Você está na página 1de 2

1- Extrair links do HTML

No evento Click do boto - Extrair Links do HTML - inclua o cdigo abaixo:

Private Sub btnExtraiLinks_Click(sender As Object, e As EventArgs) Handles


btnExtraiLinks.Click
Dim contador As Integer
Try
Dim arrLinks As ArrayList = extraiLinks(txtHTML.Text)
' extrai os links
For contador = 0 To arrLinks.Count - 1
lbLinks.Items.Add(arrLinks(contador).ToString)
Next
Catch ex As Exception
MessageBox.Show(" Erro : " + ex.Message, "Erro",
MessageBoxButtons.OK, MessageBoxIcon.Error)
End Try
End Sub

No cdigo acima estamos usando o mtodo extraiLinks() passando o HTML


retornado da pgina web.

O cdigo deste mtodo visto abaixo:

Public Function extraiLinks(ByVal HTML As String) As ArrayList


Dim objRegEx As Regex
Dim objMatch As Match

Dim arrLinks As New ArrayList


Dim strMatch As String
Try
' Cria uma expresso regular
objRegEx = New Regex("a.*href\s*=\s*(?:""(?<1>[^""]*)""|(?
<1>\S+))", RegexOptions.IgnoreCase Or RegexOptions.Compiled)
' procura no HTML
objMatch = objRegEx.Match(HTML)
' extrai Links
While objMatch.Success
strMatch = objMatch.Groups(1).ToString
arrLinks.Add(strMatch)
objMatch = objMatch.NextMatch()
End While
' retorna o resultado
Return arrLinks
Catch ex As Exception
Throw ex
End Try
End Function

Você também pode gostar