Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

  • Congratulations strongm on being selected by the Tek-Tips community for having the most helpful posts in the forums last week. Way to Go!

regexp Help

Status
Not open for further replies.

gwrman

IS-IT--Management
Dec 20, 2005
94
CA
Okay, so ive got a regexp that works perfectly in an online tester. I am having trouble implementing in my existing code.

It takes a variable (xmlhttp.responseText) that holds an external page's source code, and checks it for the existance of a link to a specified domain. It should then show ONLY the HTML for that link, and then break it up in the sub strings.

What it DOES do currently, is return the whole string, EXCEPT the link it should, in effect doing the opposite.

Any help is appreciated. Source code i'm checking against to test is and looking for the link to billiardsforum.info.

The VB code thus far:

Code:
url = objRSanc("la_blt_link_url")

set xmlhttp = server.CreateObject("MSXML2.ServerXMLHTTP")  
on error resume next 
xmlhttp.open "GET", url, false 
xmlhttp.send ""  
    
if err.number <> 0 then 
VerifyUrl=false
response.redirect("[URL unfurl="true"]http://www.google.com/?0000000000000000000000")[/URL]	
else 
    
set regizzo = New RegExp
regizzo.Pattern = "(<[^>]*?a[^>]*?(?:billiardsforum.info)[^>]*>)((?:.*?(?:<[ \r\t]*a[^>]*>?.*?(?:<.*?/.*?a.*?>)?)*)*)(<[^>]*?/[^>]*?a[^>]*?>)"

regizzo.Global = True
regizzo.IgnoreCase = True

dim stipulation 
    
stipulation = regizzo.replace(xmlhttp.responseText, "$2")
    
response.write stipulation     

    
Set regizzo = Nothing
    
dim updiz
Set updiz = Server.CreateObject("ADODB.Connection")
updiz.Open strdbee
Set objRSupd = Server.CreateObject("ADODB.RecordSet")

strsqlu = "UPDATE la_blt SET la_blt_link_anchor='somevar' WHERE la_blt_id =44"
objRSupd.Open strSQLu, updiz, adLockOptimistic
	
end if
    

response.write("done.")
%>
 
[1]
>What it DOES do currently, is return the whole string, EXCEPT the link it should, in effect doing the opposite.
It means "no match".
[2]
>looking for the link to billiardsforum.info.
To be exact, the pattern should escape . in that string like this.
[tt]billiardsforum[red]\[/red].info[/tt]
[3]
Why not give a specific instance to show what you want?
 
Amendment
I might have mis-read your message in paragraph [1]. If some part of the string does replace and the reverse to what you want, you can try "$1$3..." instead of "$2". You just have to read into how the parentheses are nested.
 
Status
Not open for further replies.

Part and Inventory Search

Sponsor

Back
Top