Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

  • Congratulations Mike Lewis on being selected by the Tek-Tips community for having the most helpful posts in the forums last week. Way to Go!

Perl Getstore Problem

Status
Not open for further replies.

chitownclone

Programmer
Mar 30, 2004
22
0
0
US
I've written a perl spider to do some analysis work on the web. The program I've written successful pulls the source code for 95% of the 10k+ URLS, but there are a select few that I get the 'getstore' function to work on.

These web sites are anything specical at all, so I am not sure why it works most of the time but not for these sites...any suggestions. Below are some sites that won't spider.

Won't Work:

Will Work:

Any ideas??? Thanks
 
Looking at the HTML headers the following could give you a clue:

Code:
<META HTTP-EQUIV="Pragma" CONTENT="no-cache">
<META HTTP-EQUIV="Expires" CONTENT="Thu, 1 Jan 1970 01:00:00 GMT">

getstore may be honouring the no-cache status of the page.

Barbie
Leader of Birmingham Perl Mongers
 
Status
Not open for further replies.

Part and Inventory Search

Sponsor

Back
Top