Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

  • Congratulations SkipVought on being selected by the Tek-Tips community for having the most helpful posts in the forums last week. Way to Go!

How to mass download/mine historical news announcement from sites?

Status
Not open for further replies.

zetic

Programmer
Aug 12, 2002
2
0
0
HK
Hi Everyone,

I have a simple project,
I need to mine a set of html file into a local drive from web for archival purposes.
Blackwidow, a spider, cannot handle the load, and netants, a mass downloader, cant handle either.
Is there a better tool?

Thanks,
Tom
 
I'm on my way to work right now, otherwise I'd hunt out a more specific tool for you, but here is a site that contains excellent information on spiders and bots, from off-the-shelf tools to do-it-yourself projects.



I've found incredible tools there.
 
I bought a copy of Offline Explorer which I found on
It offers a very simple way to archive websites and includes very detailed parameters for controlling what portions of the website to archive.

I highly reccommend it.
 
Status
Not open for further replies.

Part and Inventory Search

Sponsor

Back
Top