I need to create a web-bot or crawler to collect information from a group of web pages. I am NOT interested in harvesting email addresses for Spamming or anything nearly as nefarious as that, and will happily comply with robots.txt files; however, most of the bot script samples I have found are in JAVA or PERL- As I am already familiar w/ PHP and wish to parse the results into a MySQL database, I would prefer PHP.
All that aside I am specifically looking for either an example file in PHP which I could alter, or a hint in regard to how I "get" or open a specific URL in order to parse it?
TIA for any suggestions,
-Allen
All that aside I am specifically looking for either an example file in PHP which I could alter, or a hint in regard to how I "get" or open a specific URL in order to parse it?
TIA for any suggestions,
-Allen