Hi,
What is the bestway to extract only the <body>....</body> of an HTML file, and convert all links and graphics to their predefined URL.
For example, if I have
<html><head></head><body>
junk <a href="/local/files/.html"><img src="/local/images/.gif"></a>
junk <a href=" src="</body></html>
-----------------------
what i want is to get the part from <body>(this part)</body>
and convert all links and images that do not start with http:// to if we know /local/files will be
It is not that hard, but I have some problems w/ it. When I convert all local files to URL, it is messup.
Really appreciate for all you guys help.
cheer
kevin
What is the bestway to extract only the <body>....</body> of an HTML file, and convert all links and graphics to their predefined URL.
For example, if I have
<html><head></head><body>
junk <a href="/local/files/.html"><img src="/local/images/.gif"></a>
junk <a href=" src="</body></html>
-----------------------
what i want is to get the part from <body>(this part)</body>
and convert all links and images that do not start with http:// to if we know /local/files will be
It is not that hard, but I have some problems w/ it. When I convert all local files to URL, it is messup.
Really appreciate for all you guys help.
cheer
kevin