Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

  • Congratulations gkittelson on being selected by the Tek-Tips community for having the most helpful posts in the forums last week. Way to Go!

Really Clean up Word HTML

Status
Not open for further replies.

johnsmith98

Instructor
Jul 14, 2001
3
IL
I am using Word 2000 and Dreamweaver 4.

I am taking Word Documents (that have pictures in them) doing Save For Web to create HTML. I then open it in Dreamweaver and I do Command / Clean Up Word HTML. This is pretty good but not good enough.

How do I take a word document and clean up all the code so the that I am left with only the text, correct links to the images, and your most basic tags like <p> and <br> etc.

Even after Dreamweaver's Command / Clean Up Word HTML there is still so much crap code in there.

I've already used word to Save As Text, and then converted that to HTML but losing the images and then repasting them back in wastes too much time.

I wouldn't do this by choice, I was given the Word files and told Do It.

Tom
 
Hi,

When I want to strip html tag from a page, I use Evrsoft 1st page 2000, it has a tool that does this. Of course, you would still have to edit the output. Too bad you have to work will pages created in word.

Bye.
 
I have a perl program that will strip ALL Word html out of a file, leaving nothing but text. You might be able to modify it to do what you want if you know perl. I'll email it to you if you want it. Tracy Dryden
tracy@bydisn.com

Meddle not in the affairs of dragons,
For you are crunchy, and good with mustard.
 
Status
Not open for further replies.

Part and Inventory Search

Sponsor

Back
Top