Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

  • Congratulations IamaSherpa on being selected by the Tek-Tips community for having the most helpful posts in the forums last week. Way to Go!

Shell scripts to remove HTML tags from text file

Status
Not open for further replies.

ayeung

Technical User
Apr 28, 2003
2
AU
I have used wget command to convert a HTML file into a text file. However, the text file has included the HTML tags. How can I remove the tags using shell scripts? Please help... Thanks.
 
Try using sed, I am not sure if your html is in uppercase or lower case. Do you know how to put this into a script?

sed '/[Hh][Tt][Mm][Ll]/d'
 
Status
Not open for further replies.

Part and Inventory Search

Sponsor

Back
Top