I have an XSLT that converts RSS to a neat and readable HTML document. The resulting HTML document uses javascript to do some fancy tricks. My HTML is table-free HTML using a CSS style sheet to display elements. My problem is that some of the RSS feeds display HTML in their content and their HTML is sometimes bad. Their bad HTML breaks my pretty HTML.
I need to know if there is a way to properly contain bad HTML so it won't affect mine or to test their HTML for errors before I try to display it.
I know Google Reader does something like this because the feeds that break my reader dont break Google. If I look at the Google source code, the offending elements are left out. How are they detecting and trapping that?
I need to know if there is a way to properly contain bad HTML so it won't affect mine or to test their HTML for errors before I try to display it.
I know Google Reader does something like this because the feeds that break my reader dont break Google. If I look at the Google source code, the offending elements are left out. How are they detecting and trapping that?