Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

  • Congratulations TouchToneTommy on being selected by the Tek-Tips community for having the most helpful posts in the forums last week. Way to Go!

Evaluating similarity of two XML files

Status
Not open for further replies.

rtmena

MIS
May 25, 2003
9
BR
Hi,

I need to compare two XML files and determine if they are similar. Similar for me is that they share the same structure, regardless the actual content of them.

The idea is to discard the files before further processing based on this criteria. For example, suppose I've mirrored freshmeat.net and only want to store or process the articles pages, I'd submit every page through Xerces to balance the tags and through this "tool" to determine which pages are similar to a given example.

I do not want to reinvent the wheel so I am looking for algorithms, snippets of code, tools to aid me.

Any ideas ?
 
Status
Not open for further replies.

Part and Inventory Search

Sponsor

Back
Top