Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

  • Congratulations biv343 on being selected by the Tek-Tips community for having the most helpful posts in the forums last week. Way to Go!

Well-Formed or not?

Status
Not open for further replies.

jerrysheehan

Technical User
Aug 4, 2002
3
US
xml file:
<?xml version=&quot;1.0&quot; encoding=&quot;utf-8&quot;?>
<copyright>©</copyright>


I am not using this in a dtd as of yet and I am only checking for the well-formedness of the xml file by itself. Not using any character references or anything. So the character doesn't exist since this is a
stand alone xml at this point. This is what happens when I convert my pdf files to xml through a manual process and the characters are picked up. Some parsers tell me this is invalid and point to the symbol, while others say it is valid and I was wondering what the truth actually is. Should this be actually flagged as well formed or not?

Any help would be appreciated
 
the truth is: there is no truth.


parsers may or may not, as you've found out, think that this is valid.

a parser proof way to actually do this is to use the character code instead of the actual symbol &# xA9; [without the space].


a full list of characters can be searched from here:


hope that helps

matt
 
Status
Not open for further replies.

Part and Inventory Search

Sponsor

Back
Top