Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

  • Congratulations IamaSherpa on being selected by the Tek-Tips community for having the most helpful posts in the forums last week. Way to Go!

utf8 to ascii

Status
Not open for further replies.

linchpin1

Programmer
Mar 23, 2005
2
GR
i need to parse a file stored in utf-8 format from microsoft word and convert it to ascii format. the text contain greek characters.
 
1) How do you save a Word document in UTF-8? I can save a notepad document in UTF-8 but I can't figure out how you do it in Word.
2) Is it in RTF or some internal Word format?
 
well u can save in UTF-8 by chosing save as "plain text" from the save as box. when you do that if you document has special character data ypo'll see another box popup that will give you a mother list of save as types. you can chose utf-8 from there.

b/w if not this can anyone help me with getting some kind of character map for UNICODE character data?
 
Thanks for the info - I've only ever used notepad to generate UTF-8.

In what way do you need to parse it? Do you need to convert UTF-8 to printable text? Maybe this one will help
Unicode Character Data
Open up character map from Start/Programs/Accessories/System Tools. Select a font like Arial MS Unicode, change the Character set to Unicode. Alternatively, go to
Note that MS Unicode is only 16 bits. Unix and many other OSs use 32 bits.
 
Status
Not open for further replies.

Part and Inventory Search

Sponsor

Back
Top