Tek-Tips is the largest IT community on the Internet today!

Members share and learn making Tek-Tips Forums the best source of peer-reviewed technical information on the Internet!

  • Congratulations gkittelson on being selected by the Tek-Tips community for having the most helpful posts in the forums last week. Way to Go!

Typing Accented and other "foreign language" Characters

Status
Not open for further replies.

Ambatim

Programmer
Feb 28, 2002
110
IN
On windows I am able to input European characters with Accents, É (Alt+0201).
How can I input these characters on Unix.
My Unix is AIX Version 5.2!



Thanks in advance
Mallik
 
It's dependent on your locale setting. But in general

Code:
Acute Function 	Code Value
Acute accent 	0xef
Apostrophe (acute) 	0x27
e Acute small 	0x82
e Acute capital 	0x90
a Acute small 	0xa0
i Acute small 	0xa1
o Acute small 	0xa2
u Acute small 	0xa3
a Acute capital 	0xb5
i Acute capital 	0xd6
y Acute small 	0xec
y Acute capital 	0xed
o Acute capital 	0xe0
u Acute capital 	0xe9

The following are grave function code values:
Grave Function 	Code Value
Grave accent 	0x60
a Grave small 	0x85
e Grave small 	0x8a
i Grave small 	0x8d
o Grave small 	0x95
u Grave small 	0x97
a Grave capital 	0xb7
e Grave capital 	0xd4
i Grave capital 	0xde
o Grave capital 	0xe3
u Grave capital 	0xeb

The following are circumflex function code values:
Circumflex Function 	Code Value
^ Circumflex accent 	0x5e
a Circumflex small 	0x83
e Circumflex small 	0x88
i Circumflex small 	0x8c
o Circumflex small 	0x93
u Circumflex small 	0x96
a Circumflex capital 	0xb6
e Circumflex capital 	0xd2
i Circumflex capital 	0xd7
o Circumflex capital 	0xe2
u Circumflex capital 	0xea

The following are umlaut function code values:
Umlaut Function 	Code Value
Umlaut accent 	0xf9
u Umlaut small 	0x81
a Umlaut small 	0x84
e Umlaut small 	0x89
i Umlaut small 	0x8b
a Umlaut capital 	0x8e
O Umlaut capital 	0x99
u Umlaut capital 	0x9a
e Umlaut capital 	0xd3
i Umlaut capital 	0xd8

The following are tilde function code values:
Tilde Function 	Code Value
Tilde accent 	0x7e
n Tilde small 	0xa4
n Tilde capital 	0xa5
a Tilde small 	0xc6
a Tilde capital 	0xc7
o Tilde small 	0xe4
o Tilde capital 	0xe5
Overcircle Function 	Code Value
Overcircle accent 	0x7d
a Overcircle small 	0x86
a Overcircle capital 	0x8f
Cedilla Function 	Code Value
Cedilla accent 	0xf7
c Cedilla capital 	0x80
c Cedilla small 	0x87

Mike

"A foolproof method for sculpting an elephant: first, get a huge block of marble, then you chip away everything that doesn't look like an elephant."

 
Thanks...

But how to use these codes to input the character ?

Thanks in advance
Mallik
 
hmm, should be the same.
what sort of console are you using?
the numbers must be input from the numeric keypad(extended - on the right of the qwerty keyboard) do these number keys work normally?
might be a console / keyboard mapping / definition problem.
 
I tried e acute small by typing the following on the extended numeric keyboard
echo '\0x82'
I just got R
I was expecting é

Could you please let me where I was wrong.

Thanks in advance
Mallik
 
You need to convert the hex (0x8b) to octal 0351.

Like I said in my first post it's dependent on your locale.
If you can post that I'll paste a full octal list for you.

Mike

"A foolproof method for sculpting an elephant: first, get a huge block of marble, then you chip away everything that doesn't look like an elephant."

 
This is the output of the 'locale' command
LANG=en_US
LC_COLLATE="en_US"
LC_CTYPE="en_US"
LC_MONETARY="en_US"
LC_NUMERIC="en_US"
LC_TIME="en_US"
LC_MESSAGES="en_US"
LC_ALL=

could you please give me the full list?




Thanks in advance
Mallik
 
I typed
echo '\0351'
But I got a character which looks different from é. It is something like theta(zero with a dash in between).

Thanks in advance
Mallik
 
See second column pad with leading zero

Code:
    Char 	Oct Dec 	Char 	Oct Dec 	Char 	Oct Dec 	Char 	Oct Dec
    NUL 	000 000 	SP 	040 032 	@ 	100 064 	` 	140 096
    ^A 	001 001 	! 	041 033 	A 	101 065 	a 	141 097
    ^B 	002 002 	" 	042 034 	B 	102 066 	b 	142 098
    ^C 	003 003 	# 	043 035 	C 	103 067 	c 	144 099
    ^D 	004 004 	$ 	044 036 	D 	104 068 	d 	144 100
    ^E 	005 005 	% 	045 037 	E 	105 069 	e 	145 101
    ^F 	006 006 	& 	046 038 	F 	106 070 	f 	146 102
    ^G 	007 007 	' 	047 039 	G 	107 071 	g 	147 103
    BS 	010 008 	( 	050 040 	H 	110 072 	h 	150 104
    TAB 	011 009 	) 	051 041 	I 	111 073 	i 	151 105
    LF 	012 010 	* 	052 042 	J 	112 074 	j 	152 106
    VT 	013 011 	+ 	053 043 	K 	113 075 	k 	153 107
    FF 	014 012 	, 	054 044 	L 	114 076 	l 	154 108
    CR 	015 013 	- 	055 045 	M 	115 077 	m 	155 109
    ^N 	016 014 	. 	056 046 	N 	116 078 	n 	156 110
    ^O 	017 015 	/ 	057 047 	O 	117 079 	o 	157 111
    ^P 	020 016 	0 	060 048 	P 	120 080 	p 	160 112
    ^Q 	021 017 	1 	061 049 	Q 	121 081 	q 	161 113
    ^R 	022 018 	2 	062 050 	R 	122 082 	r 	162 114
    ^S 	023 019 	3 	063 051 	S 	123 083 	s 	163 115
    ^T 	024 020 	4 	064 052 	T 	124 084 	t 	164 116
    ^U 	025 021 	5 	065 053 	U 	125 085 	u 	165 117
    ^V 	026 022 	6 	066 054 	V 	126 086 	v 	166 118
    ^W 	027 023 	7 	067 055 	W 	127 087 	w 	167 119
    ^X 	030 024 	8 	070 056 	X 	130 088 	x 	170 120
    ^Y 	031 025 	9 	071 057 	Y 	131 089 	y 	171 121
    ^Z 	032 026 	: 	072 058 	Z 	132 090 	z 	172 122
    ESC 	033 027 	; 	073 059 	[ 	133 091 	{ 	173 123
    FS 	034 028 	< 	074 060 	\ 	134 092 	  	174 124
    GS 	035 029 	= 	075 061 	] 	135 093 	} 	175 125
    RS 	036 030 	> 	076 062 	^ 	136 094 	~ 	176 126
    US 	037 031 	? 	077 063 	_ 	137 095 	DEL 177 127




    Octal, Hexadecimal, & Decimal ASCII Character Set (Complete)

    Hex 	Oct 	Dec 	Char 	Description
    [00] 	000 	0. 	<NUL> 	 
    [01] 	001 	1. 	^A 	Control-A
    [02] 	002 	2. 	^B 	Control-B
    [03] 	003 	3. 	^C 	Control-C
    [04] 	004 	4. 	^D 	Control-D
    [05] 	005 	5. 	^E 	Control-E
    [06] 	006 	6. 	^F 	Control-F
    [07] 	007 	7. 	^G 	Bell
    [08] 	010 	8. 	<BS> 	Backspace
    [09] 	011 	9. 	<TAB> 	Horizontal tab
    [0A] 	012 	10. 	<LF> 	Line feed
    [0B] 	013 	11. 	<VT> 	Vertical tab
    [0C] 	014 	12. 	<FF> 	Form feed
    [0D] 	015 	13. 	<CR> 	Carriage return
    [0E] 	016 	14. 	<SO> 	Control-N
    [0F] 	017 	15. 	<SI> 	Control-O
    [10] 	020 	16. 	^P 	Control-P
    [11] 	021 	17. 	<XON> 	Control-Q
    [12] 	022 	18. 	^R 	Control-R
    [13] 	023 	19. 	<XOFF> 	Control-S
    [14] 	024 	20. 	^T 	Control-T
    [15] 	025 	21. 	^U 	Control-U
    [16] 	026 	22. 	^V 	Control-V
    [17] 	027 	23. 	^W 	Control-W
    [18] 	030 	24. 	^X 	Control-X
    [19] 	031 	25. 	^Y 	Control-Y
    [1A] 	032 	26. 	^Z 	Control-Z
    [1B] 	033 	27. 	<ESC> 	ESCape, control-[
    [1C] 	034 	28. 	<FS> 	Control-\
    [1D] 	035 	29. 	<GS> 	Control-]
    [1E] 	036 	30. 	<RS> 	Control-^
    [1F] 	037 	31. 	<US> 	Control-_
    [20] 	040 	32. 	  	Space
    [21] 	041 	33. 	! 	Explanation point
    [22] 	042 	34. 	" 	Double quotation mark
    [23] 	043 	35. 	# 	Number sign
    [24] 	044 	36. 	$ 	Dollar sign
    [25] 	045 	37. 	% 	Percent sign
    [26] 	046 	38. 	& 	Ampersand
    [27] 	047 	39. 	' 	Single quote, apostrophe
    [28] 	050 	40. 	( 	Left parenthesis
    [29] 	051 	41. 	) 	Right parenthesis
    [2A] 	052 	42. 	* 	Asterisk
    [2B] 	053 	43. 	+ 	Plus
    [2C] 	054 	44. 	, 	Comma
    [2D] 	055 	45. 	- 	Hyphen, minus
    [2E] 	056 	46. 	. 	Period, decimal point
    [2F] 	057 	47. 	/ 	Slash, slant, solidus
    [30] 	060 	48. 	0 	Digit 0
    [31] 	061 	49. 	1 	Digit 1
    [32] 	062 	50. 	2 	Digit 2
    [33] 	063 	51. 	3 	Digit 3
    [34] 	064 	52. 	4 	Digit 4
    [35] 	065 	53. 	5 	Digit 5
    [36] 	066 	54. 	6 	Digit 6
    [37] 	067 	55. 	7 	Digit 7
    [38] 	070 	56. 	8 	Digit 8
    [39] 	071 	57. 	9 	Digit 9
    [3A] 	072 	58. 	: 	Colon
    [3B] 	073 	59. 	; 	Semicolon
    [3C] 	074 	60. 	_< 	Left angle, less than
    [3D] 	075 	61. 	= 	Equals
    [3E] 	076 	62. 	_> 	Right angle, greater than
    [3F] 	077 	63. 	? 	Question mark
    [40] 	100 	64. 	@ 	Commercial at
    [41] 	101 	65. 	A 	Upper case A
    [42] 	102 	66. 	B 	Upper case B
    [43] 	103 	67. 	C 	Upper case C
    [44] 	104 	68. 	D 	Upper case D
    [45] 	105 	69. 	E 	Upper case E
    [46] 	106 	70. 	F 	Upper case F
    [47] 	107 	71. 	G 	Upper case G
    [48] 	110 	72. 	H 	Upper case H
    [49] 	111 	73. 	I 	Upper case I
    [4A] 	112 	74. 	J 	Upper case J
    [4B] 	113 	75. 	K 	Upper case K
    [4C] 	114 	76. 	L 	Upper case L
    [4D] 	115 	77. 	M 	Upper case M
    [4E] 	116 	78. 	N 	Upper case N
    [4F] 	117 	79. 	O 	Upper case O
    [50] 	120 	80. 	P 	Upper case P
    [51] 	121 	81. 	Q 	Upper case Q
    [52] 	122 	82. 	R 	Upper case R
    [53] 	123 	83. 	S 	Upper case S
    [54] 	124 	84. 	T 	Upper case T
    [55] 	125 	85. 	U 	Upper case U
    [56] 	126 	86. 	V 	Upper case V
    [57] 	127 	87. 	W 	Upper case W
    [58] 	130 	88. 	X 	Upper case X
    [59] 	131 	89. 	Y 	Upper case Y
    [5A] 	132 	90. 	Z 	Upper case Z
    [5B] 	133 	91. 	[ 	Opening bracket
    [5C] 	134 	92. 	\ 	Backslash, reverse slant
    [5D] 	135 	93. 	] 	Closing bracket
    [5E] 	136 	94. 	^ 	Circumflex, caret, uparrow
    [5F] 	137 	95. 	_ 	Underline, underscore
    [60] 	140 	96. 	` 	Accent grave
    [61] 	141 	97. 	a 	Lower case a
    [62] 	142 	98. 	b 	Lower case b
    [63] 	143 	99. 	c 	Lower case c
    [64] 	144 	100. 	d 	Lower case d
    [65] 	145 	101. 	e 	Lower case e
    [66] 	146 	102. 	f 	Lower case f
    [67] 	147 	103. 	g 	Lower case g
    [68] 	150 	104. 	h 	Lower case h
    [69] 	151 	105. 	i 	Lower case i
    [6A] 	152 	106. 	j 	Lower case j
    [6B] 	153 	107. 	k 	Lower case k
    [6C] 	154 	108. 	l 	Lower case l
    [6D] 	155 	109. 	m 	Lower case m
    [6E] 	156 	110. 	n 	Lower case n
    [6F] 	157 	111. 	o 	Lower case o
    [70] 	160 	112. 	p 	Lower case p
    [71] 	161 	113. 	q 	Lower case q
    [72] 	162 	114. 	r 	Lower case r
    [73] 	163 	115. 	s 	Lower case s
    [74] 	164 	116. 	t 	Lower case t
    [75] 	165 	117. 	u 	Lower case u
    [76] 	166 	118. 	v 	Lower case v
    [77] 	167 	119. 	w 	Lower case w
    [78] 	170 	120. 	x 	Lower case x
    [79] 	171 	121. 	y 	Lower case y
    [7A] 	172 	122. 	z 	Lower case z
    [7B] 	173 	123. 	{ 	Opening brace
    [7C] 	174 	124. 	  	Vertical line
    [7D] 	175 	125. 	} 	Closing brace
    [7E] 	176 	126. 	~ 	Tilde, overline, high line
    [7F] 	177 	127. 	<DEL> 	Delete, rubout
    [80] 	200 	128. 	  	 
    [81] 	201 	129. 	  	 
    [82] 	202 	130. 	  	 
    [83] 	203 	131. 	  	 
    [84] 	204 	132. 	<IND> 	 
    [85] 	205 	133. 	<NEL> 	 
    [86] 	206 	134. 	<SSA> 	 
    [87] 	207 	135. 	<ESA> 	 
    [88] 	210 	136. 	<HTS> 	 
    [89] 	211 	137. 	<HTJ> 	 
    [8A] 	212 	138. 	<VTS> 	 
    [8B] 	213 	139. 	<PLD> 	 
    [8C] 	214 	140. 	<PLU> 	 
    [8D] 	215 	141. 	<RI> 	 
    [8E] 	216 	142. 	<SS2> 	 
    [8F] 	217 	143. 	<SS3> 	 
    [90] 	220 	144. 	<DCS> 	 
    [91] 	221 	145. 	<PU1> 	 
    [92] 	222 	146. 	<PU2> 	 
    [93] 	223 	147. 	<STS> 	 
    [94] 	224 	148. 	<CCH> 	 
    [95] 	225 	149. 	<MW> 	 
    [96] 	226 	150. 	<SPA> 	 
    [97] 	227 	151. 	<EPA> 	 
    [98] 	230 	152. 	  	 
    [99] 	231 	153. 	  	 
    [9A] 	232 	154. 	  	 
    [9B] 	233 	155. 	<CSI> 	 
    [9C] 	234 	156. 	<ST> 	 
    [9D] 	235 	157. 	<OSC> 	 
    [9E] 	236 	158. 	<PM> 	 
    [9F] 	237 	159. 	<APC> 	 
    [A0] 	240 	160. 	  	-
    [A1] 	241 	161. 	  	Inverted exclamation mark
    [A2] 	242 	162. 	  	Cent sign
    [A3] 	243 	163. 	  	Pound sterling sign
    [A4] 	244 	164. 	  	reserved for future
    [A5] 	245 	165. 	  	Yen
    [A6] 	246 	166. 	  	reserved for future
    [A7] 	247 	167. 	  	Section sign
    [A8] 	250 	168. 	  	General currency sign
    [A9] 	251 	169. 	  	Copyright sign
    [AA] 	252 	170. 	  	Feminine ordinal indicator
    [AB] 	253 	171. 	  	Angle quotation mark left
    [AC] 	254 	172. 	  	reserved for future
    [AD] 	255 	173. 	  	reserved for future
    [AE] 	256 	174. 	  	reserved for future
    [AF] 	257 	175. 	  	reserved for future
    [B0] 	260 	176. 	  	Degree sign
    [B1] 	261 	177. 	  	Plus/minus sign
    [B2] 	262 	178. 	  	Superscript 2
    [B3] 	263 	179. 	  	Superscript 3
    [B4] 	264 	180. 	  	reserved for future
    [B5] 	265 	181. 	  	Micro sign, Greek mu
    [B6] 	266 	182. 	  	Paragraph sign, pilcrow
    [B7] 	267 	183. 	  	Middle dot
    [B8] 	270 	184. 	  	reserved for future
    [B9] 	271 	185. 	  	Superscript 1
    [BA] 	272 	186. 	  	Masculine ordinal indicator
    [BB] 	273 	187. 	  	Angle quotation mark right
    [BC] 	274 	188. 	  	Fraction one quarter
    [BD] 	275 	189. 	  	Fraction one half
    [BE] 	276 	190. 	  	reserved for future
    [BF] 	277 	191. 	  	Inverted question mark
    [C0] 	300 	192. 	  	A with accent grave
    [C1] 	301 	193. 	  	A with accent aigu (acute)
    [C2] 	302 	194. 	  	A with circumflex
    [C3] 	303 	195. 	  	A with tilde
    [C4] 	304 	196. 	  	A with umlaut (diaeresis)
    [C5] 	305 	197. 	  	A with ring
    [C6] 	306 	198. 	  	AE ligature
    [C7] 	307 	199. 	  	C with cedilla
    [C8] 	310 	200. 	  	E with accent grave
    [C9] 	311 	201. 	  	E with accent aigu (acute)
    [CA] 	312 	202. 	  	E with circumflex
    [CB] 	313 	203. 	  	E with umlaut (diaeresis)
    [CC] 	314 	204. 	  	I with accent grave
    [CD] 	315 	205. 	  	I with accent aigu (acute)
    [CE] 	316 	206. 	  	I with circumflex
    [CF] 	317 	207. 	  	I with umlaut (diaeresis)
    [D0] 	320 	208. 	  	reserved for future
    [D1] 	321 	209. 	  	N with tilde
    [D2] 	322 	210. 	  	O with accent grave
    [D3] 	323 	211. 	  	O with accent aigu (acute)
    [D4] 	324 	212. 	  	O with circumflex
    [D5] 	325 	213. 	  	O with tilde
    [D6] 	326 	214. 	  	O with umlaut (diaeresis)
    [D7] 	327 	215. 	  	OE ligature
    [D8] 	330 	216. 	  	O (letter) with slash
    [D9] 	331 	217. 	  	U with accent grave
    [DA] 	332 	218. 	  	U with accent aigu (acute)
    [DB] 	333 	219. 	  	U with circumflex
    [DC] 	334 	220. 	  	U with umlaut (diaeresis)
    [DD] 	335 	221. 	  	Y with umlaut (diaeresis)
    [DE] 	336 	222. 	  	reserved for future
    [DD] 	337 	223. 	  	German sharp s
    [E0] 	340 	224. 	  	a with accent grave
    [E1] 	341 	225. 	  	a with accent aigu (acute)
    [E2] 	342 	226. 	  	a with circumflex
    [E3] 	343 	227. 	  	a with tilde
    [E4] 	344 	228. 	  	a with umlaut (diaeresis)
    [E5] 	345 	229. 	  	a with ring
    [E6] 	346 	230. 	  	ae ligature
    [E7] 	347 	231. 	  	c with cedilla
    [E8] 	350 	232. 	  	e with accent grave
    [E9] 	351 	233. 	  	e with accent aigu (acute)
    [EA] 	352 	234. 	  	e with circumflex
    [EB] 	353 	235. 	  	e with umlaut (diaeresis)
    [EC] 	354 	236. 	  	e with umlaut (diaeresis)
    [ED] 	355 	237. 	  	i with accent aigu (acute)
    [EE] 	356 	238. 	  	i with circumflex
    [EF] 	357 	239. 	  	i with umlaut (diaeresis)
    [F0] 	360 	240. 	  	reserved for future
    [F1] 	361 	241. 	  	n with tilde
    [F2] 	362 	242. 	  	o with accent grave
    [F3] 	363 	243. 	  	o with accent aigu (acute)
    [F4] 	364 	244. 	  	o with circumflex
    [F5] 	365 	245. 	  	o with tilde
    [F6] 	366 	246. 	  	o with umlaut (diaeresis)
    [F7] 	367 	247. 	  	oe ligature
    [F8] 	370 	248. 	  	o (letter) with slash
    [F9] 	371 	249. 	  	u with accent grave
    [FA] 	372 	250. 	  	u with accent aigu (acute)
    [FB] 	373 	251. 	  	u with circumflex
    [FC] 	374 	252. 	  	u with umlaut (diaeresis)
    [FD] 	375 	253. 	  	y with umlaut (diaeresis)
    [FE] 	376 	254. 	  	reserved for future
    [FF] 	377 	255. 	  	-

Mike

"A foolproof method for sculpting an elephant: first, get a huge block of marble, then you chip away everything that doesn't look like an elephant."

 
Status
Not open for further replies.

Part and Inventory Search

Sponsor

Back
Top