CESU-8 Charset Encoder Test


Character Description Encoded (hex bytes)
NULL (U+0000) 00
 START OF HEADING (U+0001) 01
 START OF TEXT (U+0002) 02
 END OF TEXT (U+0003) 03
 END OF TRANSMISSION (U+0004) 04
 ENQUIRY (U+0005) 05
 ACKNOWLEDGE (U+0006) 06
 BELL (U+0007) 07
 BACKSPACE (U+0008) 08
CHARACTER TABULATION (U+0009) 09
LINE FEED (LF) (U+000A) 0a
LINE TABULATION (U+000B) 0b
FORM FEED (FF) (U+000C) 0c
CARRIAGE RETURN (CR) (U+000D) 0d
 SHIFT OUT (U+000E) 0e
 SHIFT IN (U+000F) 0f
 DATA LINK ESCAPE (U+0010) 10
 DEVICE CONTROL ONE (U+0011) 11
 DEVICE CONTROL TWO (U+0012) 12
 DEVICE CONTROL THREE (U+0013) 13
 DEVICE CONTROL FOUR (U+0014) 14
 NEGATIVE ACKNOWLEDGE (U+0015) 15
 SYNCHRONOUS IDLE (U+0016) 16
 END OF TRANSMISSION BLOCK (U+0017) 17
 CANCEL (U+0018) 18
 END OF MEDIUM (U+0019) 19
 SUBSTITUTE (U+001A) 1a
 ESCAPE (U+001B) 1b
 INFORMATION SEPARATOR FOUR (U+001C) 1c
 INFORMATION SEPARATOR THREE (U+001D) 1d
 INFORMATION SEPARATOR TWO (U+001E) 1e
 INFORMATION SEPARATOR ONE (U+001F) 1f
SPACE (U+0020) 20
! EXCLAMATION MARK (U+0021) 21
" QUOTATION MARK (U+0022) 22
# NUMBER SIGN (U+0023) 23
$ DOLLAR SIGN (U+0024) 24
% PERCENT SIGN (U+0025) 25
& AMPERSAND (U+0026) 26
' APOSTROPHE (U+0027) 27
( LEFT PARENTHESIS (U+0028) 28
) RIGHT PARENTHESIS (U+0029) 29
* ASTERISK (U+002A) 2a
+ PLUS SIGN (U+002B) 2b
, COMMA (U+002C) 2c
- HYPHEN-MINUS (U+002D) 2d
. FULL STOP (U+002E) 2e
/ SOLIDUS (U+002F) 2f
0 DIGIT ZERO (U+0030) 30
1 DIGIT ONE (U+0031) 31
2 DIGIT TWO (U+0032) 32
3 DIGIT THREE (U+0033) 33
4 DIGIT FOUR (U+0034) 34
5 DIGIT FIVE (U+0035) 35
6 DIGIT SIX (U+0036) 36
7 DIGIT SEVEN (U+0037) 37
8 DIGIT EIGHT (U+0038) 38
9 DIGIT NINE (U+0039) 39
: COLON (U+003A) 3a
; SEMICOLON (U+003B) 3b
< LESS-THAN SIGN (U+003C) 3c
= EQUALS SIGN (U+003D) 3d
> GREATER-THAN SIGN (U+003E) 3e
? QUESTION MARK (U+003F) 3f
@ COMMERCIAL AT (U+0040) 40
A LATIN CAPITAL LETTER A (U+0041) 41
B LATIN CAPITAL LETTER B (U+0042) 42
C LATIN CAPITAL LETTER C (U+0043) 43
D LATIN CAPITAL LETTER D (U+0044) 44
E LATIN CAPITAL LETTER E (U+0045) 45
F LATIN CAPITAL LETTER F (U+0046) 46
G LATIN CAPITAL LETTER G (U+0047) 47
H LATIN CAPITAL LETTER H (U+0048) 48
I LATIN CAPITAL LETTER I (U+0049) 49
J LATIN CAPITAL LETTER J (U+004A) 4a
K LATIN CAPITAL LETTER K (U+004B) 4b
L LATIN CAPITAL LETTER L (U+004C) 4c
M LATIN CAPITAL LETTER M (U+004D) 4d
N LATIN CAPITAL LETTER N (U+004E) 4e
O LATIN CAPITAL LETTER O (U+004F) 4f
P LATIN CAPITAL LETTER P (U+0050) 50
Q LATIN CAPITAL LETTER Q (U+0051) 51
R LATIN CAPITAL LETTER R (U+0052) 52
S LATIN CAPITAL LETTER S (U+0053) 53
T LATIN CAPITAL LETTER T (U+0054) 54
U LATIN CAPITAL LETTER U (U+0055) 55
V LATIN CAPITAL LETTER V (U+0056) 56
W LATIN CAPITAL LETTER W (U+0057) 57
X LATIN CAPITAL LETTER X (U+0058) 58
Y LATIN CAPITAL LETTER Y (U+0059) 59
Z LATIN CAPITAL LETTER Z (U+005A) 5a
[ LEFT SQUARE BRACKET (U+005B) 5b
\ REVERSE SOLIDUS (U+005C) 5c
] RIGHT SQUARE BRACKET (U+005D) 5d
^ CIRCUMFLEX ACCENT (U+005E) 5e
_ LOW LINE (U+005F) 5f
` GRAVE ACCENT (U+0060) 60
a LATIN SMALL LETTER A (U+0061) 61
b LATIN SMALL LETTER B (U+0062) 62
c LATIN SMALL LETTER C (U+0063) 63
d LATIN SMALL LETTER D (U+0064) 64
e LATIN SMALL LETTER E (U+0065) 65
f LATIN SMALL LETTER F (U+0066) 66
g LATIN SMALL LETTER G (U+0067) 67
h LATIN SMALL LETTER H (U+0068) 68
i LATIN SMALL LETTER I (U+0069) 69
j LATIN SMALL LETTER J (U+006A) 6a
k LATIN SMALL LETTER K (U+006B) 6b
l LATIN SMALL LETTER L (U+006C) 6c
m LATIN SMALL LETTER M (U+006D) 6d
n LATIN SMALL LETTER N (U+006E) 6e
o LATIN SMALL LETTER O (U+006F) 6f
p LATIN SMALL LETTER P (U+0070) 70
q LATIN SMALL LETTER Q (U+0071) 71
r LATIN SMALL LETTER R (U+0072) 72
s LATIN SMALL LETTER S (U+0073) 73
t LATIN SMALL LETTER T (U+0074) 74
u LATIN SMALL LETTER U (U+0075) 75
v LATIN SMALL LETTER V (U+0076) 76
w LATIN SMALL LETTER W (U+0077) 77
x LATIN SMALL LETTER X (U+0078) 78
y LATIN SMALL LETTER Y (U+0079) 79
z LATIN SMALL LETTER Z (U+007A) 7a
{ LEFT CURLY BRACKET (U+007B) 7b
| VERTICAL LINE (U+007C) 7c
} RIGHT CURLY BRACKET (U+007D) 7d
~ TILDE (U+007E) 7e
 DELETE (U+007F) 7f
<control> (U+0080) c280
 <control> (U+0081) c281
BREAK PERMITTED HERE (U+0082) c282
ƒ NO BREAK HERE (U+0083) c283
<control> (U+0084) c284
NEXT LINE (NEL) (U+0085) c285
START OF SELECTED AREA (U+0086) c286
END OF SELECTED AREA (U+0087) c287
ˆ CHARACTER TABULATION SET (U+0088) c288
CHARACTER TABULATION WITH JUSTIFICATION (U+0089) c289
Š LINE TABULATION SET (U+008A) c28a
PARTIAL LINE FORWARD (U+008B) c28b
ΠPARTIAL LINE BACKWARD (U+008C) c28c
 REVERSE LINE FEED (U+008D) c28d
Ž SINGLE SHIFT TWO (U+008E) c28e
 SINGLE SHIFT THREE (U+008F) c28f
 DEVICE CONTROL STRING (U+0090) c290
PRIVATE USE ONE (U+0091) c291
PRIVATE USE TWO (U+0092) c292
SET TRANSMIT STATE (U+0093) c293
CANCEL CHARACTER (U+0094) c294
MESSAGE WAITING (U+0095) c295
START OF GUARDED AREA (U+0096) c296
END OF GUARDED AREA (U+0097) c297
˜ START OF STRING (U+0098) c298
<control> (U+0099) c299
š SINGLE CHARACTER INTRODUCER (U+009A) c29a
CONTROL SEQUENCE INTRODUCER (U+009B) c29b
œ STRING TERMINATOR (U+009C) c29c
 OPERATING SYSTEM COMMAND (U+009D) c29d
ž PRIVACY MESSAGE (U+009E) c29e
Ÿ APPLICATION PROGRAM COMMAND (U+009F) c29f
  NO-BREAK SPACE (U+00A0) c2a0
¡ INVERTED EXCLAMATION MARK (U+00A1) c2a1
¢ CENT SIGN (U+00A2) c2a2
£ POUND SIGN (U+00A3) c2a3
¤ CURRENCY SIGN (U+00A4) c2a4
¥ YEN SIGN (U+00A5) c2a5
¦ BROKEN BAR (U+00A6) c2a6
§ SECTION SIGN (U+00A7) c2a7
¨ DIAERESIS (U+00A8) c2a8
© COPYRIGHT SIGN (U+00A9) c2a9
ª FEMININE ORDINAL INDICATOR (U+00AA) c2aa
« LEFT-POINTING DOUBLE ANGLE QUOTATION MARK (U+00AB) c2ab
¬ NOT SIGN (U+00AC) c2ac
­ SOFT HYPHEN (U+00AD) c2ad
® REGISTERED SIGN (U+00AE) c2ae
¯ MACRON (U+00AF) c2af
° DEGREE SIGN (U+00B0) c2b0
± PLUS-MINUS SIGN (U+00B1) c2b1
² SUPERSCRIPT TWO (U+00B2) c2b2
³ SUPERSCRIPT THREE (U+00B3) c2b3
´ ACUTE ACCENT (U+00B4) c2b4
µ MICRO SIGN (U+00B5) c2b5
PILCROW SIGN (U+00B6) c2b6
· MIDDLE DOT (U+00B7) c2b7
¸ CEDILLA (U+00B8) c2b8
¹ SUPERSCRIPT ONE (U+00B9) c2b9
º MASCULINE ORDINAL INDICATOR (U+00BA) c2ba
» RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK (U+00BB) c2bb
¼ VULGAR FRACTION ONE QUARTER (U+00BC) c2bc
½ VULGAR FRACTION ONE HALF (U+00BD) c2bd
¾ VULGAR FRACTION THREE QUARTERS (U+00BE) c2be
¿ INVERTED QUESTION MARK (U+00BF) c2bf
À LATIN CAPITAL LETTER A WITH GRAVE (U+00C0) c380
Á LATIN CAPITAL LETTER A WITH ACUTE (U+00C1) c381
 LATIN CAPITAL LETTER A WITH CIRCUMFLEX (U+00C2) c382
à LATIN CAPITAL LETTER A WITH TILDE (U+00C3) c383
Ä LATIN CAPITAL LETTER A WITH DIAERESIS (U+00C4) c384
Å LATIN CAPITAL LETTER A WITH RING ABOVE (U+00C5) c385
Æ LATIN CAPITAL LETTER AE (U+00C6) c386
Ç LATIN CAPITAL LETTER C WITH CEDILLA (U+00C7) c387
È LATIN CAPITAL LETTER E WITH GRAVE (U+00C8) c388
É LATIN CAPITAL LETTER E WITH ACUTE (U+00C9) c389
Ê LATIN CAPITAL LETTER E WITH CIRCUMFLEX (U+00CA) c38a
Ë LATIN CAPITAL LETTER E WITH DIAERESIS (U+00CB) c38b
Ì LATIN CAPITAL LETTER I WITH GRAVE (U+00CC) c38c
Í LATIN CAPITAL LETTER I WITH ACUTE (U+00CD) c38d
Î LATIN CAPITAL LETTER I WITH CIRCUMFLEX (U+00CE) c38e
Ï LATIN CAPITAL LETTER I WITH DIAERESIS (U+00CF) c38f
Ð LATIN CAPITAL LETTER ETH (U+00D0) c390
Ñ LATIN CAPITAL LETTER N WITH TILDE (U+00D1) c391
Ò LATIN CAPITAL LETTER O WITH GRAVE (U+00D2) c392
Ó LATIN CAPITAL LETTER O WITH ACUTE (U+00D3) c393
Ô LATIN CAPITAL LETTER O WITH CIRCUMFLEX (U+00D4) c394
Õ LATIN CAPITAL LETTER O WITH TILDE (U+00D5) c395
Ö LATIN CAPITAL LETTER O WITH DIAERESIS (U+00D6) c396
× MULTIPLICATION SIGN (U+00D7) c397
Ø LATIN CAPITAL LETTER O WITH STROKE (U+00D8) c398
Ù LATIN CAPITAL LETTER U WITH GRAVE (U+00D9) c399
Ú LATIN CAPITAL LETTER U WITH ACUTE (U+00DA) c39a
Û LATIN CAPITAL LETTER U WITH CIRCUMFLEX (U+00DB) c39b
Ü LATIN CAPITAL LETTER U WITH DIAERESIS (U+00DC) c39c
Ý LATIN CAPITAL LETTER Y WITH ACUTE (U+00DD) c39d
Þ LATIN CAPITAL LETTER THORN (U+00DE) c39e
ß LATIN SMALL LETTER SHARP S (U+00DF) c39f
à LATIN SMALL LETTER A WITH GRAVE (U+00E0) c3a0
á LATIN SMALL LETTER A WITH ACUTE (U+00E1) c3a1
â LATIN SMALL LETTER A WITH CIRCUMFLEX (U+00E2) c3a2
ã LATIN SMALL LETTER A WITH TILDE (U+00E3) c3a3
ä LATIN SMALL LETTER A WITH DIAERESIS (U+00E4) c3a4
å LATIN SMALL LETTER A WITH RING ABOVE (U+00E5) c3a5
æ LATIN SMALL LETTER AE (U+00E6) c3a6
ç LATIN SMALL LETTER C WITH CEDILLA (U+00E7) c3a7
è LATIN SMALL LETTER E WITH GRAVE (U+00E8) c3a8
é LATIN SMALL LETTER E WITH ACUTE (U+00E9) c3a9
ê LATIN SMALL LETTER E WITH CIRCUMFLEX (U+00EA) c3aa
ë LATIN SMALL LETTER E WITH DIAERESIS (U+00EB) c3ab
ì LATIN SMALL LETTER I WITH GRAVE (U+00EC) c3ac
í LATIN SMALL LETTER I WITH ACUTE (U+00ED) c3ad
î LATIN SMALL LETTER I WITH CIRCUMFLEX (U+00EE) c3ae
ï LATIN SMALL LETTER I WITH DIAERESIS (U+00EF) c3af
ð LATIN SMALL LETTER ETH (U+00F0) c3b0
ñ LATIN SMALL LETTER N WITH TILDE (U+00F1) c3b1
ò LATIN SMALL LETTER O WITH GRAVE (U+00F2) c3b2
ó LATIN SMALL LETTER O WITH ACUTE (U+00F3) c3b3
ô LATIN SMALL LETTER O WITH CIRCUMFLEX (U+00F4) c3b4
õ LATIN SMALL LETTER O WITH TILDE (U+00F5) c3b5
ö LATIN SMALL LETTER O WITH DIAERESIS (U+00F6) c3b6
÷ DIVISION SIGN (U+00F7) c3b7
ø LATIN SMALL LETTER O WITH STROKE (U+00F8) c3b8
ù LATIN SMALL LETTER U WITH GRAVE (U+00F9) c3b9
ú LATIN SMALL LETTER U WITH ACUTE (U+00FA) c3ba
û LATIN SMALL LETTER U WITH CIRCUMFLEX (U+00FB) c3bb
ü LATIN SMALL LETTER U WITH DIAERESIS (U+00FC) c3bc
ý LATIN SMALL LETTER Y WITH ACUTE (U+00FD) c3bd
þ LATIN SMALL LETTER THORN (U+00FE) c3be
ÿ LATIN SMALL LETTER Y WITH DIAERESIS (U+00FF) c3bf