Character Sets

Character Sets


This is the list of the characters sets (type=java.nio.charset.Charset) that are available here.

Also check the list by code page number.

For help figuring out which character set a file is using, try the Reverse Charset Mapping Tool.


Name Notes Aliases
Big5 csBig5
Big5-HKSCS big5-hkscs, big5hk, Big5_HKSCS, big5hkscs
EUC-JP csEUCPkdFmtjapanese, x-euc-jp, eucjis, Extended_UNIX_Code_Packed_Format_for_Japanese, euc_jp
eucjp, x-eucjp
EUC-KR ksc5601-1987, csEUCKR, ksc5601_1987, ksc5601, 5601
euc_kr, ksc_5601, ks_c_5601-1987, euckr
GB18030 gb18030-2000
GB2312 gb2312, euc-cn, x-EUC-CN, euccn, EUC_CN
gb2312-80, gb2312-1980
GBK CP936, windows-936
IBM-Thai ibm-838, ibm838, 838, cp838
IBM00858 cp858, 858, PC-Multilingual-850+euro, cp00858, ccsid00858
IBM01140 cp1140, 1140, cp01140, ebcdic-us-037+euro, ccsid01140
IBM01141 1141, cp1141, cp01141, ccsid01141, ebcdic-de-273+euro
IBM01142 1142, cp1142, cp01142, ccsid01142, ebcdic-no-277+euro
IBM01143 1143, cp01143, ccsid01143, cp1143, ebcdic-fi-278+euro
IBM01144 cp01144, ccsid01144, ebcdic-it-280+euro, cp1144, 1144
IBM01145 ccsid01145, ebcdic-es-284+euro, 1145, cp1145, cp01145
IBM01146 ebcdic-gb-285+euro, 1146, cp1146, cp01146, ccsid01146
IBM01147 cp1147, 1147, cp01147, ccsid01147, ebcdic-fr-277+euro
IBM01148 cp1148, ebcdic-international-500+euro, 1148, cp01148, ccsid01148
IBM01149 ebcdic-s-871+euro, 1149, cp1149, cp01149, ccsid01149
IBM037 cp037, ibm037, ibm-037, csIBM037, ebcdic-cp-us
ebcdic-cp-ca, ebcdic-cp-nl, ebcdic-cp-wt, 037, cpibm37
cs-ebcdic-cp-wt, ibm-37, cs-ebcdic-cp-us, cs-ebcdic-cp-ca, cs-ebcdic-cp-nl
IBM1026 cp1026, ibm-1026, 1026, ibm1026
IBM1047 ibm-1047, 1047, cp1047
IBM273 ibm-273, ibm273, 273, cp273
IBM277 ibm277, 277, cp277, ibm-277
IBM278 cp278, 278, ibm-278, ebcdic-cp-se, csIBM278
ibm278, ebcdic-sv
IBM280 ibm280, 280, cp280, ibm-280
IBM284 csIBM284, ibm-284, cpibm284, ibm284, 284
IBM285 csIBM285, cp285, ebcdic-gb, ibm-285, cpibm285
ibm285, 285, ebcdic-cp-gb
IBM290 ibm290, 290, cp290, EBCDIC-JP-kana, csIBM290
IBM297 297, csIBM297, cp297, ibm297, ibm-297
cpibm297, ebcdic-cp-fr
IBM420 ibm420, 420, cp420, csIBM420, ibm-420
IBM424 ebcdic-cp-he, csIBM424, ibm-424, ibm424, 424
IBM437 ibm437, 437, ibm-437, cspc8codepage437, cp437
IBM500 ibm-500, ibm500, 500, ebcdic-cp-bh, ebcdic-cp-ch
csIBM500, cp500
IBM775 ibm-775, ibm775, 775, cp775
IBM850 cp850, cspc850multilingual, ibm850, 850, ibm-850
IBM852 csPCp852, ibm-852, ibm852, 852, cp852
IBM855 ibm855, 855, ibm-855, cp855, cspcp855
IBM857 ibm857, 857, cp857, csIBM857, ibm-857
IBM860 ibm860, 860, cp860, csIBM860, ibm-860
IBM861 cp861, ibm861, 861, ibm-861, cp-is
IBM862 csIBM862, cp862, ibm862, 862, cspc862latinhebrew
IBM863 csIBM863, ibm-863, ibm863, 863, cp863
IBM864 csIBM864, ibm-864, ibm864, 864, cp864
IBM865 ibm-865, csIBM865, cp865, ibm865, 865
IBM866 ibm866, 866, ibm-866, csIBM866, cp866
IBM868 ibm868, 868, cp868, csIBM868, ibm-868
IBM869 cp869, ibm869, 869, ibm-869, cp-gr
IBM870 870, cp870, csIBM870, ibm-870, ibm870
ebcdic-cp-roece, ebcdic-cp-yu
IBM871 ibm871, 871, cp871, ebcdic-cp-is, csIBM871
IBM918 918, ibm-918, ebcdic-cp-ar2, cp918
ISO-2022-CN (no encoder) csISO2022CN, ISO2022CN
ISO-2022-JP csjisencoding, iso2022jp, jis_encoding, jis, csISO2022JP
ISO-2022-JP-2 csISO2022JP2, iso2022jp2
ISO-2022-KR csISO2022KR, ISO2022KR
ISO-8859-1 819, ISO8859-1, l1, ISO_8859-1:1987, ISO_8859-1
8859_1, iso-ir-100, latin1, cp819, ISO8859_1
IBM819, ISO_8859_1, IBM-819, csISOLatin1
ISO-8859-13 iso_8859-13, ISO8859-13, iso8859_13, 8859_13
ISO-8859-15 ISO8859-15, LATIN0, ISO8859_15_FDIS, ISO8859_15, cp923
8859_15, L9, ISO-8859-15, IBM923, csISOlatin9
ISO_8859-15, IBM-923, csISOlatin0, 923, LATIN9
ISO-8859-2 ISO8859-2, ibm912, l2, ISO_8859-2, 8859_2
cp912, ISO_8859-2:1987, iso8859_2, iso-ir-101, latin2
912, csISOLatin2, ibm-912
ISO-8859-3 ISO8859-3, ibm913, 8859_3, l3, cp913
ISO_8859-3, iso8859_3, latin3, csISOLatin3, 913
ISO_8859-3:1988, ibm-913, iso-ir-109
ISO-8859-4 8859_4, latin4, l4, cp914, ISO_8859-4:1988
ibm914, ISO_8859-4, iso-ir-110, iso8859_4, csISOLatin4
iso8859-4, 914, ibm-914
ISO-8859-5 ISO_8859-5:1988, csISOLatinCyrillic, iso-ir-144, iso8859_5, cp915
8859_5, ibm-915, ISO_8859-5, ibm915, 915
cyrillic, ISO8859-5
ISO-8859-6 ASMO-708, 8859_6, iso8859_6, ISO_8859-6, csISOLatinArabic
ibm1089, arabic, ibm-1089, 1089, ECMA-114
iso-ir-127, ISO_8859-6:1987, ISO8859-6, cp1089
ISO-8859-7 greek, 8859_7, greek8, ibm813, ISO_8859-7
iso8859_7, ELOT_928, cp813, ISO_8859-7:1987, sun_eu_greek
csISOLatinGreek, iso-ir-126, 813, iso8859-7, ECMA-118
ISO-8859-8 8859_8, ISO_8859-8, ISO_8859-8:1988, cp916, iso-ir-138
ISO8859-8, hebrew, iso8859_8, ibm-916, csISOLatinHebrew
916, ibm916
ISO-8859-9 ibm-920, ISO_8859-9, 8859_9, ISO_8859-9:1989, ibm920
latin5, l5, iso8859_9, cp920, 920
iso-ir-148, ISO8859-9, csISOLatin5
JIS_X0201 JIS0201, csHalfWidthKatakana, X0201, JIS_X0201
JIS_X0212-1990 JIS0212, iso-ir-159, x0212, jis_x0212-1990, csISO159JISX02121990
KOI8-R koi8_r, koi8, cskoi8r
KOI8-U koi8_u
Shift_JIS shift_jis, x-sjis, sjis, shift-jis, ms_kanji
TIS-620 tis620, tis620.2533
US-ASCII ANSI_X3.4-1968, cp367, csASCII, iso-ir-6, ASCII
iso_646.irv:1983, ANSI_X3.4-1986, ascii7, default, ISO_646.irv:1991
ISO646-US, IBM367, 646, us
UTF-16 UTF_16, unicode, utf16, UnicodeBig
UTF-16BE X-UTF-16BE, UTF_16BE, ISO-10646-UCS-2, UnicodeBigUnmarked
UTF-16LE UnicodeLittleUnmarked, UTF_16LE, X-UTF-16LE
UTF-32 UTF_32, UTF32
UTF-8 unicode-1-1-utf-8, UTF8
windows-1250 cp1250, cp5346
windows-1251 cp5347, ansi-1251, cp1251
windows-1252 cp5348, cp1252
windows-1253 cp1253, cp5349
windows-1254 cp1254, cp5350
windows-1255 cp1255
windows-1256 cp1256
windows-1257 cp1257, cp5353
windows-1258 cp1258
windows-31j MS932, windows-932, csWindows31J
x-Big5-HKSCS-2001 Big5_HKSCS_2001, big5-hkscs-2001, big5hk-2001, big5-hkscs:unicode3.0, big5hkscs-2001
x-Big5-Solaris Big5_Solaris
x-COMPOUND_TEXT COMPOUND_TEXT, x-compound-text, x11-compound_text
x-euc-jp-linux euc_jp_linux, euc-jp-linux
x-EUC-TW euctw, cns11643, EUC-TW, euc_tw
x-eucJP-Open eucJP-open, EUC_JP_Solaris
x-IBM1006 ibm1006, ibm-1006, 1006, cp1006
x-IBM1025 ibm-1025, 1025, cp1025, ibm1025
x-IBM1046 ibm1046, ibm-1046, 1046, cp1046
x-IBM1097 ibm1097, ibm-1097, 1097, cp1097
x-IBM1098 ibm-1098, 1098, cp1098, ibm1098
x-IBM1112 ibm1112, ibm-1112, 1112, cp1112
x-IBM1122 cp1122, ibm1122, ibm-1122, 1122
x-IBM1123 ibm1123, ibm-1123, 1123, cp1123
x-IBM1124 ibm-1124, 1124, cp1124, ibm1124
x-IBM1166 cp1166, ibm1166, ibm-1166, 1166
x-IBM1364 cp1364, ibm1364, ibm-1364, 1364
x-IBM1381 cp1381, ibm-1381, 1381, ibm1381
x-IBM1383 ibm1383, ibm-1383, 1383, cp1383
x-IBM300 cp300, ibm300, 300, ibm-300
x-IBM33722 33722, ibm-33722, cp33722, ibm33722, ibm-5050
x-IBM737 cp737, ibm737, 737, ibm-737
x-IBM833 ibm833, cp833, ibm-833
x-IBM834 ibm834, 834, cp834, ibm-834
x-IBM856 ibm856, 856, cp856, ibm-856
x-IBM874 ibm-874, ibm874, 874, cp874
x-IBM875 ibm-875, ibm875, 875, cp875
x-IBM921 ibm921, 921, ibm-921, cp921
x-IBM922 ibm922, 922, cp922, ibm-922
x-IBM930 ibm-930, ibm930, 930, cp930
x-IBM933 ibm933, 933, cp933, ibm-933
x-IBM935 cp935, ibm935, 935, ibm-935
x-IBM937 ibm-937, ibm937, 937, cp937
x-IBM939 ibm-939, cp939, ibm939, 939
x-IBM942 ibm-942, cp942, ibm942, 942
x-IBM942C ibm942C, cp942C, ibm-942C, 942C
x-IBM943 ibm943, 943, ibm-943, cp943
x-IBM943C 943C, cp943C, ibm943C, ibm-943C
x-IBM948 ibm-948, ibm948, 948, cp948
x-IBM949 ibm-949, ibm949, 949, cp949
x-IBM949C ibm949C, ibm-949C, cp949C, 949C
x-IBM950 cp950, ibm950, 950, ibm-950
x-IBM964 ibm-964, cp964, ibm964, 964
x-IBM970 ibm970, ibm-eucKR, 970, cp970, ibm-970
x-ISCII91 ISCII91, iso-ir-153, iscii, ST_SEV_358-88, csISO153GOST1976874
x-ISO-2022-CN-CNS ISO2022CN_CNS, ISO-2022-CN-CNS
x-ISO-2022-CN-GB ISO2022CN_GB, ISO-2022-CN-GB
x-iso-8859-11 iso-8859-11, iso8859_11
x-JIS0208 JIS0208, JIS_C6226-1983, iso-ir-87, x0208, JIS_X0208-1983
x-JISAutoDetect (no encoder) JISAutoDetect
x-Johab ms1361, ksc5601_1992, johab, ksc5601-1992
x-MacArabic MacArabic
x-MacCentralEurope MacCentralEurope
x-MacCroatian MacCroatian
x-MacCyrillic MacCyrillic
x-MacDingbat MacDingbat
x-MacGreek MacGreek
x-MacHebrew MacHebrew
x-MacIceland MacIceland
x-MacRoman MacRoman
x-MacRomania MacRomania
x-MacSymbol MacSymbol
x-MacThai MacThai
x-MacTurkish MacTurkish
x-MacUkraine MacUkraine
x-MS932_0213 (none)
x-mswin-936 ms936, ms_936
x-PCK pck
x-SJIS_0213 (none)
x-UTF-16LE-BOM UnicodeLittle
x-windows-50220 cp50220, ms50220
x-windows-50221 cp50221, ms50221
x-windows-874 ms-874, ms874, windows-874
x-windows-949 windows949, ms949, windows-949, ms_949
x-windows-950 ms950, windows-950
x-windows-iso2022jp windows-iso2022jp

Count: 171