Unicode Han Character 'CJK UNIFIED IDEOGRAPH-33001' (U+33001)

previous character next character

𳀁
U+33001 browser display

CJK UNIFIED IDEOGRAPH-33001

image of Unicode Han Character 'CJK UNIFIED IDEOGRAPH-33001' (U+33001)
Raster image of U+33001

CJK UNIFIED IDEOGRAPH-33001

Unicode Data
Name CJK UNIFIED IDEOGRAPH-33001
Block CJK Unified Ideographs Extension J
Category Letter, Other [Lo]
Script Han (Hani)
Combine 0
BIDI Left-to-Right [L]
Version Unicode 17.0 (August 2025)
Unicode Han Data
kIRG_UKSource UK-20812
kRSUnicode 142.13
kTotalStrokes 19
Encodings
HTML Entity (decimal) 𳀁
HTML Entity (hex) 𳀁
How to type in Microsoft Windows Alt +33001
UTF-8 (hex) 0xF0 0xB3 0x80 0x81 (f0b38081)
UTF-8 (binary) 11110000:10110011:10000000:10000001
UTF-16 (hex) 0xD88C 0xDC01 (d88cdc01)
UTF-16 (decimal) 55,436 56,321
UTF-32 (hex) 0x00033001 (33001)
UTF-32 (decimal) 208,897
C/C++/Java source code "\uD88C\uDC01"
Python source code u"\U00033001"
More...
Java Data
string.toUpperCase() 𳀁
string.toLowerCase() 𳀁
Character.UnicodeBlock (none)
Character.charCount() 2
Character.getDirectionality() DIRECTIONALITY_UNDEFINED [-1]
Character.getNumericValue() -1
Character.getType() 0
Character.isDefined() No
Character.isDigit() No
Character.isIdentifierIgnorable() No
Character.isISOControl() No
Character.isJavaIdentifierPart() No
Character.isJavaIdentifierStart() No
Character.isLetter() No
Character.isLetterOrDigit() No
Character.isLowerCase() No
Character.isMirrored() No
Character.isSpaceChar() No
Character.isSupplementaryCodePoint() Yes
Character.isTitleCase() No
Character.isUnicodeIdentifierPart() No
Character.isUnicodeIdentifierStart() No
Character.isUpperCase() No
Character.isValidCodePoint() Yes
Character.isWhitespace() No