Unicode Han Character 'CJK UNIFIED IDEOGRAPH-33040' (U+33040)

previous character next character

𳁀
U+33040 browser display

CJK UNIFIED IDEOGRAPH-33040

image of Unicode Han Character 'CJK UNIFIED IDEOGRAPH-33040' (U+33040)
Raster image of U+33040

CJK UNIFIED IDEOGRAPH-33040

Unicode Data
Name CJK UNIFIED IDEOGRAPH-33040
Block CJK Unified Ideographs Extension J
Category Letter, Other [Lo]
Script Han (Hani)
Combine 0
BIDI Left-to-Right [L]
Version Unicode 17.0 (August 2025)
Unicode Han Data
kIRG_GSource GKJ-00753
kRSUnicode 145.9
kTotalStrokes 15
Encodings
HTML Entity (decimal) 𳁀
HTML Entity (hex) 𳁀
How to type in Microsoft Windows Alt +33040
UTF-8 (hex) 0xF0 0xB3 0x81 0x80 (f0b38180)
UTF-8 (binary) 11110000:10110011:10000001:10000000
UTF-16 (hex) 0xD88C 0xDC40 (d88cdc40)
UTF-16 (decimal) 55,436 56,384
UTF-32 (hex) 0x00033040 (33040)
UTF-32 (decimal) 208,960
C/C++/Java source code "\uD88C\uDC40"
Python source code u"\U00033040"
More...
Java Data
string.toUpperCase() 𳁀
string.toLowerCase() 𳁀
Character.UnicodeBlock (none)
Character.charCount() 2
Character.getDirectionality() DIRECTIONALITY_UNDEFINED [-1]
Character.getNumericValue() -1
Character.getType() 0
Character.isDefined() No
Character.isDigit() No
Character.isIdentifierIgnorable() No
Character.isISOControl() No
Character.isJavaIdentifierPart() No
Character.isJavaIdentifierStart() No
Character.isLetter() No
Character.isLetterOrDigit() No
Character.isLowerCase() No
Character.isMirrored() No
Character.isSpaceChar() No
Character.isSupplementaryCodePoint() Yes
Character.isTitleCase() No
Character.isUnicodeIdentifierPart() No
Character.isUnicodeIdentifierStart() No
Character.isUpperCase() No
Character.isValidCodePoint() Yes
Character.isWhitespace() No