U+842C "萬" CJK Unified Ideograph-# Unicode Character

Unicode Version 17.0

U+842C "萬" CJK Unified Ideograph-# is a traditional Chinese character that originally depicted a scorpion in its ancient oracle bone script form, but over time it became the standard word for the number ten thousand (10,000) in many East Asian languages including Chinese, Japanese, and Korean. Beyond its numerical meaning, it frequently appears in idiomatic expressions and names to convey the concepts of vastness, abundance, or eternity, such as in the common Chinese greeting "wàn suì" meaning "ten thousand years" or "long live." In Japanese, it is also used in the surname "Yorozu" and in the word "banzai," a celebratory exclamation. The character's complexity includes thirteen strokes and it serves as a key component in other kanji and hanzi characters related to quantity and multiplicity.

General Properties

Code Point U+842C
Version Added 1.1
Name CJK Unified Ideograph-#
Block CJK Unified Ideographs
General Category Other Letter
Canonical Combining Class Not Reordered
Bidirectional Class Left To Right

Encodings

HTML Decimal Encoding 萬
HTML Hex Encoding 萬
UTF-8 Encoding 0xE8 0x90 0xAC
UTF-16 Encoding 0x842C
UTF-32 Encoding 0x0000842C
C/C++/Java Escape \u842c

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type Numeric
Numeric Value 10000
Line Break Ideographic
East Asian Width Wide
Script Han
Script Extensions Han
Indic Syllabic Category Other
ID Start Yes
XID Start Yes
ID Continue Yes
XID Continue Yes
Alphabetic Yes
Vertical Orientation Upright
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Other
Sentence Break OLetter
Ideographic Yes
Unified Ideograph Yes

Unihan Properties

kBigFive B855
kCCCII 214F22
kCNS1986 1-655C
kCNS1992 1-655C
kCangjie TWLB
kCantonese maan6
kCihaiT 1149.402
kCowles 2576
kDaeJaweon 1501.060
kDefinition ten thousand; innumerable
kEACC 214F22
kFanqie 無販
kFenn 576C
kFennIndex 593.03
kFourCornerCode 4442.7
kGB1 4582
kGradeLevel 4
kGSR 0267a
kHangul 만:0E
kHanYu 53247.080
kHanyuPinlu wàn(1335)
kHanyuPinyin 53247.080:wàn
kHKGlyph 2889
kIICore ATJHKMP
kIRGDaeJaweon 1501.060
kIRGHanyuDaZidian 53247.080
kIRGKangXi 1042.330
kIRG_GSource G1-4D72
kIRG_HSource HB1-B855
kIRG_JSource J0-685F
kIRG_KPSource KP0-DAC6
kIRG_KSource K0-583F
kIRG_TSource T1-655C
kIRG_VSource V1-6538
kJapanese バン マン よろず
kJinmeiyoKanji 2010:U+4E07
kKoreanEducationHanja 2007
kJapaneseKun YOROZU OOKII
kJapaneseOn MAN
kJis0 7263
kKangXi 1042.330
kKorean MAN
kLau 2058
kMandarin wàn
kMatthews 7030
kMeyerWempe 1744
kMojiJoho MJ022254 MJ022254:E0101 MJ022257:E0102 MJ022256:E0103 MJ022255:E0104
kMorohashi 31339:E0103
kNelson 3984
kPhonetic 866
kPrimaryNumeric 10000
kRSAdobe_Japan1_6 C+6408+140.3.9
kRSUnicode 114.8 140.9
kSBGY 397.37
kSemanticVariant U+4E07<kLau,kMatthews,kMeyerWempe U+534D<kFenn
kSimplifiedVariant "万" U+4E07 CJK Unified Ideograph-#
kSMSZD2003Index 589.05
kSMSZD2003Readings wàn粵maan6
kTaiwanTelegraph 5502
kTang *miæ̀n
kTotalStrokes 12
kUnihanCore2020 HJKMPT
kVietnamese vạn
kXerox 242:161
kXHC1983 1185.041:wàn