U+1894A "𘥊" Tangut Component-331 Unicode Character

Unicode Version 17.0

𘥊

U+1894A "𘥊" Tangut Component-331 is a graphical element used in the construction of Tangut logographs, drawn from the historical Tangut script that was employed for writing the extinct Tangut language of the Western Xia dynasty around 1036 to 1502 CE in present-day northwestern China. This specific component, identified as CBS 331 in the standard component catalog, typically represents a radical or structural part that combines with other components to form complete Tangut characters, which number over six thousand known examples. The character is encoded in the Tangut Components block of Unicode, a section dedicated to these fundamental building blocks to aid in scholarly analysis, digital representation, and research into the script's morphology and historical usage. Its inclusion in the standard supports modern efforts to decipher, digitize, and study Tangut manuscripts and inscriptions.

General Properties

Code Point U+1894A
Version Added 9.0
Name Tangut Component-331
Block Tangut Components
General Category Other Letter
Canonical Combining Class Not Reordered
Bidirectional Class Left To Right

Encodings

HTML Decimal Encoding 𘥊
HTML Hex Encoding 𘥊
UTF-8 Encoding 0xF0 0x98 0xA5 0x8A
UTF-16 Encoding 0xD822 0xDD4A
UTF-32 Encoding 0x0001894A
C/C++/Java Escape \ud822\udd4a

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Line Break Ideographic
East Asian Width Wide
Script Tangut
Script Extensions Tangut
Indic Syllabic Category Other
ID Start Yes
XID Start Yes
ID Continue Yes
XID Continue Yes
Alphabetic Yes
Vertical Orientation Upright
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Other
Sentence Break OLetter
Ideographic Yes