U+18781 "𘞁" Tangut Ideograph-# Unicode Character
Unicode Version 17.0
𘞁
U+18781 "𘞁" Tangut Ideograph-# is a specific glyph from the ancient Tangut script, a logographic writing system used to record the extinct Tangut language of the Tangut Empire in northwestern China from approximately 1036 to the 16th century. This particular ideograph represents a single character within the vast Tangut dictionary, which contains over 6,000 known characters, and its precise meaning and pronunciation, like many Tangut characters, are determined through scholarly reconstruction based on bilingual texts and linguistic analysis. The inclusion of this character in the Unicode Standard allows for its digital representation and study, preserving a fragment of a complex and historically significant writing system.
General Properties
| Code Point | U+18781 |
| Version Added | 9.0 |
| Name | Tangut Ideograph-# |
| Block | Tangut |
| General Category | Other Letter |
| Canonical Combining Class | Not Reordered |
| Bidirectional Class | Left To Right |
Encodings
| HTML Decimal Encoding | 𘞁 |
| HTML Hex Encoding | 𘞁 |
| UTF-8 Encoding | 0xF0 0x98 0x9E 0x81 |
| UTF-16 Encoding | 0xD821 0xDF81 |
| UTF-32 Encoding | 0x00018781 |
| C/C++/Java Escape | \ud821\udf81 |
Unicode Properties
| NFC Quick Check | Yes |
| NFD Quick Check | Yes |
| NFKC Quick Check | Yes |
| NFKD Quick Check | Yes |
| Numeric Type | None |
| Numeric Value | NaN |
| Line Break | Ideographic |
| East Asian Width | Wide |
| Script | Tangut |
| Script Extensions | Tangut |
| Indic Syllabic Category | Other |
| ID Start | Yes |
| XID Start | Yes |
| ID Continue | Yes |
| XID Continue | Yes |
| Alphabetic | Yes |
| Vertical Orientation | Upright |
| Grapheme Base | Yes |
| Grapheme Cluster Break | Other |
| Word Break | Other |
| Sentence Break | OLetter |
| Ideographic | Yes |
| kTGT_RSUnicode | 698.14 |
| kTGT_MergedSrc | L2008-1241 |