U+18001 "𘀁" Tangut Ideograph-# Unicode Character
Unicode Version 17.0
𘀁
U+18001 "𘀁" Tangut Ideograph-# is a specific glyph from the Tangut script, a complex logographic writing system used to record the extinct Tangut language of the Western Xia dynasty in medieval China. This character represents one of the thousands of ideographs in the Tangut repertoire, which was famously deciphered in the early 20th century and encoded in Unicode to support digital preservation and scholarly research. As a Tangut ideograph, "𘀁" carries a unique semantic meaning, though its specific definition is often identified by a number in scholarly databases due to the ongoing work of cataloging and translating the entire script.
General Properties
| Code Point | U+18001 |
| Version Added | 9.0 |
| Name | Tangut Ideograph-# |
| Block | Tangut |
| General Category | Other Letter |
| Canonical Combining Class | Not Reordered |
| Bidirectional Class | Left To Right |
Encodings
| HTML Decimal Encoding | 𘀁 |
| HTML Hex Encoding | 𘀁 |
| UTF-8 Encoding | 0xF0 0x98 0x80 0x81 |
| UTF-16 Encoding | 0xD820 0xDC01 |
| UTF-32 Encoding | 0x00018001 |
| C/C++/Java Escape | \ud820\udc01 |
Unicode Properties
| NFC Quick Check | Yes |
| NFD Quick Check | Yes |
| NFKC Quick Check | Yes |
| NFKD Quick Check | Yes |
| Numeric Type | None |
| Numeric Value | NaN |
| Line Break | Ideographic |
| East Asian Width | Wide |
| Script | Tangut |
| Script Extensions | Tangut |
| Indic Syllabic Category | Other |
| ID Start | Yes |
| XID Start | Yes |
| ID Continue | Yes |
| XID Continue | Yes |
| Alphabetic | Yes |
| Vertical Orientation | Upright |
| Grapheme Base | Yes |
| Grapheme Cluster Break | Other |
| Word Break | Other |
| Sentence Break | OLetter |
| Ideographic | Yes |
| kTGT_RSUnicode | 269.16 |
| kTGT_MergedSrc | L2008-2326 |