U+18800 "𘠀" Tangut Component-001 Unicode Character
Unicode Version 17.0
𘠀
U+18800 "𘠀" Tangut Component-001 is the initial graphical element in a set of 512 radicals or building blocks that form the basis of the Tangut script, an ancient writing system used during the Tangut Empire (11th to 16th centuries) in what is now northwestern China. This component serves as a structural part of thousands of complex Tangut logograms, which were inscribed in historical texts and documents, and its inclusion in Unicode enables digital preservation and scholarly study of this extinct language.
General Properties
| Code Point | U+18800 |
| Version Added | 9.0 |
| Name | Tangut Component-001 |
| Block | Tangut Components |
| General Category | Other Letter |
| Canonical Combining Class | Not Reordered |
| Bidirectional Class | Left To Right |
Encodings
| HTML Decimal Encoding | 𘠀 |
| HTML Hex Encoding | 𘠀 |
| UTF-8 Encoding | 0xF0 0x98 0xA0 0x80 |
| UTF-16 Encoding | 0xD822 0xDC00 |
| UTF-32 Encoding | 0x00018800 |
| C/C++/Java Escape | \ud822\udc00 |