U+18847 "𘡇" Tangut Component-072 Unicode Character
Unicode Version 17.0
𘡇
U+18847 "𘡇" Tangut Component-072 is a graphical element derived from the Tangut script, a complex writing system used for the extinct Tangut language of the Western Xia dynasty (1038–1227). This character specifically represents one of 512 radical-like components that were systematically cataloged by Unicode to facilitate the study and digital encoding of Tangut logograms, which consist of over 6,000 intricate characters. As a component, it serves as a building block for more complex Tangut glyphs rather than forming a complete standalone word, helping researchers and linguists analyze the script's structural composition and historical usage.
General Properties
| Code Point | U+18847 |
| Version Added | 9.0 |
| Name | Tangut Component-072 |
| Block | Tangut Components |
| General Category | Other Letter |
| Canonical Combining Class | Not Reordered |
| Bidirectional Class | Left To Right |
Encodings
| HTML Decimal Encoding | 𘡇 |
| HTML Hex Encoding | 𘡇 |
| UTF-8 Encoding | 0xF0 0x98 0xA1 0x87 |
| UTF-16 Encoding | 0xD822 0xDC47 |
| UTF-32 Encoding | 0x00018847 |
| C/C++/Java Escape | \ud822\udc47 |