U+18871 "𘡱" Tangut Component-114 Unicode Character
Unicode Version 17.0
𘡱
U+18871 "𘡱" Tangut Component-114 is a constituent element of the Tangut script, a complex logographic writing system used for the extinct Tangut language spoken during the Western Xia dynasty in northwestern China. This specific component, designated as number 114, represents a radical or building block that appears as a part of larger, more intricate Tangut characters. It is encoded in the Unicode Tangut Components block, which was introduced to aid in the computer representation and analysis of the script's compositional structure. Understanding components like U+18871 is essential for lexicographers, linguists, and digital text researchers working to decode, reconstruct, and properly display the thousands of known Tangut characters.
General Properties
| Code Point | U+18871 |
| Version Added | 9.0 |
| Name | Tangut Component-114 |
| Block | Tangut Components |
| General Category | Other Letter |
| Canonical Combining Class | Not Reordered |
| Bidirectional Class | Left To Right |
Encodings
| HTML Decimal Encoding | 𘡱 |
| HTML Hex Encoding | 𘡱 |
| UTF-8 Encoding | 0xF0 0x98 0xA1 0xB1 |
| UTF-16 Encoding | 0xD822 0xDC71 |
| UTF-32 Encoding | 0x00018871 |
| C/C++/Java Escape | \ud822\udc71 |