U+18840 "𘡀" Tangut Component-065 Unicode Character
Unicode Version 17.0
𘡀
U+18840 "𘡀" Tangut Component-065 is a structural element from the Tangut script, a complex logographic writing system used for the extinct Tangut language of the Western Xia dynasty (1038-1227). It represents a specific graphical subunit that, when combined with other components, forms full Tangut characters, which are known for their large number of strokes and intricate designs, similar in function to radicals in Chinese but unique in form. This component was encoded in Unicode as part of the Tangut Supplement block to support the study and digital preservation of the script, aiding linguists and historians in analyzing the textual remains of this lost civilization.
General Properties
| Code Point | U+18840 |
| Version Added | 9.0 |
| Name | Tangut Component-065 |
| Block | Tangut Components |
| General Category | Other Letter |
| Canonical Combining Class | Not Reordered |
| Bidirectional Class | Left To Right |
Encodings
| HTML Decimal Encoding | 𘡀 |
| HTML Hex Encoding | 𘡀 |
| UTF-8 Encoding | 0xF0 0x98 0xA1 0x80 |
| UTF-16 Encoding | 0xD822 0xDC40 |
| UTF-32 Encoding | 0x00018840 |
| C/C++/Java Escape | \ud822\udc40 |