U+18501 "𘔁" Tangut Ideograph-# Unicode Character
Unicode Version 17.0
U+18501 "𘔁" Tangut Ideograph-# is a specific coded glyph representing a single logogram from the extinct Tangut script, which was used to write the Tangut language of the Western Xia dynasty in China between the 11th and 16th centuries. This character's graphic form consists of a complex arrangement of strokes, typical of Tangut writing, which includes over 6,000 distinct ideographs, many of them structurally intricate. As part of the Tangut block in Unicode, this particular ideograph is assigned a provisional reading and meaning based on scholarly reconstructions of the Tangut lexicon, often cataloged in reference works like the "Tangut-Chinese Dictionary" or the "Homophones" rhyme book. Its inclusion in Unicode ensures that digital representations of ancient Tangut texts can be preserved, studied, and transmitted across modern computer systems without loss of fidelity.
General Properties
Encodings
| HTML Decimal Encoding |
𘔁 |
| HTML Hex Encoding |
𘔁 |
| UTF-8 Encoding |
0xF0 0x98 0x94 0x81 |
| UTF-16 Encoding |
0xD821 0xDD01 |
| UTF-32 Encoding |
0x00018501 |
| C/C++/Java Escape |
\ud821\udd01 |
Unicode Properties