U+183C5 "𘏅" Tangut Ideograph-# Unicode Character

Unicode Version 17.0

𘏅

U+183C5 "𘏅" Tangut Ideograph-# is a specific logogram from the Tangut script, which was used to write the extinct Tangut language of the Xixia Empire in northwestern China between the 11th and 16th centuries. This character belongs to the Tangut block of the Unicode Standard, part of a comprehensive effort to encode over 6,000 Tangut ideographs for digital preservation and academic study. Like all Tangut characters, "𘏅" represents a single syllable or morpheme, with a complex structure often composed of radical-like components that indicate meaning and phonetic hints, though its exact reading and semantic value are determined through ongoing research by philologists. Its inclusion in Unicode ensures that scholars can accurately transcribe and analyze Tangut texts without reliance on specialized fonts or proprietary software.

General Properties

Code Point	U+183C5
Version Added	9.0
Name	Tangut Ideograph-#
Block	Tangut
General Category	Other Letter
Canonical Combining Class	Not Reordered
Bidirectional Class	Left To Right

Encodings

HTML Decimal Encoding	𘏅
HTML Hex Encoding	𘏅
UTF-8 Encoding	0xF0 0x98 0x8F 0x85
UTF-16 Encoding	0xD820 0xDFC5
UTF-32 Encoding	0x000183C5
C/C++/Java Escape	\ud820\udfc5

Unicode Properties

NFC Quick Check	Yes
NFD Quick Check	Yes
NFKC Quick Check	Yes
NFKD Quick Check	Yes
Numeric Type	None
Numeric Value	NaN
Line Break	Ideographic
East Asian Width	Wide
Script	Tangut
Script Extensions	Tangut
Indic Syllabic Category	Other
ID Start	Yes
XID Start	Yes
ID Continue	Yes
XID Continue	Yes
Alphabetic	Yes
Vertical Orientation	Upright
Grapheme Base	Yes
Grapheme Cluster Break	Other
Word Break	Other
Sentence Break	OLetter
Ideographic	Yes
kTGT_RSUnicode	436.11
kTGT_MergedSrc	S1968-2650