U+11203 "𑈃" Khojki Letter U Unicode Character

Unicode Version 17.0

𑈃

U+11203 "𑈃" Khojki Letter U is a specific glyph used in the Khojki script, a writing system historically employed by the Ismaili Muslim community of South Asia to record religious and literary texts, particularly in Sindhi and Gujarati languages. This character represents the vowel sound "u" and is part of a block that was added to the Unicode Standard in 2012 to preserve and digitally encode the script's distinct aesthetic and phonetic inventory. In practice, the Khojki Letter U appears as a diacritic-like modification to consonant letters, such as the base form of "𑈀" Khojki Letter Ka, to indicate the short or long "u" vowel depending on the context of its use.

General Properties

Code Point U+11203
Version Added 7.0
Name Khojki Letter U
Block Khojki
General Category Other Letter
Canonical Combining Class Not Reordered
Bidirectional Class Left To Right

Encodings

HTML Decimal Encoding 𑈃
HTML Hex Encoding 𑈃
UTF-8 Encoding 0xF0 0x91 0x88 0x83
UTF-16 Encoding 0xD804 0xDE03
UTF-32 Encoding 0x00011203
C/C++/Java Escape \ud804\ude03

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Line Break Alphabetic
Script Khojki
Script Extensions Khojki
Indic Syllabic Category Vowel Independent
ID Start Yes
XID Start Yes
ID Continue Yes
XID Continue Yes
Alphabetic Yes
Vertical Orientation Rotated
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Alphabetic letter
Sentence Break OLetter