U+1123A "𑈺" Khojki Word Separator Unicode Character

Unicode Version 17.0

𑈺

U+1123A "𑈺" Khojki Word Separator is a specialized punctuation mark used within the Khojki script, which was historically employed by the Ismaili Muslim community of South Asia for writing religious and literary texts in languages such as Sindhi, Gujarati, and Kutchi. Unlike modern spaces or standard word dividers, this character functions as a visible vertical bar or short stroke that separates words or phrases in Khojki manuscripts, aiding readability in a script where words were traditionally written without consistent spacing. Its inclusion in Unicode allows for accurate digital representation and preservation of historical documents that rely on this distinct separator, ensuring that the unique typographic conventions of Khojki are maintained in modern electronic text.

General Properties

Code Point U+1123A
Version Added 7.0
Name Khojki Word Separator
Block Khojki
General Category Other Punctuation
Canonical Combining Class Not Reordered
Bidirectional Class Left To Right

Encodings

HTML Decimal Encoding 𑈺
HTML Hex Encoding 𑈺
UTF-8 Encoding 0xF0 0x91 0x88 0xBA
UTF-16 Encoding 0xD804 0xDE3A
UTF-32 Encoding 0x0001123A
C/C++/Java Escape \ud804\ude3a

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Line Break Alphabetic
Script Khojki
Script Extensions Khojki
Indic Syllabic Category Other
Terminal Punctuation Yes
Vertical Orientation Rotated
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Other
Sentence Break Other