U+110E5 "𑃥" Sora Sompeng Letter Uh Unicode Character

Unicode Version 17.0

𑃥

U+110E5 "𑃥" Sora Sompeng Letter Uh is a character used in the Sora Sompeng script, an abugida invented in 1936 for writing the Sora language spoken by the Sora people in eastern India. This specific letter represents the vowel sound "uh," functioning as an independent vowel in the script's writing system. Its inclusion in the Unicode Standard, specifically in the Sora Sompeng block, ensures that digital text can accurately represent this sound for linguistic preservation and communication purposes.

General Properties

Code Point U+110E5
Version Added 6.1
Name Sora Sompeng Letter Uh
Block Sora Sompeng
General Category Other Letter
Canonical Combining Class Not Reordered
Bidirectional Class Left To Right

Encodings

HTML Decimal Encoding 𑃥
HTML Hex Encoding 𑃥
UTF-8 Encoding 0xF0 0x91 0x83 0xA5
UTF-16 Encoding 0xD804 0xDCE5
UTF-32 Encoding 0x000110E5
C/C++/Java Escape \ud804\udce5

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Line Break Alphabetic
Script Sora Sompeng
Script Extensions Sora Sompeng
Indic Syllabic Category Other
ID Start Yes
XID Start Yes
ID Continue Yes
XID Continue Yes
Alphabetic Yes
Vertical Orientation Rotated
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Alphabetic letter
Sentence Break OLetter