U+11009 "𑀉" Brahmi Letter U Unicode Character

Unicode Version 17.0

𑀉

U+11009 "𑀉" Brahmi Letter U is an ancient script character representing the short vowel "u" in the Brahmi writing system, one of the earliest known scripts used to write various Prakrit and Sanskrit languages in South Asia, dating back to the 3rd century BCE. This character is part of the Brahmi Unicode block, which was added to the standard to support the digital preservation and study of historical inscriptions and manuscripts. The glyph itself resembles a hook or curved shape with a small dot, typical of Brahmi's phonetic alphabet structure, where vowels were denoted with distinctive marks attached to consonant letters. Its inclusion in Unicode allows for accurate representation in modern text processing, aiding researchers in epigraphy and historical linguistics.

General Properties

Code Point U+11009
Version Added 6.0
Name Brahmi Letter U
Block Brahmi
General Category Other Letter
Canonical Combining Class Not Reordered
Bidirectional Class Left To Right

Encodings

HTML Decimal Encoding 𑀉
HTML Hex Encoding 𑀉
UTF-8 Encoding 0xF0 0x91 0x80 0x89
UTF-16 Encoding 0xD804 0xDC09
UTF-32 Encoding 0x00011009
C/C++/Java Escape \ud804\udc09

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Line Break Aksara
Script Brahmi
Script Extensions Brahmi
Indic Syllabic Category Vowel Independent
ID Start Yes
XID Start Yes
ID Continue Yes
XID Continue Yes
Alphabetic Yes
Vertical Orientation Rotated
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Alphabetic letter
Sentence Break OLetter