U+0A89 "ઉ" Gujarati Letter U Unicode Character

Unicode Version 17.0

U+0A89 "ઉ" Gujarati Letter U is a character in the Gujarati script used to represent the short vowel sound /u/ as in the English word "put". It is the fifth vowel in the Gujarati abugida and appears as an independent letter when it begins a syllable. When combined with a consonant, this vowel is typically represented by a dependent vowel sign rather than the independent form. The character is encoded in the Unicode Standard to enable digital text processing and representation of the Gujarati language, which is primarily spoken in the Indian state of Gujarat.

General Properties

Code Point U+0A89
Version Added 1.1
Name Gujarati Letter U
Block Gujarati
General Category Other Letter
Canonical Combining Class Not Reordered
Bidirectional Class Left To Right

Encodings

HTML Decimal Encoding ઉ
HTML Hex Encoding ઉ
UTF-8 Encoding 0xE0 0xAA 0x89
UTF-16 Encoding 0x0A89
UTF-32 Encoding 0x00000A89
C/C++/Java Escape \u0a89

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Line Break Alphabetic
Script Gujarati
Script Extensions Gujarati
Indic Syllabic Category Vowel Independent
ID Start Yes
XID Start Yes
ID Continue Yes
XID Continue Yes
Alphabetic Yes
Vertical Orientation Rotated
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Alphabetic letter
Sentence Break OLetter