U+0B89 "உ" Tamil Letter U Unicode Character

Unicode Version 17.0

U+0B89 "உ" Tamil Letter U is a vowel in the Tamil script, representing the short vowel sound /u/ as in the English word "put." It is the third letter of the Tamil alphabet and serves as a fundamental building block for forming syllables, often combining with consonants to create consonant-vowel pairs using distinct diacritic marks. This character is used in the Tamil language, primarily spoken in the Indian state of Tamil Nadu, Sri Lanka, and among Tamil diaspora communities worldwide, and it appears in classical literature, modern writing, and digital text across various platforms.

General Properties

Code Point U+0B89
Version Added 1.1
Name Tamil Letter U
Block Tamil
General Category Other Letter
Canonical Combining Class Not Reordered
Bidirectional Class Left To Right

Encodings

HTML Decimal Encoding உ
HTML Hex Encoding உ
UTF-8 Encoding 0xE0 0xAE 0x89
UTF-16 Encoding 0x0B89
UTF-32 Encoding 0x00000B89
C/C++/Java Escape \u0b89

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Line Break Alphabetic
Script Tamil
Script Extensions Tamil
Indic Syllabic Category Vowel Independent
ID Start Yes
XID Start Yes
ID Continue Yes
XID Continue Yes
Alphabetic Yes
Vertical Orientation Rotated
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Alphabetic letter
Sentence Break OLetter