U+0C36 "శ" Telugu Letter Sha Unicode Character

Unicode Version 17.0

U+0C36 "శ" Telugu Letter Sha is a consonant used in the Telugu script, which is an abugida employed for writing the Telugu language primarily in the Indian states of Andhra Pradesh and Telangana. It represents the voiceless postalveolar fricative sound /ʃ/, similar to the English "sh" in "ship". As part of the Telugu block in Unicode, this character is encoded in the range U+0C00 to U+0C7F and is visually distinguished by its curved, flowing shape typical of the script. Telugu, a Dravidian language with a literary tradition spanning over a millennium, uses "శ" in a wide range of words, including the name "శంకర" (Shankara), and it often appears in both common vocabulary and classical texts.

General Properties

Code Point U+0C36
Version Added 1.1
Name Telugu Letter Sha
Block Telugu
General Category Other Letter
Canonical Combining Class Not Reordered
Bidirectional Class Left To Right

Encodings

HTML Decimal Encoding శ
HTML Hex Encoding శ
UTF-8 Encoding 0xE0 0xB0 0xB6
UTF-16 Encoding 0x0C36
UTF-32 Encoding 0x00000C36
C/C++/Java Escape \u0c36

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Line Break Alphabetic
Script Telugu
Script Extensions Telugu
Indic Syllabic Category Consonant
Indic Conjunct Break Consonant
ID Start Yes
XID Start Yes
ID Continue Yes
XID Continue Yes
Alphabetic Yes
Vertical Orientation Rotated
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Alphabetic letter
Sentence Break OLetter