U+0C3C "఼" Telugu Sign Nukta Unicode Character

Unicode Version 17.0

U+0C3C "఼" Telugu Sign Nukta is a combining diacritical mark used in the Telugu script to modify the sound of a base consonant, typically representing borrowed or non-native phonemes from languages such as Urdu, Persian, or English. It appears as a small dot or circle placed below the consonant, allowing the Telugu writing system to accurately represent sounds that do not occur naturally in pure Telugu vocabulary. This character is essential for transliterating foreign words and names while preserving their original pronunciation, and it may also be used in certain regional or scholarly contexts to denote subtle phonetic distinctions.

General Properties

Code Point U+0C3C
Version Added 14.0
Name Telugu Sign Nukta
Block Telugu
General Category Nonspacing Mark
Canonical Combining Class Nukta
Bidirectional Class Nonspacing Mark

Encodings

HTML Decimal Encoding ఼
HTML Hex Encoding ఼
UTF-8 Encoding 0xE0 0xB0 0xBC
UTF-16 Encoding 0x0C3C
UTF-32 Encoding 0x00000C3C
C/C++/Java Escape \u0c3c

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Joining Type Transparent
Line Break Combining Mark
Case Ignorable Yes
Script Telugu
Script Extensions Telugu
Indic Syllabic Category Nukta
Indic Positional Category Bottom
Indic Conjunct Break Extend
ID Continue Yes
XID Continue Yes
Diacritic Yes
Vertical Orientation Rotated
Grapheme Extend Yes
Grapheme Cluster Break Extend
Word Break Extend
Sentence Break Extend