U+0E4C "์" Thai Character Thanthakhat Unicode Character

Unicode Version 17.0

U+0E4C "์" Thai Character Thanthakhat is a diacritical mark used in the Thai script, known as the thanthakhat or "karan," which functions as a silent or cancellation symbol. When placed above a consonant, it indicates that the consonant should not be pronounced, effectively silencing the letter within a word. This character is commonly applied in Thai to represent the pronunciation of loanwords from languages like Pali, Sanskrit, or English, where certain letters are written historically but are not sounded in modern speech. Its usage is essential for accurate transliteration and proper spelling in the Thai writing system.

General Properties

Code Point U+0E4C
Version Added 1.1
Name Thai Character Thanthakhat
Unicode 1.0 Name Thai Thanthakhat
Block Thai
General Category Nonspacing Mark
Canonical Combining Class Not Reordered
Bidirectional Class Nonspacing Mark

Encodings

HTML Decimal Encoding ์
HTML Hex Encoding ์
UTF-8 Encoding 0xE0 0xB9 0x8C
UTF-16 Encoding 0x0E4C
UTF-32 Encoding 0x00000E4C
C/C++/Java Escape \u0e4c

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Joining Type Transparent
Line Break Complex Context Dependent (South East Asian)
Case Ignorable Yes
Script Thai
Script Extensions Thai
Indic Syllabic Category Consonant Killer
Indic Positional Category Top
Indic Conjunct Break Extend
ID Continue Yes
XID Continue Yes
Diacritic Yes
Vertical Orientation Rotated
Grapheme Extend Yes
Grapheme Cluster Break Extend
Word Break Extend
Sentence Break Extend