U+075C "ݜ" Arabic Letter Seen with Four Dots Above Unicode Character

Unicode Version 17.0

ݜ

U+075C "ݜ" Arabic Letter Seen with Four Dots Above is a modified form of the standard Arabic letter seen (س) that is used in certain languages, most notably in the Shina language spoken in parts of Pakistan and India, as well as in the Burushaski language. This character represents a unique retroflex or palatal fricative sound that is not found in classical Arabic, and its four-dot diacritic distinguishes it from other similar letters like the three-dot versions used for other languages. The inclusion of U+075C in the Unicode standard allows for proper digital representation and text processing of these regional languages, ensuring that writers and speakers can accurately transcribe their phonetic inventory in electronic documents and communications.

General Properties

Code Point U+075C
Version Added 4.1
Name Arabic Letter Seen with Four Dots Above
Block Arabic Supplement
General Category Other Letter
Canonical Combining Class Not Reordered
Bidirectional Class Arabic Letter

Encodings

HTML Decimal Encoding ݜ
HTML Hex Encoding ݜ
UTF-8 Encoding 0xDD 0x9C
UTF-16 Encoding 0x075C
UTF-32 Encoding 0x0000075C
C/C++/Java Escape \u075c

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Joining Type Dual Joining
Joining Group Seen
Line Break Alphabetic
Script Arabic
Script Extensions Arabic
Indic Syllabic Category Other
ID Start Yes
XID Start Yes
ID Continue Yes
XID Continue Yes
Alphabetic Yes
Vertical Orientation Rotated
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Alphabetic letter
Sentence Break OLetter