U+08C1 "ࣁ" Arabic Letter Tcheh with Small V Unicode Character

Unicode Version 17.0

U+08C1 "ࣁ" Arabic Letter Tcheh with Small V is a specialized grapheme used in the extended Arabic script, primarily for writing languages such as Sindhi or other South Asian languages where a distinct articulation of the "tcheh" sound (similar to the English "ch" in "church") needs to be modified or clarified. It consists of the base Arabic letter tcheh (U+0686) combined with a small diacritical "v" shape placed above or below it, which indicates a specific phonetic variation, often denoting a retroflex or palatalized pronunciation. This character is part of the Arabic Extended-A block, intended to support linguistic precision in orthographies that require additional marks beyond standard Arabic letters. Its inclusion in Unicode helps ensure accurate digital representation and text processing for lesser-documented languages and dialects that rely on such nuanced script adaptations.

General Properties

Code Point U+08C1
Version Added 13.0
Name Arabic Letter Tcheh with Small V
Block Arabic Extended-A
General Category Other Letter
Canonical Combining Class Not Reordered
Bidirectional Class Arabic Letter

Encodings

HTML Decimal Encoding ࣁ
HTML Hex Encoding ࣁ
UTF-8 Encoding 0xE0 0xA3 0x81
UTF-16 Encoding 0x08C1
UTF-32 Encoding 0x000008C1
C/C++/Java Escape \u08c1

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Joining Type Dual Joining
Joining Group Hah
Line Break Alphabetic
Script Arabic
Script Extensions Arabic
Indic Syllabic Category Other
ID Start Yes
XID Start Yes
ID Continue Yes
XID Continue Yes
Alphabetic Yes
Vertical Orientation Rotated
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Alphabetic letter
Sentence Break OLetter