U+10F42 "𐽂" Sogdian Letter Taw Unicode Character

Unicode Version 17.0

𐽂

U+10F42 "𐽂" Sogdian Letter Taw is a script symbol from the Sogdian alphabet, which was used to write the Sogdian language, an ancient Eastern Iranian language spoken in Central Asia during the first millennium CE. This particular character represents the sound equivalent to the Latin letter 't' and is part of the Sogdian block in the Unicode Standard, encoded to support the preservation and digital representation of historical texts from the Sogdian civilization, which played a crucial role in trade and cultural exchange along the Silk Road.

General Properties

Code Point U+10F42
Version Added 11.0
Name Sogdian Letter Taw
Block Sogdian
General Category Other Letter
Canonical Combining Class Not Reordered
Bidirectional Class Arabic Letter

Encodings

HTML Decimal Encoding 𐽂
HTML Hex Encoding 𐽂
UTF-8 Encoding 0xF0 0x90 0xBD 0x82
UTF-16 Encoding 0xD803 0xDF42
UTF-32 Encoding 0x00010F42
C/C++/Java Escape \ud803\udf42

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Joining Type Dual Joining
Line Break Alphabetic
Script Sogdian
Script Extensions Sogdian
Indic Syllabic Category Other
ID Start Yes
XID Start Yes
ID Continue Yes
XID Continue Yes
Alphabetic Yes
Vertical Orientation Rotated
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Alphabetic letter
Sentence Break OLetter