U+10F4E "𐽎" Sogdian Combining Long Hook Below Unicode Character

Unicode Version 17.0

𐽎

U+10F4E "𐽎" Sogdian Combining Long Hook Below is a diacritical mark used in the Sogdian script, an ancient writing system from Central Asia that was historically employed for the Sogdian language, an Eastern Iranian tongue. This combining character is designed to be placed beneath a base Sogdian letter to modify its phonetic value or to indicate a specific grammatical or orthographic function, such as marking a lengthened or altered consonantal sound. It belongs to the Sogdian block of Unicode, which was added to support the digital encoding of Sogdian texts from the Silk Road era, helping preserve the script for scholarly research and digital typography. Its visual appearance resembles a small, downward hook or curved line attached under the letter, distinguishing it from other combining marks in the same script.

General Properties

Code Point U+10F4E
Version Added 11.0
Name Sogdian Combining Long Hook Below
Block Sogdian
General Category Nonspacing Mark
Canonical Combining Class Below
Bidirectional Class Nonspacing Mark

Encodings

HTML Decimal Encoding 𐽎
HTML Hex Encoding 𐽎
UTF-8 Encoding 0xF0 0x90 0xBD 0x8E
UTF-16 Encoding 0xD803 0xDF4E
UTF-32 Encoding 0x00010F4E
C/C++/Java Escape \ud803\udf4e

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Joining Type Transparent
Line Break Combining Mark
Case Ignorable Yes
Script Sogdian
Script Extensions Sogdian
Indic Syllabic Category Other
Indic Conjunct Break Extend
ID Continue Yes
XID Continue Yes
Diacritic Yes
Vertical Orientation Rotated
Grapheme Extend Yes
Grapheme Cluster Break Extend
Word Break Extend
Sentence Break Extend