U+0F0B "་" Tibetan Mark Intersyllabic Tsheg Unicode Character

Unicode Version 17.0

U+0F0B "་" Tibetan Mark Intersyllabic Tsheg is a fundamental punctuation mark used in the Tibetan script to separate syllables within a word or between words, functioning as a visual break that guides pronunciation and reading flow. It resembles a small dot or comma and appears after each syllable, except for the final syllable in a word, helping to avoid confusion between syllables that could otherwise blend together. Unlike spaces in many other scripts, the tsheg is a mandatory part of Tibetan orthography and is not used to indicate grammatical pauses or sentence boundaries, which are instead marked by the shad character. Its inclusion in Unicode ensures that digital text in Tibetan can be accurately rendered and processed across modern computing systems, preserving the proper structure and readability of the language.

General Properties

Code Point U+0F0B
Version Added 2.0
Name Tibetan Mark Intersyllabic Tsheg
Unicode 1.0 Name Tibetan Tseg
Block Tibetan
General Category Other Punctuation
Canonical Combining Class Not Reordered
Bidirectional Class Left To Right

Encodings

HTML Decimal Encoding ་
HTML Hex Encoding ་
UTF-8 Encoding 0xE0 0xBC 0x8B
UTF-16 Encoding 0x0F0B
UTF-32 Encoding 0x00000F0B
C/C++/Java Escape \u0f0b

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Line Break Break After
Script Tibetan
Script Extensions Tibetan
Indic Syllabic Category Other
Vertical Orientation Rotated
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Other
Sentence Break Other