U+0539 "Թ" Armenian Capital Letter To Unicode Character

Unicode Version 17.0

Թ

U+0539 "Թ" Armenian Capital Letter To is the capital form of the 9th letter in the Armenian alphabet, representing the aspirated voiceless alveolar plosive sound /tʰ/, similar to the 't' sound in the English word "top". It is used in the writing of the Eastern and Western varieties of the Armenian language, tracing its origins back to the classical Armenian script created by Mesrop Mashtots in the 5th century. This character is part of the Armenian block in Unicode, which facilitates digital text representation and processing for the language, appearing primarily at the beginning of words or in all-caps writing, while its lowercase counterpart is U+0569 "թ".

General Properties

Code Point U+0539
Version Added 1.1
Name Armenian Capital Letter To
Block Armenian
General Category Uppercase Letter
Canonical Combining Class Not Reordered
Bidirectional Class Left To Right

Encodings

HTML Decimal Encoding Թ
HTML Hex Encoding Թ
UTF-8 Encoding 0xD4 0xB9
UTF-16 Encoding 0x0539
UTF-32 Encoding 0x00000539
C/C++/Java Escape \u0539

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Line Break Alphabetic
Uppercase Yes
Simple Lowercase Code Point "թ" U+0569 Armenian Small Letter To
Lowercase Code Point "թ" U+0569 Armenian Small Letter To
Simple Case Folding "թ" U+0569 Armenian Small Letter To
Case Folding "թ" U+0569 Armenian Small Letter To
Cased Yes
Changes When Casefolded Yes
Changes When Casemapped Yes
Changes When Lowercased Yes
Changes When NFKC Casefolded Yes
NFKC Casefold "թ" U+0569 Armenian Small Letter To
NFKC Simple Casefold "թ" U+0569 Armenian Small Letter To
Script Armenian
Script Extensions Armenian
Indic Syllabic Category Other
ID Start Yes
XID Start Yes
ID Continue Yes
XID Continue Yes
Alphabetic Yes
Vertical Orientation Rotated
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Alphabetic letter
Sentence Break Upper