U+E0041 "󠁁" Tag Latin Capital Letter A Unicode Character

Unicode Version 17.0

󠁁

U+E0041 "󠁁" Tag Latin Capital Letter A is a component of the Tags block, specifically designed for use in special plain text encoding applications like language tagging within Unicode, where it functions as a formatting character rather than a standalone letter. It represents the uppercase letter A in a tagging context, and when combined with other tag characters, it helps form tags for identifying languages or other metadata without altering visible text. This character is not intended for general written communication and may not render correctly in all software, as its visibility and behavior depend on the application's support for the Tags block, which is primarily used for interoperability in protocols like those for parsing text tagged with language information.

General Properties

Code Point U+E0041
Version Added 3.1
Name Tag Latin Capital Letter A
Block Tags
General Category Format
Canonical Combining Class Not Reordered
Bidirectional Class Boundary Neutral

Encodings

HTML Decimal Encoding 󠁁
HTML Hex Encoding 󠁁
UTF-8 Encoding 0xF3 0xA0 0x81 0x81
UTF-16 Encoding 0xDB40 0xDC41
UTF-32 Encoding 0x000E0041
C/C++/Java Escape \udb40\udc41

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Joining Type Transparent
Line Break Combining Mark
Case Ignorable Yes
Changes When NFKC Casefolded Yes
Script Common
Script Extensions Common
Indic Syllabic Category Other
Indic Conjunct Break Extend
Default Ignorable Code Point Yes
Vertical Orientation Rotated
Grapheme Extend Yes
Other Grapheme Extend Yes
Grapheme Cluster Break Extend
Word Break Extend
Sentence Break Extend
Emoji Component Yes