U+E004F "󠁏" Tag Latin Capital Letter O Unicode Character

Unicode Version 17.0

󠁏

U+E004F "󠁏" Tag Latin Capital Letter O is a special purpose character belonging to the Tags block, which is used exclusively for historical text encoding purposes related to language tagging in plain text, such as in early Unicode normalization or the deprecated ISO 11940 Thai transliteration standard. It represents a formatting character that indicates a tag for the Latin capital letter O, functioning as part of a sequence to mark a language or script tag within a string rather than as a visible glyph. This character, like others in the Tags block, is not intended for normal text display and is typically invisible in modern rendering, serving instead as metadata for legacy systems that required explicit tagging of language or script boundaries.

General Properties

Code Point U+E004F
Version Added 3.1
Name Tag Latin Capital Letter O
Block Tags
General Category Format
Canonical Combining Class Not Reordered
Bidirectional Class Boundary Neutral

Encodings

HTML Decimal Encoding 󠁏
HTML Hex Encoding 󠁏
UTF-8 Encoding 0xF3 0xA0 0x81 0x8F
UTF-16 Encoding 0xDB40 0xDC4F
UTF-32 Encoding 0x000E004F
C/C++/Java Escape \udb40\udc4f

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Joining Type Transparent
Line Break Combining Mark
Case Ignorable Yes
Changes When NFKC Casefolded Yes
Script Common
Script Extensions Common
Indic Syllabic Category Other
Indic Conjunct Break Extend
Default Ignorable Code Point Yes
Vertical Orientation Rotated
Grapheme Extend Yes
Other Grapheme Extend Yes
Grapheme Cluster Break Extend
Word Break Extend
Sentence Break Extend
Emoji Component Yes