U+E0052 "󠁒" Tag Latin Capital Letter R Unicode Character

Unicode Version 17.0

󠁒

U+E0052 "󠁒" Tag Latin Capital Letter R is a formatting character that belongs to the Tags block, primarily designed for use in Unicode's language tagging mechanism, such as in the deprecated registration of language tags in plain text. This invisible character does not represent a graphic symbol itself but instead acts as a code point to be combined with other tag characters for encoding language identifiers, specifically resembling the ASCII capital letter R in its shape for tagging purposes. Its primary historical application was in enabling language metadata within text streams, though modern protocols have largely shifted to alternative methods like the Language Subtag Registry or XML-based tags. As a result, U+E0052 is rarely used in contemporary computing and is more of a technical artifact within the Unicode standard.

General Properties

Code Point U+E0052
Version Added 3.1
Name Tag Latin Capital Letter R
Block Tags
General Category Format
Canonical Combining Class Not Reordered
Bidirectional Class Boundary Neutral

Encodings

HTML Decimal Encoding 󠁒
HTML Hex Encoding 󠁒
UTF-8 Encoding 0xF3 0xA0 0x81 0x92
UTF-16 Encoding 0xDB40 0xDC52
UTF-32 Encoding 0x000E0052
C/C++/Java Escape \udb40\udc52

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Joining Type Transparent
Line Break Combining Mark
Case Ignorable Yes
Changes When NFKC Casefolded Yes
Script Common
Script Extensions Common
Indic Syllabic Category Other
Indic Conjunct Break Extend
Default Ignorable Code Point Yes
Vertical Orientation Rotated
Grapheme Extend Yes
Other Grapheme Extend Yes
Grapheme Cluster Break Extend
Word Break Extend
Sentence Break Extend
Emoji Component Yes