U+E0055 "󠁕" Tag Latin Capital Letter U Unicode Character

Unicode Version 17.0

󠁕

U+E0055 "󠁁" Tag Latin Capital Letter U is part of the Tags block, a special collection of characters designed for use with language tag encoding in plain text, specifically within the deprecated Unicode tag mechanism that was intended for marking text with invisible language identifiers. This particular character represents a graphic illusion of a Latin capital letter U, but it is not a normal alphabetic letter; instead, it functions as a formatting tag character that, when combined with other tag characters, can be used to encode language tags as standardized by legacy systems, though modern Unicode recommends using the Language Subtag Registry or other methods instead. As a visible placeholder, its primary purpose is to facilitate machine reading of language metadata rather than to be displayed as a standard letter in written language.

General Properties

Code Point U+E0055
Version Added 3.1
Name Tag Latin Capital Letter U
Block Tags
General Category Format
Canonical Combining Class Not Reordered
Bidirectional Class Boundary Neutral

Encodings

HTML Decimal Encoding 󠁕
HTML Hex Encoding 󠁕
UTF-8 Encoding 0xF3 0xA0 0x81 0x95
UTF-16 Encoding 0xDB40 0xDC55
UTF-32 Encoding 0x000E0055
C/C++/Java Escape \udb40\udc55

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Joining Type Transparent
Line Break Combining Mark
Case Ignorable Yes
Changes When NFKC Casefolded Yes
Script Common
Script Extensions Common
Indic Syllabic Category Other
Indic Conjunct Break Extend
Default Ignorable Code Point Yes
Vertical Orientation Rotated
Grapheme Extend Yes
Other Grapheme Extend Yes
Grapheme Cluster Break Extend
Word Break Extend
Sentence Break Extend
Emoji Component Yes