U+E0043 "󠁃" Tag Latin Capital Letter C Unicode Character

Unicode Version 17.0

󠁃

U+E0043 "󠁃" Tag Latin Capital Letter C is a special purpose invisible formatting character designed for use within Unicode's Tags block, specifically part of a system used to encode language tags or other metadata by combining sequences of these tag characters. This particular character represents the Latin capital letter C in this tag context, but it is not intended for normal text display or writing; instead, it functions as a technical component within a sequence that identifies a language or custom tag, such as those once proposed for use in certain text processing protocols.

General Properties

Code Point U+E0043
Version Added 3.1
Name Tag Latin Capital Letter C
Block Tags
General Category Format
Canonical Combining Class Not Reordered
Bidirectional Class Boundary Neutral

Encodings

HTML Decimal Encoding 󠁃
HTML Hex Encoding 󠁃
UTF-8 Encoding 0xF3 0xA0 0x81 0x83
UTF-16 Encoding 0xDB40 0xDC43
UTF-32 Encoding 0x000E0043
C/C++/Java Escape \udb40\udc43

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Joining Type Transparent
Line Break Combining Mark
Case Ignorable Yes
Changes When NFKC Casefolded Yes
Script Common
Script Extensions Common
Indic Syllabic Category Other
Indic Conjunct Break Extend
Default Ignorable Code Point Yes
Vertical Orientation Rotated
Grapheme Extend Yes
Other Grapheme Extend Yes
Grapheme Cluster Break Extend
Word Break Extend
Sentence Break Extend
Emoji Component Yes