U+E0063 "󠁣" Tag Latin Small Letter C Unicode Character

Unicode Version 17.0

󠁣

U+E0063 "󠁣" Tag Latin Small Letter C is a formatting tag character used within the Unicode standard's Tags block, specifically intended for invisible text markup in plain text environments. It does not represent a visible glyph but instead acts as a component for constructing language tags, such as indicating a specific language or variant, following the syntax of RFC 5646. This character is typically used in combination with other tag characters, enclosed within U+E0001 LANGUAGE TAG and terminated by U+E007F CANCEL TAG, to apply metadata without altering visible text. Because it is a non printing control character, most modern systems and fonts will display it as a blank or invisible space, and its primary purpose is for specialized applications like plain text encoding of linguistic identifiers rather than general typography.

General Properties

Code Point U+E0063
Version Added 3.1
Name Tag Latin Small Letter C
Block Tags
General Category Format
Canonical Combining Class Not Reordered
Bidirectional Class Boundary Neutral

Encodings

HTML Decimal Encoding 󠁣
HTML Hex Encoding 󠁣
UTF-8 Encoding 0xF3 0xA0 0x81 0xA3
UTF-16 Encoding 0xDB40 0xDC63
UTF-32 Encoding 0x000E0063
C/C++/Java Escape \udb40\udc63

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Joining Type Transparent
Line Break Combining Mark
Case Ignorable Yes
Changes When NFKC Casefolded Yes
Script Common
Script Extensions Common
Indic Syllabic Category Other
Indic Conjunct Break Extend
Default Ignorable Code Point Yes
Vertical Orientation Rotated
Grapheme Extend Yes
Other Grapheme Extend Yes
Grapheme Cluster Break Extend
Word Break Extend
Sentence Break Extend
Emoji Component Yes