U+E003C "󠀼" Tag Less-than Sign Unicode Character

Unicode Version 17.0

󠀼

U+E003C "󠀼" Tag Less-than Sign is a formatting character classified within the Tags block, specifically used as a component in the Unicode tag mechanism for language and text processing. It does not represent a visible typographic symbol, but rather acts as a coded markup element that can be combined with other tag characters to identify or annotate text segments, primarily in contexts like plain text language tagging or as part of legacy usage from the deprecated "Tag Space" and "Tag Character" sequences. Its function is invisible to the end user in typical rendering, serving as a non graphical identifier rather than a printable glyph.

General Properties

Code Point U+E003C
Version Added 3.1
Name Tag Less-than Sign
Block Tags
General Category Format
Canonical Combining Class Not Reordered
Bidirectional Class Boundary Neutral

Encodings

HTML Decimal Encoding 󠀼
HTML Hex Encoding 󠀼
UTF-8 Encoding 0xF3 0xA0 0x80 0xBC
UTF-16 Encoding 0xDB40 0xDC3C
UTF-32 Encoding 0x000E003C
C/C++/Java Escape \udb40\udc3c

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Joining Type Transparent
Line Break Combining Mark
Case Ignorable Yes
Changes When NFKC Casefolded Yes
Script Common
Script Extensions Common
Indic Syllabic Category Other
Indic Conjunct Break Extend
Default Ignorable Code Point Yes
Vertical Orientation Rotated
Grapheme Extend Yes
Other Grapheme Extend Yes
Grapheme Cluster Break Extend
Word Break Extend
Sentence Break Extend
Emoji Component Yes