U+E0030 "󠀰" Tag Digit Zero Unicode Character

Unicode Version 17.0

󠀰

U+E0030 "󠀰" Tag Digit Zero is a formatting character used exclusively within the Unicode “Tags” block, intended to mark language or text processing metadata rather than to represent a visible symbol. It is part of a specialized set of 96 tag characters, ranging from U+E0020 to U+E007E, which correspond to ASCII printable characters and are used in conjunction with the Language Tag character (U+E0001) to indicate the language of a text, typically within plain text environments. This character itself does not produce a visible glyph in standard text rendering and is invisible to users, though it may appear as a placeholder in certain fonts or debugging views. The Tags block was primarily designed for legacy compatibility and is supported by very few modern implementations, making U+E0030 largely obsolete for everyday use.

General Properties

Code Point U+E0030
Version Added 3.1
Name Tag Digit Zero
Block Tags
General Category Format
Canonical Combining Class Not Reordered
Bidirectional Class Boundary Neutral

Encodings

HTML Decimal Encoding 󠀰
HTML Hex Encoding 󠀰
UTF-8 Encoding 0xF3 0xA0 0x80 0xB0
UTF-16 Encoding 0xDB40 0xDC30
UTF-32 Encoding 0x000E0030
C/C++/Java Escape \udb40\udc30

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Joining Type Transparent
Line Break Combining Mark
Case Ignorable Yes
Changes When NFKC Casefolded Yes
Script Common
Script Extensions Common
Indic Syllabic Category Other
Indic Conjunct Break Extend
Default Ignorable Code Point Yes
Vertical Orientation Rotated
Grapheme Extend Yes
Other Grapheme Extend Yes
Grapheme Cluster Break Extend
Word Break Extend
Sentence Break Extend
Emoji Component Yes