U+E0031 "󠀱" Tag Digit One Unicode Character

Unicode Version 17.0

󠀱

U+E0031 "󠀱" Tag Digit One is part of the Tags block, a special set of invisible formatting characters used primarily for language tagging in plain text, as specified by Unicode's legacy tag mechanism. This character represents the digit "1" in a coded tag sequence, but unlike standard digits, it is not intended for visible display; instead, it forms part of a hidden string of tag characters that can be processed by software to convey metadata, such as indicating a specific language or script for the surrounding text. Because of its invisible nature and limited practical use in modern systems, it is rarely encountered in everyday text and is mainly a relic of older text encoding practices.

General Properties

Code Point U+E0031
Version Added 3.1
Name Tag Digit One
Block Tags
General Category Format
Canonical Combining Class Not Reordered
Bidirectional Class Boundary Neutral

Encodings

HTML Decimal Encoding 󠀱
HTML Hex Encoding 󠀱
UTF-8 Encoding 0xF3 0xA0 0x80 0xB1
UTF-16 Encoding 0xDB40 0xDC31
UTF-32 Encoding 0x000E0031
C/C++/Java Escape \udb40\udc31

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Joining Type Transparent
Line Break Combining Mark
Case Ignorable Yes
Changes When NFKC Casefolded Yes
Script Common
Script Extensions Common
Indic Syllabic Category Other
Indic Conjunct Break Extend
Default Ignorable Code Point Yes
Vertical Orientation Rotated
Grapheme Extend Yes
Other Grapheme Extend Yes
Grapheme Cluster Break Extend
Word Break Extend
Sentence Break Extend
Emoji Component Yes