U+E0068 "󠁨" Tag Latin Small Letter H Unicode Character

Unicode Version 17.0

󠁨

U+E0068 "󠁨" Tag Latin Small Letter H is a special purpose character within the Tags block, specifically used for language tagging in plain text, particularly in conjunction with other tag characters to form a language tag as part of the deprecated Unicode "Tag" mechanism originally intended for marking language in certain protocols. This character represents the lowercase letter "h" in a tag context, and its primary function is to be composed with other tag letters to form a complete language code, such as for identifying a language like Hindi. However, because this tag system was later superseded by more modern methods of language tagging and is not widely supported in modern implementations, the character remains technically defined but rarely used in practice, appearing as a small, invisible placeholder in most standard text rendering systems.

General Properties

Code Point U+E0068
Version Added 3.1
Name Tag Latin Small Letter H
Block Tags
General Category Format
Canonical Combining Class Not Reordered
Bidirectional Class Boundary Neutral

Encodings

HTML Decimal Encoding 󠁨
HTML Hex Encoding 󠁨
UTF-8 Encoding 0xF3 0xA0 0x81 0xA8
UTF-16 Encoding 0xDB40 0xDC68
UTF-32 Encoding 0x000E0068
C/C++/Java Escape \udb40\udc68

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Joining Type Transparent
Line Break Combining Mark
Case Ignorable Yes
Changes When NFKC Casefolded Yes
Script Common
Script Extensions Common
Indic Syllabic Category Other
Indic Conjunct Break Extend
Default Ignorable Code Point Yes
Vertical Orientation Rotated
Grapheme Extend Yes
Other Grapheme Extend Yes
Grapheme Cluster Break Extend
Word Break Extend
Sentence Break Extend
Emoji Component Yes