U+E0067 "󠁧" Tag Latin Small Letter G Unicode Character

Unicode Version 17.0

󠁧

U+E0067 "󠁧" Tag Latin Small Letter G is a special purpose formatting character used within the Unicode tag system, primarily designed for use in invisible tagging applications such as language identification in plain text. It is part of a block of tag characters that are intended to be combined in sequences to encode invisible metadata, and it is functionally distinct from an ordinary visible letter "g". This character is typically used in combination with other tag characters, such as the U+E0001 language tag start, to form a complete tag sequence that can indicate a specific language, as defined by subtags in the IANA registry. When properly rendered, U+E0067 does not produce a visible glyph itself but instead acts as an invisible marker, making it a part of the system's infrastructure for embedding semantic information in text without affecting its visual appearance.

General Properties

Code Point U+E0067
Version Added 3.1
Name Tag Latin Small Letter G
Block Tags
General Category Format
Canonical Combining Class Not Reordered
Bidirectional Class Boundary Neutral

Encodings

HTML Decimal Encoding 󠁧
HTML Hex Encoding 󠁧
UTF-8 Encoding 0xF3 0xA0 0x81 0xA7
UTF-16 Encoding 0xDB40 0xDC67
UTF-32 Encoding 0x000E0067
C/C++/Java Escape \udb40\udc67

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Joining Type Transparent
Line Break Combining Mark
Case Ignorable Yes
Changes When NFKC Casefolded Yes
Script Common
Script Extensions Common
Indic Syllabic Category Other
Indic Conjunct Break Extend
Default Ignorable Code Point Yes
Vertical Orientation Rotated
Grapheme Extend Yes
Other Grapheme Extend Yes
Grapheme Cluster Break Extend
Word Break Extend
Sentence Break Extend
Emoji Component Yes