U+E0047 "󠁇" Tag Latin Capital Letter G Unicode Character

Unicode Version 17.0

󠁇

U+E0047 "󠁇" Tag Latin Capital Letter G is a special character belonging to the Tags block, which was designed for use in plain text tagging systems, particularly for language identification in earlier Unicode versions. This character represents the Latin capital letter G but is not intended for normal text display, instead serving as a component in a sequence of tags to encode metadata such as a language code, like "en" for English, within a string of text. Its usage is considered deprecated for most modern contexts, as it relies on specific rendering support and has been largely superseded by other methods like ISO language tags and Unicode's U+1F3F3 VARIATION SELECTOR-16 for emoji sequences.

General Properties

Code Point U+E0047
Version Added 3.1
Name Tag Latin Capital Letter G
Block Tags
General Category Format
Canonical Combining Class Not Reordered
Bidirectional Class Boundary Neutral

Encodings

HTML Decimal Encoding 󠁇
HTML Hex Encoding 󠁇
UTF-8 Encoding 0xF3 0xA0 0x81 0x87
UTF-16 Encoding 0xDB40 0xDC47
UTF-32 Encoding 0x000E0047
C/C++/Java Escape \udb40\udc47

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Joining Type Transparent
Line Break Combining Mark
Case Ignorable Yes
Changes When NFKC Casefolded Yes
Script Common
Script Extensions Common
Indic Syllabic Category Other
Indic Conjunct Break Extend
Default Ignorable Code Point Yes
Vertical Orientation Rotated
Grapheme Extend Yes
Other Grapheme Extend Yes
Grapheme Cluster Break Extend
Word Break Extend
Sentence Break Extend
Emoji Component Yes