U+E004C "󠁌" Tag Latin Capital Letter L Unicode Character

Unicode Version 17.0

󠁌

U+E004C "󠁌" Tag Latin Capital Letter L is a special purpose character belonging to the Tags block, used exclusively within the Unicode standard for language tagging in plain text, where it functions as a formatting tag rather than a displayable letter. It was added in Unicode version 3.1 and is typically employed in conjunction with other tag characters to encode a language identifier. This character is not intended for normal text rendering and is invisible in most contexts, as its purpose is to provide metadata about the language of surrounding text.

General Properties

Code Point U+E004C
Version Added 3.1
Name Tag Latin Capital Letter L
Block Tags
General Category Format
Canonical Combining Class Not Reordered
Bidirectional Class Boundary Neutral

Encodings

HTML Decimal Encoding 󠁌
HTML Hex Encoding 󠁌
UTF-8 Encoding 0xF3 0xA0 0x81 0x8C
UTF-16 Encoding 0xDB40 0xDC4C
UTF-32 Encoding 0x000E004C
C/C++/Java Escape \udb40\udc4c

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Joining Type Transparent
Line Break Combining Mark
Case Ignorable Yes
Changes When NFKC Casefolded Yes
Script Common
Script Extensions Common
Indic Syllabic Category Other
Indic Conjunct Break Extend
Default Ignorable Code Point Yes
Vertical Orientation Rotated
Grapheme Extend Yes
Other Grapheme Extend Yes
Grapheme Cluster Break Extend
Word Break Extend
Sentence Break Extend
Emoji Component Yes