U+E006D "󠁭" Tag Latin Small Letter M Unicode Character

Unicode Version 17.0

󠁭

U+E006D "󠁭" Tag Latin Small Letter M is a formatting character belonging to the Tags block, used as part of a sequence for language tagging in plain text. It represents the lowercase letter m in a set of special code points designed to be combined with other tag characters to encode a language identifier, such as for indicating a specific spoken language in text processing. This character is intended for internal use in conjunction with the U+E0001 Language Tag start marker and a terminating U+E007F Cancel Tag, allowing applications to apply linguistic metadata to a string without affecting the visible text. It is not meant for standalone display and is typically invisible in standard rendering, serving only as a structural component in a controlled Unicode mechanism that has been largely superseded by modern language markup protocols.

General Properties

Code Point U+E006D
Version Added 3.1
Name Tag Latin Small Letter M
Block Tags
General Category Format
Canonical Combining Class Not Reordered
Bidirectional Class Boundary Neutral

Encodings

HTML Decimal Encoding 󠁭
HTML Hex Encoding 󠁭
UTF-8 Encoding 0xF3 0xA0 0x81 0xAD
UTF-16 Encoding 0xDB40 0xDC6D
UTF-32 Encoding 0x000E006D
C/C++/Java Escape \udb40\udc6d

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Joining Type Transparent
Line Break Combining Mark
Case Ignorable Yes
Changes When NFKC Casefolded Yes
Script Common
Script Extensions Common
Indic Syllabic Category Other
Indic Conjunct Break Extend
Default Ignorable Code Point Yes
Vertical Orientation Rotated
Grapheme Extend Yes
Other Grapheme Extend Yes
Grapheme Cluster Break Extend
Word Break Extend
Sentence Break Extend
Emoji Component Yes