Unicode Character " " U+2001 Em Quad
Unicode Version 15.1
Summary
The unicode character " " at code point U+2001 is Em Quad. It is a character in the General Punctuation block and is part of the Common script. The character is a space separator. The UTF-8 encoding of " " is 0xE2 0x80 0x81 and the UTF-16 encoding is 0x2001.
General Properties
| Code Point | U+2001 |
| Version Added | 1.1 |
| Name | Em Quad |
| Block | General Punctuation |
| General Category | Space Separator |
| Canonical Combining Class | Not Reordered |
| Bidirectional Class | White Space |
| Decomposition Type | Canonical |
| Decomposition Mapping | " " U+2003 Em Space |
Encodings
| HTML Decimal Encoding |   |
| HTML Hex Encoding |   |
| UTF-8 Encoding | 0xE2 0x80 0x81 |
| UTF-16 Encoding | 0x2001 |
| UTF-32 Encoding | 0x00002001 |
| C/C++/Java Escape | \u2001 |
Unicode Properties
| Full Composition Exclusion | Yes |
| Numeric Type | None |
| Numeric Value | NaN |
| Line Break | Break After |
| Changes When NFKC Casefolded | Yes |
| NFKC Casefold | "SP" U+0020 Space |
| NFKC Simple Casefold | "SP" U+0020 Space |
| Script | Common |
| Script Extensions | Common |
| Indic Syllabic Category | Other |
| White Space | Yes |
| Vertical Orientation | Rotated |
| Grapheme Base | Yes |
| Grapheme Cluster Break | Other |
| Word Break | WSegSpace |
| Sentence Break | Sp |