U+2001 " " Em Quad Unicode Character

Unicode Version 17.0

U+2001 " " Em Quad is a typographic whitespace character specifically designed to be exactly one em in width, which is a unit of measurement equal to the current font's point size, typically the width of a capital letter M. Unlike smaller spaces such as the en quad, the em quad provides a substantial amount of blank horizontal space and is historically used in typesetting to create large indents or to fill lines in justified text, though in modern digital typography it is often replaced by other spacing methods.

General Properties

Code Point U+2001
Version Added 1.1
Name Em Quad
Block General Punctuation
General Category Space Separator
Canonical Combining Class Not Reordered
Bidirectional Class White Space
Decomposition Type Canonical
Decomposition Mapping " " U+2003 Em Space

Encodings

HTML Decimal Encoding  
HTML Hex Encoding  
UTF-8 Encoding 0xE2 0x80 0x81
UTF-16 Encoding 0x2001
UTF-32 Encoding 0x00002001
C/C++/Java Escape \u2001

Unicode Properties

Full Composition Exclusion Yes
Numeric Type None
Numeric Value NaN
Line Break Break After
Changes When NFKC Casefolded Yes
NFKC Casefold "SP" U+0020 Space
NFKC Simple Casefold "SP" U+0020 Space
Script Common
Script Extensions Common
Indic Syllabic Category Other
White Space Yes
Vertical Orientation Rotated
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break WSegSpace
Sentence Break Sp