Unicode Character "—" U+2014 Em Dash
Unicode Version 15.1
—
Summary
The unicode character "—" at code point U+2014 is Em Dash. It is a character in the General Punctuation block and is part of the Common script. The character is a dash punctuation. The UTF-8 encoding of "—" is 0xE2 0x80 0x94 and the UTF-16 encoding is 0x2014.
General Properties
| Code Point | U+2014 |
| Version Added | 1.1 |
| Name | Em Dash |
| Block | General Punctuation |
| General Category | Dash Punctuation |
| Canonical Combining Class | Not Reordered |
| Bidirectional Class | Other Neutral |
Encodings
| HTML Entity | — |
| HTML Decimal Encoding | — |
| HTML Hex Encoding | — |
| UTF-8 Encoding | 0xE2 0x80 0x94 |
| UTF-16 Encoding | 0x2014 |
| UTF-32 Encoding | 0x00002014 |
| C/C++/Java Escape | \u2014 |