U+2E31 "⸱" Word Separator Middle Dot Unicode Character
Unicode Version 17.0
⸱
U+2E31 "⸱" Word Separator Middle Dot is a punctuation mark designed to clearly indicate a separation between words in certain writing systems, particularly for historical or linguistic texts where spaces might be ambiguous or omitted. Unlike a regular interpunct or period, it is specifically intended to function as a distinct word boundary marker rather than a decimal point or multiplication sign. Its usage helps to delineate lexical units in scripts that lack explicit spacing, such as in ancient inscriptions or transcriptions, ensuring readability without altering the intended phonetic flow.
General Properties
| Code Point | U+2E31 |
| Version Added | 5.2 |
| Name | Word Separator Middle Dot |
| Block | Supplemental Punctuation |
| General Category | Other Punctuation |
| Canonical Combining Class | Not Reordered |
| Bidirectional Class | Other Neutral |
Encodings
| HTML Decimal Encoding | ⸱ |
| HTML Hex Encoding | ⸱ |
| UTF-8 Encoding | 0xE2 0xB8 0xB1 |
| UTF-16 Encoding | 0x2E31 |
| UTF-32 Encoding | 0x00002E31 |
| C/C++/Java Escape | \u2e31 |
Unicode Properties
| NFC Quick Check | Yes |
| NFD Quick Check | Yes |
| NFKC Quick Check | Yes |
| NFKD Quick Check | Yes |
| Numeric Type | None |
| Numeric Value | NaN |
| Line Break | Break After |
| Script | Common |
| Script Extensions | Avestan Carian Georgian Old Hungarian Kaithi Lydian Samaritan |
| Indic Syllabic Category | Other |
| Pattern Syntax | Yes |
| Vertical Orientation | Rotated |
| Grapheme Base | Yes |
| Grapheme Cluster Break | Other |
| Word Break | Other |
| Sentence Break | Other |