U+2E31 "⸱" Word Separator Middle Dot Unicode Character

Unicode Version 17.0

U+2E31 "⸱" Word Separator Middle Dot is a punctuation mark designed to clearly indicate a separation between words in certain writing systems, particularly for historical or linguistic texts where spaces might be ambiguous or omitted. Unlike a regular interpunct or period, it is specifically intended to function as a distinct word boundary marker rather than a decimal point or multiplication sign. Its usage helps to delineate lexical units in scripts that lack explicit spacing, such as in ancient inscriptions or transcriptions, ensuring readability without altering the intended phonetic flow.

General Properties

Code Point U+2E31
Version Added 5.2
Name Word Separator Middle Dot
Block Supplemental Punctuation
General Category Other Punctuation
Canonical Combining Class Not Reordered
Bidirectional Class Other Neutral

Encodings

HTML Decimal Encoding ⸱
HTML Hex Encoding ⸱
UTF-8 Encoding 0xE2 0xB8 0xB1
UTF-16 Encoding 0x2E31
UTF-32 Encoding 0x00002E31
C/C++/Java Escape \u2e31

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Line Break Break After
Script Common
Script Extensions Avestan Carian Georgian Old Hungarian Kaithi Lydian Samaritan
Indic Syllabic Category Other
Pattern Syntax Yes
Vertical Orientation Rotated
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Other
Sentence Break Other