U+2E31 "⸱" Word Separator Middle Dot Unicode Character

Unicode Version 17.0

⸱

U+2E31 "⸱" Word Separator Middle Dot is a punctuation mark designed to clearly indicate a separation between words in certain writing systems, particularly for historical or linguistic texts where spaces might be ambiguous or omitted. Unlike a regular interpunct or period, it is specifically intended to function as a distinct word boundary marker rather than a decimal point or multiplication sign. Its usage helps to delineate lexical units in scripts that lack explicit spacing, such as in ancient inscriptions or transcriptions, ensuring readability without altering the intended phonetic flow.

General Properties

Code Point	U+2E31
Version Added	5.2
Name	Word Separator Middle Dot
Block	Supplemental Punctuation
General Category	Other Punctuation
Canonical Combining Class	Not Reordered
Bidirectional Class	Other Neutral

Encodings

HTML Decimal Encoding	⸱
HTML Hex Encoding	⸱
UTF-8 Encoding	0xE2 0xB8 0xB1
UTF-16 Encoding	0x2E31
UTF-32 Encoding	0x00002E31
C/C++/Java Escape	\u2e31

Unicode Properties

NFC Quick Check	Yes
NFD Quick Check	Yes
NFKC Quick Check	Yes
NFKD Quick Check	Yes
Numeric Type	None
Numeric Value	NaN
Line Break	Break After
Script	Common
Script Extensions	Avestan Carian Georgian Old Hungarian Kaithi Lydian Samaritan
Indic Syllabic Category	Other
Pattern Syntax	Yes
Vertical Orientation	Rotated
Grapheme Base	Yes
Grapheme Cluster Break	Other
Word Break	Other
Sentence Break	Other