U+0003 "ETX" END OF TEXT Unicode Character

Unicode Version 17.0

ETX

U+0003 "ETX" END OF TEXT is a control character originally defined for telegraph and early computer communication protocols to signal the conclusion of a text or data stream, distinguishing it from other control codes like STX (Start of Text). Within the Unicode standard, it occupies the C0 control code range and is primarily used in low level serial protocols, terminal emulation, and telecommunications to denote where transmitted information ends, allowing receiving systems to properly parse or terminate processing. In modern computing, it maintains historical significance and can appear in raw data streams, but it is not typically rendered as a visible glyph in text processing.

General Properties

Code Point U+0003
Version Added 1.1
Unicode 1.0 Name End of Text
Block Basic Latin
General Category Control
Canonical Combining Class Not Reordered
Bidirectional Class Boundary Neutral
Alias END OF TEXT (control)
ETX (abbreviation)

Encodings

HTML Decimal Encoding 
HTML Hex Encoding 
UTF-8 Encoding 0x03
UTF-16 Encoding 0x0003
UTF-32 Encoding 0x00000003
C/C++/Java Escape \u0003

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Line Break Combining Mark
Script Common
Script Extensions Common
Indic Syllabic Category Other
Vertical Orientation Rotated
Grapheme Cluster Break Control
Word Break Other
Sentence Break Other