U+001A "SUB" SUBSTITUTE Unicode Character
Unicode Version 17.0
SUB
U+001A "SUB" SUBSTITUTE is a control character originally defined in the ASCII standard, intended to signal that a specific piece of data should be replaced or ignored, often used in communications and file transfer protocols to mark an erroneous or invalid character. In modern computing, it is most famously associated with its use in the CP/M and early MS-DOS operating systems, where it served as the end-of-file (EOF) marker, causing text editors and command-line tools to stop reading a file upon encountering it. Despite its historical importance, the SUB character is rarely used directly in contemporary text processing, as its role as a file terminator has largely been replaced by more explicit file system metadata.
General Properties
| Code Point | U+001A |
| Version Added | 1.1 |
| Unicode 1.0 Name | Substitute |
| Block | Basic Latin |
| General Category | Control |
| Canonical Combining Class | Not Reordered |
| Bidirectional Class | Boundary Neutral |
| Alias | SUB (abbreviation) SUBSTITUTE (control) |
Encodings
| HTML Decimal Encoding |  |
| HTML Hex Encoding |  |
| UTF-8 Encoding | 0x1A |
| UTF-16 Encoding | 0x001A |
| UTF-32 Encoding | 0x0000001A |
| C/C++/Java Escape | \u001a |
Unicode Properties
| NFC Quick Check | Yes |
| NFD Quick Check | Yes |
| NFKC Quick Check | Yes |
| NFKD Quick Check | Yes |
| Numeric Type | None |
| Numeric Value | NaN |
| Line Break | Combining Mark |
| Script | Common |
| Script Extensions | Common |
| Indic Syllabic Category | Other |
| Vertical Orientation | Rotated |
| Grapheme Cluster Break | Control |
| Word Break | Other |
| Sentence Break | Other |