U+0001 "SOH" START OF HEADING Unicode Character
Unicode Version 17.0
SOH
U+0001 "SOH" START OF HEADING is a control character originally defined in ASCII and later adopted into Unicode, designed to precede and indicate the start of a header block in a data transmission stream. Historically used in telecommunications and early computer protocols, SOH helps structure data by signaling to a receiver that the following bytes contain metadata, such as addressing or routing information, before the main message body begins. Although not visible in typical text output, it remains a fundamental part of legacy serial communication standards and specific modern data formats requiring discrete control sequences.
General Properties
| Code Point | U+0001 |
| Version Added | 1.1 |
| Unicode 1.0 Name | Start of Heading |
| Block | Basic Latin |
| General Category | Control |
| Canonical Combining Class | Not Reordered |
| Bidirectional Class | Boundary Neutral |
| Alias | SOH (abbreviation) START OF HEADING (control) |
Encodings
| HTML Decimal Encoding |  |
| HTML Hex Encoding |  |
| UTF-8 Encoding | 0x01 |
| UTF-16 Encoding | 0x0001 |
| UTF-32 Encoding | 0x00000001 |
| C/C++/Java Escape | \u0001 |
Unicode Properties
| NFC Quick Check | Yes |
| NFD Quick Check | Yes |
| NFKC Quick Check | Yes |
| NFKD Quick Check | Yes |
| Numeric Type | None |
| Numeric Value | NaN |
| Line Break | Combining Mark |
| Script | Common |
| Script Extensions | Common |
| Indic Syllabic Category | Other |
| Vertical Orientation | Rotated |
| Grapheme Cluster Break | Control |
| Word Break | Other |
| Sentence Break | Other |