U+001E "RS" INFORMATION SEPARATOR TWO Unicode Character
Unicode Version 17.0
RS
U+001E "RS" INFORMATION SEPARATOR TWO is a control character defined as part of the ASCII standard's C0 control code set, specifically designed to separate distinct records or data units within a hierarchical structure of information. Originally used in data transmission and early computer systems, it functions as a logical boundary marker between records, such as entries in a database or sequential blocks of text, allowing a receiving system to parse and organize data according to a predefined hierarchy. Its placement in the Unicode standard ensures legacy compatibility with systems that rely on these traditional control codes for data stream management.
General Properties
| Code Point | U+001E |
| Version Added | 1.1 |
| Unicode 1.0 Name | Information Separator Two |
| Block | Basic Latin |
| General Category | Control |
| Canonical Combining Class | Not Reordered |
| Bidirectional Class | Paragraph Separator |
| Alias | INFORMATION SEPARATOR TWO (control) RECORD SEPARATOR (control) RS (abbreviation) |
Encodings
| HTML Decimal Encoding |  |
| HTML Hex Encoding |  |
| UTF-8 Encoding | 0x1E |
| UTF-16 Encoding | 0x001E |
| UTF-32 Encoding | 0x0000001E |
| C/C++/Java Escape | \u001e |
Unicode Properties
| NFC Quick Check | Yes |
| NFD Quick Check | Yes |
| NFKC Quick Check | Yes |
| NFKD Quick Check | Yes |
| Numeric Type | None |
| Numeric Value | NaN |
| Line Break | Combining Mark |
| Script | Common |
| Script Extensions | Common |
| Indic Syllabic Category | Other |
| Vertical Orientation | Rotated |
| Grapheme Cluster Break | Control |
| Word Break | Other |
| Sentence Break | Other |