U+30BD "ソ" Katakana Letter So Unicode Character

Unicode Version 17.0

U+30BD "ソ" Katakana Letter So is a character from the Japanese katakana syllabary, representing the syllable "so" and primarily used to transcribe foreign words, onomatopoeia, and technical terms. Its visual form consists of two simple strokes, resembling a tiny swooping arc above a short horizontal line, and it is distinct from the similarly shaped katakana character for "n" (ン). In digital text and computing, this character is encoded as a single Unicode scalar value in the Katakana block, enabling its consistent representation across platforms and applications for Japanese language support.

General Properties

Code Point U+30BD
Version Added 1.1
Name Katakana Letter So
Block Katakana
General Category Other Letter
Canonical Combining Class Not Reordered
Bidirectional Class Left To Right

Encodings

HTML Decimal Encoding ソ
HTML Hex Encoding ソ
UTF-8 Encoding 0xE3 0x82 0xBD
UTF-16 Encoding 0x30BD
UTF-32 Encoding 0x000030BD
C/C++/Java Escape \u30bd

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Line Break Ideographic
East Asian Width Wide
Script Katakana
Script Extensions Katakana
Indic Syllabic Category Other
ID Start Yes
XID Start Yes
ID Continue Yes
XID Continue Yes
Alphabetic Yes
Vertical Orientation Upright
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Katakana
Sentence Break OLetter