U+00B8 "¸" Cedilla Unicode Character

Unicode Version 17.0

¸

U+00B8 "¸" Cedilla is a small, hook shaped diacritical mark placed beneath certain letters in various languages to alter their pronunciation, most commonly seen under the letter "c" to form "ç" in French, Portuguese, Catalan, and Turkish, where it signals a soft "s" sound rather than a hard "k" sound. It originates from the Spanish word for "little z" and historically evolved from a subscript z used in medieval Latin script. In digital text, this character is distinct from the similarly shaped comma or other diacritics and is encoded as a combining mark when paired with a base letter, though as a standalone symbol it can also appear in mathematical or typographic contexts.

General Properties

Code Point U+00B8
Version Added 1.1
Name Cedilla
Unicode 1.0 Name Spacing Cedilla
Block Latin-1 Supplement
General Category Modifier Symbol
Canonical Combining Class Not Reordered
Bidirectional Class Other Neutral
Decomposition Type Compat
Decomposition Mapping "SP" U+0020 Space
"̧" U+0327 Combining Cedilla

Encodings

HTML Decimal Encoding ¸
HTML Hex Encoding ¸
UTF-8 Encoding 0xC2 0xB8
UTF-16 Encoding 0x00B8
UTF-32 Encoding 0x000000B8
C/C++/Java Escape \u00b8

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
Numeric Type None
Numeric Value NaN
Line Break Ambiguous (Alphabetic or Ideographic)
East Asian Width Ambiguous
Case Ignorable Yes
Changes When NFKC Casefolded Yes
NFKC Casefold "SP" U+0020 Space
"̧" U+0327 Combining Cedilla
NFKC Simple Casefold "SP" U+0020 Space
"̧" U+0327 Combining Cedilla
Script Common
Script Extensions Common
Indic Syllabic Category Other
Diacritic Yes
Vertical Orientation Rotated
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Alphabetic letter
Sentence Break Other