U+00EF "ï" Latin Small Letter I with Diaeresis Unicode Character

Unicode Version 17.0

ï

U+00EF "ï" Latin Small Letter I with Diaeresis is a precomposed character that represents the letter "i" modified by a diaeresis, a diacritical mark consisting of two dots placed above it. This character is used in several languages, including French and English, where it indicates that the "i" should be pronounced separately from a preceding vowel, as in the French word "naïf" or the English word "naïve," to show that the two vowels are not part of a digraph or diphthong. In languages like Catalan, Dutch, and Afrikaans, it also appears in loanwords or specific lexical contexts to denote a similar syllabic break or a distinct phonetic value.

General Properties

Code Point U+00EF
Version Added 1.1
Name Latin Small Letter I with Diaeresis
Unicode 1.0 Name Latin Small Letter I Diaeresis
Block Latin-1 Supplement
General Category Lowercase Letter
Canonical Combining Class Not Reordered
Bidirectional Class Left To Right
Decomposition Type Canonical
Decomposition Mapping "i" U+0069 Latin Small Letter I
"̈" U+0308 Combining Diaeresis

Encodings

HTML Decimal Encoding ï
HTML Hex Encoding ï
UTF-8 Encoding 0xC3 0xAF
UTF-16 Encoding 0x00EF
UTF-32 Encoding 0x000000EF
C/C++/Java Escape \u00ef

Unicode Properties

NFC Quick Check Yes
NFKC Quick Check Yes
Numeric Type None
Numeric Value NaN
Line Break Alphabetic
Lowercase Yes
Simple Uppercase Code Point "Ï" U+00CF Latin Capital Letter I with Diaeresis
Simple Titlecase Code Point "Ï" U+00CF Latin Capital Letter I with Diaeresis
Uppercase Code Point "Ï" U+00CF Latin Capital Letter I with Diaeresis
Titlecase Code Point "Ï" U+00CF Latin Capital Letter I with Diaeresis
Cased Yes
Changes When Casemapped Yes
Changes When Titlecased Yes
Changes When Uppercased Yes
Script Latin
Script Extensions Latin
Indic Syllabic Category Other
ID Start Yes
XID Start Yes
ID Continue Yes
XID Continue Yes
Alphabetic Yes
Vertical Orientation Rotated
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Alphabetic letter
Sentence Break Lower