U+5572 "啲" CJK Unified Ideograph-# Unicode Character

Unicode Version 17.0

U+5572 "啲" CJK Unified Ideograph-# is a Chinese character used primarily in Cantonese, where it is pronounced as "di" or "dit" in Jyutping. It functions as a versatile particle or quantifier, often meaning "a bit," "a few," or serving as an adverbial suffix in colloquial expressions, similar to the Mandarin "的" or "地" but with distinct tonal and contextual usage. The character is composed of the radical "口" (mouth) on the left and "的" on the right, reflecting its phonetic and semantic connections within written Cantonese. Its inclusion in the Unicode standard supports digital communication and text processing for the complex needs of Chinese dialects beyond standard Mandarin.

General Properties

Code Point U+5572
Version Added 1.1
Name CJK Unified Ideograph-#
Block CJK Unified Ideographs
General Category Other Letter
Canonical Combining Class Not Reordered
Bidirectional Class Left To Right

Encodings

HTML Decimal Encoding 啲
HTML Hex Encoding 啲
UTF-8 Encoding 0xE5 0x95 0xB2
UTF-16 Encoding 0x5572
UTF-32 Encoding 0x00005572
C/C++/Java Escape \u5572

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Line Break Ideographic
East Asian Width Wide
Script Han
Script Extensions Han
Indic Syllabic Category Other
ID Start Yes
XID Start Yes
ID Continue Yes
XID Continue Yes
Alphabetic Yes
Vertical Orientation Upright
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Other
Sentence Break OLetter
Ideographic Yes
Unified Ideograph Yes

Unihan Properties

kCNS1986 E-6722
kCNS1992 3-6722
kCangjie RHAI
kCantonese di1
kCheungBauer 030/08;RHAI;di1,di4,dit1,ti4
kCheungBauerIndex 360.02 360.03 360.04 360.05
kCowles 4138
kDefinition (Cant.) a few
kHanYu 10642.091
kIICore BHM
kIRGHanyuDaZidian 10642.091
kIRGKangXi 0196.261
kIRG_GSource GH-1224
kIRG_HSource H-9DF8
kIRG_TSource T3-6722
kKangXi 0196.261
kLau 529
kMandarin
kMeyerWempe 3058
kPhonetic 1325*
kPseudoGB1 9213
kRSUnicode 30.8
kTotalStrokes 11
kUnihanCore2020 HM
kXerox 317:260