U+5454 "呔" CJK Unified Ideograph-# Unicode Character

Unicode Version 17.0

U+5454 "呔" CJK Unified Ideograph-# is a Chinese character used in East Asian writing systems, primarily in traditional and simplified Chinese. It is a phonetic character often employed to represent a sound, such as an exclamation of surprise or reprimand, similar to "tāi" in Mandarin, and can appear in dialectal or literary contexts rather than in everyday standard vocabulary. Its structure consists of the "mouth" radical (口) on the left, indicating its association with sounds or speech, combined with a phonetic component, and it belongs to the CJK Unified Ideographs block, which encompasses a vast range of Chinese, Japanese, and Korean logographs.

General Properties

Code Point U+5454
Version Added 1.1
Name CJK Unified Ideograph-#
Block CJK Unified Ideographs
General Category Other Letter
Canonical Combining Class Not Reordered
Bidirectional Class Left To Right

Encodings

HTML Decimal Encoding 呔
HTML Hex Encoding 呔
UTF-8 Encoding 0xE5 0x91 0x94
UTF-16 Encoding 0x5454
UTF-32 Encoding 0x00005454
C/C++/Java Escape \u5454

Unicode Properties

NFC Quick Check Yes
NFD Quick Check Yes
NFKC Quick Check Yes
NFKD Quick Check Yes
Numeric Type None
Numeric Value NaN
Line Break Ideographic
East Asian Width Wide
Script Han
Script Extensions Han
Indic Syllabic Category Other
ID Start Yes
XID Start Yes
ID Continue Yes
XID Continue Yes
Alphabetic Yes
Vertical Orientation Upright
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Other
Sentence Break OLetter
Ideographic Yes
Unified Ideograph Yes

Unihan Properties

kBigFive CA79
kCCCII 216E79
kCNS1986 2-233B
kCNS1992 2-233B
kCangjie RKI
kCantonese taai1
kCheungBauer 030/04;RKI;taai1
kCheungBauerIndex 349.03 349.04
kDefinition (Cant.) a necktie, a tire
kEACC 216E79
kFourCornerCode 6403.0
kGB0 6330
kGB1 6330
kHanYu 10587.030
kHanyuPinyin 10587.030:dāi,tǎi
kIICore BHM
kIRGHanyuDaZidian 10587.030
kIRGKangXi 0181.191
kIRG_GSource G0-5F3E
kIRG_HSource HB2-CA79
kIRG_TSource T2-233B
kTGH 2013:3701
kKangXi 0181.191
kLau 2986
kMandarin dāi
kPhonetic 1289
kRSUnicode 30.4
kSMSZD2003Index 95.02
kSMSZD2003Readings dāi粵taai1
kTGHZ2013 062.100:dāi
kTotalStrokes 7
kUnihanCore2020 GHMT
kXerox 317:173
kXHC1983 0203.080:dāi 1110.020:tǎi