U+F9FD "什" CJK Compatibility Ideograph-# Unicode Character

Unicode Version 17.0

U+F9FD "什" CJK Compatibility Ideograph-# is a compatibility ideograph located in the CJK Compatibility Ideographs block, representing a variant or unified form of the standard Chinese character for the word "what" or "how" (often corresponding to the simplified character 什么). It was encoded specifically for round-trip compatibility with older East Asian character sets, such as those used in legacy Japanese or Korean encoding standards, ensuring that text converted from those systems retains its original appearance. Unlike ordinary CJK Unified Ideographs, this character is not intended for new text creation and is primarily preserved for historical or archival data interchange purposes.

General Properties

Code Point U+F9FD
Version Added 1.1
Name CJK Compatibility Ideograph-#
Block CJK Compatibility Ideographs
General Category Other Letter
Canonical Combining Class Not Reordered
Bidirectional Class Left To Right
Decomposition Type Canonical
Decomposition Mapping "什" U+4EC0 CJK Unified Ideograph-#

Encodings

HTML Decimal Encoding 什
HTML Hex Encoding 什
UTF-8 Encoding 0xEF 0xA7 0xBD
UTF-16 Encoding 0xF9FD
UTF-32 Encoding 0x0000F9FD
C/C++/Java Escape \uf9fd

Unicode Properties

Full Composition Exclusion Yes
Numeric Type Numeric
Numeric Value 10
Line Break Ideographic
East Asian Width Wide
Changes When NFKC Casefolded Yes
NFKC Casefold "什" U+4EC0 CJK Unified Ideograph-#
NFKC Simple Casefold "什" U+4EC0 CJK Unified Ideograph-#
Script Han
Script Extensions Han
Indic Syllabic Category Other
ID Start Yes
XID Start Yes
ID Continue Yes
XID Continue Yes
Alphabetic Yes
Vertical Orientation Upright
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Other
Sentence Break OLetter
Ideographic Yes

Unihan Properties

kCompatibilityVariant U+4EC0
kDefinition file of ten soldiers
kHangul 집:0
kIRG_KSource K0-727A
kKorean CIP
kRSUnicode 9.2
kTotalStrokes 4