U+FA42 "既" CJK Compatibility Ideograph-# Unicode Character

Unicode Version 17.0

U+FA42 "既" CJK Compatibility Ideograph-# is a compatibility ideograph used in the CJK (Chinese, Japanese, and Korean) block, specifically encoded to represent a variant or simplified form of the standard CJK unified ideograph U+65E2 "既," which means "already," "since," or "previously." This character was included in Unicode primarily to maintain round-trip compatibility with older East Asian character encoding standards, such as certain Japanese industrial standards (JIS), where it appeared as a distinct glyph variant. Its usage is largely historical or specialized, as it is not typically employed in modern writing systems, but it ensures that legacy text data can be correctly represented without loss of information when converted to Unicode.

General Properties

Code Point U+FA42
Version Added 3.2
Name CJK Compatibility Ideograph-#
Block CJK Compatibility Ideographs
General Category Other Letter
Canonical Combining Class Not Reordered
Bidirectional Class Left To Right
Decomposition Type Canonical
Decomposition Mapping "既" U+65E2 CJK Unified Ideograph-#

Encodings

HTML Decimal Encoding 既
HTML Hex Encoding 既
UTF-8 Encoding 0xEF 0xA9 0x82
UTF-16 Encoding 0xFA42
UTF-32 Encoding 0x0000FA42
C/C++/Java Escape \ufa42

Unicode Properties

Full Composition Exclusion Yes
Numeric Type None
Numeric Value NaN
Line Break Ideographic
East Asian Width Wide
Changes When NFKC Casefolded Yes
NFKC Casefold "既" U+65E2 CJK Unified Ideograph-#
NFKC Simple Casefold "既" U+65E2 CJK Unified Ideograph-#
Script Han
Script Extensions Han
Indic Syllabic Category Other
ID Start Yes
XID Start Yes
ID Continue Yes
XID Continue Yes
Alphabetic Yes
Vertical Orientation Upright
Grapheme Base Yes
Grapheme Cluster Break Other
Word Break Other
Sentence Break OLetter
Ideographic Yes

Unihan Properties

kCompatibilityVariant U+65E2
kIRG_JSource J3-752B
kJIS0213 1,85,11
kRSAdobe_Japan1_6 C+13334+71.5.7
kRSUnicode 71.5
kTotalStrokes 11