Unicode Character "臢" U+81E2 CJK Unified Ideograph-#
Unicode Version 15.1
臢
Summary
The unicode character "臢" at code point U+81E2 is a CJK (Chinese Japanese Korean) ideogram meaning "dirty or filthy". It is a character in the CJK Unified Ideographs block and is part of the Han script. The character is an other letter. The UTF-8 encoding of "臢" is 0xE8 0x87 0xA2 and the UTF-16 encoding is 0x81E2.
General Properties
| Code Point | U+81E2 |
| Version Added | 1.1 |
| Name | CJK Unified Ideograph-# |
| Block | CJK Unified Ideographs |
| General Category | Other Letter |
| Canonical Combining Class | Not Reordered |
| Bidirectional Class | Left To Right |
Encodings
| HTML Decimal Encoding | 臢 |
| HTML Hex Encoding | 臢 |
| UTF-8 Encoding | 0xE8 0x87 0xA2 |
| UTF-16 Encoding | 0x81E2 |
| UTF-32 Encoding | 0x000081E2 |
| C/C++/Java Escape | \u81e2 |
Unicode Properties
Unihan Properties
| kBigFive | C5D8 |
| kCCCII | 227A57 |
| kCNS1986 | 1-7C44 |
| kCNS1992 | 1-7C44 |
| kCangjie | BHUC |
| kCantonese | zim1 |
| kCihaiT | 1106.201 |
| kDefinition | dirty; filthy |
| kEACC | 227A57 |
| kFourCornerCode | 7428.6 |
| kGB3 | 4607 |
| kHanYu | 32128.040 |
| kHanyuPinyin | 32128.040:zā,zān |
| kIICore | AT |
| kIRGHanyuDaZidian | 32128.040 |
| kIRGKangXi | 0999.071 |
| kIRG_GSource | G3-4E27 |
| kIRG_HSource | HB1-C5D8 |
| kIRG_KSource | K2-5638 |
| kIRG_TSource | T1-7C44 |
| kJapanese | サン |
| kKangXi | 0999.071 |
| kMandarin | zā |
| kMatthews | 6680 |
| kMojiJoho | MJ021286 |
| kMorohashi | 30060 |
| kPhonetic | 28 |
| kRSUnicode | 130.19 |
| kSimplifiedVariant | "臜" U+81DC CJK Unified Ideograph-# |
| kSMSZD2003Index | 563.09 |
| kSMSZD2003Readings | za粵zim1 |
| kTotalStrokes | 23 |
| kUnihanCore2020 | HMT |
| kXerox | 301:325 |