- CJK Unified Ideographs
CJK Unified Ideographs is a range of
Unicode code points assigned forideographs used byChinese characters . Since its introduction in Unicode 1.00, the use of CJK ideographs has been extended to multiple blocks.Unicode ranges
These ideographic characters appear in the following blocks:
*CJK Unified Ideographs (4E00–9FFF) ()
*CJK Unified Ideographs Extension A (3400–4DBF) ()
*CJK Unified Ideographs Extension B (20000–2A6DF)
*Enclosed CJK Letters and Months (3200–32FF) ()Unicode includes support of CJKV radicals, strokes, punctuation, marks and symbols. Although some characters have their (decomposable) counterparts in other blocks, the usages can be different:
*
Kangxi Radicals (2F00–2FDF)
*CJK Radicals Supplement (2E80–2EFF)
*CJK Symbols and Punctuation (3000–303F) ()
*CJK Strokes (31C0–31EF)
*Ideographic Description Characters (2FF0–2FFF)Additional compatibility (discouraged use) characters appear in these blocks:
*CJK Compatibility (3300–33FF) ()
*CJK Compatibility Ideographs (F900–FAFF) ()
*CJK Compatibility Ideographs (2F800–2FA1F) ()
*CJK Compatibility Forms (FE30–FE4F) ()These compatibility characters are included for compatibility with legacy text handling system and other legacy character sets. They include forms of characters for vertical text layout and rich text characters that Unicode recommends handling through other means.
CJK Compatibility Ideographs
Usually, compatibility characters are those that would not have been encoded exceptfor compatibility and round-trip convertibility with other standards. However, the amount of CJK ideographs within any non-Unicode standard is too big to fit into Unicode's CJK Compatibility Ideographs blocks. Instead, code points are assigned when the affected characters are approved by Unicode Consortium, but have yet to assign any code points within the CJK Unified Ideographs blocks.
Version history
Japan
Taiwan
Vietnam
CJK Unified Ideographs Extension B
*
Kangxi dictionary
*Hanyu character dictionary
*Ciyuan
*Cihai
*Hanyu word dictionary
*Encyclopedia of China
*Beijing University Founder DTP
*Siku Quanshu
*HKSCS
*JIS X 0213 planes 3 and 4
*
*KPS 9566-97, KPS 10721-2000
*CNS 11643 planes 4-7, 15
*TCVN , ,CJK Unified Ideographs Extension C
Extension C is currently (January 2007) under ballot within the
International Organization for Standardization (ISO) and will be included in some version of Unicode after 5.1. The current allocation is to the code points U+2A6E0 to U+2B77A. The characters are derived from the following:PRC
*Encyclopedia of China
*Beijing University Founder DTP
*Hanyu character dictionary
*Hanyu word dictionary
*Old hanyu word dictionary
*Commercial Press Ideographs
*Xiandaihanyu Cidian
*Cihai
*Kangxi dictionary
*Chinese Academy of Surveying & Mapping
*Modern Chinese Dialect Encyclopedia
*Yanzhou jinwen jicheng yinde (殷周金文集成引得)MacauJapan
*Japanese KOKUJI CollectionSouth Korea
*Korean IRG Hanja Character Set 5th Edition: 2001North Korea
*KPS 10721:2003Vietnam
*Từ điển chữ Nôm (喃字典), Nguyễn Quang Hồng, 2006
*Từ điển chữ Nôm Tày, Hoàng Triều Ân, 2003
*Bảng tra chữ Nôm miền Nam, Vũ Văn Kính, 1994UTC
*ABC Chinese-English Dictionary,John DeFrancis (德范克), et al., eds., 2nd edition. (1998) Honolulu: University of Hawaii Press
*The Church of Jesus Christ of Latter-Day Saints Hong Kong division
*Mathews' Chinese-English Dictionary, Robert H. Mathews (1975) Cambridge; Harvard University Press
*Guangyun
*Chinese bird system index (中国鸟类系统检索), Zheng Zhuoxin (郑作新), et al. (2000), Beijing, 科学出版社 (www.sciencep.com)
*Annotated Shuowen Jiezi,Duan Yucai CJK Unified Ideographs Extension D
According to the CJK editorial group report [http://www.cse.cuhk.edu.hk/~irg/irg/irg27/IRGN1266%20CJK%20Editorial%20Group%20Report.doc ISO/IEC JTC1/SC2/WG2/IRG N1266] , there are at least characters from following:
Taiwan
*TD-454E
*TC-5036
*TD-624C
*TD-5352
*TC-4139
*TC-4A76
*TD-5C26Korea
*K5H00535
*K5H00222
*K5H00297
*KP1-73E1
*KP1-712E
*KP1-70BE
*KP1-6752
*KP1-672B
*KP1-6651
*KP1-4B50
*KP1-487E
*KP1-4731Vietnam
*V04-5073Unicode
*UTC00103Others
*CJK Unified Ideographs Extension C Remainder list
*Macao SAR (IRGN1249 with minor adjustment)
*Unicode (IRGN1256 and IRGN1257, 472 char)
*China (IRGN1264, 57)CJK Compatibility Ideographs
ee also
*
Han Unification
* [http://www.unicode.org/versions/Unicode5.0.0/ch12.pdf Unicode 5.0 Book - Chapter 12: East Asian Scripts]
Wikimedia Foundation. 2010.