CJK Unified Ideographs

CJK Unified Ideographs

CJK Unified Ideographs is a range of Unicode code points assigned for ideographs used by Chinese characters. Since its introduction in Unicode 1.00, the use of CJK ideographs has been extended to multiple blocks.

Unicode ranges

These ideographic characters appear in the following blocks:

*CJK Unified Ideographs (4E00–9FFF) ()
*CJK Unified Ideographs Extension A (3400–4DBF) ()
*CJK Unified Ideographs Extension B (20000–2A6DF)
*Enclosed CJK Letters and Months (3200–32FF) ()

Unicode includes support of CJKV radicals, strokes, punctuation, marks and symbols. Although some characters have their (decomposable) counterparts in other blocks, the usages can be different:

*Kangxi Radicals (2F00–2FDF)
*CJK Radicals Supplement (2E80–2EFF)
*CJK Symbols and Punctuation (3000–303F) ()
*CJK Strokes (31C0–31EF)
*Ideographic Description Characters (2FF0–2FFF)

Additional compatibility (discouraged use) characters appear in these blocks:
*CJK Compatibility (3300–33FF) ()
*CJK Compatibility Ideographs (F900–FAFF) ()
*CJK Compatibility Ideographs (2F800–2FA1F) ()
*CJK Compatibility Forms (FE30–FE4F) ()

These compatibility characters are included for compatibility with legacy text handling system and other legacy character sets. They include forms of characters for vertical text layout and rich text characters that Unicode recommends handling through other means.

CJK Compatibility Ideographs

Usually, compatibility characters are those that would not have been encoded exceptfor compatibility and round-trip convertibility with other standards. However, the amount of CJK ideographs within any non-Unicode standard is too big to fit into Unicode's CJK Compatibility Ideographs blocks. Instead, code points are assigned when the affected characters are approved by Unicode Consortium, but have yet to assign any code points within the CJK Unified Ideographs blocks.

Version history

Japan

Taiwan

Vietnam

CJK Unified Ideographs Extension B

*Kangxi dictionary
*Hanyu character dictionary
*Ciyuan
*Cihai
*Hanyu word dictionary
*Encyclopedia of China
*Beijing University Founder DTP
*Siku Quanshu
*HKSCS
*JIS X 0213 planes 3 and 4
*
*KPS 9566-97, KPS 10721-2000
*CNS 11643 planes 4-7, 15
*TCVN, ,

CJK Unified Ideographs Extension C

Extension C is currently (January 2007) under ballot within the International Organization for Standardization (ISO) and will be included in some version of Unicode after 5.1. The current allocation is to the code points U+2A6E0 to U+2B77A. The characters are derived from the following:

PRC
*Encyclopedia of China
*Beijing University Founder DTP
*Hanyu character dictionary
*Hanyu word dictionary
*Old hanyu word dictionary
*Commercial Press Ideographs
*Xiandaihanyu Cidian
*Cihai
*Kangxi dictionary
*Chinese Academy of Surveying & Mapping
*Modern Chinese Dialect Encyclopedia
*Yanzhou jinwen jicheng yinde (殷周金文集成引得)MacauJapan
*Japanese KOKUJI CollectionSouth Korea
*Korean IRG Hanja Character Set 5th Edition: 2001North Korea
*KPS 10721:2003Vietnam
*Từ điển chữ Nôm (喃字典), Nguyễn Quang Hồng, 2006
*Từ điển chữ Nôm Tày, Hoàng Triều Ân, 2003
*Bảng tra chữ Nôm miền Nam, Vũ Văn Kính, 1994UTC
*ABC Chinese-English Dictionary, John DeFrancis(德范克), et al., eds., 2nd edition. (1998) Honolulu: University of Hawaii Press
*The Church of Jesus Christ of Latter-Day Saints Hong Kong division
*Mathews' Chinese-English Dictionary, Robert H. Mathews (1975) Cambridge; Harvard University Press
*Guangyun
*Chinese bird system index (中国鸟类系统检索), Zheng Zhuoxin (郑作新), et al. (2000), Beijing, 科学出版社 (www.sciencep.com)
*Annotated Shuowen Jiezi, Duan Yucai

CJK Unified Ideographs Extension D

According to the CJK editorial group report [http://www.cse.cuhk.edu.hk/~irg/irg/irg27/IRGN1266%20CJK%20Editorial%20Group%20Report.doc ISO/IEC JTC1/SC2/WG2/IRG N1266] , there are at least characters from following:

Taiwan
*TD-454E
*TC-5036
*TD-624C
*TD-5352
*TC-4139
*TC-4A76
*TD-5C26Korea
*K5H00535
*K5H00222
*K5H00297
*KP1-73E1
*KP1-712E
*KP1-70BE
*KP1-6752
*KP1-672B
*KP1-6651
*KP1-4B50
*KP1-487E
*KP1-4731Vietnam
*V04-5073Unicode
*UTC00103Others
*CJK Unified Ideographs Extension C Remainder list
*Macao SAR (IRGN1249 with minor adjustment)
*Unicode (IRGN1256 and IRGN1257, 472 char)
*China (IRGN1264, 57)

CJK Compatibility Ideographs

ee also

*Han Unification
* [http://www.unicode.org/versions/Unicode5.0.0/ch12.pdf Unicode 5.0 Book - Chapter 12: East Asian Scripts]


Wikimedia Foundation. 2010.

Игры ⚽ Поможем решить контрольную работу

Look at other dictionaries:

  • CJK unified ideographs — CJK unifikuoti ideografiniai rašmenys statusas T sritis informatika apibrėžtis Unifikuoti ideografiniai kinų, japonų ir korėjiečių rašto ženklai (↑hieroglifai). Santrumpa CJK sudaryta iš šių kalbų angliškų pavadinimų (Chinese, Japanese, Korean)… …   Enciklopedinis kompiuterijos žodynas

  • List of CJK Unified Ideographs — s. In this system the characters written with the fewest strokes are listed first.The terms Ideographs or ideograms may be misleading, since the Chinese script is not strictly a picture writing system.The block is the result of Han unification [… …   Wikipedia

  • List of CJK Unified Ideographs (4E00-62FF) — This table lists the first 5,376 characters of the CJK Unified Ideographs block (4E00 62FF). See also *List of Unicode characters …   Wikipedia

  • List of CJK Unified Ideographs (6300-77FF) — This table lists the second 5,376 characters of the CJK Unified Ideographs block (6300 77FF). See also *List of Unicode characters …   Wikipedia

  • List of CJK Unified Ideographs (7800-8CFF) — This table lists the third 5,376 characters of the CJK Unified Ideographs block (7800 8CFF). See also *List of Unicode characters …   Wikipedia

  • List of CJK Unified Ideographs (8D00-9FFF) — This table lists the last 4,903 characters of the CJK Unified Ideographs block (8D00 9FFF). See also *List of Unicode characters …   Wikipedia

  • CJK unifikuoti ideografiniai rašmenys — statusas T sritis informatika apibrėžtis Unifikuoti ideografiniai kinų, japonų ir korėjiečių rašto ženklai (↑hieroglifai). Santrumpa CJK sudaryta iš šių kalbų angliškų pavadinimų (Chinese, Japanese, Korean) pirmųjų raidžių. Tradiciškai buvo… …   Enciklopedinis kompiuterijos žodynas

  • Unicode-Block Vereinheitlichte CJK-Ideogramme — CJK Ideogramm 次 in Kurzzeichen, Langzeichen, Kanji und Hanja Der Unicode Block CJK Unified Ideographs (Vereinheitlichte CJK Ideogramme) (4E00 9FFF) enthält die grundlegenden chinesischen Schriftzeichen, die nicht nur in der chinesischen Schrift… …   Deutsch Wikipedia

  • Unicode-Block Vereinheitlichte CJK-Ideogramme, Erweiterung A — Der Unicode Block CJK Unified Ideographs Extension A (Vereinheitlichte CJK Ideogramme, Erweiterung A) (3400–4DBF) enthält zusätzliche chinesische Schriftzeichen. Er wurde in Unicode Version 3.0 hinzugefügt. Die Bezeichnungen Ideographs im… …   Deutsch Wikipedia

  • Unicode-Block Vereinheitlichte CJK-Ideogramme, Erweiterung C — Der Unicode Block Vereinheitlichte CJK Ideogramme, Erweiterung C (englisch: CJK Unified Ideographs Extension C) ist ein Unicode Block im Unicode Standard. Er wurde mit der Version 5.2 eingeführt. Die Bezeichnungen Ideographs im Original bzw.… …   Deutsch Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”