MIK Code page

MIK Code page

MIK is a Cyrillic code page to be used with MS-DOS. It is based on the character set used in the Bulgarian [http://www.old-computers.com/museum/computer.asp?st=1&c=615 Pravetz 16] IBM PC compatible system.

This is the most widespread DOS/OEM code page used in Bulgaria, rather than CP855, CP866 or CP872.

Almost every DOS program created in Bulgaria, which has Bulgarian strings in it, was using MIK as encoding, and many such programs are still in use.

Code page layout

Only the upper half (128–255) of the table is shown, the lower half (0–127) being plain ASCII.

Notes for implementors of mapping tables to Unicode

Implementors of mapping tables to Unicode should note that the MIK Code page unifies some characters:

Binary character manipulations

The MIK code page maintains in alphabetical order all Cyrillic letters which enables very easy character manipulation in binary form:

10xx xxxx - is a Cyrillic Letter

100x xxxx - is an Upper-case Cyrillic Letter

101x xxxx - is a Lower-case Cyrillic Letter

In such case testing and character manipulating functions as:

IsAlpha(), IsUpper(), IsLower(), ToUpper() and ToLower(),

are bit operations and sorting is by simple comparison of character values.

External links

For more information about the origins of the characters unification have a look at [http://www.unicode.org/Public/MAPPINGS/VENDORS/IBM/readme.txt Unicode Consortium's mappings between IBM's code pages and Unicode]

Similar information about the characters unification is given in Markus Kuhn's [http://www.cl.cam.ac.uk/~mgk25/unicode.html#conv UTF-8 and Unicode FAQ for Unix/Linux]

For more information about the range of 0xE0 to 0xFF please see the [http://www.microsoft.com/globaldev/reference/oem/437.htm Microsoft Code Page 437 reference chart]


Wikimedia Foundation. 2010.

Игры ⚽ Поможем написать реферат

Look at other dictionaries:

  • Code page — is another term for character encoding. It consists of a table of values that describes the character set for a particular language. The term code page originated from IBM s EBCDIC based mainframe systems,[1] but many vendors use this term… …   Wikipedia

  • Code page 850 — character set with 9×16 glyphs, as it usually rendered by VGA Code page 850 (also known as CP 850, IBM 00850,[1] OEM 850,[2] MS DOS Latin 1[3]) is a …   Wikipedia

  • Code page 437 — Code page 437, as rendered by the IBM PC using a VGA adapter. IBM PC or MS DOS code page 437, often abbreviated CP437 and also known as DOS US, OEM US or sometimes misleadingly referred to as the OEM font, High ASCII or Extended ASCII,[1][2] is… …   Wikipedia

  • Code page 865 — (also known as CP 865, IBM 00865,[1] OEM 865, MS DOS Nordic[2]) is a code page used under MS DOS to write Nordic languages (except Icelandic, for which code page 861 is used). Code page 865 differs from code page 437 in three points: 0x9B (‹ø›… …   Wikipedia

  • Code page 852 — (also known as CP 852, IBM 00852,[1] OEM 852 (Latin II),[2][3] MS DOS Latin 2[4]) is a code page used under MS DOS to write Central European languages that use Latin script (such as Bosnian, Croatian, Czech, Hungarian …   Wikipedia

  • Code page 857 — (also known as CP 857, IBM 00857,[1] OEM 857,[2] MS DOS Turkish[3]) is a code page used under MS DOS to write Turkish. Code page 857 is based on code page 850, but with many changes. It includes all characters from ISO 8859 9. Code page layout… …   Wikipedia

  • Code page 855 — (also known as CP 855, IBM 00855,[1] OEM 855,[2] MS DOS Cyrillic[3]) is a code page used under MS DOS to write Cyrillic script. This code page is not used much. Code page layout The following table shows code page 855.[2] …   Wikipedia

  • Code page 737 — (also known as CP 737, IBM 00737,[1] OEM 737,[2] MS DOS Greek[3]) is a code page used under MS DOS to write Greek language. It was much more popular than code page 869. Code page layout The following table shows code page 737.[2] …   Wikipedia

  • Code page 869 — (CP 869, IBM 869, OEM 869) is a code page used under MS DOS to write Greek language. It is also called MS DOS Greek 2.[1] It was designed to include all characters from ISO 8859 7. Code page 869 was not as popular as code page 737. Code page… …   Wikipedia

  • Code page 861 — (also known as CP 861, IBM 00861,[1] OEM 861, MS DOS Icelandic[2]) is a code page used under MS DOS to write the Icelandic language (as well as other Nordic languages). Code page layout The following table shows Code page 861. Each character is… …   Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”