Indian Script Code for Information Interchange

Indian Script Code for Information Interchange

Indian Script Code for Information Interchange (ISCII) is a coding scheme for representing various writing systems of India. It encodes the main Indic scripts and a Roman transliteration. The supported scripts are: Assamese, Bengali, Devanagari, Gujarati, Gurmukhi, Kannada, Malayalam, Oriya, Tamil, and Telugu. ISCII does not encode the writing systems of India based on Arabic, but its writing system switching codes nonetheless provide for Kashmiri, Sindhi, Urdu, Persian, Pashto and Arabic. The Arabic-based writing systems have subsequently been encoded in the PASCII encoding.

The Brahmi-derived writing systems are mostly rather similar in structure, but have different letter shapes, so ISCII encodes letters with the same phonetic value at the same codepoint, overlaying the various scripts. For example, the ISCII codes 0xB3 0xDB represent [ki] . This will be rendered as कि in Devanagari, as ਕਿ in Gurmukhi, and as கி in Tamil. The writing system can be selected in rich text by markup or in plain text by means of the ATR code described below.

One motivation for the use of a single encoding is the idea that it will allow easy transliteration from one writing system to another. However, there are enough incompatibilities that this is not really a practical idea. See [http://acharya.iitm.ac.in/multi_sys/exist_codes.php#Interchange About ISCII] .

ISCII is a fixed-length 8-bit encoding. The lower 128 codepoints are plain ASCII, the upper 128 codepoints are ISCII-specific. In addition to the codepoints representing characters, ISCII makes use of a codepoint with mnemonic ATR that indicates that the following byte contains one of two kinds of information. One set of values changes the writing system until the next writing system indicator or end-of-line. Another set of values select display modes, such as bold and italic. ISCII does not provide a means of indicating the default writing system.

ISCII has not been widely used outside of certain government institutions andhas now been rendered largely obsolete by Unicode. While using a separate block for each Indic writing system, Unicode does, however, largely preserve the ISCII layout within each block.

External links

* [http://varamozhi.sourceforge.net/iscii91.pdf The ISCII 1991 standard (PDF)]
* [http://padma.mozdev.org Padma - Mozilla extension for transforming ISCII to Unicode]
* [http://geocities.com/vnagarjuna/padma.html Padma - Transformer from ISCII to Unicode for Telugu]
* [http://www.phpclasses.org/browse/package/2991.html PHP script for ISCII to and from Unicode]


Wikimedia Foundation. 2010.

Игры ⚽ Поможем написать реферат

Look at other dictionaries:

  • Indian Script Code for Information Interchange — (ISCII) ist die indische nationale Norm für die Kodierung der Zeichen der verschiedenen indischen Schriften, die sämtlich Abkömmlinge der Brahmi Schrift sind. Sie sind prinzipiell sehr ähnlich strukturiert, jedoch sind die Buchstabenformen sehr… …   Deutsch Wikipedia

  • Perso-Arabic Script Code for Information Interchange — (PASCII) is one of the Indian government standards for encoding languages using writing systems based on that of Arabic, in particular Kashmiri, Persian, Sindhi, and Urdu. The ISCII encoding was originally intended to cover both the Brahmi… …   Wikipedia

  • ASCII-Code — American Standard Code for Information Interchange (ASCII, alternativ US ASCII, oft [æski] ausgesprochen) ist eine 7 Bit Zeichenkodierung und bildet die US Variante von ISO 646 sowie die Grundlage für spätere mehrbittige Zeichensätze und… …   Deutsch Wikipedia

  • Ascii-code — American Standard Code for Information Interchange (ASCII, alternativ US ASCII, oft [æski] ausgesprochen) ist eine 7 Bit Zeichenkodierung und bildet die US Variante von ISO 646 sowie die Grundlage für spätere mehrbittige Zeichensätze und… …   Deutsch Wikipedia

  • ISCII — Indian Script Code for Information Interchange (ISCII) ist die indische nationale Norm für die Kodierung der Zeichen der verschiedenen indischen Schriften, die sämtlich Abkömmlinge der Brahmi Schrift sind. Sie sind prinzipiell sehr ähnlich… …   Deutsch Wikipedia

  • Iscii — Indian Script Code for Information Interchange (ISCII) ist die indische nationale Norm für die Kodierung der Zeichen der verschiedenen indischen Schriften, die sämtlich Abkömmlinge der Brahmi Schrift sind. Sie sind prinzipiell sehr ähnlich… …   Deutsch Wikipedia

  • Bengali language — Bangla redirects here. For Bangla speaking people, see Bengali people. Bengali বাংলা Bangla The word Bangla in Bangla Assamese alphabet …   Wikipedia

  • ANSI X3.4-1968 — American Standard Code for Information Interchange (ASCII, alternativ US ASCII, oft [æski] ausgesprochen) ist eine 7 Bit Zeichenkodierung und bildet die US Variante von ISO 646 sowie die Grundlage für spätere mehrbittige Zeichensätze und… …   Deutsch Wikipedia

  • ASCII — American Standard Code for Information Interchange (ASCII, alternativ US ASCII, oft [æski] ausgesprochen) ist eine 7 Bit Zeichenkodierung und bildet die US Variante von ISO 646 sowie die Grundlage für spätere mehrbittige Zeichensätze und… …   Deutsch Wikipedia

  • ASCII-Tabelle — American Standard Code for Information Interchange (ASCII, alternativ US ASCII, oft [æski] ausgesprochen) ist eine 7 Bit Zeichenkodierung und bildet die US Variante von ISO 646 sowie die Grundlage für spätere mehrbittige Zeichensätze und… …   Deutsch Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”