Complex text layout

Complex text layout
The Devanagari ddhrya-ligature of JanaSanskritSans, should be invoked by the layout engine to render the sequence of seven Unicode characters द + ् + ध + ् + र + ् + य = द्ध्र्य.
The word العربية al-arabiyyah, "the Arabic [language]" in Arabic, in stages of rendering. The first line shows the letters as they are unprocessed, the result that would be given by an application without complex script rendering. In the second line the bidirectional display mechanism has come to play, and in the third the glyph shaping mechanism has rendered the letters according to context.
See Help:Multilingual support for enabling complex text layout on your computer

Complex text layout (abbreviated CTL) or complex text rendering refers to the typesetting of writing systems which require complex transformations between text input and text display for proper rendering on the screen or the printed page (also known as complex scripts). In other words, for these scripts the way text is stored is not mapped to the way it is displayed in a straightforward fashion. The term is used in the field of software internationalization.

Examples of writing systems requiring CTL are the Arabic alphabet and scripts of the Brahmic family such as Devanagari or the Thai alphabet.

CTL is a generalization of the concept of ligature: for the Latin alphabet, ligatures are usually considered a marginal aesthetic concern, but there is no fundamental difference between the ligatures required for acceptable typesetting of the Arabic script, and typesetting a Latin cursive.[1] Conversely, most characters of the Chinese script are compositional and could be considered ligatures, but are usually encoded as so many individual characters, that typesetting requires an enormous typeface rather than sophisticated layout. An example of a contextual variant that is not considered a ligature is Greek final sigma ς, the word-final contextual variant of the usual σ shape. Unicode encodes both variants separately, at U+03C2 and U+03C3 respectively. However, for collation and comparison purposes, software should likely consider the string "δῖος Ἀχιλλεύς." equivalent to "δῖοσ Ἀχιλλεύσ." (Unicode does not direct conforming software to treat ς and σ as canonically or compatibility equivalent).

The main characteristics of CTL language complexity are:

  • Bi-directional text, where characters may be written from either right-to-left or left-to-right direction.
  • Context-sensitive shaping (ligatures), where a character may change its shape, dependent on its location and/or the surrounding characters. For example, a character in Arabic script can have as many as four different shape-forms, depending on context.
  • Ordering, the displayed order of the characters is not the same as the logical order. For example, in Devanagari, which is written from left to right, the grapheme for "short i" appears to the left of ("before") the preceding consonant: in कि ki, the ि -i should render on the left, its bow reaching until above the k- to the right.

Contents

Implementations

Some CTL implementations do not encapsulate information about specific scripts. In these implementations, the script-specific CTL information resides within the font files. Therefore, they are able to render any script:

Other CTL implementations encapsulate information about specific scripts. In these implementations, the script-specific CTL information is provided by the CTL implementation. Therefore, they are only able to render the scripts that are previously implemented:

Notes

  1. ^ Indeed, historically, the Arabic alphabet is simply a cursive of the Nabataean alphabet, with context-dependent letter shapes that became mandatory from ca. the 4th century AD.

See also

External links


Wikimedia Foundation. 2010.

Игры ⚽ Нужно решить контрольную?

Look at other dictionaries:

  • Complex Text Layout — Als Complex Text Layout wird in Computerprogrammen die Darstellung von Schriftsystemen bezeichnet, die nicht der Links nach rechts Anordnung der westlichen Sprachen folgen. Es gibt mehrere Arten komplexer Schriftsysteme: Rechts nach links… …   Deutsch Wikipedia

  • Panorama (Layout Engine) — Infobox Software name = Bitstream Panorama caption = Bitstream Inc. Bitstream Panorama composes text in all worldwide languages, which includes various complex scripting languages such as, Arabic, Indian, and Hebrew. developer = Bitstream Inc.… …   Wikipedia

  • Keyboard layout — QWERTY keyboard on a laptop. A keyboard layout is any specific mechanical, visual, or functional arrangement of the keys, legends, or key–meaning associations (respectively) of a computer, typewriter, or other typographic keyboard. Mechanical… …   Wikipedia

  • Page layout — is the part of graphic design that deals in the arrangement and style treatment of elements (content) on a page. Beginning from early illuminated pages in hand copied books of the Middle Ages and proceeding down to intricate modern magazine and… …   Wikipedia

  • Comparison of layout engines (HTML5 Canvas) — HTML HTML and HTML5 Dynamic HTML XHTML XHTML Mobile Profile and C HTML Canvas element Character encodings Document Object Model Font family HTML editor HTML element HTML Frames HTML5 video HTML scrip …   Wikipedia

  • Predictive text — is an input technology most commonly used on mobile phones, and for accessibility. The technology allows words to be entered by a single keypress for each letter, as opposed to the multiple keypress approach used in the older generation of mobile …   Wikipedia

  • Graphics Layout Engine — (GLE) is a graphics scripting language designed for creating publication quality graphs, plots, diagrams, figures and slides. GLE supports various graph types such as function plots, histograms, bar graphs, scatter plots, contour lines, color… …   Wikipedia

  • OpenType — Filename extension .otf, .ttf Type code OTTO Uniform Type Identifier public.opentype font Developed by Microsoft Adobe Systems Latest release 1.6 …   Wikipedia

  • Abkürzungen/Computer — Dies ist eine Liste technischer Abkürzungen, die im IT Bereich verwendet werden. A [nach oben] AA Antialiasing AAA authentication, authorization and accounting, siehe Triple A System AAC Advanced Audio Coding AACS …   Deutsch Wikipedia

  • Liste der Abkürzungen (Computer) — Dies ist eine Liste technischer Abkürzungen, die im IT Bereich verwendet werden. A [nach oben] AA Antialiasing AAA authentication, authorization and accounting, siehe Triple A System AAC Advanced Audio Coding AACS …   Deutsch Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”