Arabic Unicode

Arabic Unicode

As of Unicode 5.0, the following ranges encode Arabic characters:
* (0600–06FF)
* (0750–077F)
* (FB50–FDFF)
* (FE70–FEFF)

The basic Arabic range encodes the standard letters and diacritics, but does not encode contextual forms (U+0621–U+0652 being directly based on ISO 8859-6); and also includes the most common diacritics and Arabic-Indic digits. The Arabic Supplement range encodes letter variants mostly used for writing African (non-Arabic) languages. The Arabic Presentation Forms-A range encodes contextual forms and ligatures of letter variants needed for Persian, Urdu, Sindhi and Central Asian languages. The Arabic Presentation Forms-B range encodes spacing forms of Arabic diacritics, and more contextual letter forms.

Punctuation and ornaments

*U+060C script|Arab|،: "ARABIC COMMA"
*U+060D script|Arab|؍: "ARABIC DATE SEPARATOR"
*U+060E script|Arab|؎: "ARABIC POETIC VERSE BEGIN"
*U+060F script|Arab|؏: "ARABIC SIGN MISRA"
*U+066D script|Arab|٭: "ARABIC FIVE POINTED STAR"
*U+06DD script|Arab|۝: "ARABIC END OF AYAH"
*U+06DE script|Arab|۞: "ARABIC START OF RUB EL HIZB"
*U+06E9 script|Arab|۩: "ARABIC ARABIC PLACE OF SAJDAH"
*U+FD3E script|Arab|﴾: "ARABIC ORNATE LEFT PARENTHESIS"
*U+FD3F script|Arab|﴿: "ARABIC ORNATE RIGHT PARENTHESIS"

Word ligatures

Arabic Presentation Forms-A has a few characters defined as "word ligatures" for terms frequently used in formulaic expressions in Arabic.

*U+FDF0 script|Arab|ﷰ: "SALLA USED AS KORANIC STOP SIGN ISOLATED FORM" _ar. صلے
*U+FDF1 script|Arab|ﷱ: "QALA USED AS KORANIC STOP SIGN ISOLATED FORM" _ar. قلے
*U+FDF2 script|Arab|ﷲ: "ALLAH ISOLATED FORM" -- _ar. الله.
*U+FDF3 script|Arab|ﷳ: "AKBAR ISOLATED FORM" _ar. اكبر
*U+FDF4 script|Arab|ﷴ: "MOHAMMED ISOLATED FORM" _ar. محمد
*U+FDF5 script|Arab|ﷵ: "SALAM ISOLATED FORM" _ar. صلعم "peace be upon him"
*U+FDF6 script|Arab|ﷶ: "RASOUL ISOLATED FORM" _ar. رسول
*U+FDF7 script|Arab|ﷷ: "ALAYHE" _ar. عليه
*U+FDF8 script|Arab|ﷸ: "WASALLAM" _ar. وسلم
*U+FDF9 script|Arab|ﷹ: "SALLA ISLOATED FORM"
*U+FDFA script|Arab|ﷺ: "SALLALLAHOU ALAYHA WASALLAM" _ar. صلى الله عليه وسلم "peace be upon him"
*U+FDFB script|Arab|ﷻ: "JALLAJALALOUHOU" _ar. جل جلاله
*U+FDFC script|Arab|﷼: the Rial currency sign _ar. ريال
*U+FDFD script|Arab|﷽: the Basmala

Character charts

ee also

*Latin Unicode
*Mapping of Unicode characters


Wikimedia Foundation. 2010.

Игры ⚽ Нужен реферат?

Look at other dictionaries:

  • Arabic alphabet — Infobox Writing system name=Arabic abjad type=Abjad languages= Arabic, Persian, Kurdish, Baloch, Urdu, Pashto, Sindhi, Malay (limited usage) and others. time=400 CE to the present fam1=Proto Canaanite fam2=Phoenician fam3=Aramaic fam4=Nabataean… …   Wikipedia

  • Unicode character property — Unicode assigns character properties to each code point.[1] These properties can be used to handle characters (code points) in processes, like in line breaking, script direction right to left or applying controls. Slightly inconsequently, some… …   Wikipedia

  • Arabic grammar — Arabic is a Semitic language. See Arabic language for more information on the language in general. This article describes the grammar of Classical Arabic and Modern Standard Arabic. History The identity of the oldest Arabic grammarian is disputed …   Wikipedia

  • Arabic language — Arabic redirects here. For other uses, see Arabic (disambiguation). For the literary standard, see Modern Standard Arabic. For vernaculars, see varieties of Arabic. For others, see Arabic languages. Arabic العربية/عربي/عربى al ʿarabiyyah/ʿarabī …   Wikipedia

  • Unicode equivalence — is the specification by the Unicode character encoding standard that some sequences of code points represent essentially the same character. This feature was introduced in the standard to allow compatibility with preexisting standard character… …   Wikipedia

  • Unicode subscripts and superscripts — Unicode has subscripted and superscripted versions of a number of characters including a full set of arabic numerals. These characters allow any polynomial equation and some other equations to be represented in plain text without using any form… …   Wikipedia

  • Unicode — For the 1889 Universal Telegraphic Phrase book, see Commercial code (communications). The Unicode official logo since October 2009 …   Wikipedia

  • Unicode numerals — Numerals (often called numbers in Unicode) are characters that denote a number. The same Arabic Indic numerals are used widely in various writing systems throughout the world and all share the same semantics for denoting numbers, However, the… …   Wikipedia

  • Unicode-Block Arabische Präsentationsformen-B — Der Unicode Block Arabic Presentation Forms B (Arabische Präsentationsformen B) (FE70–FEFF) enthält alle Buchstaben und Vokalzeichen des arabischen Alphabets in isolierter, initialer, medialer und finaler Form. Die Verwendung dieser Zeichen wird… …   Deutsch Wikipedia

  • Unicode-Block Arabisch — Der Unicode Block Arabic (Arabisch) (0600–06FF) enthält die Standardbuchstaben und diakritika sowie weitere Diakritika und Arabisch Indische Ziffern (en). Weitere drei Arabisch Blöcke sind Arabisch, Ergänzung sowie Arabische Präsentationsformen A …   Deutsch Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”