Unicode subscripts and superscripts
- Unicode subscripts and superscripts
Unicode has subscripted and superscripted versions of a number of characters including a full set of arabic numerals. These characters allow any polynomial equation and some other equations to be represented in plain text without using any form of markup like HTML or TEX. With the exception of using "²" and "³" for squared and cubed units it is considered inadvisable to use these characters when other methods of superscript and subscript are available.
Note as well that most fonts that include these characters use them for mathematical numerator and denominator glyphs, which are smaller than normal characters but are aligned with the cap line and the baseline, respectively. When used with the solidus, these glyphs are useful for making arbitrary diagonal fractions (similar to the ½ glyph), but are not substitutes for proper mathematical sub- or superscripts.
The most common superscript digits were in ISO-8859-1 at positions B9HEX, B2HEX and B3HEX for digits 1, 2, and 3 respectively and have therefore been carried over into those positions in the Latin-1 range of Unicode. The rest were placed in a dedicated section of Unicode at U+2070 to U+209F. The two tables below show the characters in this range. Each superscript or subscript character is preceded by a normal "x" to show the subscripting/superscripting. The table on the left contains the actual Unicode characters (which may not render correctly in your browser); the one on the right contains the equivalents using HTML markup for the subscript/superscript.
Unicode | | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | U+207x | x⁰ | xⁱ | | | x⁴ | x⁵ | x⁶ | x⁷ | x⁸ | x⁹ | x⁺ | x⁻ | x⁼ | x⁽ | x⁾ | xⁿ | U+208x | x₀ | x₁ | x₂ | x₃ | x₄ | x₅ | x₆ | x₇ | x₈ | x₉ | x₊ | x₋ | x₌ | x₍ | x₎ | | U+209x | xₐ | xₑ | xₒ | xₓ | xₔ | | | | | | | | | | | | | HTML | | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | U+207x | x0 | xi | | | x4 | x5 | x6 | x7 | x8 | x9 | x+ | x− | x= | x( | x) | xn | U+208x | x0 | x1 | x2 | x3 | x4 | x5 | x6 | x7 | x8 | x9 | x+ | x− | x= | x( | x) | | U+209x | xa | xe | xo | xx | xə | | | | | | | | | | | | |
References
* [http://www.unicode.org/charts/PDF/U2070.pdf "Superscripts and Subscripts"] (PDF file)
Wikimedia Foundation.
2010.
Look at other dictionaries:
Unicode equivalence — is the specification by the Unicode character encoding standard that some sequences of code points represent essentially the same character. This feature was introduced in the standard to allow compatibility with preexisting standard character… … Wikipedia
Unicode compatibility characters — In discussing Unicode and the UCS, many often refer to compatibility characters. Compatibility characters are graphical characters that are discouraged by the Unicode Consortium. As the [http://www.unicode.org/glossary/#compatibility character… … Wikipedia
Unicode character property — Unicode assigns character properties to each code point.[1] These properties can be used to handle characters (code points) in processes, like in line breaking, script direction right to left or applying controls. Slightly inconsequently, some… … Wikipedia
Unicode Phonetic Symbols — Unicode supports several phonetic alphabets and notations through the existing writing systems and the addition of several phonetic extension blocks. *IPA Extensions (0250–02AF); Spacing Modifier Letters (02B0–02FF); Phonetic Extensions… … Wikipedia
Unicode — For the 1889 Universal Telegraphic Phrase book, see Commercial code (communications). The Unicode official logo since October 2009 … Wikipedia
Unicode font — A Unicode font (also known as UCS font and Unicode typeface) is a computer font that contains a wide range of characters, letters, digits, glyphs, symbols, ideograms, logograms, etc., which are collectively mapped into the standard Universal… … Wikipedia
Unicode symbols — v · Character Types Scripts Unihan ideographs, etc. Phonetic characters Punctuation and separators Diacritics and other marks Symbols Numerals Compatibility characters … Wikipedia
Subscript and superscript — This article is about the terms subscript and superscript as used in typography. SuperScript can also refer to a commercially available Reverse transcriptase. A subscript or superscript is a number, figure, symbol, or indicator that appears… … Wikipedia
C0 and C1 control codes — Most character encodings, in addition to representing printable characters, may also represent additional information about the text, such as the position of a cursor, an instruction to start a new line, or a message that the text has been… … Wikipedia
Phonetic symbols in Unicode — Unicode supports several phonetic scripts and notations through the existing writing systems and the addition of extra blocks with phonetic characters. These phonetic extras are derived of an existing script, usually Latin, Greek or Cyrillic. In… … Wikipedia