- Mac OS Roman
-
Mac OS Roman is a character encoding primarily used by Mac OS to represent text. It encodes 256 characters, the first 128 of which are identical to ASCII, with the remaining characters including mathematical symbols, diacritics, and additional punctuation marks. It is suitable for use to represent English and several other Western languages. Mac OS Roman is a superset of the original Macintosh character set, used in System 1.
The Internet Assigned Numbers Authority identifies this encoding using the string "macintosh". The MIME Content-Type for this encoding is therefore "text/plain; charset=macintosh". Mac OS Roman is also referred to as MacRoman or the Apple Standard Roman character set.
With the release of Mac OS X, Mac OS Roman and all other "scripts" (as the Mac OS called them) were replaced by UTF-8 as the standard character encoding for the Macintosh operating system. However, default character encoding in Java for Mac OS X is still MacRoman (see Mac Dev Center), and the keyboard layout with its combination of control, option, and dead keys still map to the original characters in MacRoman.
Contents
Codepage layout
The following table shows how characters are encoded in Macintosh Roman. Each character is shown with its Unicode equivalent right below and its decimal code at the bottom.
Mac OS Roman −0 −1 −2 −3 −4 −5 −6 −7 −8 −9 −A −B −C −D −E −F
0−
NUL
0000
0SOH
0001
1STX
0002
2ETX
0003
3EOT
0004
4ENQ
0005
5ACK
0006
6BEL
0007
7BS
0008
8HT
0009
9LF
000A
10VT
000B
11FF
000C
12CR
000D
13SO
000E
14SI
000F
15
1−
DLE
0010
16DC1⌘
0011
17DC2⌘
0012
18DC3⌘
0013
19DC4⌘
0014
20NAK
0015
21SYN
0016
22ETB
0017
23CAN
0018
24EM
0019
25SUB
001A
26ESC
001B
27FS
001C
28GS
001D
29RS
001E
30US
001F
31
2−
SP
0020
32!
0021
33"
0022
34#
0023
35$
0024
36%
0025
37&
0026
38'
0027
39(
0028
40)
0029
41*
002A
42+
002B
43,
002C
44-
002D
45.
002E
46/
002F
47
3−
0
0030
481
0031
492
0032
503
0033
514
0034
525
0035
536
0036
547
0037
558
0038
569
0039
57:
003A
58;
003B
59<
003C
60=
003D
61>
003E
62?
003F
63
4−
@
0040
64A
0041
65B
0042
66C
0043
67D
0044
68E
0045
69F
0046
70G
0047
71H
0048
72I
0049
73J
004A
74K
004B
75L
004C
76M
004D
77N
004E
78O
004F
79
5−
P
0050
80Q
0051
81R
0052
82S
0053
83T
0054
84U
0055
85V
0056
86W
0057
87X
0058
88Y
0059
89Z
005A
90[
005B
91\
005C
92]
005D
93^
005E
94_
005F
95
6−
`
0060
96a
0061
97b
0062
98c
0063
99d
0064
100e
0065
101f
0066
102g
0067
103h
0068
104i
0069
105j
006A
106k
006B
107l
006C
108m
006D
109n
006E
110o
006F
111
7−
p
0070
112q
0071
113r
0072
114s
0073
115t
0074
116u
0075
117v
0076
118w
0077
119x
0078
120y
0079
121z
007A
122{
007B
123|
007C
124}
007D
125~
007E
126DEL
007F
127
8−
Ä
00C4
128Å
00C5
129Ç
00C7
130É
00C9
131Ñ
00D1
132Ö
00D6
133Ü
00DC
134á
00E1
135à
00E0
136â
00E2
137ä
00E4
138ã
00E3
139å
00E5
140ç
00E7
141é
00E9
142è
00E8
143
9−
ê
00EA
144ë
00EB
145í
00ED
146ì
00EC
147î
00EE
148ï
00EF
149ñ
00F1
150ó
00F3
151ò
00F2
152ô
00F4
153ö
00F6
154õ
00F5
155ú
00FA
156ù
00F9
157û
00FB
158ü
00FC
159
A−
†
2020
160°
00B0
161¢
00A2*
162£
00A3*
163§
00A7
164•
2022
165¶
00B6
166ß
00DF
167®
00AE
168©
00A9*
169™
2122
170´
00B4
171¨
00A8
172≠
2260
173Æ
00C6
174Ø
00D8
175
B−
∞
221E
176±
00B1*
177≤
2264
178≥
2265
179¥
00A5
180µ
00B5*
181∂
2202
182∑
2211
183∏
220F
184π
03C0
185∫
222B
186ª
00AA
187º
00BA
188Ω
03A9
189æ
00E6
190ø
00F8
191
C−
¿
00BF
192¡
00A1
193¬
00AC
194√
221A
195ƒ
0192
196≈
2248
197∆
2206
198«
00AB
199»
00BB
200…
2026
201NBSP
00A0
202À
00C0
203Ã
00C3
204Õ
00D5
205Œ
0152
206œ
0153
207
D−
–
2013
208—
2014
209“
201C
210”
201D
211‘
2018
212’
2019
213÷
00F7
214◊
25CA
215ÿ
00FF
216Ÿ
0178
217⁄
2044
218€¤
20AC
219‹
2039
220›
203A
221fi
FB01
222fl
FB02
223
E−
‡
2021
224·
00B7
225‚
201A
226„
201E
227‰
2030
228Â
00C2
229Ê
00CA
230Á
00C1
231Ë
00CB
232È
00C8
233Í
00CD
234Î
00CE
235Ï
00CF
236Ì
00CC
237Ó
00D3
238Ô
00D4
239
F−
a
F8FF
240Ò
00D2
241Ú
00DA
242Û
00DB
243Ù
00D9
244ı
0131
245ˆ
02C6
246˜
02DC
247¯
00AF
248˘
02D8
249˙
02D9
250˚
02DA
251¸
00B8
252˝
02DD
253˛
02DB
254ˇ
02C7
255- ^⌘ The (usually nonprintable) character 0x11 mapped to the Command key glyph (⌘) in many fonts, particularly those intended for use as system fonts. It is mappable to the Unicode character U+2318 "Place of Interest Sign" (⌘). The codes 0x12, 0x13, and 0x14 were also used for icons to indicate the shift, option, and control keys in menu items.
- ^* The codes 0xA2, 0xA3, 0xA9, 0xB1, and 0xB5 coincidentally have the same character assignment as ISO-8859-1 (and thus Unicode).
- ^¤ Before Mac OS 8.5, the character 0xDB mapped to currency sign (¤), Unicode character U+00A4.
- ^a The character 0xF0 is a solid Apple logo. The Unicode equivalent U+F8FF is in the Corporate Private Use Area, but it is probably not supported on non-Apple platforms.
See also
References
- Apple Computer, Inc. (1993). Inside Macintosh: Text. New York: Addison-Wesley. ISBN 0-201-63298-5.
- Apple Computer, Inc. (1985). Inside Macintosh Volume I. New York: Addison-Wesley. ISBN 0-201-17731-5.
External links
- http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/ROMAN.TXT
- http://developer.apple.com/documentation/mac/Text/Text-30.html#HEADING30-9 Description of Mac OS Roman
Character encodings Early telecommunications ASCII · ISO/IEC 646 · ISO/IEC 6937 · T.61 · sixbit code pages · Baudot code · Morse code · Chinese telegraph codeISO/IEC 8859 Bibliographic use National standards ArmSCII · CNS 11643 · GOST 10859 · GB 2312 · HKSCS · ISCII · JIS X 0201 · JIS X 0208 · JIS X 0212 · JIS X 0213 · KPS 9566 · KS X 1001 · PASCII · TIS-620 · TSCII · VISCII · YUSCIIEUC CN · JP · KR · TWISO/IEC 2022 CN · JP · KR · CCCIIMacOS codepages ("scripts") Arabic · CentralEurRoman · ChineseSimp / EUC-CN · ChineseTrad / Big5 · Croatian · Cyrillic · Devanagari · Dingbats · Farsi · Greek · Gujarati · Gurmukhi · Hebrew · Icelandic · Japanese / ShiftJIS · Korean / EUC-KR · Roman · Romanian · Symbol · Thai / TIS-620 · Turkish · UkrainianDOS codepages Windows codepages EBCDIC codepages 37/1140 · 273/1141 · 277/1142 · 278/1143 · 280/1144 · 284/1145 · 285/1146 · 297/1147 · 420/16804 · 424/12712 · 500/1148 · 838/1160 · 871/1149 · 875/9067 · 930/1390 · 933/1364 · 937/1371 · 935/1388 · 939/1399 · 1025/1154 · 1026/1155 · 1047/924 · 1112/1156 · 1122/1157 · 1123/1158 · 1130/1164 · JEF · KEISPlatform specific ATASCII · CDC display code · DEC-MCS · DEC Radix-50 · Fieldata · GSM 03.38 · HP roman8 · PETSCII · TI calculator character sets · WISCII · ZX Spectrum character setUnicode / ISO/IEC 10646 Miscellaneous codepages Related topics control character (C0 C1) · CCSID · Character encodings in HTML · charset detection · Han unification · ISO 6429/IEC 6429/ANSI X3.64 · mojibakeThis character encoding article is a stub. You can help Wikipedia by expanding it.