- Windows-1251
-
Windows-1251 (a.k.a. code page CP1251) is a popular 8-bit character encoding, designed to cover languages that use the Cyrillic alphabet such as Russian, Bulgarian, Serbian Cyrillic and other languages. It is the most widely used for encoding the Bulgarian, Serbian and Macedonian languages.
In modern applications, Unicode is a preferred character set.
Windows-1251 and KOI8-R (or its Ukrainian variant KOI8-U) are much more commonly used than ISO 8859-5, which never really caught on. In the future, both may eventually give way to Unicode.
Codepage layout
The following table shows Windows-1251. Each character is shown with its decimal code and its Unicode equivalent.
Windows-1251 —0 —1 —2 —3 —4 —5 —6 —7 —8 —9 —A —B —C —D —E —F
0−
NUL
0000
0SOH
0001
1STX
0002
2ETX
0003
3EOT
0004
4ENQ
0005
5ACK
0006
6BEL
0007
7BS
0008
8HT
0009
9LF
000A
10VT
000B
11FF
000C
12CR
000D
13SO
000E
14SI
000F
15
1−
DLE
0010
16DC1
0011
17DC2
0012
18DC3
0013
19DC4
0014
20NAK
0015
21SYN
0016
22ETB
0017
23CAN
0018
24EM
0019
25SUB
001A
26ESC
001B
27FS
001C
28GS
001D
29RS
001E
30US
001F
31
2−
SP
0020
32!
0021
33"
0022
34#
0023
35$
0024
36%
0025
37&
0026
38'
0027
39(
0028
40)
0029
41*
002A
42+
002B
43,
002C
44-
002D
45.
002E
46/
002F
47
3−
0
0030
481
0031
492
0032
503
0033
514
0034
525
0035
536
0036
547
0037
558
0038
569
0039
57:
003A
58;
003B
59<
003C
60=
003D
61>
003E
62?
003F
63
4−
@
0040
64A
0041
65B
0042
66C
0043
67D
0044
68E
0045
69F
0046
70G
0047
71H
0048
72I
0049
73J
004A
74K
004B
75L
004C
76M
004D
77N
004E
78O
004F
79
5−
P
0050
80Q
0051
81R
0052
82S
0053
83T
0054
84U
0055
85V
0056
86W
0057
87X
0058
88Y
0059
89Z
005A
90[
005B
91\
005C
92]
005D
93^
005E
94_
005F
95
6−
`
0060
96a
0061
97b
0062
98c
0063
99d
0064
100e
0065
101f
0066
102g
0067
103h
0068
104i
0069
105j
006A
106k
006B
107l
006C
108m
006D
109n
006E
110o
006F
111
7−
p
0070
112q
0071
113r
0072
114s
0073
115t
0074
116u
0075
117v
0076
118w
0077
119x
0078
120y
0079
121z
007A
122{
007B
123|
007C
124}
007D
125~
007E
126DEL
007F
127
8−
Ђ
0402
128Ѓ
0403
129‚
201A
130ѓ
0453
131„
201E
132…
2026
133†
2020
134‡
2021
135€
20AC
136‰
2030
137Љ
0409
138‹
2039
139Њ
040A
140Ќ
040C
141Ћ
040B
142Џ
040F
143
9−
ђ
0452
144‘
2018
145’
2019
146“
201C
147”
201D
148•
2022
149–
2013
150—
2014
151
152™
2122
153љ
0459
154›
203A
155њ
045A
156ќ
045C
157ћ
045B
158џ
045F
159
A−
NBSP
00A0
160Ў
040E
161ў
045E
162Ј
0408
163¤
00A4
164Ґ
0490
165¦
00A6
166§
00A7
167Ё
0401
168©
00A9
169Є
0404
170«
00AB
171¬
00AC
172SHY
00AD
173®
00AE
174Ї
0407
175
B−
°
00B0
176±
00B1
177І
0406
178і
0456
179ґ
0491
180µ
00B5
181¶
00B6
182·
00B7
183ё
0451
184№
2116
185є
0454
186»
00BB
187ј
0458
188Ѕ
0405
189ѕ
0455
190ї
0457
191
C−
А
0410
192Б
0411
193В
0412
194Г
0413
195Д
0414
196Е
0415
197Ж
0416
198З
0417
199И
0418
200Й
0419
201К
041A
202Л
041B
203М
041C
204Н
041D
205О
041E
206П
041F
207
D−
Р
0420
208С
0421
209Т
0422
210У
0423
211Ф
0424
212Х
0425
213Ц
0426
214Ч
0427
215Ш
0428
216Щ
0429
217Ъ
042A
218Ы
042B
219Ь
042C
220Э
042D
221Ю
042E
222Я
042F
223
E−
а
0430
224б
0431
225в
0432
226г
0433
227д
0434
228е
0435
229ж
0436
230з
0437
231и
0438
232й
0439
233к
043A
234л
043B
235м
043C
236н
043D
237о
043E
238п
043F
239
F−
р
0440
240с
0441
241т
0442
242у
0443
243ф
0444
244х
0445
245ц
0446
246ч
0447
247ш
0448
248щ
0449
249ъ
044A
250ы
044B
251ь
044C
252э
044D
253ю
044E
254я
044F
255In the table above, 20 is the regular SPACE character, A0 is the NO-BREAK SPACE, and AD is SOFT HYPHEN.
External links
- Windows 1251 reference chart
- IANA Charset Name Registration
- Unicode mapping table for Windows 1251
- Unicode mappings of windows 1251 with "best fit"
- Universal Cyrillic decoder, an online program that may help recovering unreadable Cyrillic texts with broken Windows-1251 or other character encodings.
Character encodings Character sets
Early telecommunications ASCII · ISO/IEC 646 · ISO/IEC 6937 · T.61 · sixbit code pages · Baudot code · Morse code · Chinese telegraph codeISO/IEC 8859 Bibliographic use National standards ArmSCII · CNS 11643 · GOST 10859 · GB 2312 · HKSCS · ISCII · JIS X 0201 · JIS X 0208 · JIS X 0212 · JIS X 0213 · KPS 9566 · KS X 1001 · PASCII · TIS-620 · TSCII · VISCII · YUSCIIEUC CN · JP · KR · TWISO/IEC 2022 CN · JP · KR · CCCIIMacOS codepages ("scripts") DOS codepages Windows codepages EBCDIC codepages 37/1140 · 273/1141 · 277/1142 · 278/1143 · 280/1144 · 284/1145 · 285/1146 · 297/1147 · 420/16804 · 424/12712 · 500/1148 · 838/1160 · 871/1149 · 875/9067 · 930/1390 · 933/1364 · 937/1371 · 935/1388 · 939/1399 · 1025/1154 · 1026/1155 · 1047/924 · 1112/1156 · 1122/1157 · 1123/1158 · 1130/1164 · JEF · KEISPlatform specific ATASCII · CDC display code · DEC-MCS · DEC Radix-50 · Fieldata · GSM 03.38 · HP roman8 · PETSCII · TI calculator character sets · WISCII · ZX Spectrum character setUnicode / ISO/IEC 10646 Miscellaneous codepages Related topics control character (C0 C1) · CCSID · Character encodings in HTML · charset detection · Han unification · ISO 6429/IEC 6429/ANSI X3.64 · mojibakeCategories:- Character sets
- Windows code pages
Wikimedia Foundation. 2010.