0 (number)
1 (number)
2 (number)
3 (number)
4 (number)
5 (number)
6 (number)
7 (number)
8 (number)
9 (number)
@
A
ANSEL
APL (codepage)
ASCII
ATASCII
A (Cyrillic)
Ampersand
Angle bracket
Apostrophe
ArmSCII
Asterisk
B
Backslash
Baudot code
Be (Cyrillic)
Big5
Binary Ordered Compression for Unicode
Braces (punctuation)
Bracket
Bulgarian language
C
C0 and C1 control codes
CCCII
CCSID
CDC display code
CNS 11643
Character encoding
Character encodings in HTML
Charset detection
Che (Cyrillic)
Circumflex
Code page 1133
Code page 437
Code page 720
Code page 737
Code page 775
Code page 850
Code page 852
Code page 855
Code page 857
Code page 858
Code page 860
Code page 861
Code page 862
Code page 863
Code page 865
Code page 866
Code page 869
Code page 932
Code page 936
Code page 949
Code page 950
Colon (punctuation)
Comma (punctuation)
Control character
Copyright symbol
Cork encoding
Cyrillic
D
DEC Radix-50
De (Cyrillic)
Degree symbol
Dollar sign
E
EBCDIC 037
EBCDIC 1047
EBCDIC 285
EBCDIC 500
EBCDIC 875
EBCDIC 930
EUC-CN
EUC-JP
EUC-KR
EUC-TW
E (Cyrillic)
Ef (Cyrillic)
El (Cyrillic)
Em (Cyrillic)
En (Cyrillic)
Equal sign
Er (Cyrillic)
Es (Cyrillic)
Exclamation mark
Extended Unix Code
F
Fieldata
Full stop
G
GBK
1 (number)
2 (number)
3 (number)
4 (number)
5 (number)
6 (number)
7 (number)
8 (number)
9 (number)
@
A
ANSEL
APL (codepage)
ASCII
ATASCII
A (Cyrillic)
Ampersand
Angle bracket
Apostrophe
ArmSCII
Asterisk
B
Backslash
Baudot code
Be (Cyrillic)
Big5
Binary Ordered Compression for Unicode
Braces (punctuation)
Bracket
Bulgarian language
C
C0 and C1 control codes
CCCII
CCSID
CDC display code
CNS 11643
Character encoding
Character encodings in HTML
Charset detection
Che (Cyrillic)
Circumflex
Code page 1133
Code page 437
Code page 720
Code page 737
Code page 775
Code page 850
Code page 852
Code page 855
Code page 857
Code page 858
Code page 860
Code page 861
Code page 862
Code page 863
Code page 865
Code page 866
Code page 869
Code page 932
Code page 936
Code page 949
Code page 950
Colon (punctuation)
Comma (punctuation)
Control character
Copyright symbol
Cork encoding
Cyrillic
D
DEC Radix-50
De (Cyrillic)
Degree symbol
Dollar sign
E
EBCDIC 037
EBCDIC 1047
EBCDIC 285
EBCDIC 500
EBCDIC 875
EBCDIC 930
EUC-CN
EUC-JP
EUC-KR
EUC-TW
E (Cyrillic)
Ef (Cyrillic)
El (Cyrillic)
Em (Cyrillic)
En (Cyrillic)
Equal sign
Er (Cyrillic)
Es (Cyrillic)
Exclamation mark
Extended Unix Code
F
Fieldata
Full stop
G
GBK
KOI8-U is an 8-bit character encoding, designed to cover Ukrainian, which uses the Cyrillic alphabet. It is based on KOI8-R, which covers Russian and Bulgarian, but replaces eight graphic characters with four Ukrainian letters Ґ, Є, І, and Ї in both upper case and lower case.
In Microsoft Windows, KOI8-U is assigned the code page number 21866. In IBM, KOI8-U is assigned code page 1168.
KOI8 remains much more commonly used than ISO 8859-5, which never really caught on. Another common Cyrillic character encoding is Windows-1251. In the future, both may eventually give way to Unicode.
In Russian, KOI8 stands for Kod Obmena Informatsiey, 8 bit (Код Обмена Информацией, 8 бит) which means "Code for Information Exchange, 8 bit".
The KOI8 character sets have the property that the Russian Cyrillic letters are in pseudo-Roman order rather than the natural Cyrillic alphabetical order as in ISO 8859-5. Although this may seem unnatural, it has the useful property that if the 8th bit is stripped, the text can still be read (or at least deciphered) in case-reversed transliteration on an ordinary ASCII terminal. For instance, "Русский Текст" in KOI8-U becomes rUSSKIJ tEKST ("Russian Text") if the 8th bit is stripped.
Codepage layout
KOI8-U
—0
—1
—2
—3
—4
—5
—6
—7
—8
—9
—A
—B
—C
—D
—E
—F
0−
1−
2−
SP
0020
32
!
0021
33
"
0022
34
#
0023
35
$
0024
36
%
0025
37
&
0026
38
'
0027
39
(
0028
40
)
0029
41
*
002A
42
+
002B
43
,
002C
44
-
002D
45
.
002E
46
/
002F
47
3−
0
0030
48
1
0031
49
2
0032
50
3
0033
51
4
0034
52
5
0035
53
6
0036
54
7
0037
55
8
0038
56
9
0039
57
:
003A
58
;
003B
59
<
003C
60
=
003D
61
>
003E
62
?
003F
63
4−
@
0040
64
A
0041
65
B
0042
66
C
0043
67
D
0044
68
E
0045
69
F
0046
70
G
0047
71
H
0048
72
I
0049
73
J
004A
74
K
004B
75
L
004C
76
M
004D
77
N
004E
78
O
004F
79
5−
P
0050
80
Q
0051
81
R
0052
82
S
0053
83
T
0054
84
U
0055
85
V
0056
86
W
0057
87
X
0058
88
Y
0059
89
Z
005A
90
005B
91
\
005C
92
005D
93
^
005E
94
_
005F
95
6−
`
0060
96
a
0061
97
b
0062
98
c
0063
99
d
0064
100
e
0065
101
f
0066
102
g
0067
103
h
0068
104
i
0069
105
j
006A
106
k
006B
107
l
006C
108
m
006D
109
n
006E
110
o
006F
111
7−
p
0070
112
q
0071
113
r
0072
114
s
0073
115
t
0074
116
u
0075
117
v
0076
118
w
0077
119
x
0078
120
y
0079
121
z
007A
122
{
007B
123
|
007C
124
}
007D
125
~
007E
126
8−
─
2500
128
│
2502
129
┌
250C
130
┐
2510
131
└
2514
132
┘
2518
133
├
251C
134
┤
2524
135
┬
252C
136
┴
2534
137
┼
253C
138
▀
2580
139
▄
2584
140
█
2588
141
▌
258C
142
▐
2590
143
9−
░
2591
144
▒
2592
145
▓
2593
146
⌠
2320
147
■
25A0
148
∙
2219
149
√
221A
150
≈
2248
151
≤
2264
152
≥
2265
153
NBSP
00A0
154
⌡
2321
155
°
00B0
156
²
00B2
157
·
00B7
158
÷
00F7
159
A−
═
2550
160
║
2551
161
╒
2552
162
ё
0451
163
є
0454
164
╔
2554
165
і
0456
166
ї
0457
167
╗
2557
168
╘
2558
169
╙
2559
170
╚
255A
171
╛
255B
172
ґ
0491
173
╝
255D
174
╞
255E
175
B−
╟
255F
176
╠
2560
177
╡
2561
178
Ё
0401
179
Є
0404
180
╣
2563
181
І
0406
182
Ї
0407
183
╦
2566
184
╧
2567
185
╨
2568
186
╩
2569
187
╪
256A
188
Ґ
0490
189
╬
256C
190
©
00A9
191
C−
ю
044E
192
а
0430
193
б
0431
194
ц
0446
195
д
0434
196
е
0435
197
ф
0444
198
г
0433
199
х
0445
200
и
0438
201
й
0439
202
к
043A
203
л
043B
204
м
043C
205
н
043D
206
о
043E
207
D−
п
043F
208
я
044F
209
р
0440
210
с
0441
211
т
0442
212
у
0443
213
ж
0436
214
в
0432
215
ь
044C
216
ы
044B
217
з
0437
218
ш
0448
219
э
044D
220
щ
0449
221
ч
0447
222
ъ
044A
223
E−
Ю
042E
224
А
0410
225
Б
0411
226
Ц
0426
227
Д
0414
228
Е
0415
229
Ф
0424
230
Г
0413
231
Х
0425
232
И
0418
233
Й
0419
234
К
041A
235
Л
041B
236
М
041C
237
Н
041D
238
О
041E
239
F−
П
041F
240
Я
042F
241
Р
0420
242
С
0421
243
Т
0422
244
У
0423
245
Ж
0416
246
В
0412
247
Ь
042C
248
Ы
042B
249
З
0417
250
Ш
0428
251
Э
042D
252
Щ
0429
253
Ч
0427
254
Ъ
042A
255
—0
—1
—2
—3
—4
—5
—6
—7
—8
—9
—A
—B
—C
—D
—E
—F
KOI8-R - Wikipedia, the free encyclopedia
KOI8-R is an 8-bit character encoding, designed to cover Russian, which uses the Cyrillic alphabet. ... A derivative encoding is KOI8-U, which adds Ukrainian characters. ...
In the table above, 20 is the regular SPACE character, and 9A is the NO-BREAK SPACE.
The difference with KOI8-R consists of the positions 0xA4; 0xA6; 0xA7; 0xAD; and 0xB4; 0xB6; 0xB7; 0xBD; which consist of extra letters that don't exist in Russian.
Although RFC 2319 says that character 95 should be U+2219 (∙), it may also be U+2022 (•) to match the bullet character in Windows-1251.
Some references have a typo and incorrectly state that character B4 is U+0403, rather than the correct U+0404. This typo is present in Appendix A of RFC 2319 (but the table in the main text of the RFC gives the correct mapping).
See also
Ukrainian alphabet
External links
RFC 2319
IBM CDRA
IBM codepage 1168
v · d · eCharacter encodings
Character sets
Early telecommunications
ASCII · ISO/IEC 646 · ISO/IEC 6937 · T.61 · sixbit code pages · Baudot code · Morse code
ISO/IEC 8859
-1 · -2 · -3 · -4 · -5 · -6 · -7 · -8 · -9 · -10 · -11 · -12 · -13 · -14 · -15 · -16
Bibliographic use
ANSEL · ISO 5426 / 5426-2 / 5427 / 5428 / 6438 / 6861 / 6862 / 10585 / 10586 / 10754 / 11822 · MARC-8
National standards
ArmSCII · CNS 11643 · GOST 10859 · GB 2312 · HKSCS · ISCII · JIS X 0201 · JIS X 0208 · JIS X 0212 · JIS X 0213 · KPS 9566 · KS X 1001 · PASCII · TIS-620 · TSCII · VISCII · YUSCII
EUC
CN · JP · KR · TW
ISO/IEC 2022
CN · JP · KR · CCCII
MacOS codepages ("scripts")
Arabic · CentralEurRoman · ChineseSimp / EUC-CN · ChineseTrad / Big5 · Croatian · Cyrillic · Devanagari · Dingbats · Farsi · Greek · Gujarati · Gurmukhi · Hebrew · Icelandic · Japanese / ShiftJIS · Korean / EUC-KR · Roman · Romanian · Symbol · Thai / TIS-620 · Turkish · Ukrainian
DOS codepages
437 · 720 · 737 · 775 · 850 · 852 · 855 · 857 · 858 · 860 · 861 · 862 · 863 · 864 · 865 · 866 · 869 · Kamenický · Mazovia · MIK · Iran System
Windows codepages
874 / TIS-620 · 932 / ShiftJIS · 936 / GBK · 949 / EUC-KR · 950 / Big5 · 1250 · 1251 · 1252 · 1253 · 1254 · 1255 · 1256 · 1257 · 1258 · 1361 · 54936 / GB18030
EBCDIC codepages
37/1140 · 273/1141 · 277/1142 · 278/1143 · 280/1144 · 284/1145 · 285/1146 · 297/1147 · 420/16804 · 424/12712 · 500/1148 · 838/1160 · 871/1149 · 875/9067 · 930/1390 · 933/1364 · 937/1371 · 935/1388 · 939/1399 · 1025/1154 · 1026/1155 · 1047/924 · 1112/1156 · 1122/1157 · 1123/1158 · 1130/1164 · JEF · KEIS
Platform specific
ATASCII · CDC display code · DEC-MCS · DEC Radix-50 · Fieldata · GSM 03.38 · HP roman8 · PETSCII · TI calculator character sets · ZX Spectrum character set
Unicode / ISO/IEC 10646
UTF-8 · UTF-16/UCS-2 · UTF-32/UCS-4 · UTF-7 · UTF-1 · UTF-EBCDIC · GB 18030 · SCSU · BOCU-1
Miscellaneous codepages
APL · Cork · HZ · IBM code page 1133 · KOI8 · TRON
Related topics
control character (C0 C1) · CCSID · Character encodings in HTML · charset detection · Han unification · ISO 6429/IEC 6429/ANSI X3.64 · mojibake
KOI8-U Mozilla project
Implementation of koi8-u encoding in mozilla open source browser , as ... to koi8-u to any other charset of cp1251, mac, iso, and koi8-r), need support for winfe and macfe ...
KOI8-U Character Map
KOI8-RU is compatible with KOI8-R in all Cyrillic Letters and completes it with four ... KOI8-U in Rusian, Ukrainian letters is completely compatible with ISO ...
koi8-u(7) - Linux manual page
koi8-u - Ukrainian character set encoded in octal, decimal, and ... The following table displays the characters in KOI8-U, which are printable and unlisted in the ascii(7) ...
KOI8-U Test Page
You can compare the content of test table and standard codetable image. ... Download Ukrainian KOI8-U fonts for MS Windows. You can copy this page for your personal ...
Network Working Group KOI8-U Working Group Request for ...
This document provides information about character encoding KOI8-U ... Originally, specification of proposed standard koi8-u was officially. adopted by the ...
RFC 2319 - Ukrainian Character Set KOI8-U (RFC2319)
KOI8-U is compatible with KOI8-R (RFC 1489) in all Russian letters and extends it with ... Originally, specification of proposed standard koi8-u was officially adopted by the ...
Ukrainian Character Set KOI8-U
KOI8-U is compatible with KOI8-R (RFC 1489) in all Russian letters and extends it with ... Originally, specification of proposed standard koi8-u was officially adopted by the ...
rfced-info-koi8-u-02.txt
KOI8-U is compatible with KOI8-R (RFC 1489) in all Cyrillic letters and extends it with ... Originally, specification of proposed standard koi8-u was officially adopted by the ...
