IS0 IEC on al mater s of electr otec nical stan ar dization... T he c ar acter set is pr imar ily inten ed for the inter chan e of infor mation amon data pr oces in s stems inter na tion
Trang 2For ewor d
IS0 (the Inter national Or ganization for Stan ar dization) is a w or l w ide
f eder ation of national stan ar ds b dies (IS0 memb r b dies) T he w or k of
pr ep r i g Inter national Stan ar ds is nor maly car r ied out thr ou h IS0
an non-gov er nmental, in laison w ith ISO, also ta e p r t in the w or k IS0
(IEC) on al mater s of electr otec nical stan ar dization
cir culated to the memb r b dies f or v otin Publcation as an Inter national
Stan ar d r eq ir es a pr ov al by at le st 7 % of the memb r b dies castin
a v ote
A nnexes A , B an C of this Inter national Stan ar d ar e for inf or mation only
0 IS0 19 6
Al r i hts r es r v ed Unle s oth r w is spe if ie , n p rt of this publcatio may be
r epr od c d or uti z d in an form or by an mean , ele tr onic or me hanical in lu in
ph to opyin a nd microfim, w ith ut permis io in w r itin fom t he publs er
Inter natio al Organizatio f or Stan ar dizatio
Cas Po tale 5 l C -121 Ge ev e 2 l Sw itz rlan
Pr i te in Sw itz r lan
i
Trang 3Information and documentation - Extension of the Cyr il ic
in lu ed T he c ar acter set is pr imar ily inten ed for the inter chan e of infor mation amon data pr oces in s stems
inter na tional r egiste con titute a c ar acter set f or the inter national inter chan e of biblogr aphic citation , in lu in
their a nnotation , in the non-Slav ic Cy l c alpha ets f or the lan uages sp cif ied in 1.3
1.3 This c ar acter set is inten ed to han le infor mation in the f olow in lan uage gr oups:
A bazia n
A bkhasian
A isor
A ltaic
A v ar
A zer baiani
Balkar
Bas kir
Bur yat
Ch kc i
Ch v a sh
Dar gw a
Eskimo
Ev en
Ev enki
Gagau i
ln u h
K ab r dian
K almyk
K ar ac ay
K ar a-K alp k
K ar elan
K aza h
K hanty
K ir g hiz
K omi
K or ya k
K umyk
K ur dis
L k
Lith anian
Man i
Mar i
Moldav ian
Mor dv in
Nenets
Niv kh
Nogai
Os etic
Roman
Sami
Selkup
Shor
T ab sar an
T ajk
T at
T atar
T ur kmen
T uv inian
Udekhe
Udmur t
Uig ur
Uzb k
Yakut
a p ar to b u r epr esented in the c ar acter ta le ar e actualy gr aphic v ar iants Obsolete leter s, those u ed f or
only a br ief p r iod in the late 19th centur y, hav e b en ex lu ed f r om this Inter national Stan ar d T his a ples c iefly
Trang 42 Normativ r efere ces
Inter national Stan ar d At the time of publcation, the edition in icated w er e v ald A ll stan ar ds ar e s bject to
r egister s of c r r ently v ald Inter national Stan ar ds
lSO/l EC 6 6: 19 1, Infor mation tec nolog - IS0 T -bit coded c ar acter set f or infor mation inter chan e
ISO/IEC 2 2 : 19 4, Infor ma tion tec nolog -
Char acter code str uctur e an exten ion tec niq es
Inter national r egister of c a r acter sets to b identified by me n of es a e seq en es 1)
the ne d f or er r or c eckin , is the s bject of other Inter national Stan ar ds (se an ex C)
3.2 T he implementation of this Inter national Stan ar d is in ac or dan e w ith the pr ov ision of ISO/IEC 2 2 2) an
is identif ied by an es a e seq en e (To b as ig ed.)
3
bib1
iogr aphic inf or mation
Switzerla d
2) G : E C Z/8 F; Gl E C Z/9 F; G2: E C Z/IO F; G3: E C 2/l 1 F (F” re rese ts th fin l ch racter of th es a e se ue ce)
2
Trang 54 Co e ta le for exte d d Cyri ic ch racter s of n n-Sla ic la g ages
T able 1 is the code ta le f or exten ed Cy l c c ar acter s of non-Slav ic lan uages
b
7
b
6
b
5
0 1
0 1
0 1
0 1
5
i3
6
B
7
ti
Y
Y
::
_,,;_ _ _ : :::_ ,:: ::_ :
: ‘.‘.‘ :’
: '_ :
1
0
8 : : ~_~_':~.:_~_~._~_',
: ~_~._'_~_~_':._~._' '_'_'_~_~_ ~
,
:.: : : : :
':-_ a ~.~.~.~.~ ~ ''~.' '~ ~.~ ~ '~
'~.~ ~.~.~.'~.~.~.~.~.~.A Z ' '~ n l-b W W
_~.~.~.‘.‘.~.~
1
0
9
f
_‘ : ‘ ;
‘:
._ _, : ‘_I :
‘_ :
.‘ _,_’
1: _,_
,:
:_ ‘ ,._ ,; : : ‘;
.: : ;
: :.:
‘
1 ‘
: _‘; _
‘
:
.:_ _‘_
.: :.: :.‘ :.:
1
0
B
.: ‘ ,._,: _,:.:
.: : ‘ , ‘
:’ ;_ : y :.: :,_,.:
‘ ‘ ,
‘:.:
,.: : :
‘ : :
d
I-._
:._‘
_,.;_., ,_ ‘ _“_ ‘“ ’
_., _ ;._._ : ~.~ :.~ ;,.I 1,.:.;.~
‘ _‘;._._.,_ 1 ,.,.,_ _ :_
,.:.,._, ,_,., ,_ : _’ :
: :
,.,.,._.,
c
‘ _ :
1
,
,
.; _: ,.,.,.,
‘ : : :.:
,.;., : ,
: ,.,_,_ ,
: :~
.:.: 1 : ‘
1 d
: ._,_ _,_,., ,_ _:’ :
:‘1:.: :._ ,
“ : :
” :
‘ _ “ ,., ,
‘, ‘L_ , : :_ ,_,_ :
“_ ‘_
,.I
.:._.:
.:_.:.I :.; :
D :.:_, _ _‘_’ , , _,_I _ 1
.:
: ’ :
‘ : _, ”
:._,_ :.: : .1.1
1 : : _,
:
,:_ :
._
: :
.,: _ :_
,_
: _1 _
‘_._ : : : ._:_, : :
:_ ’_ :
: ‘
:
E
‘: ‘:
~:‘.‘:,‘ _._
,
: ‘:
,_._._
:
._: : : :
: _ ,_.,,_
‘.‘ ‘.‘.: :
‘.‘ : _‘ : :_
._.:
3
e?
R?
se
: :_’ ‘_ : _~_‘.‘_~.‘.~_~_~_‘_‘.‘.~.~.‘
:
Reser v ed for futur e stan ar dizatio
Trang 65 L ge d
Ta le 2 giv es the code, gr aphic an name of e c c ar acter an comments on u age
Code
21
2
2
2
2
2
2
2
2
2A
2
2
2
2E
2F
6
B
P
9
i:j
:-?
i t
;
.A
a
f
5
d
CL
3
3
?J
CO BININ DIAERESIS (Dia lytika )
CO BININ OGONEK
CO BININ RIG T DES EN ER
CO BININ CEDILA
CY RILIC SMAL L T ER A IE
CY RILU SMAL L T ER G E WITH STROK E
CY RILU SMAL L T ER G E WITH MID L HO K
CY RILU SMAL L T ER K OMI DE
CY RILIC SMAL L T ER K OMI DJE
CY RILIC SMAL L T ER A K A IA DZ
CY RIL IC SMAL L T ER K OMI DZ
CY RIUIC SMAL L T ER K OMI ZJE
3
31
3
3
34
35
36
37
38
3
3A
38
3
3D
3E
3F
p
(y
p
P
K
u
Ipi
i.i
f ‘
“
: A
A Z
f
5
d
&
3
7
3.J
(T his psition s al not b u ed)
CO BININ MA CRON
CO BININ CA RON
CY RILU C PlTAL L T ER A IE
CY RIUIC C PITAL L T ER G E WITH STROKE
CY RILU C PITAL L T ER G E WITH MID L HO K
CY RILIC CA FITAL L T ER K OMI DE
CY RILU C PlTAL L T ER K OMI DJE
CY RILIC C PITAL L T ER A K A IA DZ
CY RIUIC C PlTAL L T ER K OMI DZ
CY RILU C PITAL L T ER KO I ZJE
Ta le 2
Trang 7T able 2 (contin ed)
CY RILU SMAL LT ER JE W ITH STROKE
CY RILIC SMAL L T ER K WITH VERTIC L STROK E
CY RILIC SMAL L T ER B S KIR K
CY RILIC SMAL L T ER K WITH STROK E
CY RILU SMAL L T ER C EC EN K
CY RILU SMAL L T ER K UR IS QA
CY RILIC SMAL L T ER A ISOR E
CY RILIC SMAL L T ER K OMI E J
CY RILIC SMAL L T ER E WlTH MID L HO K
CY RILIC SMAL L T ER M R VIN E K
CY RILIC SMAL L T ER A LTA IC N
CY RILIC SMAL L T ER C U A H N
CY RILIC SMAL L T ER K OMI N
CY RILIC SMAL L T ER EN WITH MID L HO K
CY RILIC SMAL L T ER 0 WITH STROKE
41
s
K
x
42
43
A lso in Y akut
4
4
4
4
4
4A
4B
4c
40
4E
4
L
I
CY RILIC C PITAL L T ER JE WITH STROK E
CY RILIC C PITAL L T ER K WITH VERTIC L STROKE
CY RILIC C PITAL L T ER B S KIR K
CY RILIC C PITAL L T ER K WITH STROKE
CY RILIC C PITAL L T ER K UR IS QA
CY RILIC C PlTAL L T ER K OMI EJ
CY RILU C PITAL L T ER E WITH MID L HO K
CY RILIC C PITAL L T ER M R VIN E K
CY RILIC C PITAL L T ER C U A H N
CY RILIC C PITAL L T ER K OMI N
K
52
5
K
ec
of cQ,
L
5
5
5
5
5
5A
5B
5
5D
5E
5
CY RILIC C PITAL
CY RILIC C PITAL
EN W ITH MID L
0 WITH STROKE
Trang 8T able 2 (con lu ed)
Code
60
Q
62
17
63
P(
6
c
6
Y
6
Y
69
rl
6B
h
6
I
70
71
72
73
7
7
7
7
7
79
7A
7B
7c
70
7E
Y
W
‘LI
v
h
e
a
E
CY RlLlC SMAL L T ER A K A IA H
CY RILU SMAL L T ER SE K P 0 IE
CY RlLlC SMAL L T ER A K A IA P E
CY RILU SMAL L T ER ER K
CY RlLlJC SMAL L T ER KO I ESJ
CY RILU SMAL L T ER K OMI TJE
CY RlLlC SMAL L T ER STRA IG T U
CY RILIC SMAL L T ER STRA IG T U W ITH STROK E
CY RILIC SMAL L T ER C E WITH VERTIC L STROKE
CY RILIC SMAL L T ER HE
CY RILIC SMAL L T ER A K A IA C E
CY RILU SMAL L T ER S WA
CY RILIC SMAL L T ER Y IE
CY RILIC A PIR TION OR G T U A L SIG
CY RILIC C PITAL L T ER A K A IA H
CY RlLlC C PlTAL L T ER SE K P 0 IE
CY RILIC C PlTAL L T ER A K A IA P E
CY RIUIC C PITAL L T ER ER K
CY RILU C PITAL L T ER K OMI ESJ
CY RILIC C PITAL L T ER K OMI TJE
CY RILU C PITAL L T ER STRA IG T U
CY RILU C PITAL L T ER STRA IG T U W ITH STROKE
CY RILIC C PITAL L T ER K UR IS WE
CY RILIC C PlTAL L T ER A K A IA THE
CY RlLlC C PlTAL L T ER C E WITH VERTIC L STROKE
CY RlLlC C PlTAL L T ER HE
CY RILIC C PITAL L T ER A K A IA C E
CY RILIC C PITAL L T ER S WA
CY RILU C PITAL L T ER Y IE
Trang 96 Explanator y n tes
Inter national Stan ar d ar e av aia le in the b sic Cy l c set (Registr ation No 3 in the inter national r egister w ith
w hic this set is desig ed f or u e)
this lnter natr onal Stan ar d
In some texts, a hig cor nr na is u ed a ov e leter s in te d of an ac te mar k ( 4 ) Most moder n Cy l c s r ipt
Stan a r d In older texts, an a ostr ophe is oc asionaly u ed to r epr esent modif ied leter s When this c ar acter is
ne ded, the a ostr ophe pr ov ided in the b sic Cy l c set (Registr ation No 3 in the inter national r egister ) s ould b
u ed
b en identified
Char acter s w ith lar ge mid le ho ks or tais ar e defined as se ar ate c ar acter s Sour ces identify these mar ks as
eithe ” hv ostik” (tai) “sedi ” (cedi a) or ” kr j k” (ho k)
6.3 T he g tur al or aspir ation sig (pr i ykhateln j z a ; I mu t not b confu ed w ith the L tin s r ipt ca ital “I”
the case of the other leter s in a w or d (e.g r I n a~) A lthou h tec nicaly a sig (lke a p r cent “%” sig ) this
c ar acter IS giv en as the last leter of most Cy l c-b sed alpha ets The notion of ca italzation is not a pled to this
sig , th s, it is as ig ed only one code in this Inter national Stan ar d
w hic ar e non-sp cin c ar acter s, that is, c ar acter s w hose u e is not folow ed by the f or w ar d mov ement of an
output dev ice In a c ar acter str i g, these non-sp cin c ar acter s ar e input b for e the c ar acter s they modify
le to r i ht or to to b tom T hey ar e inten ed to b combined w ith other sp cin c ar acter s in this Inter national
Stan ar d or c ar acter s f r om the b sic Cy l c set These combinin mar ks (e.g dia r esis) ar e u ed lb r aly in the
6.5 T he r en er i g of gr aphic c ar acter s is inten ed solely to identify u iq ely the ad itional Cy l c s r ipt leter s
for ms
as ig ed in lSO/IEC 10 4 -I
Trang 10A nnex A
T ables A 1 an A 2 s ow the u age by non-Slav ic lan uages of Cy l c s r ipt c ar acter s def i ed in this Inter national
Stan ar d Only low er -case leter s ar e s ow n Var iant gr aphic ar e en losed in p r entheses T able A 1 is a lstin by
T able A 1
A bazian
I
A bkhasian
W ’e’E3!5(K )kQr l”,b W-’
A ltai
jbf ij;
A v ar
I
+XjK C3 h’+i
Bal k ar
Bas kir
Bur yat
ib+h
Y Y
a’Ehphfl~$~ ;l ;j;
Dar gwa
I
I-’ K ’ l-l (4) H’ X’ j;
Even
ye
Ev enki
Y
=v
K ar ac ay
9
Y
K ar a-K alp k
+-K (&-w Y j;ya
K ar elan
=v
K azakh
a f i Y (5) Y 8 Y (9 Y h
K hakas
f i j t 6 j y ($ h
i a 5 n’ y 8 6
’ (a o trophe)
’ (a o trophe)
fy r -y
i-i L
/ -
,-
g ;y
in ~
t;
g ’ (a o trophe)
v
r-Y .r Y
i : ’ - t7
I ,
tJ i “+ 3i i
i I
.x -7 -
if i ;
-3 q
E ’ (a o tr ophe)
:t
i-1 -1
f’
I2
:-L.G
t7
;
zy
: * y-’ r
L.2 + ‘*<
fZ
I i
L.2
fy fj y-j
-3 =I L d
z! ?
id ti r
3
i-r ;:
LJ
g! ’ (a o trophe)
.y
i !
3
8
Trang 11L n uage
K ir ghiz
K omi
K or ya k
K ur dis
L k
Lith anian
Mar i (Mo ntain)
(Mok sha)
Mord in
Nenets
Niv kh
Os etic
Os etic
Sami
Selkup
(Dig r)
(Iro )
Tajk
Tat
Tatar
Tur k men
Udmur t
Uig ur
Y akut
Ta le A 1 - (con lu ed)
By eiqk3
NeY
i
d d q ? 3 j I-L (J-@ L (L) 6 c 7;
i6
Y
;- (I-‘) 9 a’ (3’) I (K’) 6 ; (l)
r ; (p’) ; (T’) h It (h’) ; (U’) q W
I
I
y (H’)
%H6Y
gHayol
%
eY
I-r
~El-KI-~ Gi5
Y
j$ti i
+-K Y (gY eyh
f y (K) y (jd s (5)
Elfl-Ge y
0 p ’ (a o trophe)
q
B
17 q ’ (a o trophe)
B
g ’ (a o trophe)
5i ’ (a o trophe)
fj
tf
ofi
’ (a o trophe)
’ (a o trophe)
0
f i ’ (a o trophe)
f j
?-I
“
ry
‘;a o tr ophe)
G
Trang 12Char acter
Ta le A 2 -
6 5 b/l 9 i 5
ifiG
CO BININ CEDILA
CO BININ DIAERESIS
0
Y
Y
CO BININ RIG T DES EN ER
K hakas , K hanty, K omi, Mar i, Mor dv in,
K urdis
K hakas
Taj k, Y a kut
K ar a-K alp k , Uzb k, Y akut
10
a
Trang 13Annex B
A r abic or Roman alpha ets, w hie other s had their ow n alpha ets In the mid-tw enties, Sov iet ln uists esta ls ed
lan uages in q estion
W ithin a few ye r s, it b came a p r ent that the c oice of u in the Roman alpha et for some lan uages w ithin a
alpha et w as its lack of leter s that r epr esent sou d common to lan uages in Centr al A sia an Ru sia T he b sic
leter s u ed to r epr esent one sou d) had to b u ed exten iv ely
T he diff i ulty of the s r ipt an simiar ity b tw een man leter s in the A r abic alpha et (one leter b in distin uis ed
f r om another o en only by dots) made it ev en les desir able than the Roman alpha et to r ecor d the lan uages of
(pr e-1917) an the moder n Cy l c p r iod (p st-l 9 0) ar e se ar ated by a p r iod of ye r s w her e the Roman alpha et
f av our of Cy l c eq iv alents in most con tituent r epublc of the for mer Sov iet Union
Some lan uages contin ed to u e alpha ets other than Cy l c d r i g the p r iod 1917 to 19 1 Lith anian, L tv ian,
esta ls ed an histor i aly imp r tant alpha ets T he u e of alpha ets an s r ipts is now goin thr ou h a p r iod of
alpha et other than the one b sed on the Cy l c s r ipt Mon ola, for example, decided in 19 2 to r etur n to the
tr aditional Mon olan s r ipt
Trang 14Annex C
Implementation of the 7-bit coded c ar acter set an its 7-bit an 8-bit
Use of lon itu inal p r ity to detect er r or s in inf or mation mes ages
or iented tr an mis ion
[41 IS0 17 5: 19 5, Infor mation pr oces in -
Pr oced r e for r egistr ation of es a e seq en es
lan uages f or biblogr aphic infor mation inter chan e
Par t 1: A r chitectur e an Basic Multi n ual Plane
Slav onic
K S) -
[ 01 K A TZ ER, K T he lan uages of the w or l New Yor k: Fu k & W ag als (c 19 5)
Con r es , 19 1)
12
Trang 16K S 35.040
De criptors: d c me tatio , biblo r aphie , da t a pr oc s ln p Inf or matIo interchan e, gr aphic c aracter s, Cyri c c aracter s, c aracter
s ts, c od d c ar acter s ts, c d d r epr es ntatio , e te sio s
Pric b s d o 12 pa ge