việc dịch thủ công bằng người đòi hỏi thời gian và công sức lớn, đặc biệt là những tri thức chuyên ngành đòi hỏi người dịch phải có chuyên môn trong lĩnh vực mà mình đang dịch. Chính vì vậy như cầu tự động hóa công tác dịch thuật Anh-Việt ngày càng trở nên thiết thực
Trang 1PH{) L{)C
Tu di€n LLOCE (Longman Lexicon Of Contemporary English) duQc xayd1,1'ngd1,1'atren tu di€n LDOCE (xin xem phlf llfC 8.2) Tu di€n LLOCE [32]khong s~p xep cac mlfc tu tieng Anh theo m~u t1,1'abc nhu thong thuong, ma s~pxep thanh cac nhom chu d~, m6i chu d~ duQc chia thanh nhi~u nhom, m6i nhomduQc chia thanh nhi~u lOp (t<:lmgQi la lOp ngu nghla) va m6i lOp geSmcac mlfc
tu co quail h~ ngu nghla voi nhau T6ng so geSm14 chu d~, 127 nhom, 24411O[ingu nghla va 16.000 mlfc tu Cac tu trong m6i lOp d~u la nhung tU co quail h~ v~nghla (nghla bi€u V?t hay nghla bi€u ni~m) voi nhau (nhu: deSng nghla, ggnnghIa,oo) M6i lOp thuong duQc lien ket cheo (cross-reference) voi cac lOp ngunghla khac rhea cac quail h~ logic - ngu nghla Ngoai cac nhan v~ ngu nghianoi tren, LLOCE can chila d1,1'ngra-tnhi~u nhan v~ tu 10<:liva cu pharo Chinh cacnhan ngu phap nay se giup chung ta ra't nhi~u trong vi~c khli' nh?p nhang nghlacua tu VInghla cua tu cling phlf thuQcra-tnhi~u vao vai tra ngu phap cua notrong d.u Vi dlf:tu "can" co nhi~u nghla khac nhau ("co th€", "dong hQp", "caihQp") tuy thuQcvao tu lo~i cua no Tu "ask" tuy thuQcvao m~u cali cu phap maduQcdich nghla khac nhau ("hoi" hay "yell du" ) Duoi day la daub sach 14 cMd~ trong LLOCE:
1 A Life and living things (s1,1'song va V?t th€ song)
2 B The Body: its Functions and Welfare (Co tht chilc Dang va s1,1'cham sac)
3 C People and the Family (Con nguoi va Gia dInh)
4 D Buildings, Houses, Clothes, Belongings, Personal Care (Cong trlnh xaydl1ng, nha cli'a,qu~n £10,deSd~c, ti~n nghi ca nhan)
Trang 25 E Food, Drink and Farming (d6 an, thuc u6ng, va ngh~ nong)
6 F Feelings, Emotions, Attributes and Sensations (earn XllC,XllCearn, theEdQ
va earn giac)
7 G Thought and Communication, Language and Grammar (tu duy va thongtin, ngon ngu va van phq.m)
8 H Substances, Material, Objects and Equipment (cha't li~u, V?t li~u, d6 V?t
va trang thie't bi)
9 I Arts and Craft,Science and Technology,Industry and Education (nghethu?t va ngh~ thli cong, khoa hQc va cong ngh~, cong nghi~p va giao d1,lc)
10 J Numbers, Measurement, Money and Commerce (s6, do luong, ti~n t~ va
j, thuong mq.i)
11 K Entertainment, Sports and Games (giai tri, th~ thao va cac tro choi)
12 L Space and Time (khong gian va thai gian)
13 M Movement, Location, Travel and Transport (dich chuy~n, vi tri, du lich vav?n tai)
14 N General and Abstract Terms (cac thu?t ngu khai quat va truu tu9ng)
Trang moi chu d~ tren, nguoi ta tie'p tllc chia nha tMnh cae nh6m (t6ng s6
129 nh6m), moi nh6m nay chua mQt s6 lOp va cac lOp nay c6 moi lien h~ ngu\nghia vdi cac lOp khac (c6 th~ thuQcehu d~ khac) trong tu di~n
Vi d1,l:chu d~ A g6m 10 nh6m sail:
1 S1,1's6ng va s1,1'che't, co chua cac lOptu Al de'n A20
2 Cae sinh V?t noi chung, co chua cac lOp tu A30 de'n A43
3 DQng V?t va dQng V?t co Vll,co chila cac lOp tu A50 de'n A61
4 Chill, co cae lOp tu A70 de'n A78
5 Bo sat va luong cu, co cae lOptu A90 de'n A94
6 Cd va cae thu)' sinh V?tkhae, co cac lOp tu A100de'nA 104
Trang 4f)~ minh hOil, chung t6i xin trlnh bay danh sach 158 lOp thuQc chu d€ A nhU'
Bang 8.1 dU'di day:
Bang 8.1: Danh s:kh lOp cua nh6m A trong LLOCE
Al V: existina and causina to exist
A2 V: livin and dying
A3 A: living and dead~
A4 N: life and death
AI0 informal V: killing
All IA: killing and death
A12 !poetic & formal A: without death
Al3 N: killing
A 14 IN & A: sexual types
A 15 V: sexual action
A16 N: sexual activities
A!7lA: sex and reproduction
~V: sex and reproduction
~N: early states of life
A20 IN: young creatures generally
A3"QlN: living things
A31 N: emotive expressions for living things
A32 N: kinds of living creature
A33 N: classes of living thin,gs
A34 N: breeds and strains of living things
A35 N: kinds of living things as bred by
mankind
~V: man breeding living things
~IN: very small living things
A38 N: living thin,gs which cause trouble
A39 N: living thing which eat or are eaten
A40 collective N: groups of the same kind of
Ghi chu
~n~iva~orasU'~ntilis6ng va chet
lien quan den s6ng va chetslf s6ng va slf chet
sinh de
dU'Qcsinh ra b~m sinh Igiet 1tf'I:' khu Ithu tieu
jgiet-chet'hinh th«c & tho bit tuslf giet chet
Icac 10iligidi tinhIhoat dQng gidi tinh
I
cac ho(,1t dQng gidi tinhgidi tinh va sinh sanIgiOitinh va sinh sankhdi dftu cua Slfs6ng
cac sinh V?t con n6i chung I
I
III
chan nu6i va gay gi6ng
vi sinh y?tV?t gay h(,1ithu an thitfbi an thitnh6m sinh V?l cling lo(ii
Trang 5living thin s
IA41 N: laces where creatures live and shelter noi sinh v?t~6na/tni ~n
!A42 N: places where creatures are kept by Inoi ng11aigill loai V?t
man
~v I & N: noise made by certain animals
ASO N: man and the monke
AS 1 N: the horse and similar animals
AS2 N: the cow and similar animals
A53lN: the cat and similar animals
AS4 N: the do and similar animals
ASS N: the ig
A5.6 N: the shee and the aoat
A57 IN: deer and similar animals
:AS8 N: sea and amphibious mammals
iA59 N: other large animals
IA60 N: rodents
IA61 N: other smaller animals
A70 N: domestic fowl
A71 N: ducks and geese
iA72 N: seabirds
A73 N: waterbirds
A74 N: common wild birds
A7S N: common pet birds
IA76 N: unusual birds
A77 IN: birds which are hunted as game
A78 IN: birds of prey and scavengers
A90 IN: snakes
A91 IN: lizards
iA92 IN: reptiles with shell
'A93 N: re tiles that donot exist
A94 N: am hibians
IA100 IN: common fish
AlGI N: laraer and danaerous fish
AI02 N: crustaceans
AiO3lN: mollusks (BrE), mollusks (ArnE)
AI04 N: other sea creatures
AII0 N: common insects
tie'ng keu cua mQt so loai V?tng11aiva khl
nglja va cac oQng V?t Wong tlj
bo va cac don£!vat Wong t11. ~. ~.mea va cac oQng V?t Wong t11cho va cac GQngV?t t11ongtljheo/1Qn
cuu va deh11ou va cac oQng V?t t11ong tlj GQng V? t bie'n va 111ong c11,cac oQng V?t IOnkhacloai g~m nham
cac oQng V?t nha khac
jga Ivtt va ngongchim bi~nchim n11dcchim hoang noi chungchim dnh
-chim hie'mchim bt san b~nchim san m5i va an xac che'tr~n
Trang 6AllIIN: stages in the lives of insects
A1l2 N: creatures similar to insects
A1l3 N: worms and similar creatures
A120 N: arts around the head and neck
A121 N: thin s o-rowin from the head and face
A122 N: mouth and related parts of different
creatures
,A123 N: the teeth of different creatures
A124 N: the feet of different creatures
A125 N: special parts of different creatures
A126 N: the skin, hair, and related parts of
different creatures
A127 N: marks on animals
Al28 N: parts of a female animal where milk
can be got by the young
Al30IN: arts of lants
Al31 N, etc: the flower on lants
Al32 N: new o-rowthin lams
Al33 N: li uids in lants
Al34 N, etc: fruit and seed
Al35 N, etc: laro-er lants
Al36 N, etc: o-eneral words for lants
Al37 N: collective words for many plants
together
Al38 N, etc: kinds of o-rassesand similar lants
Al39 N: ferns, heather and other s ecial plants
A140 N: lichens, funo-iand similar lants
Al41 N: climbing lants
Al50 N: kinds of fruits
Al5I N: kinds ofveo-etables
AI52 N: kinds of cereal
Al53 N: kinds of nuts
Al54 N: kind of berries
AI55 N: common kinds of deciduous trees
Al56 N: kinds of coniferous trees
A157 N: some common kinds of smaller trees
cac giai do?n cua dOl song contrung
cac sinh V?t tU'dng11,1'con trunggiun va sinh V?t tU'dngt\1'cac be)ph?n quanh d~u va c6nhung thli m9c tren d~u va m~tmi~ng va cac be)ph?n lien quan
rang cua cae sinh V?t khac nhauchan cua cac sinh V?t khac nhaube)ph?n d~c bi~t cua cac sinh v~t
da, long (toc) va cac be)ph~n lienIquan cua cac sinh v~t khac nhaucac da'u ve't tren can v~t \.-cae bo.ph~n eho bli eua eon V?tcai
cae be)ph?n cua cayhoa tren cay
nhung ea1 moi m9c tren caycha't long trong eay
Iqua va h? tcae lo?i cay IOncac tir t6ng quat chI ca y eoi
tu t~p h<;5pchi nhi~u cay coi h<;5pl?i
lo?i eo va cay tU'dngtl1IdU'dn Xl,th?ch nam va tU'dno-Wdia ,na'm va cay tU'dIlgt\1'leay lea
l(fai caycae Io?i rau cucac Io?i ngfi coeeae Io?i lo?i h? tcae Io?i qua m9ngcae Io~licay I1:mgIa theo muacae 10<;11cay h9 tung bachmQt solo?i cay nho / bui nho
Trang 7Bang 8.2: Th6ng ke s61uQng ml,lctu, nghla va nhan cua LLOCE.
Trang 88.2 Ht THONG NIlAN NGU NGmA LDOCE
Tu di~n LDOCE (Longman Dictionary Of Contemporary English) g6m45.000 ml;lctu voi hdn 65.000nghla M6i ml;lctu dU'<;1cphan bi~t dlfa tren tu lOi;li,
ma Cll phap, ma ngii' nghla, ma chu d~, ma phong cacho LDOCE baa g6mkhoang 100 ma chu d~ chinh, nhU':MD: cho y hQc, VH: cho xe cQ, ON: cho ngh~
,nghi~p, Cac ma chu d~ chinh c6 th~ dU'c;lcke't hc;lpvoi nhau ti;lora cac chu d~
can, nhU':MDON: ngh~ nghi~p-y hQc LDOCE g6m 19 ma ngii' nghIa cd b-an va
13 ma ngii' nghla phai sinh dU'c;lcke't hc;lPtU 19 ma ngii' nghla cd bin tren Cl;lth~nhU'sau:
Bimg 8.3: Bang ma ngii' nghla cua LDOCE
Cac ma ngii'nghla n6i tren c6 th~ du'c;lcs~p xep rhea cay phan loai nhU'sail:
Stt I Mfi ngu nghla co'ban Mfi ngu nghia phai sinh
1 A I Can V?t (animal) E Chit rn / long (S+L)
2 B Can V?t cai (female animal) K NgU'oi/ Can V?t dlfC(D+M)
3 I C I V?t Cl;lth (concrete) 0 NgU'oil Con V?t (A+H)
4 D I Can V?t dlfC(male animal) R NgU'oi/ Con V?t cai (B+F)
5 F NgU'oinii' (female human) U T?p hc;lpngU'oilconvt (Col.+ 0)
7 H I NgU'oi(human) W V?t Cl;ltruu tU'<;1ng/cl;lth (T +1)
8 I V?t C1,1th khong c6 sV s6ng X V? t truu tU'c;lng/NgU'oi (T+ H)
9 J V?t dn di chuyn dU'c;lc Y V? t truu tU'c;lng/c6slf s6ng (T+Q)
10 L Chit long (liquid) 1 NgU'oi/ Chit rn (H+S)
11 11 NgU'oiDam (male human) 2 Trliu tU'<;1ng/ Chit rn (T+S)
12 IN V?t rn khong di chuyn dU'c;lC 6 Chit long / Truu tU'c;lng(L+T)
13 P ThlfCV?t (plant) 7 Chit khi / Chit long (G+L)
14 Q C6 slf s6ng (animate)
16 T Truu tU'c;lng(abstract)
17 Z Khong danh diu (unmarked)
18 4 V?t th truu tU'<;1ng(abs
physical)
19 5 Chit hii'll cd (organic material)
Trang 9Hlnh 8.1 : Cay phan cftp cac ma ngu nghla cua LDOCE.
H~u her cac nhung cua danh tu d~u co ma ngu nghia va cac ma ngu nghianay duQcdung d~ phan lOpngu nghia cho danh tu E>6ivoi dQngtu va tinh tu,cac ma ngu nghIa nay se du'Qcdung d~ lam lieu chi chQnnghIa cho cac d6i s6-(cac vai) cua cac dQngtu hay tinh tu do Vi d\l: dQngtir "an" dn co danh tu 0vai chu th~ la nguoi / hay con v~t (ma 0), hay danh tu ma du'Qctinh tu "mallxanh" b6 nghia phai la danh tu thuQcv~t C\lth~ (ma C),
Bang 8.4: Th6ng ke s6luQng m\lc tir, nghia cua cac tu lo,?i chinh trong LDOCE
M\lCtu so' nghia Mli'c da nghia
Trang 108.3 lIt CO Sa TRI TmJC NGU NGHIA TV VljNG WORDNET
\VordNet la mQt h~ cd so tri thuc kh6ng 16 v€ ngu nghla cua tli v1.,I'ngtieng
Anh, duQc xay d1.,I'ngboi cac nha ligon nguhQc - may tinh, ligon ngu hQc - tam1y va ligon ngu hQc- tri nh~naDq.i hQc Princeton (My) tU d~u th~p DieD 1980.H~ WordNet [110] la mN h~ tr1.,I'ctuyen (on-line) cho phep mQi nguoi akhapmQi ndi (c6 noi mq.ng InterNet) du'Qct1.,I'do (mi~n phi) khai thac hay t,E xuong(download) may ca nhan cua minh cho cac ml,lc dich nghien cuu H~ WordNetduQc th1.,I'chi~n du'di s1,1'tai IrQ kinh phi tli mQt so cd quail, t6 chuc cua My, nhu':ONR (Van phong Nghien cuu Hai qUail), ARPA (Cd quail D1.,1'an Nghien cuuCao ca'p), LDC (T6 hQpDu li~u Ngon ngu), '
\VordNet khong chi ddn thu§n la nh6m cac tli d6ng ngh'ia hay cac tll c6quail h~ ngu nghla vdi nhau thanh tling lOp nhu' mQt so tU di~n LDOCE,LLOCE, ma WordNet con la mQth~ thong cacyni~m c6 quail h~ nhi€u m?tvdi nhau, t~o thanh mQtm~ng lu'diphuc t~p Ml,lclieu cd ban cua WordNet la
chua cac thong tin v€ ngil nghfa cIla tu, ma h~ n6i den khai ni~m hay dinh~ghla
v€ "tli " thi chac ChaD l~i daD den mQt lo~t ykien tranh ci:'iikh6ng dUL Chinh vi
V?y, ngay tll ago, ta phai XclCdinh cach hi~l) v€ ddn vi tu trang WordNet la nhu' the naG, sail d6 ta tim hi~u v€ t?P d6ng nghia (synset) - mQt thanh ph~n cd ban cua Wordnet.
8.3.1 "TV" TRONG 'WORDNET
Tren phudng di~n ngu nghla hQc tli v1,1'ng(lexical semantics), WordNet
xem Htli" La melt 51! ktt hC;p giila melt Y ni~m duc;c tit vl!ng hod (LexicaLized
concept) va meltphdt ng6n (utterance) co melt vai tra cu phdp Trang dinh nghla v€ Htli" nhu v~y, ta tha'y c6 3 va'n d€ dn lam r6 them: thu nha't, lo~i phdt ng6n
naG c6 th~ tham gia vaG trong kef hQp nay; thu hai: ban cha't va t6 chuc cua y
Trang 11ni~m au(fc ta vI!ng h06 ma ta th€ hi~n va thu ba: nhung -vat tro cu ph6p cila cac tit khac nhau Chung ta dn lam r5 cii 3 va'n d~ tren, nhu'ng VI ml,lc tieu ci1aWordNet la t6 chuG ngu nghla cila tu v1,1'ng,chinh VI v~y trong khuon kh6 cilalu~n an nay, chung toi se d~ c~p den va'n d~ thu hai, do la ca'u truc ngu nghlacila tu v1,1'ngtieng Anh.
V.l tu "tu" li;li du<;1cdung chung rho cii phat ng6n (m~t th€ hi~n, m~t hinh
thuG) va rho ciiyni~m dU<;1c ket h<;1ptrong no (m~t ynghla, m~t nQi dung), chinh
VI v~y d€ tninh hi€u nh~m, trong Wordnet se dung thu~t ngu "di;lng tu", hay la
"hinh thuG tu" (word form) d€ chI den m~t hinh thuG, th€ hi~n vat cha't cila "tu",con thu~t ngu "nghla tu" (word meaning) d€ chi den m~t ilQi dupg, )' ni~m duQc
tu v1,1'ngboa cua "tu" Xua't pM t tu hai khai ni~m tren, ta e6 th€ n6i rang "ngunghla hQc tu v1,1'ngla s1,1'anh Xi;lgiua hinh thuGva nghla" va "m6i tu lOi;liCllpM Pkhae nhau, se c6 cae ki€u anh Xi;lkhac nhau"
B?mg 8.5: Ma trQntu vI!ng trong WordNet.
Ta thli xem xet mQt ma tr~n tu v1,1'ng(lexical matrix) nhu trong bang 1 tIeDday M6i hang Mj, M2, , MI11la cae nghla khac nhau cua mQt di;lng tu (wordform) F nao do Cac cQt Fj, Fz, , Fn la cac di;lng th€ hi~n khac nhau eua rungffiQtnghla tu (word meaning) M nao do Giao giua hang M va CQtF rho ta mQtml,lc E c6 nghla la di;lngtu F do dung d€ th€ hi~n nghla M d6 Vi dl,l:E1,2 Ht di;lng
N ghla ci1a tu Hinh thue cua tu (word forms)
Trang 12tu F2dung d6 th6 hi~n nghla MI Ne'u CQtF nao c6 nhi~u bon hai ml,lc E thi ta n6i d~ng tu d6 la da nghla (polysemous) Ne'u hai ml,lc E rung n~m tIeD mQt
hang M thi ta n6i hai d<:tngtu d6 d6ng nghIa (synonym) vdi nhau Vi dl,l: trong
bang Bang 8.5 tIeD, thi F2la da nghla, FI va F21a d6ng nghla.
Anh x<:tgilia d<:tngva nghla cua tu 1a ant x<:tki6u da doi da (many: many),
nghIa 1a: c6 d<:tngtu ma c6 nhi~u nghIa va rung c6 nghla tu duQc th6 hi~n thanh nhi~u d~ng Da nghla va d6ng nghla 1a hai vfin d~ ma ngli nghla h9C tu vt;rng
quail tam, nguoi TIghehay d9C thi duong dgu vdi tu da nghla (dn xac dint xem
trang ngli canh nay thi c6 nghla gi), con nguoi vie't hay n6i thil<:ti dn bie't ch9TI
d<:tngtu nao trang so cac tu d6ng nghla d6 th6 hi~n.
8.3.2 TAp DONG NGHIA (SYNSET) TRONG WORDNET
Tr9ng tam eua Wordnet la cac 9ni~m dil duQc tu vlfng boa (ngli nghla
cua tu, tam goi g9ri la: )' ni~m tu vt;rng),chinh vi v~y Wordnetquail tam c1e'n cach bi6u di~n nhling nghla (hay 9 ni~m) nay Bang8.5afreon dung ma tr~n tu vt;rng d6 th6 hi~n cac d<:tngva cae nghla cua tu Tuy nhien, phuong phap dung k9
hi~u chITvie't ehl c6 th6 dung d6 bi6u di~n d<:tngthuc cua tU (word form) ma thai, chu khong th6 dung d6 bi6u di~n nghla.
Vi~c bi6u di~n 9ni~m tu vt;rng nay phl l thuQc VaG ml,lc tieu phl,lc Vl lcua Wordnet: ne'u dt;r tint dung d6 xay dt;rng Den 9ni~m tu vt;rng thi W9rdnet phai dam bao chua tfit ca cac thong tin ngli nghla c6 lien quail cua tu saGrho chinh tu
\\1ordnet, nguoi ta c6 th6 xay dt;rnglai chinh xac 9ni~m d6 (theo quail di6m cua
19 thuye't xay dlfng nghla) Tuy nhien, 9dint nay kh6 ma Gap ling duQc, vi ngay
d cac nghla chua trang cac tu di6n hi~n nay rung chua Gapling duQcyell du
tai hi~n nghla n6i lien Con ne'u dt;rtint dung Wordnet chi d6 phan bi~t nghla
nay vdi nghla khac, 9ni~m tU vt;rng nay vdi 9 ni~m tu vt;rng khac, thi trong
Trang 13Wardnet chi dn chila cac thong tin duai d:~mgky hi~u chITsaGcho nguoi sU'dl,lTIgc6 th~ dl;!'aVaGd6 d~ phan bi~t du<;Jcnghla nay vai nghla khac cua cling mQt tll
da nghla Vi dl;l: tll "letter" c6 2 nghla la "la thu" va "chITcai" Neu ta lll"uthanh
2 t?P nhu sail: {letter, message, } va {letter, alphabet, } thl nguoi sU'dl;lngl?ptilc bier ngay d~ng tll "letter" naG c6 nghla gi V?y hai t~p d6ng nghla (synset)n6i IreD chinh la each bi~u di~n hai nghla cua d:;mgtll "letter"
Nhung t?P d6ng nghla (SYNonym SET=synset) tl;!'than chung khong giaithich v€ nghla (hayyni~m) ma chungmang la gl, chung chi cho bier la chung c6mang mQt nghla (y ni~m) Guynha't naG d6 ma ta't ca cae tUc6 d~ng tll du<;Jcchilatrong t~p d6 cling mango Vi dl;llOp SSj = {WFjl , WFi2, , WFin} se mang 01\.nghla Guy nha't ma eac tU WI, W2, , Wn cling mango (Luu y: t~p d6ng nghla
trong Wordnet dudc d~t giITa 2 da'u ngo~c m6c: { } ) Vi tieng Anh 1a ngon ngG'
gi~u tll d6ng nghla, Den trong m6i synset thuong c6 nhieu (d~ng) tll Neu trongsynset naG chi c6 mQt (d~ng) tll , thl trong Wardnet nha't thiet phai c6 mo ngo~cgiai thich them ve nghla cua d~ng tll d6 (hi~n nay, da s6 synset deu c6 giaithich) M6i synset trong WordNet du<;Jcgall mQt mil s6 Guynha't (synset id.) d~d~ truy xua't khi xU'1y W dQng b~ng may tinh Mil s6 nay cling chinh la dQ doi(offset) tinh tll ci~u cua t~p (text only) chila synset d6, VI v~y chung ta c6 th~
dinh vi synset ci6mQt each nhanh ch6ng (vi dl;lqua ham fseek cua ngon ngi1' C).
VI tn;mg tam cua WordNet la ngITnghla, Den cac quail h~ trong WardNetcling chu yeu la cac quail h~ lien quail den nghla, nhung VI nghla cua tll trongWordNet thl du<;Jcbi~u di~n boi cac synset (thanh ph~n cd ban trong WordNet),chinh VI v~y quail h~ ngu nghla trong WardNet cling chinh la eac quail h~ giITacac synset Neu giua synset SSj={WFjj, WFi2, , WFin}va synset SSj={WFjl,
Trang 14WFj2, , WFjn} c6 quail h~ Rij vdi.nhau, thl synset SSj ={WFj], WFj2, , WFjn} cling se c6 quail h~ Rji vdi synset SSi ={WFil , WFi2, , WFin}' Tinh cha't nay
cua quail h~ du'Qc g<)i la tint hiS tu'ong (reciprocate) Ngoai ra, ne'u giua hai
synset SSj ;:: {WFil , WFj2, , WFin} va synset SSj ;:: {VlFjl , WFj2, , WFjn} c6
quail h~ R vdi nhau, thl WordNet cling dung quail h~ R d6 d<in6i leD quail h~giua cac d~ng tu (Word Form) WFj E SSjva WFj E SSj vdi nhau Cac quail h~trong WordNet du'Qcdi~n ta tn!c quail b~ng cac con tra (pointer) lien ke't giuasynset nay vdi synset kia Du'di day la cac quail h~ du'Qcs\i'dlfng trong WordNet:8.3.3.1 QUANB~ DONGNGBlA (SYNONYMY)
Quan h~ d5\ng nghla la quail h~ quail tr<)ng nha't trong WordNet, n6 th~hi~n tinh tu'ong d5ng giua cac nghla Tuy nhien, ne'u theo dint nghla ch~t che
cua Leibniz v~ d<3ngnghla "la hai tli du(/c thay the'lan nhau ma klu5ng lam thay d6i ngii:nghfa hay sdc thai cua diu", thl trong th1,1'cte', ta kh6 ma c6 du'Qc d5ng
nghla th1,1'cs1,1' nhu' the' VI V?y, ta phai cha'p nh?n mQt dint nghIa v~ d5ng nghla
mQt cach Wong doi nhu' sau "la hai tli du(/c thay the'lan nhau ma kh6ng lam thay
d6'i ngii: nghfa hay sdc thai cua diu trong mQt ngii: cdnh nh{[t djnh naG d6".
The'o dint nghla tIeD, thl giua hai (nghla) tu, ta se c6 ho~c la d<3ngnghla,
ho~c la khong d6ng nghla (logic c6 di<in:dung ho~c sai, 1 ho~c 0), nhu'ng th1,1'c
s1,1',thl ta't d chung ta d~u tha'y,giil'ahai (nghla) tu ba't ky, se c6 quail h~ d<3ngnghla vdi nhau vdi muc tu 0 de'n 1, trong d6 muc 0 la toaD roan khong d6ngnghla, muc 1 la d<3ngnghla tuy~t doi, con cac khoang giua la dQ do tint tu'ong
d<3nggiua 2 nghla d6, cang cao thl cang d<3ngnghla (logic ma mang tint dung tu
0 de'n 1) Quan h~ d<3ngnghla la quail h~ c6 tint doi xung (symmetry) va b~c
du (transitive), nghla la: x R y =>y R x va xR y /\ Y R z =>x R I).
Trang 158.3.3.2 QUAN H~ TRA.r NGHIA (ANTONYMY)
Doi l~p vdi quail h~ d6ng nghla la quail h~ phan nghla (doi l?p nghla) Vid1,1:"rich" (giiw) doi l~p nghla vdi {poor} (ngheo), {high} (cao) doi l~p nghlavdi {low} (tha'p), {old} (gia) doi l~p nghla vdi {young} (tre), {brother} (anh em)doi l~P vdi {sister} (chi em), Tuy nhien kh6 ma dinh nghIa C1,1thS du'Qc lo<;ii
quail h~ nay, VIkhong phai hie nao phdn nghfa cua x La "khong x" Trong vi d1,1
tren ta tha'y {rich} (giau) doi l~p vdi {poor} (ngheo), v~y ne'u rhea dint nghla,thl "poor" c6 nghla la "not rich" (khong giau), nhu'ng th1.fCte' thl "khong giau"chu'a hill la "ngheo" VI v~y, trong WordNet, quail h~ phan nghla chil ye'u dungcho tint tu va tr<;ingtu Quan h~ ~han nghla c6 tinh doi xung: x R y => y R x
8.3.3.3 QUAN H~ THU(JC CAP VAQUAN H~ BAD HAM
Quan he bao ham (hypernymy) la nhung quail h~ ngu nghla giua cae
nghla eila tu (word meanings), nghla la quail h~ ngu nghla giua cac synset vdi
nhau Vi d1,1:{maple} (cay g6 thiGh) la thuQc cap cila {tree} (cay), {tree} (cay)
la thuQc cap cila {plant} (th1.fCV?t) Quan h~ thuQc caplbao ham c6 thS xem tu'ang du'ang vdi quan h~ cap du'O'i(subordinate) / cap tTen (superordinate), hay
quan h~ t(ip con (subset)'! t(ip mf (superset).
Ta n6i synset SSx = {x, x', } la thuQccap cila synset SSy={y, y', }neu
ngu'oi ban ngu tie'ngAnh c6 thS cha'pnMn cau "x LA MOT (lo<;ii)y", noi each
khac, x c6 quail h~ rS-A vdi y Quan h~ ThuQc cap la quail h~ co tint cha't b~e
du, nghla la ne'u x thuQc cap y va y thuQc cap z thl x se thuQc cap z, n6i rhea ngon ngu roan la: (x c y) 1\ (y ex) =?X C z Quan h~ thuQcdip la quail h~ ba't doi xung (asymmetry), nghla la x thuQccap y thl y khong thuQc cap x, rhea ngon ngu roan la: xc y =? YCLx D1.favao tinh cha't b~c du n6i tren, ta co: mQt SSx la thuQc cap (la can) cila synset cha SSy (cling nhu' cae synset cao bon), SSx se ke'
Trang 16thua tat d cac d~c di~m cua t~p cha (va cac t?P synset cao hdn), nhu'ng d5ngthai no co them it nhat mQt d~c di~m rieng nh~m d~ phan bi~t no voi t?P synset
cha va voi cae t?P synset anh em clia no Vi dlf: {maple} (cay g6 thich) la thuQc dIp cua {tree} (cay), Den no se ke' thua tat d cac d?c di~m cua {tree} (cay),
nhu'ng d5ng thai no cling co them mQt s6 d~c di~I1).rieng khac (nhu': dQ cling cuathan, hlnh d;~mgcua la, ) nh~m phan bi~t,no voi {tree} va voi cac loai cay khac
cling thuQc loiii {tree} E>6il~p voi quail h~ thuQc dip (hyponymy) la quail h~ baa Julm (hypernymy), nghla la: x c y ~ Y:J X.
8.3.3.4 QUAN H$ BQ PRAN (MERONYMY / HOLONYMY)
Cac quail h~ d5ng nghla, phan nghIa, thuQCcap la nhfi'ng quail h~ tu'dngd5ng, chung du'Qcap dlfng cho tu v1,1'ngva ta d~ xac dinh dttQc cac quail h~ do
mQt cach t1,1'nhien Tuy nhien, bell qnh do, trong WordNet co mQt lo<;liquail h~
moi, do la quail h~ fa bQphf)11(meronymy) va co bQplu7-11(holonymy)
(;\
~
Ghi chu: thuQCcap (hyponymy); phan nghla ; la bQph~n (meronymy)
Hlnh 8.2 Mr;ll2g cae synset v6i cae kilu quan hf.
Trang 17Quan h~ la bQ ph4n e6 tint eha't b~e cffu (transitive) va ba't d6i xung
(asymmetry) Quan h~ bQ ph~n eung voi cae quail h~ khae (d6ng nghTa, phan
nghTa, thuQe ea'p, baa ham) t<;lOthanh mQt m<;lngbioi phue t<;lp(H'mh 8.2) vdi nhling nut (node) la nhling synset va cae eung n6i gilia cae nut la nhling quail h~.
8.3.3.5 QUAN m KEG THEG (ENTAILMENT)
Trong,logie, ta e6: P keG rhea Q Q (P => Q) Trang WordNet, quail h~nay du'<jeap d\mg eho dQng tU Ta n6i aQng tli Vi keG rhea ailng tli V2 ne'u va chi' ne'u co ai lam hanh aQng Vi thi chile chdn nguaiay phdi lam hanh dQng V2 Vi
d1,l:"Anh ta ngay" la keG rhea "Anh ta ngu", dQng tu "ngay" keG rhea dQng tU
"ngu", nghla la khi n6i de'n tanh dQng "ngay", th1 ta eh~e eh~n bie't r~ng da c6tanh dQng "ngu", nhung ngu<jc l<;lith1 chua eh~c dung, v1 v~y quail h~ nay laquail h~ ddn huang (unilateral) Quan h~ keG rhea nay cua dQng tU tudng nj nhuquail h~ bQ ph~n cUa daub tu Th1fCchat, ta n6i dQng tu VI keG rhea dQng tu V2,th1 tat nhien tanh dQng VI phai Ia mQtbQ ph~n, mN phgn/giai do<;lncua V2.8.3.3.6 QUANH~ CACHTHUCBAc BI~T (TROPONYMY)
Ta n6i ailng tif Vi c6 quan h? caeh thue Cl:tthl veriaQl1g tli V2 ne'u vi~c th1fC
hi~n dQng tu VI chinh la th1fchi~n dQng tU V2 voi milt each thue Cl:t tld naG d6 V~y ta thay: quail h~ caeh thue age bift cling Ia m.Qtd<;lngd~c bi~t eua quail h~ keG rhea ma trang d6 dQng tu V2 se t6ng quat hdn dQng tu V 1 va V 1 cling keG thea V2 Vi d1,l: "di kh~p khi~ng" la mQt each thuc Clftld cua "di bQ", va "di kh~p khi~ng" cling keG thea "di bQ" Tuy nhien, c6 dQng tu c6 quail h~ keG rhea nhu'ng l<;likhong quail h~ each thue age bift Vi d\1:tanh dQng "ngay" keG rhea
tanh dQng "ngu" (c6 nghla Ia 'c6 ngu moi e6 ngay'), nhu'ng "ngay" khong phaiIii "nail mot cach dae biet".1::>. .,
Trang 188.3.4 Ht THONGNIlAN NGD NGHIA CUA WORDNET
Ngoai vi~c phan thanh cac lOp ngiI nghla nhuLLOCE/LDOCE, ngu'oi tacon xem xet giiIa cac lOp nay co quail h~ vdi nhau nhu the' naG (quan h~ d6ng
nghlal phan nghla, thutlng danh I h~ danh, be)ph?n I chua d1!ng, ) VI v~y, cac
nha ngon ngu hQc - tri nMn tren the' giOi dff xay d1!ngille)th~ th6ng cac yni~m
co quail h~ ngu nghla nhi~u m~t vdi nhau va tlm cach bi~u di€n cae y nghlacling nhu cac qtwn h~ giua cac yni~m do MOt h~ th6ng nhl( the da du'tle xayd1!ng, do ehinh la h~ th6ngyni~m WordNet [75] (xin xem ph\l l\le 8.3).
Cac lOp ngu nghla trang \VordNet ehlnh la cac t~p cac tu d6ng nghla(synonym set, hay dutlc gQi dt la synset) Vi d\l: 2 synset {letter, message, } va{letter, alphabet, } M6i synset th~ hi~n mOt net nghla chung cho cae tU d6ngnghla trang t~p synset do Cac synset nay chinh la cae don vi co ban cilaWordNet va chung dutlc gall nhan boi cac ma so' (cling vdi cae giai thlch b~ngtie'ng Anh) Ta co th~ xem day chinh la cae nhiln ngCi'nghla ciia WordNet
WordNet den nay (phien ban 1.7) dil co khoang 100.000 synset Trong
WordNet, cai dutlc bi~u G<,ltla chung nha't eho ffiQi ngon ngu, con cai bi~u G<,ltthi
co th~ la cac ky hi~u b~ng mil so', ho~c tie'ng Anh, b~ng tie'ng Phap, tie'ng Hoa,
hay tie'ng Vi~t, Chinh vi v~y, ma de'n nay tren the' gidi da co nhi~u Gong trlnhban dia boa (localization) WordNet tu cac ky hi~u b~ng tie'ng Anh sangWordNet ky hi~u b~ng tie'ng Phap, tie'ng Nh~t, [39] Th?m chi, g~n day dil coGong trlnh cila FellBaum [76] nh~m 'dua eel thanh ngG:(idiom) VaG trong
W ordN et
H~ th6ng nhiln ngu nghla (synset) eua WordNet dutle t6 chUGrhea 4 tLtlo<;liehinh: danh tLt,dOng tLt,tinh tLtva tr<;lngtl( M6i tLt10<;lico Ci:lcht6 ch((c khac
nhau.
Trang 19"thtfc v~t" 1a mQt loai "huu co" (organism) Trang Wordnet se dieD ta nhusau:
oak @ -;> tree @ -;>plant @ -;>organism, voi ky hi~u "@-+" d€ tra de'n nut cha,
th€ hi~n quail h~ thuQc cfip (hyponymy) (hay ISA)
Xufit pMt tu g6c 1a mQt y ni~m cha rfit t6ng quat, dtfa rhea quail h~ baa
ham (hypernymy), nguai ta phan (nMnh) thanh cac y ni~m can cu th€ hall, r6icling tu chinh cae yni~m con nay, nguai ta tie'p wc phan nha nua thanh cac yni~m chi tie't hon nua, va cu nhu the' de'n khi kh6ng con'dn thie't phan chia nlia.Cac yni~m con se ke' thua toaD bQ cae thuQCtinh cua )' ni~m cha d6ng thai no
co them cae d~c di€m rieng d€ phan bi~t giua no voi cha no cling nhu giua novoi cae anh em cua no Cac d~c tinh phan bi~t nay g6m 3 lo?i: thuQc tinh, bQph~n va chuc Dang Vi dl}.voi yni~m {rabin} (chim c6 do), no co 3 lo?i d~c tinhsail:
a ThuQc tinh (attributes), (n6i voi tinh tu) [mau =do, kich thuoc=nho]
b BQ ph~n (parts) (n6i voi danh tu) [mo, long, cQnh]
c Chuc Dang (functions) (n6i voi dQng tu)=[h6t, bay]
Dtfa tren tinh C1,1th€ va truu tuQng cua nghla, ma Wordnet chia danh tuthanh 2 lo:;ti IOn: lo:;ti "entity" 1a nhling th1,1'cth€ C1,1th€ (tie'p xuc duQc), nhu:
ngl1C1i,khl, va nhung lo<;iitruu tuQng, kh6ng tie'p xuc duQc, nhu: tam lu5n, vi~c
h9C, sl! thanh cong, Trong m6i lOp IOnnay, Wordnet l?i chia thanh nhung lo?i
nha hon nhu sail:
Trang 20Bang 8.6 Sv phan lOp danh tu trong WordNet.
Cac yni~m trong bang tren day du'c;1cgQi la nhung yni~m nguyen thu)'(primitive semantic component) Tu nhung y ni~m nguyen thuy nay, maWordNet di'i xay dvng nen h~ thong cay phan lOp cho danh tu theo quailMthuQCcap (hyponymy) va bao ham (hypernymy)
entity Organism animal (suc v.;lt)
(thvc (v.;lt c6 sv person (ngu'ai)
plant (thvc v.;lt) tip
xuc object (v.;lt artifact (d6 nhan t<;to)
du'c;1c) th kh6ng natural object(v.;lt th tv nhien) body (co th)
p.ychology cognition (tri nh.;ln)
activity (ho<;ttdQng) Ievent (bin co)
group (nh6m ngu'ai) location (vi tri) possession (sa huu)
shape (hlnh d<;tng) I
state (tr<;tng thai)
I
I