1. Trang chủ
  2. » Luận Văn - Báo Cáo

Báo cáo khoa học: "TRANSFORMING ENGLISH INTERFACESTO OTHERNATURALLANGUAGES: AN EXPERIMENT WITH PORTUGUESE GABRIEL PEREIRA LOPES " potx

3 210 0
Tài liệu đã được kiểm tra trùng lặp

Đang tải... (xem toàn văn)

THÔNG TIN TÀI LIỆU

Thông tin cơ bản

Định dạng
Số trang 3
Dung lượng 277,77 KB

Các công cụ chuyển đổi và chỉnh sửa cho tài liệu này

Nội dung

Logic Programming as a mental aid, and Prolog Coelho, 1983; Clocksin & Melish , 1981 and Extraposition Grammars Pereira, 1983 as practical tools, were adopted to implement a natu- ral la

Trang 1

TRANSFORMING ENGLISH INTERFACES TO OTHER NATURAL LANGUAGES:

AN EXPERIMENT WITH PORTUGUESE GABRIEL PEREIRA LOPES (1) Departamento de Matem~tica

• In s t i t u t o Superior de Agronomia Tapada da Ajuda - 1399 Lisboa Codex, Portugal

ABSTRACT

Nowadays i t is common the construction of

English understanding systems (interfaces) that soo-

ner or l a t e r one has to re-use, adapting and conve~

ting them to other natural languages This is not an

easy task and in many cases the arisen problems are

quite complex In this paper an experiment that was

accomplished f o r Portuguese language is reported

and some conclusions are e x p l i c i t e l y stated A know

ledge information processing system, known as SSIPA,

with natural language comprehension capabilities

that interacts with users in Portuguese through a

Portuguese interface, LUSO, was b u i l t Logic was u-

sed as a mental aid and as a practical tool

I INTRODUCTION

The CHAT-80 program f o r English (Warren &

Pereira, 1981; Pereira, 1983) was transformed and a

dapted to Portuguese Logic Programming as a mental

aid, and Prolog (Coelho, 1983; Clocksin & Melish ,

1981) and Extraposition Grammars (Pereira, 1983) as

practical tools, were adopted to implement a natu-

ral language interface f o r Portuguese The i n t e r f a -

ce here reported, called LUSO, was then coupled to

a knowledge base for geography, an extension of the

CHAT-80 knowledge base In an u l t e r i o r experiment ,

LUSO dictionary was augmented with new vocabulary

and LUSO was coupled to other modules that conside-

rably augmented the expertise capabilities of SSIPA

(Sistema Simulador de um Interlocutor Portugu~s Au-

tom~tico (2))

SSIPA is a complex knowledge information processing

system with natural language comprehension and syn-

thesis c a p a b i l i t i t e s that interacts with users in

Portuguese due to the l i n g u i s t i c knowledge that is

l o g i c a l l y organized and codified in the above men-

tioned SSIPA's interface ca]led LUSO.After the f i r s t

step of i t s development, SSIPA was able to answer

(1) Present Adress: Centro de Inform~tica, Laborat5

r i o Nacional de Engenharia C i v i l , lOl, Av do Bra=

s i l , 1799 Lisboa Codex, Portugal

(2) Simulating System of a Portuguese Automatic In-

terlocutor

questions about geography and could agree or disa- gree with the opinions stated by the users about

i t s geographical knowledge After the second step

of i t s development SSIPA became more powerful and

i n t e l l i g e n t because i t could also perform actions that t r a d i t i o n a l l y were attributes of computer mo- nitors (Lopes & Viccari, 1984).As a matter of f a c t , SSIPA can create and delete f i l e s , f i l l them, change t h e i r names, l i s t and change their, contents; SSIPA receives, keeps and send messaqes answers questions not only about geography but also about the knowledge SSIPA represents; i t a - grees or disagrees with the opinions stated byusers about the Knowledg~ context behind dialogues, reacts when users t r y to cheat i t but, as a rule, SSIPA behaves as a h e l p f u l , deligent and cooperat~

ve i n t e r l o c u t o r w i l l i n g to serve human users, chan ging from one to another topic of conversation and developing i n t e l l i g e n t c l a r i f i c a t i o n dialogues (Lo pes, 1984) A l l these features require a very power

f u l Portuguese language interface whosemain moron~

- s y n t a c t i c features are pointed out in this pa- per

2 FORMALIZATION OF NATURAL LANGUAGE CONSTRUCTS Natural language are complex structured systems d i f f i c u l t to formalize Formalization can

be understood as a step by step construction of a theory to achieve , as an ultimate goal, an axioma

t i c d e f i n i t i o n of natural language constructs I f this descriptive theory can also function as the

l i n g u i s t i c structured knowledge necessary to simu- late a human native using his mother language then, the formalization e f f o r t has acquired and gained a new insight While representing a natural language system, i t may represent a native competence about his mother language and, simultaneously, i t mayper form the role of a native using that competence This dual u n i t y , incorporatingadescription of l i n

g u i s t i c knowledge and incorporating the same l i n -

g u i s t i c knowledge ready to be active, is central to this work.This u n i f i c a t i o n in the same u n i t of two apparently c o n f l i c t i n g and contraditory aspects of natural languages is possible due to the usage of logic as a mental and a practical tool SSIPA enca psulates both views of natural language

Practice demonstrates that, for the cons truction of complex models i t is better to begin with simple model versions to represent the system one intends to simulate This practical conclusion

8

Trang 2

seems reasonable because knowledge about a system

and about i t s representation keeps on augmenting as

far as, to achieve the validation of the simula -

ting model, empirical investigation progresses(Klir,

1975) However one must be aware that while Know -

ledge about a real system keeps on growing so do

the complexitythat one can u n w i l l i n g l y introduce in

to the model Having a l l this in mind, i f we want

to formalize l i n g u i s t i c knowledge about natural fan

guage we must be prepared to use powerful formal-

languages prone to description of complex systems

and able to be used as programming languages Here

i t is subsumed that computers are tools adapted to

deal with complexity, augmenting considerably hu-

man capabilities to handle highly complex represen

tational systems

3 LUSO

LUSO input subsystem is a device that

transforms a sequence of words morfologically, syn

t a c t i c a l l y and semantically s i g n i f i c a n t into a Lo-

gical Form A Logical Form is here understood as a

sequence of predicates, envelopes for knowledge

transportation from users to SSIPA central proces-

sing unit (the EVENT DRIVER) and from this u n i t to

users These predicates generalize and augment the

potencialities of Pereira's equivalent predicates,

(Pereira, 1983) They can also be compared with the

lexical functions of Bresnam (Ig81) However we

don't use case c l a s s i f i c a t i o n In Portuguese, pre-

positions associated to noun semanticfeatures seem

to be enough to i d e n t i f y and d i f f e r e n t i a t e mea-

nings of verbal, noun, adjectival and even prepos~

tional form functions (Lopes, 1984)

LUSO is a natural language interface that

concentrates l i n g u i s t i c expert knowledge about Pot

tuguese language

LUSO input subsystem works sequentially

In a f i r s t step i t performs the syntactical analy-

sis of an input Portuguese sequence of words De-

pending on the task LUSO has been commited to per-

form, a l e x i c a l l y f i l l e d syntagmatic marker or a

f a i l u r e is the result of LUSO eagerness to prove

the above mentioned input sequence of words as a

syntactically correct yes-no question, wh-question,

imperative or declarative sentence, or as a syntac

t i c a l l y correct noun phrase or prepositional phra Z

se When a l e x i c a l l y f i l l e d syntagmatic marker is

obtained, i t is translated to a logical form Fi-

nally this form is planned and simplified accor -

ding to the methodology described by Pereira (1983)

and Warren (1981)

The design of LUSO input subsystem re -

flects the following hypothesis:

• morphological analysis of Portuguese

constructs is syntactically driven;

• l i n g u i s t i c semantic analysis of Portu-

guese constructs is l e x i c a l l y (functio

nally) driven (in a quasi-bresnamian,

sense (Bresnam, 1981; Pereira, 1983;Lo

pes, 1984));

• cognitive semantic analysis of Portu - guese constructs depends on syntacti - cal and l i n g u i s t i c semantic analysis previously achieved for Portuguese cons tructs

This suggests SSIPA as a formal system that already theorizes some aspects of Portuguese language while LUSO specificates the form of for- mal functions whose cognitive content and formal ap titude for transforming system state are defined at the semantic level of the formal system

To complete the formal role wewanted SS !

PA to play, LUSO output subsystem synthesizes Por- tuguese noun phrases, prepositional phrases or se D tences whenever i t receives correspondent requests

to output such constructs To achieve that goal LU

SO transforms any previously l e x i c a l l y f i l l e d syn- tagmatic marker into a sequence of Portuguesewords

in i t s f i n a l forms, ready to be sent to a user

4 MORPHO-SYNTACTICAL ANALYSIS AND SYNTHE - SIS OF PORTUGUESE LANGUAGE CONSTRUCTS The morpho-syntactical analysis of Portu guese language constructs is application indepen - dent and is based on the various concepts develo- ped by Chomsky and followers in the framework of the Extended Standard Theory of Generative Grammar (Chomsky, 1980, 1981a, 1981b; Rouveret, 1983 and many others)• As i t was already mentioned in this paper, one of the crucial hypothesis behind LUSO's design reflects the idea that morphological analy- sis of Portuguese constructs is syntactically d r i - ven This means that when the syntactical parseris waiting for a specific grammatical category, i t ta kes the next word to be analysed from the input se quence of words and searches the dictionary for that category, trying to find the input word I f the i put word does not match any dictionary entry for that particular category, a l l possible input word endings, one after another, starting from the lon- gest towards ths shortest, are matched against the ending entries for that category u n t i l a success - ful match w i l l occur I f such a match does not suc ceed, this means that the input word does not be- long to the foreseen grammatical category As a co) sequence, a f a i l u r e occurs and the Prolog mecha - nism for backtracking is automatically activated When one of the input word possible endings mat - ches an ending entry for the syntactically predic- ted category, a basic form f o r the input word is coined The newly coined basic form f o r that in - put word is then checked against the subdictionary entries for the foreseen grammatical category.A pr~ cess of successes and/or failures proceeds A syn- tagmatic marker for each input Portuguese construct

is f i l l e d with word basic forms and correspon - dingsyntactic features information (person, gender and number f o r noun phrases; tense, mode, aspect , voice and negation for verbs; e t c ) The basic form fora-verb is i t s i n f i n i t i v e form; for a n o u h i s i t s singular form; for a pronoun, a r t i c l e or adjective

is i t s singular masculine form

Trang 3

The morphological synthesis of Portugue-

se constructs is syntactically driven This means

that, departing from a syntagmatic marker l e x i c a l -

lp f i l l e d with basic forms of Portuguese words, u-

sing the syntactic features that are e x p l i c i t e l l y

considered into that marker, LUSO output subsystem

coines the corresponding sequence of Portuguese

words in i t s f i n a l output form ready to be sent to

the user with whom the system is interacting For

this purpose most of the rules that were designed

to consult LUSO's dictionary were reordered Depa~

ting from basic forms of words, their f i n a l forms

are obtained by a process nearly inverse of the

process used for input

Extraposition grammars, the formalism d e

veloped by Pereira (1983), were used to implement

the analyser and the synthesizer f o r Portuguese.It

is worth t e l l i n g that this formalism proved to be

quite adequate for the description of move-alpha ru

le (Chomsky, IgBlb) in complex syntactical environ

ments such as those that frequently occur in Portu

guese As a matter of fact phrase constituents or-

der in Portuguese sentences is quite free LUSO ta

kes into account the same type of problems handled

by CHAT-80 program Additionally, i t analysis syn-

tactical structures involving prepositional phra -

ses and verb headed sentences where there is reor-

dering of noun phrase constituents inside those se~

tences due to the heading process Problems rela-

ted to common nouns followed by the proper nouns

they refer, in the context where they appear,is a !

so handled

5 CONCLUSIONS

I t is wiser to concentrate efforts to o 0

tain more and more powerful morpho-syntactic anal~

sets, l i n g u i s t i c semantic analysers and cognitive,

semantic interpreters f o r the natural language we

are working in Constructing replicants of applica

tion directed interfaces starting from scratch is

unproductive Constructing more and more powerful

interfaces, as the number of applications natural-

l y grows, the natural language analyser, planned to

be application independent, is always under impro-

vement because i t is always incorporating more and

more l i n g u i s t i c knowledge At the same time one is

freed from consideration of morphological and syn-

tactic basic problems and so one can s h i f t his at-

tention to more subtle problems related to tense ,

modality and others and one can concentrate his

mind to the way how concepts related to words are

defined As a consequence, the implementing task

can be organized by areas of specialization

When one has to construct an interface

for a specific language i t is reasonable to look

for interfaces implemented for other languages wh e

re the faced syntactical and morphological prob -

lems have a similar degree of complexity Having

this in mind, Portuguese language seriously compe-

tes with English because i t rises quite important

syntactic, semantic and pragmatic problems similar

to problems risen by l a t i n , slavonic and germanic

languages

6 AKNOWLEDGEMENTS

I would l i k e to thank Helder Coelho for his insightful comments and suggestions throughout this research and the writing of this paper

7 REFERENCES BRESNAM, J., "The passive in lexical theory", Occa sional Paper 7, The Center for Cognitive Science MIT, 1981

CHOMSKY, N., 'bn binding", Linguistic Inquiry,vol

I I , n9 l , 1-46, 1980

CHOMSKY, N., "Lectures on government and binding", Foris Publications, Dordrecht, Holland, I981a CHOMSKY, N., "On the representation of form and function", The Linguistic Review, vol l , n9 l , 30-40, 1981a

COELHO, H., "The art of knowledge engineering with Prolog", INFOLOG PROJ, Faculdade de Ci~ncias, U- nivers~dade Cl~ssica de Lisboa, 1983

KLIR, G., "On the representationof a c t i v i t y arrays~ Int J G e n e r a l Systems, 2, 149-168, 1975 LOPES, G., "Implementing dialogues in a knowledge information system", paper submited to Interna - tional Workshop on Natural Language Understan ding and Logic Programming, Rennes, France, 1984 LOPES, G and VICCARI, R., "An i n t e l l i g e n t monitor interacting in Portuguese language", short paper accepted for ECAI-84, Pisa

PEREIRA, F., "Logic for natural language analysis~ Technical Note 275, SRI International, 1983 ROUVERET, A., unpublished lectures lectured in Lis bon, 1983

WARREN, D., " E f f i c i e n t processing of interactive r e lational data base queries expressed in logic" , Dept of A r t i f i c i a l Intelligence, Univ of Edin- burgh, 1981

WARREN, D and PEREIRA, F., "An e f f i c i e n t e a s i l l y adaptable system for interpreting natural langua

ge queries", DAI research paper nQ 155, Univ of Edinburgh, 1981

10

Ngày đăng: 08/03/2014, 18:20

TỪ KHÓA LIÊN QUAN

TÀI LIỆU CÙNG NGƯỜI DÙNG

TÀI LIỆU LIÊN QUAN

🧩 Sản phẩm bạn có thể quan tâm