1. Trang chủ
  2. » Luận Văn - Báo Cáo

Báo cáo khoa học: "A Minimalist Head-Corner Parser" ppt

3 142 0
Tài liệu đã được kiểm tra trùng lặp

Đang tải... (xem toàn văn)

THÔNG TIN TÀI LIỆU

Thông tin cơ bản

Tiêu đề A minimalist head-corner parser
Tác giả Mettina Veenstra
Trường học University of Groningen
Chuyên ngành Informatics
Thể loại báo cáo khoa học
Thành phố Groningen
Định dạng
Số trang 3
Dung lượng 271,5 KB

Các công cụ chuyển đổi và chỉnh sửa cho tài liệu này

Nội dung

This paper explains why functional heads are not treated as head-corners by the mini- realist head-corner parser described here.. Because head- corner parsing is a bidirectional strategy

Trang 1

A M i n i m a l i s t H e a d - C o r n e r P a r s e r

M e t t i n a V e e n s t r a

v a k g r o e p A l f a - i n f o r m a t i c a , U n i v e r s i t y of G r o n i n g e n

P o s t b u s 716

N L - 9 7 0 0 A S G r o n i n g e n

M e t t i n a @ l e t r u g n l

A b s t r a c t

In the Minimalist Program (Chomsky,

1992) it is assumed that there are different

types of projections (lexical and functional)

and therefore different types of heads This

paper explains why functional heads are

not treated as head-corners by the mini-

realist head-corner parser described here

1 I n t r o d u c t i o n

In the Minimalist Program (Chomsky, 1992) 'sur-

face' word order is determined in a very indirect way

Word order is no longer a property of phrase struc-

ture, because phrase structure is universal Fur-

thermore movements are universal This implies

in principle that when we parse comparable sen-

tences in different languages, we always build the

same tree Word order differences are distinguished

by the choice of the moment of Spell Out (SO)

SO is the point in the derivation where instructions

are given to an interface level called PF (Phonetic

Form) Thus SO yields what was formerly called

surface structure SO determines in which position

in the tree a certain constituent becomes visible and

consequently it determines the relative order of the

constituents of a sentence This is illustrated in the

simplified tree in figure 1 Note that each cluster of

co-indexed positions (i.e a chain) in the figure has

only one visible constituent This is the position in

which the constituent is represented at the moment

of SO This moment is not universal The verb chain

of our English example gives instructions to the in-

terface level P F when the verb is adjoined to AgrS

(head of the agreement phrase of the subject) The

verb chain of a comparable sentence in Dutch 'spells

out' when the verb is in V Thus in Dutch subor-

dinate clauses the movement of the verb to AgrO

(head of the agreement phrase of the object) and

CP

I

u

/ \

I / \ that~ DP AgrS

I / \ she~ A g r S AgrOP / \

V AgrS ej AorO

o(;

V AgrO

[ L

ek c a t s j

F i g u r e 1: A s i m p l i f i e d t r e e f o r a t r a n s i t i v e s u b o r d i n a t e

c l a u s e in E n g l i s h

subsequently AgrS happens 'covertly' The motiva- tion for covert movement can be found in (Chomsky,

1992, pages 38-40)

In the following sections we will show that the structure building operations of the Minimalist Pro- gram are bidirectional operations Because head- corner parsing is a bidirectional strategy, this type

of parser seems more favorable for minimalist pars- ing, than the usual left to right parsing algorithms

2 G T a n d M o v e - c ~ The central operations of the Minimalist Program are Generalized Transformation (GT) and Move-

~ GT is a structure-building operation that builds trees in a bottom-up way as is illustrated in figure 2

Trang 2

V V

F i g u r e 2: G T a p p l i e d t o V a n d D P y i e l d i n g ~'

T w o phrase markers (V and D P ) are combined into

one One of these two is called the target (V) A pro-

jection of the target (V) is added to the target T h e

projection of the target has two daughters: the tar-

get itself and an e m p t y position T h e empty posi-

tion is substituted for by the second phrase marker

(DP) This second phrase marker is itself built up in

other applications of G T a n d / o r Move-a

Move-(~ is a special kind of G T It is an opera-

tion t h a t combines a target with a moved phrase

marker It is assumed t h a t movement is always left-

ward (Kayne, 1994) and t h a t in the universal trees of

the Minimalist P r o g r a m heads and specifiers, which

are the only positions to move to, are always to the

left of the projection line These two assumptions in

combination with the fact t h a t G T and Move-a are

b o t t o m - u p operations, effect t h a t the moved phrase

marker has to be contained in the tree that was built

so far 1

T h e tree in figure 1 illustrates different kinds of

movement In the Minimalist P r o g r a m movement

occurs to check features Elements move from the

lexical domain (VP) to the functional domain (e.g

AgrOP, AgrSP) to compare their features with the

features t h a t are present in the functional domain

3 H e a d - c o r n e r p a r s i n g

T h e main idea behind head-driven parsing (Kay,

1989) is t h a t the lexical entries functioning as heads

contain valuable information for the parsing process

For example, if a verb is intransitive it will not re-

quire a complement, if it is transitive it will require a

complement Therefore the head is parsed before its

sisters in a head-driven parser A head-corner parser

(Kay, 1989; B o u m a and van Noord, 1993) is a spe-

cial type of head-driven parser Its main character-

istic is t h a t it does not work from left to right but in-

stead works bidirectionally T h a t is, first a poten-

tial head of a phrase is located and next the sisters

of the head are parsed T h e head can be in any po-

sition in the string and its sisters can either be to

the right or to the left

A head-corner parser starts the parsing process

with a prediction step This step is completed when

iSee (Veenstra, 1994) for further details

a lexical head is found t h a t is the h e a d - c o r n e r of the goal (i.e the type of constituent t h a t is parsed) T h e head-corner relation is the reflexive and transitive closure of the head relation A is the head of B if there is a rule with B as left hand side (LHS) and A

as the head daughter on the right hand side (RHS) When a (lexical) head-corner is found an X rule is selected in which the (lexical) head is on the RHS

T h e sisters of the head are parsed recursively T h e LHS of the rule contains the m o t h e r of the head

If this mother is a head-corner of the goal, and the mother and the goal are not equal the whole process

is repeated by selecting a rule with the new head- corner (i.e the m o t h e r of the first head-corner) on its RHS

In section 2 it is assumed t h a t movement is invari- ably leftward and that G T and M o v e - a are b o t t o m -

up mechanisms G T builds the V P before other pro- jections Constituents of V P are moved to higher projections by Move-a, which is a special kind of

GT Suppose that the parser should consider AgrS

as the head-corner of AgrSP, which accords with X - Theory T h e n the head (AgrS) t h a t should be filled with an adjoined verb by movement from AgrO (in

a transitive sentence) or V (in an intransitive sen- tence) is created before AgrO and V To avoid mov- ing constituents from a part of the tree t h a t has not been built yet, the head-corner table for the min- imalist head-corner parser is not constructed com- pletely according to X - T h e o r y (see (1))

(1) hc(AgrS,AgrSP), hc(V,VP)

hc(AgrOP, AgrS) hc(V,V)

hc(AgrO,AgrOP), hc(N,NP)

hc(VP, AgrO) hc(N,~)

For example, instead of AgrO, V P is the head- corner of AgrO This solution is compatible with the Minimalist P r o g r a m in the sense t h a t in this way the tree is built up in an absolute b o t t o m - u p way (i.e starting from V) so t h a t a position that should be filled by movement is always created after the position from which the moved element comes

T h e head-corner table in (1) illustrates that func- tional heads like AgrO and AgrS are not processed

as heads Lexical proj_.ections like V P and N P are treated according to X - T h e o r y If we follow (1) in combination with the tree in figure 1 we establish the fact that the parser searches its way down to the verb as soon as possible T h e top-down prediction step moves from t h e g o a l AgrSP to AgrS to A g r O P

to AgrO to V P to V and finally to the lexical head- corner V where the b o t t o m - u p process starts as the Minimalist P r o g r a m requires

T h e head-corner parsing algorithm and the

Trang 3

structure-building operations of the Minimalist Pro-

gram ( G T and Move-a) have much in common In

both cases a tree is built up in a b o t t o m - u p way by

starting with a head (lexical head-corner in the pars-

ing algorithm, target in the structure building op-

erations) and creating the sister of the head recur-

sively, etc 2 By treating only lexical heads as head-

corners we achieved t h a t our parsing algorithm com-

pletely represents G T Only for Move-a we need an

e x t r a predicate to accomplish a movement if there is

a possible movement to the node t h a t has just been

created

In section 3 we chose not to consider functional heads

as head-corners This choice was made because it

allows G T and M o v e - a to start constructing a VP

before the projections to which constituents from V P

are moved are constructed Another motivation to

s t a r t with V P is t h a t V contains information that

is useful for the remainder of the structure building

process For example, if the verb is intransitive we

know t h a t V does not require a complement sister,

and we know t h a t we do not need an A g r O P on top

of VP T h e fact t h a t V contains lexical information

and functional heads like AgrO and AgrS do not,

could be used as a justification for the fact t h a t the

latter are not head-corners T h e main idea of head-

driven parsing is, as was stated before, that heads

contain relevant information for the parsing process,

and t h a t they therefore should be parsed before their

sisters Functional heads obtain their contents via

movement of elements from positions lower in the

tree This special status makes them less useful for

the parsing process

T h e Minimalist P r o g r a m is a generation-oriented

framework Because we are dealing with parsing (as

opposed to generation) in this p a p e r there are cer-

tain discrepancies between the parser and the frame-

work it is based on In the minimalist framework,

lexical information belonging to a chain is available

from the m o m e n t t h a t the first position of the chain

is created, because t h a t is the m o m e n t when the lex-

icon is consulted W h e n parsing a sentence the lexi-

con is not by definition consulted at the beginning of

the chain Figure 1 shows a tree t h a t contains traces

and visible constituents T h e position containing a

visible constituent is the SO position of that chain

T h e parser consults the lexicon at the moment in

which the SO position of a chain is reached Conse-

Sin the minimalist head-corner parser that is de-

scribed here a head always has only one sister because

minimalist trees are at most binary branching

quently, when a trace is created before SO, the fea- tures belonging to that trace are unknown T h e fea- tures of the traces of a certain chain are known as soon as the SO position is reached, because all posi- tions in a chain are linked

It can be concluded t h a t the absolute b o t t o m - u p approach for the building of trees is more useful for generation than for parsing In generation, lexical information can be used as soon as a position t h a t

is the beginning of a chain is created In parsing we will have to wait until the SO position is reached

In spite of this, we chose not to consider functional heads as heads in order to accomplish an absolute bottom-up process T h e reason for this is that, as was mentioned before, otherwise we would be rea- soning backwards with relation to movement This could be inefficient and it is too far removed from the ideas of the minimalist framework

5 F u t u r e P l a n s

T h e parser described here can j u d g e the g r a m m a t - icality of simple declarative transitive and intransi- tive sentences, and of subordinate clauses We will extend the parser in such a way t h a t it will cover more advanced linguistic p h e n o m e n a like anaphors and wh-questions F u r t h e r m o r e o t h e r types of parsers will be built to determine if this 'lexical' head-corner parser is indeed more efficient

I would like to t h a n k Gosse B o u m a , J o h n Nerbonne, Gertjan van Noord and J a n - W o u t e r Zwart for their helpful comments on earlier versions of this paper

R e f e r e n c e s Gosse B o u m a and G e r t j a n van Noord 1993 Head- driven parsing for lexicalist grammars: Experi- mental results In 6th Meeting of the European chapter of the Association for Computational Lin- guistics, Utrecht

Noam Chomsky 1992 A minimalist p r o g r a m for linguistic theory M I T Occasional P a p e r s in Lin- guistics

Martin Kay 1989 Head driven parsing In Proceed- ings of Workshop on Parsing Technologies, Pitts- burg

Richard S Kayne 1994 T h e a n t i s y m m e t r y of syn- tax M I T Press, Cambridge

Mettina J.A Veenstra 1994 Towards a formaliza- tion of generalized transformation In H de Hoop

A de Boer and Henriette de Swart, editors, Lan- guage and Cognition ~, Groningen

Ngày đăng: 17/03/2014, 09:20

TÀI LIỆU CÙNG NGƯỜI DÙNG

TÀI LIỆU LIÊN QUAN