1. Trang chủ
  2. » Luận Văn - Báo Cáo

Tài liệu Báo cáo khoa học: "An alternative LR algorithm for TAGs" docx

7 415 0
Tài liệu đã được kiểm tra trùng lặp

Đang tải... (xem toàn văn)

THÔNG TIN TÀI LIỆU

Thông tin cơ bản

Tiêu đề An alternative LR algorithm for TAGs
Tác giả Mark-Jan Nederhof
Trường học DFKI
Chuyên ngành Computational Linguistics
Thể loại báo cáo khoa học
Thành phố Saarbrücken
Định dạng
Số trang 7
Dung lượng 587,66 KB

Các công cụ chuyển đổi và chỉnh sửa cho tài liệu này

Nội dung

Furthermore, the new algorithm is much sim- pler, being very close to traditional L R parsing for context-free grammars.. sure function for traditional LR parsing.. In the former string

Trang 1

A n alternative LR algorithm for T A G s

M a r k - J a n N e d e r h o f

D F K I

S t u h l s a t z e n h a u s w e g 3 D-66123 Saarbr/icken, G e r m a n y E-marl: n e d e r h o f @ d f k i d e

A b s t r a c t

We present a new LR algorithm for tree-

adjoining grammars It is an alternative to an

existing algorithm t h a t is shown to be incorrect

Furthermore, the new algorithm is much sim-

pler, being very close to traditional L R parsing

for context-free grammars The construction of

derived trees and the c o m p u t a t i o n of features

also become straightforward

1 I n t r o d u c t i o n

The efficiency of LR(k) parsing techniques

(Sippu and Soisalon-Soininen, 1990) a p p e a r s

to be very attractive from the perspective of

natural language processing This has stim-

ulated the computational linguistics commu-

nity to develop extensions of these techniques

to general context-free g r a m m a r parsing The

best-known example is generalized LR parsing

(Tomita, 1986)

A first a t t e m p t to adapt LR parsing to tree-

adjoining grammars (TAGs) was made by Scha-

bes and Vijay-Shanker (1990) T h e description

was very complicated however, and not surpris-

ingly, no implementation of the algorithm seems

to have been made up to now Apart from pre-

sentational difficulties, the algorithm as it was

published is also incorrect Brief indications of

the nature of the incorrectness have been given

before by Kinyon (1997) There seems to be no

straightforward way to correct the algorithm

We therefore developed an alternative to

the algorithm from Schabes and Vijay-Shanker

(1990) This alternative is novel in presenta-

tional aspects, and is fundamentally different in

that it incorporates reductions of subtrees

The new algorithm has the benefit that many

theoretically and practically useful properties

carry over from the context-free case For ex-

ample, by making a straightforward translation

from TAGs to linear indexed grammars, one may identify c o m p u t a t i o n s of the parser with rightmost derivations in reverse Also the ex- tensions needed for construction of parse trees (or "derived trees" as they are often called for TAGs) and the c o m p u t a t i o n of features are al- most identical to the corresponding extensions for context-free LR parsing

Section 2 discusses our notation T h e algo-

r i t h m for constructing the LR table is given in Section 3, and the a u t o m a t o n t h a t operates on these tables is given in Section 4 Section 5 first explains why the algorithm from Schabes and Vijay-Shanker (1990) is incorrect, and then provides an example of how our new algorithm works Some extensions are discussed in Sec- tion 6, and the implementation in Section 7

2 N o t a t i o n For a good introduction to TAGs, the reader

is referred to Joshi (1987) In this section we merely summarize our notation

A tree-adjoining g r a m m a r is a 4-tuple (Z, N T , I, A), where ~ is a finite set of termi- nals, I is a finite set of initial trees and A is

a finite set of auxiliary trees We refer to the trees in I U A as elementary trees T h e set N T ,

a finite set of nonterminals, does not play any role in this paper

Each auxiliary tree has a distinguished leaf, call the foot We refer to the foot of an aux- iliary tree t as Ft We refer to the root of an elementary tree t as Rt T h e set of all nodes

of an elementary tree t is denoted by At(t), and

we define the set of all nodes in the g r a m m a r by

At = U, ruAAt(t)

For each non-leaf node N we define

children(N) as the list of children nodes For other nodes, the function children is undefined

T h e dominance relation <J* is the reflexive and

Trang 2

transitive closure of the parent relation <~ de-

fined by N <~ M if and only if children(N) =

aMf~, for some ~, f~ E A/'*

Each leaf N in an elementary tree, except

when it is a foot, is labelled by either a termi-

nal from Z or the e m p t y string e We identify

such a node N labelled by a terminal with t h a t

terminal Thus, we consider 2: to b e a subset

of Af, I

For now, we will disallow labels to b e e, since

this causes a slight technical problem We will

r e t u r n to this issue in Section 6,

For each node N t h a t is not a leaf or t h a t

is a foot, we define A d j u n c t ( N ) as the set of

auxiliary trees t h a t can be adjoined at N This

set may contain the element nil to indicate t h a t

adjunction at t h a t node is not obligatory

An example of a TAG is given in Figure 1

There are two initial trees, a l and a2, and one

auxiliary tree fL For each node N , Adjunct(N)

has b e e n indicated to the right of t h a t node,

unless A d j u n c t ( N ) = {nil}, in which case t h a t

information is o m i t t e d from the picture

3 C o n s t r u c t i o n o f t h e L R t a b l e

For technical reasons, we assume an additional

node for each elementary tree t, which we de-

note by T This node has only one child, viz

the actual root node Rt We also assume an

additional node for each auxiliary tree t, which

we denote by _L This is the unique child of the

actual foot node Ft T h e d o m a i n of the func-

tion children is e x t e n d e d to include foot nodes,

by defining children(Ft) = _L, for each t E A

For the algorithm, two kinds of tree need to

b e distinguished: elementary trees and subtrees

of elementary trees A s u b t r e e can b e identified

by a pair (t, N ) , where t is an elementary tree

and N is a node in t h a t tree; the pair indicates

the s u b t r e e of t r o o t e d at N T h e set of all trees

needed by our algorithm is given by:

T = I U A U { ( t , N ) I t E I U A , N E A f ( t ) }

From here on, we will use the s y m b o l t exclu-

sively to range over I U A, and r to range over

T in general

1With this convention, we can no longer distinguish

between different leaves in the grammar with the same

terminal label This merging of leaves with identical la-

bels is not an inherent part of our algorithm, but it sim-

plifies the notation considerably

For each ~ E T, we m a y consider a p a r t of the tree consisting of a node N in ~- and the list of its children nodes 7 Analogously to the n o t a t i o n for context-free parsing, we separate the list of children nodes into two lists, s e p a r a t e d by a dot, and write N ~ a • f~, where a/~ = 7, to indicate

t h a t the children nodes in a have already been

m a t c h e d against a part of the input string, and those in fl have as yet not b e e n processed The set of such objects for an elementary tree

t is given by:

Pt = { ( T ~ f l ) I afl=Rt}U { ( N ~ a • f~) I N E Af(t), children(N) = aft} For subtrees (t, M ) we define:

P(t,M) = { ( N ~ (~ • t~) I M <F N , children(N) = a/3}

Such objects are a t t a c h e d to the trees ~ E T to which they pertain, to form the set of items: Items =

{ [ T , g - - ~ a • fl] I r e T , ( Y - + (~ fl) E P~ }

A completed item is an item t h a t indicates a completely recognized e l e m e n t a r y tree or sub- tree Formally, items are c o m p l e t e d if they are of the form [ t , T ~ Rt •] or of the form

[ ( t , N ) , N -+ a •]

T h e main concept needed for the construction

of the L R table is t h a t of L R states These are particular elements from 2 Items to be defined shortly

First, we introduce the function closure from

2 Items t o 2 Items and the functions goto and goto±

from 2 Items x J~f to 2 Items For any q C_ Items, closure(q) is the smallest set such that:

1 q C closure(q);

2 [ r , N ~ o~ • M/~] E closure(q), nil E

A d j u n c t ( M ) and children(M) = 7 implies

[% M -+ • 7] E closure(q);

3 [ r , N -+ (~ • ME] E closure(q) and

t E A d j u n c t ( M ) implies [t,T -~ • Rt] E closure(q);

4 [~-,Ft ~ • _L] E closure(q), t E A d j u n c t ( N ) ,

N E Af(t ~) and children(N) = 7 implies [(t ~, N ) , N -~ • 7] E closure(q); and

5 k , i 7 ] e closure(q) and k , N a M •

8] E Items implies [T,N -+ a M • fl] E

closure (q)

T h e clauses 1 t h r u 4 are reminiscent of the clo-

Trang 3

( a l )

b

(a2)

U

(Z)

Figure 1: A tree-adjoining g r a m m a r

b' Figure 2: A n incorrect

"parse tree" (Section 5)

sure function for traditional LR parsing Note

t h a t in clause 4 we set out to recognize a sub-

tree ( t ' , N ) of e l e m e n t a r y tree tq Clause 5 is

unconventional: we traverse the tree ~- upwards

when t h e dot indicates t h a t all children nodes

of M have been recognized

Next we define the function goto, for any

q C Items, a n d any M E ~7 or M E A f such

t h a t A d j u n c t ( M ) includes at least one auxiliary

tree

goto(q,M) = {[T,N + a M ,, ~] [

[T,N ~ a • Mfl] E closure(q)}

T h e function goto± is similar in t h a t it shifts

the dot over a node, in this case the i m a g i n a r y

node J_ which is the unique child of an actual

foot node Ft However, it only does this if t is a

tree which can be adjoined at the node t h a t is

given as the second argument

goto±(q,M) = {[7, Ft ~ _1_ ] I

[T, Ft "-+ • k] E closure(q) A t E A d j u n c t ( M ) }

T h e initial LR state is the set

qin { [ t , T - + ,,Rt] ] t e I}

We construct t h e set Q of all L R states as the

smallest collection of sets satisfying t h e condi-

tions:

1 qin E 0,;

2 q E Q, M E A/" and q' = goto(q,M) ~ @

imply q~ E Q; a n d

3 q E Q, M E A/" and q' = goto±(q,M) ~ 0

imply q' E Q

An LR state is final if its closure includes a

completed item corresponding to an initial tree:

Q1~n = {q E Q I

closure(q) n {[t, T R, -] I t e Z) # ¢0}

Final LR states indicate recognition of the in- put O t h e r completed items give rise to a re- duction, a t y p e of stack m a n i p u l a t i o n by the

LR a u t o m a t o n to be defined in the next sec- tion As defined below, reductions are uniquely identified by either auxiliary trees t or by nodes

N obtained from the corresponding completed items

reductions (q) = {t e A [ [t, T + Rt ] E closure(q)} U { N E.hf [ [ ( t , N ) , N + a ] E closure(q)}

For each node N in a tree, we consider the set C S ( N ) of strings t h a t represent horizontal cross-sections t h r o u g h t h e s u b t r e e r o o t e d at N

If we do not want to include the cross-section

t h r o u g h N itself, we write C S ( N ) + A cross- section can also be seen as t h e yield of the sub- tree after removal of a certain n u m b e r of its sub- trees

For convenience, each node of an auxiliary tree (or subtree thereof) t h a t d o m i n a t e s a foot node is paired with a stack of nodes T h e intu- ition behind such a stack of nodes [ N 1 , , Arm]

is t h a t it indicates a path, the so called spine,

t h r o u g h t h e derived tree in the direction of the foot nodes, where each Ni, with 1 <_ i < m,

is a node at which a d j u n c t i o n has taken place Such stacks correspond to the stacks of linear indexed g r a m m a r s

T h e set of all stacks of nodes is d e n o t e d by A/'* T h e e m p t y stack is d e n o t e d by [], and stacks consisting of head H a n d tail T are de- noted by [HIT ] We define:

M = • u ( • x 2 ( * ) and we simultaneously define the functions CS

and CS + from Af to 2 "~" as t h e least functions

Trang 4

satisfying:

• CS(N) + C_ CS(N), for each N;

• (N, L) • CS(N), for each N such that N <~*

l , and each L • Af*;

• N • CS(N), for each N such that -~(N<~*l);

and

• for each N, children(N) = M I " " M m and

xl • CS(M1), ,xrn • CS(Mm) implies

z l ' " X m • CS+(N)

4 T h e r e c o g n i z e r

Relying on the functions defined in the previous

section, we now explore the steps of the LR au-

tomaton, which as usual reads input from left

to right and manipulates a stack

We can divide the stack elements into two

classes One class contains the LR states from

Q, the other contains elements of A4 A stack

consists of an alternation of elements from these

two classes More precisely, each stack is an

element from the following set of strings, given

by a regular expression:

S = qi,(.MQ)*

Note that the b o t t o m element of the stack is

always qin We will use the symbol A to range

over stacks and substrings of stacks, and the

symbol X to range over elements from A4

A configuration (A, w) of the a u t o m a t o n con-

sists of a stack A • $ and a remaining input w

T h e steps of the a u t o m a t o n are given by the bi-

nary relation t- on pairs of configurations There

are three kinds of step:

s h i f t (Aq, aw) b ( Aqaq', w), provided q' =

goto(q, a) ¢ 0

r e d u c e s u b t r e e ( AqoXlqlX2q2 Xmqm, w) ~-

(Aq0 (-k, [Y[n])q', w), provided g •

reductions(qm), X 1 Xm • CS+(N) and q' =

goto±(qo, N) ~ 0, where L is determined by the

following If for s o m e j (1 < j <_ m) Xj is of

the form (M, L) then this provides the value of

L, otherwise we set L = [].~

r e d u c e a u x t r e e ( AqoXlqlX2q2 Xrnqm, W)

F- (AqoXq~, w), provided t • reductions(qm),

X 1 Xm • CS(Rt) and q' = goto(qo, N) ~ O,

where we obtain node N from the (unique) Xj

(1 _< j _< m) which is of the form (M, [NIL]),

2Exactly in the case t h a t N dominates a footnote will

(exactly) one of the Xj be of the form (M, L), some M

and set X = N if L [] a n d X = ( N , L )

o t h e r w i s e )

T h e shift step is identical to t h a t for context- free L R parsing There are two reduce steps that must be distinguished T h e first takes place when a subtree of an elementary tree

t has been recognized We t h e n remove the stack symbols corresponding to a cross-section through t h a t subtree, together with the associ- ated LR states We replace these by 2 other symbols, the first of which corresponds to the foot of an auxiliary tree, and the second is the associated L R state In the case t h a t some node

M of the cross-section dominates the foot of t, then we must copy the associated list L to the first of the new stack elements, after pushing N onto t h a t list to reflect that the spine has grown one segment upwards

T h e second type of reduction deals with recognition of an auxiliary tree Here, the head

of the list [NIL], which indicates the node at which the auxiliary tree t has been adjoined according to previous b o t t o m - u p calculations, must m a t c h a node that occurs directly above the root node of the auxiliary tree; this is checked by the test q' = goto(qo, N) ~ 0

I n p u t v is recognized if (qin,v) ~-* (qinAq,¢)

for some A and q E Q/~, T h e n A will be of the form XlqlX2q2"'" qm-lXm, where X1 " Xm E CS(Rt), for some t e I

Up to now, it has been tacitly assumed that the recognizer has some mechanism to its dis- posal to find the strings X I " " X m E CS(Rt)

and X I " " Xm E CS+(N) in the stack We will now explain how this is done

For each N, we construct a deterministic fi- nite a u t o m a t o n t h a t recognizes the strings from

CS+(N) from right to left There is only one final state, which has no outgoing transitions This is related to the fact t h a t CS+(N) is suffix- closed A consequence is that, given any stack that may occur and any N, there is at most one string X I ' " Xm E CS+(N) t h a t can be found from the top of the stack downwards, and this string is found in linear time For each t E I U A

we also construct a deterministic finite automa- ton for CS(Rt) T h e procedure for t E I is given

in Figure 3, and an example of its application

is given in Figure 4 T h e procedure for t E A is 3Exactly in the case that N dominates a footnote will

L ¢ [ ]

Trang 5

let K = 0 , 7 " = { ~ ;

l e t s = fresh_state, f = fresh_state;

make_fa(f , Rt, s)

p r o c e d u r e make_fa(ql, M, q0):

let 7" = 7"U {(qo, M, ql)};

if children(M) is defined

t h e n make_fa_list (ql, children (M), q0)

e n d p r o c

p r o c e d u r e make_fa_list ( ql , M s , q0):

i f ~ = ~

t h e n make_fa(ql, M, qo)

else let q = fresh_state;

make_fa_list(q, a, q0); make_fa(ql, M, q)

e n d p r o c

p r o c e d u r e fresh_state 0:

create some fresh object q;

let K = K t J { q } ; r e t u r n q

e n d p r o c

Figure 3: Producing a finite automaton

(K, N, T, s, {f}) that recognizes CS(Rt), given

some t E I K is the set of states, N acts as

alphabet here, 7" is the set of transitions, s is

the initial state and f is the (only) final state

similar except that it also has to introduce tran-

sitions labelled with pairs (N, L), where N dom-

inates a foot and L is a stack in Af*; it is obvious

that we should not actually construct different

transitions for different L E hf*, but rather one

single transition (N, _), with the placeholder "_"

representing all possible L EAf*

The procedure for CS+(N) can easily be ex-

pressed in terms of those for CS(Rt)

5 E x t e n d e d e x a m p l e

For the TAG presented in Figure 1, the algo-

rithm from Schabes and Vijay-Shanker (1990)

does not work correctly The language de-

scribed by the grammar contains exactly the

strings abc, a'b'c ~, adbec, and a'db'ecq The al-

gorithm from Schabes and Vijay-Shanker (1990)

however also accepts adb'ec' and a~dbec In the

former string, it acts as if it were recognizing

the (ill-formed) tree in Figure 2: it correctly

matches the part to the "south" of the adjunc-

tion to the part to the "north-east" Then, after

reading c', the information that would indicate

/

Figure 4: Example of the construction for

CS(R1), where R1 is the root node of ~1 (Fig-

ure 1)

whether a or a' was read is retrieved from the stack, but this information is merely popped without investigation Thereby, the algorithm fails to perform the necessary matching of the elementary tree with regard to the part to the

"north-west" of the adjunction

Our new algorithm recognizes exactly the strings in the language For the running ex- ample, the set of LR states and some opera- tions on them are shown in Figure 5 Arrows labelled with nodes N represent the goto func-

tion and those labelled with ± ( N ) represent the

goto± function The initial state is 0 The thin

lines separate the items resulting from the goto

and goto± functions from those induced by the closure function (This corresponds with the

distinction between kernel and nonkernel items

as known from context-free LR parsing.) That correct input is recognized is illustrated

by the following:

O a l d 5 b 7 ec reduce N1

0 a l d 5 (±,[N1]) 9 ec s h i f t e

0 a l d 5 (±,IN1]) 9 e l 0 c reduce/3

Note that as soon as all the terminals in the aux- iliary tree have been read, the "south" section of the initial tree is matched to the "north-west" section through the goto function Through subsequent shifts this is then matched to the

"north-east" section

This is in contrast to the situation when in- correct input, such as adb~ec ~, is provided to the

Trang 6

2 1[~2, N2 ~ " b']

1[/3,T ~ Aft]

~o~2, N2 > b' -] ]

[_[~2, R2 ~ a'N2 • c']

~i~2 -* ~'N2 ° c']

1 [o~2, R2 -+ a'N2c' ,1 I

[c~2, T ~ R2 *]

][(o~2,N2),N2 ~ b',

0 [at, T + * RI]

[a2, T + * R21 [ozl, RI -~ * aNlc]

[o~2, R2 -~ * a'N2c

[/3, Rfl ~ d * Fe]

[/3, F + *_1.]

[(cq, N1), NI "-+ • b]

[(a2, N2), N2 -+ * b']

[/3, F ~ _L ]

[/3, R f + dF • e]

[/3, Rf -+ dFe .]

[/3, T ~ R f .]

1

I [OZI,R 1 ~ a * N1 C]

[OZl, N1 "-)" * b]

[/3, T -~ • R~]

/ [/3, R f -~ * dFe]

~b,]

R1 "-~ aNt • c]

[olt, RI + aNt * c] ]|

|

[al, R1 -~ a N l c *]

[O(1, T ) R 1 ]

Figure 5: The set of LR states

automaton:

Stack

0

0 a l

0 a l d 5

O a l d 5 b ' 8

0 a l d 5 ( ± , [ ~ ] ) 9

0 a l d 5 ( ± , [ ~ ] ) 9 e l 0

I n p u t Step

adb' ec I shift a dbl ec I shift d

bl ec I shift b t

ec I reduce N2 ec' shift e

C t

Here, the c o m p u t a t i o n is stuck In particular, a

reduction with auxiliary tree/3 fails due to the

fact that goto(1, N2) - 0

6 E x t e n s i o n s

T h e recognizer can be t u r n e d into a parser

by attaching information to the stack elements

from ~4 At reductions, such information is

gathered and combined, and the resulting data

is attached to the new element from Iv[ that

is pushed onto the stack This can be used

for c o m p u t a t i o n of derived trees or derivation

trees, and for c o m p u t a t i o n of features Since

this technique is almost identical to that for the

context-free case, it suffices to refer to existing

literature, e.g Aho et al (1986, Section 5.3)

We have treated a classical type of TAG,

which has adjunction as the only operation for

composing trees Many m o d e r n types of TAG

also allow tree substitution next to adjunc- tion Our algorithm can be straightforwardly extended to handle tree substitution T h e main changes t h a t are required lie in the closure function, which needs an extra case (much like the corresponding operation in context-free LR parsing), in adding a third type of goto func- tion, and in adding a fourth step, consisting of reduction of initial trees, which is almost iden- tical to the reduction of auxiliary trees T h e main difference is t h a t all Xj are elements from Af; the X t h a t is p u s h e d can be a substitution node or a nonterminal (see also Section 7)

Up to now we have assumed t h a t the gram- mar does not assign the e m p t y string as label

to any of the leaves of the elementary trees

T h e problem introduced by allowing the e m p t y string is that it does not leave any trace on the stack, and therefore CS(Rt) and CS+(N)

are no longer suffix-closed We have solved this

by extending items with a third component E, which is a set of nodes labelled with ¢ that have been traversed by the closure function U p o n encountering a completed item IT, N + ~ *, E],

a reduction is performed according to the sets

CS(Rt, E) or CS+(N, E), which are subsets of

CS(Rt) and CS+(N), respectively, containing only those cross-sections in which the nodes la-

Trang 7

belled with E are exactly those in E An au-

tomaton for such a set is deterministic and has

one final state, without outgoing transitions

7 I m p l e m e n t a t i o n

We have implemented the parser generator,

with the extensions from the previous section

We have assumed that each set Adjunct(N), if

it is not {nil}, depends only on the nonterminal

label of N This allows more compact storage

of the entries goto±(q,M): for a fixed state q

and nonterminal B, several such entries where

M has B as label can be collapsed into a single

entry goto~(q,B) The goto function for tree

substitution is represented similarly

We have constructed the LR table for the En-

glish grammar developed by the XTAG project

at the University of Pennsylvania This gram-

mar contains 286 initial trees and 316 auxiliary

trees, which together have 5950 nodes There

are 9 nonterminals that allow adjunct±on, and

10 that allow substitution There are 21 sym-

bols that function as terminals

Our findings are that for a grammar of this

size, the size of the LR table is prohibitively

large The table represented as a collection of

unit clauses in Prolog takes over 46 MB for stor-

age The majority of this is needed to represent

the three goto functions, which together require

over 2.5 million entries, almost 99% of which is

consumed by goto, and the remainder by gotox

and the goto function for tree substitution The

reduction functions require almost 80 thousand

entries There are 5610 LR states The size of

the automata for recognizing the sets CS(Rt, E)

and CS + (N, E) is negligible: together they con-

tain just over 15 thousand transitions

The time requirements for generation of the

table were acceptable: approximately 25 min-

utes were needed on a standard main frame with

moderate load

Another obstacle to practical use is the equiv-

alent of hidden left recurs±on known from tradi-

tional LR parsing (Nederhof and Sarbo, 1996),

which we have shown to be present in the

grammar for English This phenomenon pre-

cludes realization of nondeterminism by means

of backtracking Tabular realization was inves-

tigated by Nederhof (1998) and will be the sub-

ject of further research

A c k n o w l e d g m e n t s Anoop Sarkar provided generous help with mak- ing the XTAG available for testing purposes Parts of this research were carried out within the framework of the Priority Programme Lan- guage and Speech Technology (TST), while the author was employed at the University of Groningen The TST-Programme is sponsored

by NWO (Dutch Organization for Scientific Re- search) This work was further funded by the German Federal Ministry of Education, Science, Research and Technology (BMBF) in the frame- work of the VERBMOBIL Project under Grant 01

IV 701 V0

R e f e r e n c e s A.V Aho, R Seth±, and J.D Ullman 1986

Compilers: Principles, Techniques, and Tools Addison-Wesley

A.K Josh± 1987 An introduction to tree ad- joining grammars In A Manaster-Ramer, editor, Mathematics o/ Language, pages 87-

114 John Benjamins Publishing Company

A Kinyon 1997 Un algorithme d'analyse LR(0) pour les grammaires d'arbres adjoints lexicalis@es In D Genthial, editor, Qua- tri~me confdrence annuelle sur Le Traitement Automatique du Langage Naturel, Acres,

pages 93-102, Grenoble, June

M.-J Nederhof and J.J Sarbo 1996 In- creasing the applicability of LR parsing In

H Bunt and M Tomita, editors, Recent Advances in Parsing Technology, chapter 3, pages 35-57 Kluwer Academic Publishers M.-J Nederhof 1998 Linear indexed automata and tabulation of TAG parsing In Actes des premikres journdes sur la Tabulation en Ana- lyse Syntaxique et Ddduction (Tabulation in Parsing and Deduction), pages 1-9, Paris, France, April

Y Schabes and K Vijay-Shanker 1990 Deter- ministic left to right parsing of tree adjoin- ing languages In 28th Annual Meeting of the

A CL, pages 276-283

S Sippu and E Soisalon-Soininen 1990

Parsing Theory, Vol II: LR(k) and LL(k) Parsing, volume 20 of EATCS Monographs

on Theoretical Computer Science Springer- Verlag

M Tomita 1986 E]ficient Parsing for Natural Language Kluwer Academic Publishers

Ngày đăng: 20/02/2014, 18:20

TỪ KHÓA LIÊN QUAN

TÀI LIỆU CÙNG NGƯỜI DÙNG

TÀI LIỆU LIÊN QUAN