Tài liệu Báo cáo khoa học: "HORN EXTENDED FEATURE STRUCTURES: FAST UNIFICATION WITH NEGATION AND LIMITED DISJUNCTION" ppt

Following the ideas of Moshier and Rounds 1987 and Langholm 1989, we define an extended fcature structure to be a pair N, K: in which /C is a set of feature structures and N is the le

Trang 1

H O R N E X T E N D E D F E A T U R E S T R U C T U R E S :

F A S T U N I F I C A T I O N W I T H N E G A T I O N A N D L I M I T E D D I S J U N C T I O N t

S t e p h e n J H e g n e r

D e p a r t m e n t o f C o m p u t e r S c i e n c e a n d E l e c t r i c a l E n g i n e e r i n g

V o t e y B u i l d i n g

U n i v e r s i t y o f V e r m o n t

B u r l i n g t o n , V T 05405 U S A

t e l e p h o n e : (802)656-3330

i n t e r n e t : h e g n e r @ u v m e d u

u u c p : u u n e t ! u v m - g e n ! h e g n e r

A B S T R A C T The notion of a Horn extended feature structure

(HoXF) is introduced, which is a feature s t r u c t u r e

constrained so t h a t its only allowable extensions are

those satisfying some set of llorn clauses in feature-

term logic, l l o X F ' s greatly generalize ordinary fea-

ture structures in a d m i t t i n g explicit representation of

negative and implicational constraints In contradis-

tinction to the general case in which a r b i t r a r y logical

constraints are allowed (for which the best known al-

gorithms are exponential), there is a highly tractable

algorithm for the unification of HoXF's

1.1 U n i f i c a t i o n - b a s e d g r a m m a r f o r m a l i s m s

Unification-based g r a m m a r formalisms constitute a

cornerstone of many of the most i m p o r t a n t approaches

to natural-language u n d e r s t a n d i n g (Shieber, 1986),

(Colban, 1988), (Fenstad etal., 1989) T h e basic idea

is t h a t the parser generates a number of partial repre-

sentations of the total parse, which are subsequently

checked for consistency and combined by a second pro-

cess known as a unifier A common form of represen-

tation for the partial representations is t h a t o f / c a t u r e

structures, which are record-like d a t a structures which

are allowed to grow in three distinct ways: by adding

missing values, by adding a t t r i b u t e s , and by coalescing

existing a t t r i b u t e s (forcing them to be the same) T h e

last o p e r a t i o n may lead to cyclic structures, which we

do not exclude If the feature s t r u c t u r e Sz is an ex-

tension of $1 (i.e., $1 grows into $2 by application of

some sequence of the above rules), we write $1 E $2

and say t h a t St subsumes $2 Intuitively, if Sl E: $2,

S~ contains more information than does S l It is easy

to show t h a t E: is a partial order on the class of all

feature structures

Each feature s t r u c t u r e represents partial informa-

tion generated during the parse To o b t a i n the total

picture, these partial components must be combined

t T h e r e s e a r c h r e p o r t e d h e r e i n w a s p e r f o r m e d w h i l e t h e

a u t h o r w a s v i s i t i n g t h e C O S M O S C o m p u t a t i o n a l L i n g u i s t i c s

G r o u p of t h e M a t h e m a t i c s D e p a r t m e n t a t t h e U n i v e r s i t y o f

O s l o l i e w i s h e s t o t h a n k J e n s Erik F e n s t a d a n d t h e m e m b e r s

o f t h a t g r o u p f o r p r o v i d i n g a s t i m u l a t i n g r e s e a r c h e n v i r o n m e n t

P a r t i c u l a r t h a n k s a r e d u e T o r e L a n g h o l m f o r m a n y i n v a l u a b l e

d i s c u s s i o n s r e g a r d i n g t h e i , , t e r p l a y o f logic, f e a t u r e s t r u c t u r e s ,

a n d u n i f i c a t i o n

into one consistent piece of knowledge T h e formal process of unification is precisely this o p e r a t i o n of com- bination T h e most general unifier (mgu) $1 LI $2 of feature structures Sj and Sa is the least feature structure (under E ) which is larger than both Sl and $2 Such an mgu exists if and only if $1 and $2 are con-

sistent; t h a t is, if and only if they subsume a common feature structure

1.2 U n i f i c a t i o n a l g o r i t h m s a n d t h i s p a p e r While the idea of a most general unifier is a pleasing theoretical notion, its real utility rest with the fact

t h a t there are efficient algorithms for its c o m p u t a t i o n

T h e fastest known algorithm, identified by A i t - K a c i (1984), runs in time which is, for all practical pur- poses, linear in the size of the i n p u t (i.e., the combined sizes of the structures to be unified) In proposing any extension to the basic framework, a p r i m a r y considera- tion m u s t be the complexity of the ensuing unification algorithm T h e principal contribution of the research summarized here is to provide an extension of ordinary feature structures, a d m i t t i n g negation and limited disjunction, while at the same time continuing to a d m i t

a provably efficient unification algorithm

Due to space limitations, we must o m i t substan- tial background material from this paper Specifically,

we assume t h a t the reader is familiar with the notation and definitions surrounding feature structures (Shieber, 1986; Fenstad et al., 1989), as well as the

t r a d i t i o n a l unification algorithm (Colban, 1990) We also have been forced to o m i t much detail from the description and verification of our algorithm A full

r e p o r t on this work will be available in the near fu- ture

2 U N I F I C A T I O N I N T H E P R E S E N C E

O F C O N S T R A I N T S 2.1 C o n s t r a i n t s o n f e a t u r e s t r u c t u r e s Not ev- ery feature s t r u c t u r e is a possibility as the u l t i m a t e

o u t p u t of the parsing mechanism Typically, there are constraints which must be observed One way of en- suring this sort of consistency is to build the checks right into the g r a m m a r , so t h a t the feature structures generated are always legitimate s u b s t r u c t u r e s of tile

final o u t p u t T h e C L G formalism (Dumas and Vat- lie, 1989) is an example of such a philosophy |n many ways, this is an a t t r a c t i v e option~ because it provides a

Trang 2

unified context for expressing all aspects of the gram-

mar liowever, this approach has the disadvantage

t h a t it limits the use of i n d e p e n d e n t parsing subalgo-

rithms whose results are subsequently unified, since

the consistency checks nmst be performed before the

feature structures are presented to the unifier There-

fore, to maintain such independence, it would be a

distinct advantage if some of the constraint checking

could be relegated to the unification process

To establish a formal framework in which this is

possible, we must s t a r t by extending our notion of a

feature structure Following the ideas of Moshier and

Rounds (1987) and Langholm (1989), we define an ex-

tended fcature structure to be a pair (N, K:) in which

/C is a set of feature structures and N is the least ele-

ment of/C under the ordering _ (Titus, by definition,

K: has a least element, and K: determines N.) T h i n k of

N a.s the "current" feature structure, a n d / C as the set

of all structures into which N is allowed to grow We

define (N~,K:t) C:~ (N~,/C~) to mean precisely t h a t

K~ C_ /C~ In other words, the set of all structures

which N~ can grow into is a subset of those which N~

can grow into (It follows necessarily t h a t N~ ~_ N2

in this case.) Note t h a t if we identify the ordinary

feature s t r u c t u r e N with the pair (N, IM I N ~ M}),

we precisely recapture o r d i n a r y subsumption Finally,

the notion of unification associated with _~ is given

by

( M r , / C t ) LI= (M~,/C:~) =

( M , / ~ 17/C2) if/C~ n/c2

has a least element M ; undefined oOmrwise

2.2 L o g i c a l f e a t u r e s t r u c t u r e s w i t h c o n -

s t r a i n t s To o p e r a t e on pairs of the form (N~/C) al-

gorithmically, we must have in place an a p p r o p r i a t e

representation for the set g: T h e r e are many possible

choices; ours is to let it be the set of all structures

satisfying a set of sentences iu a particular logic T h e

logic which we use is a simple modification of the lan-

guage of Rounds and Ka.sper (1986) (see also (Kasper

and Rounds, 1990)) a d m i t t i n g negation but only bi-

nary path equivalences Specifically, an atomic feature

term is one of the following

FormltJa

T

±

( ~ : a )

(,~ × f~)

Semantics

T h e identically true term

The identically false term

T h e p a t h (nesting of a t t r i b u t e s ) cz exists

and terminates with label a

T h e paths cr and /? have a common end

point (coalesced end points)

In ( a : a), the label a may be T , denoting a miss-

ing value T h e notation ( a ~ /~) is borrowed from

(Langholm, 1989), and has the same semantics as

{ , , B } o f ( R o u n d s and Kasper, 1986) A (general)fea-

tur~ term is b i l t up from atomic feature terms using

the connectives ^, v, and -., with the usual semantics

In particular, the negation we use is the classical no-

tion; a s t r u c t u r e s a t i s f e s (-,~0) if and only if it does

not satisfy ~ For any set • of feature terms, Mod(&) denotes the set of all feature structures for which each

E r~ is true For a formal definition of satisfaction,

we refer the reader to the above-cited references In- tuitively, any set of terms which defines a consistent rooted, directed graph is satisfiable Ilowever, let us specifically r e m a r k t h a t only nodes with no outgoing edges may have labels other than T, t h a t labels other than T may occur at at most one end point, t h a t no two outgoing edges from the same node may have the same label, and t h a t any term of the form ( a : L) is equivalent to _L, and so inconsistent

Now we define a logical extended feature structure (LoXF) to be an extended feature s t r u c t u r e i N , K:)

in which K: = M o d ( ¢ ) for some consistent finite set ~ of feature terms In particular, M o d ( ~ ) must have a least model We also denote this pair by

Y ( ~ ) = ( g , M o d ( ~ b ) ) Now Y(~b,) E_, ~ ' ( ~ 2 ) re- duces to M o d ( ~ ) C_ Mod(4,a), and

undefined

if Mod(&a U q~)

has a least element under E; otherwise

2.3 R e m a r k o n n e g a t i o n A full discussign of the

n a t u r e of negation in L o X F ' s is complex, and will be the focus of a s e p a r a t e paper IIowever, because this topic has received a great deal of a t t e n t i o n (Moshier and Rounds, 1987), (Langholm, 1989), (Dawar and Vijay-Shanker, 1990), we feel it essential to r e m a r k here t h a t ~'(¢~) does not have the "classical" negation semantics which can be d e t e r m i n e d by looking solely at the least element Indeed, the a p p r o p r i a t e definition is t h a t .~'(~) satisfies -'7' precisely when no

m e m b e r of Mod(&) satisfies ¢; in other words, the

s t r u c t u r e N is not allowed to be extended to satisfy

~o

2.4 U n i f i c a t i o n a l g o r i t h m s f o r l o g i c a l e x -

t e n d e d f e a t u r e s t r u c t u r e s In view of the definition i m m e d i a t e l y above, it is easy to see t h a t t h a t any unification algorithm for L o X F ' s must solve the following two problems in the course of a t t e m p t i n g to unify ~ ' ( ~ i ) and ~'(~2)

( u l ) It must decide whether or not ~ i U q~2 is consistent; i.e., whether or not there is a feature structure satisfying all sentences of b o t h ~ i and cb2

(u2) In case t h a t 4~I U~2 is satisfiable, it must also de- termine if there is a least model, and if so, identify

it

Now it is well known t h a t ( u l ) is an NP-complete problem, even if we disallow negation and path equiva- lence (Rounds and Kasper, 1986, Thin 4) Therefore,

barring the eventuality that P = NP, we cannot ex-

pect to allow ~I and ~2 to be arbitrary finite sets of feature terms and still have a tractable algorithm for unification One solution, which has been taken by a number of authors, such as Kasper (1989) and Eisele and D6rre (1988), is to devise clever algorithms which apply to the general case and appear empirically to work well on "typical" inputs, but still are provably

Trang 3

exponential in the worst case While such work is un-

deniably of great value, we here propose a companion

strategy; namely, we restrict attention to pairs {N, ~ )

such t h a t the very n a t u r e of • guarantees a t r a c t a b l e

algorithm

3 H O R N F E A T U R E L O G I C

In the field of mathematicM logic in general, and

in the c o m p u t a t i o n a l logic relevant to computer sci-

ence in p a r t i c u l a r , Horn clauses play a very special r61e

(Makowsky, 1987) Indeed, they form the basis for the

programming language Prolog (Sterling and Shapiro,

1986) and the d a t a b a s e language Datalog (Ceri et ai.,

1989) This is due to the fact t h a t while they possess

s u b s t a n t i a l representational power, t r a c t a b l e inference

algorithms are well known It is perhaps the main the-

sis of this work t h a t the utility of llorn clauses carries

over to c o m p u t a t i o n a l linguistics as well

3.1 H o r n f e a t u r e c l a u s e s A feature literal is ei-

ther an atomic feature term (e.g., ( ~ : a), (~ ~- /~),

or _L) or its negation A feature clause is a finite

disjunction £ l v t ~ v v l , n of feature literals A fea-

ture clause is florn if at most one of the t i ' s is not

negated A Horn extended feature structure ( lloXF)

is a LoXF ~'(4,) such t h a t • is a finite set of llorn

feature clauses

3.2 A t a x o n o m y o f H o r n f e a t u r e c l a u s e s Be-

fore moving on to a presentation of algorithms on

t I o X F ' s , it is a p p r o p r i a t e to provide a brief sketch of

thc utility and limits of restricting our attention: to col-

lections of lIorn clauses, hnplication here is classical;

in the case of ordinary propositional logic, we use

the notation e t ^ ~ r ~ ^ ^am =~ p to denote the clause

~O'l v-~0r2v V'~O'rnVp Horn feature clauses may then

be t h o u g h t of as falling into one of the following four

Định dạng
Số trang	6
Dung lượng	596,28 KB