7th International Conference on Automated Deduction: Proceedings

Lecture Notes in Computer Science Edited by G. Goos and J. Hartmanis 170 7th International Conference on Automated Ded...

Author: R. E. Shostak

16 downloads 1668 Views 24MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!

Report copyright / DMCA form

DOWNLOAD PDF

Lecture Notes in Computer Science Edited by G. Goos and J. Hartmanis

170 7th International Conference on Automated Deduction Napa, California, USA May 14-16,1984 Proceedings

Edited by R. E. Shostak

Springer-Verlag Berlin Heidelberg New York Tokyo 1984

Editorial Board

D. Barstow W. Brauer P. Brinch Hansen D. Gries D. Luckham C. Moler A. Pnueli G. SeegmOlier J. Stoer N. Wirth

Editor

A. E. Shostak SRI International 333 Ravenswood Avenue Menlo Park, CA 94025 U.S.A.

Library of Congress Cataloging in Publication Data International Conference on Automated Deduction (7th: 1984 : Napa, Calif.) Seventh International Conference on Automated Deduction. (Lecture notes in computer science ; 170) 1. Automatic theorem proving-Congresses. 2. Logic, Symbolic and mathematical-Congresses. I. Shostak, Robert. II. Title. III. 7th International Conference on Automated Deduction. IV. Series. QA76.9.A96158 1984 511.3 84-5441

CR Subject Classification (1982): 1.1, J.2.

© 1984 by Springer-Verlag New York Inc. All rights reserved. No part of this book may be translated or reproduced in any form without written permission from Springer-Verlag, 175 Fifth Avenue, New York, NY 10010, U.S.A. Permission to photocopy for internal or personal use, or the internal or personal use of specific clients, is granted by Springer-Verlag , New York, Inc. for libraries and other users registered with the Copyright Clearance Center (CCC), provided that the base fee of $0.00 per copy, plus $0.20 per page is paid directly to CCC, 21 Congress Street, Salem, MA 01970, U.S.A. Special requests should be addressed directly to Springer-Verlag, New York, 175 Fifth Avenue, New York, NY 10010, U.S.A. 0-387-96022-8/84 $0.00 + .20 Printed and bound by R.A. Donnelley & Sons, Harrisonburg, Viriginia. Printed in the United States of America. 9 8 7 6 5 432 1 3-540-96022-8 Springer-Verlag Berlin Heidelberg New York Tokyo 0-387-96022-8 Springer-Verlag New York Heidelberg Berlin Tokyo

iii

FOREWORD The Seventh International Conference on Automated Deduction was held May 14-16, 19S4, in Napa, California. The conference is the primary forum for reporting research in all aspects of automated deduction, including the design , implementation, and appli cations of theor em-proving systems , knowledge representation and retrieval, program verification , logic programming, formal specification , program synthesis , and related areas. The presented papers include 27 selected by the program committee, an invited keynote address by Jorg Siekmann, and an invited banquet address by Patrick Suppes. Contributions were presented by authors from Canada, France, Spain , the United Kingdom , the United States, and West Germany. The first conference in this series was held a decade earlier in Argonne, Illinois. Following the Argonne conference were meetings in Oberwolfach, West Germany (1976), Cambridge, Massachusetts (1977), Austin, Texas (1979), Les Ar cs, France (19S0), and New York, New York (19S2).

Program Committee P . Andrews (CMU) W.W. Bledsoe (U. Texas) past chairman L. Henschen (Northwestern) G. Huet (INRIA) D. Loveland (Duke) past chairman R. Milner (Edinburgh) R. Overbeek (Argonne) T . Pietrzykowski (Acadia) D. Pl aisted (U. Illinois) V. Pratt (Stanford) R. Shostak (SRI) chairman J. Siekmann (U. Kaiserslautern) R. Walding er (SRI) Local Arrangements R. Schwartz (SRI)

iv

CONTENTS Monday Mornin g

Universal Unification (Keynote Address) Jorg H. Siekman n (FRG ) .

. . . . . . 1

A Portable Environment for Research In Automated Reasoning Ewing L. Lusk and Ross A. Overbeek (USA) . . . . . 43 A Natural Proof System Based on Rewriting Techniques Deepak Kapur and Balakrishnan Krishnamurthy (USA)

53

EKL-A Mathematically Oriented Proof Checker Jussi Ketonen (USA) . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65 Monday Afternoon

A Linear Characterization of NP-Complete Problems Silvio Ursie (USA) . . . . . . . . . . . . . . . . . . .

. . . . . . 80

A Satlsflablllty Tester for Non-Clausal Propositional Calculus Allen Van Gelder (USA) A Decision Method for Linear Temporal Logic Ana R. Cavalli and Luis Farinas del Cerro (France)

101

. . . . . . . . . . . 113

A Progress Report on New Dec ision Algorithms for Finitely Presented Abelian Groups D. Lankford, G. Butle r, and A. Ballantyne (USA) . . . . . . . . . . . 128 Canonical Forms In Finitely Presented Algebras Philipp e LeChenade c (Fran ce) . . . . . . . . . . . . . . . . . . . . . . . . 142 Term R ewriting Systems and Algebra P ierre Lescanne (Fra nce) . . . . . . . .

. . . . . . . . 166

Termination of a Set of Rules Modulo a Set of Equations Jean-P ierre Jouannaud (France) and Miguel Munoz (Spain) . . . . . . . . . . . 175

v

Tuesday Aforning Associative-Commutative Unification Francois Fages (France) .

. 194

A Linear Time Algorithm for a Subcase of Second-Order Instantiation Donald Simon (USA) A New Equational Unification Method: A Generalisation of Martelli-Montanari's Algorithm Claude Kirchner (France) .

209

. . . . 224

A Case Study of Theorem Proving by the Knuth-Bendix Method Discovering that x 3 = z Implies Ring Commutativity Mark E. Stickel (USA) . . . . . . . . . . . . . . . . . . . . . . . . . . . 248 A Narrowing Procedure for Theories with Constructors L. Fribourg (France) . . . . . . . . . . . . . . . . . . . . . . . . . . . . 259 A General Inductive Completion Algorithm and Application to Abstract Data Types Helene Kirchner (France) . . . . . . . . . . . . . . . . . . . . . . . . . . 282

Tuesday Evening The Next Generation of Interactive Theorem Provers (Banquet Address) Patrick Suppes (USA)

303

VVednesday Alorning The Linked Inference Principle, IT: The User's Viewpoint L. Wos, R. Veroff, B. Smith, and W. McCune (USA) A New Interpretation of the Resolution Principle Etienn e Paul (France)

. . . . . . . . 316

333

Using Examples, Case Analysis, and Dependency Graphs in Theorem Proving David A. Plaisted (USA) . . . . . . . . . . . . . . . . . . . . . . . . . . 356

vi

Expansion Tree Proofs and Their Conversion to Natural Deduction Proofs Dale Miller (USA) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 375 Analytic and Non-analytic Proofs Frank Pfenning (USA) . . . . . . . . . . . . . . . . . . . . . . . . . . . 394

Wednesday Afternoon Applications of Protected Circumscription Jack Minker and Donald Perlis (USA) . . . . .

.

Implementation Strategies for Plan-Based Deduction Kenneth Forsythe and Stanislaw Matwin (Canada)

. . . . . . . . . . 426

A Programming Notation for Tactical Reasoning David A. Schmidt (USA) . . . . . . . . . . . . . .

414

. . . . . 445

The Mechanization of Existence Proofs of Recursive Predicates Ketan Mulmuley (USA) Solving Word Problems In Free Algebras Using Complexity functions Alex Pelin and Jean H. Gallier (France)

. . 460

476

Solving a Problem In Relevance Logic with an Automated Theorem Prover Hans-Jurgen Ohlbach and Graham Wrightson (FRG) 496

7th International Conference on Automated Deduction

UNIVERSAL UNIFICATION Jorg H. Siekmann Universitat Kaiserslautern FB Informatik Postfach 3049 D-6750 Kaiserslautern

Uberhaupt hat der Fortschritt das an sich. daB er viel groBer ausschaut. als er wirklich ist. J.N. Nestroy,

ABSTRACT:

This article surveys what is presently known about first order unification theory.

CONTENTS

O.

INTRODUCTION

I.

EARLY HISTORY AND APPLICATIONS

II.

A FORMAL FRAMEWORK 1. Unification from an Algebraic Point of View 2. Unification from a Logical Point of View 2.1 Equational Logic 2.2 Computational Logic 3. Universal Unification

III. RESULTS 1. Special Equational Theories 2. The General Theory 2.1 Classes of Equational Theories 2.2 Universal Unification Algorithms IV. OUTLOOK AND OPEN PROBLEMS V.

1859

BIBLIOGRAPHY

2

O. INTRODUCTION Unification theory is concerned with problems of the following kind:

Let f and g be function symbols, a and b constants and let x and y be variables and consider two first order terms built from these symbols; for example: t t

f(x,g(a,b))

1

f(g(y,b),x).

2

The first question which arises is whether or not there exist terms which can be substituted for the variables x and y such that the two terms thus obtained from t and t become equal: in the example g(a,b) 1 2 and a are two such terms. We shall write

°1

= {x-s-q (a,b) ,y+a}

for such a unifying substitution °1 t1

=

°1

:

is a unifier of t

1

and t

2

since

°1 t2'

In addition to the decision problem there is also the problem of finding a unification algorithm which generates the unifiers for a given pair t

1

and t

2.

Consider a variation of the above problem, which arises when we assume that f is commutative: f(x,y)

(C)

=

f(y,x).

Now 01 is still a unifying substitution and moreover a unifier for t

1

and t

2,

0

{y+a} is also

since

But 02 is more general than as the composition A0

°2

°

1 with A

, since 01 is an instance of

2 algorithm only needs to compute

=

°

2 obtained {x+g(a,b)}; hence a unification

°2 ,

There are pairs of terms which have more than one most general unifier (i.e. they are not an instance of any other unifier) under commutativity, but they always have at most finitely many. This is in contrast to the first situation (of free terms), where every pair of terms has at most one most general unifying substitution.

The problem becomes entirely different when we assume that the function denoted by f is associative: (A)

f(x,f(y,z))

f(f(x,y),z).

3

In tha t c ase 01 is stil l a unify ing s ubstitut ion , but 03 =

( x~f ( g ( a , b ) ,

g( a ,b»

, y·a }

is also a u n ifier: 0 3t 1 = f (f(g (a ,b) , g (a,b» , g( a ,b» =A f(g (a ,b) , f (g (a , b) , g (a ,b») = 03t 2 But 0 4 = (x·f (g (a , b ) , f(g(a , b ), g( a,b») , y.a} i s a gain a uni fyi ng subs tituti on a nd it i s not dif f i cu lt t o see that t here are i nfi nitel y man y u nif i ers, all o f whi ch are most g ener a l . Fina lly , if we a ssume that bo t h axioms (A) a nd (C) ho l d f or f then the s ituat i on chang e s ye t a g a i n a nd f or any pa ir of terms there are a t mos t f i n i te l y man y most general unifiers unde r

(A ) and (C ) .

Th e a bov e e xamples a s well as the prac t i c a l a pp l i cati on s of u n i f i c a tion theory quoted in the f o llowing pa r ag raph s hare a common problem , wh i ch in i ts most a bstra c t f or m i s as f o l l ows : Sup po s e

t~ o

t erms s a nd t are give n ,

whi c h by s ome c onvent ion d en ote a parti cular s t r u c t ur e and le t sand

t con t a in some fre e variable s . We sa y s a nd t are un i f iab l e i f f t he r e a re s u b sti t ut i on s

(i. e . t erms re-

placing the fr e e v a r iable s o f sand t) such that bo th t e r ms become equal i n a we ll de f ined s ense .

If t he structure can be a xiomatiz ed by s ome fi r s t o r d e r theory T, unification of sand t u nder T amounts t o s o lv ing the equ a t ion s = t in t hat theory . Howeve r, the ma t hema tic a l inv e stiga tion o f equa tion solving in certain theories i s a subject as old as mathematics itself and, r i ght from the beginning, v ery much a t t he heart of i t : It dates back to Bab ylonian mathemati cs (about 20 0 0 B.C . ) . Unive r s al u n ification carries this a cti vity on i n a more ab str act sett i ng: just as univer sal algebra abstracts from c ertai n pr opertie s that perta in to s pe c i f ic algebr a s a nd i nve stigate s i s sue s that are common t o all of them, u n i versal u n i f i c ation add res s e s prob l ems, whi ch are typical f o r e quation solVing as such.

4

Just as traditional equation solving drew its impetus from its numerous applications (the - for those times - complicated division of legacies in Babylonian times and the application in physics in more modern times), unification theory derives its impetus from its numerous applications in computer science, artificial intelligence and in particular in the field of computational logic. Central to unification theory are the notion of a set of most general unifiers wur (traditionally: the set of base vectors spanning the solu-

tion space) and the hierarchy of unification problems based on wur (see part II for an exact definition of this hierarchy): (i)

a theory T is unitary if Wur always exists and has at most one element;

(ii) (iii)

a theory T is finitary if WUE always exists and is finite; a theory T is infinitary if Wur always exists and there exists a pair of terms such that WUE is infinite for this pair;

(iv)

a theory T is of type zero otherwise.

We denote a unification problem under a theory T by <s = t>

T

In many practical applications it is of interest to know for two given terms sand t if there exists a matcher (a one-way-unifier) W such that W(s) and t are equal under T. We denote a matching problem under a theory T

by <s

~

t>

T

In other words, in a matching problem we are allowed to substitute into one term only (into s using the above convention) and we say s matches t with matcher w. A

unification problem (a matching problem) under a theory T poses two

questions: Q1: is the equality of two terms under T decidable? If so:

Q2: are these two terms unifiable and i f so, is it possible to generate and

represent all unifiers?

Q1 is the usual word problem, which has found a convenient computational treatment for equational logics [KB70], [HOaO]. These techniques, called term rewriting systems are discussed in section II. 2.2. An affirmative

5

answer to Q1 is a n i mporta nt prere quisite for unificati on t h e o ry . Q2

summarizes the actual interest in u n i f i c a t i o n theory and i s the

subj ect o f thi s ar t i c l e.

It is rea sonable to expect that the r e l a t i onshi p between compute r s c i e n c e and mathematical l og ic will be as fr u i t f u l in t he n ext c e nt ury as tha t b etwee n phy s i c s an d an a l y s i s in the las t. J ohn McCart hy , 1963 I. EARLY HISTORY AND APPLICATIONS

There i s a wide v a r iety of a reas ~r ob lems

1.

in c omputer sc ience whe re u n i f i c a t i o n

arise .

Da t aba ses

A deduc t i ve databas e [ GM7 8 ] doe s no t c o n t a i n ever y piece o f informat i on e x p l i c ite ly . Instead it c ont a ins only those fa cts from wh ic h a l l other inf ormati on the u ser may wish t o know can be dedu c e d by s ome inference r ule. Such inference ru les (dedu ction ru l es) heavily r e l y on unifi c at i on algor ithms . Also the u ser of a

relationaZ

databas e [DA76] may logically AND the

propertie s s he want s t o re tr ieve or else she may be inte rested in the NATURAL JOIN [ C0 70J of two s t ore d r e lat ions . In nei the r c a se, she would a pprec i ate i f she c o nstant l y had to t a ke into acco unt t ha t AND i s an associativ e and coromutat i ve, o r t h at NATURAL JOIN obeys an a s sociative a x i om, wh ich may distribute ove r s ome other o peration.

2 . Informati on retr i e v a Z A paten t office ma y store al l recorded elec t ri c c i r c ui t s [ BC6 6 J or a ll record e d chemical compo u nds [SU65 ] as some gr aph struc tur e, and t h e problem of checking whethe r a g iven circu it or c omp ound alread y e xists is an i n s t ance of a t est for g r a p h isomosp h ism [ UL76 J, [ UN6 4 J , [eR6 SJ . Mo r e generally, if th e nod e s of s uch g rap h s are labelled wi t h universally

6 qua ntified variables ranging ove r subgraphs , these problems are p ractical instances of a gr aph ma t c hing proble m.

3. Compu t e r vi s ion I n the f i e l d o f c ompu t er vi s i on i t has become c ustoma r y t o store t h e i nt e r n a l r e p r es enta ti o n of c ertain ex ter n a l scenes a s som e n e t stru cture [ CL7 1 J , [vill7 5] . The prob l em to find a par ti cular objec t

-

als o r e p r e s e nted as some net - in a g iven sce n e is a lso an instance o f t h e grap h ma t c hi n g pr ob lem [R L6 9 J . He r e o ne of t he mai n p r o b lems is t o specify as to wh a t cons t itu te s a s uc ce ss fu l l match ( s ince a stri c t t e s t f o r end omorph ism is too r ig i d f or mos t app lications) a l t hough s eri ou s inve s t i g a t i on of t hi s pro b l em is s t i l l pe nding ( see p a r a unifi cati on in section I V).

4. Natu ral Language Pro c e ss i ng The processing of natural langu a g e [ TL8 1 ] by a c omputer us es

transformation

rule s t o change the syn t ax o f t he input sentence i nt o a mor e appropriate o ne .

I n f e r e nc e r u le s are us ed to ma n i pUlate t h e s eman ti c s of an inpu t se ntence and to d isamb i g u a t e it . Th e world k no wl e d g e a n atura l l ang uag e understanding s ystem mu s t hav e i s rep r esen ted by cert a in (s yn tact ic ) de s c ript i ons and i t is par amou nt t o de t ect i f t wo de script i on s desc r i be t h e s ame objec t o r fac t. Trans f orma ti o n r ule s , i n f er ence r ules and t h e ma tch ing of desc r iptions are bu t a f e w appl icat i o n s of u ni fi c a t i on theor y to th is field .

5 . Ex pe rt Sy s t ems An expe r t

s ystem i s a c ompute r p rogr am t o s o l v e problems and a n s wer

q u e s tio n s ,which up t o now on ly h uma n ex pe r t s were capabl e o f [S H76J . The powe r of such a s y s t em larg el y d epends o n its a bil i t y t o repre se nt and man i pu l at e the knowl edge o f its field of e x per t i se . The techniques for doing so are yet a n other i n s t ance of the a p plic at i on of unification the o r y within the field o f ar t i f i c i a l intell i genc e .

6. Comput er Al geb r a I n compu t e r a l ge br a ( o r

symb o l mani pu l at io n ) [SG7 7J mat c h i ng alg or ithms

also play an important role : f or examp l e the integrand in a s ymbolic i n t e gr a tion p roblem [M07 1J ma y be ma t c hed a gain s t cer tain p a t t e r n s in o r d e r to det ec t t he cl a s s o f i n t e gra tion pro b l ems i t belongs t o and to

7 trig g er t h e app ropr i a te act ion fo r a s oluti o n (which i n t urn may i n volve s everal q uite c omplicated matc hing attempt s [BL7 1 ] , [ CK71 ], [FA7 1] , [RN71], [ MB 6 8 ], [ M074]. 7.

Pro gr amming Lang u a g e

An important contri butio n of art ifi c ial intell i g e n ce t o p rogr a mm i ng language d e s i g n i s the me cha nism o f patt e r n -directe d i nvoca ti on of procedures [BF 7 7 ] , [ HT 72] , [HT7 6 ] , [ RD 7 2 ] , [ WA7 7 ] . Pro c edures are identif i ed by pa t t erns i n stead of proced u re identi fiers a s i n tradi ti o na l progra mm i ng langu age s . I nv oca t i o n p at te rns are u suall y d es i g n e d t o expr ess g o al s a chieved by executi ng the p roced u re . I n c oming mes s ages are tri ed to be ma tched ag a inst t he invocati o n pa t t e rns o f pro c edures in a p r o c e dur a l d at a b as e, a nd a p roc e d u r e i s a c tivated a f ter h av i ng complete d a successful match betwee n messag e and pattern. So, matching is done (1) for looking up an a ppro priate procedure that helps to accomp lish an inte nded g o a l , and

(2 ) t r a n s mi t t i ng information to the

involv ed pro cedure . Fo r these a p p lications it is p a r t icula r ly d es ir a ble to h av e me tho d s f o r match ing ob j e c t s belong ing to hig h l e v el d ata s tr u c t ur es suc h as strin gs, sets, mu ltise t s e tc . A little r efl e ction wi l l s ho w th a t fo r ver y r i c h ma tc h ing struc tur e s, as it has e . g . been prop osed in MATCHLESS in PLANNER [HT7 2 ], the matching problem is und ecidable. This

presents a probl em for the

designe r of such languag es: on t h e one hand, v e ry rich and e x pressive matching structur es ar e de s i r ab l e , sinc e they f o rm t h e b a s i s fo r the invocat ion a nd deducti o n mecha n i s m. On the o t her hand, drasti c restr icti ons wi l l b e n ece s s a ry i f match ing a l g or i t hms ar e t o be f ound . The q u e stion is j u st h ow s e ver e d o t h e s e r es tric ti o n s h av e to be . The fundame ntal mode o f o per at ion for t h e p rog r ammi ng langua g e

SNOBOL

[FG64] is t o detect t he o c c u r re nc e of a substring wi th i n a larger str ing of char acters (like e . g . a program o r some text ) and there ar e very f ast methods known, which require less than linear time [BM77 ]. If these strings contain the SNOBOL 'don't care'-variables, the o c c u r re n c e problem i s an i n s t a nc e o f the str ingun i f ic at ion problem men t ioned in the following p a r a g r aph . curren t att empts t o u se f ir st order p red i c ate Zo g i c [ K0 79 ] as a p rog ram ming langu a g e [ CM8 1 ] h eaVi ly d epend on t h e a vai l a bi lity o f fast unifica tion algorithms. In order to g ain spe ed there are attempts at present t o have a ha rdwa re re a Zi za t io n

of the u nificati o n procedure [GS84]

8

8. Algebra

A famous decidability problem, which inspite of many attacks remained open for over twenty-five years, has only recently been solved: the monoid problem (also called Lob's Problem in Western Countries, Markov's Problem in Eastern Countries and the Stringunification Problem in Automatic Theorem Proving [HJ64J, [HJ66J, [HJ67J, [LS7SJ, [MAS4J, [SS61J, [PL72J) is the problem to decide whether or not an equation system over a free semigroup possesses a solution. This problem has been shown to be decidable [MA77J. The monoid problem has important practical applications inter alia for Automatic Theorem Proving (stringunification [S17sJ and second order monadic unification [HT76J,

[~~76J)

for Formal

Language Theory (the crossreference problem for van Wijngaarden Grammars [W176J), and for pattern directed invocation languages in artificial intelligence as mentioned above. Another wellknown matching problem is Hilbert's Tenth Problem [DA73J, which is known to be undecidable [MA70J. The problem is to decide whether or not a given polynomial P[x ,x ... ,xnJ = 0 has an integer solution 1 2' (a Diophantine solution). Although this problem was posed originally and solved within the framework of traditional equation solVing, unification theory has shed a new light upon this problem (see 111.1.).

Semigroup theory [H076J, [CP61 J is the field traditionally posing the most important unification problems (i.e. those involVing associativity) . Although scientifically more mature than unification theory is today, interesting semigroup problems have been solved using the techniques of unification theory (see e.g. [SS82J, [LA80J, [LA79J).

9.

Computational Logic

All present day theorem provers have a procedure to unify first order terms as their essential component: i.e. a procedure that substitutes terms for the universally quantified variables until the two given terms are symbolwise equal or the failure to unify is detected. This problem was first studied by Herbrand [HE30], who gives an explicit algorithm for computing a most general unifier. But unification algorithms only became of real importance with the advent of automatic theorem provers (ATP) and algorithms to unify two first order terms have independently been discovered by [G067],

[R06s] and [KB70]. Because of

their paramount importance in ATP's there has been a race for the fastest such algorithm [R071],

[BA73] ,

[VZ7s],

[HT76],

[MM79] resulting in a

linear first order unification algorithm for the free algebra of terms [PW78],

[KK82].

9

Also for almost as l on g as attemp ts at proving t heorems by ma ch ine s have been made , a critical p roblem has been well known [G067] , [CK65 ] , ~E 71] : Certain equat ional ax ioms, if left wi thout p re c au t ion in the da ta base of an a u t oma tic t heorem prover , wi ll f orce t he ATP to go a str ay . I n 1967, Robinson [RN67] proposed t hat s ubsta nti a l progre s s ("a ne w platea u " ) would be achieved by r e mov i n g t hese troublesome axioms from the da ta ba s e and building the m i n to t he de duc t ive machinery . Four a pproac hes to cope with e quationa l ax i oms ha ve be e n propo s ed : (1) To write the ax ioms into the data b a se , and u s e an additional rule of inference, suc h a s paramodulat i on [WR73]. (2) To use special "rewri t e rules " [KB70], [WR67], [HT80], [H080]. (3) To design special i nferenc e r ule s incorporating these axioms [SL72]. (4) To deve l op special un i fi c a t i on algor i t hms inc orpora t ing these a x i oms [PL72]. The last a pp r oac h (4) stil l appears to be promi s ing , however i t ha s t he drawba c k that f or every new set of axioms a ne w uni fication algori thm has t o be found . Also recently there has been interesting work on c ombinations of approach (2 ) and (4) ; see section I I I 2.2. The work on higher unification by G. Huet [HT72], [HT75], [HT761, ha s been very influential for first or de r unification t heor y also and was fundamen tal in shaping the fiel d as it is known today . G. Plotk in ha s shown i n a pioneer ing pa pe r [PL7 2 ] that whenever an a u t omat ic the or em prover is t o be re f uta tion complete, its unification pr oc e du re mu s t generate a s e t o f un i f ier s satisfying the t hr e e condit i on s completeness, c orrec t ness and mi nima l i ty , wh i c h are defined below. Summarizing unification theory rest s upon two main pillars: Un i v e r s a l Algeb ra and Comput a t ional ~ o g ic and we shall now turn to a brief survey of the i mportant notion s, wh i ch form the theoretical fra mework of the fiel d.

10

11

but we ne ed n o t ion s , not

not at i on .

11

A . Ta r e k i ,

1 943

II. A FORMAL FRAME WORK ,. Uni f i c a t i o n fr om an A l g eb ra i c Po int o f Vi e w As usual l et n

be t he se t of na t u r a l numb ers. A s et of ' symbo l s wi t h

arity' is a mapping rl : M .. :IN

where M is som e s et. For f EM rlf is the

r

a rity of f. The dom ain of rl is u sed t o denote certain n -ary o p e r a t i o n s and is s ometimes called a si gnatu r e . A Un i v e r s a l Algebr a A is a p ai r

(f , n) EQ i s abbrev i a t ed to fE Q.

(A, Q), where A is t h e c ar r i e r and fE rl

denotes a mappi ng

then we write fA tal , . . . ,a ) f or t h e reali zatio n of the denoted mapping) . n = 0 t he n f is a distinguis h ed constant of the algebra

No t e that i f Qf

A . COD ( rl), the codomai n of Q, is i t s type . ~: A.. B is a homomorphi sm i f ~ fA( a , , ... ,a ) n a bi j e ctiv e homomorphism is c al led a n isomo rphi sm , i n

If A and B are algebras, f B ( ~a"

. ..

, ~ an ) ;

symbols "". For a s u bse t A :: A, tDo = (j) jA o

o

is the res tr i c t i o n of tD t o A ' An o

equiva l e nc e r e lation p is a co ngruence re la t i o n i f f a Pb" 1 imp lies fA(a"

Alp =

. . . ,a n ) p fA (b"

.. . ,a pb n n

. . . ,b n ) .

(A/ p, Q) is the quotie n t al gebra modulo p . [ a J p is t h e c ongruence

class g enerated b y aE A.

Iro of f ixed type, the a l g e b r a A Iro on the s e t X , in symbols Alro (X) , iff

For a class of algebras free i n (i)

(A ,Q ) is

(A ,>l) E Ir o

(ii) X c A E Ir and ~o : X"'B is any mapping , then there exists a o u nique homomorph ism ~: A-s-B wi t h ~o = ~Ix'

(iii ) i f

B

If Ir is t h e c l a s s o f a l l algebras o f the fixed t ype, the n Air (X) i s the

(since it e xists a nd is u n iqu e up t o isomorphism) abso l u tel y free

a lg e bra on X . The elements of A Ir (X ) are called te rms and are given a concrete representati on l~ by :

11

(i) xEX (ii) if t in

W;

is in 1,t2

, ••• ,t

~~.

~

We assume that

are terms and

~f

n, n

2

consists of the disjoint sets iff

Qf

2

fEr iff

Qf

;=

fE~

~

n

0, then f(t

~

1,

... ,t

n)

is

and r such that

and

1

°

is called the set of function symbols, r the set of constants and X

the set of variables. We define operations for n A

= Qf

A

by f(t , ••• ,t

f(t , ..• ,t Let Q be the set of these (term building) n) 1 n). operations. Let ¢ denote the empty set. 1

F;

= (W~,~)

is isomorphic to Aff(X) and hence is called the absolutely

free term algebra on X.

F~

is the initial term algebra (or Herbrand

universe). We shall write F Q

F~.

for

o

by the fact that for every algebra A

F~

Our interest in

=

is motivated 0

(A,Q) there exists a unique

homomorphism h

A:

->- A

FQ

•

o

But then instead of investigating A, we can restrict our attention to a quotient of F Q

modulo the congruence induced by h

o

A.

In order to have variables at our disposal in the initial algebra we define Q X

=Q

U X, that is we treat variables as special constants. Since

X F Q "" F¢ Q

x

we simply write F".. if X

Because terms are objects in F

*¢

and

=t

no if

n we shall write tEF n instead of

An equation is a pair of terms s , t E F n' in symbols s s

X c Q and F

=

t. The equation

is valid in the algebra A (of the same type), in symbols A F

for every homomorphism

~:

F

n

->-

s = t

iff

A ~s

~t

in

A.

Let cr: X ->- FQ be a mapping which is equal to the identity mapping almost everywhere. A substitution cr:F Q ->- F Q is the homomorphic extension of ; and is represented as a finite set of pairs:

12

E is t he s et o f sub stitu ti ons on F , The iden t ity ma p p i ng o n F , i. e. n n the empt y subs t i t u ti on , is denot ed b y E . If t is a term a nd 0 a substitution, define V: F~ ~ 2 It I E

~

X

b y V (t ) = {s et of variab l es in t }and V( t

denote s the le ngth o f t

1

, ••• ,t ) =

n

( i .e . the numbe r o f symbols in t )

'" x I

DOM (o )

{x EX: ox

COD (0)

{ ox : X E DOM (0) }

XCOD( o ) = V (COD(o» EO c E is the set o f ground substitut i ons, i . e. o EE is un i f i ab l e

An equation s = t

o

iff COD (a ) c F

n

. o

(is so l va b l e ) in A iff there exists a

substitution ~ : F ~ ~ F ~ such that ~ s = ~t is valid in A. For a set o f equations T let =T b e the fin e st congru e n c e c onta ining (s,t) and a ll pairs ( as, a t ), for a e:~

and s =t

£

T. F ~I =T i s the quotient

algebra mo d u lo =T' A uni ficatio n proble m for T , denoted a s <s

is given by the equation s or not s = t

=

t >

T

t, s,tEF . The p r ob lem is t o d e cide wh ether

n

is unifiable in

We denote the constituent

t h e initial alg ebra

as

2. Uni fic atio n fr om a Logical Poi nt of View 2 . 1 EQUATI ONAL LOG IC The wel l forme d f ormu l as o f our logic a r e e quat i o n s d efin ed a s p a i r s (s,t) in

W;

x

A substitution

~ and denoted as s = t . 0

is a finite set of pairs in w~

x

~~ (i. e. classical

work confuses the issue a little by i d e n t i f y ing the representation with t he mapping that is bein g repres e n ted ). Th e appl icati on o f o = {x + t , ... , x +t to a term t, o t 1 1 n n} replaci ng each Xi in t by t • i

is ob t a ined by simultaneously

Let T be a se t of e q u a t i o n s . The e q u a ti o n p T 1 -

=

q is der i vable fr om T ,

p = q, i f P = qET or p = q is o b t ained from T b y a fini t e

sequence of the f ollowing opera tions:

13

(i)

t

t is an axiom

=

(ii)

if s

(iii)

if r

=

if s

=

=

=

then t

sand s

(iv) if si (v)

t

s

t then r

=

t

t i, 1$i$n then f(s1,···,sn) t then crs = crt where crEL.

For a set of equations T, T 1- s

=

tiff s

t is valid in all models

of T.

Theorem (Birkhoff): T 1= s We shall abbreviate T equation s

=

t

1= s

=

t i f f T l- s

=

t

(and hence T 1-

= t

s = t) by S =T t. An

is T-unifiable, iff there exists a substitution cr such

that crs =T crt. Although this is the traditional view of unification, its apparent simplicity is deceptive: we did not define what we mean by a 'model'. In order to do so we should require the notion of an interpretation of our well formed formu~as, which is a 'homomorphism' from

W;

to certain

types of algebras, thus bringing us back to section 1. Since neither 1= nor ltreatment of

are particularily convenient for a computational

an alternative method is presented below.

2.2 COMPUTATIONAL LOGIC For simplicity of notation we assume we have a box of symbols, GENSYM, at our disposal, out of which we can take an unlimited number of "new" symbols. More formally: for F,,' let" with "x

=

U

r u

X, where X

=

Xo U GENSYM

0, xEX.

=

We shall adopt the computational proviso that whenever GENSYM is referenced by v E GENSYM it is subsequently 'updated' by GENSYM' Since F".

~

F"

=

X U I v l and ,,' =

GENSYM - I v l and

X~ =

F,,'

A renaming substitution p E LX (i)

(ii)

C

L is defined by

COD(p) c X x,yEDOM(p): if x

*y

then ox

'*

oy ,

such that o s = pt. If o s = t then t 1.S x called an X-variant of s, if in addition COD(p) c GENSYM then t is

For s,t E F,,: s

~p

t if 3pEL

called a new X-variant of s. In order to formalize the accessing of a subterm in a term, let

~*be

the set of sequences of positive integers, A the empty sequence in

~*

14

and let.be the concatenation operation on sequences. Members of nE~*.

called positions, and denoted by

(i) if [It

=

=

0 then Il(t)

if t = f(t , ••• ,t 1 n)

(ii)

=

(i) tin

=

f(t

t

1,

for

=

(ii) tl~n'

{h}

then Il(t)

=

For example: f(g(a,y) ,b) The subterm of t

U {i. rr

1· 2}

.

at n, tin, is defined as:

n)

or

h

n

{h}

=

{h,1 ,2,1'1,

... ,t

n¢1l (t)

tiln'for n

=

For example: f(g(a,y) ,b) 11.2

y.

A subterm replacement of t by s at n, ~t, with ~ (i) ~t (ii)

=

s

if

=

n

[n+s] is defined as:

h

~t

(iii) ~t

are

the set of positions in t , be:

~*,

any tEF[l let Il(t) c

~*

They are used as follows: for

i'n' and =

t

if

n¢1l (t) • /I

/I

/I

We denote replacements by a,p,n, etc. and substitutions by a,p,n etc. ~

A relation +

F[l

F[l is Noetherian

x

(terminating) if there are no

.t

infinite sequences: s1 + s2 + s3 + . . . . As usual

*

~

is the transitive and

the reflexive and transitive closure of +. A relation + is confluent

if for every r,s,t E F[l such that r such that s

*~

u and t

*~

*

~

such that v(r

i)

~

=

t there exists a u E F [l

u . A confluent Noetherian relation is canonical.

We define two important relations +R and A rewrite system R

*~

sand r

{11,*r

1,

... ,ln,*r

n}

~R

on F[l

x

is any set of pairs li,r i E F[l'

1~i~n

V(li)'

For two terms sand t we say s is rewritten to t, s /I

=

~

~~

[n+ar

and li,r

11

t, if there

and t

1

are new X-variants of li,r

i i] track of the information by writing s

~R

= al.

exists nEIl(s), aEL and l.,*r. E R such that sin a

F[l as follows:

= ~s,

i. , t, s ~ t, s ~ t etc. [n,i,a] [n,i] [n]

For two terms sand t we say s is paramodu lated

t~

exists nEil (s), l.,*r. E R, aEL such that a(sln)

ali and a is most

1

1

~

=

t, s:>---+R t, if there

general (see 3. below), li is,a new X-variant of 1. and as 1

For example for R s

=

f(g(a,y) ,y)

with n

=

1 and a

=

~R

=

{g(x,O) f(O,O)

'* O} = t

{x+a, y+O}.

where

Occasionally we keep

we have

t.

15

Bu t n ote s t4

R

t , s i n c e we a re not a llowed to s u bs t itu t e i n t o s .

The notation a nd de fi nit i ons of term r ewriting sys t e ms a re es s e ntial l y tho se of [HT80j ; the i mp or t an ce o f te rm r e wr i t i n g systems (d e mo d u l a t i on ) [~~67 J .

f o r theorem p rov i ng wa s first noticed i n

e q u at i o nal theory T t h er e i s a rewrite s y s t e m R = T t i ff 3 p E F

S

n su c h tha t s

T

Sup p o s e f o r a n such that for s , t E F n :

p and t -* R

*R

-

P

T

T

I n t h a t c a se we say T is embedded into R a nd wr i t e T

Fo r an e q u a t i o n a l t h e ory T ther e a r e techniqu es t o o b ta in a system RT s u c h tha t T ~ R ; mor e ov e r f o r many theories of p ractical inter e st it T is p os sible t o obt a i n a r ewrit e s ys t e m s uch that -R is ca no n i cal T [ KB70 J , [ HT80 J , [PS81 J, [H L80 l . Ca no n i c a l r elati ons ... a re an impor t ant ba s i s fo r co mpu t at ion s in e quati ona l lo gi c s , s i nc e they d efine a unique

nor mal form Ilt ll f or a n y t E Fa ' g iven by t -* Ilt ll ... s , He n c e s =T t iff ll s l] = ll t ll .

IIt ll and ;;ls E F a such t hat

I n c a se R is Noe therian ( i .e . R define s t he No e t he r i a n r elation T we als o say it i s a r edu c t i on syst em.

R ), T

3. Uni v er s a l Uni fi ca tion An equa t i o na l t he o r y T is decidable i ff s = T t is d ec i dab l e fo r a n y s, t E F n ' Le t

(J= d eno t e the f amily of decid a b l e f i ni t e l y b a s e d equational

t h eor i e s . t >T consi s t s of a p a i r o f t e rm s s , t E F n

A T- un i f i ca t i on problem <s a nd a t h e or y T E (J=

A s u b s t i t u t i o n cr EL i s a T- un i f i e r for <s = t >T if f cr s = T crt . Th e su b set o f L which u ni f i e s <s

t >T i s ULT{ S ,t) , the set of uni f ie r s

(f o r s and

t ) un de r T . I t is easy to s e e th a t UL is r e cur s i ve l y enumer able (r .e .) T f o r a ny s "and t : Sinc e F is r . e . s o is L, now for a n y 6EL , check if

n

s s = T 6t (whi c h is dec idab l e s i nce T E 6 1£

~

) then 6 E UL ( s ,t ) oth e r wise T

LT (S,t).

We shall omi t the sub s c r i p t T a nd

( s ,t ) if t hey a re c l ear fr om the

con t ext . Th e compo sit i o n o f substi t ut i ons i s de f i n e d by t h e usual c omp osi t i on o f ma ppin g s:

(cr

0

T)t = cr (Tt ) . I f W e X, t hen T-equality is

ex t e nded t o substitutions b y o =T

cr an d

1:

ar e T- e qua l i n W.

1:

( Wi

iff VxEW

cr x

T TX

,

'6 Le t

$T be a pa r t i al o rder on t e r ms such t h a t

s $T t if f t here e xists

o£~

satisfyin g s ~T o t . Thi s r e latio n is exten d ed to sub s titut ions :

We say cr is an instance of T a nd T is more genera l than cr , in symbol s ° ST

° '" T I f c ST

1

[ WI a nd

iff

[W]

T c

T

[W!

3 AE L

wi th

for som e W eX .

ST o f WD the n a "' T T (W ] , o a nd T a r e T-equ i v a l en t

1

i n W. Similarl y exte nd

Le mma :

( i) For

~T

t o s u bstitu tion by :

T~ 0:

o~T ' I W I

cr ~T, I W !

iff

cr x~T' X ,

x c W.

iff O%TT IW I .

( i i ) There exists an equational theory T such that O~ T T

(W! bu t

See [HE83] for a proof. For L , L '= L we d e fin e L o L 1 2 2 '" { o 1 0 o 2 : a 1 E L 1, °2 1 L c L [WI if f VO, E L, 3 ° E L s . th o [ W] 1 -T 2 2 ° 1 '" T ° 2 2 ..L 1 T L 2 (WI iff ~1 '=T L 2 [ W! a nd L 2 '=T L 1 l W!

u : 2 }. ,

Un i ve r s a l un i fi c a ti on i s c o n c e r n e d wit h t h r e e f u ndamenta l probl ems : PROBLEM ONE ( De c i d ab i l i t y Pr ob l em)

For a gi v en eq uat ion c:-l th eory T E ~ , i s it decidabl e fo r any s and t whet her s and t are uni f ia bl e ? Th a t i s , we a r e i n t e r e s t e d in cla sses o f t h e ori e s suc h that

"s and t

a r e un i fia ble unde r T " is dec i d a b l e for every T in t ha t c lass .

A u n i f i er ° for <s '" t >T i s ca l led a most ge ne r a l uni fie r (mgu ) i f f o r any u n i fier

<5

E UL T ( s , t ) :

<5

ST a aWl, whe r e V ( s ,t )

=

W. Sin ce i n gener a l

a si ngl e most g e n e r a l u n ifier doe s not e x i st f or <s '" t >T ' we d ef i ne ~ U L T ( S ,t ) ,

(i )

th e s e t of most g en er a l un i f i e r s , as : (c o r r e c tn e s s)

lJU ~ ~ U ~

(ii) Vo£

U~

( i i i ) V 01 $02

there ex i s t s £

lJU ~ :

F r o m condition ( ii) O~

O £ lJU ~

s .th .

i f 0 1$T 02 IW1 t h e n

(comp leteness) (minimali ty)

it f o l l o ws i n particu lar that U ~ =T ~o U ~ ! W n , i .e .

is a Ze ft i de a l in the semigroup ( ~, o ) and UE is g e nera t ed by lJ U~ .

For theoretica l r e a s o n s

( i d e mp o t e n c y of substi t utions ) as well a s for

many practical applications , it t u rne d out to be u s efu l addit ional t ec h nical r equ i r ement :

(0) F or a s et of va r i ab les z wi t h XCO D ( 0) ('\

z =0 for

O £ lJ U ~ 'T' ( s ,

t)

V (s ,t) ~z:

(p r o t e c t i on of Z)

to h ave t he

17

If conditions (0) - (iii) are fulfilled we say

~UZ

is a set of most

generat unifiers away from Z [FH83], [PL72]. The set ~UZT does not always exist;when it does then it is unique up to the equivalence see [FH83]. For that reason it is sufficient to

%T,

generate one ~UZT' In the following we always take representative of the equivalence class [~UZT]%'

~OZT

as same

PROBLEM TWO (Existence Problem):

For a given equational theory T E always exist for every s,t E F~/

~

, does ].lUi:: (s,t)

T

PROBLEM THREE (Enumeration Problem):

For a given equational theory T

E ~

]JUi::T(s,t) recursively enumerable for any s,t

is

,

EF

n

?

That is, we are interested in an algorithm which generates all mgu's for a given problem <s ~ t>T' Section 111.1 summarizes the major results that have been obtained for special theories T. The central notion ]JUL T induces the following fundamental classes of equational theories based on the cardinality of ]JUL T: (i) A theory T is unitary if Vs,t ].lULT(S,t) exists and has at most one element. The class of such theories is ~ 1 (type one). (ii) A theory T is finitary if it is not unitary and if Vs,t ]JULT(S,t) eXists and is finite. The class of such theories is

~w

(type w).

(iii) A theory T is infinitary if Vs,t ]JULT(S,t) exists and there exists

T such that ]JULT(p,q) is infinite. The class of such theories is ~oo (type 00). (iv) A theory T is of type zero if it is not in one of the above classes The class of these theories is ~o' (v )

A theory is unification-re levant i f it is not of type zero. The class

of these theories is

1r .

Several examples for unitary, finitary and infinitary theories as well as type zero theories are discussed in 111.1. A matching problem <s~t>T

consists of a pair of terms and a theory TeGt= . A substitution

v€E is a T-matcher (or one-way-unifier) if vs t. MET is the set of matchers and a set of most general matchers ~MET is defined similarily to VULT'

18

The setvMLT induces the classes of matching-relevant theories similar to the classes based on VULT: a theory T is unitary matching if VMLT always exists and has at most one element. The class of such theories is .JIl1 • Analogeously we define A unification algorithm U A T

.Al./jJ..AI. oo JJlo and the class A (a matching algorithm MAT)

T is an algorithm which takes two terms sand t ~ ULT(~ ML

a set o/T

T)

for <s

=

t>T

(for <s

.

for a theory

as input and generates

t>T)' A minimal algorithm

2

vuA T (f./MA is an algorithm which generates a VULT T)

(VML T)'

For many practical applications this requirement is not strong enough, since it does not imply that the algorithm terminates for theories T E

11. 1

U,

11. w'

On the other hand, for T E

11.

/jJ

it is sometimes too

rigid, since an algorithm which generates a finite superset of VULT may be far more efficient than the algorithm vuAT and for that reason preferable. For that reason we define: An algorithm (i) uA (ii) uA (iii)

uA T

is type conformal iff:

generates a set o/T with UL T T T

terminates and v ,

if T E

'1l

oo

:=

v,

::::J

is finite i f T E

VULT for some VULT'

'1l 1

U

'1l

/jJ

and

then o/T~ [VULT]%'

Similarly: algorithm

MA T

is type conformal iff (i) -

U replaced by M.

(iii) hold with

19

"Howev er to ge ne rali ze . on e n e e ds exp eri e nc e ... " G. Gratze r Un i v er s a l Al g ebr a . 19 68 II I. RESULTS

" a comparative study neces sari ly pr esup pose s some pr ev i ou s se par a t e stu dy. co mpar ison bei ng impo s sible wit hou t k no wledge . " N. Whi t ehead Treat i s e on Univers a L Alg ebra. 18 9 8 1.

Spe c i a l The or i e s

Thi s sec t i o n i s conc er ned with Pr o blem Two a nd Th r ee (the e x i s t e n c e r es p. the enumerati on problem) me n tion e d i n I I .3 : For a give n equationaL

th e or y T. do e s t h ere exis t an a Lgor i thm. whi c h e nume ra t e s any te r ms s and t ?

~ UL T ( S , t )

for

The follow i ng t able summa rizes the -r e s u l t s t hat have b een ob t a i n e d for spec ial t heo r i es, whi ch con s is t o f combination s o f t he f oll owi ng e q u a t ion s : A

(associat i v i t y)

f ( f (x , y ) , z)

C

(commu t ativity)

D

(d i s t r i but i vi t y )

H, E

(homomor phism, e ndomor phi sm)

ql (x oy )

I

( idemp o tence )

f(x ,x)

f (x ,y)

!

DR :

f {x,g {y ,z»

D f{ g {x,y) , z) L:

f( x ,f (y ,z» f {y, x ) g (f (x,y ) , f (x , z » g(f {x , z) ,f {y,z »

=x

Abbreviati ons : FPA: F i n i t e l y Presented Al gebra s QG:

S~T:

Many sor t ed fir s t order

term~

empty theor y T, the so r t

Quasi -Groups

Ab elian-Groups th Hl0 : Hi lbert's 10 Pro b l e m

structu re

AG:

S~N:

i s a tre e .

empty t he o r y T , the s o r t

Sot : Seco nd o rde r terms Ho t : Hi ghe r order term s

< fr ~ >

Many sor t e d firs t order terms ,

(i . e . ~ 3 r d o r d er)

structu re < ~, ~ > i s n o t a t ree ,

!;/ i s f inite. The column und er

uA T -

i ndi cate s whethe r or not a t y p e c o n f o rmal alg o r i t hm

h as been p r e s e n t e d i n the literature . The 'type of a t h e o r y ' a nd ' type c onformal' are defin ed in s ection I I .3 .

20

II Theory II Type

i

of T

T

I

¢

I~ I

W W

I

I A+C I C+I A+I

W

? W

1 1 ~+C+I

W

Unification decidable

D+C D+A+C D+A+I H,E H+A H+A+C

Yes

Yes Yes Yes Yes Yes Yes Yes

Yes Yes Yes Yes

Yes Yes Yes Yes No Yes Yes Yes Yes Yes Yes No Yes Yes Yes No

I;~~~:-

?

W

[HE30] [R06S] [R07l] [KB70] [G67] [PR60] [BA73] [HT76] [MM79] [PW78] [HM67][PL72][SI7S][LS7S] [MA77] [SI82] [RS78] [SB82] [S2S2] [ST75] [LS76] [HU79] [FA83] [HT78] [SSS2] [S282] [RS78] [LS76] [S282] [S78][S282] [S282] [S282] [SZ82] [V078] [V078] [V07S] [V078]

Yes

?

?

References

Yes

No

1 D+A

uA T

No Yes Yes Yes Yes

?

Yes Yes Yes Yes Yes Yes ?

Yes Yes Yes ?

W

FPA

W

Yes Yes No Yes

?

No

[G08l]

o

No

lET73] [HT75] TBA7!';] [LCn]

=

W ?

Sot,

T

Yes Yes

Yes Yes No Yes

?

Yes

W

Yes

Yes

Yes

Yes

I

[HUSO] [LA79] [LBB84] [MA70] [DA73] [LA80]

¢

Hot,E0

I

--------------------------------------------------------------------------1

AG

l H10

II

I

I

l

Yes Yes

Except for Hilbert's tenth problem, we have not included the classical work on equation solving in 'concrete' structures such as rings and field~, which is well known.

The relationship of universal unification

to these classical results is similar to that of universal algebra [GR79] to classical algebra. Let us comment on a few entries in the above table: The Robinson

Unification Problem, i.e. unification in the free algebra of terms or unification under the empty theory

0

has attrackted most attention so

far and was already discussed in section 1.9.

Unification under associativity is the famous monoid problem mentioned in 1.8. Plotkin gave the first unification algorithm for this theory [PL72] and used it to demonstrate the existence of infinitary equational theories. Completeness, correctness and minimality proofs are presented in [8178], which also discusses the practical implications of these results for theorem proving and programming language design. Makanin showed the decidability of this

unification problem [MA77].

21

Un i fica ti on unde r c ommu t at i vi ty has a trivial solution, whereas mi n i mal i t y presents a har d problem. A type conformal

algorith~

is

presented in [SI76]. The main interest in this theory however derives from its f i nitary nature in contrast to the in fi ni t ar y theory of associativity. A nice characterization of this difference is possible i n terms of the universal unification algorithm presented below. Howe v er a deep theoretical ex planation of wh y two seemingly very similar t heories be l o ng to entirely different c lasses is still an open research p r o b l e m. Apart from i t s practical r elevance, unifica t i on under assoc iativity

and co mmutat iv ity (A+C) poses an important theoretical problem: why is it that the combination of an infinitary theory (A) with a finitary theory (C) results in a finitary theory (A+C), whereas the combination o f an infinitary th eory (D) with the finitary (C) results in an the ory

infinit~

(D+C) ? Both th eori es (A+C) and (A+C+I) define common data-

structures, namely bags and s ets res pectively.

Uni f ication under dist ribu t i vity and as s ociativit y provides a point in case tha t the combination of t wo i nf in i t a ry th e ories is an infinitary theory. Is this alwa ys the case? The D+A-Unification Problem is also o f the ore tical inter est wi t h respect t o Hilbert's Tenth Problem, which is the problem of Diophantine solvability of a polynomial equation. An axiomatization of Hilbert's Tenth Problem would involve the axioms A and D plus additional axioms for integers, multiplication, etc. Calling the union of these a xi oms HTP, the famous undecidability resu lt [OA73] shows t he undecidabilit y of the unification problem un der HT P. Now the und e cid ab i l i ty of t he O+A-Un if icat ion Problem d emonstrates t hat al l

Hilbert axiom s in HTP ca n be dropped ex c e pt f or 0 and A (ho l d i ng for one funct ion s ymbol) a nd the problem s t i l l r emai ns unde cidable . Since A-u n i f i c a tio n is known t o be decidable, the r ace is open as t o wh e t h e r or n o t A ca n be dropped as well and 0 on its own presents an undecidable unification problem. More generally: it is an inter esting and natural question for an und ecidable pr o blem t o ask f or its "mi nimal undecidable substructure". Whate v e r the result may be, the D+A problem already highl ights the a dv a nta ge o f t he abstrac t n a t u r e o f u niversal u n if i c at i on theory in con t rast t o t he tradit ional poi nt of vi ew , with its reliance on in tu i t i v el y g iven en ti t ies polynomials) .

( l ike i nteger s ) a nd st ructures (l i ke

22

The undecidability results for second and higher order logic were the first undecidability results obtained in the framework of unification theory and rely on a coding of known undecidability results (Post's Correspondence Problem and H10) into these problems. Finally it is important to realize that the results recorded in the above table do not always hold for the whole class of first order terms. The extension of these special results to the whole class of first order terms is but a special case of the Combination Problem of Theories: From the above table we already have D

E

1t ",,'

A

E

D

E

11. ",,'

C

E

A

E

11. ",,'

C

E

11."" 1I. w

und

D+A

E

'11.

und

D+C

E

'11. co

'7l w

und

A+C

E

'7l w

C+I

E

11. w

E

'1l

C

E

11. w'

I

E

'7l w

und

H

E

'7l 1 '

A

E

und

H+A

11

E

11.

A+C E

'7l"" ll w

und

H+A+C E

D

E

D

E

L L

1r

'1l1 '

'7l 1 '

C

E

DR E

1l w und

'7l 1

und

D

L+

+

W

= w,

1 +00=00, 1 +w

= W,

1+w

"" and even 1 +1

00

'7l w

Ell oo

C

DL + DR = D E

Using a more informal notation we can write: 00+00=00, W

00

=

00

'7l eo

oo+w=oo,

for these results.

Here we assume that for example C and A hold for the same function symbol f and the ccmbination of these axioms is denoted as C+A. But what happens if C and A hold for two different function symbols, say C for f and A for g? Even the most trivial extension in this spirit, which is the extension of a known unification result to additional "free" functions(Le.

the empty theory for every function symbol, which

is not part of the known unification result) as mentioned above is unsolved. Summarizing we notice that unification algorithms for different theories are usually based on entirely different techniques. They prOVide the experimental laboratory of Universal Unification Theory and it is paramount to obtain a much larger experimental test set than the one recorded above.

23

" . . . ( th eori e s) a re wo r thy o f a comparative stu d y ,

f o r t he s a k e

o f t he li gh t t h e re by t hrow n on th e gen e ra l t h eo r y of sym b o lic re a s o n i ng an d o n algeb r a i c sym b o lism i n par t i c u l a r " N. Whit ehead T rea ti se o n Uni v e r sal A lg ebra ,

1 8 98

2 . Th e Gene r a l Th eo r y 2 . 1 A Cla s si f i c a t i o n of Equ a t ional Theo r ie s We li k e t o p re s e nt some i mporta n t s u b c lass es o f equ a t ional t h e or ies, which t u rned out t o be of practical i n t e r e st a s well as heing useful as " b a s ic bui l d i ng b locks " f or other equ a t i onal cl a s ses . vie sha l l first present t h e d e f i n i t i o n s and then show a f e w t h e o rems in order to d e mon s t r a t e the descriptive v a l u e of these theori es a nd to give a f lav our of t h e fie l d. Le t

Gr=

b e th e c la ss of equ at i ona l t h e orie s , whi c h

a re f in i t e l y b a s ed a nd have a d e cidab le word p roblem . At pre s e nt the most im p ortant subclass i s b7~

A theo r y T is r e q u. l a» iff for every l =r E T V ( 1)

g*

c~

: = {T E G7= : there e x i s t s a term rewr i ting s ystem s .th . T

if

g

R}

= V (r); we sha l l write

is a clas s o f re g Ul a r t h e orie s . As an i mmediate re s u l t we

h a ve :

Th e fun dame n t a l c la sses fo r u nif i c atio n t he ory are t h e class of uni f ica tion r e le v a n t t h e o rie s , and typ e - ze ro theo r ie s . S i mila r i l y we d efine r e l e v a n t t h e o r i e s , and .IIl

o

~,

'1l:=

"Ii,

U

U '1l ""

t he c l ass of ma tch in g

' It i s no t d iffi cu l t to see t h a t

.I/l

subclass o f ~ o :

is a

'1l

w ?l o' t h e cl a s s of

o

Pr op o si t i on 1 : An importa nt r equ i rement wi t h respect to uni ficatio n t h e ory is that the

b7 ~ d eno te thi s c lass . Th e n b7 s i s the c lass o f a dmi s sible t heor ies . Def i n ing

matching p r o b l e m i s d ecidabl e f or T ; let class

b7 +

c

of :=

g

~

G7 ~ a s t he sub cl ass with a con fluent r e wri t i n g s ys tem a nd

(~ c g~ a s t h e s u b c l ass wi t h a Noetherian r e writ ing s ystem and

abbre v iat ing

m. -I-

=

m. g +

(throug hout this s e ct ion we use the

deno tationa l provi s o that j uxta p os iti on abbr ev ia t es int ersec ti on of

24

classes ) as the nam e for the aanoni aal t he or i e s in a g ene ra l i ze d se n s e (i.e. any canonicalization is allowed). Defining having a

€

c

~+ as the class

=.IIl U.IIl we have w1 to 1 which turned out to be important

(standard) canonicalization and let.lll

€! .IIlw1

the classes d

and d

€!

*.At

w1

'

for universal unification algorithms: it can be shown that ~ UL recursively enumerable f or any T in a subclass o f this subc Las s

d @ *.JJl+ we have

Th eo re m 1 (Szabo):

d @*.IIl

is T

. Calling w1

~ UET e xists for any Te.dl:@:iJt.

This t heorem has been e xtende d in [HDB2 ] to a containing the c o n f l u e n t

larger class

"modulo" and c onfluent "over" t heories.

For regular equational theories we have: Th e o r e m 2 :

For TeGL

: Every complete set of matchers is mi n i ma l ,

Le. IlM ET = MET See [SSB1];and [FH83] for a generalization. Th e or e m :3 :

(Huet, Fage)

There is an e q u a t i o n a l theory

~ft

such that

(i) 1l0ET (s,t) does not ex i s t for a g i v e n pair of terms. (ii) IlMET (s,t) does n ot e x i s t for a given pair of terms. This important resu lt shows t hat (e v en f or f i r s t order theories) a minima l basis is not always alta inable Th e o re m 4 :

(Huet, Fage)

[F HB3] .

There i s a regula r c a n o n i a l t he ory TEe*

such th at ~U ET

(s,t) does not exist for a given pair of terms sand t.

The class o f ~-free theories, descriptive worth:

r.tn turned out to be important for its

The fo l l owi ng r esu l ts c haract er ize r.t~ wi t h r esp ec t t o t h e b as ic h i e r a r chy : Lemma 1 :

i . e . t h ere e x i s t s an n - f r e e infini tar y the or y, bu t from

11.

cc '

Lem ma 2 :

i . e. t h ere e x i s t s an n- f r e e finitar y theory . Le mma 3 :

i .e . t h ere e x i sts a n rt- f r e e u ni t ary the o r y . But:

1I.0'f'l n = ¢ ? i. e. do es IlUL ex i s t fo r every n- f r e e t h e ory ?

Piro b l.em

r.tn

is di fferent

25 I n other wo r d s ~n i s somehow 'diagonal' to t h e bas ic hierarchy o f e quational classes. But we have t h e s urprising r esult, wh i c h giv e s a nice algebraic characterizati on of the unitary match ing t heories:

Theore m 5 (Szabo): i . e . ~ n constitutes e xactl y t h e class of uni t a ry ma t c hing the orie s . Ne c e s s a ry cond itions for a theory

T to h ave a n e f fect ive, min i mal and

complete unification algor ithm is that T is unif ication relevant and admissible. Ther efore l e t fll = of?l

be the class o f normaZ theories and

we have by theorem 1 resp. Theorem 4 shows that even the r e g u l a r theories are not normal t heo ri es and here are some results with respect to

f1It

Lemma 4:

c:

b7= *:

g * =

i . e . the Q- f re e t h eor i e s are r egular.

Theo rem 6 : (S zabo)

~M E T

e x ists for e very T€

b7=*

b7:c.J/l ~ as tho s e that have a

Finally we de fine t he pe rmu tati ve th eo r i e s f ini t e equivalence class: VT E fj)

Vt E F n[t]=

i s fini t e . T

For th is c lass we h a v e

Proposition 2 : i . e . the permutative theories are admissible. Also there is the important r esu lt:

Theorem 7: (Szabo) i.e.

~ur T

always exists for p ermu tative theori e s.

Lemma 5 : i. e. pe rmu t a tive t h e o r i e s are always regu lar.

Pr opo si t ion 3 :

fj) c:

*

.At w1

i. e . p e rmuta t ive t heori es are r e gular and fini t el y ma t chin g .

26

f1l = d 11. we have by def ini tion:

Since

Corollary: i.e. the permutative theories are normal theories. Unification theory has results and hard open problems similar to the wellknown compactness theorems or the Ehrenfeucht Conjecture. These are tied to the important concept of a local subclass of a class Let term (T)

: = {l, r

b7 :

: l=r E T} be the set of terms in T E b7=

and let

I(T) be the set of instances of these terms: I(T)

:= {CYt: tEterm(T), oEL}

Similarly we define G(T) as the finite set of all generalizations of these terms: G(T)

~ =

:= {St: tEterm(T),

'ITEI!(t), xEX}

[TI+x],

We assume terms equal under renaming to be discarded, i.e.

G(T)/~.

With

these two sets we obtain the characteristic set of an equational theory

T as: X (T)

:= I (T)

U G(T)

and the finite

local-characteristic set as: A (T)

:= term(T)

U G(T).

~(T) be some first order property of T. If the property

Let

e

only considered with respect to a subset

F~,

~ is

we shall write

Ie'

~(T)

Definition 1: For a theory T property

b7~

~(T)

is x-reducible iff there is a

~ of T: ~(T)

Let

of

IX (T)

(!(T)

implies

be the class of theories having property

g' then the X-sub-

class

Xg~ X

'=

g~

g~

is the set:

:= {T E

gg'

:

g'(T) is x-reducible}

A theory T is A-reducible iff there is a property

T:

I

~ (T) A (T) implies

g'(T)

~ of

27

A g?:f

:=

{T E g?:f:

g'(T) is A-reducible} is the A- subclass of

For certain theories it may be possible to reduce F~ such that

g~

?:fIT) to a finite

test set 10c(T) c

?:fIT) [loC(T)

implies

?:fIT)

and we have in that case loc g~

{T E

g

:

a finite test set loc(T} exists}

A typical result, shown in [SZ82] is: Theorem 8:

and hence we have

of

@:...

w1

=

X of @*.At

*

w1

This theorem greatly

simplifies the test for TEd @:At", since we only have to show that it holds for matching problems on X(T}, i.e. for all problems <s 2 with s,tEX(T).

A major research problem of the field is to A-reduce (or at least to xreduce) the property of a theory to be unitary, finitary or infinitary. A first result in this respect is the A-reducibility of unitary matching theories: Theorem 9:

(Szabo)

The proof of this theorem demonstrates the intention of the above definition. Setting 8 to:

q: ~ =.At,

~(T):

iff T E

?:fIT) :

iff all terms p,q E A(T) are unifiable with at most one most general unifier (i.e. they are unitary) .

(Le. the property we wish to show) and

It can be shown that ?:f implies ~ and hence we only have to test the terms in A(T). In [SZ82] it is shown that this test can be even more simplified. Theorems of this nature are of considerable practical importance since they allow an immediate classification of a given theory: Usually it is not too hard to find some unificaticn algorithm for a given theory however it can be very tricky to ensure that it is complete, i.e. that it generates all unifiers. But if we already know that the given theory is unitary or finitary this task is greatly simplified. The following results are concerned with the reducibility of unitary

28

unifica tion theories. I n 1975 P. Haye s conjectured tha t Robinson' s un i fic~tion a l gor i thm f or fr e e te rms may wel l be t he only c ase wi th at mos t one mos t g ene r a l unifier. Un f o r t una tel y this i s not the cas e: for example l e t T : = {a = a } for a

a ny cons tant a, t he n Ta E ~£1 '

But the p r ob lem turned ou t t o be mor e comp l e x tha n antic i pated at t he time: for e xample let Ta a := {f (a , a ) = a} for any constant a, then Taa f

'1l 1 .

We first ob serve that t he uni t ary un i fi cati on t heorie s are a proper subs et o f t he un itary match i ng t heorie s:

'1l 1

Pr op ositi o n 4 :

c

f

'1l./Jl 1

= .4l 1 •

I n [S Z82 ) i t is s hown t ha t

'1l 1 =

The orem 10 :

X

'1l 1

i .e . t he u nita ry u n ificati on theor i e s are x- r e du c i b l e . But: Conj ectu re :

To il l u strat e t h e u s e of t he above t heor ems le t us con s i der t he emp t y t heor y T£, i.e . t he Robinson-uni f ic at ion pr ob lem f or f r ee t e~E . I n or der t o s how T£ E ~ 1' i n t he s tone a ge of u nif ication t he ory one ha d to i nve nt a speci a l algorithm and then prove its completeness a nd c orre ctnes s [ R0 6 S) , [KB70) . F by ~, i t A mor e elegant method i s c on t a i ned in (HT76 ) : f actoring n is po s s ible t o show t hat Fr2 forms a comple t e sani-lat tice under s, Hence i f two t e rms a r e un i fia ble /~ t he r e e x ist s a c ommon instance and he nc e t here exis t s a l. u. b., which is t he most ge ne r a l suc h i ns t ance : thus Eo Ll.ows

T£ E

'1l

1•

Howe ver u s i ng t he a bo ve theorem , thi s r e sult is imme diat e : Sinc e the abs olutely f r e e algebra o f terms is i n part icular r2- f r e e : T£ f ~1 . Now s i nce X(T£ ) i s emp t y ever y TEST s e t i s empty. Henc e t he re d oe s not exist a pair i n TEST with more than one mgu , t hus fo l l ows T£ E '1l 1 . Although t he compa rative study of t he or i es and classes of theor ies has un covered int e r e sting algebraic str uc t u res this is wi thou t dou bt nothi ng bu t t he ti p of a n i c e be r g o f ye t un kn own res u l ts .

29

2.2 Universal Unification Algorithms Experience shows that unification algorithms for different theories are usually based on entirely different methods. For theoretical reasons as well as for heuristic purposes it would be interesting to have a universal unification algorithm for a whole class of theories, however inefficient it Eight be: A universal unification algorithm (a universal matching algorithm) for a class of theories

b7 is an algorithm which b7 and generates

takes as input a pair of terms (s,t) and a theory T E

a complete set of unifiers (matchers) for <s = t> T (for <s

"

t> T)' In

other words, just as a universal Turing machine takes as its input a specific argument and the description of a special Turing machine, a universal unification algorithm has an input pair consisting of a special unification problem and an (equational) theory T. To exhibit the essential idea behind the universal algorithms suppose <s = t>T is the unification problem to be solved and let R be the rewrite system for T. Let h be a

'new' binary function symbol (not in

Q)

then h(s,t) is a term. Using these conventions we have the following consequence of Birkhoff's theorem, which is the basis for all universal unification algorithms: There exists aEL

with

as

at

T

iff there exist terms p,q and aEL such that

* h(p,q) h(s,t) =>->R

and

op =T

aq. E

Here

~E

is the empty theory, i.e. =T

denotes symbolwise equality. E

A first step towards an application of this result is a proper steps~

organization of the paramodulation

into a tree, with the

additional proviso that we never paramodulate into variables, i.e. if s

:>-+

t

then s ITI ¢ X.

For a given term t the labeled paramodulation tree (i) t

(the root) is a node in

(ii) if r is a node in node in

P

P

t

and r

P

P

t

is defined as:

t

~

s, then s

(the successor) is a

t

(iii) the edge (r,s),where triple [TI,i,El].

r~

, s, is labeled with the [TI,i,El]

30

Using the above result we have: if h(p,q) that p,q are Robinson-unifiable with unifier for sand t, where

e

(J

is a node in

then 0

=

(Jo

e

P h(s,t) such

is a correct T-

is the combination of all the paramodulation

substitutions obtained along the path h(s,t) to h(p,q) . And vice versa for every T-unifier h(p,q) in

T

for sand t there exists a node

Ph(s,t) such that p and q are Robinson-unifiable with

(J

and

Of course the set of unifiers obtained with this tree is far too large to be of any interest and the work of Lankford

[LB7~

and Hullot [HU801,

based on [FA791, is concerned with pruning this tree under the constraint of maintaining completeness. Hullot [HU80] shows the close correspondence between ~ (rewrite) and~ (paramodulation, narrowing) steps and [JKK82] investigate an incremental universal unification algorithm by separating the given theory T into two constituent parts T = RvE, where only R must be E-canonical. Since the set of unifiers UL T E

b7=,

T

is trivially recursively enumerable for

there is the important requirement that a universal unification

algorithm generates the minimal set VULT or is at least type conformal. Since such a result is unattainable in general, there is a strong incentive to find classes of theories, such that a universal unification algorithm is minimal for every theory T within this class. But such a class should be large enough to contain the theories of practical interest. In [SS811 the class ~@ *~+ is proposed and it is shown that the universal unification algorithm based on P is correct, minimal t and complete for this class. Herold [HE82] gives an extension of this class, which is the widest currently known. The Next 700 Unification Algorithms These theoretical results can be applied in practice for the design of an actual unification algorithm. So far the design of a special purpose algorithm was more of an art than a science, since for a given theory there was no indication whatsoever of how the algorithm might work. In fact the algorithms recorded in the table of III.1 all operate on entirely different principles. Using the universal unification algorithm as a starting point this task is now much easier by first isolating the crucial parts in the universal algorithm and then designinq a practical and efficient solution. The universal algorithm has been successfully applied to a special case

31

[ RS78] y i eld ing a mi nimal algori t hm [S Z82], which i n a ddition is mu c h simpler than the one previous l y known. A collection o f canonical theories [ HLBO] isava l ua ble source for this purpose and ha s al ready been used t o find the first u n i f i cat i o n a l go r ithms f o r Abelian group theory and quasi g r ou p theory [LA79], [HUBO]. I V. OUTLOOK AND OPEN PROBLEMS The following seven paragraphs give s ome pe r s pe c t i ve and sket ch some of the l ikely developments unifica t ion theory i s t o u ndertake i n t he ne a r future. Uni fioa t io n in S o rt ed Lo gios

In most practical app lications variables do not range ov e r a fla t unive r se of d is c ou r s e bu t are ty pe d . Unificati on of two t y ped (or sor t e d) t e r ms amoun ts to solving a n equation i n t he c or r e sponding h et erog e n e o u s algebra rather t ha n in homogeneous algebras as propos ed in s ect ion II. The formal f r amewor k f or do i ng so i s wellknown a nd ha s a lready f oun d a proper place i n c omputer science as a tool for t he descr i ption of abstract data types. De pending on the structure of the sorts (usual l y s ome form of a lattice) the e x tension o f t he kn own r e su l ts t o s ort ed doma i ns is not t rivi al . Co mp l e xit y r e su l t s a n d S p e o i a l Pu r pose T h e o r i e s Except for the (SNOBOL) s tring matching problem and the un if ication problem in free t e r ms (Robinson) no c omplexity r esults are known . Good candidates for the ne xt lea st-complexi ty-ra ce ma y be unif i c ation und er commutati v i ty or i d empo tenc e , s ince the y have fairly s i mple algorithms , there is a practica l dema nd for effici ency and finally the known techniques of [PW7Bl, [KKB2] may be extendable to t hese cases .

Al s o there is ev ery ince n tive t o ob t a i n a much larger collec tion of s pecial purpose u nifica t i on algori t hms . Comb in at i on o f Th e o ries

Why i s t he combina tion of a f in i t a ry the or y wi t h a n i nfi nitary t heory some times a fin i tary theor y where a s in other cases it i s i nfi n i t a r y? Is it possible t o d evelop a s ys t ematic t heory of a o ombina t o r o f t h e o r i e s , say T1 ~ T 2, where T 1 a nd T 2 a r e equational t heor ies ? A similar pr ob l em is known for simplificat ion algorithms . [SH84] [N080] Wha t i s the algebrai c s truc tu re o f @ (i . e. a theory t heo ries ) with respec t to uni f i cat ion theory ?

~ho s e

ob j e o t s are

32 Paraunification For many practical applications the requirarnent that two terms are unifiable in the strict sense as defined above is too rigid. For example the matching of descriptions in artificial intelligence does not demand proper T-equality. Instead there is interest in algorithms which detect whether or not the "essential components" of two descriptions coincide. Can this problem be expressed wi thin our algebraic framework of unification theory? In [SZ82] affinity of two terms sand t and tare affin, s '¥ c: Q.

~

'!'

is defined such that s

t, if they coincide in their essential components

A par aun i f i cat i on problem <s o for sand t such that

~ ~

os

'!'

t>T is the problem to find a substitution

ot.

This notion expresses in a more abstract way the classical notion of an

approximation of a solution. Subunification If a term s is a subterm of some t E [t] T we write s '=T t. The

subunification problem <s '= t>T is the problem to find a substitution o for sand t such that os c:

-T

at.

Again there is a practical need for subunification algorithms.

Higher Order Unification Although the unification of two terms of order w is outside of the scope of this survey

article we like to suggest one interesting aspect related

to the work recorded here. The undecidability results for second and higher order unification [ET73], [LC72], [G081]

as well as the enormeous

proliferation of unifiers even for small problems [HT76], [HE75] have cast some shadows on earlier hopes for higher order theorem proving [RN67]. But may be T-unification for w-order logics is not more but less complex then free w-unification? For example the second order monadic unification problem closely resembles the stringunification problem. Now the stringunification problem is infinitary, it posed a very hard decidability problem and the known stringunification algorithms are almost useless for all practical purposes. However stringunification

under commutativity (i.e. the A+C-problem), is comparatively simple: it is finitary, decidability is easy to compute and the unification algori thms [ST81], [LS76] are not too far away from practical applicability.

33 Ope n

Pro b~emB

Whereas the previous paragraphs listed extensions and interesting fi elds of investigation we now like t o list some specif ic open problems. P1 :

?l1

1e ?l1

=

~

i . e . ca n the test fo r a unitary the o r y b e f ur t her

localized to the fini te test s et Ie(T)? P2 :

Characterize the borderline b etween finit ar y and i nfinitary t heories, i. e.

?l w a n d

?l eo ' Th is i s t h e major o p e n p rob lem

right now . P3:

P4: P5 :

.Atw = 1e .At w ? T E

'1l.

0

.J1t.

..At 1 decidabl e? No t e : .J1t. 1

* 0?

=

00

= 1e ..At eo

1e .J1t.

?

1

Do e s ther e e x ist a t y p e- z e r o t h e ory which is

un itary matching?

fit...At : 1 d ecidable? i. e. in the light o f the above re s u l t s i s T E flJ decidable?

P6 :

T E

P7 :

Do e s t here e x i s t a minimal (an d / or typ e con f o r ma l ) univ ersal unif ication al gori thm for the whol e class

P8 :

* w1 ? [email protected].

Does there exist a type c onformal (i. e . terminat ing ) universal

* of€.J1t. ? Note this i s a p r e r e q u i s i t e w1

matching alg o r i t hm fo r f o r P7 .

Does the r e ex i s t a t ype conformal universa l ma tch i ng a l g orithm f or of

@.J1t. w1 ? Since this i s prob a b l y not the c a se : sho w its

u nsolvability . Whe r e is t h e e xact bor d e r l ine? P9 : (P10):

@ = 01. @

? L , e . are t h e c a noni ca l t h eo ri e s a dmi s s i b le?

~

? i. e. c a n ever y finitel y b ased t h e ory wi t h a d ecidable

= ~

wor d p r o b lem b e e mbedded i nto a rewr ite s ystem? Th is wou l d hav e str ong i mplicat ions f o r uni v ersal u n i f i c a t i o n alg o r i t hms. P11 :

( permutative theorie s ) . Let

flJi

=

?l i flJ

i E { 1 , w , w1 } .

Does t here ex i s t a t ype co n f o rm a l u niv er s al uni f i cat i on a l g o r ithm f or

flJi? Is

T E

'1l.

d e c i dable ?

P12 :

1f T E

P13:

(e xis t ence prob l em ) . Giv e a n algebraic cha r a c teri zat i o n o f clas ses

.At

1 is T E

1'fJ l.

o f the ories s u ch t ha t decid able?

1

d e cid a b l e ?

~UL T

e x is ts f o r T in t h i s c las s . Is T E ?l o

34

P14:

The problematic issue of universal unification algorithms can be reduced to the following question: Given a unifier 6 for sand t under T, i.e. 6s

6t; is 6 most general? Since the question

can not be answered in general: for which equational class is it decidable? P15:

In many applications it is useful to have very fast, albeit incomplete unification algorithms. However they should not be "too incomplete" .

Because of its theoretical beauty, its fundamental nature as well as its practical significance, unification theory is likely to develop into a major subfield of computer science. ACKNOWLEDGEMENT This is an updated version of a paper I wrote two years ago with Peter Szabo, with whom I worked very closely on unification problems for almost five years. Although I see the disillusionment every day university life can cause not only in a young and idealistic mind, I deeply regret his decision to leave academic life. I like to remember him as one of my best friends and as the most extraordinarly gifted person I had the opportunity to work with.

35

V. BIBLIOGRAPHY [BA72J

Ba~~ow, Amble~, Bu~stall: "Some techniques fo~ ~ecognizing St~uctures in Pictures', F~ontiers of Pattern Recognition,

Academic Press Inc., 1972 [BA78J

L.D. Baxter:"The Undecidability of the Third Order Dyadic Unification Problem', Information and Control, vol.38, no.2, 1978

[BA73J

L.D. Baxter: "An efficient Unification Algo~ithm', Rep. CS-7323, University of Waterloo, Dept. of analysis and Computer Science, 1973

[BC66J

H. Br y an , J. Ca~nog:"Search methods used with transistor patent applications', IEEE Spectrum 3, 2, 1966

[BF77 J

H.P. Bl5hm, H.L. Fd s c he r , P. Raulefs: "CSSA: Language Concepts and Pr-og r-amm Lng Methodology', Proc. of ACM, SIGPLAN/ART Conference, Rochester, 1977

[BL71J

F. Blair et al: "SCRATCHPAD/1: An inte~active facility for symbolic mathematics', Pr-oc , of the 2nd Symposium on Symbolic Manipulation, Los Angeles, 1971

[BL77J

A. Ballantyne, D. Lankford: "Decision Procedures for simple equational theories', University of Texas at Austin, ATP-35, ATP-37, ATP-39, 1977

[BM77 J

R. Boyer, J.S. Moore: "A Fast String Searching Algorithm', CACM vol. 20, no. 10, 1977

[B068J

D.G. Bobrow (e d ) :"Symbol Manipulation Languages', Pr-oc , of IFIP, North Holland Publishing Comp., 1968

[B077J

H. Boley:"Directed Recursive Labelnode Hypergraphs: A New Representation Language', Journal of Artificial Intelligence, vol. 9, no. 1, 1977

[CA70J

Caviness: "On Canonical Form and Simplification', JACM, vol. 17, no. 2, 1970

[CD69J

CODASYL Systems Committee: "A survey of Generalized Data Base Management Systems', Techn.Rep. 1969, ACM and lAG

[CD71J

CODASYL Systems Committee: "Feature Analysis of Generalized Data Base Management Systems', TR 1971, ACM, BC and lAG

[CK67J

Cook: "Algebraic techniques and the mechanization of number theory', RM-4319-PR, Rand Corp., Santa Monica, Cal., 1965

[CK71J

C. Christensen, ~1. Ka~~: "lAM, A System r o r Ln t e r a c t Lve a Lge b r a.Lc Manipulation', Pr-oc , of the 2nd Symposium on Symbolic Manipulation, Los Angeles, 1971

[CL71J

M. Clowes: "On Seeing Things', Techn.Rep. 1969, ACM and lAG

[CM81 J

W.

Clocksin, C. Mellish: "Progr'amming in PROLOG', 1981

Sp~inger

36

[C070j

E.F. Codd: 'A Relational Model of Data for Large shared Databanks', CACM, 13,6,1972

[C072j

E.F. Codd: 'Relational Completeness of Data Base SUblanguages', in Data Base Systems, Prentice Hall, Courant Compo Science Symposia Series, vo. 6, 1972

[CP61j

A. Clifford, G. Preston: 'The Algebraic Theory of semt gr-oups ", vol.I and vol. II, 1961

[CR68j

D.G. Corneil: 'Graph Isomorphism', Science, University of Toronto, 1968

[DA7lj

J.L. Darlington: 'A partial Mechanization of Second Order Logic', Mach.lnt. 6, 1971

[DA76j

C.J. Date: 'An Introduction to Database Systems', AddisonWesley Publ. Compo Inc., 1976

[DA73j

M. Davis: 'Hilpert's tenth Problem is unsolvable', Amer.Math.Monthly, vol. 80, 1973

[FA7lj

R. Fateman: 'The User-Level Semantic Matching Capability in MACSYMA', Proc. of the 2nd Symposium on Symbolic Manipulation, Los Angeles, 1971

[FA79j

M. Fay: 'First Order Unification in an Equational Theory', Proc. 4th Workshop on autom. Deduction, Texas, 1979

[FA83J

F. Fage: 'Associative Commutative Unification', CNRS-LITP4, 1983 (see also this volume)

[FG64j

D.J. Farber, R.E. Griswald, I.P. Polonsky: 'SNOBOL as String Manipulation Language', JACM, vol. 11, no.2, 1964

[FH83j

F. Fage, G. Huet: 'Complete Sets of Unifiers and Matchers in Equational Theories', Pr-oc, CAAP-83, Springer Lec.Notes Compo Sci, vol. 159, 1983

[GI73j

J.F. Gimpel: 'A Theory of Discrete Patterns and their Implementation in SNOBOL4, CACM 16, 2, 1973

[GM78j

H. Gallaire, J. Minkel'. 'Logic and Databases', Plenum Press, 1978

[G066j

W.E. Gould. 'A matching procedure for lJl-order logic', Scientific report no.4, Air Force Cambridge Research Labs., 1966

[G067j

J.R. Guard, F.C. Oglesby, J.H. Benneth, L.G. Settle: 'SemiAutomated Mathematics', JACM 1969. vol. 18, no.1

[G081j

D. Goldfarb: ' Th e Undecidability of the Second Order Unification Problem', Journal of Theor. Comp.Sci., 13, 1981

[GR79j

G.

Gr~tzer.

Pri.DvDe p t ,

of Computer

INRIA report

'Universal Algebra', Springer Verlag, 1979

37

[GS84J

W.K. Giloi, J. Siekmann: 'Entrichtungen von Rechnerarchitekturen fUr Anwendungen in der KUnstlichen Intelligenz', BMFT-Antrag 1984, Univ. Berlin u n d Kaiserslautern

[GU64j

J.R. Guard: 'Automated logic for semi-automated mathematics', Scientific report no.l, Air Force Cambridge Research Labs., AD 602 710, 1964

[HD82J

A. Herold: 'Universal Unification and a Class of Equational Theories', Proc. GWAI-82, W. Wahlster (e d ) Springer Fachberichte, 1982

[HE82J

A. Herold: 'Some Basic Notions of first Order Unification Theory', Univ. Karlsruhe, Interner Report, 1983

[HE30j

J. Herbrand. 'Recherches sour la theorie de la demonstration', Travaux de la Soc. des Sciences et des Lettre de Varsovie, no.33, 128, 1930

[HE75J

G.P. Huet: 'A Unification Algorithm for typed A-Calculus', J.Theor. Compo Sci., 1, 1975

[HJ64j

J.I. Hmelevskij: 'The solution of certain systems of word equations', Dokl.Akad. Nauk SSSR, 1964, 749 Soviet Math. Dokl.5, 1964, 724

[HJ66j

J.I. Hmelevskij: 'Word equations without coefficients', Dokl. Nauk. SSSR 171, 1966, 1047 Soviet Math. Dokl. 7, 1966, 1611

Ac a d,

[HJ67J

J.I. Hmelevsklj: 'Solution of word equations in three unknowns', Dokl. Akad. Nauk. SSR 177, 1967, no.5, Soviet Math. Dokl. 8, 1967, no. 6

[HL80j

J.M. Hullot: 'A Catalogue of Canonical Term Rewriting Systems, Research Rep. CSL-113, SRI-International, 1980

[HN71j

A. Hearn: 'REDUCE2, A System and Language for Algebraic Manipulation', Proc. of the 2nd Symposium on Symbolic Manipulation, Los Angeles, 1971

[H076j

J. Howie: 'Introduction to Semigroup Theory', Acad. Press 1976

[H080J

G. Huet, D.C. Oppen: 'Equations and Rewrite Rules', in 'Formal Languages: Perspectives and Open Problems', Ed. R. Book, Academic Press, 1980

[HR73j

S. Heilbrunner: 'Gleichungssysteme fur Zeichenreihen', TU MUnchen, Abtl. Mathematik, Ber.Nr. 7311,1973

[HT72j

C. Hewitt: 'Description and Theoretical analysis of PLANNER a language for proving theorems and manipulating models in a robot', Dept. of Mathematics, Ph.C. Thesis, MIT, 1972

[HT76j

C. Hewitt. 'Viewing Control Structures as Patterns of Passing Massages', MIT, AI-Lab., Working paper 92, 1976

38

[HT72j

G. P. Huet. 'Constrained resolution: a complete method for theory', Jenning's Computing Centre rep. 1117, Case Western Reserve Univ., 1972

[HT73j

G.P. Huet: 'The undecidability of unification in third order logic', Information and Control 22 (3), 257-267, 1973

[HT75j

G. Huet: 'Unification in typed Lambda Calculus', in ACalculus and Compo Sci. Theory, Springer Lecture Notes, No.37, Proc. of the Symp. held in Rome, 1975

[HT76j

G. Huet: 'Resolution d'equations dans des langauges d'ordere 1,2, •. ,w', These d'Etat, Univ. de Paris, VII, 1976

[HT78j

G. Huet: 'An Algorithm to Generate the Basis of Solutions to Homogenous Linear Diophantine Equations', Information Pr-o e , letters 7,3, 1978

[HT80j

Huet: 'Confluent reductions: Abstract Properties and Applications to Term Rewriting Systems', JACM vO. 27, no.4, 1980

[HU79j

J.M. Hullot: 'Associative Commutative Pattern Matching', Int.Joint Conf. on AI, Tokyo 1979

[HU80j

J.M. Hullot: 'Canonical Forms and Unification', Proc. of 5th Workshop on Automated Deduction', Springer Lecture Notes, 1980

[JP73J

D. Jensen, T. Pietrzykowski: 'Mechanising A-order type theory through unification', Rep. C873-l6, Dept. of Applied Analysis and Compo 4, 1972

G.

5th

[JKK82j J. Jouannaud, C. Kirchner, R. Kirchner: 'Incremental Unification in Equational Theories', Universite de Nancy; Informatique, 82-R-047, 1982 [KB7 0 j

D. E. Knuth, P.B. Bendix: 'Simple word Problems in Universal Algebras', in: Computational Problems in Abstract Algebra, J. Leech (ed), Pergamon Press, Oxford, 1970

[KK82j

D. Kapur

[KM72 J

Karp, Miller, Rosenberg: 'Rapid Identification of repeated Patterns in Strings, Trees and Arrays', ACM Symposium on Th.of Compo 4, 1972

[KM7 4j

Knuth, Morris, Pratt: 'Fast Pattern Matching in Strings', Stan-CS-74-440, Stanford University, Compo Sci. Dept., 1974

[KM77 J

S. Kuhner, en, Mathis, P. Raulefs, J. Siekmann: 'Unification of Idempotent Functions', Proceedings of 4th IJCAI, MIT, Cambridge, 1977

[K079j

R. Kowalski: 'Logic for Problem Solving', North Holland, 1979

[LA79j

D.S. Lankford: 'A Unification Algorithm for Abelian Group Theory', Rep. MTP-1, Louisiana Techn. Univ., 1979

M.S. Krishnamoorthy, P. Narendran: 'A new linear 82CRD100, New York, 1982

Algorith~ for Unification', General Electric, Rep. no.

39

[LA80J

D.S . Lankfo rd : ' A new c ompl et e FPA-Un i f ic a tio n Algo r ithm ' , MIT-8 , Louisiana Te chn . Univ . , 198 0

[LB79J

D.S . Lankf or d, M. Ba llantyne : ' The Ref u ta t i on Compl e t en e s s of Blo ck ed Pe rmut a ti ve Nar r owi ng and Re s olut i on ' , 4th Wo rks hop on Aut om . Deduct i on , Texa s, 197 9

[LBB8 4 J D. S . La n kf o r d , G. Bu t l e r , B. Bra d y: ' Abel ian Group Un if i c a ti on Algo r ithms fo r element ar y t er ms ' , t o ap pe a r in : Cont empory Ma themat i cs . [ LC72J

C.L. Lucchesi: 'The undecidability of t he unifi cation problem f or t h i r d order l a ng uag e s ' , Rep. CSRR 20 59 , Dep t . of Appl ied Analysi s and Compo Science, Univ . of Wa t e rl oo , 1972

[L080J

D. Loveland: 'Automate d Theorem Proving', North Holl and, 1980

[LS75J

M. Livesey, J. Siekmann : 'Termination and De cidabi li t y Results f or Stringunifi cation', Uni v . of Essex, Memo CSM- 12 , 1975

[LS76J

M. Livesey, J. Si ekmann : 'Unification of Sets and Multisets', Uni v . Karlsurhe, Tec hn. Report, 1976

[ LS79 J

M. Li ve s ey , J. Sie k mann , P. Sza bo , E. Unverl c ht: ' Unif ica tion Pr oblems for Combin a t ions of Asso ci a tiv ity ; Com mut ativit y, Dist r ibut i v ity a nd In de mpot e nce Axi oms', Pr o c . of Conf. on Aut om . Dedu ct ion , Austin , Tex a s , 197 9

[ LS73 J

G. Lev i , F. Si r ov ich: ' Patt e rn Ma t c h i ng a nd Goal direc t ed Computa tion ' , Nota I nt erna B73- 12, Univ. of Pis a , 1973

[MA54 J

A.A. Ma r kov : ' Tr udy Ma t .I ns t . St ekl ov ' , SSSR, 19 54, NR 17 , 1038 , 1954

[MA70 j

Y. Matiyasevich: ' Di o p h a n t i ne Re p r e s e n t a t i o n of Re c. Enumerable Pred icates ' , Proc. of the Sca nd. Logi c Symp., Nor t h Holland , 1978

[MA 77 J

G.S . Maka n in: ' The Pro blem of So l va bility of Equati ons in a Fr ee Semig roup ' , Sov iet Akad , Nauk SSSR, Tom 233 , no. 2, 197 7

[I.IA77

J

no. 42,

Izdat .Akad . Nau k

Mau re r : ' Grap hs as S t ri ngs ' , Uni ve rsitat Kar ls ruhe , Techn. 197 7

Rep , ,

[~lB68 J

Manove , Bloo m, Engel man n: ' Rational Fun cti on s i n MATH LAR', IFI P Conf. on Symb. Mani pulation, Pisa, 19 68

[MM79j

A. Martelli, U. Monta n e r i : 'An Rf fi c i ent Uni f i c a t i o n Algo r i t hm' , Uni ve rs i ty of Pis a , Techn. Repo r t , 1979

[M071

J . Mos e s : ' Symboli c In t egra ti on: The Stormy Decade ' , CACM 14, 8 , 1 971

J

[M07 4 j

J . I~ ose s : ' MACSYMA - t h e fit' th Ye a r ', Camb r i dg e, 1974

Pro ject lJI AC,

MI T,

[NE71j

A. Ne v i n s : ' A Human oriented logic f o r AT P' , JACM 21 , 1974 ( f i r s t re port 1971 )

40

[NI80j

N. Nilsson: 'Principles of Artificial Intelligence', Tioga Publ. Comp.,Cal., 1980

[N080j

G. Nelson, D. Oppen: 'Fast Decision Procedures Based on Congruence Closure', JACM, 27, 2, 1980

[PL72j

G. Plotkin: 'Building in Equational Theories', Machine Intelligence, vol. 7, 1972

[PR60j

D. Prawitz: 'An Improved Proof Procedure', Theoria 26, 1960

[PS8lj

G. Peterson, M. Stickel: 'Complete Sets of Reductions for Equational Theories with Complete Unification Algorithms', JACM, vol. 28, no.2, 1981

[PW78j

M. Paterson, M. Wegman: 'Linear Unification', J. of Compo and Syst. Science, 1968, 16

[RD7 2j

Rulifson, Derksen, Waldinger: 'QA4: A procedural calculus for intuitive reasoning', Stanford Univ., Nov. 1972

[RL69j

J. Rastall: 'Graph-family Matching', Univers tty of Edinburgh, MIP-R-62, 1969

[RN67j

J.A. Robinson: 'A review on automatic theorem proving', Symp. Appl.Math., vol. 19, 1-18, u967

[R065j

J.A. Robinson: 'A Machine Oriented Logic based on Resolution Principle', JACM 12, 1965

[Ronj

J.A. Robinson: 'Computational Logic: The Unification Computation', Machine Intelligence, vol. 6, 1971

[RS78 j

P. Raulefs, J. Siekmann: 'Unification of Idempotent Functions', Universitat Karlsruhe, Techn. Report, 1978

the

[RSS79j P. Raulefs, J, Siekmann, P. Szabo, E. Unvericht: 'A short Survey on the State of the Art in Matching and Unification Problems, SIGSAM Bulletin, 13, 1979 [SB82J

P. Szabo: 'Undecidability of the DA-Unification Problem', Proc. of GWAI, 1979

[SB82j

J. Siekmann, P. Szabo: 'A Minimal Unification Algorithm for Idempotent Functions', Universitat Karlsruhe, 1982

[SH76j

E.H. Shortliffe: '!~YCIN: Computer Based Consultations', North Holland Publ. Compo 1976

Medical

[SH75j B.C. Smith, C. Hewitt: 'A Plasma Primer', MIT, AI-Lab., 1975 [SH84j

R. Shostak: 'Deciding Comhinations of Theories', JACM, vol. 31, no.1, 1984

[SG771

SIGSAM Bulletin: 'ACM special inte res t group on Symbol ic and Algebraic Manipulation, vol. 11, no.3, 1977 (issue no. 43) contains an almost complete bibliography

41

~emo

[ SI 75 1

J . Si ekma nn : ' String un i r i cat i on' Rss e x Univ e rsity , 7 , 1975

CSM -

[SI76 J

J . S i e km a n n: ' Un i f i c a t i o n of Kar l s r uh e , 197 6

[S I7 8 J

J . Sie kmann : ' Un i f i c a t i on a nd Mat chi ng Pr'obl e ms ' , Ph.D. , Esse x Un iv . , Me mo CSA- 4- 78

[S172]

J . R. Slag le: ' ATP with bu ilt-in theorie s incl ud ing equality , partial orde ri ng and sets' , JACM 19 , 120-135 , 197 2

[SL74j

J . Slagle: ' ATP fo r 'Phe o r-Le s with Sim plifiers, a nd Ass o c iativity ', JA CM 21 , 1974

[3082 J

J . S i e k mann , P . Szabo : ' Uni ver s a l Un i ficat ion and a Cla s s i f i c a t i on of Eq a u t i o n a l Theor ie s', Pr-o c , of Conf. on Autom. Deduct i on, 198 2, New Yo r k , Springe r' Le ctu re Notes compo Sc i. , vol. 87

[ SS81J

J . Si ek ma nn , P. Sz abo: ' Un i ve r s a l Un ifi cation a nd Reg u la r' ACFM Theo ri e s ' , Proc . 1.r r:AI-81 , Vancouv e r , 198 1

[ SS8 2 J

J . Siekmann , P. Sz a bo : ' A Noethe r'ian and Confluent Rew r ite System f or' Indempotent Se mi g r ou p s ' , Se mig r'ou p Forum , vo l . 25, 1982

[ sS6 1j

D. Skordew , B. Sendow: ' z. Math . Logic Gr'und la gen ', t1ath.7 (1961), 289, MR 31, 57 (Russian) (English t r-ans La t Lon at Univ. o f Essex, Co mp o Sci. De p t .)

[ ST8 1j

M. S t i c k e l : ' A Un if i c a tio n A1gor'it hm for' As so c . Commu t a t i v e Func t ions', JACM , vol . 28, no .3, 19 81

[S T74j

G.F. St.e w a r- t : ' An Alg e b r ai c Mod e l for S 't r -Lng Pa t t e r-n s " , Un i v . of To r on t o , CSRG-39 , 1974

[ SU65j

E. Sussengut h: ' A g r'a ph - t h e o r'e t i c a l al g o r'it hm for' ma t ch i n g chemical str'uctu re s ', J . Che m. Do c . 5 , 1 , 1965

[SU78j

P . S zabo , E . Un v e r i ch t : ' Th e Un i f i c a t i o n Di s t r i but i v e Terms' , Un i v . Ka r l s r uh e , 1978

[SZ78 j

P. Szab o : ' Th e or y of First Order Un i f i c a t i o n ' thes i s) Uni v. Kar-Ls r-uh e , 1982

[ TA68 J

A.

c o mn u t a t t v e T e r'm s ' , Un i.

Commutativit y

Pr'oble m for' ( i n German ,

Tarski: ' Eq u at i o n a l Lo gi c and Equa ti on al Th e o r i e s of Schmi dt e t al (e d e ) , Contri buti on s to Ma thematical Log i c , North Holland , 1968 Al.gebr-a ",

[TE8 1 J

H. Tennant. "Na t u r a I Language P r o c e s s Lng ", Pe t r-oc e L j L Bo ok s , 1981

[TY75J

W. T a ylor': ' Eq u a t i o n a l Logic ' , Societatis Janos Rolya, 19 75

[UL76J

J . R. Ullm a n: ' An Algo r'ithm for' Subg rap h Isom orphism ' , JACM, vo l. 23 , n o vL, 1 9 7 6

Co lloq u ia Mathematica

42

[UN64J

S.P. Unger: 'nIT Heuristic Prof,t'an for Testing Pairs of directed Line nraphs for Lso o o r-ph Lsm ", r,Ar,~~, '101.7, nov l , 196!l

[VA75j

,T.

van Vaalen: 'An Extension of Unification to Substitutions

wL th an Application to ATP', Pr o c, of Fourth I.TCAI, 'I'b LlLs L,

USSR, 1975 [V078 J

E. Vogel: "Un t.f Lka t t on von 14 0 r p h i s l'1 e n - , Diplomarhelt, Un I v , Karlsruhe, 1978

[VZ75J

H.

[WA77 J

D.H.D. Warren: 'IMplementing PROLOG', vol. 1 and vol. D.A.I. Research Rep., no. 39, Univ. of Edinburgh, 1977

[~IA84 J

Ch. Walther: 'Unification in Many Sorted Theories', Karlsruhe, 1984

[WC76J

K. Wong, K. Chandra: 'Pounds for the String Ed Lting Problem', JACM, vol. 23, no.l, 1976

[WEnJ

P. Weiner: 'Linear Pattern Matching Algorithms', IEEF, Symp. on SW. and Automata Theory, 14, 19773

[WH98J

N. Whitehead: 'Treatise on Universal Algebra', 1898

[wI76J

van Wijngaarden eet all): 'Revised Rep. on the AlgorithMic Language ALGOL68', Springer-Verlag, Berlin, Fe Lde Lbe t-g , N.Y., 1976

[WN75J

Winston: 'The Psychology of Computer Vision', McGraw Hill, 1975

[WN76J

G. Winterstein: 'Unification in Second Order Logic', Rericht 3, Univ. Kaiserslautern, 1976

[WR67J

L. Wos, G.A. Robinson, D. Carson, L. Shalla: 'The Concept of Demodulation in Theorem Proving', JACM, vol. 14, no.4, 1967

[WRnJ

L. Wos, G. Robinson: 'Maximal r~odels and Refutation Completeness: Semidecision Procedures in Automatic Theorem Proving', in: Word problems (W.W. Boone, F.B. Cannonito, R.C. Lyndon, eds), North Holland, 1973

Venturini-Zilli: 'Complexity of the Unification Al"orithm for First Order F,xpression', Calcolo XII, Fasc IV, 1975 2,

Univ.

43

A Portable Environment for Research in Automated Reasoning Ewi:n.g L. Iiusk

Ross A Ouerbeek

Mathematics and Computer Science Division Argonne National Laboratory Argonne, Illinois 60439 ABSTRACT The Interactive Theorem Prover (ITP), an environment that supports research into the theory and application of automated reasoning, is described. ITP is an interactive system providing convenient access to and control of the many inference mechanisms of Logic Machine Architecture (LMA) , described elsewhere. LMA itself has been substantially enhanced since the last report on its status, and we describe here some of the enhancements, particularly the addition of a tightlycoupled logic programming component, which provides an integration of the theorem-proving and logic programming approaches to problems represented in the predicate calculus. 1. Introduction

In [2] and [4] the authors defined a layered software architecture called Logic Machine Architecture (LMA), for the construction of inference-based systems. Many of the basic decisions were motivated by the considerations given in [3]. The primary purpose of this paper is to describe ITP, the first major system built With the LMA tools. ITP is itself a tool for research rather than system development. It provides access to and control of all inference mechanisms provided by LMA, allowing its user to experiment easily with a Wide variety of inference rules and strategies for their use. In addition, the LMA architecture and its own design make convenient the addition of new capabilities and experimentation With alternatives to current design decisions. The system (written in Pascal) is highly portable, in the public domain, and has been distributed to approximately fifty university and industrial sites. Its purpose is to make unnecessary the large programming investment which used to be required before one could undertake serious experimentation in automated reasoning. A secondary purpose of this paper is to present the current status of LMA, which has evolved SUbstantially since the publication of its original definition. The primary enhancement has been the addition of a logic programming component, available to the ITP user as an independent Prolog subsystem, or as an inference mechanism integrated into the other inference mechanisms of LMA. For example, in the course of computing a hyperresolvent, a literal recognized as a "Prolog" literal may be "resolved" away by querying the Prolog subsystem. Two-way communication can occur between the theorem-proving and Prolog subsystems. This communication takes place at the literal rather than clause level, permitting close cooperation during an attempt to satisfy a Prolog goal or to compute a resolvent. The Prolog subsystem is the first step in the creation of an integrated environment in which theorem-proving systems may interact with other "intelligent" systems, such as symbolmanipulation packages like MACSYMA and logic programming systems like Prolog. ITP provides to the researcher in automated reasoning an integrated, uniform environment for exploring This work was partially supported by the Applied Mathematical Sciences Research Program (KC-04-02) of the Office of Energy Research of the U.S. Department of Energy WIder Contract W-31-100-Eng-36, and also by National Science Foundation grant MCS62-07496.

44

these interactions. In the following sections, we describe the research environment provided by IT?, describe in detail the integrated Prolog component, and discuss the issues raised by combining disparate systems POSSflSSing reasoning capabilities. ,'\fe conclude with a report on th e current status of the porting/distribution subproject, and describe our plans for LMA in two widely differing computational environments: microcomputer systems and supercomputers. 2. ITP Facilities ITP has been designed to provide a rich, powerful. and friendly environment for research in automated reasoning. It offers a large number of features and options to control their use, so that a wide variety of experiments can be conducted. ITP is also easily extendible, so that new features inspired by experiments can be added smoothly to the system.

2.1. Absence of Limits ITP inherits from LMA the freedom from system-imposed limits on the number of clauses present in the clause space, the number of literals or variables per clause, the length of names, etc. Past experience has painfully taught us that any such limits established in the name of efficiency are soon made obsolete by new types of problems one wants the system to handle. 2.2. Representation Languages LMA is currently an entirely clause-based system. Thus, it requires that knowledge be represented to it in clauses. However, the user has several options for representing the clauses. One form is that used by LMA as its external (character string) form of data, designed for ease of parsing, not reading. ITP translates a number of different user-oriented languages into and out of this language. Some of the languages supported are: a)

If-then format: If p(x) &: q(x) then rl(x) I r2(x)

b)

The format used by our earlier system, AURA

c)

The C1ocksin-Mellish [1] format for Prolog clauses

d)

A special-purpose language oriented toward circuit design applications.

Because input and output languages are independent, any of the above languages can be translated into any other by reading in a list of clauses using one language, and writing it out using another. Except for minor incompatibilities due to naming conventions and built-in functions. the languages are interchangeable. The specific languages supported are not as Significant as the ease with which new languages can be added. Because the translation is to the LMA portable format, one need not know anything about the LMA internal data structures to write a new translator for a specific purpose. 2.3. Basic Operation The basic operation of ITP is quite straightforward. Its power derives from the many options which control its fundamental algorithm. The clause space of ITP is divided into four lists of clauses: the Axiom List, the Set-ofSupport List, the Have-Been-Given List, and the Demodulator List. Each plays a specific role in the fundamental operation, which is repeated many times in the course of one execution of the program. The fundamental operation consists of the following steps:

45

1.

Choose a clause from the Set-of-Support. Call this clause "the given clause".

2.

Infer a set of clauses, using one or more of the inference rules listed below, which have the given clause as one parent and the other parent clauses selected from the Axiom List and the Have-Been-Given List.

3.

For each generated clause, "process" it (simplify, perform subsumption checks, evaluate it for addition to the clause space, etc.) If the clause is to be kept, add it to the end of the Set-of-Support List.

4.

Move the given clause from the Set-of-Support List to the Have- Been-Given List.

The exact way in which each of these steps is carried out (how the given clause is chosen, which inference rules are used, etc.), is determined by user-controlled options, described below. 2.4. Options ITP has many options to control its operation and to customize its interface to a particular application or user. Options are manipulated through a hierarchical set of self-explanatory menus. A set of options can be saved in a file and restored at the beginning of a later run. Multiple options sets can be used at different times during the same run. In this paper, we will only describe the most signlficant of the choices avallable to the user. 2.4.1. Choosing a Given Clause An option controls how a given clause is chosen from the Set-of-Support. If the first clause is always chosen. a breadth-first search results. If the last clause is always chosen, a depth-first search results. Alternatively, each clause on the Set-of-Support List is evaluated according to a weighting scheme]':'], and the "best" clause is selected. Weighting can be used to attempt to focus a search when the user has some idea of how to determine "relevance". 2.4.2. Inference Rules The inference rules currently supported in ITP are the ones we have found most useful over the years. They are: 1. Hyperresolution 2. Unit-Resulting resolution 3. Paramodulation into given clause 4. Paramodulation from given clause 5. Binary resolution 6. Unit resolution 7. Link-resolution (still experimental) B. Forward demodulation 9. Backward demodulation 10. Factoring 11. Unit Deletion 12. Negative hyperresolution 13. Linked UR-resolution Paramodulation has a collection of suboptions controlling the instantiations allowed to occur. The two linked resolution inference rules each.have a variety of options controlling their operations. Some of the inference rules which generate a new clause by performing a sequence of resolution steps have been enhanced. In particular, some literals can be "removed" by evaluating built-in predicates, one of which can be used to query other reasoning systems. For example, if

46

If p(x) & $LT(O,x) & $ASK(prolog,path(node(x),node(O))) then q(x) and p(lO) are clauses, then q(lO) can be derived, provided "prolog" can establish that the goal "path(node(10),node(O))" succeeds. The first antecedent literal is removed by the unit clause; the second, by evaluating the special predicate $LT (which evaluates to "true", if the first argument is less than the second); and the third, by evaluating the special predicate SASK (used to query the prolog system in this example). The exact inference rule options are not as important as the fact that a rich set is included and it is easy to add new options. New inference rules automatically and consistently interact with both the higher level algorithms and with the database mechanisms in Layer 1 of LMA which provide rapid access to the relevant terms and clauses. 2.4.3. Processing Generated Clauses

New clauses generated by the previous inference rules are (optionally) processed in a number of ways before being added to the clause space, First they are demodulated. Demodulation is a complete term-rewriting mechar.J.sm[8,9J, driven by the unit equality clauses on the Demodulator List. Built-in functions (e.g., arithmetic operations) can also be used to simplify the derived clauses. After demodulation, a complete subsumption check is performed, Next, the clause is evaluated, using a weighting mechanism evolved from the one described in [7]. The templates which direct the weighting algorithm are part of the options. The set is used here from the one used to select the given clause. If the demodulated version of the clause is neither subsumed nor rejected by the weighting mechanism, it is integrated into the clause space. Optionally, "back subsumption" is performed, in which existing clauses in the clause space less general than the new clause are deleted. Back demodulation occurs at this point: If the new clause is a unit equality clause and meets certain other criteria, it is added to the Demodulator List and is immediately applied to all existing clauses. Any new clauses resulting from the process are then completely processed, just as if they bad been generated by an inference rule. If specified by the user, each step of the previous procedure is logged to a file, which in turn is used in the proof examination facility. A proof can be extracted from the log, and each step of the proof can be explained in detail.

2.4.4. Utilities

The logging facility is an example of a utility, a subsystem of the program not strictly necessary for making experimental runs, but one which makes the research process more convenient. Other utilities include the ability to save the status of a run and restore it later, and two interactive subsystems for studying the effects of demodulators and weighting templates in isolation from the rest of the theorem-proving process, It is also possible to run the theorem-proving process itself in interactive mode, adding and deleting clauses, generating new clauses one at at time, and accepting or rejecting them "by hand". This mode is particularly useful when one has in advance some idea of how a deduction should proceed.

47

2.4.5. Documentation Use r -oriente d document ati on is distri buted wit h the syste m. One rnanu aljb] is a tutorial on how to use ITP. It contains a de sc r iption of each command and of the effects of each option. The tutorial c ont ains a sample' ses sion and m any ex am ples , Another manual[6] is provided for t hose tnt ending to a dd m or e functionality to ITP. Bec ause in L.,\l:A terms , ITP is a Layer 3 program, adding func tio n alit y r equi r es an understanding of the Layer 2 subr outines. Thes e are listed together with their calling sequen ce s and a disc us sion of the relevant data ty p es . Some of th e infor mati on in [5] is avai lab le on-line as well. The he lp fac ility is file-driven and easily ex te nded or cus t om ize d b y the user. 3. Integration of The orem-ProV'ingand Logic Programming

3.1. Tightly-COupIed Logic Programming

LMA and ITP now support a Prolog component, which can be used in two ways . There is an ITP c ommand which initiates a Prolog subsyst em. Within this subsystem, the user is interacting with n ormal Prolog. As throughout LMA, no limits exist on the number of literals (subgoals) per clause or on the number of variables p er clause. Beca use the Prolog subsyst em is contained within IT?, all language options ar e available . We have used as input to the Prolog component clauses that were written with a theor em-proving run in mind, and vic e versa. The most interesting use of the Prolog component occurs when it is tightly bound into the ot her inf erence mechanisms of LMA. As an illustration, consider the inference rule hyperresolu tion . From the theorem-proving point of view, the negative lite rals of a clause represent subgoals to b e achieved in the formation of a hyperresolvent. Some of these subgoals might be best attacked by the use of Prolog. In LMA. the user can identify certain subgoals as ap pro pr iate to t he Prolog sub component. The details are as follows. A fifth list of clauses is added to IT? , r epresenting the standard Prolog database of facts (positive unit cl aus e s) and procedures (non-unit Horn claus es) . The spe cial pr edi cate s , $ASK and STELL. are incorporated into IT? to allow communic ati on with "for eign" sys tems . For example , if the Axiom List co ntains: If St atus( x.Failed) & $ASK(pr olog,isp at h (x,y))

then Stat us(y,Su spi ci ou s) and th e Prolog lis t contains: conn(a,b).

connfb.c). isp ath(X,y) :- conn(X,y) . ispath(X,y) ;- conn(X,Z), isp at h (Z.Y). and the Set-of-Support contains: Status(a,Failed) then lTP would deduce St atus (b ,Suspicious) St atus (c, Sus picious) by hyp err esolution in conjunc tion with the Prolog com put ati on s of t he ways in which the sub goal isp ath( a .y) can suc ceed . The lITELL(prolog,
48

with the goal STOLD(lma,
49

3.2. Types of Reasoning Th e integrati on of a cla ssical t h eore m p r over wit h a Prolog syst e m nat urally lea ds to t he questi on of distribution of effort. What type of computation should be processed by t he Prolog sys t e m , and what t yp e of c om put a ti on sh ould be p erformed in the t heor em- pr oving c om pon ent? To a nswer t his question , we have p a r ti tioned the ty pe s of re as oning r equi r e d to solve problems into fou r categori e s: 1.

Totally focuse d re asoning is used to solve co mputational pr oble m s, e .g ., adding two numbers or invert ing a matrix.

2.

Rela tively focused reasoning is als o well-focused, b ut may inclu de occasional backtracking .

3.

Relat ively unfoc used r easoni ng is characterized b y some notion of how t o identify r el evant facts. but no precis e idea of whi ch st eps will le ad t o the desired answer.

4.

Strategic reasoning is used t o form an overall plan of h ow t o attack a given problem.

3.2.1. Totally Focused Reasoning Finding the eigenvalues of a matrix is a reasoning problem. It is co mpletely algorithmic; the process is well enough understood that no steps are was ted. It is p ossibl e to write clauses which allow a t h eor em prover to solve t hi s problem. It could be done in Prolog , as well; but it is more appr opr ia tely done in FORTRAN. This is t r ue not jus t beca us e of efficie ncy considerations. but also beca us e numer ical sub r outine packages for such tasks be ne fit fro m the enormous research effort that has gone in to t heir development. Sop histi cate d reasoning systems m u st b e able to p er form totally foc us ed reasoning by a ccessing the appropriate ex isting software . 3.2.2. Relatively Focused Reasoning The n ext level of reasoning is still algorithmic, but the algorithm may involve a certain amount of t rial-and-error, with backtracking to recover from t emporary dead e nds. Logic programming languages such a s Prolog are appr opri a t e for this le vel, whi c h we call r el atively focused. 3.2.3. Relatively Unfocused Reasoning At this level t h e pa th to be followed by the reasoning proc ess becomes difficult to lay out in ad vance, and t here may be s ub st antial ex ploration of blind alle ys . It is not known just whic h steps will le ad to a so lution, although t he re may be heuristics a vaila ble to guide the reasoning process in the ge ne r al direction of the goal. At this level, r esoluti on- based a nd natural deduction t he or e m provers have demonstrated competence. Mu ch r es earch has b een done on the control of unfo cused reasoning . It sh ould be pointed out that attainment of complete control (e .g " no clauses generated that do not p articipate in a proof) m ay m ean t h at the problem at hand is m ore ap pr opr ia t e for a different tool. 3.2.4. Strategic Reasoning Strategic reasoning invclves t he identifica tion of a sequence of steps leading to the solution of a proble m , followed by execution of t he steps. It is to this leve l of reasoni ng that the Al research in planning and problem solvi ng has been directed. Gen eral-purpo se strat egic reasoning tools are just beginning to emerge .

50

3.2 .5 . Event-Driven vs . Goal-Direct ed Reasoning Anot her, almost orthogonal, wa y t o se p ar a te typ e s of r e a son ing is 1.0 dJsti'l J;" ,s h e v-nt driven fr om go al-dire cted reasoning. Ver y r oughly spe aking , e vent-d r iven reasoning se e ks to as c ert ain the co ns eque nce s of a give n fac t . wh er eas goal -d ire ct e d r easoning se eks to determine whe ther a given goal can be reached. Bot h ty p es of r easo ning can be carr ied out by bo th theorem provers a nd by logic progr amming sy ste ms , althoug h it is som etimes mor e co nve nient t o exp ress goal-directed r easo ni ng in Pr olog and even- dr iven reas oning in clause s t o be p ro cess ed by a theorem p r over. Within LMA one no w h as a ch oice, so t h at the mo st convenient appr oa ch can be taken, and the appr oache s can b e mi xed in the s ame att a ck on a pr obl em. 3.3. The Integrated Environment The Prolog subcomp onent of the ITP rea soning system is t he fir st of a s eries of p lanne d additions of subsystems that coop erat e wit h LMA. The next target is a sym b olic manipulation sys te m like MACSYMA. We have found tha t communic ation s wit h suc h packages, ot h er tha n from a us er via a t erminal, are quit e difIi c ult. It seems ironic t h at wor k er s in the area of machine in telligence so often design sys tems whic h are difficult for machines t o use. We anticipate better success with MAPLE, a sys tem with many of the feature s of MACSYMA. As such systems are integr at ed, communication will co ntinue t o b e through the $ASK/$TELLm echanism . The one other sys te m pr esently suppor t ed by SASK/ STELL is the ITP user himself . Fo r example , mes sages whic h are deduced in the course of a r un can be sent to the console with the $TELL(u se r ,<list » t erm. The elements of <list> c an be strings, variables (whic h, if instantiated, will be rep laced by their instantiations) and t er m s of the for m SCHR(
LMA and ITP have b een p or t ed fro m the VAX/UNIX envi r onment in which they we r e dev el op ed t o four ot her environments : VAX/UNIX. IBM/C MS, Perq, and Ap ollo . Our experience has shown t hat Pascal is a g ood language for writing portable soft ware . Given a robust P a scal compiler (support of external c ompila tion, an in clu de mechanism, and gene r ou s lim it s on the number of files and procedures) t he po rt takes a few days at most. Chang es a re always limited to the I /O routines and to the particular syntax for externai compilation . About fifty copie s of LMA and ITP have b een distributed, to univ ersities. indust ria l research groups, and gove r nme nt-sponsore d lab or at or ie s. The sy stem is being us ed for research in symboli c logic , and circ uit de sign and valida tion . A numb er of gr oup s are using it to explore applicati ons of a utomated r ea soni ng in n u cle ar r eactor p roce dure pr omp ting sys tem s, buildi ng -wide flre ala r m syste ms, and organic c he mi c al synt h esis .

51

5. Future Plans LMA was created to support research in automated reasoning systems, lYe believe that de j sical theorem-proving systems, logic programming systems. and expert systems represent diffel'ent ",pproaches towards solving significant reasoning problems, We intend to create an integrated environment which will eventually support the tools and techniques required by each of the three approaches, This will require the addition of the user interface tools characteristic of many of the better expert systems environments, In addition, we hope to develop techniques for rapidly interfacing LMA-based systems with a wide variety of software systems, The increasing awareness that intelligent systems must communicate with one another. as well as with humans. will certainly aid in these efforts,

Within a year or two. there will almost certainly be available machines for less than $3000 that are capable of supporting the entire LMA environment. These will include at least two megabytes of memory. virtual memory. and a hard disk. Such systems should allow anyone wishing to perform experiments in automated reasoning access to the required environment. In addition to porting LMA to micro-based systems. we are investigating ports to supercomputers, Our intent is to eventually move LMA to one or more supercomputers. receding sections to take advantage of parallelism, At this time we are investigating the possibility of a port to the HEP, a machine supporting tightly-coupled multiprocessing. The exact point at which we make such a port depends critically on the evolution of the software environments for such machines. which frequently support only a dialect of FORTRAN, 6. Summary

ITP has been serving three goals, It provides a test bed for the LMA subroutine package, because it makes use of all of the LMA tools. It has been used to teach and develop ideas in automated reasoning. Since its distribution. a number of groups have used it to experiment with their own applications to determine the potential utility of automated reasoning in their projects. It is also in limited use as a production tool, particularly in the areas of circuit design and validation. and for research in formal logic. The development of the integrated logic programming component represents a first step towards the creation of a system built around a wide variety of communicating systems. each with specialized capabilities. It is our hope that the development and widespread distribution of such a system will accelerate progress in building ever more powerful reasoning systems. References 1.

W. F. Clocksin and C. S. Mellish, Progra.mming in Prolog, Springer-Verlag, New York (1981).

2.

E. Lusk, William McCune. and R. Overbeek, "Logic Machine Architecture: inference mechanisms," pp. 85-108 in Proceedings of the Sixth Conference on Automa.ted Deduction, Springer-Verlag Lecture Notes in Computer Science. Vol, 138. ed. D. W. Loveland.SprmgerVerlag. New York O.

3.

E. Lusk and R. Overbeek, "Data structures and control architecture for the implementation of theorem-proving programs," in Proceetiinqs of the F'ifth Conference on Automated Deduction. Springer-verlag Lecture Notes in Computer Science. Vol. 87, ed. Robert Kowalski

4.

E. Lusk, William McCune. and R. Overbeck, "Logic machine architecture: kernel functions." pp. 70-84 in Proceedings of the Sixth Conference on Automated Deduction. Springer-Verlag Lecture Notes in Computer Science. Vol. 138, ad. D. W. Loveland.Sprtnger-Verlag. New York

and Wolfgang Bibel.

(1962).

O.

52

5.

Ewing L. Lusk and Ross A. Overbeek, "An LMA-based theorem prover," Al\JL-B2-75, Argonne National Laboratory (December, 1982).

6.

Ewing L. Lusk and Ross A. Overbeek, "Logic Machine Architecture inference mechanisms layer 2 user reference manual," A-lIlL-82-B4, Argonne National Laboratory (December, 1982).

7,

J. McCharen, R. Overbeek, and L. Wos, "Complexity and related enhancements for automated theorem-proving programs," Computers and Mathematics with Applications 2pp. 1-16 (1976).

B.

S. Winker and L. Wos, "Procedure implementation through demodulation and related tricks," pp. 109-131 in Proceedings of the Sixth Conference on Automa.ted Deduction, Springer-Verla.g Lecture Notes in Computer Science, Vol. 138, ed. D. W. Loveland.SpringerVerlag, New York (1982).

9.

L. Wos, G. Robinson, D. Carson, and 1. Shalla, "The concept of demodulation in theorem proving," Journal of the ACM 14 pp. 698-704 (1967).

53

A NATURAL PROOF SYSTEM BASED ON REWRITING TECHNIQUES Deepak Kapur and Balakrishnan Krishnamurthy

Computer Science Branch General Electric Research and Development Center Schenectady, New York, 12345

ABSTRACT

Theorem proving procedures for the propositional calculus have traditionally relied on syntactic manipulations of the formula to derive a proof. In particular, clausal theorem provers sometimes lose some of the obvious semantics present in the theorem, in the process of converting the theorem into an unnatural normal form. Most existing propositional theorem provers do not incorporate substitution of equals for equals as an inference rule. In this paper we develop a "natural" proof system for the propositional calculus, with the goal that most succinct mathematical proofs should be encodable as short formal proofs within the proof system. The main distinctive features of NPS are: 1. The substitution principle for the equivalence connective is incorporated as an inference rule. 2. A limited version of the powerful ideas of extension, originally suggested by Tseitin, are exploited. Extension allows the introduction of auxiliary variables to stand for intermediate sub-formulas in the course of a proof. 3. Formulas are standardized by converting them into a normal form, while at the same time preserving the explicit semantics inherent in the formula. 4. A generalization of the semantic tree approach is used to perform case analysis on literals as well as sub-formulas. 5. Additional enhancements such as a generalization of resolution are suggested. We show that from a complexity theoretic viewpoint NPS is at least as powerful as the resolution procedure. We further demonstrate formulas on which NPS fares better than resolution. Finally, since proofs in NPS usually resemble manual proofs, we feel that NPS is easily amenable to an interactive theorem prover.

54 1. INTRODUCTIO N

With increased interest in theorem proving procedures, a collection of proof systems have been proposed in the literature to handle a variety of logical theories. Never theless, theorem proving in the propositional calculus lies at the heart of most of those procedures. Even though theoretical evidence has all but foreclosed the possibility of a uniform efficient theorem prover even for the propositional calculus, the hope remains that we can develop practical theorem provers that can cope with a large collection of " naturally occurring" theorems. Perhaps a natural and desirable class of theorems that such proof systems must be capable of handling are those for which there exist succinct mathem atical proofs. It would then appear that such a proof system must incorporate the techniques that we employ when proving theorems manually. This paper deals with the development of a " natural" proof system . We distinguish a proof system from a proof procedure . A proof system is a nondeterministic procedure that defines the notion of a proof by providing the inference rules that may be used. Any sequen ce of formulas that are derived using the given set of inference rules forms a valid proof of the final formula in the sequence . Non-determin ism arises from the fact that the proof system does not specify any order in which the rules are to be applied. The complexity of a proof is then defined as the length of the proof. In contrast, a proof procedur e is a deterministic procedure that not only lays out the inference rules, but also incorporates specific heuristics that precisely determ ine the order in which the applicability of the rules are checked and subsequently applied. Consequently , given a theo rem, a (complete) proof procedure generates a unique proof for the theorem. The complexity of the proof is then defined as the time/space complexity of finding the proof. Observe that there exists polynomially bounded proofprocedures for theorem proving in the propositional calculus if and only if P = NP. On the other hand, there exists polynomially bounded proof systems for theorem proving in the propositional calculus if and only if NP is closed under complementation (see [4]). It is commonly believed that neither P = NP nor NP is closed under complementation. Thus , we should expect neither a polynomially bounded proof procedure nor such a proof system for the propositional calculus. Nevertheless, we could ask if most mathematical proofs that we normally encounter, can be encoded within a given proof system without a significant increase in the length of the encoded proof. For, if certain mathem atical proofs have no efficient encoding within a given proof system, then there is no hope of efficiently finding a proof of the theor em using any proof procedure based on that proof system . In this paper we argue that most existing proof systems have certain weaknesses that prevent succinct encodings of certain types of mathematical arguments. We also point out that many of these popular proof systems require the formu la to be presented in a normal form that is unnatural. This makes it difficult to translate intu itive heuristics that we normally use while proving theorems manually, into a proof procedure based on those proof systems. We propose a new method for theor em proving in the propositional calculus, that we call as "Natural Proof System," wherein we synthesize various attractive features from a variety of proof systems. In the remainder of this section we develop the necessary terminology and notation. In Section 2 we comment on existing proof systems and their strengths and weaknesses. We deliberately go into some detail to point out the attractive features of these proof systems and the difficulties encountered by them . This is intended to motivate the techniques employed in the proposed proof system. In Section 3 we present an initial version of the proposed theorem proving technique, followed by a number of enhancements to the technique in Section 4. Finally, we conclude in Section 5 with remarks on transforming the proof system to a proof procedure and its implementation. We call a propositional variable simply as a variable and distinguish it from a literal, which is a variable together with a parity- positive or negative. By a formula we mean a well -,

55 formed propositional formula using any of the 16 binary connecti ves. The connectives of primary interest are: AND (A), OR (V) , NOT (-) , IMPLIES <_), EQUIVALENCE (=:) and EXCLUSIVE-OR ( + ) . The constants TRUE and FALSE will be represented by 1 and 0, respectively. 2. CURRENT PROOF SYSTEMS 2.1 Resolution The elegance of the resolution proof system, originally suggested by Robinson ([12]) , is, in part, due to its simplicity. It is based on a single rule of inference , called the resolution principle. This makes the conversion from a proof system to a proof procedure relatively easy, since the heuristics need not be complicated and messy. The only non-determinism that needs to be resolved is the decision of where to apply the rule of inference. Furthermore, this technique can be extended to the predicate calculus efficiently, through the process of unification . In spite of its simplicity, the resolution proof system compares favorably in its power with most other existing proof system , i.e., the ability to encode succinct proofs. It has been shown [4) that resolution can polynornially simulate most other "reasonable" proof systems. (A proof system /1[ can polynomially simulate /12 if for every proof in /12 there is a proof of the same theorem in /11 with at most a polynomial increase in the length of the proof.) However, there are two main drawbacks of resolution. The requirement that the theorem be represented in conjunctive normal form (CNF) is both restrictive and unnatural. It prevents us from incorporating intuitive heuristics based on the structure of the original formula . The second and more serious drawback is its inability to handle the logical equivalence connective well. Consider the following set of equations: a+b+ c=l a+d+e=O b+d+f=O c + e +f = O where , a, b, c, d, e and fare variables. Observe that every variable occurs twice in this set of equations. Hence, the sum of the left-hand sides of the equations, which is 0, is not equal to the sum of the right-hand sides. Consequently, if we encode each equation in CNF then the conjunction of the collection of clauses representing the above equations would be unsatisfiable . However, a resolution proof of this statement seems to be unnecessarily lengthy. The reason is that there is no short representation for a chain of Boolean sums in CNF . Using this fact, Tseitin [13) has demonstrated that theorems of the type illustrated in the above example , have no polynomially bounded proofs in certain restricted forms of resolution. While a similar result for unrestricted resolution has not been theoretically proved, there is ample evidence to believe that unrestricted resolution will fare no better. 2.2 Natural Deduction Systems (Gentzen Systems) Natural deduction systems incorporate the deduction theorem as a rule of inference. One of the earliest natural deduction system was proposed by Gentzen [7). Gentzen systems do not require the theorem to be represented in any normal form. They operate on each of the binary connectives in a natural way. Gentzen systems develop a tree in which the root is the theorem and the leaves are axioms. This makes the proof structure more comprehensible, and this could prove a useful feature in an interactive theorem prover. A unification based extension of Gentzen systems to the predicate calculus has been recently suggested by Abdali and Musser [1). On the negative side, the variety of inference rules needed to handle all of the binary connectives makes it more difficult to design a deterministic proof procedure based on Gentzen systems. A more serious difficulty is the fact that a proof tree should really be viewed as a dag (directed acyclic graph) to avoid repeated proofs of the same sub-formulas. Finally, Gentzen systems are also and-or based, and hence can not handle the logical equivalence connective well.

56

2.3 Semantic Trees Semantic trees were originally suggested in [8] (also by Davis and Putnam, see [3]). They represent a proof of the theorem based on a case analysis on the variables occurring in the theorem. A tree is developed in which the root is labelled with the theorem, the leaves are labelled with the constant 1, and the sons of a node labelled Fare labelled r, and F2 where, F l and F2 are obtained by choosing a variable x in F and evaluating F with x = 0 and x = 1. An extension of this technique has been suggested by Monien and Speckenrnyer [II] where, instead of a variable, a clause is used for splitting the original problem. Yet another generalization, suggested by Bibel [2], calls for splitting over a set of variables instead of a single variable. The simplicity of this technique is appealing. Further, if the tree is viewed as a dag, this proof system is not known to be any less powerful than resolution. However, a deterministic proof procedure based on this proof system is sensitive to the heuristics used in choosing the variables for instantiation. In addition, efficient extensions of this technique to the predicate calculus have not been investigated. Finally, this too is an and-or based proof system and our earlier comment on the logical equivalence connective prevails. 2.4 Rewrite Rules Technique Recently, Hsiang [5,6] has reported a proof system based on rewrite rules. The negation of the theorem is specified as a conjunction of a set of equations. Each equation is a Booleansum of products equated to a constant. These equations are manipulated using the usual rules of equality to obtain a contradiction. While this is a more general version than that reported in [5,6], it is very much in the spirit of Hsiang's technique. The most attractive feature of this proof system is the ability to substitute equals for equals (Hsiang does so in a limited way though) -a principle, fundamental to any manual theorem proving process. The substitution principle is precisely what is needed to efficiently handle the exclusive-or connective. While the normal form used in this technique is tailored to support a chain of exclusiveor's, it has it's own weaknesses. For example, it does not support an efficient representation for a chain of disjunctions. Further, if the normal form is derived from a CNF representation of the formula, as the author suggests, the advantages of using the exclusive-or connective could well be completely lost. In fact, if the equations in Tseitin's examples are converted from CNF to the normal form used here, the form of the equations changes so drastically that this proof system no longer admits short proofs for the same theorems when presented in this manner! Another drawback of this proof system is that it relies on susbstitution alone for proving the theorem. Equations are expanded by unifying one term in one equation with another term in another equation, so that the terms may be substituted for each other. This expansion process often increases the length of the equations (not necessarily the number of terms in the equations). It appears that it would be preferable to replace the expansion process with some other technique. 2.5 Extension Tseitin [13] introduced a technique called extension which allows the introduction of auxiliary variables to stand for arbitrary functions of existing variables. He demonstrated that by repeated applications of the principle of extension, one can efficiently manipulate complex sub-formulas by merely operating on the auxiliary variables that stand for the sub-formulas. Cook and Reckhow [4] have shown that extension is so powerful that it overshadows the inference rules of the proof system. That is, whenever two reasonable proof systems - with different inference rules-are enriched with the power of extension, the two resulting proof systems are equally powerful. Krishnamurthy [9] has shown that a variety of mathematical arguments can be succinctly encoded using the principle of extension. Thus, this simple feature of extension can significantly reduce proof lengths.

57 However, the difficulty is in a deterministic implementation of the principle of extension. How can a proof procedure recognize the need for auxiliary variables and determine how the y sho uld be defined ? In the proof system suggested in the sequel we provide a tech nique for a deterministic implementation of a limited form of exten sion. 3. A NATURAL PROOF SYSTEM In this section we present the main ideas of the proposed proof system, which we shall call NPS (Natural Proof System ) . It is a natur al deduction system in the sense that (i) the proof preserves the structure of the theorem, and (ii) the deduct ion theorem is assumed. Theorems are proved by assum ing the negation and arriving at a contradiction. Thus, to prove '" ~ F where, is a set of formul as and Fis a formul a which is a logical consequence of <1> , we assume ( II - F ) and derive a contradiction.

3.1 Equational Norm al Form The observations about norm al forms made in the previous section indicate that while norm al forms are desirable to avoid handling every binary connective and to give some structure to the formulas that are manipulated within the proof system, they shoul d preser ve the structure of the original formula. In particular, the distributive properties of the Boolean operators shou ld not be used in deriving the normal form. With this in mind we develop a normal form purely for standardization of the formu la, called equational normal form (ENF ) . For the sake of convenience, in an interactive implementation of a proof procedure, we might include the remaining connectives as well. However, for clarity, we limit ourselves here to V, +, and - . ENF is a representation for not just a formul a, but for a statement of the form " F = X" where, F is a formula and X is either a literal or a Boolean constant. ENF is a set of equations of two types: d-type (disjunction) and s-type (summation) . A d-type equation is of the form A I V A 2 V . .. V A n ~ B where , A }, A2 , ... , A n are literals and B is either a literal or the constant l. An s-type equation is of the form AI + A 2 + . . . + A n = B where, A), A 2," ' , A n are positive literals and B is a Boolean constant. We sketch below a procedure for converting " F = X" into ENF and follow it up with an example. We begin with the expression tree (also know as the parse tree) for F. Recall that the leaves of the expression tree are labelled with variables and the internal nodes are labelled with Boolean operators. Using the usual rules of the Boolean operators, other than the distrib utive laws, we reduce all operators to - , V and + . We also flatten the parse tree by using the associative properties of V and + . Finally, convert the tree into a dag by identifying common sub -expressions. Now associate a new variable name with each v-node and with each + -node of the dag, X being assigned to the root of the dag. Write an equation for each such node in the obvious way - a d-type equation for a node labelled with V and an stype equ ation for a node labelled with + . The collection of these equations is the ENF. Example I: Consider the formul a [«a II -b II - c) II d ) V «c

+ a) _

(b

==

-c))] II [( a II - b) _(c

+ a)]

The corresponding equatio n tree and the modified dag are shown in Figure 1. The ENF equivalent is shown below: - a V b V c V-d-Z I

b

+c=

Z3

-Z I V-Z 2V

Z2V - a V b - Z 4 V - Z5

>

Z 3~

Z5

~ -X

Z4

58

.>. /\ /\ 1\

;::>

1\

+

1\

/\ /\ /\ 1\ all\ /\ T\ \ =a",

d+

1\

II b e

cab

a

'"

\

c

Figure 1 3. 2 Simplification Rules for Equations A set of equations can be simplified using the following rules. Recall that the order of the literals appea ring on the left-hand sides of the equations is imm ateri al, since + and V are associative and commutative. Thus , the literals on th e left-hand sides of the equations can be rearranged in any manner.

I. Substitution: We can subs titute equals for equals. Thus from the two equatio ns , A + X = 0 and A + Y = 0 we can obtain, by substituting the value of A from the first equation into the seco nd equation , the equation X + Y = O. Here X and Y can be an arbitrarily long chain of Boolean sums. Observe that X + Y = 0 will automatically be in the nor mal form . We can req uire that equations of the for m A = B are eliminated by replacing all occurrences of A with B (or, vice versa). We must also point out tha t in the substitution process - 0 should be treated as a 1 and - I should be treated as a O. 2. Reduction of equations: The following ru le can be used to rep lace equations by simpler equatio ns: RI :

A VX

R2:

~A

=

VX

0 =

A

A

-

=

A

0;X =

=

1;X

0; =

1;

Note that X can be a chain of disjunction.

3. R eduction offormulas: Th e following rules can be used to simplify th e left-hand sides of equations to simpler forms: XV X

.r :

R4:

1V X

1;

R5:

OV X

X ;

R6:

- A

R3:

+ X

-

1+ A

R7:

X+ X

0;

R8:

0+ X

X ;

+ X;

59

3.3 The Proof System The main core of the proof system described herein is based on a generalization of the semantictreeapproach. We begin with a formula that we wish to prove to be a theorem or that it is inconsistent. To prove it to be a theorem, we transform the formula into a set of equations asserting that the formula has a value O. Similarly, to prove it to be inconsistent, we transform the formula into a set of equations asserting that the value of the formula is 1. We then proceed to derive a contradiction. To this end, we construct a proof tree by repeated applications of the simplification process described above and the splitting process described below. We first simplify the set of equations using the rules given in Section 3.2. At any step, if the simplification results in an inconsistent equation 0 = 1 or x = -x in the set, then the set of equations is contradictory and we are done. Otherwise, we choose a sub-expression of the left-hand side of one of the equations. The sub-expression may (and, often will) simply be a literal. We then invoke the splitting rule and create two sub-problems: one in which the subexpression is equated to 0 and another in which it is equated to 1. These two sub-problems are represented by two sets of equations obtained by adding each of the two equations mentioned above to the original set of equations. The original set of equations is contradictory if and only if each of these new sets of equations is contradictory. We then recursively apply this process. We illustrate the technique on an example in Figure 2. Example: Theorem: b

b

+ c + (bVc) +

+ c + (bVc) +

Split on

4. 5. 6. 7. 8. 9. 10.

b+Z 2 = 0 -bV-c = b b= 1 c=O Z\ = 1 Z2 = 1 1= 0

b

(-bV-c)

(-bV-c) = 1

+ 22

11. (subst, 4 in 2) (R2 on 5) (R2 on 5) (subst, 6,7 in 1) (subst. 6 in 4) (subst. 6-9 in 3)

12.

13. 14. 15. 16. 17. 18.

Figure 2

b+Z 2 = 1 c+Z] = 1 bVc =-c c=O

b=1 Z] = 1 Z2 = 1 0=1

(subst, 11 in 3) (subst. 12 in 1)) (R2 on 13) (R2 on 13) (subst, 14 in 12) (subst. 14,15 in 2) (subst. 15,17 in 11)

60 3.4 Completeness The soundness of this proof system is evident, for, each of the simplification rules can be verified and the soundness of the splitting rule is inherited from the semantic tree approach. We show below that this proof system is also complete.

Theorem 1: NPS is complete for the propositional calculus. Proof: Given a theorem T in the propositional calculus, we write the assertion "T = 0" in ENF. We need to show that by repeated applications of the simplification process and the splitting process we will necessarily arrive at a contradiction. If werestricted ourselves to the splitting process alone, we will eventually eliminate all the variables. If this does not result in a contradiction along every path in the tree, then by the soundness of the proof system, we would have produced a satisfying truth assignment for the negation of the theorem. This would contradict the hypothesis that T is a theorem. Thus, all that remains to be shown is that the simplification process terminates. In particular, we need to show that the substitution process cannot be applied indefinitely. The number of new variables introduced in transforming T into ENF is bounded by the number of internal nodes in the expression tree for the formula T. This bounds the total number of variables and consequently, the number of possible equations. Hence, the simplification process must terminate. Q.E.D. 4. ENHANCEMENTS Equations whose left-hand sides are disjunction of literals can be simplified only by splitting on a formula. We would like to avoid the splitting of a problem into subproblems as much as possible in order to avoid an exponential growth in the complexity of the proof. In this section, we discuss two enhancements to the proof system to handle disjunctionsgeneralized resolution and compaction. Generalized resolution is a generalization of standard resolution 112]; it can be used to simulate resolution proofs. Even in cases when resolution is not applicable, it generates information which might be useful in subsequently establishing a contradiction. Compaction allows us to recognize a set of equations whose left-hand sides are disjunctions and which have a succinct representation using the boolean sum notation. Later in the section, we discuss ways to identify similar nodes in the proof tree which can be used to avoid repeated independent derivations of identical sets of equations. 4.1 Generalized Resolution Even though simplification is a powerful tool, it is not sufficient to efficiently handle certain types of arguments. For example, consider the following set of d-equations: {XV Z~ 1; -XV Y

1; -YV Z

It is easy to deduce from the above equations that Z

1; -ZIV-Z2~-Z } 1 thus making ZI = 1 and Z2 = 1.

However, we cannot make any substitutions, as the left-hand sides are disjunctions. On the other hand, if we used the semantic tree approach, we could branch on X and in each of the two sub-trees we would be able to derive ZI = 1 and Z2 1. But then all subsequent arguments that use ZI = 1 and Z2 ~ 1 would have to be duplicated in the two subtrees. Instead, we can derive the necessary conclusion without branching, using resolution. If we viewed each of the left-hand sides as a clause and the equations as asserting the conjunction of those clauses, then through two applications of the resolution principle, we can derive Z = 1. This suggests that a generalization of resolution applicable to a set of equations would be a useful tool.

Theorem 2: Let E be a set of equations including A V X = Z and - A V Y = Z '. Let ZI and Z2 be variables not occurring in E. We can add Z V Z' 1, ZI V Z2 ~ 1, X + Z + 1 = Zh and Y + Z' + 1 = Z2 without affecting the satisfiability of E. Proof' Since either A or -A is always true, at least one of Z and Z' must be true. Thus, we can conclude Z V Z' ~ 1. For the remaining 3 equations, consider two cases based on the

61

value of A. If A is 0 then X = Z and z, V Z2 = 1. Q.E.D.

z, =

1.

If A is 1 then Y = Z' and Z2 = 1. In either case

The simplification rule corresponding to generalized resolution is: GR: AVX=Z;-AV Y=Z' -

AVX=Z;-AV y=z'; ZV Z'= 1; Zl V Z2= 1; X

+ Z + 1 = Zl;

Y + Z'

+ 1 = Z2.

When Z = Z' = 1, new variables Zl and Z2 are not introduced because X = z, and Y = Z2 in that case; instead, of 4 additional equations, only one new equation, X V Y = 1 is added. For example, consider the following set of equations: 1. PVQ=!

2. Q V R

~

1

3. R V W= 1 4. -P V -R = 1 5. -QV-W= 1

6. -Q V -R

=

1

The derivation is as follows: 7. -PVQ=1

GR(2,4)

8. Q =!

GR 0,7)

When 1 is substituted for Q in the above equations, equations 1, 2 and 7 are discarded; equations 5 and 6 simplify to: 5. - W

=

!

subs. (8)

6. -R

~

1

subs. (8)

Using them in 3 gives an inconsistent equation 0 = 1.

Theorem 3: NPS with GR can simulate the resolution proof system without an increase in complexity. Proof Consider a formula F in CNF which has a proof in the resolution proof system. From F, we get a set of equations such that there is an equation corresponding to each clause and nothing else. The equation corresponding to a clause C is C ~ 1. For every step in a proof of F in the resolution system in which a variable x is resolved, we take the equations corresponding to the two clauses involved in the resolution and apply the generalized resolution rule on them using x. Following the resolution proof tree which gives the empty clause, we would obtain 1 = 0, an inconsistent equation. Q.E.D. 4.2 Compaction Consider the following set of equations: {-AVBVC=Z; AV-BVC~Z; AVBV-C=Z; -AV-BV-C=Z}

If these equations have to be transformed into a Boolean sum of products as required by Hsiang's technique, it will take a considerable effort to get to the equivalent form using +, which is: A

+B+C

= 0;

Z = 1

Note that if Z = 0, then we immediately get an inconsistent equation from the original 4 equations. Using the above two equations, it is possible to substitute for any of the variables

62 the remaining part of the equation; such a substitution was not possible from the original set of equations. Further, the new equations provide more insight into the semantics of the original set of equations than the original set does. It is often useful to recognize formulas with such structures and transform them to a more succinct form. Given 3 variables a, b, c, 8 clauses can be expressed using them; so the possible number of subsets of such clauses is 28, which is 256. It turns out that there are only 5 relevant cases as other cases either simplify easily using the rules of inference discussed so far or can be obtained using symmetry (i.e., by renaming the variables) from the 5 cases. Without any loss of generality, we can assume that the clause (a V b V c) is present in all 5 cases, as any clause can be transformed to this clause by symmetry. We can construct the following five subsets of equations and the corresponding reductions on these subsets are also given: 1. a V b V c

~ z,

which is kept as it is.

2. a V b V c = z and -a V -b V-c = z. This is a hard case; most hard tautologies, including Ramsey formulas [JO], can be expressed using this case. Further, the satisfiability problem remains NP-complete even if all clauses in a formula are of these forms. These two equations are kept as they are; in addition, we can derive z = 1, as otherwise we get an inconsistent equation.

3. a V b V c =

z

and <« V- b V c =

z.

This case is equivalent to the following set of equations: ~

4. a V b V c

~ z,

c V ZI

=

1; a + b

= ZI;2=1.

-a V
21;

2 ~

1.

5. And, finally, aVbVc ~

a

~

z; -a V b V c c = l;z = 1. <

~

z; -a V b V -c

=

z; a V -b V -c

~

z

+b+

The utility of the above rules can be easily demonstrated on the CNF representation of formulas used by Tseitin [13]; for an instance, see example 3.6 in Bibel [2]. Using the last rule (case 5), we get formulas identical to those discussed in subsection 2.1 (except that different variable names are used in example 3.6), Adding those four equations immediately gives an inconsistent equation. 4.3 Identification A proof tree in any proof system can in general have nodes labelled with identical formulas. Two different paths from the root-the formula being proved or disproved-may lead to identical subproblems (formulas) after splitting and simplification. We can avoid repeated independent derivations of sub-trees corresponding to identical nodes in a proof tree by maintaining a hash table for the formulas associated with those nodes of the proof tree that have been completely explored. Prior to exploring a new node, we would use the hash table to ensure that the formula associated with that node has not been encountered earlier. In the proposed proof system, a node in a proof tree has a set of d-type and s-type equations associated with it. Before hashing the set, we canonicalize the set so as to obtain a unique representation for a set of equations using associative and commutative properties of /\, V, and +. One way to achieve this is by using an ordering on variable names and constants o and 1, parity, sorting arguments of V and + in ascending or descending order and deciding whether d-type equations follow s-type equations or vice versa. This would lead to a check for equality of two sets of equations using hashing. Notice that the above scheme only checks whether two sets of equations (or two formulas) are identical. Since the inconsistency of a set of equations (formula) does not depend

63

upon the particular names used for variables in the equations, two sets of equations that are identical up to variable renaming (in fact, up to literal renaming) characterize the same Boolean function. If a node has a label E which is identical up to variable renaming to the label E' of a known dead node, then E is also a dead node. (A dead node is one whose proof tree has been completely deve loped.) A variation of the above canonicalization and hashing scheme can be used to check for this symmetry property of formulas (see [9]). This substantially reduces the complexity of proofs of a class of hard tautologies . For handling symmetry , the canonicalization of a set of equations (formula) also involves standardizing the variable names used in the set of equations. The hashing is done on the standardized canon icalized set of equations. Plaisted [personal communication) has implemented the above two schemes on a resolution based proof procedure and reported a substantial improvement in the performance of the theorem prover as a result of these schemes . As should be evident, what is really needed is a way to ident ify whether a node labelled with a set E of equations contains as its subset (up to variable renaming), E', the set of equations associated with a known dead node . This is a hard problem and we do not know how to implement this kind of identification check yet (in resolution and sema ntic tree based proof systems, such a check would amount to doing subsumption check with symmetry). Even if we sacrifice symmetry, the subsumption check has been found hard to implement; incorpo rating symmetry makes it harder because standradization of variable names does not quite work. 5. CONCLUSION We have proposed a proof system in which formulas at any intermediate stage are closely related to the structure of the original formula being proved or disproved . It extensively uses the powerful technique of extension for introducing new variables to stand for formulas in a proof, substitution and simplification derived from expressing formul as in Boolean sum notation and generalized semantic tree approach in which split could be done on a variable in the original formula or the one introduced via extension. A crucial step involved in transforming a nondeterministic proof system to a deterministic proof procedure is the development of a set of heuristics that dete rmine the order in which various rules of inference in a proof system are selected for possible application. Obviously, many proof procedures can be developed from a proof system by choosing different sets of heuristics . The challenge here is to come up with a proof procedure that makes the right choice in selecting the inference rules in most cases, and results in a minimal proof that could be obtained in a proof system . The overhead involved in implementing the heuristics and selecting it is usually justifiable only if it makes the right choices in most cases. One must then establish a completeness result for the proof procedure with respect to a set of heuristics, namely, that if a proof exists, the proof procedure finds it. A proof procedure is considered good for a set of formulas if the complexity of a proof found by the proof procedure is at most a polynomial function of the complexity of a minimal proof in its proof system. We believe that the heu ristics are often depen dent and derived from the application area. Altho ugh there may exist a small subset of gener al purpose heuristics based on the syntactic structure of form ulas, the rules of inference and the seman tics of various logical connectives, powerful heuristics often make use of the knowledge of the domain of the formu las. We believe that a "natural" proof procedure ought to provide facilities for a user to incrementally introduce heuristics and specify how the built-in and the user specified heuristics interact. This is a hard issue and falls outside the scope of this paper; however , we believe that the proposed proof system can incorporate some useful heuristics based on extension such as the user suggesting an arbitrary formul a for a case analysis or a lemma to prove the original formu la, etc. We plan to implement a proof procedure based on NPS using various heuristics and analyze their performance on different sets of formulas.

64

We believe that standard techniques suggested in Kowalski and Hayes [8] and Abdali and Musser [1] can be used to lift the proposed proof system to the first-order predicat e calculus . However, an efficient gener alization of the proposed proof system to the first-order predicate calculus such as that achieved by unification in the resolution procedure, requires further investigation . 6. REFERENCES [1] S.K. Abd ali and D.R . Musser, "A Proof Method based on Sequents and Unification," Unpublished Manuscript (1982) , G .E. Rand D Center, Schenectady, NY. [2] W. Bibel, " Tautology Testing with a Generalized Matrix Reduction Method," Theoretical Computer Science, 8 (1979) , pp.31-44. [3] CoL Ch ang and R.C. Lee , Symbolic Logic and Me chanical Theorem Proving. Academic Press (1973) , New York . [4] S. A. Cook and R. A. Reckhow, " The Relative Efficiency of Propos itional Proof Systerns ," J. ofSymbolic Logic, 44 (I 979) , pp. 36-50. [5] J. Hsiang, Topics in Automated Theorem Proving and Program Generation. Ph.D. Thesis, Department of Computer Science, University of Illinois, Urbana, Illinois . [6] J. Hsiang and N. Der showit z, "Rewrite Methods for Clausal and Non-clausal Theorem Proving ," Proc. 10th EATCS IntI. Co//q. on Automata, Languages, and Programming, (1983) , Spain. [7]

S.c. Kleene, An Introduction to M etama thematics. (1952) , Van Nostrand, New York.

[8]

R. Kowalski and P. Hayes, " Semantic Trees in Automatic Theorem Proving ," in Machine Intelligence 4, Meltzer and Michie, eds ., Edinburgh Univ. Press , Edinburgh.

[9] B. Krishnamurthy, " Short Proofs for Tricky Formulas, " Unpublished Manuscript, (1982) , G.E. Rand D Center, Schenectady, NY. [10]

B. Krishnamurthy and R. N. Moll, "Examples of Hard Tautologies in the Propositional Calculus," Proc. of the Thirteenth A CM Symp. on Th. of Computing, (1981) , pp. 2837.

[11]

B. Monien and E. Speckenmeyer, " 3-Satisfiability is Testable in 0 (1.62') Steps, " Bericht Nr. 3/1979 (1979) , GH Paderborn , Fachbereich Math ematik-lnformatik.

[12] J.A. Robinson, "A Mach ine Oriented Logic Based on the Resolution Principle," JA CM, 12 (1965) , pp. 23-41. [13]

G. S. Tseitin, " On the Complexity of Derivations in Propositional Calculus," Structures in Constructive Math ematics and Mathematical Logic, Part 11, A. O. Sliosenko, ed. , (1968), pp. 115-125.

65

EKL-A Mathematically Oriented Proof Checker Jussi Ketonen Stanford University

Abstract EKL is an interactive theorem-proving system currently under development at the Stanford Artificial Intelligence Laboratory. A version of EKL transportable to all TOPS-20 systems has been used for simple program verification tasks by students taking CS206, a LISP programmming course at Stanford. The EKL project began in 1981 and has grown into a large and robust theorem-proving system within a relatively short span of time. It currently runs at SAIL (a KL10-based system at the Stanford Computer Science Department), comprising about 10000 lines of code written in MACLISP. We describe some of the features of the language of EKL, the underlying rewriting system, and the algorithms used for high order unification. A simple example is given to show the actual operation of EKL. Research supported by NSF grant MCS-82-06565 and ARPA contract N00039-82-C-0250.

1. Introduction

An interactive theorem-prover is to be judged on the basis of its ability to imitate actual

mathematical practice. It can be viewed as a testing ground for the study of representation of facts and modes of reasoning in mathematics. A proof-checker may be used to give immediate feedback on the correctness of our ideas of formal representation. For example, one can judge a mechanically presented proof for its clarity and compactness. We may also compare machinechecked proofs with their informal counterparts. We see mathematics as a discipline that thrives on highly abbreviated symbolic manipulations. Their logical complexity tends to be low-s-the use of logic in such contexts is a relatively straightforward matter. Most of the burden of theorem-proving is taken by rewriting processes. We regard the ongoing development of EKL as an experimental science. In this sense, neither the existing control structures nor the underlying logic is sacred. However, we do not want to stray too far afield into "exotic" logics by violating basic proof-theoretic properties of first order logic. In particular, we want to pay close attention to such obvious requirements as consistency and sound semantics. Our primary criterion is that of expre""ibility :1nd th"

66

a bility to ta lk about the intrinsic prop er ties of t he conce pts in question. Suc h acc idents of formalisation as completeness, decidability , or the compl exity of a decis ion proced ure are of only secondar y impor tance; we re cognize th e fact that as of now we know of ver y few decidable t heor ies that correspond t o natur al fra gments of mathematic s. Since even t he simplest kno wn decision pro cedure for a non-tr ivial par t of logic has an exponentia l worst-case performance, it is more useful for us t o tailor our algorithms to naturally occurring inputs rat her than use arbit rary syntactic constraints. Indeed, we expect such trivial syntactic criteria as t he number of quantifiers in a formula t o appeal only t o the most technically or iented logician . OUf intent ion is to go th roug h majo r pr oofs fr om several areas of mat hemat ics and computer science an d see what difficult ies aris e as a result of t ryin g to formaliz e t hem. This kind of "ong oing dia logue" ([Was 1982]) will in turn be used to desig n future versi ons of EKL. The main distin gui shin g chara cter ist ic of EKL is its flexibility and adaptability to different ma thematical environments. The emph asis has been on cr ea t ing a system and a language which would allow t he expr ession and ver ificat ion of m at hem ati cal fact s in a direct , readable and natural way. The development of EKL has so far been heavily weighted in favor of exp ressibilit y and user -fri endli ness as opposed to soph isti cat ion in automa t ic procedu res . Our goal is to provide a good environment for form al manipulation. Ult imatel y we would like to see EKL as an editor wit h capabilit ies for sym bolic computing and verification. EKL has strong displ ay features . T he recen t E/MACLISP interface d evelop ed by Richard Gabriel and Martin Frost [Gabriel and Frost 1984] has made it possible to run EK L through t he SAIL E edit or. Furth er enhancem ents by Joseph Weening have made the "E EKL" mode of operatio n vastly super ior t o standard te rminal interaction. EK L is writ t en in MACLIS? In fact , thc t op level of EK L is LISP . Commands given to EKL arc simply S-expressions to be evalua ted . The user is given a set of programs to manipulate an d access pr oofs; they can be accessed only through their nam es, using th e given EK L routines .

Th us EKL becomes autom aticall y progr ammable- t h e programming

language is LISP itse lf. This is a way to implemen t simple proof strategies and to "cust om ize" EK L for par t icu lar user req uirements. Our approac h sh ould be cont rasted w it h th at of LCI<' ([Gordon, Milner an d Wad sworth 19791) -- the firs t progra mm ab le pro of-ch ecker ever built. LCF ca n be pro grammed using a sp ecial-pur pose meta-languag e, whereas in EKL much of the

67

burden of control is taken by the rewriting machinery used within the LISP environment. Complex formulas need not be given to EKL in a LISP format: All EKL commands accept formulas presented as atoms which are then exploded and parsed into an internal form for further processing. Our current library of EKL proofs consists of facts about LISP functions (APPEND, REVERSE, MAPCAR, NTH, SIZE, MEMBER, LENGTH, SAMEFRINGE, and various permutation functions on lists), elementary arithmetic, and combinatorial set theory. For example, we have been able to produce an elegant and eminently readable proof of Ramsey's theorem in under 50 lines. The shortest computer-checked proof previously known is due to [Bulnes 1979] who proved it in about 400 lines. In addition, we are going through theorems in Landau's Grundlagen in order to make a comparative study of EKL and AUTOMATH proofs [Jutting 1976] of the same theorems. Simple facts about LISP functions can usually be presented as one-line applications of the induction schema. This represents both the strength of the rewriting system and its lack of specialisation in any mathematical subfield. We do not have any built-in induction heuristics (or indeed, any heuristics at all) as in [Boyer and Moore 1979AJ, since this would tend to violate the generality of EKL as a system of formal manipulation. One has to specify the instances of induction used unless the high order unification mechanism of EKL can be exploited for the same purposes. One of the most recent proofs on LISP programs involved SAMEFRINGE, a function defined by a complicated recursion involving the size of S-expressions. Its existence (i.e., the termination of the corresponding program) was proved using a high order inductive schema for natural numbers. A simpler example-the definition of APPEND---will be given later. EKL is specifically designed to manipulate proofs and to handle several proofs at the same time. A proof in EKL consists of lines. Each line in a proof is a result of a command. A state in EKL consists of the currently active proof and the currently active context. A context is simply a of list declarations for atoms giving their syntactic meaning. Declarations are generated using the DECL function or automatically through the EKL default declaration mechanism. Associated with each line is its context and dependencies, If a line contains a formula, then its context is the set of all declarations needed to make sense of that formula. The current context

68

is the cumulative subtotal of all t he context manipulation t hat has happened thus far .

In a typica l command several lines may be used. We first combine the contex ts of the cited line s. If an incompat ibility turns up, the command is aborted . This context is combined with the previou s active context. All the incomp atible declarations from the pr eviou s context are thrown ou t . The resulting cont ext is used for par sing of te rms and type computat ions in the command. It follows t hat one can use conflict ing declarations in different parts of t he same proof pro vid ed that we do not t ry t o refer to those lines within the sa me com ma nd. Th e lan guage used is ultimately local to the line in question . Th is locality allows us to bui ld libr aries of alreadyproven th eorems. We can change notation and basi c definit ions as easily as a mathematician can in switc hing from one book to another . There ar e ab out ten primi tive commands for EKL. They hav e th e effect of intr od ucing new axi oms or depend encies: AXIO M, ASSUME, DEFAX, new definit ions: DEFINE, sp ecializing universal var iables: DE , rewrit ing: RW,TRW, and man ipulating depende n cies: CASES; proof by cases and Cl; conditi ona l intr od uction (a form of t he deduction theorem). Most of these comma nds use the EKL rewr itin g mechanism . In ad di t ion, there are commands for ma nipulat ing and edit ing the proof itself: LINK for goal structuring, CHANGE and DELETE for re doing or deleting a line and all the lines that de pend on it , and, finall y, COPY and TRANSFER for copying and tr an sferring lines with in pr oofs.

2. The Langua ge of EKL

Th e lan guage of EKL consis ts of a finite- or der predicate logic with typ ed >.·c alculus and a facility for talking abou t metatheoretic obj ects. Each EKL at om has a typ e (a syntactic entity), a sort, an d a syntype wh ich eith er VARIABLE , CON STANT, DIND OP , DEFINED or SP EC IAL.

or these

syntypcs, SPECL\ L and DEF INE D arc not user declar abl e.

Th e synty pe SP ECIAL refers t o symb ols in the standard context of E KL th at ar e heavily overload ed in t hat t hey operate on all typc levels; st r ictly speaking, these symbols don 't have a ty pe and arc rega rded as ahso lute constants. DEF INg D atoms ar c intr od uced thru t he DEF INE

69

command -

in proof-theoretic terms, they can be viewed as "eigenvariables'' resulting from

the elimination of existential quantifiers. The central notion in the logic of EKL is that of types. Informally, they are represented as a classes of objects in a set-theoretic universe. The main purpose of types in EKL is to restrict the class of acceptable formulas in the language in order to prevent logical contradictions from occurring. Our motivation is quite different from the use of typing in traditional programming languages: The type structures encountered in mathematical reasoning are much simpler than the ones found in programs. The intent is to gain maximum expressibility in our language while preserving consistency

we want to prevent the expression of inconsistencies like AX. -,x(x). At

the same time, we want to give the user the means of rigidly (syntactically) excluding formulas. The user may want to specify the number and types of arguments to a function. For example, expressions of type ground

~

ground can be applied to terms of type ground resulting in

an object of type ground. Of particular interest is the notion of list types, which allows us to talk in a natural way about parameterized formulas and functions taking an arbitrary number of variables. A term of type gr-ound»

~

ground can be applied to any number of terms of

type ground resulting in an expression of type ground. Thus objects of this type could be regarded as having variable arity. Formulas and terms are treated in a completely uniform manner; formulas are simply terms of type truthval. All deduction rules manipulate terms of type truthval. EKL checks terms for correct typing. More formally, the EKL type structure is an algebra with an arbitrary set of atoms including a special atom empty (representing the null tuple

0 of empty type), together with

type constructors "@" (product), "V" (disjunction), "~" (application), "*" (list types) and relation ":S;" (for a type being a subtype of another type). In addition, the user can introduce variable types. For example, "?Foo" denotes a variable type with the name Foo. Bindops are operators that bind variables. They must have the BINDOP syntype. For example, the set-theoretic comprehension operator {xIP(x)} can be construed as a bindop of type (ground) @ truthval ~ ground. This means that the bound variable has to be of type ground, the matrix of type truthval and that an entity of type ground always results. An applicative operator of variable arity can be declared left or right associative or simply associative. A bindop of variable arity call be declared right associative. Internally the resulting

70

expressions will be automatically "flattened" by the EKL rewriter. Thus 2 + 3 + (1 + 2) becomes 2 + 3 + 1 + 2 if

+ has been

declared right associative and Vx y .Vu V.1r becomes \Ix y u u.rr. Note

t hat declarations of this type hav e imp lied semanti c content. Tupling is the m ost fundamental operation in EKL. We regard all functions as applying to ntuples, The term ntuple(x, y, z) is writt en most of the t ime as (x, y, z). Our operators are unary: f(x, y , z) is thought of applying f to the tr iple (x, y, z). In t his sense, we do not have Ivtuples: (x) means z . We allow empty tuples;

fO

refer s to a function of no arguments. We

associate to the right: !(x,y, (u, v,w)) is t he same as !(x,y, u, v,w ). In particular, the empty tuple is deleted wh en it appears at the end: (x, y, OJ is rewritten to (x, y). Perhaps the semantically trickiest part is in the introduct ion of metatheoretic operators (for na ming EKL ob jects) and

t

t

(roughly corresponding to evalu ati on) as a proper part of our

language . One has to be careful about th e interaction of bound var iab les with metatheoretic evalu ation. For instance, we cannot universally genera lize the variabl e x in in the valid for mula

z

= t [z.

Our approach avoids the sep ar ation of metatheory into another domain; the notion of

"reflect ion" in the sense of [Weyhrau ch 1978] is absent. Mctatheory and th e use of semantically attached functions are simply a part of the rewriting process.

In order to guarantee the

soundness of the system, attach ment of fun ctions cann ot be completely controlled by the user. EKL has a list of standard attached function s like PL US, TIM ES , APPEND, . .. etc. Should t he user declare a function with an intern al nam e found in thi s list , the rewriter will do the appropr iate computations on quoted enti t it ies to repl ace the corr esponding expression with the (quoted) result if the type info rm ation matches w ha t is expected. More details of the EKL lan guage, including formal semantics an d the proofs of consistency and soundness, can be found in [Ke tone n and Weening 1983B] . Even though we have implemented

it

metatheoret ic facility, th ere ha s been com parat ively

little use for it in day-to- day th eore m-proving activit ies. It seems t hat most of the concep ts regarded as "m et at heor et ic" can more naturally be expressed in terms of high-order predicate logic. The nee d for me tatheory has in many cases been an artifact of restr iction into first order expr essions; for examp le, fa ct s about simple schemata can be formul ated in terms of 'second-or der quan tifie rs: t he ind uct ion schem a 1'(0) /\ Vn.P(n ) :J P(n ' ) :J Vn .P(n)

71

can be equally well expressed as t he second-order sent ence

VP.P (O) /\ Vn.P(n) :::) P(n ') :::) Vn.P{n). Our belief is bolstered by the nearly uniform denial by pract icing ma th ematic ians that they ever use what logicia ns call m et ath eory-it appears t o be t oo cru de and simplistic to ca pt ure even remo tely the processes occurring in mathematics. The intrinsic struct ur e of facts is obscured at t he expense of emphasizing the pa rticular choice of a lan guage an d syntactic forms for representation. As a disgui sed form of pr ogramming it may not pr esent an y apparent increase in eit her correc t ness or clarity.

3. The Use of Definitions

Definit ions playa key role in math emat ics. T hey seem to be one of the principa l ways of con tr olling comp lexity of form al disc ourse . Defined symb ols in EKL have a special syntype- in rough corresponde nce wit h t he notion of an eigenvariable in proof t heory. T hey can be introduce d in two different ways: t hrough t he use of t he DEFI NE command , which check s t he validity of the proposed definition, or the DEFAX com m an d, which allows an axi om to be regarded as a defin ition of some symbol occurring in it . Defin itions arc heavily used in E KL by both the rewri ter and the un ifier.

4. Sop hi stica ted Unification

EKL is based on high -ord er logic since we felt st rongly th at many importan t math ema tical facts admit a more natural representat ion in th is context. There has been much recent work on th e t op ic of high -order unificat ion (for example, [Huet 1975]' [Miller, Cohen and Andrews 1982J and [Jensen an d Pietzrykowski 1!J76]), which has shown it to be a feasib le alternative to firstorder methods . T he most critical use of unifica tion in EK L t akes place w it hin rewr iti ng whe re one may expect all t he high-order unifiab le variables t o occur onl y on one side, for inst an ce the left-hand side, of the match. On e can show .t hat in this case t he Huct algo rit hm [ll uet t !J75] actually

72

converges-it converges even when we allow first -order un ifiable var iab les to occur on both sides. Given a reason able definition of the size of a te rm, we can eas ily pro ve th at the value of t he pair (size(1h s), size(r hs)} decr eases lexicographically du ri ng th e course of un ifying Ihs against rhs. In fact , one can show that the cardinali ty of t he set of un ifiers generated by this process is exponentia lly bo unde d in the size of Ih s . T he algorithm te rm inates very ra pidly in practice -we have yet t o worry a bout exponentia l blow-u ps. We use two-si ded unification in t he sense explained above in rewr it ing sit uations. T hus one can avoid t he problem of "free variables" mentioned by [Boyer and Moore 1979A]. This also allows us to do som e existential verification in the process of rewriting. Our implementation is opti mi zed in many ways. tio ns are never made.

For exa mple, explicit substitu-

For efficiency and in order to deal wit h impli cit lambda elimina-

ti ons, we have used the rat her complex dat a st r uctures for subst it ution list s suggest ed by [Lusk, McC une and Overb eck 1982]. Th e second modi fication t o t he Huet algorit hm involves the way EK L t reats tu ples of variables: For example, a varia ble occurring at the end of a list may matc h to t he constant

0,

which need not appear exp licit ly on the oth er side. Finally, the uni fier may in the process of mat ching (im plicit ly) expa nd defini tions of at oms occur ri ng on either side. T his has turn ed out to be a powerfu l to ol, tho ugh it is t he current bot tleneck in the un ificat ion process. As an exam ple of uni ficat ion we present the EKL verifica tion of the correctness of a definit ion of AP P EN D based on a high-ord er function existence axiom . T his is an act ual EKL run t hroug h t he SAIL E editor.

;retrieve the basic l isp axi oms

(GET-PROOFS LISPAX PRF PRF JK) (PROOF APPEND) ; de clar e a ne w operator taki ng one or more argumen ts

(DECL NEWAPPEND (TYPE: IG R O UND @ ( G ROU ND *)~GRO UNDI) (SYNTYPE : CONSTANT) (INFI XNAME: **) (BINDINGPOWER: 840)) ; We wi l l be using l i s t i nducti on .

73

;Note the distinct uses of "." in LISTINDUCTIONDEF: ;first, as a delimiter for quantified variables, and then as the infix ;operator-name for CONS. ;Expressions surrounded by bars are regarded as terms by EKL. (SHOW LISTINDUCTION LISTINDUCTIONDEF) ;labels: LISTINDUCTION 29. (AXIOM !VPHI.PHI(NIL)A(VX U.PHI(U)JPHI(X.U»J(VU.PHI(U» I) ;labels: LISTINDUCTIONDEF 33. (AXIOM

IVDF NILCASE DEF. (3FUN. (VPARS X U.FUN(NIL,PARS)=NILCASE(PARS)A FUN(X.U,PARS)= DEF(X,U,FUN(U,DF(X,PARS» ,PARS») I) ;Note that there are 6 unifiable variables occurring in this line: ;DF of type GROUND@(GROUND*)~(GROUND*), ;DEF of type GROUND@GROUND0GROUND0(GROUND*)~GROUND, ;NILCASE of type (GROUND*)~(GROUND*), ;PARS of type GROUND*, and finally X,U of type GROUND. ;The variables PARS,X,U occur inside an existential quantifier. ;In the actual unification process they are replaced by functionally ;interpreted higher order variables. (DEFINE NEWAPPEND IVV X U.NIL**V=VA(X.U)**V=X.(U**V) 1 (USE LISTINDUCTIONDEF» ;EKL accepts this definition because it is able to rewrite the formula ;3NEWAPPEND. VV X U.NIL**V=VA(X.U)**V=X.(U**V) into TRUE ;by matching it against LISTINDUCTIONDEF with the additional unifiable ;variable NEWAPPEND coming from the other side. ;After translation from internal forms, the unifier found ;by EKL can be expressed as the following set of pairs: ; (DEF , I"X Y Z Xi. X. ZI) , (DF, I"X Y.Y1), (U, IUI), (X, I XI), (NILCASE, I"X. XI ) ; (NEWAPPEND, I"x Y.FUN(X, y) I) and (PARS, IVI) ;We can now go on and prove facts about NEWAPPEND. ;For example, we can show that U**V is a list ;for any two lists U,V by induction on U. ;This is done by instantiating the universal variable PHI ;in the list induction schema to the term "U.VV.LISTP (U**V), ;opening the definition of NEWAPPEND, ;and letting the rewriter do the rest through ;the universal elimination (UE) command. ;It should be noted that the variables X,Y,Z have the sort SEXP ;and the variables U,V have the sort LISTP. ;Sorts are unary predicates representing "semantic" restrictions ;on atoms. For example, since the variable U is declared to have ;the sort LISTP, the formula LISTP U is true. (UE (PHI I"U.VV.LISTP (U**V) I) LISTINDUCTION (OPEN NEWAPPEND» VU V.LISTP U ** V

74

; This i s a us eful fa ct to add to the c ontext of ~ene r a l l r known facts ; about NEWAPPEND. The labe l SI MP INFo a s a n am e t o a line has ;spe cia l s i gnif ic an ce to EKL : a ll rewriting c ommand s wi l l ; us e SI MPINFO lines autom a t ically in t he default mode.

(LABEL SI MPINFO) ;GENERAL PRI NCI PLE: Many meta theoretic Ob j ec ts (ax iom s che mas etc .) ; can be r eplaced by constr uc t ions involving higher types.

Our approach to verifying th e existence of LISP-like fun cti ons is quite differe nt from the on e cho sen by [Boyer and Moore 1979A]. We do not use spe cial-p urpo se definit ional mechanisms. Any definition in EKL arises from ax ioms whi ch often contain varia bles of higher type. Indeed, we have no desire to represent progr ams directly in the formali sm of EKL: Our approach is pur ely extensional. The ax iom LISTIND UCTIONDEF given abo ve is sufficient for simp le primitive recursive fu nc t ions wit h pa ra meters . Man y other LISP fun ction s ca n be defined by prim itive recursion on a higher ty pe . Con sid er , for exa mp le, the fu nction flat wit h the proper ty

flat( x.y, z ) = flat(x , flat(y, z)). While flat is no t primitive recurs ive, t he function Ay.flat(x,y) is.

5. Rewriting

Almos t all the primitive command s of EKL use rewriting- even the decision pro cedures are viewed as a part of the rewrit ing process. Rewr iting immediately poses the problem of control. Indi scr iminate use of equalit ies may easily lead to infinite loops or wors e conseque nces: Unintended replacem en ts. One cannot expect problems of termination to be of great relevance in our context. Thus we need a language for rew riting-how to control th e pr ocess th roug h simple instructions t o EKL. Many formal manip ulation pr ograms use t he paradigm of "reasoning experts" operating on a "k nowledg e base" , followin g t he tradi tional sepa rati on of pr ogr am s from da t a . Our first attempt at controll ing rewri ti ng was base d on t his a pproach, following a suggest ion made by McCarthy . Regul ar expressions of strateg ies were used to te ll t he rewr iter what to do . The

75

resul ts from this experiment were very valuab le. While we were ab le to pro duce very compa ct proofs, oft en the expressions employed were incomprehensible to the casual user . One may argue that the fault lay in the design of the rewriting language. We ar e inclined t o believe otherwise. In our opinion the point of depar ture from fam iliar mathematical pr act ice occur s with th e very attempt t o separate pr ograms from da ta. Mat hematical statements oft en contain im plicit procedural informat ion. Let us look at a simple example: What is t h e intended mean ing of t he fact

P(x) J A= B? One can immediately enumerate several possibilities: (1) Repl ace P(x) J A = B by true , wh enever it appears. (2) Rep lace A

=

B by true if one can pro ve P(x) in t he current situation.

(3) Rep lace P(x) by false if one can prove ..,A = B . (4) Replace A by B whenever one can prove P(x). (5) Replace B by A wh enever one can prove P(x). (6) Replace A by B whenever on e can pr ove P(x), but not in te rms resulting from this substitut ion . Som e of th e interp re tations list ed above su bsume others: For exa mp le, (2) is mor e general than (1). Lines (4) and (5) are com plete ly contradictory in inte nt . It is obv ious that one can go on listing man y mor e possib ilit ies in this vein. With quan tified statements th e situation can get even mor e involved. EKL terms are complicated data structur es tagged by informat ion about t he applicability of various rewriting procedures. We view t he interpretation of fa ct s as a mapping of the form: (fact)

® (mode

of use) ---; (rewriting procedure).

A rewr it ing procedure can be expresse d in terms of a tuple consist ing of left -h and side, r ighthand side, list of un ifiab le variables, an d condit ions. Th e conditions can eit her be pro ced ur al in natur e or consist of form ulas t o be veri fied before the r esult of th e pro cedure can be acce pte d. The user can impose pr ocedur al conditi ons by

76

listing arbitrary LISP S-expressions that have to evaluate to T in order for the application to be accepted. A rewriting procedure returns either the right-hand side together with a substitution list or else reports failure. The verification of the conditions of rewriting is considered separate from the process of rewriting. We have a small decision procedure that performs this task quite quickly. Currently there are three possible modes of use for a fact: Given a non-failing application of a rewrite, the mode default accepts the result only if it is simpler, the mode always accepts the result always, and the mode exact accepts the resulting term but does not allow applications of this rewrite in it. In addition to actions described above the rewriter docs standard simplifications: For example, logical simplifications, A-eliminations, and removal of unnecessary quantifiers are done automatically. Applications of associative operators are "flattened." Existential statements are replaced by true if they can be verified through the use of unification. Equalities are treated in a similar fashion. Metatheoretic simplifications are done automatically. The rewriter will replace expressions of the form

~ tt

by t if t is an EKL expression of the right type and has no free variables that

are captured by the current binding environment. In addition, computations involving absolute constants are performed. If an EKL symbol F is attached to the LISP function FOO, then the rewriter may replace F(tXj ... tXn) with tY (assuming the types are consistent), where Y is the result of applying FOO to the S-expressions representing Xl" .Xn . One can ask the rewriter to apply the EKL decision procedure DERNE. The appropriate term will be replaced by true when the procedure succeeds. DERNE invokes a program which tests the validity of deductions in a fragment of predicate calculus designed to capture the notion of "trivial" inferences ([Ketonen and Weyhrauch 1983C]). Rewriting procedures can be induced by conditional branches. For example, in the formula if P then A else B we use

77

conjunctions and disjunctions. In the expression

PI\Q we may use P when rewriting Q and vice versa. Similarly, in

PvQ oP is used when rewriting Q. Conditional branches of this kind are more difficult to handle. The facts used need to be recomputed every time there is a change. A typical effect of this sort of conditional reduction is the removal of redundant expressions in conjunctions.

6. An Example of Rewriting in EKL

Continuing the example presented before, we will prove the associativity of the NEWAPPEND function.

This is done by proving by induction on the list variable U the

statement

Here U, V, Ware variables of the sort LISTP. The LISTINDUCTION axiom is invoked through the universal elimination (UE) command by instantiating the variable PHI to the term

Turning on the debugging switch REWRITEMESSAGES forces EKL to give detailed information about its actions,

(SETQ REWRITEMESSAGES T) (UE (PHI IhU,((U**V)**W)=(U**(V**W» I) LISTINDUCTION (OPEN NEWAPPEND» ;the term NIL ** V is replaced by:

;V ;the term NIL ** (V ** W) is replaced by:

;V ** W ;the term V ** W=V ** W is replaced by:

;TRUE ;the term X,U ** V is replaced by: ;X. (U

** V)

78

; t he t erm X. (U ** V) ** IV is replaced by:

;X. ((U ** V) ** IV)

;the term (U ** V) ** IV i s replaced by: ;U

**

(V

** IV)

; t he term X.U ** (V ** IV) is replaced by:

;X. (U **

(V

** IV»

; t he term X.(U ** (V ** IV»=X.(U ** (V ** W»

is replaced by:

;TRUE ; t he t.erm (U ** V) ** W=U ** (V ** W)JTRUE is replaced by:

;TRUE ;the term VX U.TRUE is replaced by:

;TRUE ;the term TRUEATRUE is r eplaced by:

;TRUE ;the term TRUEJ(VU. (U ** V) ** w=u ** (V ** IV» ;VU. (U ** V) ** IV=U ** (V ** IV)

is replaced by:

VU . (U ** V) ** IV=U ** (V ** IV)

References

Bledsoe, W. W. , No n-resolu ti on Th eorem -provin g, Artificial Intellig ence 3, 1-36, 1977. Boyer, R.S ., Moore, J. S., A compu ta tional logic, Acad emic Press, New York, 1979. Boyer, R.S. , Moore, J. S., M etafunc tion s: Pro ving th em correct and using them efliciently as new proof procedures, SRI Intern ational, Technical Report CSL-108, 1979. Bulnes, J. , Goal: a goa.l orient ed comm and language for interacti ve proof construction, St anford AI Memo AIM-328, 1979. de Bruijn, N.G., AUTOMATII-A Language for Mathematics, Technological University Eindhoven, Netherlands, 1968. Gabriel, R.P., Frost, M.E., A Programming Environment for a Tim eshared System, to be pr esented at the 1984 Software En gineering Symposium on P ra cti cal Software Development Envi ronments, 1984. Gordon, M .J.C., Milner, R., Wadsworth, C., Edinburgh LCF, Springer-Verlag, New York, 1979. Ruet, G.P., A unification Algorithm for Typed A-Calculus, Theoret ical Comput er Science 1, 27-57, 1975. Jensen, D.C., Pietzrykowski, T. , Mechanizing w-Order Typ e Th eory Through Unification, Th eoret ical Computer Science 3, 123-171 , 1976.

79

Ju t ting , L. S ., A t ranslatio n of Lan dau 's "Grundlagcn" in AUTO'\'!1-1Tll, Eindhoven University of Technology, Dept . of Math., 1976. K etonen, J., W eening , J ., EKL -A n interactive proof checker, Users ' Reference Man ual, 40 pp., Stanford University, 1983. Keton en, J. , W eening , J ., Th e Language of an Interactive P roof Checker, 34 pp ., Stanford Unive rsity, CS Report STAN -CS-83-992, 1983. K etonen, J., W eyhrau ch , R . , A semidecision proced ur e for predicate calculu s, 16 pp ., to ap pear in the Journ al of Theo retical Computer Science 1984. Krei sel , G. , Neglecte d Possi bilities of Pr ocessing Assertions and Prools Mechanically: Ch oice of Pr oblem s an d Da ta, in University-Level Computer-Assisted Inst ru cti on at Stanford: 19681980, edite d by Patrick Sup pes, llvISSS, Stanford, 1981. Lusk, E.L, McCune, W.W, Overbeck, R.A., Logic Machin e Architecture: Kernel funct ions, in 6 th Conference on Automated Deduct ion, New York, edited by D.W.Loveland, Lecture Not es in Computer Science, No.138, Spri nger-Verlag, 70-84, 1982. McCarthy, J., Oomputer program s [or check ing mathematical proofs, Proc. Symp. Pure Math ., Vol. 5, American Mathematical Society, 219-227 , 1962. McCarthy, J ., T alcott, C . , LISP : programm ing and proving , to appear, availa ble as Stanford CS206 Course Notes, Fall 1980. Miller, D.A., Cohen, E ., A n d r ew s, P.B., A look at TPS, in 6th Conference on Automated Deduct ion, New York, edite d by D.W.Loveland , Lecture Notes in Computer Science, No.138, Springer-Verlag, 70-84, 1982. Shostak, R.E . , Schwart z, R. , Melliar-Smith, P.M., STP: A Mechaniz ed Logic for Specifica tion and Verification , in 6th Conferen ce on Automated Deduction, New York, edited by D.W .Loveland, Lecture Notes in Computer Science , No.138, Springer-Verlag, 32--49, 1982. Weyh rau ch, R. , Prolegomena to a th eory of mechanized formal reasoning, Stanford AI Mem o AlM-3 15, '1978. Wos , L. , Solv ing open questions wi th an au tomated theorem -proving program, in 6th Conference on Aut omated Deduct ion, New York, edited by D.W.Loveland , Lecture Notes in Compute r Science, No.138, Springer-Verlag, 1-31, 1982.

80

A LINEAR CHARACTERIZATION OF NP-COMPLETE PROBLEMS Silvio Ursie Madison, Wisconsin ABSTRACT We present a linear characterization for the solution sets of propositional calculus formulas in conjunctive normal form. We obtain recursive definitions for the linear characterization similar to the basic recurrence relation used to define binomial coefficients. As a consequence, we are able to use standard combinatorial and linear algebra techniques to describe properties of the linear characterization. 1. INTRODUCTION This paper develops polyhedral combinatorics methods for the problem of finding satisfying truth assignments to propositional calculus formulas in conjunctive normal form. In particular, we wish to detect formulas with no solution. The methods to be developed have applicability in the detection of tautologies in the predicate calculus, in analogy to the manner that ground resolution is lifted to resolution. Clauses of a formula in conjunctive normal form become faces of a polytope. An unsatisfiable formula becomes an unsatisfiable set of linear inequalities. In this context, the fact that two clashing clauses can be resolved, producing a third clause, can be interpreted as being a manifestation of the fact that the polytopes in question are highly degenerated. A supporting hyperplane to the folytopes in question can be written in more than one way as a positive linear combination of faces. Two ,ery influential papers are responsible for setting background goals which led to the "rise of the conjunctive normal form". They are (Robinsor. 65) and (Cook 71). Robi.nson' s proof that ground resolution will detect unsatisfiable formulas is simple and elegant. A distinct proof of this fact can be found in (Quine 55) as ammended by (Bing 56). The two proofs complement each other by addressing two distinct aspects of the problem. Robinson's ground resolution and Quine's consensus can ultimately be traced to (Blake 37) and (Blake 4·6). To this respect, see the note (Brown 68). Blake's syllogistic expansion is, exactly, what we today know as consensus, or resolution, of two clauses. His proof of its validity, theorem (l0.3), which he characterizes as having "fundamental importance" is precisely the proof of the validity of consensus as given by (Quine 55). It is particularly fascinating to, after so many years, read the review to (Blake 46). The reviewer characterizes the value of concrete applications of Blake's expansion process (what we today know as resolution) as 'virtually nil". Almost fifty years after Blake introduced it, the computing time of iterated resolution to detect tautologies in the propositional calculus continues to be unanalized. A not very restrictive particularization of it, regular resolution, was shown to have exponential computing time, (Tseitin 70), (GaliI77). The paper (Buchi 62) contributed to our understanding of conjunctive normal forms from another direction. He showed it is possible to simulate the behavior of a Turing machine, in a very simple way, with a predicate calculus formula. If the Turing machine to be simulated is known to halt after a finite number of steps, Buchi predicate calculus formulas can be expanded to propositional calculus formulas. The expanded formula has the very important property of being describable with a number of symbols proportional to the Turing machine computing time. This proportionality was noted and used in a fundamental way in (Cook 71). Much followed from Cook work. The book (Garey & Johnson 79) and the ongoing column on NP-complete problems by Johnson in the Journal of Algorithms will introduce the reader to the field. The reductions from Turing machines to conjunctive normal forms in (Buchi 62) are important for another reason. They show that the predicate calculus and, in particular, propositional calculus for computations known to require a finite number

81

of step s , ca n be con sider ed a programming l an gua ge. t hem i n a very na t ural way.

We can sp ec i f y algori t hms wi th

The met hods to be pur s ued in t hi s paper ca n be collec t ive ly labe led a s ' ~Iethod s i n Pol yhedra l Comb i na t or ics" . Pol yhedral combi nato rics, as a t op i c of research , predates t he a dve nt of NP-complete pr oblems . I nteger programming was r ecognized a s a powe rful unifying t e chnique fo r a va r iety of combi nator ial pr oblems long before (Cook 71) and (Kar p 72) fo und unambi guou s reasons fo r it. The s urvey by (Balinsky 65 ) is a good overvi ew of t he s t ate of affai rs in the pre NP -comple tenes s era . More recent s ur veys of the fie l d can be f ound in (Padbe rg 80) and (Pull eyblank 83) . Rea s on s fo r the t enacity and permanen ce of intege r programmi ng as a technique to solve combina t or i a l pr ob l ems a re two fo ld . It pr ovi de s met hods f or t he practical so l ut io n , via the s impl ex met ho d , of ha rd combina to r ial problems. A su ccess history is exempl i fied by (Gro tsche l 82) . It a lso provi des a cons t ruc t ive f ramewor k fo r the s t udy of NP- compl e t e problems. The dis cov ery of the ellips oi d a lgori thm (Khachian 79) an d i ts variants (Urs i e 82), has created new reasons t o pursue t his line of attack . They are exemplified by (Kar p & Papadimitriou 80) , (Gr ot sche l , Lovasz & Schrijve r 81), (Papadimitriou 81) and (Lens t r a 82). This paper presents a new fami l y of polytopes, the Bin omial Polytopes. Section 2, motivation, presents them informally and provides reasons for their study. Section 3, the binomial matrices, r e cursively defines some fami l ies of mat rice s and develops some of their properties and i den t i ties among them. Section 4, the binomial polytope s , presents s ome character i zation s of their faces . Section 5, applica t i on s , Presents s ome exampl e s of the po tent ial uses of the binomial polytope s. Se ction 6, conc l us ion , presents a final commen t . 2. MOTIVATION

Cons ider a boolean formula i n N va riabl e s , i n conj unc tive normal fo rm and with p liter al s pe r cl au s e ( i n short : CNPN pl . Construct a matrix as f ol lows : label each ro w wi th a dis tin ct trut h a ss ignmen t to a CNFN p ; l ab e l eac h column wi t h a di s t inc t clause wi t h p literals in N va r iab l e s ; s e t an ~lement of t he ma t r ix t o ze r o i f the row l ab el make s the col umn labe l true; if the r ow label makes the column label f als e, s e t it t o one . Each co lumn of a matrix so defined is t he i ncidence ve c to r , some~ times ca l l ed the charac ter is tic f unct ion , de f ini ng the se t of t r u t h a s s ignment s for which a sp e cific clause is false. We wish to s tudy the convex hull of the set of points whos e coordin ates a r e gi ve n by the rows of the mat r ice s de f ined abov e . The following examp le , wi t h N = 3 and p = 2, illustrates the idea. xy xy iCy xy xz xz xz xz y z yz yz yz

000

o

000

0

000

o

o

0

o

o

0

o 0

0

0

o o

o o

o o o o

o o o

1

0

0

0

Fx Fy Fz

o

0

Fx Fy Tz

o

Fx Ty Fz

0

000

o o

o o o

o

000

o o

0

o o o

o o o o o o o o o o o o o

o o

Fx Ty Tz F

y

F

z

F T Y z T T F x Y z Ty Tz

(2.1)

82 We use as variables x, y and z. We label rows and columns with truth assignments and clauses. To indicate clauses we omit the logic operator and use an overbar for negation. To indicate truth assignments we use T for true and F for false with the subscript indicating the variable to which they are assigned. With matrix (2.1) we obtain a polytope with eight vertices in a space of dimension twelve. This polytope is in a subspace of dimension six and has sixteen faces. Its description is given by three groups of linear inequalities and equations. xy <: 0, xy <: 0, xy <: 0, xy <: 0, xz <: 0, xz <: 0, xz <: 0, xz <: 0, yz <: 0, yz <: 3, yz <: 0, yz <: 0.

(2.2)

xy - xz + yz <: 0, xy - xz + yz <: 0, xy - xz + yz <: 0, xy - xz + yz <: 0.

(2.3)

xy + xy + xy + xy

1,

xz + xz + xz + xz

1,

yz + yz + yz + yz

1,

xy + xy + xz + xz

1,

xy + xy + yz + yz

1,

xz + xz + yz + yz

1.

(2.4)

Inequalities (2.2) correspond to the clauses of the CNF 3 2. This correspondence is the key property we wish to have. The entries in matrix'(2.1) were chosen so that this would happen. Inequalities (2.3) do not correspond to clauses. Much of what follows was originated by these inequalities that do not correspond to clauses. The equations (2.4) are a consequence of the fact that the columns of matrix (2.1) are linearly dependent. They describe the linear subspace which contains our polytope. The form assumed by equations and inequalities (2.2), (2.3) and (2.4)'depends on how we choose a basis for the column space of matrix (2.1). Notice that each of the faces corresponding to clauses, namely (2.2), touches all the vertices labeled with truth assignments that make the corresponding clause true. Equivalently, each face does not touch vertices labeled with truth assignments that make the clause false. Hence it is possible to test a CNF 3 2 for satisfiability by converting to an equality each face corresponding to a clause present in the CNF 3 2' Each conversion eliminates a few vertices from consideration. If they are all eliminated, we obtain a system of equations and inequalities with no feasible solution. Notice that for each Nand p the polytope in question is fixed. Its vertices are labeled with all the 2N possible truth assignments and its faces do not depend on a specific CNFN p' The main purpose of this paper is to provide a description as accurate as p6ssible for the faces of this family of polytopes. The task was started in (Ursie 75) where the family was defined and some of its properties were given.

83

3. THE BINOMIAL MATRICES We will r ec ursively define some fa milies of matrices. The recursive definitions closely follow the basic recurrence r elation with which binomial coefficients can be defined, namel y: (3.1)

The building blocks for the recursive definitions of the binomial matrices are given by the f ollowing two by two mat r ix , wi t h labeled rows and columns.

(3.2) The subscript x of Gx represents a boolean variable and is used to form row and col umn labels. The label $ r epresents the empty set. It will be used to represent either the empty label or the empty matrix. In small exa mples we will use as labels and subscripts the variables x, y, z and w. In recursive definitions we will use xl' x2' ..• ,xN and write, for example, Gi ins t ead of GXi' The four entries in matrix Gx can be considered one by one labeled matrices. From (3.2) we have: $

ax

= [l J ~;

bx

=

x [ OJ$ ; c x

~

=

[ IJx ; dx

=

x [1Jx .

(3.3)

We will use + and. to indicate standard matrix sums and products . We will use ~ (er) to indicate the row-wise (column-wise) join, or conca tenation, of two matrices, with the same number of identically labeled rows (columns). All row and column labels are retained in the results. We can now define some additional matrices :

(3. 4)

Rence we have:

We will us e 0 to indicate standard tensor product of two matrices. Labels of the result are sets of row and column labels of the factors. See, for example, (Pease 65). So, we have: x

y

xy

°I

0

0

~

0

0

x

0

I

0

Y

( 3 . 5)

I

xy

Our mai n fa mily of matrices are in many respects the matrix counterparts of binomial coefficients . They are recursively defined as follows : ~

rO,O,O

=

ClJ$ ; for i,

j

~

0, rO,i,j

= ~;

(3.6)

84

Definition (3.6) closely mirrors definition (3.1), applied twice to the indices i and j of rN,i,j' The resulting matrices are defined for N 2 0 and are non-void in the range 0 ~ i, j ~ N. The next theorem is our main tool for the study of some of the combinatorial properties of boolean formulas in conjunctive normal form. It is the exact counterpart of the identity

o~

L i

~ N

(~)

2N

=

~

(3.7)

The matrices Gi assume the role of the constant two and the matrices r . . are N "doubly binomial ", either by the row index i or by the column index j. ,~,] Theorem B (The binomial identity).

The following is a matrix identity: (3.8)

Proof. We start by giving a detailed proof of the induction step in a proof by induction on N of (3.7). This level of detail is hardly necessary. It illustrates, however, the complete parallelism between binomial coefficients and binomial matrices. (separate a "two")

( L

o~

i

o

i

~

I

~

(use induction hypothesis and definition of constant "two")

N-l

L

o~

~

i

(N ~

i

1)

(N

i

N-l

o~

L i

(N ~

i

L

+

1)

N

~

0

i

(~ ~

(perform product)

N-l

-

t)

1) (extend sums using (N N 1) also (N_I 1) 0)

o and

(shift indices in second sum)

N

(merge sums) (use defining recurrence relation) We will repeat the procedure in a proof by induction on N of (3.8). is true for N = 1. In this case it reduces to

which is true by definition. ~

o i

~

o~

N

Gi

= ( ~

o i

~

N-l

The identity

The induction step is taken care as follows: Gi ) 0

~

(separate GN from the product)

GY N-l i

~

(use induction hypothesis and definition of the matrix G N)

85

°:; G? N-1 i

:;;

(perform tensor product)

(rN-1,i-1,j 0 cN E9 t N- 1, i - 1, j - 1 0 dN) ) (extend sums using the fact that the matrices r N i j = ¢ outside the index range o :;; i, j ~ R, and then shift indices) (use defining recurrence relation) We conclude that (3.8) is indeed an identity.

0

Matrix identity (3.8) essentially permutes rows and columns of the same matrix in two different ways. The following example, with N = 3, illustrates its effect. We have:

1

0

0

0

0

0

0

o

-

r3,O,O

r3,O,1

r3,O,Z

r3,O,3

r3,1,O

r 3,1,1

r3,1,Z

x3,1,3

r 3,Z,O

r 3, 2 , 1

r 3, 2, 2

r 3,Z,3

r3,3,O

r3,3,1

r3,3,Z

r3,3,3

1

1

0

0

0

0

0

o

1

0

1

0

0

0

0

o

1

0

0

1

0

0

0

o

1

1

1

0

1

0

1

1

0

1

0

1

°0 I oo

1

0

1

1

0

0

1 ~

1 \ 1

1

1

1

.o

1

x1 x3 x2x3 x1 xZx3

+0 + ¢

G1 0 GZ 0 G3

1

Xl x2 x1x2 x3

x3

¢

1

100

o

0

0

0

Xl

1

0

1

0

o

0

0

0

Xz

111

1

o

0

0

0

x1 xZ

0

0

0

0

0

1

100

x 1x3

0

1

0

1

0

x2x3

1

1

1

1

x1 xZx3

0

~

x3

It should be emphasized that identity (3.8) is an identity among the entries of the matrices on the right and left hand side having the same row and column labels.

86

The nex t theo rem pres en t s essen t ial info rmat ion ab out t he entries and r ank of the mat r ices rN, i ,j ' Theor em P (Properties of the binomi al mat r ices) .

The fo l lowing is true :

(A)

Matrix r N,i,j ha s (~) r ows and (~) columns;

(B)

Ma t r ix r~ . . is a mat rix of ze ros an d one s with exa c t l y (J~) ones i n each ro w a N . ,1., J and (N=l one s i n ea ch col umn ;

(C)

For N

~

P

~

q

~

s

0 we have

~

p - s

_

rN,p ,q . rN,q , s - (q _ s) (D )

.

€I r N,p , s '

(3 . 9)

For i ~ j , ma t r ix r i . ha s full r ank. If (~) ~ (~) then its r ows are linearly independent. If (~~' ~'~N) then i t s col umns are linearly independent.

Proof. All four properties are proven by induction on N using the defining recur s i on , namely (3.6). The proof of properties (A) and (B) i s uncomplicated. Identity (3.9) is obviously true for N ~ O. We also have rN,p,q . rN,q,s

=

e; (rN-l,p-l,q €I cN e r N-l,p-l,q-l €I dN» « r N- l , q ,s €I aN e rN-l,q,s-l €I bN) e; (rN- l , q- l , s €I cN e rN- l , q- l ,s- l €I dN» « r N_l , p , q

€I aN e rN-l,p,q-l €I bN)

(us e def ining r ecurrence (3. 6 ) ) «rN-l ,p,q . r N-l,q,s) (a ma t r ix of zeros)

€I aN e ~

€I bN)

«rN-l,p-l,q . r N-l,q,s

e

r N-l, p-l,q- l

(rN-l,p-l ,q-l . r N-l,q-l,s- l)

(rN-l ,p,s

€I dN)

=

(pe r f orm produ ct taking into a ccount the fact that the entry of bN is zero )

€I (~ = ~) €I aN e

(a matrix of zeros)

«.r N- l , p- l , s €I (pp rN-l,p-l,s-l

€I bN)

~

- s - l) _ q _ 1 E9 rN-l,p-l,s

€I (~ = ~) €I dN)

=

€I (use induction hypothesis)

(use defining r ecurrences (3.1) and (3.6» We therefore established property (C). Property (D) foll ows, also by induction, from the fact that the entry in bi , defined in (3.2), is zero. As a consequence, the matr ices rN i j' as given by recursion (3.6), are block triangular. The two blocks on the diagonal, mat r i ces rN-l i j and r N-l i-I j-l' hav e either row or column full rank (by induction~ and we ~ r e done with one special case t o be consider ed . We ca n have (N i 1) < ( 7 1) and (~ = 1) 1s > (~ = ~ ) or also, concievably, (N i 1) > (N j 1) and (~ < ~~ : this ystem of 1neq ua l~ties ij8n be solved in i and j. We obtai§ th~ f eashbl e l so l u t i on N ~ i + j. Hence (i) = (J ) and as a consequence we have ( I ) > ( J ). Full rank for the square matrix r N,i, j follows from the followin g l i nea r combination of columns of r N,i , j'

= I)

I).

87

The entries in matrix rN,i,j are given by

l

rN-1,i,j

J'

0

rN-1,i-l,j

(3.10)

rN-1,i-1,j-1

Multiply the matrices r N-1,i,j and r N-1,i-1 i in (3.10) by r~_l j,j-1 and then use property (C) and subtract tnem from the otter two matrices In t3.10). We obtain the matrix rN-1,i,j-1 0 (p -

r N- 1 ' i , j [

o

rN-1, i-1 ,j

which is block triangular and, by induction, is of full rank.

0

Theorem P establishes basic information about the matrices generated by recursion (3.6). The next theorem expands these properties in a form useful to the task outlined in section Z. i

Theorem V (Generalized Vandermonde). such that (~) ~ (~i)' Th~n the matrix ~

GY i

~

k

G) ~

j

~

k

~

N, be distinct integers (3.11)

~

k

has linearly independent columns. Proof.

Vandermonde matrix can be written as:

0 Tk

:5 i

S;

~

0j

~

k

(a. )j

a1

(a1) Z

(a1)3

az

(az)Z

(az)3

l

(3.1Z)

We know that its determinant is

and hence is a full rank matrix. Similarly to (3.12), we can define a "binomial Vandermonde matrix", with entries being given by binomial coefficients. We obtain the matrix

o

~ j ~ k

(';i)

(3.13)

J

The entries on (3.13) can be considered fractions whose numerators are polynomials in ai' and hence can be obtained as linear combinations of the columns of (3.12). The denominators are factorials and can be factored out of the matrix. The determi~ nant of (3.13) ends up being the determinant of (3.1Z) divided by a product of factorials. Matrix (3.11) is very similar to matrix (3.13). We will prove that (3.11) has linearly independent columns by reducing it to block triangular form, with each matrix in the diagonal having full column rank. The technique is similar to the one used to compute the determinant of (3.12). See, for example, (Knuth 68).

88

For j = 2 , 3, . •. ,k , in this order, multiply column j by r n-l j j-l' mul t i ply column j - l by ak- j + l an d s ub trac t co lumn j from column j-l . By colu&n of index j we mea n here the submatrix given by

G?~ k

:5 i

rN

"

, ai ']

Use the fac t that r N,ai, j . r N,j , j-l = r N, j,j-l 0 (a i - { + 1), t hat is property (C) of theorem P. For j = 1, ..• , N-l t he submat rices r N a j wi l l be ze roed. We fa ctor (a k - a j ) from the col umns of the result and continue ~y induction on a matri x of the same kind of order k-l. Theorem V f ollows from the fact that matrix rN ,ak , k has linearly independent columns, by prop erty (D) o f theorem P. D The matrices r N i j beh av e, in mat r ix identities, like b i nomial coe fficients. Theorem B should be considered just a . sampl e of what is po s s i ble. Consider, for example, the binomial coefficient id entity (Vandermonde convo l ut i on ) : (N

~

M) =

I

(~)

(k

~ i) .

(3.14 )

i

The corresponding matrix identities for binomial matrices ar e: r N+M,k,l =

~T i

(rN,i,l 0 rM,k-i,l)

or, ap ply ing (3.14) to both r ow and column indices rp+Q+R+S" k 1 =

~T~ i j

(r p "J i " 0 r Q,~, ' 1-J') ' . 1-J" 0 r R, k-i ,],0 r S , k- 1,

It i s conven i en t t o define two add itional f amilies of matrices.

They are:

(3 . 15)

~,j

(3. 16) From (3. 4), (3.6), (3.15) and (3 . 16) we obta i n:

Ro,o = 1; for i

2:

f or i ~ 0, Ro,i = ~ ; RN,i = RN-1 ,i-l 0 VN

0, BO,i

e

~-I,i 0 ~;

(3 . 17)

1; f or i < 0 , BO,i

From definition (3.16) and identity (3.8) we derive the f ollowing identity, which provides an alternate description of BN,N:

(3.19) In identities, matrices BN,p behave s imilarly to the sum

o ,;

Ii

(~) + (~) + (~) + ••• + (~).

(3.20)

,,; P

(Knuth 68) on page 64 mentions that there seems to be no simple formula for (3 . 20) meaning it does not seem possible to obtain identities involving (3.20) without the sum sign. It. is nevertheless possible to derive meaningful identitie s involving (3 ,20). (Riordan 68), pp 128-130, reports so me results in this direction. The matrix identity involving BN,p needed for our purposes is the analo gue of

89

I

o s i s P

(N

t M)

I o s i s N

I

(I;)

~ 0

s j s p-i

(~)

J

I

o s i+j s p

(~) (}J) J

(3.2 1)

namely

BN+~,p

G

0 S i S N

(RN i 0 '

~

,

p-i )

=

8

0 S i +j S P

(~,i 0 RM,j) '

(3 . 22)

Ident ity (3 .22) ca n be proven by i nduction on N. The ne cessary index mani pulat i ons cl osely f ollow t he cor respondi ng ones i n a proof by induc t ion of (3 . 21) . This concludes our s t udy of "Binomial Mat r i xol ogy " . It would be interesting, and useful , t o prec i s ely cha r a c teri ze whi ch binomial co e f fici en t iden t i t ies ca n be conver t ed into b inomi al mat r ices i dentiti es . 4 . THE BINOMIAL POLYTOPES N We de fine the binomial pol ytopes as being t he convex hull of the 2 points whose coo rdinates a re given by the r ows of t he matrices ~ p' We wi l l call the polytope associa t ed with matrix BN by the name of the ma tri~ that generat e s it, namely p o ~ ly to pe BN p ' Def i ni t ion t~.16 ) and identity (3.19) inform us of two fa c t s : matri x BN N has ful l r ank and hence the as s ociated polytope is a simplex; matrices BN p a r e f ofmed wi t h a sub s e t of t he co lumns of BN,N and henc e the polytopes BN,p ar e a'projection of polytope BN,N' Pol y t ope BN N i s a s implex of di men sion 2N - 1 in a s pace of dimens ion 2N• To ob tain a simpl ex whose dimension matches t he dimension of t he sp ace we have t o e liminate one of the columns of BN, N' The column of BN,N wi t h the emp ty l abel, mat r i x RN 0' pe rforms very nicely t he role of t he homogeniza tion column . We will keep t his co lumn in the binomial mat r ices with the unde r s tanding that to ob t a in t he des cr ipt i on of our convex hulls we mus t add the equa l i t y ¢ ~ 1 to t he sy s t em of i nequal i t ies de s cribing faces. The i nve r s e of mat rix BN ,N is easy to comput e. B- 1 N,N

8 G-1i S i S N

~

8i

~

Fr om (3. 19) we obtain :

.[-: "j o

<j>

1

Xi

(4 .1 )

-1 I dentity (4 . 1) tel ls us wha t the fac es of BN N ar e . Each col umn of BN N des cr i ~ _ be s a face of B~ , N ' The pol y top es BN p are projec t i on s of BN N' Hence i t is pos -s i ble to obt ain a f a ce of BN,p by Four ~er elimi na t ion on the ~nown fa ces of BN•N.

We will i ndi ca t e by fN, p a suppor t i ng hyper pl ane to BN , p ' tion of BN , N we can wri te : f N,p

-1

BN,N' hN,p

wi th hN, p ~ 0 •

As BN,p is a proj ec(4.2)

The vec tor hN p con tains t he 2N Fourier mul t i pliers def i ning t he posi tive linear combina t i on of f ~ c es of BN N whi ch generate f N and ha s its componen ts labe l ed with t he column l a bels of BN 1N. ' The vect or f N p wii~ have i t s compon ents l abe l ed with the r ow l abel s of BN~N " We als o requi re t ha t all t he compone n t s of f N whos e l abels are not present i n BN to be ze ro. This l ast condi t i on re flec t~ t he f a c t tha t ~ ,p is a project ion of' ~N,N ' As a conseq uence we can wr i te

disregar ding al l t he coor di na t e s of fN,p no t present in BN,p' whi ch have zero en t ries . We say that f N R defines a face, or is a fa ce , of BN p i f t he collection of v ertices of BI;, p t o u ch~a by i t define s a sub matr i x of BN p of maximum r an k. Othe rwi se fN, p de f ~nes, or ~s, a facet of BN,p ' It i s convenient t o explicit l y l ist the

90

subscripts of the matrices us ed to generate B3,3 = Gx 0

Gy 0 Gz

=

~,p'

We write

B3, 3(x , y , z) .

We pro ceed analogously fo r fa ces. We wr i t e f 4 3 (x , y, z, w) for a face of B4,3( x, y , z, w). We a r e now ready to start to describe f aces of the polytopes BN ,p' Theorem R (Representative faces) . Le t s N j be col umn ve c t or s wi t h (~) components labeled wi th a ll the labels i n t he col umns ot r N,i,j and wi t h a l l t he entries being e qual t o one . Let aI' •.• , a p be the constants used in theorem V with t he additional condi t i on that ai+1 = ai + 1 for i odd· We will refer to this condi t i on as t he "no odd gap condition". , bp by so lving the l inear s ystem

Compute the constants bO' b 1,

E

o s; k

s

p

bk ( ~j)

= 0,

fo r

j

= 1,

. . . ,p .

(4. 3)

Then (4.4) is a fa ce of EN,p ' Pr oo f.

We have

(us e de f i ni t ions (3. 16) and (4 .4 » (u s e identity r N,i, j • SN, j = SN,i' (3 ) , a consequence of pr ope r ty ~B) of theorem P) But, fo r i

o s;

E

j s; P

a I , a 2,

= bj (~) J

... o.

, ap ' we have (4 . 5)

The co ns tan t s b j we r e chos en , by cond i t i on (4 . 3) , so that this woul d happen. Hence , by theorem V, hyperplane fN , p' as def i ned in (4.4 ), to uches a sub s et of the vertices of EN,p having maximum r ank . . To verify tha t a ll the other vertices of EN ,p ' not touched by fN,p' are located 1n onl y one of the t wo halfspa ces defined by f N p' we must ch eck the sign of the sum in (4 . 5) for values of i ou t s i de the range aI' . . • ,a p' To t hi s respe c t , ad d to the linear sys t em (4. 3) the linear e qua t i on

in order t o comput e the va lue of c i ' The q ~antity ci wi ll be a f unction of aI ' a 2, a and i . We use the commen ts f ol l owin g (3 . 13) and conc l ude t ha t c i i s the ~~~i~ o~ the two Vandermo nde determinant s, given by the constants a I , a2 , •• . , a p and 8 1 , 32' . .. , a p , i. We have (a 1 - L) (a 2 - i)

( 4.6 )

91

We conclude that ci ~ 0, f or i = 0, s atisfie s the "no odd gap" cond ition .

o

, N, whenever the sequence aI, • •• , a p

Some examples are in order. The explicit solution of the linear system (4.3) is simplified by the following facts. We obtain the binomial Vandermonde matrix (3.13) from Vandermonde matrix (3.12) using the Stirling numbers of the second kind and the inverse to the Vandermonde matrix is known explicitly . Hence the Stirling numbers of the f irst kind and t hi s known inverse permit us to write sol utions to (4.3) without having to use Gaussian elimination on (4.3) . For details consult , for example, (Knuth 68) . The linear s ystem (4 .3) is homo geneous . We have used this extra de gree of f reedom to obtain the cons t a nt s b O' • • • , b p in a denominator free f orm. We obtain for p

for p

=

2,

bO b1 b2

a 1a 2 ' 1 - a l - a2' 2,

3,

bO bl b2 b3

a1 a2 a3' - 1 + a 1 + a2 + a3 - al a 2 - al a3 - a 2a 3 , -6 + 281 + 2a2 + 2a3, - 6.

Theorem R generate s, for example: f 3, (1, 2) (x, y, z) f3, (1, 2 , 3) (x , y , z ) f s , (2, 3)

1 - x - y - z + xy + xz + yz =

~

0, f , 1 (x ) = 1 - x 1

1 - x - y - z + xy + xz + yz - xyz

3 - 2x - 2y - 2z ~ O.

~

~

0,

0,

2w - 2t, + xy + xz + xw+ xt + yz + yw + yt + zw

+ zt + wt

It s hould be noted that xy is not "x times y " . The equation f 3 (1 2) (x, y, z) is a linear equation and represents a hyperplane def ining one of the races of a pol ytope and xy is the name of a variable which happens t o be a set whose members are the t wo l abels x and ~ A f ace f N of a binomial pol yt ope ~ p is defined by the col l ec t i on of vertices it t ouches. '80nversely , a fa ce f N p of BN• P ca n be considered as defining a f unc t i on wh i ch selects some of the ver l l ce s of BN,p ' The polytope BN p ha s 2N ver t i ce s l abeled, by our construction, with a l l the 2~ subsets of N symbols. We can associate a truth assignment with ea ch label (and vertex) a s follows. Assign true to x if Xi does not appear in t he l abel, otherwise assign f a l s e to Xi. For example , t~e vertex of B4• 2(x, y , z, w) labeled xy is associated wi t h the truth assignment : fa lse for x; false for y ; true for z; t r ue for w. Hence a f ace f N of BN ~ defines or is, a boolean function . With t hi s interpretation, the f aces coft~idered'~n theorem R are some of the boolean symmetric functions with de fining patame t er s a I ' a2' ... , a p ' Symmetric fun c t i ons have been appearing regul arly in connection with met hods to obtain solutions t o boolean f un c t i ons . Our sN ' are exactly the f unctions def i ned by (Whitehead 01) with his investigation on the, hAlgebra of Symbolic Logic" . His motivation to cons ider them come s from efforts to decompose , or f a c t or, boolean function s into "prime fa ctors" whose f urther decomposition s eems impossible. The decompos i t i on leads to interesting methods of expressing all the so l ut i ons to a boo lean f unc t i on. It is quite remarkable that a similar con cept a r i ses here. It is not possible to obtain a face of a polytope as a linear combination of faces with pos itive multipliers . Symmetric functions continued to be approach ed, for similar reasons, in (Lowenheim 08) and (Skolem 34). A result in (Sha nnon 38) increased cons i de r ably their usefulness. f e presented a ve ry s imple circuit to impl emen t them. The man y interesting properties of boolean symmetri c functions

92 have continued to direct attention on them from an algebraic and combinatorial point of view, as in (Semon 61), (Arnold & Harrison 62), (Cunkle 63). Propositional calculus formulas exhibit two well known symmetries (Slepian 53). We can permute, or interchange, variables and we can negate them, without altering in any essential way the properties of the formula . It is reasonable to suppose that the binomial polytopes will inherit these two symmetries. This is indeed so. In addition, a third symmetry has been found present. All three symmetries can be found by permuting in all the 4! = 24 ways the 4 vertices of a B2 2' and hence can be considered a manifestation, or consequence, of a common underlying symmetry. So far our presentation has followed the traditional approach of marking sets with zeros and ones. As a consequence, matrix BN N is triangular and quite sparse, two good characteristics for computational purposes. We found that to describe the symmetries of the binomial polytopes it is more convenient to use -1 and +1, instead of 0 and 1. In this -1+1 setting the generating matrix Gx and the matrix BN,N become:

G'x

! ~] [ 1

-1

q,

(4.7)

x'

The transformation from zero-one variables to minus-one-plus-one variables is given by the matrix Tx, computed as follows: x ¢ x Tx

G-1 x •

G~

= [:

T- l x

1J ¢ -2 x

1/2J

[:

-1/2

¢

(4.8) x

With (4.8) we define our linear transformations :

N

Hence if f N P is a face of BN p' the corresponding face of B p' obtained by changing the entry in b x of (3.3j to +1 and the entry in d x to -1, is (4.9)

In this -1+1 form, the binomial polytopes are projections of a regular simplex. This is a consequence of the fact that matrix BNN is an orthogonal matrix. Triangularity has been transformed to orthogonalIty. Before being ready to present the symmmetries we need to define a few additional matrices.

Negate

~.

x Px x- = ,

[:1

~]

¢

x

x

Q' x,x(EN, i ) ,

=

[: -1OJ x Q'

xi,xi

0

/?!

j

The matrices P and Q have been chosen have P - . G' • Q' - = G'. The matrices xcombination ces Q ~nao, wtth a Xltnear of by P. The matrices Q and Q' are related identities matrices

x

[~ -:]

¢

¢

7-

N

I. J

¢

x (4.10)

i

so that Px,x . Gx • Qx,x = Gx' We also . P are row permutation matrices. The matr~ columns, the permutation of rows performed by Q Tx' Q' • T~1. The matrices I j are

93

II

x

-$

I

x

=

$

:]

0

x

The identi ty mat r i ces I x a re us ed i n (4 .10) as a padding for all the va r iables which a r e not ne gated. The other symme t r ies a r e Permut e x and y. x y xy x y xy y xy x $ 4> 1 0 0 0 4> 0 1 0 0 0 0 4> 0

=

Rema:rk

«......

=

Given a k-tuple ~). all the variable of its predecessors also appear in ,t k) (shared variables).

«.....

272 definition Let C be a clause. l=r an equation in E. and 0 an occurrence of a non variable subterm tin C. such that: t and L are unifiable. with m.g.u. a = IXj <- tjlj=l,...,k' Let N be the narrowing (aC)* of l=r into C at occurrence o. The predecessor set of a is the subset of substitutions defined by : Pd(u) = ! !x;<-SJ;=l.....k Ifor any k-tuple (St···sk) predecessor of ·tk)l.

«...

The set o/narrowing hypotheses of N is the set of clauses defined by: 8(N) = !u'C I o' E: Pd(a)l Remark

The variables of terms Sj in clauses of 8(N) appear in the te rms t; of aC . proposition 5.1 Let C be a clause. and Cs = !Njl;=I....,p an innermost narrowing class of C. Let Vi (l,,;j,;:;p) be implications of the form: (P ij E-valid . for all P ij E: 8(N)) ==> ( N; .E-valid ) If every Vj (1,,;i";p) is true. then Cis E-valid.

proof Let us suppose that, for every i (l,,;i,,;p). Vj is true and let us show that C is E-valid. Let a be a given ground substitution. CS is obtained from E-narrowings into C at an occurrence 0 of an innermost term t of the form f(tl'''~)' The terms tl' .... t k are composed only of constructors and variables. The normalized substitution a* replaces every variable by a ground irreducible subterrn in ce (by (H1) ). The terms (a*t) are hence in ce. for 1,,;i,,;k. By (HB), there is an equation l=r of E and a substitution (" such that ("l = a*f(tl"'~)' Hence land t are unifiable with the m.g.u. h U T, and there exists a substitution 1) such that u* 1)T;the clause (TC)* is a clause Nq of C<1 (1,,;q,,;p). Let G be the clause (a*C). We have: G ;::: 1)TC, hence G* = (1)N q)*. Now V q is true, so : ==> ( Nq E-vahd) , therefore (Pqj E-valid • for all Pqj E: 8(N q) ) (1) P qj E-valid , for all Pqj E: @(Nq ) ) ==> ( T/ Nq E-valid ) Now 1)Nq is E-valid iff (1)Nq) '" is E-valid .i.e iff G* E-valid. So T/Nq E-valid iff G is Evalid; so : (T/P qj E-valid , for all Pqj E: @(Nq ) ) ==> ( G E-valid ) (I) Now IPqjl I Oqj C I Oqj E: Pd(T) I For any Oqj E: Pdfr), 7/0qj is a member of Pd(1/r), i.e. of Pd(a*). Therefore, 7/ Pqj is a clause pF' withp j E: Pd(a*). From (I), we have then ( PjC E-valid . for all Pj E: Pd(a*) ) ==> ( G E-valid) , so ( Pj C E-valid. for all Pj E: Pd(a*) ) ==> (a*C E-valid) From the structural induction principle. it follows that u*C is E-valid; therefore aC is E-valid. Thus, whatever the ground substitution a is. cC E-valid. It follows that C is E-valid.

=

5.1.2. Hypothesis decomposition

Let C be a clause of the form L1 v .... v Lr .

273 Let N be a narrowing of the form Nl v .... V Nr , and let Gl(N): !Hl'...,Hql be the hypothesis set of N. For lsjSq, Hi is of the form h j l v ... v h ir The r q functions 'l of fl ..... ql into 11..... r] define the so-called sampling Junctions of N. Given a sampling function 'Y. the collection H of literals 7 I h i7u/ j=l,... ,q I is called a hypothesis sample of N. The set I Hi' I for all sampling Iunction v ] is called the hypothesis decomposition of N and is denoted n(N). example We consider the functions Iormp, eval, optimize defined by Boyer-Moore (see [BM], p.258 ). The definitions given in [BM] can be transformed into canonical sets of equations defining respectively formp on [true.Ialse] and eval.optimize on [nil.cons] ; in particular. we have the following equations:

El: (formp (cons w (cons x (cons y z)))) = (and (Iormp x) (formp y)) E2: (eval (cons w (cons x (cons y z)))) e) = (apply w (eval x e) (eval y e) E3: (optimize (cons w (cons x (cons y z)))) = (cons w (cons (optimize x) (cons (optimize y) nil))) Let us now consider the translated form of the OPTlMIZE.CORRECTNESS.THEOREM C: (formp x) = false v (eval (optimize x) e) = (eval x e) The narrowing of E3 into C. using the m.g.u a: I x <- (cons w (cons x (cons y z)))) L gives: N: (and (formp x) (Iorrnp y)) ::: false v (apply w (eval (optimize x) e)(eval (optimize y) e)) ::: (apply w (eval x e) (eval ye)) 8(N) :::

I (Iormp x) :::false

v (eval (optimize x) e) ::: (eval x e) , (formp y) ::: false v (eval (optimize y) e) = (eval y e) The sets Hi' are: H l ::: I (formp x) =: false, (formp y) ::: false I Hz::: I (formp x) =: false. (eval (optimize y) e) = (eval y e) I II:, =: I (eval (optimize x) e) :::: (eval x e). (formp y) ::: false I H4 = I (eval (optimize x) e) = (eval x e). (eval (optimize y) e) := (eval y e) ll(N) = H l V Hz v II:, V H4

I.

From the properties of conjunction-disjunction, follows: proposition 5.2 Let N; be a narrowing of an equation of E into a clause C. Then (P;j E-valid , for all P;i E Gl(N;) :::=> (Hi' E-valid . for some sample Hi' of n(N;) ) Furthermore : proposition 5.3 Let Ni be a narrowing of an equation of E into a clause C. Let V; be the following statement: ( Hi' E-valid ==> Ni E-valid) , for every Hi' E n(Ni ) If Vi is true. then Ui is true.

.

274

proof : Let us s uppos e that Vi is true (a), a nd that P;j is E-valid , for all J1j E: e( Ni ) (b) , and le t us show th at Ni is E-valid . Fr om (b) an d pr opositi on 5.2, it follows t hat ther e exi sts an E-valid sa mple H6 in IT(Ni ) ; so by (a) , Ni is E-valid, q.e.d. Let N be a narrowing, and H a sam ple of IT(N). In the following, H is r egarded as a set of r ewr it e rules for N. All the variables of H appear in N ; these var ia bles are t hu s not un ive rsally quan ti fied but "fro ze n" as new constan ts . In order to preve nt infinite se que nces of re duction, T is provide d in a classical way , with a well-founded or dering <0 [Pl, De 1,JLR]. Equati ons of H will be us ed as rew rite rule s only if their sides are
N.J. H2: (and fal se (for mp y)) = false v (a pp ly w (eval (optimize x) e) (eval y e) = (a pply w (eval x e) (eval y e) ) N.J. U3 : (and (for m p x) false) = false v (apply w (eval x e) (eval (optimize y) e)) = (apply w (eval x e ) (eval y e)) N+H4: (and (formp x) (Iorrnp y)) = fals e v (apply w (eval x e) (eval y e) = (apply w (eva! x e) (eval y e)) Note th a t N.J. H4 is s u bs um ed by x=x, an d that N.J. Hi (i= l ,2,3) is a lso su bs um ed by x= x, aft e r rewriting wit h f (and false x)= fa lse , (and x false)=fal se ) I. In the following, given a clause D, R(D) denotes the se l of sele cted resolvent s of D against clauses of C U [xe x], and Ninner(D) denotes th e set of selected na r ro wings for the chos en innermost clas s of D. V de no te s the cur r ent set of cla uses known to be valid . Resolution against C U lx=xl is now us ed as a means to split goals into subgoals. Indeed, we have: proposition 5 .4 Let D be an equational clause . Suppose the ~ sele cted literal is eit her an inequation over GCV, or an equation wher e each side is a variable or a term beginning with a co nstructor. Then 0 is E-valid iff either R(D) is emp ty or every claus e of R(D) is E-valid. Let us now consid er t he following proced ure PROC2.

275

PROC2 (C) Initially V :::: E u C If C is subsumed by a clause of !x::::xl u C. then stop "p roof" else TOP (C) TOP (D) If 'P(D) is of the form : x=y .with x.y E V or c(tr ..'-k)=d(u) ...u l ) . with c.d E C and 1.;.') E T (l~~k .l~~L) or x=c(t) ... ~) (resp . c(t) ...tk)=x) , with c EC, xc V and tiET (l~~k) or t;tu • with t .u E GCV then RES (D) otherwise NAR (D)

RES (D)

compute R(D) :::: IR1,....Rnl if 0 is a member of R(D). then stop "failure". let R' :::: IRil....'Ripl be the set obtained from R(D) through elimi nati on of clauses subsumed by a clause of !x=xl u c. if R' :::: If;. then BACKTRACK (D) else TOP (Ri!)

NAR(D) compute Ninner(D) :::: INJ.....Nnl le t N' = INil.....Nipl be the se t ob t ained from Ninner(D) through elimination of cl au ses Ni such that Ni~Hij is subs umed by a clause of Ix::::xl u C. for all Hij E: I1(N) . if N' :::: If;. then BACKTRACK (D) else TOP (Rll )

BACKTRACK (D) Put D into V If 0 has a righthand brother Dr . TOP (Dr) else if D ha s a parent Dp , BACKTRACK (Dp ) else stop "proof" 5.1.3. Soundness of PR0C2 The procedure PROC2 attempts to show the E-validity of clauses either directly or indirectly. If C is subsumed by a clause of Ix=xl u C. then C is d irectly proved E-valid . Otherwise, either a ll the resolvents (case RES). or all the narrowings (case NAR) are computed. The procedure attempts then to establish the direct validity of all the children (by subsumption (case RES) or by rewriting with hypotheses then subsumption (case NAR». In case it fails, the procedure is recursively applied to the leftmost child non directly validated.

276

The procedure backtracks to a node D, when all the children of D have been (directly or indirectly) validated. D is then put in the set V of validated clauses; actually, either: • the children of Dare resolvents; they have been directly subsumed, or indirectly validated on former backtracking. In all cases, the children are all valid, and hence D is valid (by proposition 5.4), or • the children Ni of Dare narrowings;then,either Ni is such that Nhlij is subsumed for all Hij E: II(Nj ) , or Ni has been indirectly validated on a former backtracking. In the first case, Vi is true, hence Uj is true (by prop. 5.3). In the second case, Ui is true again. So in any case, Ui is true. Hence D is valid (by prop. 5.1). If a backtracking occurs towards a clause D without parents, then the validated clause D is the top clause C, hence C is E-valid. Thus, we can state: THEOREM 5.1 (soundness) If PROC2 stops with "proof", then the input clause C is E-valid. Remark PROC2 can be modified without loss of soundness, in case certain function symbols being commutative and associative, by using Associative-Commutative unification and pattern matching [SI,Pe,Fa], instead of ordinary ones.

Similar strategies to that of PROC2 apply, if (HB) holds besides (HO-Hl),and if there is a procedure Pya] of E-validity on Ccv. The general strategy is expressed as follows: "Split the goal into subgoals by narrowing or P yal ' then try to prove each subgoal through hypothesis rewriting or P yal . Split the unproved subgoals , and so on until all subgoals have been proved." 5.2. Rewriting with lemmas and goal assumptions

Rewriting lemmas [BM,Pa] are very useful in proving a goal. Usually, lemmas are considered as conditional rewriting rules of the form P & Q ==> l=r. Rewriting is performed if the left-hand side matches a subterm within the goal and if one can prove recursively that the corresponding instances of P and Q are satisfied. This strategy is risky because it may attempt to prove unsatisfiable conditions, or it may indefinitely appeal to lemmas to establish the conditions of other lemmas (infinite backwards chaining). Instead, we propose subsumptive reurritinq , a much weaker form of rewriting but still always safe. We say that a clause is oriented if its rigtmost literal is an oriented equation (from left to right). definition Let D' be an oriented clause (D'J,t J.... t 2 ) Let D be a clause Vj"'l, ... ,p Li . D is subsumptive reducible by D' if there is a subterm t, at occurrence 0, in a literal Lj (1";j,,;p) such that: • t = at l ' for some substitution a

277

Vi =1,...,P &i#j

Li

c uD\

The clause D": L1 v...v Lj _1 v Lj[o<-ut 2 ] v Lj +1 V... VLp is the subsumptive reduction of D by D'. Remarks D" is a paramodulant of D' into D. and conversely D is a paramodulant of D' into D". Subsumptive reduction and ordinary reduction coincide when D' is the monoliteral clause (t l->t2 ) .

Furthermore, when trying to prove a goal and more precisely the goal 97-~elected literal. we can rightfully make use of goal assumptions, by assuming the remaining literals are false (see [EM.PaJ). definition Let C: Vk=1 ,... ,p L"k =t' k VI=1 ,.... q UI,cU'1 be the goal to prove, and let t.=t·. (resp. u.,cu'j') 1 1 J be the .p-selected literal, with t,;:;i,;;p (resp. t';:;j,;:;q). Then the goal assumption set is the following monoliteral clause set: !ul=u'llj",,,,,q (resp. !ul=u'lll",""q&l#)' Remark's Variables of the goal assumption clauses are frozen as constants in the same way as those of hypothesis sample. The goal assumption set can be extended with equations of the form tk=false, when there is a (non .p-selected) equation ~=true within C. More generally, it can be extended with inequations of the form tk,ct' k ' viewed as rewrite rules in the manner mentioned in § 5.1.2.

Throughout. we consider only subsumptive reduction performed at once with rewriting clauses D' known to be valid (lemmas or solved subgoals) and with goal assumptions. These clauses form the current set W of locally valid clauses. Clauses in Ware oriented - therefore used as rewriting clauses - only if their rightmost equation sides are
It can be shown without difficulty that PROCZ' is sound Remark Subsumptive reduction with W, in PROC2', performs a weak role of forward

278

subsumption . Sup pose indeed th at a clause D' in W su bsumes the c urre nt clause D, th en th e r educed form of D by D' is subsumed by x=x an d then discarded. To illustrat e the running of PROCZ', we give two aut omated proo f examples in appendix. The first one is the FLATTEN.!dAC.FLATIEN.THEOREM (from [BM]) . The pr oof is achieved through only two narrowings. The se cond one is the COIDIUTATMTY.OF.TrMES.THEOREM . The proof is given t o illustrate how the procedure h an dle s pcrmulative equations . 6. Conclusion

The final narrowing procedure consti tu te s a method to prove equational clauses in the in itial algebra defined by t he set E of equations, when E forms a canonical set of rules satisfying a certain principle of definition. Its simplicity and relative efficiency are the main positive aspects of this method. In keeping with Dershowitz' s int erpr et ation of rewrite systems [DeZ], the pro cedure can be regarded as a prover of program properties. Several strategies, such as generalization [BM,Au,KC] and conditional rewriting [Re,Ka] , are still not embodied in the me thod. However, our work may hopefully provide logic programming with a basic too l for the inc orporati on of induction and equality.

ACKNOWLEDGEMENT I am grateful to Gerard Huet for his helpful criticism and advice .

279

REFERENCES [Au] Aubin R.. "Mecha niz ing Structural induction", TCS 9 ,1979. [m] Bidoit M, "Pr oofs by ind uct ion in "fair ly" sp ecified equational th eories" , Proc. 6th German Workshop on Arti ficial Int elligence, Sept. 1982, pp 154-166., [BM] Boyer R.. Moor e J.S.. A computati onal logic . Academic press , 1979. [Bu] Burstall R M.. "P roving properties of program by struct ur al induc tion". Comp ut. J 12.1969. [Del ]Dershowitz N.. "Or d eri ng for term rewr iting sys tems". TCS 17-3. 1982. pp 279-30 l. [De2]Ders howilz N.. "Computing with Rewrite Systems". Report No ATR-83 (8478)- 1, The Aerospace Corporation. El Segundo. California. 1983. [Fa] Fage s F. "Associative-commutative Unification", Proc. CADE-7. 1984. [Fr] Fr ibourg L., "A Superposition Oriented Theorem Prov er". Proc. IJCAl-83. pp923-925. [Go] Goguen J.A., "How to Prove Algebraic Inductive Hypotheses Without Induction, with Applications to the Corr e ctness of Data Type Implementation", Proc. CADE 5, Les Arcs, July 1980. [HH] Huet G.. Hullot J.M.. "Proofs by in du cti on in equational theorie s with constructors". 21st FOCS, 1980. pp 96-107 [HO] Huet G., Oppen D., "Equations and Rewrite Rules: A Survey". For m al Lang uages Perspective and Open Problems , Ed. Book R, Academic Pr ess, 1980, pp 349-406 [JLR]Jouannaud J.P .. Lesc anne P..Reinig F'., "Recursive Decomposition Ordering". Formal description of programming conce pts 2, Ed . Bjorner, North-Holland, 1982-. [Jo] Joyner W.H., "Resolution St rategies a s Decision Procedures" J.ACM23:3. Jul. 1976. [Ka] Kapl an S., "Conditional Rewrite Rule Sys t em s and Termination", Report L.R.l, Orsay (to appear). (KB] Knuth D..Bendi x P.. "Simple Word Problems in Universal Algebr as". Comput ational Problems in Abstract Algebras , Pergamon Press, 1970, pp 263-297. [KC] Kod r atotI Y ., Cas ta ing J.. "Tr ivializing the proof of trivial th eorem s". Proc , IJCAl-83, pp 930-93 2. [KK] Kowa lski R.. Kuehner D.. "Linea r Resolut ion with Sel ecti on Fun ctio n", Artit. Intelligen ce 2. 1971, pp 227-260 . (La] Lan kford D. "Cano nical Inference", Rep ort ATP-32. U. of Texas,.1 975. [Lo] Loveland D.."Automated Thcor em Pr oving : A logic al basis", Fun da me nta l St udie s in Computer Scie nce,N orth Holland,1978. [Mu] Musse r D.L., "On Prov ing Inductive Prop er ti es of Abstract Data Type s", Proc, 7th POPL, Las Vegas,1980. [Pal Paulson L., "A Higher-Order Impl ementation of Rewrit ing", Sci ence of Computer Programming 3.1983, pp 119-149. (PI] Plai sted D.A., "A recursively defined ordering for proving termination of term rewriting systems", U. of Illinois. Report nO R-78-943. 1978. [Re] Remy J.L. "Proving conditional identities by equational ca se reasoning rewriting and normalization", Report 82-R-085, C.R.LN., Nancy. 1982. [SIJ Slag le J.R., "Automated Theorem Proving for Theories with Simplifiers. CommutatiVity and as sociativity", J .ACM 21:4, Oct 1974. pp 622-642 . [St ] Stickel M.E., "A unification algoritm for associative commutative functions" J.ACM 28:3,1981. pp 423-434. [Th] Thiel J.J. "Un aigorithme interactif pour l'obtention de definitions completes" Proc. 11th POPL. 1984.

280 APPENDIX 1) THE FLATTEN.MAC.FLATTEN.THEOREM

constructors' nil, cons axioms El : (append nil y) = y E2: (append (cons xl x2) y) = (cons xl (append x2 y)) E3 : flatten nil = (cons nil nil) E4: flatten (cons x y) = (append (flatten x) (flatten y)) E5 : (macflatten nil y) = (cons nil y) E6 : (macflatten (cons xl x2) y) = (macfiatten xl (macflatten x2 y)) lemma CO : (append (append x y) z) = (append x (append y z)) theorem Cl : (macflatten x y) = (append (flatten x) y) the occurrence chosen for narrowing on Cl is the occurrence of macfiatlen in the left hand side - narrowing of E5 into C1 cons nil y = cons nil y

(0: !x<-nilj, Pd = ¢ ; e = ¢) subsumed by x=x

- narrowing of E6 into C 1

(0: !x<-(cons xl x2)j, Pd = [x l, x2j;

e = ! macflatten xl y = append (flatten xt) y macflatten x2 y = append (flatten x2) yl )

macflatten xl (macflatten x2 y) = append (append (flatten xl ) (flatten x2 y)) which is rewritten, with ® u CO, into: append(fiatten xl (append(flatten x2) y)) = append(fiatlen xl (append(flatlen x2) y)) ,which is subsumed by x=x. This achieves the proof of C1.

2) THE COMMUTATNITY.OF.TIMES.THEOREM

constructors: 0 , succ AC operators' : + axioms: El : x + 0 = x E2 : x + succy = succ(x+y) E3:xxO=O E4 : x X succy = x X y + x theorem Cl : x X y = y x x (Throughout the proof, the narrowing occurrence chosen is always the occurrence of 'x' in the left-hand side)

281

- narrowing of E3 into Cl (a: y<-O, Pd = C2:0xx=0

cp, €?

- narrowing of E3 into C2 (a: x- O, Pd = o = 0 ,which is subsumed by x=x

=

cp)

cp, e = cp)

- narrowing of E4 into C2 (a: x- succ x', Pd = Ix<-x'L e = lOX x' = 0 I) o x x' = 0 ,which is rewritten by e into 0 = 0 , which is subsumed by x=x This achieves the proof of C2, which is included into W. - narrowing of E4 into Cl (a: y--succ s: Pd = ly<-y'L e = I x x v> y' x x j) C3 : succ y' x x = x x y' + x (C3 cannot be rewritten bye, because the equation in e cannot be oriented) - narrowing of E3 into C3 (a: x<-O, Pd = cp, 61 = cp) o x y' = 0 , which is rewritten in 0=0 by C2 and then is subsumed by x=x - narrowing of E4 into C::J (a: x--succ x', Pd = Ix<-x'L e = I succ y' x x' = x' x y' +x'j ) succ (succ x' x s' + x') = succ (succ y' x x' + y~ which is transformed into: succ (suec x' x y' + x') = succ (x' x y' + x' + y,) (by reduction with e) then into: succ x' x y' + x' = x' x s' + x' + y' , so : C4 : succ x x y + x = x x y + x + y - narrowing of E3 into C4 (a: y-O, Pd = x=x , which is subsumed by x=x

cp,

61 =

cp)

- narrowing of E4 into C4 (a: y<-succ s', Pd = ly<-y'L e = I succ x x y' + x = x x y' +x + y' I) succ( succ x x y' + x + x) = succ ( x x y' + x + x + y,) which is transformed into: suce( x x y' + x + y' + x) = succ( x x y' + x + x + y') (by rewriting with 61), and lhen is subsumed by x=x . This achieves the proof of C4, hence of C3 and consequently of Cl.

282

A GE.'fJiRAL INDUCTIVE CQVIPLETION AIf'JORITHM AND APPLICATION TO ABSTRACT DATA TYPES Helene Kirchner* Centre de Recherche en Informatique de Nancy Campus Scientifique BP 239 54506 Vandoeuvre-les-Nancy Cedex FRANCE ABSTRACT : This paper states the connection between hierarchical construction of equational specifications and completion of

equational term

rewriting

systems. A general inductive completion algorithm is given, which turns

out

to

be a well-suited tool to build up specifications by successive enrichments. Moreover, the same algorithm allows verifying consistency of a specification or proving theorems in its initial algebra without using explicit induction. INTRODUC';'ION In this paper, we address some general

problems

of equational logic

from the point of view of abstract data type specifications and term rewriting systems. More precisely, we are dealing with three kinds of questions - How to prove theorems in the initial algebra of an equational theory, vQthout using explicit induction? -

How to

enrich a basic specification with new operators

in a

hierarchical way ? - How to prove

correctness of some equational specification with

respect to another one ? This work extends previous results obtained in the following framework: Let F be a set of funct ion symbols split into a set of constructors FO with at least two elements, and a set of defined function symbols F1. 'de assume that there is no relation between constructors and that the

relations between

constructors and defined function symbols are expressed via a set of such that A provides a complete definition of operations

of F1•

axioms A,

':'hat

implies

This research has been Gupported byJRECO de Programmation and by Agence pour le Developpement de l'Informatique, under contract no 32/767

283

that for any term, without variables, with at least

one

symbol of F1,

exists an equivalent term modulo A expressed with symbols

of Fa only.

there If

in

addition exicms of A can be directed, giving a terminating and confluent

term

rewriting system R, the well-known Knuth and Bendix's completion procedure

can

be used to prove properties in the initial algebra of the equational theory A. After the pioneer work of Musser [MUS,80], the method

has

been clarified and

improved by Goguen [GOO,SO], Huet and Oppen [H'l:O,80], and Lankford [LAN,81]. addition, Huet and Hullot LE&H,80] have algorithm

in order to take

providing a

powerfull

shown how to modify

into account

technique.

These

the existence

works

have

the

In

completion

of constructors,

given rise to various

implementations: the systems AFFIRM, OBJ , FORMEL, REVE. Some of them accept associative

commutative symbols

in F1,

using for

instance in FORMEL, the

Peterson and Stickel's completion algorithm [P&S,81]. In

order

to allow relations between constructors,

introduced the concept

of structured

constructors can be directed into a

specification,

confluent

and

in which

Remy

[REM,82]

equations

noetherian term

on

rewriting

system. On the other hand, Jouannaud [JOU,83] developed a general framework decide equality in equational theories where some axioms

(like

to

commutativity)

cannot be directed without loosing the termination property. An equational term rewriting system is then composed of a set of equations E and a set

of rewrite

rules R ; a suitable Chuch-Rosser property allows deciding (R U E)-equality of two terms by using rewritings only.

In

[J&K,84]

and

[KIR,83],

we proved

a

general completion algorithm, roughly based on the same principles as the Knuth and Bendix's one, which allows completing an equational

term

into another one which has the Church-Rosser property and

rewriting system defines

the same

congruence on terms. Our aim is to adapt this completion algorithm in order to deal with the problems of consistency or proofs in initial algebra, but for wider class of equational theories which

admit

an

equational term rewriting

system. r'loreover this inductive completion algorithm turns out to be a tool for proofs in hierarchical specifications.

the

suitable

284

PRELn-iINARIES

-------------

vve assume that the reader is familiar with the basic notions of manysorted algebras used in

the algebraic approach of

data type specification

[GTW,78]. For more readibility we only use there the one-sorted case and assume that there exists at least one constant symbol. But all the results carry over to many-sorted algebras, assuming this condition for each sort [H&O,80]. I(F,X) denotes the free F-algebra over a set of variables X, whose domain is the set of terms

~(F,X)

constructed from operators of F and variables

of X. For any term t, V(t) denotes the set of variables occUrring in t, t/u the subterm of t at occurrence u and t[ uc-t;"] the term obtained by replacing t/u by t' in the term t. G(t) is the set of non variable occurrences in t. I(F.¢) is the initial F-algebra and

~(F ,,0'),

denoted also T(F),

is the

set of ground terms, i.e. terms built up with symbols of F only. Given a set A of axioms, the equational variety generated by A is the class of F-algebras which are models ofaxio1'Ols of A. generated by A over T(F,X) is denoted by =A.

~e

smallest congruence

I-I A denotes one step of A-

equality. The restriction of A-equality to T(F) is also denoted =A. The initial algebra I(F,A) of the variety is the quotient I(F)/=A. An equation t=t' holds in I(F,A) iff s(t) =A s(t') for any substitution s from V(t)UV(t') to T(F), called a ground substitution. In this case we write t =ind(A) t' and =ind(A) is called inductive equality vie use

on T(F,X).

in what follows that, for two sets of axioms A and A', =ind(A)

coincides with =ind(A') iff =A coincides with =A' on ground terms of T(F). A specification (S,F,A) is an algebra of the variety generated by A and is composed of a set S of sorts, a finite set F of S-sorted function symbols and a set A of axioms • Given a basic specification BSPEC=(S, BF ,BA), we nov, want to

add new

operators DF defined by a set of new axioms DA.

~he

specification

SPEC=(S,BF U DF,BA U DA) is called an enrichment of BSPSC.

SPEC is a protected

enrichment of BSPEC if intuitively the new operations and equations do not

285

modify I(BF,BA). A sufficient condition is that DA defines DF completely and consistently on BSPEC. More formally :

* SPEC

~~!~~~~~~~_~_:

= (S,F,A) is complete w.r.t.

BSPEC = (S,BF,BA) if

for any term t in T(F), there exists some term t' in T(BF) such that t =A t' •

* SPEC

is consistent w.r.t. BSPEC if

for any terms t,t' in T(BF) , t =A t' []

implies t =BA t'.

~~!~~~~~~~_~_: SPEC = (S,F,A) is a protected enrichment of BSPEC = (S,BF,BA) iff

SPEC is consistent and complete w.r.t. BSPEC. [] The problem of deciding A-equality can be approached by the use of rewrite rules, that is directed equations, or more generally by the use of mixed sets of rules R and equations E defining an equational term rewriting system [JOU,83] ~~!~:;~~~~:;~2

:*

An

equational term rewriting system (ETRS in short) is a pair

(R,E) of a term rewriting system R and a set of equations E, satisfying : - for any equation g=d in E, V(g)=V(d) - for

~y

*

rule l->r in R, V(r) is a subset of V(l). Let _>E.R the composed relation _>R

0

=E, which simulates the

relation induced by _>R in the set of equivalence classes modulo =E, and _>R' any rewrite relation satisfying _>RC;;;; _>R' c;;;; _>E.R.

*

R is said E-noetherian (or

E-terminating)

iff _>E.R is

noetherian. _>R' is then also noetherian. * =(R U E) is the

reflexive, symmetric, transitive closure of

=E U _>R, and _*_>R' denotes the reflexive transitive closure of _>R' • * R is R'-Church-Rosser modulo E on T(F)

iff for any ground

terms t1, t2 in ~(F), t1 =RUlI t2 implies there exist ground terms t'1, t'2 s.t. t1 _*_>R' t'1 =E t'2 R'<_*_t2.

[] The rewriting relation R', introduced by Jouannaud in [JOU,83], is used

286

to compute in E-equivalence classes,

even when these classes are infinite.

Actually, if R is R'-Church-Rosser modulo E, then _>E.R is Church-Rosser, the converse being false. In practice, we shall use for R' one of the

rewriting relations _>R,

_>R,E (defined by Peterson and Stickel [P&S,81]) or

(_>L U _>NL,E) (defined in

[J<'lK,83]), where L are left-linear rules of R and NL the other ones.

To use the

two last relations, we need a matching algorithm for the theory =E and to

prove

the R'-Chuch-Rosser property, we need a complete unification algorithm for =E. ~~E~~~

: (1) E = lx+y = y+xl

R = [x-O -> x]

The term (x + (0 + y)) is not R-reducible but R,E-reduces to (x + y). (2) E = !x+y = y+xl

L = [x-O -> x]

NL = lx+(-x) ->

01

The previous term is no more R'-reducible because no commutativity step is allowed before applying a rule of L. But the term (-(x+y) + (x+y)) is R'-reducible to O.

STRUCTURED SPECIFICATION AND EQUATIONAL TERM RJ1JNRITING SYSTFNS

-------------------------------------------------------------Following Remy [RNVI,82] , we

introduce the notion of structured

specification which allows us to translate in terms of rewriting systems, the properties of consistency and completeness and to give sufficient conditions to check them. ~~!~~~!~~~_~_:

We call

* *

Let BF U DF be a partition of a set of function symbols F.

BF-equation g=d a pair of terms in T(BF,X) such that V(g) = V(d). BF-rule l->r a directed pair of terms in r(BF,X) such that VCr)

is

a subset of Vel)

* *

DF-equation g=d a pair of terms in T(F,X)\T(BF,X) s.t. V(g) = V(d) DF-rule l->r a directed pair of terms s.t. 1 is in T(F,X) \ T(BF,X)

and VCr) is a subset of Vel).

[]

Notice that, according to this definition,

an equation x=t with t

T(F,X) is not a DF-equation, and a rule x->t is not a

D~-rule.

in

287 ~~fi~i~i2~_2_:

A specification SPEC=(S,F,A) is a structured specification based

on BSPEC=(S,BF,BA) iff F=BF U DF,

A=E U R, E=BE U DE, R=BR U DR, BA=BE U BR, such that

1) BE is a set of BF-equations, DE is a set of DF-equations

BR is a set of BF-rules, DR a set of DF-rules 2)

BR is BR'-Church-Rosser modulo

BE

on T(BF), where BR' satisfies

_>BR~ _>BR' ~ _>BE.BR.

3) R is E-noetherian 4·) Any term t of T(F) has a R'-normal form t!R' in T(BF), where _>R' is

a relation such that _>R C _>R' C _>E.R and _>BR'C _>R'.

lJ

-

-

-

The notion of structured specification can be illustrated by a classical, but nevertheless significant e,(1J'nple S = integer

:3SPEC : ~U1ction

SPEC:

symbols : BF

Z:sRO SOCC PRED rules :

DF

-> integer integer --> integer integer --> integer

OPP : integer -> integer PLUS : integer, integer -> integer

BR

DR

SUCC(PRED(x)) -> x PRED(SUCC(x)) -> x

axioms

OPP(ZERO) ->ZERO OPP(SUCC(x)) -> PRED(OPP(x)) OPp(PRED(x)) -> SOCC(OPP(x)) PLUS(ZERO,y) -> y PLUS(SUCC(x) ,y) -> SUCC(PLUS(x,y)) PLUS(PRED(x) ,y) -> PRED(PWS(x,y)) DE

BE

PLUS(x,y) = PLUS(y,x) PLUS(PLUS(x,y),z) = PLUS(x,PLUS(y,z)) So~e

remarks are useful about definition 5

- In practice,

we shall use

for R'and HR'

relations previously mentioned, assuming

i~plicitely

one of the rewriting

that complete

B~

and E-

unification algorithms are known if necessary. - The condition 4) implies the completeness property of SPEC w.r.t. BSPEC. An equivalent formulation is : " any R'-normal form of a term t

in T(li')

288

is in T(BF) ,

~ (BF) " ,

s ince if a term t has some R'-normal

applyi ng condit i on 4) yi elds that t o would

to

form

which is not

in

be R'-reduci ble t o another

term of T(BF) , whi ch is impossibl e. Thus t o is i n T(BF) . The condi t i on 4) is also equ ivalent to f or any t erm t =f( t o " " ,t n ) such that f belongs t o DF and t o ' " .t n are BR'i r r educ i ble terms of T(BF) , t i s DR'-reducible. An ef fi cient decisi on algorithm f or t his last property can be

obtained

f rom [ THI, 84]. fu t an easy way to satisfy thi s condi t i on is t o defi ne the new symbol s of DF by s tructural i nduct ion on BR' - i r r educ1ble t erms of T(BF) . Let us first point out some pr operties of a

s t r uctured speci f i cat i on ,

result ing from the syntactical conditions of definition 4 . ~~~_~_~ Let (S,F,A) a speci fi cation sat isf yi ng t he condi tion 1

of

definition

5. Then fo r an,V terms t and t' such that t i s in T(BF ,X) : a ) t =BE t ' => t' i s in T(BF,X) b ) t _*_>BR' t ' => t ' is i n T(BF,X) c) t =E t'

=> t

d ) t _*_>R' t '

=BE t '

and t ' i s i n T(BF,X)

=> t _*_>BR' t ' and t' is i n T(BF,X).

Proof : a)

by

induct i on

on the

l ength

of t he small est proof

t hat

t =BE t ' . b) by i nduct i on on t he l ength of t he de r ivation. c ) a nd d) If t i s i n T(BF,X) ,

neither

equat i ons

of

DE,

nor

BSPF~

and

rules of DR apply to t . An easy i nduction yi elds t he result .

[]

-

The r elat ion between s tructured specifi cat i ons

based

enr ichments of BSPEC i s expr essed via term rewriting sys t ems !~~ £~~_~_ :

a~

on

fo l l ows :

Let SPEC=(S,F, A) a s t ructur ed specificati on based on BSPEC such

t hat A = (R U E) and R i s R'- Chur ch- Ross er modulo E on T(F) . with T:l '

as usual.

Then SPEC i s a pr otected enr ichment of BSPEC. Proof : 1 ) Si nce R is R' - Chur ch-Ross e r

mod11lo E on T(F) ,

implies t hat t here exi s t tl and t'l s . t . t _*_>R' t l =E t ' l

tAt' R' <_*_

t'.

By l~~ 1 , s ince t and t ' a r e in T(BF) , t _*_>BR' t1 =B~ t ' l BQ ' <_*_t ' .

289

2) Since SPEC satisfies the condition 4 of definition 5, let us define t' as the R'-normal form of t,

which satisfies t!R' belongs to

T(BF). t ' clearly satisfies t =A t'.

[]

The next lemma is the key point of our proofs. Its intuitive meaning is the following : to enrich a complete specification by equations which hold

on

ground terms does not modify the initial algebra and both specifications are consistent together. Y~1~~~!l_1~~~_~

Let SPEC=(S,F,A) be complete w.r.t. BSPEC=(S,BF,BA) and A' a set of axioms such that =BA<;; =A~ =A'. '(hen the three next conditions are equivalent (1) The specification

(2) a)

b)

SPEC'=(S,F,A ')

is consistent w.r.t. BSPEC

SPEC is consistent w.r.t. BSPEC any equation of A' holds in I(F,A).

(3) a) SPEC is consistent w.r.t. BSPEC

b) =A coincides with =A ' on ground terms of '((F). Proof: Let us first prove that (1) implies (2). a) is obvious. b) If there exists some equation t=t ' in AI which does not hold in I(F,A), there exists a ground substitution s such that s(t) t A s(t'). As s(t) and s(t ') are in T(F), since SPEC is complete w.r.t.

BSPEC,

there exist to and t o in ~(BF) such that s(t) =A to and s(t ') =A t ' O· But to t l o and thus to t BA t ' O' On the other hand, s(t) =A' s(t I) . =A~ =A' , to =A' s(t) =A' s(t ' ) =A' t ' O' and smce l

l

\Vhich yields a contradiction with the w.r.t. BSPEC =

consistency of SPEC'=(S,F,A ')

(S,B~,BA).

The proof that (2) implies (3), or equivalently that if t

and

t ' are two terms in ~(F) s.t. t =A' t ' then t =A t', is easily obtained I by induction on the length of the smallest proof that t =A t'.

fl

Finally (3) implies (1) is obvious.

290

'le now give a different version of this r esult ,

using equat io nal term

r ewriting sys tems : ~~~~~~~_~~~~~~_~~_~g~~~~~~_!~~_E~~~~!~~_~Y~!~~_:

*

Assume t hat BSPEC

SPEC

= (S,F ,R U E)

is a structured spec i f icatio n based

on

(S, BF,B.J:! U BE) ,

* SPSC ' = (8, F, R U R" U E U "S" ) i s an enr ichne nt of SPEC by a set E" of DF- equat i ons and a s et R" of Dr- rules

*

a compl et e (E U B" )- unifi cat i on algorithm is known .

If there exi st s a r ewri t i ng syst em R# such t hat - RI! i s R#'-Chur ch-:'losse r modul o ('.': U E" ) on ground terms with R#' as usual.

_ =(R# U E IT E") = =(R U TI" IT E UE") on ground terms of ,;( p) - SPEC"

= (8, P, R¥ U E U B")

is a s t r uct ur ed specification bas ed on 3SPEC, then

then SPEC i s consistent v , r , t . B.'1 PEC and all the equations of R" U 3" hold

in

I (P, RU;';) . Pro of : Si nce R# cons ist ent and

i s Rt1 '- Chur ch-Ross er

cQ~plet e

w.r .t .

modulo (E U E" ) ,

BSPEC by theorem

SPEC"

and t hus

is also

SPEC ' =(S, P, R U R" U !'; U E") . The str uctured s pecifi cation SPEC based = ( ~R U BE)

In orde r to sc hemat i .ze t hi s theoren and to cl ari f y what is done i n t he f ollowing, l et us say t hat i ntu i t i velY, f rom the first s ituat ion : SPSC

Funct io n symbol s equat ions : rules : pr oper t ies :

F

13F

BE (B"?- equ8.tions) BR (3P-rul es) BH'-Chur ch-Rosser modulo RB

we ,n nt a process that al l ows

=

= BP U DP = 133 U DE

E q =

ER U DR

R U R" (DF- rules )

st ruct ur ed spec . based on BSFSC f,et t i nc~

~PBC I

P 13[;' U DF 3 U f, II (DF-equ'lt i ons )

complete (":cUE") unifi cation algpr i t h'1l

the second situat ion : SPEC"

Funct ion symbols equations : r ules : pr oper ties

F

.:.J.l

T . I' "

BE (3F-equ'ltions ) 'JR.!! (BP- r ules ) RR1 '-Chur ch-Rosse r modulo BE

" ith H BP , BR¥ U BE)

= I ( B}',

BR U S::; ) and I (F,

= 131"

U DF

?.if (DF-rules

and BF- rules ofBR#) R!f '- Church- Rosse r modul o ::; U E" s t r uct ur ed speci f . based on B3 P~C C>./f

U 5 U E" )

= 1( 1" ,

H J E) .

291

Not.Lee that ER can be replaced by ER;¥ during the process,

assuming

however that the initial algebra I(EF, BR U BE) is not modified. This process is a general inductive completion algorithm which

has

a

twofold interest : First, it can be used to build up a hierarchy of enrichments or to add a level to such a construction. llfore precisely : -- at level 0,

Flo

and

Eo are

respectivel,y the sets of rules and

equations on the set FO of constructors. By the inductive completion algorithm, the ETRS (RO'

Eo)

is completed into (Ra#,

Rosser modulo ID and =(RO# U that =ind(Ra U

Eo) -

s»

Eo)

such that Rif

is

Ra#'-Churh-

is equal to =(RO U ID). We assume in addition

is decidable (and thus also =ind(Rif U EO» at levell,

F1 together with

v16

add a complete definition of new operators

some properties

that we want

to

prove.

If, during the

completion process, some equation on constructors is generated, it is proved disproved in the basic specification gPECO' using the

of

same mechanism.

or

If the

inductive completion process does not terminate with failure

or disproof,

resulting specification (s, Fa U F1, F"'O U E1 U E" U R#) is a

protected enrich-

Flo

ment of SPECO' In SPECO' SOMe additional properties, val.Id in I(FO' have possibly been added. --

at

any level k,

all the

rules

and

introduced make up the basic specification mF:<:c=(s, 3F,

axioms

U

the

EO)'

prevtous'lv

ER U BE) and

again

this specification can be enriched like at level 1. On the other hand, at any level of this hierarchy, the validity theorem can be used in three particular

l~YS

assume

that

3R is BR'-Church-Rosser

modulo BE and Dl}-noetherian on T(RF). -

In

order to

prove the

SF:<:C=(S,BF U DP, DR U DR U BE U DE)

consistency of a

w.r ;t .

BSPEC,

specification

the general

inductive

completion algori tlJm is initialized vii th DR as set of pair-s, En as set of and (BTi U

as set of equations. It will generate, assuminG it does

not

rules stop

or abort, a tom ro',vriting oystom pJI ',;hich is R/f'-Church-Rosser modulo (BE U DE) such that (S,F, R# U DE U DE) is a consistent and cO!'lplete specification w.r.t.

292

BSPEC and JR# U BE U DE) is equal to =(BR U DR TJ 'lE U DE) on T(BF U DF).

':'hus

SPEC is also consistent w.r.t. BSPEC. - Assume

now that we

want

to perfor:n proofs in the initial algebra of a

structured specification (S,?,R U E) based on BSPEC. For instance,

our

aim is

to prove a set of equations split into a subset of DF-rules R" and a subset DF-equations E", such that a complete (E U E")-unification algorithm

of

is known,

':'he general inductive cOl'lpletion algorithm initialized with TI" as set of pairs, R 'is set of rules and (E U E") as set of equations, will

generate a

system R# satisfying the conditions of the validity theorem.

rewriting

':'he equations of

R" U E" thus hold in the initial algebra 1(1", E U R).

- Moreover it is possible to introduce at the same time the of operators (expressed via DR U (expressed

via R"

U E")

D~)

and to

together "ith some of prove

both the

definitions

their properties

consistency of the new

specification and the validity of the properties. The second part of the paper is the proof of these points, after a more precise description of the general

inductive completion algorithn.

~his

algorithn attempts to check the hypotheses of the vali1ity theore'l\ and if they are not satisfied, tries to generate Fl.", by adding rules

g->d

such that

p=d

holds in 1(F,RUS).

Tll~

IimUCT1VE C()J\jPLS':'IOl!

1JROC:SDlJ''lE

Let us assume that function symbols, rules and equations are in a hierarchical way. Junction symbols are divided into FO' where

}k is

F.

T;1

'1 , ...

J

the set of operators defined at level 1,. The main feature

algorithm is that it can be called at any level k , rules

organized

and then only works I,ith

and equat.I ons whi ch do not contain any symbol of \:+1 U

Given k, we need to distinguish rules and eqnations

of

introduced at

U Fj' For a this level

and rules and equat i one wn ich nako up the basic specification. P10re formally, for all the cons ide red setR of rules 'I R.11d let us define the functions

1\

and J k in the follo-"ing v/R.y :

e'luations :S,

the

293

* *

I1c(F) = FO U • • • U Fk_1 , Bo(F) i s empty. l\(F) = ~ . R = I\c(R) U l\(R), where Dk(R) i s the subset of Fk- rules and I\c(R) is

the subs et of I\c(F)-rules of R. BO(R) i s empty.

* E = I\c(E) U l\(E) , where l\(E) is the subset of Fk-equations and ~(E )

i s t he subset of ~(F) -equat ions of E. BO(E) is empty. In each subset l\(R) or l\(:E) , elements are l abelled using distinct

sets of l abel s, isomorphic t o positive i nt egers. If the rule l->r is label led by n , we denote either n: l -> r or I n- >rn •

~he

set of all t he labels of

rules

is

completely ordered in such a way that any rule of Dk(R) has a Label, l ess than any r ule of l\+1(R) . 'lIe assume the same on the l abel s of all the equat ions . ~he

i nductive

co~plet i on

procedure

is obtained by modifyi ng the E-

completion pro cedur e , described i n [KIR,83] and [J&K,84 ] , t hat generates f roll a r ewriting system R/I which is R.# '-Chur ch-Rosser

s et of axioms RU E, a t ern

modulo E. Mor eover =(R# U E) i s equal t o =(R U E) on ? (F,X). In t he same ,ray as the E-completion pr ocedur e , the in duct ive completion procedure desc r i bed belm>' vorka with three ki nds of obj ects :

* a constant se t of equations denoted SE. We assume given a SE-r educt i on

< on terms [J&K,84] , used to prove the

ordering

termin~t i on

of

t he success i ve

s ets of r ewrite rules.

* a cur r ent set of directed pai r s denoted SP. The t'NO ter1lS of all the pairs have t o be

comp~ ed,

ac cord i ng t o the chosen G3-r educt i on or der i ng.

Each

pai r comes ei ther fr om an equat i on resultine from the re duction of some rule by another one, or from t he cOllput at i on of some criti cal pai r .

* a cur rent

rewrit in~

system denoted SR.

Por

t heoretical

r easons

(cf .[KIR,83] or [J~K, 84 ] ), a rule l abelled n in )k( Sr.) can be : -

~

as soon as al l the cr itical

p~irs

of

n wit h equations

in

SE and ',li th r ul es of SR with a label n' less t han n have been computed . - pr otect ed fo r coherence of another rule n' if i t i s non l eft linear , if there is a critical pai r (p,q) betwe en t he rule n ' and an axi on (e=a) a nd if

of

8E

t he r ul e n i s used t o _>SR'_ r edllce p on top . ?he l eft-hand side of a

pr ot ected rule is not

al. Lowed

t o be r educed lilly mor e . Not i ce that a rule can be

294

protected for coherence of several rules. - an extension of a rule n' : l->r , i f t here

is a critical pai r

(p , q)

between t he non- l ef t - l i near rule n' and an axi om (g=d) of SE such that p is not - >SR'-r educ i bl e. ~he ext ension i s built f rom t he so cal led (g[u<- l] , g[u<- r ] lSR') which i s di re ct ed fr~~ left t o right.

"ext ensi on pair " The extension of

rule n' is only al.Lowed t o disappear when the rule n ' di sappears. Then , all the non protect ed extensions of n' previ ousl y i ntroduced by the

proce ss disappear

at the same time. Vlhenever

the

process

introduces a

new rul e l->r

in

the cur rent

re wri ti ng syst em , the othe r rules are checked for s implifiability :

a

rule

is

s impl i f i abl e by l->r iff - its lef t-hand s i de is reducible by the rule l ->r, - it is pro tected

on~y

for coher ence of r ules s impl ifiabl e by l->r ,

- it may be an extension but only of rules s implif iabl e by l - >r. Let us point out t hat , at l evel k , ~ (F) - rul e .

and

~ (SR)

~ ( F)- rules

Thus only ot her

each

newly i ntroduced rule is a

need to be checked f or s implifiabi lity

i s not modified .

We need a fai r ness se l ection hypothe sis , i n order

t o ensur e t hat

r ule and equation will be selected after a fi ni t e t ime to compute its

any

cr it i cal

pai rs , except if it has been deleted . The mai.n pro cedur e attem pts to direct a pair re duces as much as

is

(p,q)

possible

INDUCn vr:

CCT1 PL.~IOn .

int o a rule .

r ul es

The DIREC':' procedure

The SU1PLIPICATION pr ocedure

i n D,/'m). Rules that are s iJTlpl ifi abl e hy

l -> r must become new pai rs because their orientation may cnanee .

The

CRI~ICAL-

PAIRS proc edur e comput es over l appines between t he rule or the equat io n n ' given as argument and ot her equatio ns and rules . This pr ocedure also sets

pr ot ect i ons

or builds ex tens io n-pa i r s i :!' necessary, and nor'llali ze pai rs bef or e adding t o t he set of new pai rs .

t hem

295

INDUCTIVE COYiPLETIm (SP, SR, SE, <, n, k) IF SP is not enpty TllEN choose a pair (p,'l) in SP ; CASE

*

p=SEq THEN SR := INDUCTIVE

*

k

40

COMPL~ION(sP\1 (p,'l) l,SR,SE,<,n,k)

(1)

AIID p, 'l 8 T(l\(F) ,X) TimN

IF (k=1 AND p'!'ind(RO U EO)q) T'lEN smp and RE7JRN "disproof" END IF RES := INDUCTIVE CCJ!I1PL3'l'IO'T( (p,'l) ,I\:(SR) ,l\(SE) ,< ,card(11J.yR)) ,k-1) EXCEPT 11:BN it returns "disproof" TKSN STOP and REJ'URN "disproof" ~IHEN

it returns "failure" TI{EN STOP and RETURN "failure"

E'ID EXCEPT SR := I~IDUCTIVE Ca1PL~ION(Sp\1 (p,'l)I,RES U (SR\l\(SR),SE,<,n,k)

*

(2)

ELSE l->r := DIRECT(p,q)

]<,'XCEPT 'IIJEX it returns "failure" THEN STOP and RE7JRN "failure" E~ID EXCEP~

(SP,SR) := STI1PLIFICATION(3P, SR, l->r) SR := I\mUCTI~~ C'~lPLETIO'J(SP, SR U In:l->rl, SE, <, n+1, k)

(3)

END IF BTD CASE %SE IF all the elements of ::VSR) and ~(SB) are marked THE"T STOP and REWRN SR E18', Choose a non marked element n ' fairness selection hypothesis

in

Dk(SR) U ~(SE)

according

to

the

SP := CRITICAL-PAIRS(SR, SE, n'l

SR := I'IDUCTIVS CQllPLETION(SP, SR, SE, <, n, k) E'ID IF E:m IF E'ID nIDUCTIVE CCJ1PLE':I'JIJ DIRECT(p, '1) CATI. ("c€T(F,X)\T(,\(~') ,X) AIID p8T(I\;C?) ,X) Aim q>n)r~[:;;::T R::':'Uil'T 'l->p .(p8T(F,X)\T(~(F),X) A'ID q8':'(~(:'),x)

AnD p>q)T:rEN n=·JTlJR1J p->'l

r->«

.(p, q 8 T(F,X)\T(\:(F) .x) A'ID pxq )

T:IE:: l.I:Tlffil1

. (p, q S T(P,:{) \':'C\(?) ,::) A\'IJ q>p)

':'[iS~J FETIHT q->p

•ELSE S~8P and :;UD CAS~ 3tTD DIRTiCT

R~?r::L1Rq

"failure"

(4·)

296

SIMPLIFICATION(SP,SR, l - )r) K=!mSSR\ \: (SR) [m i s s implifi abl e by l - )r} ; Kl =lm8K [m is not an exte nsionl SPl = SP\! (p,q) 1 U l (l lm , r m) 1m i n Kl and ~-)l' m using l -)r! SRl

! ~-)rl m:

m not i n K, ~-)rm is i n SR , rI m i s t he

SR' -normal form of r m us i ng rules of SR and l - )r! IDJLURN(SP1,SR1) 3ND SIMPLIFICATION ~~E~~ _ :

Appl ie d to

t he specification of

integers given as example , t he

algorithm eff ectively t ermi nat es , s t at ing t hus t he specification SPEC.

In addition,

we

obtain that

consist ency

SPEC i s a

of the

st ructur ed

specificati on based on BSPEC . Moreover , if we want to prove i n this specification t he assertions OPP(OPp(x) ) = x OPP(PLUS(x,y)) = PLUS(OPP(x), OPP(y) ), t hey have t o be direct ed fr om left t o r ight

f or

giving R".

Once more the

general inductive completion algorithm terminates , t hus pr ovi ng t he validity of these assertions . In order to enrich SPEC by the new oper ator

r1UL~ ,

we add a set R2 of

rules defi ni ng !·1ULT w.r.t . each operator of FO U F1 Function symbol s : FO U F1 ZERO : - ) i nt eger SUCC : i nt eger -) i nteger PRED : integer - ) i nte ger OPP : i nt eger -) i nteger PLUS : integer, integer - ) int eger Rules

r·1ULT

integer , integer - ) int eger

:

SOCC(PRED(x)) - ) x PRED( SUCC(x) ) -) x OPP(ZRQO) - ) ZSRO OPP(SUCC(x) ) - > PRED(OPP (x)) OPP(PRED( x) ) - > SUCC(OPP(x)) PLUS(ZERO, y) - ) y PLUS(SUCC( x) ,y) -) SUCC(PLUS(x,y)) PLUS(PRED(x) ,y) - > PRED( PllJS( x,y)) Equat i ons :

1:J

U Sl

PLUS (x,y ) = PLUS(y , x) PLUS (PLUS (x, y) , z) = PLUS( x,PLUS(y ,z ) )

f1ULT(ZERO, x) - > ZERO

~[f.LT ( SUC C ( x ) , y )

- > PLUS ( y ,~[f.LT(x ,y ) )

MULT(PRED(x) ,y) - > PLUS(OPP(y), MULT( x, y) ) MUW(PLUS(x,y) . z) - ) PLUS ( MULT ( x , z ) ,~~J1T ( y , z) )

Il\ULT(OPP(x) ,y ) - ) OPP(MULT(x,y ) )

n

'2

"1ULT ( ~~~,T ( x , y ) , 7. ) = r~JLT ( x , r1ULT (y , z ) ) r~JLT ( x ,y ) = MlTLT(y, x)

297

';'he completion al.gor i thm then

!~ene rates

and

proves t he neiv rules

PLUS(x, OPp(x» - > ZERO. OPP(OPP(x» - > x OPP(PLUS(x ,y» - > PLUS(OPP(x) ,OPP( y» , and terminates t hen with success .

~h is

example has been processed with the

help

of t he system romfF,L due to G. Iluet , G.Cousineau and t hei r co-worke r-s at I NRIA.

PRooP OF TIE I NDUCTI VE CCl'lPI,E'I'ION

AIf'JORI~ H: l

For a given k , l et SRi be t he cur r ent set of rewri t e rules at

t he ith

t ermi nal recursive call (1), (2) , (3) or (4) of the pr ocedure . R+ = U SRi is i

the set of al l the rules generated m.ring t he process. R# i s t he set of the rules which are never r educed neither on t he right or t he l eft . We ar e goi ng to prove by successive l emmas t he main theo rem ~~~~;:~~_~ _l

Let P = FO IT . . • U Pk- 1 U Fk be a set SPEC = (S, P, RUE) a st ructured s peci ficat i on (S, l\:(F) , l\: (R) IT

~( E»,

of f unction symbols , based

on

R" a set of l\( F)-rules and E" a set

BSPEC of l\:(F)-

equati ons s .t . a complete (E U E")-unification algorithm i s known . Assume t hat =i nd(Ro U Eo) i s decidabl e and t hat

the general inductive

ccrnpl et i on procedure is init i alized with R" , R, BUE", <, card(Dk(R» and k , Then a ) If it does not stop with .failure nor dispr oof ,

- R# i s R#'- Chur ch-Rosse r modulo (E U E") on ground terms T(F) and =i nd(R# U E U E") = =ind(R U R" U E U Ell ) . - SPEC" = (S, F, RI! U E U E" ) i s

a

st ructur ed

speci f ication

BSPEC# = (S, ~ ( F ), ~(R#) U l\ (E» . - SPEC" is consi st ent and complete w.r. t. BSPEC. - SPEC = (S, F, RUE) i s consistent w.r .t . RSPEC, - every equation of R" U E" hol ds in I(F ,R U E). - I (F, R UE ) = I (F, R# U E U E") = I(F, R U R" U E U E") . b) I f the pr ocedure stops with disproof, - eit her sane equation in R" U E" does not hol d in I (P, RUE) - or SPEC is not consistent w.r.t . BSPEC .

based

on

298

c) Conversely, if sone c.iuat.i.on in ;;" U 't" does not hold in I("','(:J ~)P8C

is not consistent ;".r.t.

J::F~'~,

then

»r ocedur-e

t~e

stops

::),

or

if

either

':lith

disproof or failure. Pem'l.rk:

':'he decLhbility 0:' =ind(TiQ J

~') is em onen nrobIen, ';evort'1eless,

in S011e equat i.onal theories, th,J con-ii t ion to lin_ __ (RO U ""0) "

LPAU, 34].

ni

".

'inch

theories

p

II

are

ind(E" T; ,- ) v

~Hlled

'1.

-'')

"is

e1u i va'l.errt

"inductively compl ete''

in

An exanple is i!.iven by the theory (HO ;; 20) in the inte,c;ers : if the

equation t=t' is valid in I(t:y

110

J

SO),

3lld

t

t' both

contain

variable, say x. Then let us choose the Bubstit\ltion s defined

by

only

one

Z3RO.

s(x)=

From the sequence of (RO U =O)-equality steps:

:_:0::0

s(t) :_:(EO U RO)

U RO) s(t'), 'lie can deduce a sequence of (;'0 U

equality steps from t to t' just by appl.y i nr; the inverse t.ranatornatfon substitution e , In such cases, the procedure stops vi th disproof as

of

soon

'~o)the

g.s

D

and q belong to ';'(FO'X) and are not ]';O-equ8.l. ':'he proof of theorem 2 is by induction on the level k in the of specifications. At level 1, Fo-equations valid in I('?O' !';;~:;:_~_~

;'18

Flo

need to prove that we can "l!ld

to

hierarchy 3PGC some O

U J
Let FO be the set of constructors, "'0 a set of "O-rules, EO a

set

of

Fa-equations a.t.. a compl.e'te T'\8-unification algorithm is known, and R" a set

of

Fa-equations Ivhich hold in I (FO U RO)' Then HlDUCn1!E CO~1P:S:S';'IJN(R",RO,EO,<,card(RO) ,0) either or stops with failure, or returns R

loops forever,

# s.t. RQ# is RO#'-Church-Rosser modulo EO

O

Proof

',Vit h

k=O,

we

find

8f3ain

procedure and according to [KIR,83],

the

usual

general E-completion

we get a syste'll

ROll such that

is RO#'-Church-Rosser modulo EO and =(RrJI U EO) = =(P'O U H" U =0). equality of initial

algebras

L1Plied with A = BA = RO U

lOb

then

follows

and A' =

fron

Ra T] R" U Ea·

the

Perl ':'he

validity lenma

299

In order to prove a), let us assune th-rt the vrith failure nor disproof.

';'he following

proce,,1urei.oes

preliminary Lemma st8.tes

not

stop

a useful

property about each terminal recursive call of the procedure : 18 __~~~

· 0n m(F) f . -jSP1' +1 U Sfl'+ (R IT q" IT :; U 8") IT S~) -__ ~ ,or any 1, 1 l 3 .:.

a-nd on T(Fk(F», for any i, =(~~(SRi) U Fk(SE» = =(Fk(R) U Fk(E» Proof: by Irrtr-oepect i ng each terminal recursive call in the procedure.

u

',Ie ~~_~_:

are now ready to state the successive points of ':'heorem 2 a)

R# is R#'-Church-Rosser nodulo (E HE") on ground terns of ':'(F) and

=ind(R,¥ U S 'J E") = =ind(R U R" UE U E") = =ind(R+ U8 U B") Proof: by the SaMe technics as in [KIR,831, restricted to ~(F).

[]

Lemma 5 : SPEC" is a structured specification. Proof: ll'J construction, SPEC" satisfies the condition 1 of definition 5 - On T(~~(F», ~(R#) is Fk(R#) '-Church-Rosser modulo Fk(E) since if t and t' belong to T(~(F», t =(Fk(R#) U ~(E»

=(E U E") t'1 H#' <-*- t' 1 and by lemma 1 : t _*_>Fk(R#), t =Fk(:E) t'l Fk(R#)'<_*_ t'. 1

Then by lemma 4, t

_*_>R#'

t' => t =(R# U E U :0.:") t'.

t

- R# is E-noetherian. ive are left to prove that the condition 4 also satisfied: let t be a term in T(F) and Since SPEC is structured, let t T(Fk(F».

Since,

on ground

t

2

of definition 5

is

its H#'-normal form.

be the H'-normal form of t \vhich is in l terms of T(F), =(R U E) is contained into

(R# U E U E") (H# U E U E") /I . = = , t1 = t 2• Because R" 1S R#'-Church-Rossermodulo (EUE") onT(F), t _*_>R#' t'l JEUEII) t • l 2 =

(R U R" U .E U E")

fJ

lemma 1, t'1

=~(E)

t 2 and t 2 is in

T(~(F».

300

~~~~:'_~_: If

the pr oce.ture does not sto'J ',-lith +'8.ilure nor ,ii,mroof, then

- SPEC" is consistent and conpl ete v.r.t. BSP:C. SPEC = (S, P, R u

~)

is consistent w.r.t.

- every equation of' E" U

,~"

J~P~C,

holds in 1(?,R U

_~),

- 1(1", RUE) = I(F, WI U E) = I(F, R U R" U E U E"). Proof : TJy theorem 1,

SP;~C"

is then compl.et.e

and consistent

~if. r

BSPEC,f. It is easily proved that =UVn) U I\/S))= J\:(R¥) U 1\(E))

,t .

on

I3rOlmd terms of' T(1\ (F) ) • ~hU8

BSPSC,.¥

=

JlSffiC and SPEC" is consistent

The theorem of validity in the that SEC is consistent holds in I (F, It U 1:1).

',if.

then

»t,

r . t. BSPE.c.

ap~lies

and yields

r ,t. BSPEC# and any equation of

(TI" U :;11)

~RS

The identity of initial algebras are easily

obtained from validity lemma (with BA =

(~,(R)

U ~(E)), A = (R U E) and

A' = (R U H" U E U E")) and lemma 4.

[J

The point b) of theorem 2 is based on the following lemma: Lemma 7 : If the procedl1re stops with disproof at some recursive call i, - SPEC i - SPEC'

(S, P, SPi U SRi USE) is not consistent w.r.t. BSPEC, (S, F, R U R" U E U E") is not consistent w.r.t. BSPSC. Proof (p,q)

If the procedure stops with

disproof,

in SPi such that p and q are both in

there exists a pair

T(~{(F))

and not SE-equal.

Then either ]{=1 and p=q does not hold in 1(FO' EO U EO)' or INDUC':'IVB Ca1PLET10N((p,q), Fk(SRi), \:(SE), card(~_1 (SRi))' k-1) stops with disproof and by induction hypothesis, p=q does

not

hold

in

I(~(F),

Fk(SR i) U ~(E)). There exists then a substitution s from X to T(Fk(F))

such that s(p) t(~(SRi) U ~(E)) s(q). Thus s(p) *(~(R) u ~(E)) s(q) and SPEC i is not consistent w.r.t. BSPEC. Then, fra~ lemma 3, on T(Fk(F)) =(SPi U SRi USE) is

included

into =(SPo U SIlo USE) = =(R U R" U E U 311 ) . Thus if SPEC i is not consistent w.r.t. [~nsistent w.r.t. 3SPEC.

BSPEC,

SPEC'

is clearly not

301

~ve

In order to obtain the second ryoint h) of the theoren 2,

'lCl'1ly

the

nil U"8" v,hich does not hold

in

validity lemma to 13SY':C, SP:C and SP:C'. Let as nov Drove c) : Lemma S : If there exists some assertion of

I(5,TI U Z) or if SP8C

(3,

~,

RUE)

is

not

consistent

w.r.t.

BSPEC,

the

inductive completion procedure stops vii th either disproof or failure. Proof: Applying the validity Lerma to BSPEC, SPEC and SPEC', we obtain that SPEC' is not consistent w.r.t.

S3P~C.

':'hus there exist two tens t and t' in ';'(~/?)) such that : t

(R U R" U E U E") t' and t ..(11. (R) U R r -k ' "k (E)) t'.

terminates or loops, (according to [KIR,G3J or terms \

an:'! t \ s.t. t _*_}Ri ' t

If the

procedure

[J~K,A4J),

there

exist

U ;~II) t \ SRi'<_*_ t'.

i

But since t and t' belong to ';'(Bj,/P)), v-Ie then deduce 'oy Lenrna 1, that t Jl\(SRi) U l\(E))t', thus

t

=(~,(R) u\:(:'n) t', v!hich yields a

contradiction. ';'hus the procedure stops with disproof or failure.

[]

Notice that, if the procedure stops with failure, nothinG can

be

said.

Perhaps consistency or validity properties are not satisfied, or the chosen SEreduction ordering is not powerful

enough, or the method is not

applicable

all, or a valid equation in the basic specification is generated but

it cannot

be directed into a rule with the eiven ordering. In this last case, one can again after adding the valid equation to

the

set

of equations,

at

try

assuming a

complete unification algorithm is known for the whole set. CONCLUSION ';'his procedure is being implemented in the system REVE

[LES,83]

soon provides the classical mechanism of the inductive completion [H&H,80J. As a counterpart of an complete unification algorithms,

increased which are

generality, often

runtime

our

which

described

procedure

consumtng,

in

needs On the

other hand, some probl.ems are yet open, for example how to decide =ind(RO U EO)? Is it possible, in some cases, to terminate with disproof only by the top symbols of the two terms ?

considering

302

[EI-:D,7'J] ~~.mF~ '1., KRI.:O;IS]{'!i.J., l)A,)k!e~ P. : "St8F.·rise sr8cific8.tion and Inpl.enerrtation of abstract G.flt8. tvpes" Proc , 5th Int. Collo']uiu1'11 Automata, Lal1[UaJ~es and lOrorr8.'I':nin.;', Ud ine (1 ')7'1)

on

[GOG,SO] :::;OGG~:l J .A. : ":row to prove alr,ebraic inductive hvpctheses Hithout induction, with 'lpplication to the correctness of data types implementation" Proc. 5th CADY, Les Arcs (1')'30) [G':"1i,78] GO:::l18::i J.A., ':':IA':'CCIER J.'T., 'JAG'B.R s.c. : "An initial algebra approach to the specification, correctness and implementation o~ abstract data types" in "Current trends in programming methodo logy", vol. 4, pp 3<:>149, Ed. Yeh R., Prentice-Sall (1978) [H&'{,80] ;.rUE';' G., 'IDLLa':' J.r·l. : "Proofs by inrJuction in equational theories with constructors" Prcc. 21th rocs (19'30) an] JeSS 25-2 ~1'):r:') 9':)

J -'~~J:::~~ s.,

a.rp?:·,J J. "~f1l1rj,tions 1n~ rewr i te rules : 3. survsy" in ""'ornal langu8{':eG : perspectives and open problems" ~d. Academic Press (1930)

[J?.:K,84J JOUA,'P'IAUD ,J.P., KIEC'r.ill\ H, : "Compl.et i on of a Get of' set of equations" Proc of POpL (1934).

Book R.,

rules

modulo

[JOU,83] JOUA:'l'IAUD ,J.P.: "Confluent and coherent sets of reductions equations. Application to proofs in data types." Proc. 8th Colloquium on Trees in Algebra and Pro!:ram~ins (1983)

a

wi th

[KIR,83] KIRC'JlT:IR ,]. : "A eener'll conplet i on algorithm for equational term rewriting syste:ns and its proof of correctness" Rep. CRn, :,ancy (1983) [KB:K,82] KIRCH'JER C., KIRCmTTI '1. "Unification dans les theories equa.tionnelles" Proc, Journees ::mOSS3r·1-32, flarseille and Rep Algorithmique , Limoges (1982) [LAN,81] LANKFORD D.G. : "A simple explanation of inductionless induction" Louisiana Tech. University, ~;a.th. Dep., Rep. 1\1':'10-14 (1981) [LES,83] LESCA:TNE P.: "Computer experiments with the REVE term system generator" Proc. 10th POPL Conference (19'13)

revrriting

[MUS,80] J'~'IUSSER D.R. : "On provi.ng inductive properties of abstract data types" Proc. 7th POPL Conference, Las Vegas (19'30) [pAU,84] PAUL E. : "Preuve par induction dans les theories equationnelles presence de relations entre les constructeurs" Proc of 9th Colloquium on trees in Algebra and Programming (1984)

en

[10&3,81] PETERSON G.5., STICKEL "!.E. "Complete sets of reductions equat Ional, theories 1vi. th complete unification algorithms" J.ACM 28, no. 2, pp 233-264 (1981)

for

[REVI,82] REIllY J.L. "Etude des systemes de reecriture conditionnels et application aux types abstraits al.gebrl.quee" These d'Etat, Nancy (1982) [THI,83] ':'RIEL J.J. : "Un 'llgorithme interactif pour l'obtention de completes" Proc of POpL (1984).

definitions

AC'tGJOWLEDGHEN':'S : I thank J.P. Jouannaud, P, Lescanne, C. Kirchner and for their helpfull comments about preliminary versions of this paper.

J.L.Rerny

303

THE NEXT GENERATION OF INTERACTIVE THEOREM PROVERS Pa tr ick Suppe s Stanford University

1.

History

Pr io r to dis cussing what I s e e a s des i r abl e and achievabl e featur e s of the nex t gene ra t i on of interact i ve th eorem provers, I want t o say s ome t hing about the hi story of my own work and t h at of my co l l eague s , which f orms the basis f or t he v i ew of the futur e I sketch i n the r emainder of th i s pap er.

Si mpl e uses of an

i n t erac tive theorem prover f or the tea ch i ng of e lement ar y l ogic be gan more than twenty years ago .

I remember wel l our f i r s t demonstrations with el ement ary-school

chi l dren i n 1963.

For a number of years we concent r a ted on t ea ching eleme nta ry

logic an d a lgebra t o brigh t e lementary- and middl e-schoo l chil dren .

We fe l t a t the

time t hat this wa s the righ t level of diff icul ty to r ea ch for i n t erms of computer capacity and re s our ce s tha t cou ld be devoted t o th e endeavor .

Al l of this ea r ly

work was done on one of the l ow-se r i al-number PDP-l 's , which J ohn McCarthy and I join tly purchas ed from grants a t Stanfor d i n 196 3 .

This early work has been

described in Supp e s and Binford (1965) and Supp es (1972) . By the late s ixt ies it became clear t ha t we cou l d a im a t s omething a step more advanced , and by 1972 I wa s able t o conver t t he elemen t ary-l ogic cou rse a t St anford t o a cou rse t au gh t en t irely a t computer t ermina ls.

By tha t t ime we ha d i ntrodu c ed

a bette r and mor e power fu l i nte r a ct i ve t heorem prover.

That cou r se is proba bly t he

longes t- running show an ywher e on ear t h having a n intera ct i v e th eorem pr over used on a r egular ba s is day in and day out by large numbe r s of pe rsons .

Appr oximately 100

s t ude nts each t erm at Stanford enroll fo r this course and it is given eve ry t erm. Ac ce s s t o t he computer is pr etty much around the clo ck seven day s a week so t hat at al mos t any time of t he da y or night a ro ut i ne us e of our in t erac t ive theorem prover is taking place .

The content of the course i n element a r y l ogi c i s compar abl e to

that of my text (Suppes , 1957).

I t i s obv i ous enough t hat the theorem-prov ing

demands of such an e lementary cours e are no t ve ry sever e .

To upda te also the

compu ter fr amework, by the ear ly se ventie s we had moved f r om a PDP-l to a PDP-KAl O. The next natural move up was to a co urse in axiomat ic set t heor y, rough ly co r respon d i ng t o my t ext i n t he s ubj e ct (Suppe s , 1960/19 72).

Here there were

nontr i v i al theorems t o be proved, above a l l the clas s i cal organ i za tion of a mat hematical sub jec t into a l ong s equence of t heorems, wi th no ho pe of i nd ivi dua l t heo r ems ' be ing prove d from scratch directly f rom t he ax ioms.

Since 1974 t his

304

course also has been offered every term as a course in computer-assisted instruction, with students' getting all of the instruction at terminals. is much smaller than that of the logic course.

The enrollment

The average enrollment each term is

eight or nine students, but the enrollment is greater than it was in the days before the course was computerized.

By the time this course was introduced we had moved to

a PDP-KIlO, and a little later we were able to add a second KIlO. running both courses on the two KIlOs running as a dual processor.

We are still We have ported

the logic course to a number of other systems and it is running in IS or 20 places around the world, but the set theory is a much more elaborate course.

It is

probably a matter of at least two man-years to convert the course to a portable framework.

The details of the set-theory course are described in several articles

in Suppes (1981) and more recently in McDonald and Suppes (1984).

I shall not

recapitulate more details than you will want to hear about this course. organized in terms of somewhat more than 600 theorems.

It is

Depending upon the grades

students seek they prove somewhere between 30 and more than 50 theorems.

Some of

the theorems are too hard to require them to prove in a beginning course of this kind.

After all, one of the theorems on transfinite induction is essentially the

main content of von Neumann's dissertation.

I also do not want to give the wrong

impression about the finished character of the setup.

I think that the interactive

theorem prover we are using has many good features but it is still awkward to prove the hardest theorems in the sequence.

Much work remains to be done to make the

proofs of those theorems as natural and easy as they really should be. Roughly speaking, I would describe the main features of the interactive theorem prover used in the set-theory course under three headings.

First, elementary rules

of inference, roughly the kind we associate with first-order logic, are available and can be used by the students.

Second, students can call a resolution theorem

prover that will run for a few seconds of machine time.

What they give as input

to the theorem prover are the definitions or previous theorems that are to be used to infer a desired formula.

Third, a goal structure is provided for helping the

student in determining the structure of his proof.

A goal structure represents

the kind of expert knowledge that is not yet perfected in all respects but that is used extensively by the students and provides a good deal of meaningful assistance.

There are several important remarks to be made about the way in which the students use the theorem prover.

First, the most frequently used rule is the use

of a previous theorem to make a fairly direct inference.

Second, contrast must be

sharply drawn between the highly interactive nature of the way the students use the theorem prover and the way proofs can be printed out under a review function at the end.

The interactive phase of creating the proof looks from the outside world like

a mess.

It has the kind of highly interactive discourse structure that is not easy

305

to follow at a glance.

It is meant to be easy for the students to use and to

provide considerable help to them.

In contrast, the proofs that are printed out

at the end under the heading of review are organized and systematic and put in a standard crisp form that gives a minimal insight into the interaction that took place in creating the proofs.

As an object of study of how students are able to

create proofs, the interactive proof is a much more important object than the cleaned-up version.

Almost all mathematical study of the structure of proofs has

been concerned with the latter rather than the former objects. Let me give a couple of partial examples.

Here is the beginning of an

interactive proof of Cantor's theorem that the power of a set is strictly less than the power of its power set, taken from McDonald and Suppes (1984). In the first three frames the student merely accepts the program's suggestion (note that pA is the power set of A): Goal Gl:

('
-< pA

[Reduce the current goal with a universal reduction] (reduce) *.!-

(Student input is underlined.)

Doing universal reduction Goal G2:

A

-<

pA

[Reduce the current goal with an introduction reduction using the definition of less power] (reduce) *.!Doing introduction reduction Goal G3:

A '" pA and not pA '" A

[Reduce the current goal with a conjunction reduction] (reduce) *.!Doing conjunction reduction In the fourth frame the student begins to exert control by using the sufficiency condition given in theorem 4.2.1, which reads ('
A .; pA

[Reduce the current goal with an introduction reduction using the definition of leq power] (reduce) *1reduce Which proof procedure?

(introduction) *theorem *4.2.1

The next goal shown will be labeled G6 since GS was created at the same time as G4, when the conjunction G3 was reduced. short proof using forward-chaining commands.

In this frame, the student does a Since the set-theory course uses a

306

so r ted l an guag e , it i s nece s s ary t o prove i n l ine 2 ( s e e below) tha t t he indica ted t erm i s ac t ua l ly a fun ction before i t ca n be ex istent ially gene r a l i zed in line 3. The command VERI FY invoke s a r esolution theor em pr over us ed as a bl a ck box by t he student.

TM : l : l denot e s the f i r s t t erm on l ine 1, and is used to simpl ify typing.

EXCHECK , the se t - theory program , doe s partial re cogni tion of i nput st rings, s o " i nj e c ti.£n" indica tes tha t the s tudent t ype d "inj $", the program ex tended tha t to "injecti" (which is ambi guous be tween injection an d injective) , and the student then t yped "0$ " t o resolve the ambi guit y .

Remember t hat $ indi ca t e s t he ESC ke y .

Typing ? inst ead of "0$" would have displ ayed the ambigu ity .

In the command fo r

l ine 3, "o ccur rences (I- I )" i s a mi l dly awkwa rd way of i nd i ca ting a l l oc cur r e nces , in t h i s ca se t he f irs t through t he fir s t . ( 3 F) F: A i nj pA

Goa l G6:

[Use establish to inf er th e cur r ent goa l ] ( e s tablish) *theo rem *4 .1 . 10 <x , {x }> & x E A) }: A i nj pA

( V A) {z : ( 3 x)(z

Do you wan t to s pecify? (ye s) *lyes Subs titute f or A? (A) *1A theo rem 4 .1.10 (1)

{z : ( 3 x)(z = <x , {x} > & x E A) }: A i nj pA

[Use establ i s h t o inf er th e current goal from line 1] ( e s tablish) *lve r ify

(2) *func( t m:l : l)

Using *definition *2Eie c t i .£n Using *def ini tion *~ Using ""ok [Use establ ish t o infer t he current goal from line 1, line 2] (e s tabl ish) Replace term

*~

*tm : l : l Variab le *F

F and {z : ( 3 x ) ( z = <x , {x }> & x E A)

aren 't in t he proper so r t r elation .

Line j us tifying a sort for {z: ( 3 x)(z = <x , {x }> & x E A) } *2 Occurrences

(1 - 1) *11 - 1

1 eg

( 3 F) F: A inj pA

(3)

Goa l 6 fu l f i lled by line 3. 3 impl i es using theor em 4.2 . 1 (4)

A -: pA

Goal 4 fulfilled by l i ne 4 . I n con tras t, he re is a proo f not a t th e i n tera c tive s tage but a t t he f inished s tage when t he s tudent asks f or a pr i ntou t of the proof that he has created .

The

proof i s fo r the theorem t ha t if a set is fini te t hen ano ther se t tha t i s a s ubse t

307

of it must also be finite. comment.

The proof is so transparent that it requires little

Each of the steps were not taken by the student, but were taken auto-

matically by

program, for example, the elementary conditional proof inference

qIT~

in line (16) and the routine application of universal generalization in line (17) to bind free variables.

The abbreviations on the left for rules used are fairly

self-explanatory. Derive: (v A,B) [finite(A) & B ~ A -> finite(B)]

assume

(1)

Finite(A) and B ~ A

assume

(2)

Bl f 0 and Bl

~

pB

definition finite 1 simp

(3)

Finite(A) iff (V B)(B f 0 & B

(4)

Finite(A)

~

pA -> (3 C) C min-elt B)

4 implies using 3

f

0 & B ~ pA -> (3 C) C min-elt B)

(5)

(V B)(B

5 us

(6)

If Bl f 0 & Bl

1 simp

(7)

B

~

~

pA then (3 C) C min-elt Bl

A

7 theorem using theorem 2.12.8

2 simp 9,

pA

(8)

pB

(9)

Bl c pB

~

8 theorem using theorem

2 simp

(10)

Bl c pA

(11)

Bl f 0

2.4.2

11, 10 implies using 6 (12)

(3 C) C min-elt Bl

(13)

If Bl f 0 & Bl

(14)

(V Bl)(Bl f 0 & Bl

2, 12 cp

13 ug

~

pB then (3 C) C min-elt Bl ~

pB -> (3 C) C min-elt Bl)

14 introduction using definite finite (15)

Finite(B)

(16)

If finite(A) & B

(17)

(V B,A)[ finite (A) & B

1, 15 cp

16 ug

~

A then finite(B) ~

A -> finite(B)]

***QED*** There is a great deal more I could say about the use of the interactive theorem prover in the set-theory course, but since this is meant to be a talk about not what has been done but what should be done in the future I will say no more about it.

308

2.

Desi rab le Features f or t he Us e r

In di s cu s sing t he de s i r abl e f ea t ur e s of the next ge ne r a t i on of i nte r ac t i v e t heorem pr ov e rs , it i s natu r a l to brea k th e ana ly s i s in t o t wo par t s.

The mos t

import a nt i s from the s t a ndpoi nt of th e us er but for rea sons that I sha l l try to br i ng out i t is al most eq ua lly i mportant t hat the de s i r a bl e f ea t ur es f or au t hor s crea t i ng courses be g i ve n s e r ious a nd t hought ful con sidera t ion . F l exibi li ty-~

of int era c tion .

First , a bove al l , i n t he lis t of fea tur es

is easy and flex ib le us e of t he th eor em-pr ov i ng machiner y .

The re ca n be in the

co ns t r uc tion of on ly a mode r a tel y dif fi cu l t theo rem f or a s tuden t wha t co r re s ponds to 25 or 30 pa ge s of interac tion a s l ong a s i t i s t he kind of i nt e r a c t i on that i s ea sy f or t he s t ude nt .

There is , of co urs e , mor e t ha n one cr i terion of eas e .

If

t he stude nt has to go t hro ugh an awkwa r d path to const r uc t a proof becau s e of the severe l i mi ta t i ons on t h e t h eorem prover, then it doe s no t have the prope r sense of f lexibili t y.

There i s a gene r a l pr ob lem of human eng i nee r i ng of the proper

int e rf a ce be t ween t h e s t uden t an d t he i nt e r a ct i ve theo r em pr ove r that i s t oo ea s y to for get a bou t .

I suppos e t he po i nt I woul d s t re s s the most i s t ha t the i nterfa ce

must be suc h that it ca n be used by s ome one wi t h no pr ogramming expe ri en ce of any kind .

This is one c r i t e r i on we have had t o me e t as a s tric t t est i n hav ing ou r

int e ract i ve t heorem prove r s be us ed by l a r ge numbers of s t udent s .

No programming

r equireme nt s a re pl a ced on the s t ude nt s a nd t he y co me t o t he cour se with, i n many cases , no pr ior background in programming.

We emphasize that they wi l l l earn

no t h ing ab out pr ogramming or ab ou t co mputers in t he cour ses . teaching t hem a given subje ct matter. used i n ba nks or in f a ctorie s.

Th es e a r e co ur s e s

Compu t e r s are be i ng used jus t as t he y are

Stude nt s are not in t hes e co ur s e s to l earn about

compute r s or to gain an y pr ogrammi ng expe r i ence .

The i nte rface has got t o be

t ho ught of i n t hi s fashion , it seem s t o me, in order f or us t o ha ve a suc ce s s f ul nex t ge ne r a tion of t he or em prove r s .

I f we pu t in mor e powe r and gener a li t y we

mus t be ca r e ful t hat th is powe r and ge ne rali ty do no t impo s e strains on t he u s e of t h e s ys tem by r e l a t i v ely na i ve user s . Mi ni mum i n put . and dif f i cult ar t .

The t echnical t y pi ng of mat hema tica l f ormulas is a n a rca ne I t is s ome th ing t hat we do no t wa nt t o ge t in the wa y of

student s' g i ving proof s.

Th is means t ha t we want to t h i nk of the int era ctive

theorem pro ve r as offer ing a s much as poss ib l e a cont r ol language t o t he students, not dir ect l y a langua ge for wr i t i ng ma themat i c s.

The r e i s a tens i on he r e that will

not go away an d t hat wi ll r emain wi th us f oreve r , f or there wi l l be , on t he one hand , the de sire t o make t he s tuden t i npu t a s na tur al as possib l e in terms of ord i nary ma t hema t ical pra ctice of writing i nforma l proof s , a nd , on t he o the r hand, even i nf ormal proof s r equir e , i n sub j e c t s wi t h an y devel opment, f a irl y elabo r a t e mat hema t ical f ormu l a s tha t a re pa i nf ul and un plea s a nt t o t ype .

We will there f ore

have a t ension be t ween a rela t ively mor e a rcane co ntrol a nd the us e of mathema tical

309

Eng lish i n the gi v i ng of proo f s .

At the s t a ge of development we should see in t he

nex t ge neration I t hink we shou l d co ntinue to co ncentra te , as we have in t he se t theory course, on mi nimum input a nd t he conception that we a r e offering t o a student a con tro l languag e ra ther than an i nf orma l ma t h ema t i ca l langua ge as his mai n veh icle fo r expressing the proof he wants t o give . Power .

The t he orem-prov ing machinery in t he s et- theory cou rse i s not as

powe r f ul as one would like , and natur al i nference s cannot be made dir e ctly and easily.

The crit erion here i s the inferential leaps t hat are na t ur a l l y and ea s i l y

made by t ea ch er s and student s in giving proofs .

Now t he l eaps an d jumps tha t will

be made by different students and different instructors will vary widely, but I t hink t here is a common unde rs tanding of when mat t e r s a re too tedious an d too much time is being spen t on rou t ine th at should be swept unde r t he r ug .

Power can be

increas ed by hav ing a t the he a r t of matt er s mor e powerful r e s ol ution theorem prover s , but I t hi nk tha t what I have to say und er the inclu s i on of expert knowl edge in t h e form of heuristic guidance and automat ic "subject-matter" inference is probab ly as impo r tant an i ng redient of power as di rect computa tion . understand me.

Do not mis-

Increasing t he power of the resolution theorem provers or re lated

theorems t hat can be called by s t uden ts is of impor t a nce and shoul d by no means be neg lected .

The increa sing ch eapne s s of sheer comput a tional power makes t he

prospec t s here rather

br~gh t .

Heur is tic gu i dance .

The incorpora tion of exper t knowledge abou t a given

subject matter and , in particular , a given course by specific heuristic guidance avail ab le to s t uden ts in a varie ty of forms , especially under the form of goal struc t ures and responses to calls fo r help is one of t he most diff icult , tedious , an d time- con sumi ng aspe cts of good interac tive theorem provers.

I t i s unfor tunately

an a spect that I s ee l ittle pr os pe ct for be ing ab l e t o ge t r ight i n a ge ner al way . Cer tainly i t would seem fo r t he next generation of theor em provers the best we can ho pe is to incorp orate h i ghly sp ec if i c knowledge of a given s ubject mat ter, put t ogether most likel y by an expe r i enced ins truc tor in the subject .

In fact , I guess

I would ex press my skepticism that the kind of sp eci f i c heuristic guidance and co ns t r uc tion of goal struc t ures r equi r ed would ever become a matte r of generalized routine .

I do say something abou t the need fo r making i t easier fo r authors to

implement such gui dance in t he ne xt s ection . Graphics.

As fa r as I know, t h ere i s no s tandard r egul a r use of a t h eor em

prover anywhere in t he world that ex tensively and directly int e racts with graph ic di sp lay s related t o t hat which i s be ing proved .

It is obvious t ha t al r eady at t he

level of high -school geometry a power f u l use of graphics i s ca lled for .

There is

ev ery r eason to be hopeful ab out the k i nd of hardwar e that wi l l be av ailable to us. We are s t i l l , as f ar as I can see, a l ong way from havi ng a l l t he t ool s need ed t o crea te a real ly f irst-rate cour s e even in h i gh-sch ool geometry .

The us e s i n a

310

variety of other courses should be obvious.

I will say no more about this but take

it as understood that the extensive interaction with graphic displays should be a high-priority feature. 3.

Desirable Features for Authors

------

It is too easy to concentrate on the kind of end product that should be available to users. that has gone

i~to

It is obvious to me when I look back on the agony of effort creating the set-theory course that if we are to have the kind

of widespread use of interactive theorem provers that can be extremely useful in meeting teaching needs in mathematics and science, we must also worry about helping the authors who will actually create the courses using the tools I am calling for. Let me mention here four desirable features. Nonprogramming environment.

The first and most essential requirement is that

a sufficiently rich author language be built up that authors can create a new course without having to do any programming, or, ideally, even having the assistance available of a programming staff.

We are a very great distance from achieving this

objective at the present time, but I see no reason in principle why it is not even a feasible objective for the next generation of interactive theorem provers.

I

cannot stress too much its importance if we are to see widespread use of theorem provers in both high school and college instruction in mathematics and science. Easy i£ use author language.

It would be possible to create a nonprogramming

environment but one that is so awkward and tedious to use that only authors of the hardiest nature would be willing to tangle with it.

It is important that the author

language that is created be one that authors like and feel at home with. great deal to be said on this subject.

There is a

I would just emphasize that once again as

much as possible we would like to minimize input on the part of authors.

We would

like to give them as much as possible a control language for creating a course.

As

far as I can tell we are very far from having such facilities anywhere in the world at the present time. Flexible

~

structure.

It is also an important requirement that authors

have available a clearly formulated and flexible course structure that they can use without programming assistance.

The courSe structures in the elementary logic and

set-theory courses with which I have been associated closely myself are not, I think, impressive at all.

We did not concentrate on what I generally call the

course driver in these cases but more on the theorem-proving apparatus.

But good

courses using interactive theorem provers need to have the possibility for instructors to not be locked into a single course structure but to satisfy their own particular teaching plans and to fit the course into the curriculum of their particular institution.

Again, it is easy to underestimate the importance of this

kind of flexibility in terms of making the use of interactive theorem provers a success.

311

Eas y ways to add expert knowledge .

Above all , we ne ed to make it easy fo r

i ns t r uc tor s withou t programmi ng experien ce or programming ass istance t o add exper t kn owl edg e t o gi ve t he cour s e th e f ull-bodied character it should have.

I do not

want to undere s timat e eithe r my ambi tions in this a rea or the difficulties .

What i s

pr oba bly most i mpor t ant i s not t o think in terms of encoding a fixe d body of knowledge but fo bu ild dynami c pro cedures that i nt era c t with what the s tudent is do i ng in powerful way s to give pertinent and cogent guidance to the student .

Let

me give the simp les t kind of example , but a lso one of the most importan t, t hat arises in a ny u s e of i nteractive th eor em pr ove rs . a proof .

The student beg in s to co nstruct

He i s, l et us say , a certain distance into t he proof and althoug h he

ha d a re asonab le idea to be gin with he is now at a loss as t o how to cont inue . asks for help .

He

A dumb expert sys t em will make him s t ar t over and give adv i ce in

t erms of so me preset idea s of what a proof of a given the or em shoul d lo ok like. smar t sy s tem of expert knowl edge wi ll work in a very di f f erent way.

A

It will look

at the proof as develop ed t hu s fa r by the student and be ab le , i f he has a reasonab le i dea , t o give h im help on continuing and comple t ing the pr oof he has a lr eady begun .

Now we a l l kn ow i t is easy t o say th is but e i t her as i ns t ructors ours elves

or as one s crea t ing such expert systems , it i s no mean fea t t o come up with s uch a sys tem.

We hav e had in t he last de cade a grea t deal of discus sion of such expert

sy s t ems of help in such trivia l su bjects as elementary a ri thme tic .

I have myself

devo ted s ome time t o th ese matters and s o when I say ' t rivial ' I do no t mean t o den i gr a t e the wor k tha t ha s been done but just t o put it in pr oper persp ec t i v e .

The

ki nd of work associated wi t h BUGGY cr eated by John See l y Brown and ot he r s simply has no obvious and easy extension t o a sub j ec t at the leve l , let us s ay , even of t he fi rst cour s e in ax ioma tic set theor y, not t o sp eak of more adv ance d subjec ts .

The

difficulties of creating r eal l y good s ystems of expert knowledge of the k ind I am ca l l i ng for cannot be underestima ted.

It is , I t h i nk , in many r es pec t s a lmost t h e

f irs t item on t he agenda fo r t he nex t ge ne ra t ion of interact ive t heor em prove r s . 4.

of -Cour -Next - -Round - - - -s es -

Le t me j us t conclude by listing some of the courses that I t hi nk a re just r igh t i n dif f i cul t y for a ttack by the next generation of interact ive theorem prov ers .

None of the cours e s reaches abov e the undergradu ate l evel.

I think it is

go i ng to t ake one more gen erat i on beyond t h e next one bef or e we can ha v e interactive theorem prover s that ca n be ser i ou s ly us ed in graduate cou r s e s of ins t r uc t i on in mat hematics or s cien ce.

To empha s i ze the general i ty of th e fr amewor k t hat needs

t o be c reated , let me br i ef l y des cribe seven s tanda r d cou rses that would be of cons i derable s ignif icance to have available i n a compu ter -based framework a nd wi t h good interactive the or em-p r oving facili ties .

The first thre e courses co uld as

wel l be offered t o able h i gh school s tudent s a s t o co l lege undergraduate s.

312

Elementary geometry.

This course is in fact a high school course.

It is a

bit of a scandal that we do not yet have a production version of a good elementary geometry course with a good interactive theorem prover available anywhere in the country as far as I know.

There are some formidable problems to be solved in

creating the theorem-proving facilities required for such a course, especially in having the proper interaction between proofs and graphs of the figures being constructed, but the problems are not of great fundamental difficulty.

A standard

criticism of many axiomatic theorem-proving elementary geometry courses is that there is too much emphasis on theorems whose geometrical content is limited.

1

think that a computer-based course can avoid this problem in the way that we have avoided it in the set-theory course.

Students could be given individual lists of

theorems and they could be led to expect to have to use previous theorems that they themselves have not proved in giving their own proofs.

In this way it would

be possible carefully to select theorems of geometrical interest that are still sufficiently elementary to let students try them and to deal with the problem of giving adequate proofs.

1 have some slightly idiosyncratic ideas about this course

that 1 shall not go into here.

I think there is a place for a quantifier-free

formulation of elementary geometry that has a highly constructive formulation and that could be a basis of a course that would avoid some of the logical intricacies inherent in the quantified formulas that are so much a part of a standard geometry course.

Linear algebra.

As has been emphasized by many mathematicians in the past

several decades, an elementary course in linear algebra might well replace the elementary geometry course in high school.

In any case, a course in linear algebra

is now standard fare in every undergraduate mathematics curriculum.

There are many

nice things about linear algebra from the standpoint of being a computer-based course.

Much of the course, for example, requires only a small number of types of

variable. ~hat

One of the standard bookkeeping problems in computer-based courses is

of embodying in a natural and easy way the standard informal use of typed

variables.

Moreover, most of the proofs in linear algebra are relatively easy and

rather computational in spirit.

Of the courses I mention in this list I think it

would be the easiest to implement.

As we all know, it is possible to continue

development of the course so that it becomes relatively difficult, but even then I do not see the proofs as being as difficult as the harder proofs in the first course in axiomatic set theory described above. Differential and integral calculus.

The undergraduate teaching staffs are so

familiar with this course and it is such a standard service offering, it might be wondered why it should be considered for development as a computer-based course with an interactive theorem prover.

I think the main argument for this is that

there is a definite place for it in the more than 20,000 high schools in this

313

country.

For many of these high schools it is really not economical to staff a

small course in the calculus for the small number of students interested in taking it.

From a broad national standpoint, however, it is important that such courses

be offered to the willing and able students.

We know from a great deal of

experience that very bright sixteen- and seventeen-year-olds, for example, can do just as well in such a course as students who are a couple of years older.

I can

see offering an excellent course with theorem-proving facilities but also offering some additional graphical and symbolic facilities as desired of the kind that have been developed in recent years.

In fact, one of the problems of the more powerful

facilities for elementary integration, for example, is that of knowing exactly what should be available to the student at a given point in the course.

It would

also be possible to offer such a course in the calculus with a new viewpoint, for example, the viewpoint of nonstandard analysis.

That would be a difficult decision

because many of the schools at which a computer-based course would be directed would not have instructors who would feel at home with a nonstandard approach to the calculus.

In any case, the desirability of such a course seems clear.

Differential equations.

Again, this course might just as well be one that

would be offered to the very best students in some of the high schools.

I must

confess that I know of no one who has yet tackled even the first course in differential equations as a computer-based course using an interactive theorem prover.

It seems to me, however, that there is nothing that stands in the way of

such a course.

It is true that many instructors would, and so I would myself,

emphasize concepts and applications perhaps more than proofs in the first course, but there is no reason that a computer-based framework could not offer a good approach to these matters as well.

Also, here again is a case where the use of

sophisticated graphics could be highly desirable. Introduction to analysis.

In this list I am developing, by now the student

will be ready for a first course in analysis.

Here a theorem prover would really

get a proper workout but again I find that the proofs in most books that are billed as a first course in analysis are not especially more difficult or complex than the proofs in set theory. variables.

Also in such proofs there is a fairly restricted typing of

By careful and judicious arrangements of theorems I see no difficulties

in principle, just the difficulties of actually working out all the details in a way that will give a smooth-running course with the kinds of facilities that could be offered in the near future. Introduction to probability.

The deductive organization of the first course

in probability that assumes a background in the calculus is a natural subject to put within a computer-based framework.

Also, it is a course that some faculties

are not particularly interested in teaching. for the very best students in high school.

It could also be made available again Most of the introductory courses in

314

probability at the level I am talking about do not require very elaborate proof procedures.

The machinery, I think, would not be too difficult to implement.

Theory of automata.

An undergraduate course in the theory of automata is

again at the right level of difficulty.

There are some problems in this course.

The notation is harder than the proofs themselves.

The structures being considered

are complicated but most of the proofs are not of a comparable difficulty.

At the

present time this is something of a problem in computer-based frameworks but it is something that I am sure we will see solved in a reasonable and intuitive fashion in the near future. science.

I mention this only as one theoretical course in computer

It is clear that there are other undergraduate courses in computer

science that will, on occasion, be thinly staffed in many colleges and universities. The opportunities for computer-based courses using sophisticated interactive theorem provers in this domain are perhaps among the best of any of the areas I have touched on. In closing I want to return to my point that what we need is a general facility for creating such courses.

The logic and set-theory courses with which

I have been closely associated myself have an inevitably parochial character in their organization and conception.

This is because they were created from scratch

and were focused on solving the immediate problems at hand for the subject at hand and not on creating a general framework usable by many different people for many different courses.

At the time we were creating these courses it would have been

premature to aim at such generality.

It is not now.

It is what we need in the

near future in order to fulfill the promise of the role that theorem provers should be playing in the teaching of mathematics and science.

315

REFERENCES McDonal d , J., and Suppes, P.

Student use of an interactive t he orem prover.

W. W. Bledsoe and D. W. Lovel and (Eds .), Au t omat ed theorem proving: 25 yea r s .

Provi dence , R.I . :

Suppe s , P.

Introduction to logic.

Suppes, P.

Axiomatic se t theory.

In

After

Amer i can Mathematical Society, 1984 . New Yor k : New York :

Van Nos t r and, 1957. Van Nos t rand , 1960.

Slightly revised

edition pub l i shed by Dove r , New York , 1972 . Suppes, P. Basel:

Computer-assisted instruction at St anf or d .

In Man and computer .

Karger, 1972.

Suppes, P. (Ed.), University-level computer-assisted instruction at St a nf or d : 1968-1980.

Stanford, Ca l i f . :

Stan ford University, Institute f or Mathematical

Studies in the Social Sciences, 1981 . Suppes, P., and Binfor d, F. elementary school .

The

Experimental teachi ng of mathematical logic i n the Arithmet ic~,

1965, 12, 187-195.

316

The Linked Inference Principle, II: The User's Viewpoint*

1. Wos Mathematics and Computer Science Division Argonne National laboratory 9700 South Cass Avenue Argonne, IL 60439

R. Veroff Department of Computer Science University of New Mexico Albuquerque, NM 87131 B. Smith Mathematics and Computer Science Division Argonne National laboratory 9700 South Cass Avenue Argonne, IL 60439 and W. McCune Department of Electrical Engineering and Computer Science Northwestern University Evanston, IL 60201

1.

Introduction

In

the

field

representations canonicalization objective

of

of

of

procedures,

this

search

reasoning programs. sharply

increased

automated

information, and is,

[2),

inference rules. the

Seventh

the

search

powerful

for

intelligent

continues

inference

rules,

strategies.

and

for for

The

useful effective practical

of course, to produce ever more powerful automated

In this paper, we show how the power of such by

can

be

employing inference rules called linked inference rules.

In

particular, we focus on linked resolution

reasoning, for

UR-resolution,

a

generalization

programs of

standard

UR-

discuss ongoing experiments that permit comparison of the two

The intention is to present the results of

Conference

on

Automated

inference rules given in this paper is

Deduction. from

the

those

experiments

at

Much of the treatment of linked user's

viewpoint,

with

certain

abstract considerations reserved for Section 3.

*This work was supported in part by the Applied Mathematical Sciences Research Program (KC-04-Q2) of the Office of Energy Research of the U.S. Department of Energy under Contract W-3l-l09-Eng-38 (Argonne National laboratory, Argonne, IL 60439).

317

Employment of linked inference rules enables an automated reasoning program draw

conclusions

in

one

step

that

(unlinked) inference rules are used.

typically

require

to

many steps when standard

Each of the linked inference rules is obtained

by applying the "linked inference principle", a principle of reasoning that is fully developed in [8].

Application of the linked inference principle yields generaliza-

tions of a number of well-known inference rules--for example, binary resolution, URresolution, hyperresolution, and paramodulation. use of various linked inference rules are made criteria for syntactic criteria.

In

The larger steps permitted by the possible

particular,

by

substituting

semantic

the usual clause boundaries that

define certain well-known inference rules are replaced by criteria defined in of the meaning of the predicates and functions being employed. er the case in which the natural representation of a problem clauses,

each containing negative literals.

terms

For example, considproduces

Further, assume that

two

nonunit

some

reasoning

step that you would like a reasoning program to take requires the simultaneous sideration of the two nonunit clauses. dard UR-resolution suffice on would not be so restricted

Neither

syntactic

con-

standard hyperresolution nor stan-

grounds.

However,

linked

UR-resolution

for the criteria that are employed governing application

of the rule are semantic and not syntactic. Although the discussion focuses chiefly on the inferential touch

process,

we shall

on some consequences and benefits of a strategic nature that are present when

employing linked inference. approaches

as

if

they

(We often discuss

are

integrally

various

connected

strategic

and

inferential

when the connection is in fact

primarily historical.

The study of linked inference has led

strategies

can be used outside of linking, strategies such as the target

[8]

that

strategy and the extension of Notwithstanding

our

strategy

the [7],

discovery for

set

of

on

the

user, we shall briefly discuss the abstract

notion of linking in order to show how it unifies a number of concepts. to

of

the

emphasis

support

to

example.) In addition

furthering the objective of producing more effective and more powerful reasoning

programs, linking contributes to two other objectives. greater

control

over

the

performance

attacking some specific problem.

of

an

First, the user is

automated

reasoning

provided

program when

In particular, the user has control over the

size

of the steps that occur in a deduction, and also can instruct an automated reasoning program to restrict its deductions to those directly relevant to some chosen concept or goal.

Second, the user is permitted more freedom in the use of a natural presen-

tation of a specific problem. syntactic

flavor

of

the

The

clauses

choice

of

to

deduced.

be

notation is

not

dictated

by

the

In contrast, because of such

syntactic considerations, many (unlinked) inference rules place limitations

on

the

choice of notation. The problems we solve with puzzles,

circuit

linked

inference

design, and program verification.

are

taken

from

the

world

of

By selecting from the first of

318 these three areas, we can immediately give a sample of how linked We

choose

two

fragments

of

a

puzzle called the " j obs puzzle ".

concerns four people who, among them, hold eight jobs. Roberta,

inference works.

Each of

Thelma, Steve, and Pete--holds exactly two jobs.

The jobs puzzle

the

four

actor, boxer, chef, guard, nurse, police officer, teacher, and telephone is

held

people.

by

one

person.

people--

Each of the eight jobs-operator--

You are asked to determine which jobs are held by which

Among the clues, you are told that Roberta is not the chef,

and

that

the

husband of the chef is the telephone operator. Perhaps you have jumped to the conclusion that Thelma is the chef. you

To

see

if

are right, the following clauses can be used to represent this puzzle fragment,

and UR-resolution can be used to draw conclusions. Roberta is not the chef. (1)

-HASAJOB(Roberta, chef) If a job is held by a female, then the female is Roberta or Thelma.

(2)

-FEMALE(jobholder(y»

HASAJOB(Roberta,Y) HASAJOB(Thelma,y)

If a person is a wife, then that person is female. (3)

-HUSBAND(x,y) FEMALE(y) The person who holdS the job of telephone operator is the husband of the person

who holds the job of chef. (4)

-HASAJOB(x,telop) HUSBAND(x,jobholder(chef»

(A second clause -HUSBAND(x,jobholder(chef»

HASAJOB(x,telop)

is needed to complete the representation of this fact, but it does

not

participate

in the illustration.) For every job, there is a person holding that job. (5)

HASAJOB(jobholder(y),y)

UR-resolution suffices to yield the desired conclusion in three steps. satellite and 4 as nucleus,

From 5

as

319

(6)

HUSBAND(jobholder(telop),jobholder(chef»

is obtained. (7)

From 6 as satellite and 3 as nucleus,

FEMAtE(jobholder(chef»

is obtained. (8)

And finally, from 7 and 1 as satellites with 2 as nucleus,

HASAJOB(Thelma,chef)

is deduced as the desired result. A person solving this puzzle clause

8

(simultaneously)

by

through 5.

fragment

would

considering

simply

and

naturally

conclude

the information contained in clauses 1

The information contained in clauses 6 and 7 would not exist,

explicitly.

One variant

of

linked

UR-resolution

at

clause 8 by simultaneously considering clauses 1 through 5 without deducing 6

and

7.

In

clauses

particular, in the terminology to be illustrated in this paper, the

user could instruct a reasoning program satellite"

least

would also immediately deduce

to

choose

clause

and the predicate HASAJOB as the "target".

1

as

the

"initiating

With those choices, clause 2

is the "nucleus", and clauses 3 and 4 are "linking clauses" that "link" clause 2

to

the "satellite", clause 5. In the example just discussed, the choice of the "target" is motivated natural interest in who holds which jobs.

output, since the goal is to establish a (simple) possibilities. in clause 2.

The

the

fact

rather

than

a

choice

of

conclusion, HASAJOB(The1ma,chef), is a descendant of a literal

Thus, in this variation of linked UR-resolution, the

"target clause".

by

The intent is to produce a unit clause as

nucleus

is

the

In the following variation, however, the nucleus is not the target

clause, for the conclusion is not a descendant of a literal in the nucleus. For this variant, we select another fragment from the "jobs clue

in the puzzle is that the job of nurse is held by a male.

quickly deduce that Roberta is not the nurse. example

Another

(Incidentally, this fragment

is

the

that led to the first application of the principle of linked inference and,

in fact, to the first

variant

of

obtained with the following clauses. (9)

puzzle".

From this clue, you

FEMALE(Roberta)

(10)

-FEMALE(x) -MAtE(x)

(11)

-HASAJOB(x,nurse) MALE(x)

linked

UR-resolution.)

The deduction

can

be

320 Here again the user chooses the natural target and the inference rule of linked resolution,

the

two

choices

motivated

as

discussed

earlier.

Clause 9 is the

initiating satellite, clause 10 the nucleus, and clause 11 the target literal

UR-

clause.

MALE(x) of clause 11 links to the literal -MALE(x) of clause 10.

The

The other

literal of clause 11, -HASAJOB(x,nurse), is the target literal, and is the parent of the deduced clause (as a literal) (12)

-HASAJOB(Roberta,nurse)

and we have an illustration of a variation on the preceding example

of

linked

UR-

They

show

resolution. The two examples illustrate two variants of linked

UR-resolution.

how the user can rely on a natural representation for the problem, without regard to syntactic tricks required by the wish for using a particular inference show how the

step

occur when using this natural formulation of the puzzle. possible

increase

They

Finally, they suggest

the

in program control and possible decrease in user effort that can

occur when employing linked inference. from

rule.

size need not be limited by the obvious clause boundaries that

The increase in control is derived

in

part

avoiding the generation of certain classes of intermediate clauses and in part

from keying on semantically chosen targets.

The decrease in effort

is

derived

in

part from the ability to rely on a natural representation without being so concerned with the need to generate (intermediate)

clauses

that

are

required

to

be

unit

clauses.

2.

Overview

In this section, we briefly particular,

we discuss

the

motivation for its existence. the

desire

need

review for

to

material covered in [8]. In some of its properties, and the

At the most general level, the motivating forces

are

to rely on semantic considerations in place of syntactic, the intention

of increasing the power and efficiency of desire

certain linking,

automated

reasoning

programs,

and

the

provide the user with more control over the actions taken by a reasoning

program while simultaneously placing less burden on the user.

At the more

specific

linking addresses a number of problems commonly faced when using a reasoning

level, program.

Of course the problems are extremely interconnected, but

let

us

discuss

them as if they are somewhat separate. The first problem focuses on the size and nature of the steps occur

in

a deduction.

that

ordinarily

Because many inference rules are constrained and defined in

terms of syntactic criteria alone, and because clause boundaries currently

prohibit

321

certain

combinations of facts from being simultaneously considered. the steps taken

by a reasoning program are often smaller than necessary or desired. of

The termination

a deduction step is often given in terms of syntactic criteria such as the signs

of the literals and the number of the literals. program

Users

of

an

automated

reasoning

might well prefer the termination condition to rely on the significance and

the meaning reasoning

of

the

program

conclusion. might

be

In

standard

hyperresolution,

for

example.

a

forced to accept a conclusion containing two positive

literals. while linked hyperresolution might

produce

a

conclusion

with

but

one

positive literal, the second literal being removed with a negative unit clause. The second problem concerns representation and choice

of

inference

choice

of

predicates

readability.

and

rule(s) and

and

strategy.

functions

naturalness.

typically

its

is

forced

with

the

Too often the user wishes to use one

dictated

by

a

desire

for

convenience.

If binary resolution is to be avoided--after

use results in too many clauses of too small a step size--then

more clauses must be added to the set of support. user

interconnection

and finds that. for example, binary resolution must

then be included as an inference rule. all.

its

Thus. in

these

situations,

the

to choose between using an inference rule that may be too prolific

and weakening the power of the chosen strategy. The third problem addresses user control of the actions taken by reasoning

program.

In

many cases,

the

an

automated

user does not wish to be forced to read

through and examine a myriad of conclusions. but instead wishes to be presented only with important conclusions.

The intent of using linked inference is to prOVide each

user with a means for telling a reasoning program which concepts are interesting. in turn

permitting

the

program to present only conclusions consistent with the given

instruction.

By judicious choices. the program Can be

intermediate

clauses

by

in

effect

Many such clauses are merely links between one significant

one.

prohibited

from

generating

classifying them as relatively insignificant. significant

statement

and

another

Depending of course on the price paid (measured in terms of time)

for achieVing this reduced clause space, a sharp increase in efficiency results. The fourth and final problem is that of extending the power and flavor set

of

support

strategy.

partitions the input clauses

The into

strategy. those

as

with

currently support

and

defined those

and

of

the

employed.

without.

The

strategy prohibits application of an inference rule to a set of clauses when none of its members has support. are

not

generated.

By doing so, many clauses that would have

This

clauses and then purge them subject to various criteria. Luckham

[1].

been

generated

action is far more efficient than actions that generate However, as pointed out by

this partitioning of the input clauses is not recursively present--is

not present at higher levels of the clause space.

Of course. levels 2. 3. 4. and so

on are smaller than they would have been because of the clauses not present at level 1 when using the strategy.

But the recursive power. Were it present. would

further

3~

partition

the

retained

l evel

clauses

into two sets , one wi th s uppor t and one

without.

With this action , the l evel 2 clause space woul d be small er t han wi t h

s tandard

de f i nition of the s et of support stra t egy.

a means for part itioning ea ch level of r etained level

O--and,

clauses - -as

set

of

currently

occurs

f or

as a re sul t , t o co nt inually r educ e the s ize of levels grea ter than 1

comparable to the way that level 1 is reduced. the

the

The probl em thus is t o provi de

support

strate gy

ref utation compl e t ene s s .

in

this

Of course, t he object is

fashion

without

to

extend

(operat i ona lly)

losing

(Questions of refutation co mpl e t e nes s are not addressed in

this pap er.) Appl icat i on of i nfe rence

r ules

t he

li nked

addresses

i nference

t hese

pr i nc ipl e

va r i ous problems.

to

produc e

r equired

to

result

ha s

many

skip the les s significant intermediate results.

draw a conc l u s i on , or term t he deduction step been

linked

Intuitively, linked inference

rules ena b le an automated r eas oning pr ogr am to "link" together as are

various

complete,

only

clauses

as

The object is to a

significant

obta ined, where significance is defined by t he user.

when

Rather than

totally aba ndoni ng syntact ic notions such as sign of literals and number of literals and

clause

boundaries,

linked

inference

c ri t e ria wi t h certain s emantic notions . r equi res

rules

Thus ,

pe rmi t

fo r

t he us er to combi ne such

exa mple,

linked

UR- r es ol u t i on

t ha t the conclusion be a uni t clause but, when the a ppr opr iate s t r ate gy i s

emp loyed , broadens the de finit ion of s tanda r d UR- r es ol u t i on to r equi r e t ha t the uni t c lause

sa t i s f y

additional

s pe cified criteria.

This extension enables a rea so ning

pr ogr am to avoid terminat ing t he deduc tion step me re ly becaus e some

unit clause .

of

having

produced

The extension also allows the us e of clauses that would normally

not be co nsidered satellites.

linked UR-resolution is to s tandard UR-resolution

as

standard UR-resolution is t o unit resolution.

3.

The Linked Inference Principle Note t hat, whe n re f e rr ing t o a cl aus e, here we mean an oc cu r r ence of a

Thus

t he

men t i on

of

two

c l aus es

admi t s

the

pos sibility

ident ica l - -are merely cop i e s of the s ame cl a us e . literal means an occurrence of a literal. impl y t ha t the two literals occ urrence s

of

(possibly

are

Si mila r ly ,

in t e rfe r es

but

the s ame) literals.

sli ght ly

with

of

r e f erence

to

a

rather

that

they

ar e

different

Although we could de fi ne the linked cov e r

fa ctor i ng

[6 ] ,

such

a

an unders t a nding of the princi pl e and is not

consis t ent with the current implementat ion of Employment

the

Thus the mention of t wo li t erals does not

di stinct,

in f erence pr i nc i pl e to cove r litera l me rg i ng and to defini ti on

c l au s e .

that the c l aus es are

t he

co r r espond ing

infe rence

rules.

the pri ncip l e, and he nce of various in f erence rules derived f r om it,

r equ i r es us i ng f a ctoring as an ad ditional i nf e rence r ule .

Finally , we

choos e

here

~3

to

define

the principle at the literal level, presenting its extension to the term

level in another paper.

Thus, equality-oriented inference rules such as paramodu1a-

tion are not covered in this treatment. 3.1

Definition The linked inference principle is a principle of

reasoning

that

considers

a

finite set S of two or more (not necessarily distinct) clauses with the objective of The deducing a single clause that follows logically from the clauses in S. principle

applies

if

there

unifier u that depends on f. found in [8j.)

unifiers is

exists

appropriate function f and an appropriate

an

(The formal discussion of

rule to app1y--pairs literals in the same rules do.

appropriate

functions

sense

that

certain

standard

inference

For example, standard binary resolution considers two clauses and pairs a

literal in one with a literal of opposite sign in the other with the intent two

literals

unifying.

with

the

of

the

As a different example, standard UR-reso1ution considers a

nucleus and a set of satellites, and pairs all nucleus

and

Such a function--that required for the linked inference

satellites.

but

one

of

the

literals

in

the

Again the intent is to find an appropriate unifier

that depends on the pairings, and that

is

required

to

simultaneously

unify

the

chosen pairs. In the same satellites,

and

spirit,

linked

UR-reso1ution

various

linking

clauses.

satellites and possibly with literals employing

from

considers

a

nucleus,

a

clauses

with

a unifier that simultaneously unifies the chosen pairs.

UR-resolvent

that

results

if

the

application

perspective, standard UR-resolution can be viewed resolution.

Just

as

standard

of

It pairs literals in the nucleus with linking

the

intent

UR-resolution

as

an

being

is successful. instance

of

The object is to

"cancel", with one exception, all literals in all clauses--the exception linked

set

of

the

With this linked

UR-

and standard hyperresolution can be

implemented as a sequence of the appropriate binary resolutions, so also can various linked

inference

rules.

However,

as

expected,

linked

inference rules are not

implemented in this fashion, but instead are implemented to avoid the generation

of

intermediate clauses. 3.2

Inference Rules Before briefly

discussing other concepts covered by the linked inference prin-

cip1e, we focus on inference rules. the linked

inference

principle,

For example,

binary resolution is captured by

even when a clause is resolved with

itself.

In

that case, S consists of two copies of the same clause, and each of the literals not involved in the unification is mapped to itself.

Factoring, on the other

hand,

is

not captured by the principle as presented here.

Hyperresolution,

resolution,

Equally, linked UR-resolution and

and UR-resolution are also captured.

negative hyper-

linked hyperreso1ution are captured by the linked inference principle.

324

We can now give the following formal definition of linked UR-resolution. Definition. selection

of

Linked UR-resolution is that

inference

rule

that

requires

the

a unit clause, called the initiating satellite, and a nonunit clause,

called the nucleus, such that the literal of the initiating satellite unifies with a literal

of

opposite sign in the nucleus.

In addition, with at most one exception,

for each of the remaining literals of the nucleus, nonunit)

clause

literal.

The

containing

unit

ancestor-depth

a

clauses

1,

and

literal

are

the

of

called

nonunit

there

opposite

immediate

must

sign

exist

a

(unit

satellites

or

satellites

a

literal

in

the

nucleus,

with

opposite sign that unifies with that literal.

no satellites or links of ancestor-depth greater number

paired

at most one exception, there must exist a

satellite of ancestor-depth 2 or a link of ancestor-depth 2 that provides a of

of

clauses are called links of ancestor-depth 1.

Further, for each of the literals in the ancestor-depth 1 links that are not with

or

that unifies with that

literal

There must be an n such that than

n

participate.

Next,

the

of so-called exception literals in the set consisting of the nucleus and the

links must be exactly one. unifies

pairwise

all

Finally, there must exist a unifier that

pairs

designated

by

simultaneously

the given conditions.

The linked UR-

resolvent is obtained by applying the unifier to the unpaired literal. In the unification requirement, the reason for allowing the possibility exception

literals

in

of

no

the nucleus but instead allowing an exception in one of the

links is that the deduced unit clause may be descended from a literal

contained

in

one of the links.

Accidental deduction of a unit clause, as can occur with merging,

is not permitted.

As in standard UR-resolution, we are interested in

that

operationally

will be deduced. produced

by

predicts,

if

a

definition

all conditions are satisfied, that a unit clause

AlloWing in the definition the unit clause

that

is

accidentally

merging leads to an implementation of linked UR-resolution, as well as

of standard UR-resolution, that is less effective.

The

broader

definition

would

force exploration of many paths that in fact would not produce a unit clause. The definitions, from the abstract viewpoint of the linked inference

principle

or from the user viewpoint, of linked hyperreso lution can be obtained by focusing on the objective of deducing corresponding

a

positive

clause

rather

than

a

unit

clause.

The

definitions for linked negative hyperresolution and for linked binary

resolution can be obtained by focusing on the corresponding objectives.

The

linked

inference principle is thus seen to capture a number of inference rules. 3.3

Other Applications of the Principle The linked inference principle also captures

capturing

various

inference

rules.

other

all classes of proof by contradiction that are signaled empty clause.

concepts

in

addition

to

For example, it captures, with one exception, by

the

The exception is illustrated with the two clauses

deduction

of

the

325 P(x) P(y) -P(x) -P(y) which, taken together, are an unsatisfiable requires

the use of f a c t or i ng.

set.

The

proof

of

unsatisfiability

The linked inference principle as given here can be

extended to capture this t ype of proof as well by replacing the requirement one-to-one

property

of

the

required

function

f.

of

the

In the extension, rather than

simply considering pairs of literals l,f(l), the function is allowed to admit pa i r e d sets

T,f(T).

All

literals in any such T must be from a single clause C in S, all

literals in f(T) must be from a single clause D, and of course simultaneously

must

un i f y,

so

must

the

the

literals

of

T

literals of f(T), and the two re sulting

literals must unify and be of opposite sign .

Thi s extension is similar to the firs t

def inition published for binary resolution [3]. The linked i nf erence princ iple also cap tur es , with one exceptio n, t he notion of the

of a clause D from a given set S of c lauses.

deduction

As expec ted, agai n t he

exception is any deduction tha t requires fact or ing. Finally, the deductive

aspects

linked of

appropriate strategy .

inference

Prolog

[4] .

principle The

can

be

viewed

as

capturing

the

procedural aspects can be captured by an

The execution of a Prolog program

can

be

achieved

with

a

single linked hyperresolution. The point of noting t he various concepts captured under principle

the

linked

i nference

is merely to observe that the principle provides a unifying framework for

a number of rather distinct concepts.

No claim is being

made

that,

f or

example,

seeking a proof by contradic tion, the user should instruct a reasoning program

when

to search for the appropriate single linked inference.

In fact, unless the user has

a well-tailored algorithm for solving the problem under consideration, searching f or such one-step proofs by contradiction is essentially a waste of time .

4.

Experiments We cannot overstate the importance of proper experimentat ion in evaluating

ideas.

early stage, we have not yet been able to do the extensive testing that is to

new

Because our implementation of the linked inference rules is still in a very

properly

assess the value of the concept.

required

In this section we briefly summarize

the few experiments that we are running, and include

other

examples,

obtained

by

hand, to further illustrate the potential of linking . The problems circuits ,

are

s e l e c t ed

from

three

and proving properties of programs.

areas:

solving

puzzles,

designing

The experiments focus on comparisons

326 between standard and linked UR-reso1ution. we

Because of the obvious need for brevity,

include here only sketches of solutions to problems.

A more detailed discussion

of the problems is found in [8]. 4.1

Solving Puzzles The first experiment focuses on the "jobs

described Pete. The

in the introduction.

puzzle",

There are four people:

Among them, they hold eight different jobs. jobs

are:

actor,

boxer,

chef,

guard,

implied), teacher, and telephone operator.

golfing together.

Each

nurse,

kind of fruit: fruit.

Each

bananas. 4.2

holds police

which

was

exactly

two

jobs.

officer (gender not

Roberta is not a boxer.

Roberta, the chef,

and

the

police

The

Pete has no officer

went

Who holds which jobs?

A merchant wishes

The second experiment concerns the "fruit puzzle". you some fruit.

of

The job of nurse is held by a male.

husband of the chef is the telephone operator. education past the ninth grade.

a fragment

Roberta, Thelma, Steve, and

He places three boxes of it on a table. apples, bananas, or oranges. box is mislabeled.

Box b contains apples.

to

sell

Each box contains only one

Each box contains a different type

of

Box a is labeled apples, box b oranges, and box c What do the other boxes contain?

Designing Circuits Circuit design is an application of automated reasoning that has generated much

interest

[5].

The

basic

approach

is

to

describe

with

axioms

the available

components and the way they interact, and then deny that a circuit with the properties

can be constructed.

Examination of a proof of the corresponding theorem

contains all of the information necessary to circuit.

specify

the

design

of

the

desired

The experiments are with multiple-valued logic circuits employing T-gates.

The following clause defines aT-gate. (1)

desired

-<XT(x) -<XT(y) -<XT(z) <XT(tgate(x,y,z,w»

The following clauses define the circuit problem to be solved. (2)

<XT(O)

(3)

CKT(l)

(4)

<XT(2)

(5)

-CKT(tgate(tgate(O,l,O,il),tgate(1,2,l,il), tgate(O,l,O,il),i2»

327 With standard UR-resolution, the choice of clause 5 as set of support does yield

a

proof

that

the desired circuit can be designed.

generated with that choice. inclusion

of

any

And yet, clause 5

is

an

In fact, no clauses are

obvious

of clauses 2 through 4 is less justified.

choice,

while

With

linked

UR-resolution,

on

the

Thus, you must choose

between modifying your choice of inference rule and modifying your choice of set support.

not

of

the other hand, you are not required to

modify the natural choice of having clause 5 as the only input clause in the set

of

support.

4.3

Proving Properties of Programs Automated reasoning is applicable to a

problems.

wide

variety

of

formal

verification

One area of particular interest is that of proving that a given computer

program has certain properties it is claimed to have.

For example, you

might

wish

vectors,

and

to prove that the program correctly implements a given algorithm. Many of the programs that we study involve some use of integers, matrices.

The corresponding axiom sets have the property that very closely related

facts are deducible from them when example,

if

the

reasoning

program

deduces that b is not less than a. seriously

impair

a

reasoning

inference rules, by relying program

to

deduces

rules

are

employed.

on

performance. semantic

For

a is less than b, then i t soon

that

The presence of such closely related

program's

heavily

inference

standard

facts

can

Since employment of linked

criteria,

enables

a

reasoning

avoid drawing such closely related conclusions, efficiency is enhanced.

Correspondingly, verification problems for computer programs with

such

axiom

sets

appear to be very amenable to attack with a linked inference rule. We include here a portion of a proof (obtained by hand) correctly

finds

the

maximum

element of an array.

that

constants of this problem have the following interpretations. LT(x,y)

x is less than y

IB(x,y)

index y is in bounds in array x

eval(x,y)

the value in position y of array x

s(x),pd(x)

successor and predecessor functions

cc,numl,cn

array cc is dimensioned from numl through cn

cj, cmax

constants representing program variables

a,b

other Skolem constants that come from the statement of the theorem

The relevant portion of the problem is the following.

a given

program

The predicates, functions, and

328 Special hypothesis: (1)

lX(a,cj)

(2)

-LT(cj,numl)

(3)

-LT(cn,cj)

(4)

-LT(s(cn),cj)

(5)

-IX(a,numl)

(6)

-IB(cc,a)

(7)

lX(xl,numl) -IX(xl,cj)

(8)

-IB(cc,cj)

EQUAL(cmax,eval(cc,a» -IB(cc,xl)

-IX(cmax,eval(cc,xl»

-LT(cmax,eval(cc,cj»

Denial of the conclusion: (9)

-LT(b,numl) -IB(cc,cj) LT(a,numl)

-IX(a,s(cj»

tr(s(cj),numl)

lX(s(cn),s(cj»

-IB(cc,a)

-EQUAL(cmax,eval(cc,a» (10)

LT(b,s(cj»

-IB(cc,cj)

LT(a,numl)

-IX(a,s(cj»

tI(s(cj),numl)

tI(s(cn),s(cj»

-IB(cc,a)

-EQUAL(cmax,eval(cc,a» (11)

LT(cmax,eval(cc,b» LT(s(cn),s(cj»

-IB(cc,cj)

tI(a,numl)

LT(s(cj),numl)

-LT(a,s(cj»

-IB(cc,a)

-EQUAL(cmax,eval(cc, a» In addition to these clauses, there is a set of general axioms that gives properties of the integers, of ordering relations, of arrays, and other relevant information. Note that there are seven literals that are common to each of the

denial, clauses 9, 10, and 11.

the

clauses

of

A single application of linked UR-resolution to

each of the three clauses removes the seven common literals and yields the following three unit clauses. (12)

-LT(b,numl)

(13)

LT(b,s(cj»

(14)

tI(cmax,eval(cc,b»

Before completing the proof, which requires two additional linked UR steps, show

how linking

removes each of the seven literals.

we

The process is described by

associating each literal with the appropriate axioms and set of support clauses that are

required to remove it from the nucleus.

Simply for illustrative purposes, each

329 of the seven literals is viewed as a unit clause that is used to

deduce

the

empty

clause. Note the difference between linked UR-resolution and standard UR-resolution in the treatment of this proof fragment. In standard UR, it would be necessary to deduce in separate steps an appropriate unit clause to remove each of the seven literals from clauses 9, 10, and 11. To see precisely what suffices--to derive the standard UR proof from the linked UR proof we are about to give--merely consider the linking clauses associated with each of the seven literals, but in reverse order. a. -IB(cc,cj) LT(xl,numl) LT(cn,xl) IB(cc,xl) -LT(cj,numl) -LT(cn,cj)

(axiom) (clause 2) (3 )

b. LT(s(cj),numl) -LT(x,y) -LT(y,z) LT(x,z) LT(x,s(x» -LT(cj,numl)

(axiom) (axiom) (2 )

c. LT(s(cn),s(cj» -LT(x,y) -LT(pd(x),pd(y») EQUAL(pd(s(x),x) -LT(cn,cj)

(axiom) (demodulator) (3)

d. LT(a, numl) -LT(a,numl)

(5)

e. -LT(a,s(cj) -LT(x,y) -LT(y,z) LT(x,z) LT(x,s(x» LT(a, cj)

(axiom) (axiom) (1 )

f. -IB(cc,a) LT(xl,numl) LT(cn,xl) IB(cc,xl) -LT(a,numl)

--->

(axiom) (5)

LT(cn,a) (intermediate result added for clarity)

LT(a, cj) -LT(x,y) -LT(y,z) LT(x,z) -LT(cn, cj)

(1)

(axiom) (3)

g. -EQUAL(cmax,eval(cc,a» -IB(cc,a) EQUAL(cmax,eval(cc,a» (6) (see solution of literal "f " to complete)

330 Having shown how the seven literals can be removed with linked UR-resolution to deduce the unit clauses (12)

-LT(b,numl)

(13)

LT(b,s(cj»

(14)

LT(cmax,eval(cc,b»

we show how the proof can be completed.

With clause 12 as the initiating

unit

and

clause 7 (7)

LT(x1,num1) -LT(x1,cj)

-lB(cc,xl)

-LT(cmax,eval(cc,xl»

as the nucleus, the intermediate clause -LT(b,cj) -lB(cc,b) -LT(cmax,eval(cc,b» can be deduced by binary resolution. (different)

unit

clause,

removed by linking. (15)

and

The first literal can be

transformed

into

a

simultaneously the second and third literals can be

The output of this linked UR step is the following unit clause.

EQUAL(b,cj)

This application of linked UR can be described as we described the action of linking in removing the seven literals. h.

-LT(b,cj) LT(x,y) LT(y,x) EQUAL(x,y) -LT(xl,s(x2» -LT(x2,xl) LT(b, s(cj»

---> 1.

(axiom) (axiom) (13)

(result of inference)

EQUAL(b,cj)

-IB(cc, b) LT(x1,num1) LT(cn,xl) IB(cc,x1) -LT(b,num1)

--->

LT(cn,b)

(intermediate result for clarity)

-LT(x,y) LT(z,y) -LT(cn,cj)

--->

LT(cj,b)

-LT(xl,s(x2» LT(b,s(cj»

(axiom) (clause 12)

LT(x,z)

(axiom) (3 )

(intermediate result for clarity) -LT(x2,xl)

(axiom) (13)

331

j.

-LT(cmax,eval(cc, b» tr(cmax,eval(cc,b»

(14)

Note that the new equality (clause 15, which was

derived

from

literal

"h")

back

demodulates clause 14 to produce the following unit clause. (16)

tr(cmax,eval(cc,cj»

To complete the proof, it is sufficient to show how clause 16 can

be

used

to

-IB(cc,cj)

is

deduce a unit clause that conflicts with an existing unit. k.

tr(cmax,eval(cc,cj» -IB(cc,cj) -LT(cmax,eval(cc,cj» tr(xl,numl) tr(cn,xl) IB(cc,xl) -tr(cj,numl)

(8)

(axiom) (clause 2)

The result of this linked UR-resolution step is the unit clause (17)

LT(cn,cj)

which conflicts with clause 3. itself

a

linked

Note that

UR-resolvent

the

intermediate

result

that would be generated with this step.

The entire

problem is solved with 5 linked UR steps.

5.

(hnclusions

We have application

discussed that

the

yields

linked

linked

inference

principle

UR-resolution.

Linked

and,

specifically,

UR-resolution

reasoning program to take much larger steps than standard UR-resolution employing

this

inference

rule,

representation for presenting a standard

inference

rules

often

the

user

problem

in

force

support and adding binary resolution to

is

usually

clause

free

form.

the

enables a does.

By

to choose a natural

In

particular,

while

you to choose between expanding the set of the

inference

rules

being

used,

linked

inference rules often avoid both undesirable choices. We have described certain experiments to test the position that linked

inference

increases

addition, we have included other illustrations (carried understanding

how the

linked

inference

illustrations focus on problems from the proving

employment

the effectiveness of automated reasoning programs.

properties of computer programs.

principle world

of

out

works. puzzles,

~

hand) The

to

aid

of In in

experiments and

circuit

design,

and

We intend to present the results of those

experiments at the Seventh Cbnference on Automated Deduction.

332 Although we have not given a detailed account of precisely how the the

reasoning

program

the

user

inference, we have illustrated how a strategy, the target strategy, can be employ

semantic

considerations

standard inference rules. principle

is

to

The

provide

rather

than

motivation

greater

control

refutation

for

to

formulating

the

linked

inference

over a reasoning program's attack on a Questions of soundness

completeness as well as a detailed discussion of the fine points of

the linked inference principle are found in [8]. demonstrate

used

the usual syntactic ones that define

problem, and also to increase the role of semantic criteria. and

gives

instructions that in turn result in controlling linked

that

the

use

Our

main

objective

here

is

to

of linked inference rules may increase the potential of

relying on a reasoning program to function as an automated reasoning assistant.

References [1]

Inckham ,

D. C"

"Some Tree-paring Strategies for Theorem Proving," Machine

Intelligence 3 (ed. Michie, D.), Edinburgh University Press 1968, pp. 95-112. [2]

McCharen, J., Overbeek, R. and Wos, L., "Problems and experiments for and with automated theorem proving programs," IEEE Transactions on Computers, Vol. C-

25(1976), pp. 773-782. [3]

Robinson, J., "A machine-oriented logic based on the resolution principle," J. ACM, Vol. 12(1965), pp. 23-41.

[4]

Warren, D. H. D., Implementing Prolog - compiling predicate logic programs, DAI Research Reports 39 & 40, University of Edinburgh, May 1977.

[5]

Wojciechowski, W. and Wojcik, A., "Automated design of multiple-valued logic circuits

by

automatic

theorem

proving

techniques, "

to

appear

in

IEEE

Transactions on Computers. [6]

Wos, L., Carson, D. and Robinson, G., "The unit preference strategy in theorem proving," Proc , AFIPS 1964 Fall Joint Computer Conference,

Vol. 26, Part II,

pp. 615-621 (Spartan Books, Washington, D.C.). [7]

Wos, L., Carson, D. and Robinson, G., "Efficiency and completeness of the

set

of support strategy in theorem proving," J. ACM, Vol. 12(1965), pp, 536-541. [8]

Wos, L., Smith, B. and Veroff, R., "The Linked Inference Principle, I: Formal Treatment," in preparation.

The

333

A NEW INTERPRETATION OF THE RESOLUTION PRINCIPLE Etienne Paul Centre national d'etudes des Telecommunications 38/40 Rue du General Leclerc 92131 Issy les Moulineaux, France Abstract We show in this paper that the application of the resolution principle to a set of clauses can be regarded as the construction of a term rewriting system confluent on valid formulas. This result allows the extension of standard properties and methods of equational theories (such as Birkhoff's theorem and Knuth and Bendix completion algorithm) to quantifier-free first order predicate calculus. These results are extended to first order predicate calculus in an equational theory, as studied by Plotkin [15], Slagle [17] and Lankford [12]. This paper is a continuation of the work of Hsiang [5], who has already shown that rewrite methods can be used in first order predicate calculus. The main difference is that Hsiang applies rewrite methods only as a refutational proof technique, trying to generate the equation TRUE=FALSE. We generalize these methods to satisfiable theories; in particular, we show that the concept of confluent rewriting system, which is the main tool for studying equational theories, can be extended to any quantifier-free first order theory. Furthermore, we show that rewrite methods can be used even if formulas are kept in clausal form. 1: INTRODUCTION We show in this paper that the resolution algorithm applied to a set of clauses has the same goal as the Knuth and Bendix algorithm applied to a set of equations, namely the construction of a confluent rewriting system, but with the restriction that the system obtained is confluent only on valid formulas. More specifically, let S be a set of clauses. We show that the application of the resolution principle to S produces a rewriting system R such that any quantifier-free first order formula F which is a valid consequence of S reduces to TRUE, using reductions from R in any order. Conversely, only valid consequences of S reduce to TRUE. But the system R is not confluent in general, for F may possess several normal forms if it is not a valid consequence of S. Such partly confluent rewriting systems have already been studied in equational theories: for example, the Greendlinger-Bucken algorithm [2] for finitely presented groups constructs a rewriting system which, under certain conditions, is confluent on the relators (i.e. words equal to the identity in the group). The word problem can then be decided as follows: two words a and b are equal iff the normal form of the word a+(-b) is the identity.

334

In the same way, the equivalence between two quantifier-free first order formulas F and G in the theory defined by the set of clauses S can be decided by computing the normal form of the formula F<=>G: F and G are equivalent iff this normal form is TRUE. From this first result, we deduce that standard properties and methods of equational theories (such as Birkhoff's theorem and Knuth and Bendix completion algorithm) can be extended to quantifier-free first order predicate calculus. The organization of the paper is as follows: In section 2, we introduce a rewriting system in propositional calculus confluent on valid formulas (which are merely tautologies). Since we do not need general confluence, we will not use the system discovered by Hsiang [5], because this system is not practical for manipulating clauses. Our system will be constructed from the usual boolean connectors "and", "or", "not".

In section 3, we extend this result to first order predicate calculus: we describe a completion algorithm based on the resolution principle which generates, from any initial set of clauses, a rewriting system confluent on valid formulas. In section 4, we prove Birkhoff's theorem for first order predicate calculus. This theorem states that equational-type reasoning (i.e. reasoning by instantiation and replacement of equivalents by equivalents) is complete for quantifier-free first order predicate calculus. From this theorem, we deduce a new completion algorithm, based on the Knuth and Bendix completion algorithm and the Hsiang system for propositional calculus. In section 5, the previous results are extended to first order predicate calculus in an equational theory. 2:CONFLUENCE ON VALID FORMULAS IN PROPOSITIONAL CALCULUS 2-1: review of equational term rewriting systems It is assumed that the reader is familiar with the literature on equational theories and term rewriting systems. See [7] for a full description. We start with a vocabulary of variables and function symbols; we define terms, occurrences, substitutions, unification, most general unifier (m.g.u). If M is a term and u an occurrence of M, M/u denotes the subterm of M at occurrence u, and M[u<-N] the term M in which this subterm is replaced by N. If s is a substitution, s(M) is denoted Ms. The set of terms is denoted T. An equational system E is a set of pairs M=N, where M and N are two terms. <->E denotes the one step equality. The equality relation =E or <*>E generated by E is the reflexive-transitive closure of <->E, i.e. the smallest congruence over T containing all pairs <Ms,Ns> for M=N in E and s an arbitrary substitution. A term rewriting system R is a set of directed pairs l->r such that each variable in r occurs in 1. A term tl R-reduces at occurrence u to a term t2 using the rule l->r iff there exists a substitution s such that tl/u = Is and t2 = tl[u<-rs]. We write tl ->R t2.

335

Give n E and R as ab ove , a t e rm t l E,R-r edu c es t o t 2 i f f t he re ex is ts a t e rm t3 such t ha t tl =E t 3 and t3 - >R t 2 . We wr i te t l - >E,R t 2 . The r el at ion - >E, R ca n als o be reg arded as the r e lation i nduc ed by R in th e equ i valen ce classes of t e r ms modulo =E. *>E,R denotes the reflexive-transitive closure of ->E,R, and <*>E,R or =E,R denotes the reflexive-sYmffietrictrans itive closure of ->E,R. The pair (E,R ) is called an equationa l term rewr iting system (ETRS). Suc h s ystems are a gen er a l i z ati on of usual re writing s ys t ems for ha ndl ing non- te rmi na t ing equations such as commuta t i v ity. The y are s tudi ed in detail in [10],[ 14] . (E,R ) i s terminat ing if t he r e is no i nfin i t e sequence o f E,R-reductions f r om an y term . (E, R) is i nter- reduced i f f o r e a ch rule l - >r i n R, r is (E, R)- i rreduci b l e and 1 is « E, R) - ( l - >r »- i r r educ i ble. (E, R) is confluen t if for any t e rms t , tl and t 2, t *>E,R tl and t *>E,R t2 impl ies: t he r e e xist t 3 and t 4 s uch that: t l *>E,R t3, t2 *>E,R t4 and t3 =E t4. (E, R) is canonical if it i s both terminating and confluent. A term tl is a normal form of t2 iff t2 *>E,R tl and tl is (E,R )-irreducible. It is easy to see that if (E,R) i s canonical, the n every term has a unique ir r educi b l e normal form (up to =E) . 2-2 : Proposit ional c a l culus Fo rmulas of p roposi tional calculus a re cons truc ted from the fo l low ing vo cabulary: t wo cons tants TRUE and FALSE. boolean conn ector s: v or <=> eq u iva l ence & : and => : implication ~ : no t ! : exc lusive or a denumerable set of boolean variables, each variable can have two values: TRUE or FALSE. A formula is VALID or TAUTOLOGICAL iff its value is TRUE for each assignment of va l ue s to its variables, according to t he rules of boolean calculus . We embody t hes e rules i nto the following ETRS: -Equational sys tem E: XvY (XvY)vZ X&Y (X&Y)&Z

YvX Xv(YvZ) Y&X X&(Y&Z )

-Rewriting system R: (1)

(2) (3)

(4 )

(5)

X<= >Y X=>Y XIY ~(TRUE) ~(FALSE)

-> -> ->

(Xv(~Y»&(Yv(~X» (~X)vY (XvY)&«~X)v(~ Y»

-> ->

FALSE TRUE

336

~(~x)

-> -> ->

X

Xv(y&z)

->

(XvY)&(XvZ)

(7)

~(XvY) ~(X&Y)

(8 ) ( 9)

(6 )

(10 ) (11)

(12) (13) (14)

(15) (16) (17)

XvTRUE XvFALSE XvX XV(~X)

X&TRUE X&FALSE X&X X&(~x)

(~X)&(~Y) (~X)v(~Y)

-> -> -> ->

TRUE X X TRUE

-> -> -> ->

X FALSE X FALSE

Rules (1) to (3) eliminate the connectors <=>, =>, !. Rules (4) to (8) eliminate all the connectors ~ which are not directly applied to variables. Rules (10),(11),(14),(15) eliminate the constants TRUE and FALSE. Rule (9) converts the formula into conjunctive normal form Cl&C2& ... &Cm, each Ci being in the form: Ci = Xl v X2 v ... v xp v

(~Xp+l)

v

(~Xp+2)

v ... v

(~Xn)

For a given Ci, the Xj's are distinct variables from rules (12) and (13). Therefore, in this ETRS which we denote BOOL and which is obviously terminating, the normal forms of terms are: -The constants TRUE and FALSE -The formulas Cl&C2& ... &Cm, each Ci being a disjunction of variables and of negations of variables which are all distinct. This system is not confluent on all formulas. Moreover, it is well known that it is impossible to construct a finite confluent rewriting system in propositional calculus based on connectors v,&,~, due to the fact that the prime implicant representation of Boolean terms is not unique. But this system is confluent on valid formulas. For if F were a valid formula and had the normal form Cl&C2& ••• &crn, with variables in each Ci distinct, it would be easy to assign values to the variables of F such that, for example, Cl=FALSE and hence F=FALSE. Therefore, the unique normal form of F is TRUE. Given two formulas Fl and F2, we can decide whether Fl and F2 are equivalent by computing the normal form of Fl<=>F2: Fl and F2 are equivalent iff this normal form is TRUE. For example, the absorption law (i.e. X&(XvY) = X) can be proved by reducing the formula (X&(XvY»<=>X: (X&(XvY»<=>X

-> «X&(XvY»v(~X»&(Xv(~(X&(XvY»» -> ••• -> TRUE by rules (4) to (17)

by rule (1)

Note that both terms X&(xvY) and X are irreducible. Of course, this method is less efficient than using Hsiang's system, in which two formulas are equivalent iff they have the same normal form. But we shall retain this system for the moment, for it is easier to manipulate clauses with it than with

337

Hsiang's system (we will introduce Hsiang's system in section 4). Note that if we replace rule (9) by the other distributivity rule: X&(YvZ) -> (X&Y)v(X&Z), we obtain a system confluent on unsatisfiable formulas (FALSE being the unique normal form of these formulas). In this new system, we can decide the equivalence of two formulas Fl and F2 by computing the normal form of Fl!F2: Fl and F2 are equivalent iff this normal form is FALSE. 3:CONFLUENCE ON VALID FORMULAS IN PREDICATE CALCULUS 3-1: Review of first order predicate calculus and resolution We start with a vocabulary of variables, function symbols, and predicate symbols. Terms, atoms and literals are introduced next (see for example [3] for a full description). Throughout this paper, we only deal with quantifier-free first order predicate calculus; that means that all formulas manipulated are in prenex form with all their variables implicitly universally quantified. A clause is a finite disjunction of zero or more literals; the order and parenthesis of the literals are irrelevant; in other words, a clause is assimilated to its equivalence class modulo =E. The empty clause is assimilated to the constant FALSE. A tautological clause is a clause which contains a complementary pair of literals (i.e. L and ~L). A clause with repeated literals is considered as different from the same clause without repeated literals, because both clauses are not in the same equivalence class modulo =E (note that it would be the case if the rule Xvx -> X of BOOL were placed into E, i.e. regarded as a non oriented equation). The resolution operation is decomposed into binary resolution and binary factoring, which are defined in the usual way. Deletion of repeated literals in a clause is considered as a particular case of factoring. The clause Cl subsumes the clause C2 if there is a substitution s such that Cls c C2; in this inclusion, repeated literals in Cls or C2 are taken Into account only once. A set S of clauses is regarded as equivalent to the conjunction of these clauses. The first order formula F is a valid consequence of S, or is valid in the theory specified by S, iff F is true in all models of S. In particular, TRUE is a valid consequence of any set S, and FALSE is a valid consequence of S iff S is unsatisfiable. We will use the following result obtained by R.C.Lee: Theorem 3.1 (completeness of resolution for consequence finding): Let S be a set of clauses. Let C be a non tautological clause. C is a valid consequence of S iff there is a clause I deduced from S by resolution which subsumes C. This theorem links the semantic concept of truth and the syntactic concept of provability (by resolution). It is proved in [13].

338

3-2: confluence on valid formulas If S is a set of clauses, we associate term rewriting system which is defined as following systems: -The system BOOL defined in section 2.2. -The set of rules {C -> TRUE} with C in S. We denDte alSD S this ETRS . We say that S is cDnfluent Dn valid fDrmula F which is a valid cDnsequence Df S,

with S an equational the union of the two

fDrmulas iff for any F z>S TRUE.

TheDrem 3.2: Let S be a set Df clauses which dD nDt cDntain the empty clause. S is confluent Dn valid fDrmulas iff the fDllowing cDnditions are met: (i) for each binary factDr F of a clause in S, F z>S TRUE (ii) for each binary resDlvent R of two clauses in S, R z>S TRUE This theorem cDrresponds tD the Knuth and Bendix critical pairs theorem [11]. In fact, the computation of binary factDrs and binary resolvents can be regarded as a computation of critical pairs between rules of S restricted to certain critical pairs (the details Df this cDmputation are given in annex). We do not need to compute all critical pairs because we do nDt require the general confluence, but only the confluence Dn valid formulas. ProDf: the Conversely, that S is lemmas:

conditions (i) and (ii) are obviously necessary. let us suppose that (i) and (ii) are true. To prove confluent on valid formulas, we need the following

Lemma A: if C is a non tautological clause such that C z>S TRUE, there is a clause D in S, a sub-clause B of C and a substitution s such that B = Ds. Proof: obviDus. Lemma B: if C is a clause such that C z>S TRUE, and F(C) is a binary factor of C, F(C) z>S TRUE. Proof: if C is tautDlogical, F(C) is also tautDlogical. Hence F(C) z>BOOL TRUE and F(C) z>S TRUE since BOOL c S. If C is not tautological, by using lemma A, C can be written C = Ds v Cl, with D in S. If the factoring of C is done between two literals of Cl, or between a literal in Cl and a literal in Ds, it is easy to check that F(C) z>S TRUE by application of the rule D -> TRUE. If the factoring is done between two literals Lls and L2s of Ds, we have: C = Lls v L2s v Dls v Cl, with D = Ll v L2 v Dl. Let t be the m.g.u of Lls and L2s. We have: F(C) = Llst v Dlst v Clt Ll and L2 are unifiable, since Lls and L2s are unifiable. Let u be the m.g.u of Ll and L2: F(D) = Llu v Dlu is a binary

339

fac to r of D. By hypothesis (i), F(D ) x>S TRUE. From the defini tion of the m.g.u, there IS a substitution w such that st = uw. Hence F(C ) = F(D)w v CIt and F(C) x>S TRUE. Lemma C: if C and D are two clauses such that C x>S TRUE D x>S TRUE, and R(C,D) is a binary resolvent of C and D, R(C,D ) x>S TRUE.

and

Proof: sim ilar to the proof of lemma B. We can now prove that S i s co nfluent on valid formulas: let F be a formula which is a valid consequence of S. If F is a tautology, F *>BOOL TRUE and he nc e : F *>S TRUE. If F is not a tautology, F *>BOOL Cl&•.. &CP, each Ci being a non tautological clause which is also a val id consequence of S. Let us choose for example the clause Cl: by theorem 3.1, there i s a clause I deduced from S by resolution which subsumes Cl. Therefore, there is a sub-clause D of Cl and a substitution s such that Is and D contain exactly the same literals (possibly repeated for Is, but not for D because D is reduced by the the system BOOL). I is deduced from S by resolution, i.e. by a sequence of binary resolution and binary factoring. Hence, from lemmas Band C: I x>S TRUE. Therefore Is x>S TRUE. D is obtained from Is by eventual deletion of repeated literals, which is a particular case of factoring. Hence, by lemma B, D x>S TRUE. Since S does not contain the empty clause, D is not the empty clause. Since D is a non-empty sub-clause of Cl, Cl *>S TRUE. We prove in the same way t ha t Ci x>S TRUE for a ll i. Therefore F x>S TRUE, which ends the proof of theorem 3 .2. Note that we have also proved that S is satisfiable, since we do not have FALSE x>S TRUE; hence FALSE is not a valid consequence of S. 3-3: Completion algorithm From theorem 3.2, we can derive a completion algorithm similar to the Knuth and Bendix algorithm, which will generate from any set of clauses an equivalent ( i.e. having the same models) rewriting system confluent on valid formulas. In the following, Ei and Ri are two finite set of clauses. The current rewr iting system is the system associated with Ri, i.e . the system: BOOL u {C - > TRUE, C in Ri}. This rewriting system is also denoted Ri. Each clause in Ri has a label, which is a unique integer. We denote by k:C the clause C with label k. Finally, each clause in Ri is marked or unmarked. Initial data: a (finite) set of clauses S. I-Initialization: Let EO

= S,

RO = BOOL, i

0, p

2-If Ei # emptyset, go to 4 . 3-Compute binary resolvents and binary factors:

O.

340

I f a l l c lauses in Ri are ma rked, s top with success. Otherwise, sele ct an unmar ked C clause in Ri , say wi t h labe l k. Do: Ei +l {b i na ry fa ctors o f C} u {bina ry resol vents o f (C, D) for any c lause D o f Ri of l abe l not greater than k } Ri +l = Ri wit h c lause C ma rked i = i +l , go t o 2. 4- I nt r oduc t i on of new r u les: Se lec t clause C in Ei. Let C! a Ri-norma l f orm of C. - If Cl = FALSE, s top wit h answer: S unsat is f iable. -If Cl = TRUE, do: Ei +l = Ei -C, Ri+ l = Ri, i = i+ l an d go to 2 . - If C! # FALSE a nd C! # TRUE: Let K be the set of labels of clauses in Ri reducible by the new r ule C! - > TRUE. Do: Ei +l = Ei -C u {j : D with j in K} p = p+l Ri+l = {j:D with D in Ri and j not in K} u {p:CI } i = i+l, go t o 2 In Ri+l, the clauses coming from Ri are marked or unmarked as they were in Ri, the new clause p:C! is unmarked. To ensure the completeness of this algorithm, we need an as s ump t i on concerning the selection of clause at step 3: for every clause label k , there is an iterat ion i such that either the clause of label k is deleted from Ri (i.e. k i s in K at i t e r at i on i), or the c lause of label k is selected a t step 3. Thi s assumption ensures that no clause wil l be igno red i nde f i n i t e ly by the s e l e c ti on process. The de letion at step 4 of clauses which are reduced to TRUE corresponds t o the deletion, in resolution algo rithm, of clauses whi ch are t auto l og i es o r which are subsumed by other cl auses . The only d i f ference wi th t he Knuth and Bendix complet ion a l gorit hm, as presen ted f o r instance in [6] , is the replacement of t he computat ion of cri t i c al pairs by the computation of bi nary f a ct or s and binary reso lvents, i.e. computation o f only certain critical pairs. Consequently, the re is no case of stop with failure, be cause the right-hand side of the added rules is always TRUE. The a lgorithm may: -either s top wi t h the conclusion : S i s unsat isfiable, -either s top with success, - o r run f o r ev e r . In the last t wo cases, let R~ be the final rew riting s ystem constructed by the algorithm . R= is the se t of all the rules which belong to some Ri and to all Rj's for j >i; i.e. which are never reduced by other rules. In the f irst case, we define R~ as the sing le rule X - > TRUE. Note that this rule is obtained by superposing the rule FALSE -> TRUE, generated by the algorithm, and the ru le XvFALSE -> X of BOOL; furthermore, this rule X -> TRUE entails the deletion of all previous rules by redundancy (including rules of BOOL). Theorem 3.3: The rewriting system R~ has the following properties: (i) R~ is inter-reduced. ( i i ) R~ is equivalent to the initial set of clauses S.

341

(iii) R~ is confluent on valid formulas. Furthermore, for a given S, R~ is the only rewriting system associated with a set of clauses which has these properties. Proof: if S is unsatisfiable, R~ is reduced to the only rule X -> TRUE, which means that any formula can be considered as a valid consequence of S. Properties (i) to (iii) are obvious. If S is satisfiable, (i) comes from the structure of the algorithm. (ii) comes from the fact that at each iteration i of the algorithm, S is equivalent to Ei uRi. Finally, (iii) comes from theorem 3.2. To prove the unicity, let us suppose that there exists another rewriting system Q, associated with a set of clauses, such that Q has the properties (i) to (iii). As Q is equivalent to S by (ii), Q is equivalent to R~. Let Cl be a clause in Q. Cl is a valid consequence of R~, and since R~ is confluent on valid formulas, Cl *>R~ TRUE. Cl is not tautological (otherwise, it could be reduced by rules of BOOL, and the system Q would not be interreduced). Therefore, by lemma A, there is a clause C2 in R~ and a substitution s such that C2s c Cl. This clause C2 is a valid consequence of Q, and we can prove in the same way that there is a clause Dl in Q and a substitution t such that Dlt c C2. Since Q is interreduced,-that entails Hence Dlts £ CI. Dl=Cl, hence ts = identity and Cl and C2 are identical (up to the names of variables). Therefore Q £ R~. We prove in the same way that R~ £ Q, therefore Q = R~. Note the difference with the Knuth and Bendix algorithm, in which it is possible to generate several different confluent rewriting systems, depending on the orientation chosen for rewrite rules. 4: EQUATIONAL METHODS IN FIRST ORDER PREDICATE CALCULUS 4-1: Theorem of Birkhoff Let S {El, •.• ,En} be a set of quantifier-free formulas. El, ..• ,En are not necessarily in clausal form. We associate with S an equational system which is defined as the union of the following systems: -The system BOOL defined in section 2.2 (regarded as a set of equations). -The set of equations {Ei = TRUE} with Ei in S. We denote also S this equational system. Note that the symbol , linking two boolean formulas, corresponds here to the boolean connector <=> and is different from the equality predicate, linking two non boolean terms, which can also exist in S. This equational system defines an equality relation on the set of quantifier-free formulas. We denote this relation =S or <*>8. Theorem 4.1: Let F and G be two quantifier-free formulas.

342

F and G hav e simul t an eously t he same v alues (TRUE or FALSE ) f or ev ery as s ignment of t he i r va ri ab l es in a ll t he mod el s of 8 i ff F =S G. An equiva lent s ta temen t i s: The f ormul a F<=>G i s a va l id co nseque nce o f 8 iff F =8 G. Theorem 4.1 s ta t es t h at equat ional- t ype r e as oning (i . e . r e as on i ng by ins tan tiat ion and repla ceme nt o f equ ivalen t by equ i va lent ) i s comp le te f or qua ntif ier-free fi r st o rde r pred icate ca l cu l us . The boo lean connec t o r <=> plays ex a ct ly t he same ro le as the equali t y p red icate in the usua l Bir kho f f t heo r em f or equat ional t heo ri es . A compa rison be t ween the scope of app l icat ion of these t wo theo rems is per formed in s ec tion 5. 4. To prove theorem 4 . 1 , we need the following l emmas : Lemma D: for every set of formulas 8, there i s an equ ivalent set of clauses 81 such that: <*>8 = <*>81. Proof: by using BOOL, each Ei in S can be transformed into a conjunction of clauses. Let 81 be the correspond ing set of c lauses. For each Ei in S, we have the following relation: Ei

<*>BOOL

Cl&C2& ... &CP

with Cj in 81.

S ince BOOL c Sl , we h av e Ei <*>81 TRUE. Hence <*>8 c <*>81 . Converse ly,-the fol lowing sequence ho lds for each Cj: Cj <-> BOOL Cj &TRUE <- >S Cj &Ei <*>BOOL Cj& (Cl& •• • &Cp) <*>BOOL Cl& . . . &Cp <*>BOOL Ei <-> S TRUE Th er e f or e Cj <*>S TRUE. Hence <*>Sl £ <*>S. Lemma E: let 8 be a se t of clauses, and built f r om S by t he co mpletion a lgor i thm.

t he r ewri t i ng system We have <*>8 = < *> R~ .

R~

Proof: if C is a clause in 8, C * >R~ TRUE. Then <*>8 c < *> R~ . Conversely, if C is a clause in R~, the equat ion C = TRUE is obtained from t he clauses in 8 by a fini te sequence of binary f a cto r i ng and binary resolut ion. These operat ions are part icular cases of computation of crit ical pairs (see annex). Consequently, the equat ion C = TRUE i s an equational consequence o f S, and there fore < *>R~ £ <*>S. Lemma F: the boolean connector <=> has the follow ing properties: Commutativity: Associativity: Identity Nilpotence

X<=>Y (x<= >y) <=>z X<=>TRUE X<=>X

<*>BOOL <*>BOOL <*>BOOL <*>BOOL

Y<= >X x<=>(y<= >z) X TRUE

Proof: straightforward, by reducing each expression to its normal f o r m in the rewriting system BOOL. We can now prove t heo rem 4.1: Let

F and G be two quantifier-free fo rmulas.

If F =S G, it

343

i s ob v ious t ha t F<=>G i s a val i d co nsequence o f S. Conversel y , let us suppose that F<=>G is a valid cons equenc e of S. Fr om lemma D, we can s uppose that S is cl aus a l . Let R~ be the r ewrit ing syst em ob ta ined f r om S by t he comp let ion algo rithm . We hav e : F<=>G

* >R ~

TRUE

Hence F<=>G <*>S TRUE by l emma E. Then , using lemma F, we can writ e t he f ol l owi ng sequence: F

<*>BOOL F<=>TRUE <*>S F<=>(F <=>G ) <*>BOOL TRUE<=>G <*>BOOL G

<*>BOOL

( F<=>F) <=>G

Hence F =S G, which ends t he proof. Remark 1 : t heo r em 4 .1 has the following particu lar cases: -With G = TRUE: F is a valid consequence of S i ff F =S TRUE. -With F = FALSE and G = TRUE: FALSE is a valid consequence of S (i.e S is unsatisfiable) iff FALSE =S TRUE. Tha t i s a well known definition of the inconsistency of a specification. Remark 2: theorem 4.1 remains true if the system BOOL is replaced by any specification BOOLl of propositional cal culus such that <*>BOOL = <*>BOOL1. I n particular, theorem 4. 1 i s true if BOOLl i s the Hs i ang system. 4- 2: ge neral confluence i n f irst o rder pred i cate ca lculus Theo rem 4.1 s tates that eq uationa l reason i ng is co mplete fo r quan t i f i er-f r e e first orde r p redicate ca lculus. Consequen tl y, we c an try to bu ild fr om any spec i f i cat ion S a r ewri t i ng s ystem confl ue nt on a l l fo rmulas and no t onl y on vali d f ormulas , by using t he Knut h and Bend i x algor ithm instead o f th e r es ol ut i on a l go ri thm ( i . e . by comput ing a ll cr itica l pairs and no t only certa in c r i tic al pa i rs). But we wil l not succeed if we s tart with t he rewriting system BOOL, because even i f we rest rict ourse lves t o p ropos itiona l calculus, i t i s impossible t o build f r om BOOL a fini te complete r ewr i t i ng sys tem. Therefore , we must use for th is purpose Hsiang 's system . 4- 2-1 : The Hsiang sys tem a nd the dual sys tem The Hs iang system i s t he following: (1)

(2) (3)

(4) ( 5) ( 6)

X&X - > X X&TRUE - > X X&FALSE - > FALSE X!FALSE - > X XIX - > FALSE X&(YIZ) - > (X&Y)! (X&Z)

(7)

X<= >Y

(8)

~X

(9)

XvY X= >Y

(10)

-> -> -> ->

XIYITRUE Xl TRUE (X&Y) IX!Y (X&Y) IXITRUE

344

This system is based on the connectors & and 1. Rules (7) to (10) eliminate other connectors. This system is confluent modulo the associativity/commutativity of & and l. As noted by Hsiang, there exists also a dual system. To formalize the construction of this system, we define the dual d(f) of any boolean fonction f with arity n by: d(f)(Xl,X2, ••• ,Xn) '"

~(f(~Xl,~X2,

••.• ~Xn»

The following relations are straightforward: d(TRUE)

FALSE

d(FALSE)

TRUE

d( v ) d( <"'>

&

d( & ) d( ! )

v

dI

~

)

)

!

<"'>

(in general. d(d(f» '" f) d(X) X (X being a boolean variable) d(f(Pl, ..• ,Pn» '" d(f)(d(Pl), ••• ,d(Pn» for any connector f These relations allow us to compute the dual of any expression con~tructed from boolean variables and connectors. by induction over the structure of this expression. If PI and P2 are two equivalent expressions, d(Pl) and d(P2) are also equivalent. This property allows us to apply a transformation by duality to Hsiang's system. We obtain the following system: XvX XvFALSE XvTRUE X<"'>TRUE ( 5) X<=>X (6) Xv(Y<=>Z)

-> -> -> -> -> ->

X X TRUE X TRUE (XvY)<=>(XvZ)

(7) (8) (9)

-> -> -> ->

X<=>Y<=>FALSE X<=>FALSE (Xvy) <=>x<=>y (XvY)<=>Y

(1)

(2 ) (3) (4 )

(10)

XIY ~X

X&Y x=>y

(The rule (10) is directly computed, since the dual of => is not a usual connector) This system is built from v and <=> instead of & and l. It is confluent modulo the associativity/commutativity of these two connectors. 4-2-2: completion algorithm Let S {El, •••• En} be a set of quantifier-free formulas. To complete S, we run the Knuth and Bendix algorithm in its associative/ commutative version, initializing the set of rewrite rules with the Hsiang system (or the dual system) and the set of equations with {Ei '" TRUE}. If a formula Ei is already in the form Fi<=>Gi, we can initialize the set of equations with the equation Fi = Gi instead of (Fi<=>Gi) = TRUE. From now on, we suppose that the completion algorithm does

345

not stop with failure because of the generation of an incomparable critical pair. Let R~ be the (finite or infinite) rewriting system built by the algorithm. Theorem 4.2: Let F and G be two quantifier-free formulas. The formula F<=>G is a valid consequence of S iff F and G have the same R~ normal form. In particular, F is a valid consequence of S iff F z>R~ TRUE. Proof: from theorem 4.1 and the general properties of the Knuth and Bendix algorithm, as explained in [6]. Remark: if the algorithm stops with failure, we can in certain cases run it again after putting the incomparable critical pair into the set of equations, with the associativity/commutativity of ! and & (or <=> and v if we use the dual system). For example, commutative predicates can be handled in this way. Example: let us consider the following clausal specification: S = {p(f(x» v P(x) , ~P(f(x» v ~p(x)} If we apply to this specification the completion algorithm of section 3.3, based on resolution, we generate an infinite number of rewrite rules: (system BOOL +) P(f(x» v p(x) -> TRUE ~P(f(x» v ~p(x) -> TRUE ~P(f(f(x») v P(x) -> TRUE P(f(f(f(x»» v p(x) -> TRUE ~P(f(f(f(f(x»») v p(x) -> TRUE But if we apply to this specification the Knuth and Bendix completion algorithm, this algorithm will stop with only a finite number of rules, namely: (Hsiang's system +) P(f (x ) -> p(x)! TRUE Let us consider another example: let S be the following specification for a program to test if an element u is a member of a sequence z: x=x (2) ~Elem(u,NIL) (3) Elem(u,w.x) <=> (u=w) v Elem(u,x)

(l)

We have added (1), simple reflexivity, the only equality property needed here. Using the dual Hsiang system and the Knuth and Bendix algorithm, we obtained at once the following complete system: (dual Hsiang's system

+)

346

x=x -> TRUE Elem(u,NIL} -> FALSE Elem(u,w.x} -> (u=w) v Elem(u,x} If we use the resolution algorithm, we must split the equivalence into three clauses. We obtain an infinite system: (system BOOL +) X=X -> TRUE 'Elem(u,NIL) -> TRUE 'Elem(u,w.x) v (u=w) v Elem(u,x) -> TRUE '(u=w) v Elem(u,w.x) -> TRUE 'Elem(u,x} v Elem(u,w.x) -> TRUE 'Elem(u,w.NIL} v (u=w) -> TRUE Elem(u,u.x} -> TRUE '(u=w) v Elem(u,wl.(w.x)} -> TRUE Elem(u,wl.(u.x)} -> TRUE In both examples, the Knuth and Bendix algorithm is preferable to the resolution algorithm. That is due to the fact that we can use equivalence relations (such as (3)) as simplifiers. In other cases, both algorithms will run in parallel. For example, if S is {,p(x) v P(f(x}}}, each algorithm generates infinitely many rules. Each rule: ,p(x} v P(f(f(f( •.. (x»}}) -> TRUE produced by the resolution algorithm is associated with the rule: p(x) & P(f(f(f( ••• (x)}})} -> p(x) produced by the Knuth and Bendix algorithm. Actually, the major problem with the Knuth and Bendix algorithm is the orientation of rules; for example if we run the Knuth and Bendix algorithm with S = axiomatization of equality, we generate at once non-orientable critical pairs such as: «x=y)&(x=z) , {x=y}&(y=z}>. Furthermore, with the techniques available now, we cannot put this rule into the set of equations, because we do not have a unification algorithm for this kind of equation. 4-2-3: Knuth-Bendix algorithm as a refutational proof technique Theorem 4.2 can be specifications as follows:

particularized

for

unsatisfiable

Theorem 4.3: Let S be a set of quantifier-free formulas. S is unsatisfiable iff the Knuth and Bendix completion algorithm generates one of the following rules (x being a boolean variable): X -> TRUE X -> FALSE FALSE -> TRUE TRUE -> FALSE Proof: taking F=FALSE and G=TRUE in theorem 4.2, we obtain: FALSE is a valid consequence of S (i.e. S is unsatisfiable) iff FALSE

347

and TRUE have the same normal form in R=. Therefore, either FAL5E or TRUE must be reducible by R=. Hence, one of the above listed rules has been generated by the algorithm. Theorem 4.3 is close to results of Hsiang [5] and Fages [4]. However, there are two differences: -We do not suppose in theorem 4.3 that formulas in 5 are initially in clausal form. -In exchange for removing this restriction, we have to compute all critical pairs; consequently, the algorithm can stop with failure if it generates an incomparable critical pair. 5-FIR5T ORDER PREDICATE CALCULU5 IN AN EQUATIONAL THEORY 5-1: preliminaries We are now in first order predicate calculus with equality. Furthermore, we suppose that the set of clauses that we consider is divided into two parts: -A set T of unit clauses of the form {M=N} which define an equational theory. -A set 5 of clauses which do not contain any equality predicates. We suppose that the equational theory T can be compiled into a canonical term rewriting system R. We introduce a new inference rule, called "narrowing" defined as follows: Given a clause C, if there is a non variable occurrence u in C such that C/u is unifiable with the left-hand side of a rule (l->r) in R with m.g.u s, the clause N(C) = C(u<-r)s is a narrowing of C. This is Hullot's definition, and not Lankford's or 51agle's, for we do not normalize N(C) by R. Note that the pair is a critical pair between the two rules l->r and C->TRUE. A resolution of the clauses Cl and C2 producing the clause C3 is said to be a blocked resolution if the three clauses Cl,C2,C3 are in R-normal form. We similarly define a blocked binary resolution and a blocked binary factoring. We need the following result, which is analogous to theorem 3.1 (Lee's theorem): Theorem 5.1: Let C be a non tautological clause in R-normal form, which does not contain any equality predicates. C is a valid consequence of 5 u T in predicate calculus with equality iff there is a clause I deduced from 5 by narrowing and blocked resolution which subsumes C. Proof: we first consider the case where all clauses in 5 are ground. If 5! is the set of R-normal forms of clauses in 5, 5! is deduced from 5 by reduction by R, which is a particular case of narrowing; and C is a valid consequence of 5! u T in predicate calculus with equality. Therefore 51 u T u ~C is equality unsatisfiable. 5ince 51 and ~C are in R-normal form, by using methods of

348

Lankford [12] we obtain that S! u ~C is unsatisfiable. Therefore C is a valid consequence of St. From Lee's theorem, there is a clause I deduced from S! by resolution which subsumes C. Moreover, this deduction is blocked. This proof is easily lifted to the general case by using methods of Lee [13], the usual lifting lemma for reSOlution, and the following lifting lemma for narrowing: Lemma G: let C be a clause. Let s be a R-normali~ed substitution (i.e xs is R-irreducible for each variable X). Let D be the R-normal form of Cs. There exist a clause Cl derived from C by a finite sequence of narrowings and a substitution t such that CIt = D. Proof: see Hullot [8]. 5-2: confluence on valid formulas We associate with S u T an equational term rewriting system which is defined as the union of the three following systems: -The system BOOL defined in section 2.2. -The rewriting system R associated with the equational theory T' -The set of rules {C -> TRUE} with C in S. We denote RS this equational term rewriting system. We say that RS is confluent on valid formulas iff for each formula F without equality predicate which is a valid consequence of S u T in predicate calculus wih equality, F ~>RS TRUE. Theorem 5.2: Let S u T be defined as above. We suppose that S does not contain the empty clause. The ETRS RS is confluent on valid formulas iff the following conditions are met: (i) for each binary factor F of a clause in S, F ~>RS TRUE (ii) for each binary resolvent R of two clauses in S, R ~>RS TRUE (iii) for each narrowing N of a clause in S, N ~>RS TRUE Proof: the run of the proof follows closely the proof of theorem 3.2. Lemmas A,B,C have to be proved only for clauses in R-normal form (that is sufficient because we use only blocked resolution in theorem 5.1). This fact allows us to ignore the rewriting system R; consequently, the proofs of these lemmas are exactly the same as for theorem 3.2. We need the following additional lemma, which extends lemmas Band C to the narrowing operation: Lemma H: if C is a clause such that C narrowing of C, N(C) ~>RS TRUE.

~>RS

TRUE, and N(C) is a

Proof: let U be the equational term rewriting system obtained from RS by using only the following rules of BOOL: ~(TRUE)

->

FALSE

349

~(FALSE)

XvTRUE XvFALSE XvX

XV(~X)

-> -> ->

->

->

TRUE TRUE X X TRUE

Let C be a clause such that C z>RS TRUE and N(C) a narrowing of C. We have C z >U TRUE because the rules of BOOL which are not in U cannot be applied to a clause. We are going to prove that U is confluent w.r.t the associativity/ commutativity of v, by using the confluence criterion of Peterson and Stickel. This criterion consists in checking the confluence of all AC-critical pairs between rules of U and their extensions. AC-critical pairs means that we use AC-unification instead of ordinary unification (see [14] for more details). It is easy to check that all critical pairs are confluent: -critical pairs between rules of R are reduced because R is confluent. -critical pairs between rules of R and rule D -> TRUE, D in S, are confluent from hypothesis (iii), because such critical pairs correspond to a narrowing of D. -critical pairs between rule D - > TRUE, D in S, and rule XvX -> X of BOOL (or its extension YvXvX -> YvX) are confluent from hypothesis (i), because such critical pairs correspond to a binary factoring of D. -critical pa irs between rule Dl - > TRUE and rule D2 - > TRUE, Dl and D2 in S, with: Dl = D3 v ~Ll D2 = L2 Ll and L2 being two positive unifiable literals, are confluent from hypothesis (ii), because such critical pairs correspond to a binary resolution between Dl and D2. -other critical pairs are obviously confluent (in particular, the sub-system of BOOL which we use is confluent). since C z>U TRUE, we have C =U TRUE. Hence N(C) =U TRUE from the definition of narrowing. Hence N(C) z>U TRUE by confluence of U. Therefore N(C) z>RS TRUE since U £ RS, which ends the proof of lemma H. We can now prove that the system RS is confluent on valid formulas by using theorem 5.1 which caracterizes the valid formulas. The proof is exactly the same as the end of the proof of theorem 3.2 and so is omit ted. From theorem 5.2, we deduce a completion algorithm which is similar to the completion algorithm of section 3.3. The differences are: -We initialize the set of rules RO to BOOL u R. -We add at step 3 the computation of narrowings. If R~ is the final rewriting system produced by the algorithm, we have the theorem: Theorem 5.3:

350

The rewriting system R~ has the following properties: (i) R~ is interreduced (ii) R~ is equivalent to S u T (iii) R~ is confluent on valid formulas Furthermore, for a given rewriting system R associated with the equational theory T, R~ is the only rewriting system associated with a set of clauses which has these properties. Proof: this proof follows closely the proof of theorem 3.3 and is left to the reader. 5-3: extension to equational term rewriting system We suppose now that the equational theory T can be compiled into a canonical equational term rewriting system (P,R), P being a set of equations and R being a set of rewrite rules, as described in section 2.1. We suppose that there is a finite and complete algorithm of P-unification. We define P-binary resolution, P-binary factoring, P-narrowing as binary resolution,... in which P-unification is used instead of ordinary unification. To extend the previous results, we need an additional property of the ETRS (P,R). This property has been introduced by Jouannaud and is called P-coherence. Roughly speaking, P-coherence allows to replace the reduction relation ->P,R defined in section 2.1 by a weaker relation ->R,P defined as follows: The term tl R,P-reduces at occurence u to a term t2 using the rule l->r in R iff there exists a substitution s such that tl/u ~P Is and t2 ~ tl[u<-rs]. Note that R,P-reduction is the same as R-reduction except that we use P-matching instead of matching. Jouannaud gives sufficient conditions for testing simultaneously confluence and P-coherence of an ETRS. See [10] for details. These results extend the previous results of Peterson and Stickel. For our purpose, P-coherence allows us to generalize lemma G (lifting lemma for narrowing) if we use P-narrowing instead of narrowing. This generalization is done by Jouannaud and Kirchner [9], where it is used for the construction of unification algorithms. All other results of sections 5.1 and 5.2 are carried over without difficulty. In particular, the confluence criterion of Jouannaud can be applied to prove the confluence of the system U used in proof of lemma H. Note that in this framework, the set of non oriented equations is the union of two systems: -The equational system P. -The set of associativity/commutativity equations for the boolean connectors v and &. If R is empty, these results are close to some results of Plotkin [15]. 5-4: Theorem of Birkhoff

351 I n this section , we ex tend the results of section 4 t o first order predicate calculus in an equational theory. Let T be an equa t ional theor y, and S = {El , ••. ,En} a set of quantif ier-free formulas which do not contain any equality predicates. El, •.. ,En are not necessarily in clausal form. We suppose that we can build from T a canonical term rewriting system (or a canonical and coherent equational term rewriting system as in section 5.3). We associate with T u S an equat ional system which is defined as t he un ion of the three follow ing systems: -The system BOOL defined in section 2.2 (regarded as a set of equations). -The equational system T. -The set of equations {Ei = TRUE } with Ei in S. Note that the symbo l = corresponds in this system either to the boolean connector <=> or to the equal ity predicate . This equational system defines an equality relation on the set of quantifier-free formulas without equality predicates. We denote this relation =TS or <*>TS. Theoreme 5.4 Let F and G be two quantifier-free formulas without equality predicates. F and G have simultaneously the same values (TRUE or FALSE) for every ass ignment of the ir variables in all the models of T u S iff F =TS G. Another equivalent statement is: The formu la F<=>G is a valid co nsequence of T u S in predicate calcu lus wi th equalit y iff F =TS G. Proo f: the proof follows closely the proof of theorem 4.1 and is left to the reader. Although th is proof is valid only if we can build from T a canonical rewr iting system, we conjec ture that this theorem is true for any equational theory T. Note the difference between theorem 4.1 and theorem 5.4: in theorem 5.4, we only deal with formu las which do not contain the equal ity predicate. In exchange for this restriction, we do not use expl icitly the axiomatization of equality (reflexivity~ symmetry, transitivity, substitution). In fact, th is axiomat ization is used i mplicitly when we use an equation M=N in T for replacing in a formula an instance of M by the corresponding instance of N (if we use theorem 4.1, we are only allowed t o replace an i ns t anc e of (M=N) by TRUE). Let us now compare theorem 5.4 with usual Birkhoff's theorem in equational theories: for this purpose, let us consider a many-sorted equational theory Q with a sort Bool, defined by the union of the three equational systems given above. Usual Birkhoff's theorem states that two expressions are equal in all the models of the theory Q iff they can be proved equal by pure equat ional reasoning. This result is identical t o the result of theorem 5.4, but there i s a major semantic difference: we need to consider all the models of t h e equational theory Q. In some of these models, the

352

carri er of the sor t Bool c an be d i f fe re nt fr om t he s t andard two-va l ues model {TRUE, FALSE}. Act ua l l y, the c a rri e r of so r t Bool can be any bool e an a lgeb ra. Theo rem 5.4 i s much s tro nge r, f or it proves t hat equat iona l r e as oni ng i s st i l l compl e t e if we r es tri c t ourse lves t o the mode l s whose carr ier of the so rt Bool is t he s tandard model; fo r these models are the on l y one t o be cons idered in fi rs t order pr edi ca te ca l culus. These models are i nde ed t he most interes ting. That i s du e to t he f ac t th at we use a comp le te spec i f i cat ion BOOL for propos i t ional calculus; thi s c ompl et enes s me ans t hat two exp ress ions F and G of pr opositional calculus are equa l for a ll assig nments of t he ir var iables i n the s tandard mode l of Bool (i .e. F<=>G is va lid ) iff they can be proved equa l by pure equationa l r easoni ng . In other wo r ds , inductive reasoning i s uSeless for propositional c al culus . Theorem 5.4 (and also theorem 4. 1) are t he extensions of this property of completeness from the proposi tional ca l cul us to the (quantifier-free) predicate calculus. Note also that theorem 5-4 can be extended by the usual Birkhoff theorem to the case where F and G are two non -boolean terms, if we replace the boolean connector <=> by t he equality predicate. 5-5: ge ne ra l confl ue nce in f irst order predi cate calcu lus From theo r em 5 . 4 , we obta i n a met hod f or bu i ld ing rew rit ing systems co nf l ue nt on formulas wi thou t equal i t y pred icate, based on t he Knut h and Bend ix comp le t ion a lgori thm and the Hsiang s ystem f or p ropositiona l ca lcul us . Theo rems 4 .2 ( f or ge neral conf luenc e ) and 4 . 3 (fo r re f utat iona l proof ) a re ea s il y e xtended in th i s framewo r k. Note tha t if the canon ical rewr iti ng s ys tem for t he equat iona l t heory T i s not prov ided at t he beg i nning, we can r un the complet ion algori t hm simultaneousl y on T and S. Example: let us cons ider the fo l lowi ng spec i ficat ion: Equationa l theory T: X+ O = X, X+Y = Y+X, x+ (Y+Z ) = (X+Y ) +Z Se t of f o r mu l as S: ~( O >X) , X+l>O, X+l >Y+ l <=> X>Y The standa rd model of th is spec ificat ion is t he set of natural numbe r s, wi th the usual mean ing of 0,1,+, >. Applying the Knuth and Bendix completion a lgor i thm, we obtain the f ollowi ng system, complete w.r.t the as soc iativity/ commutativity of +, ! , &: (Hs iang's system + ) X+O - > X O>X - > FALSE 1>1 - > FALSE 1>0 - > TRUE X+ l >Y+ l - > X>Y X+ l> O - > TRUE l >X+l - > FALSE

353

X+l >l - > X>O If we use the (resolut ion + na r r owing) a lgor ithm of theorem 5.3" we mus t spl it the equ iva lence i nt o two clauses. We obtai n an inf i nite rewr iting system: (system BOOL +) X+O - > X ~(O >X) - > TRUE X+l >O - > TRUE ~(X+l >Y + l) v (X>Y) - > TRUE ~(X >Y) v (X+l >Y+l) - > TRUE ~(X +l+l >Y+l+l) v (X>Y) - > TRUE ~(X+l+ l+l >Y+l+l+l ) v (X>Y) - > TRUE

It is proved in this annex that the computation of binary factors and binary resolvents can be considered as a computation of critical pairs restricted to certain critical pairs. To take into account the associativity/ commutativity of v, we consider for each rule its extension rule, as defined by Peterson and Stickel [14]. I-Binary f a ct o r C being a clause and F(C) a binary factor of C,the r u l e F(C} - > TRUE IS obvious ly a cr itical pair between the rule C - > TRUE and the the rule XvX -> X (or its extension YvXvX - > YvX) of BOOL. 2-B inary r es o l vent Let C = Cl v Land D = Dl v ~P be two clauses, Land P being two positive unifiable literals. Let s be the m.g.u of L and P. A b inary resolvent of C and D is R(C,D } = CIs v DIs. The rewr i te rule associated with C and Dare: Cl v L Dl v ~P

-> ->

TRUE TRUE

We suppose that Cl and D1 are not the empty clause (i.e. C and D are not unit clauses). The proof is easily extended if Cl and/or D1 are the empty clause. The rule R(C,D} - > TRUE can be obtained by a computation of critical pairs as follows: The rules of BOOL: X v ( y & Z)

->

(X v

X

->

FALSE

&.,x

Y)&(X v

Z)

can be superposed, generating the rule: (X

v Y)&(X v

~y)

->

X

(1)

354

Rule (1) can be generating the rule: Cl v -'L

->

Cl

superposed with the rule Cl v L -> TRUE, (2)

The rule (2) can be considered as a kind of "extension" of the rule Cl v L -> TRUE. Such additional rules are also used by Hsiang [5] and Fages [4] for computing critical pairs. The two rules:

x v Cl v -'L Y v Dl v -.p

-> ->

X v Cl TRUE

(extension of rule (2» (extension of rule Dl v -.p

can be superposed since Land P are unifiable. generated is: CIs v DIs -> TRUE, i.e. R(C,D) - > TRUE.

->

TRUE)

The rule

Note that we retain only the rule R(C,D) -> TRUE to build the system R~ confluent on valid formulas. It is not necessary to retain the intermediate rules (1) and (2). REFERENCES [1] BIRKHOFF G. : On the structure of abstract algebras. Proc.Cambridge Phil.Soc.3l, pp 433-454 (1935). [2] BUCKEN H. : Reduction systems and small cancellation theory. Proc. Fourth Workshop on Automated Deduction, 53-59 (1979) [3] CHANG C.L. theorem proving.

and LEE R.C. : Symbolic logic and mechanical Academic Press, New-York (1973)

Formes canoniques dans les algebres booleennes, [4] FAGES F. et application a la demonstration automatique en logique du premier ordre. These de 3me cycle, Universite Pierre et Marie Curie, Juin 1983. [5] HSIANG J. and DERSHOWITZ N. : Rewrite methods for clausal and non-clausal theorem proving. ICALP 83, Spain.(1983). [6] HUET G. : A complete proof of correctness of the KNUTH-BENDIX completion algorithm. INRIA, Rapport de recherche No 25, Juillet 1980. [7] HUET G., OPPEN D.C., Equations and rewrite rules: a survey, Technical Report CSL-ll,SRI International, Jan.1980. [8] HULLOT J.M., Canonical form and unification. Fifth Conference on Automated Deduction, Les Arcs.

Proc. of the (July 1980).

[9] JOUANNAUD J.P., KIRCHNER C. and KIRCHNER H. : Incremental unification in equational theories. Proc. of the 21th Allerton Conference (1982). [10] JOUANNAUD with equations.

J.P. : Confluent and coherent sets of reduction Application to proofs in data types. Proc. 8th

355

Colloquium on Trees in Algebra and Programming (1983). [11) KNUTH D. and BENDIX P., Simple word problems in universal algebra Computational problems in abstract algebra, Ed. Leech J., Pergamon Press, 1970, 263-297. [12] LANKFORD D.S. : Canonical inference, Report ATP-32, Department of Mathematics and Computer Science, University of Texas at Austin, Dec.1975. [13] LEE R.C. : A completeness theorem and a computer program for finding theorems derivable for given axioms. Ph.D diss.in eng., U. of California, Berkeley, Calif.,1967. [14] PETERSON G. and STICKEL M. Complete sets of reductions for some equationnal theories. JACM, Vol.28, No2, Avril 1981, pp 233-264. [15] PLOTKIN G.: Building-in Intelligence , pp 73-90. (1972) [16] ROBINSON J.A resolution principle.

Equational

Theories.

Machine

A machine-oriented logic based on the JACM, Vol.12, Nol, Janvier 1965, pp 23-41

[17] SLAGLE J.R : Automatic theorem proving for theories with simplifiers, commutativity and associativity. JACM 21, pp.622-642. (1974)

356

Using Examples, Case Analysis, and Dependency Graphs in Theorem Proving David A. Plaisted Department of Computer Science University of Illinois 1304 West Springfield Avenue Urbana, Illinois 61801

This work was partially supported by the National Science Foundation under grant MCS 81-09831. 1. Introduction The use of examples seems to be fundamental to human methods of proving and understanding theorems. Whether the examples are drawn on paper or simply visualized, they seem to be more common in theorem proving and understanding by humans than in textbook proofs using the syntactic transformations of formal logic. What is the significance of this use of examples, and how can it be exploited to get better theorem provers and better interaction of theorem provers with human users? We present a theorem proving strategy which seems to mimic the human tendency to use examples, and has other features in common with human theorem proving methods. This strategy may be useful in itself, as well as giving insight into human thought processes. This strategy proceeds by finding relevant facts, connecting them together by causal relations, and abstracting the causal dependencies to obtain

a proof.

The

strategy can benefit by examining several examples to observe common features in their causal dependencies before abstracting to obtain a general proof. Also, the strategy often needs to perform a case analysis to obtain a proof, with different examples being used for each case, and a systematic method of linking the proofs of the cases to obtain a general proof. The method distinguishes between positive and negative literals in a nontrivial way, similar to the different perceptions people have of the logically equivalent statements A

:J Band

This work builds on earlier work of the author on abstraction strategies

(~B)

:J

(~A).

1171 and problem reduc-

tion methods [181. and also on recent artificial intelligence work on annotating facts with explanatory information

[6,7,91. This method differs from the abstraction strategy in that it is possi-

ble to choose a different abstraction for each case in a case analysis proof; there are other differences as well. For other recent work concerning the use of examples in theorem proving see

[11 and [2]. 1.1 Comparison with previous work

357

Several methods have been proposed for using examples or semantics in theorem provers. Gelernter [101 developed a geometry theorem prover which used back chaining and expressed semantic information in the form of diagrams; this enabled unachievable subgoals to be deleted. Reiter [191 proposed an incomplete natural deduction system which could represent arbitrary interpretations and use them as counterexamples to delete unachievable subgoals. His method also could use models to suggest instantiations of free variables. Slagle [20] presented a generalization of hyper-resolution to arbitrary models; his system gives a semantic criterion for restricting which resolutions are performed. Ballantyne and Bledsoe [11 give techniques for generating counterexamples in topology and analysis, and also show how examples can help in a positive sense in finding a proof. This idea is extended by Bledsoe [2] who gives methods for instantiating set variables to help prove theorems with existential quantifiers. Our method differs in the following ways: a) We actually apply a transformation to the clauses themselves and do a search on the transformed clauses. This transformation is based on semantic information. b) We split the set of input clauses to obtain Horn clauses and use the splits to structure the case analysis in the proof. c) We construct a dependency graph representing the assertions that can conceivably contribute to a proof, and then restrict the search to these assertions. Our method permits different models to be used for each case in a case analysis proof . Also, our methods of generating examples and counterexamples are not nearly as sophisticated as those in [I] and [2]; we concentrate on methods that are simple and general

and

easily mechanized.

However, eventually

more complex, domain-dependent

approaches such as those in [11 and 121 will undoubtedly be necessary. 1.2 An example We informally discuss an example of how the method works, before giving a formal presentation. This example should make the main features of the method clear. The theorem is the following: for all natural numbers x, x*(x+1) is even. even(x) V odd[x], even(x) ::> odd(x+i), odd(x)

Assume we have the axioms

::> even(x+ 1), even(x) ::> even(x*y),

and some axioms about arithmetic. The theorem, when negated and Skolemized, becomes

-even]c * (c + 1)), which can be viewed as a goal of even(c*(c+1)) . We first split the non-Horn clause even(x) V odd (x) to obtain the two clauses even(x) and odd(x) which must be dealt with separately. For semantics we use the standard model of the integers; this will be the "initial model" introduced below. Now, c may be interpreted to an arbitrary integer . Intuitively, when dealing with the case even(x), we would like to interpret c as an even integer, and when dealing with the case odd(x), we would like to interpret c as an odd integer . However, in gener al, there are technical problems so that when doing a case analysis, it is not always possible to find an interpretation making the case true. Thus when doing the case even(x), it is conceivable that we might be forced to look at an interpretation in which x is odd . For now suppose this does not happen. Suppose we do the case even(x) and interpret c to be 2. Suppose we also

358

interpret x to be 2. Then we construct causal relations between such interpreted facts; we say even(2) causes even(2"'3). From these causal relations we construct a proof ror the case even(x). Similarly, ror the case odd(x), we interpret x and c as 3, say. We say odd(3) causes even(4) and even(4) causes even(4"'3). From these causal relations we construct a dependency graph relating odd(3), even(4), even(4"'3), and even(3"'4) (making use or x"'Y = yottx). From this graph we construct a general proof for the case odd(x); these proofs are then combined to obtain a complete proof,

2. The Horn Case First we consider the case in which the input clauses are all Horn clauses. We assume the reader is familiar with the usual concepts or a term, a literal, a clause, a substitution, and so on. For an introduction to such theorem proving terminology see [5\ or [15\. An atom is a positive literal. P

/I.

Q

A Horn clause is a clause in which at most one literal is positive; :::l

R, considered as the clause

~P

V

~Q

thus

V R, is a Horn clause. Consider

the task or finding a proof or a positive literal M (called the goal literal) Irom a set S or Horn clauses. Assume for now that none or the clauses or S are all-negative clauses. This implies that S is consistent. Therefore there is a Herbrand model or S, which for our purposes is a set of ground atoms which are assumed to be true. Also, this model must make S true, in the sense that all clauses in S are true in the interpretation in which literals in the model are true and all other literals are false, Since S is a Horn set and has no all-negative clauses, there is in Iact a minimal Herbrand model or S, which consists or all positive ground literals that may be derived from S by hyper-resolution (equivalently, all positive ground literals that are logical consequences or S). We denote this model by CI(S), the closure or S. The first step in the proposed theorem proving method is to interpret the function symbols or S and interpret the terms or CI(S) in a corresponding way to obtain a more "concrete" model that will serve as an example Ior our theorem prover. Let I be an interpretation or the function and constant symbols or S. If

t is a ground term composed or function and constant symbols or S, let t! be the value of t in I. Assume for simplicity that S is untyped, so that I has one domain DJ and each function symbol r is interpreted as a function from D/ to DJ for n the arity or r. Then I can be extended to a model or S by properly interpreting the predicate symbols or S; let MJ (5) be the minimal such extension or I. We define an interpreted literal to be a literal whose arguments are elements or

DJ • Thus an interpreted literal is or the form P (d l'

dk ) where P is a predicate symbol

or S and the d, are elements or DJ , or is or the form ~P (d l'

dd. An interpreted

clause is a clause which is the disjunction or interpreted literals. If L is a ground literal and I is an interpretation, let L J be the interpreted literal in which each term or L is replaced by its value in I. Thus ir Lis P (t 1 ,

tt) and t/=d, then LJ is P (d 1 ,

dk

).

If C is a

ground clause and I is an intepretation, let OJ be the interpreted clause in which each literal L or C is replaced by LJ. For example, ir C is a Sb V b
V 3 <2. Let SJ be { OJ : C is a ground instance or a clause in S }. Let OIJ (S) be {

359

a! (5 )eCI (51). If D1 is finite, then 51 is a finite set of ground clauses, and its closure is finite and can be computed in a finite number of steps. Now

L1 : L E CI(S) }. One can show that

Cl (51 ) is a model of 51. Also, Cl (51 ) is a model of S, in the sense that it I is extended by mak ing P (d I '

d" ) is in Cl (51), then this extension of I is

d,,) true iff P (d I '

a model of S. In fact, this extension of I is just M1 (5). Recall that M, (5) is minimal, in the sense that removing any literals would not yield a model of S. Thus, given any interpretation I of the function symbols of S such that D1 is finite , a minimal model M1 (5) of S having domain

D 1 can be computed in a completely mechanical way . For certain standard sets S of clauses and interpretations I, M1 (5) will be known in advance and need not be derived. For example, for the integers, we can quickly compute that 2

<

5, ~(3<1), et cetera. We will also be

interested in models whose domains are infinite. We consider such a model to be an example, and the elements of Cl (51 ) as facts which are to be connected by causal relations, as explained below. Suppose I is specified. If Dl is infinite, then Cl (SI) may be infinite, so in practice we need to restrict attention to a relevant subset J of Cl (SI). Even if Dl is finite, it still may be useful for efficiency reasons to restrict attention to a relevant sub set J of Cl (51) . We can think of the literals in J as " relevant facts " for the proof.

The next step (step 2) is to attach causal relations to the literals of J. Literals of J which are (positive) unit clauses of SI will be called assumption literals. Also, if Ll, L2, ..., Lk and L are literals of J such that the clause L 1 1\ L 2 1\

1\ Lk

::::>

L is in

51 , we say L is caused by Ll and L2 and ... and Lk. We call this a causal relation between {Ll, L2, ..., Lk} and L, and use the notation {Ll , ..., Lk} .. L. For convenience in remembering the correspondence between literals in clauses and literals in cau sal relations, we consider {Ll, ..., Lk} to be an ordered sequence, so that the order of the literals is significant. Similarly, the order of literals in a clause is significant. The causal relation {Ll , ..., Lk} .. L. is annotated with the clauses

C

of

S

having

L 1 1\ L 2 1\

ground

1\ Lk

instances

Cl

such

that

C 11

is

is

::> L. Note that there may be more than one such

clause C. Also, the set {Ll, L2, ..., Lk} may be empty, in which case L is an assumption literal. In this way we see that an assumption literal is a special case of a causal relation. The collection of annotated causal relations will be called the dependency graph for J. This graph has nodes labeled with the literals of J, and relations between the nodes corresponding to the causal relations of J. These relations are annotated with clauses of S, as above . The particular literals labeling nod es in the graph are not as important as the depend ency relations between them. Therefore we say that two dependency graphs are isomorphic if they are the same except for the replacement of some nodes and labels by others . A literal L of J which is an instance of the goal literal M is called a goal literal of J, and the associated node of the dependency graph is called a

goal nod e. If {L l'

L l'

. . .•

. ..•

L,,} .. L is a cau sal relation of a dependency graph G, and

L" , L are the lab els of nodes N l '

also consider {N I '

"

' ,

.. . . ,

N" . N of G . respectively, then we

N,,} .. N to be a cau sal relation of G, for convenience.

360

From the dependency graph, we obtain a causal chain relating some goal node to assumption nodes of the dependency graph; this is step 3 of the method. This causal chain is a tree, the root of the tree being labeled with a goal literal, and for each node N of the tree, there is a causal relation between the labels {Ll, L2, ..., Lk} of the sons of N and the label L of N. Thus the leaves of the tree must be labeled with assumption literals of J. It is permissible for a literal to label more than one node of the tree; in fact, a literal may even label both a node and one of its sons or more distant descendents. Hence for a given dependency graph there may be infinitely many possible causal chains. Each such tree represents a combination of causal relations by which some goal literal may be related to assumption literals. A causal chain may be considered as an attempt to obtain a general proof by looking first at the causal relations in this particular example.

Step 4 of the proof method is to consider several examples, and for these examples to find causal chains which have the same structure (the labels of the nodes may be diferent but the trees must have the same shape and the same clauses of S annotating the causal relations at corresponding nodes). Step 4 poses an interesting algorithmic problem, and there are many possible algorithms that could perform this step efficiently. The idea is to look at all the examples at the same time and gradually extract a common causal chain from all of them, working backwards from the goal. Later (in section 4.1) we will give an algorithmic solution to step 4. Step 5 is to obtain a general proof by lifting this common causal chain to a proof of M from S. It is not difficult to see that this method is complete (subject to the proper choice of J), since positive unit resolution for Horn sets is completejl l]. The intent of the method is that the number of causal chains will be reduced by the examples under consideration,

50

that proofs will not even

be attempted which do not correspond to causal chains from an example. This reduction in search space should be even more pronounced if several examples are used. The causal chains give the prover some idea of where it is trying to go in a global sense, as opposed to the common approach of resolving any two clauses which satisfy some local property specified by a resolution strategy. Note that if all domains of interpretations are finite, step 5 is the only part of the method that involves general resolution and undecidability; every other part of the method deals only with ground clauses and ground resolution. Therefore the first four parts should be reasonably fast, and hopefully the search space for part 5 will be reduced by the preprocessing of steps 1 through 4. If some of the interpretations have infinite domains, then constructing

Cl (SI) involves a potentially infinite search, but this can be limited by choosing a finite "relevant" subset J of Cl (SI). Also, we note that the method is still complete if interpretations with infinite domains are never chosen, but such infinite interpretations may be useful for intuitive reasons and possibly may reduce the search space in step 5 in some cases, as we shall indio cate in section 5.

361

Some models I are much more useful than others in guiding the search for a proof. The function of t he model is to eliminate as many causal chains as possible, so that it is only necessary to search among a small subset of t he causal chains to find one leading to a proof. We say a model M[ (5 ) is eparee if it makes each predicate symbol true on a small number of sets of arguments. Clearly if M[ (5) makes each predicate symbol true everywhere, then many possible causal chains can be generated from the dependency graph, and the search for a proof is more difficult. It is an interesting question to determine for which interpretations I does M[ (5) have good sparseness properties. Note that the method is still complete if we choose allY model of S, not just models of the form MI (S) . However, one can show that for any model MI of S there exists I such that MI (5 )eM 1, so the models MI (5) are the best ones for purposes of reducing the search space. 2.1 Example 2 Let S be {Plio, Px

:l

Pfx] and let the goal literal be Pffffa, We are omitting

parentheses for simplicity. Then Cl(S) is [Pa, Pta, PlIa, ... }. Let I have domain D[ = {b, c, d, e], a interpreted as b, and f interpreted so that fb

Pb

:l

Pc, Pc

:l

Pd , Pd

:l

Pb} and

= c, fc = d, and fd = b. Then 5[ is {Pb, a (5[ ) is {Pb, Pc, Pd}. Thus AI[ (5) is the

model of S which interprets a and f as does I and interprets P so that Pb , Pc, and Pd are true but Pe is false. Let J be {Pb, Pc , Pd}. The dependency graph has nodes Nl labeled Ph, N2 labeled Pc , and N3 labeled Pd. Also, Nl is an assumption node and N2 is a goal node. There is a causal relation on Nl annotated with the clause Pa of S, a causal relation {NI} .. N2 annotated with the clause Px

:l

Pfx, a causal relation {N2} .. N3annotated with the clause Px

Pfx , and a causal relation {N3} .. NI annotated with the clause Px

:l

Pfx. The graph is

:l

thus as follows:

Pb'

Pc

)

Pd The superscript

* indicates an

assumption node. There are an infinite number of causal chains

connecting the goal literal N2 to the assumption literal NI; one of them is the tree containing nodes MI and M2 labeled with Pb and Pc, respectively, and the other is the tree containing nodes MI , M2, M3, M4, and M5 labeled with Pb, Pc, Pd, Pb, and Pc , respectively. In both trees, MI has a causal relation annotated with the clause Pa, and there is a causal relation between {MI} .. M2 annotated with the clause Px

:l

Pfx, In the second causal chain, {M2}

.. M3, {M3} .. M4, and {M4}.. M5 are causal relations annotated with the clause Px

:l

Pfx.

The first causal chain cannot be lifted to obtain a proof, but the second causal chain can be lifted to obtain the following proof: Pa (assumption), Pfa (using Px

:l

Ptx] , Pffa, PlICa,

362

Pffffa (all using Px

:J

Pfx) .

2.2 Example 3 This is from Chang and Lee

151.

The problem is to show that in a group, if the

square of every element is the identity, then the group is commutative. When phrased in relational form (so that P[x y z) means x*Y

Pexx

Pzyu

z], this yields the following set S of clauses:

Pxex

Pxxe Pzyu

=

Pabc /I

Pyzu

/I

Pyzu

/I

Pzuw

:J

Puzw

/I

Puzw

:J

Pzoui

Also, the goal literal is Pbac, Here x, y, u, z, v, and ware variables and e, a, b, and c are constants. In this example , the closure CI(S) of S consists of the sixteen literals of the form P(x y z) where x and yare all possible combinations of e, a, b, and c, and z is as required by the "multiplication table" for this group . Thus we have P(x e x) for all x, Pte x x) for all x, P(x x e) for all x, Pta be), P(b a c), Pta c b), P(c a b), P(b c a), and P(c b a). Let I have domain {a bee} and interpret a as a, b as b, c as c, and ease for simplicity . Then SI consists of the ground instances of clauses in S; that is, the ground instances obtained by replacing variables by a, b, c, and e. Also, CI (Sl )=CI (S ) in this case, and M1 (S) is the interpretation in which a, b, c, and e are interpreted as themselves and P(x y z) is true for the sixteen combinations of a, b, c, and e given above and false otherwise. Let J (the relevant set of literals) be CI (Sl), which is the above sixteen literals. Now, Pbac is the goal node of the dependency graph, all ground instances of P(x ex), Pte x x) and P(x x e) are the assumption literals of this graph , and for each ground instance L 1 A L 2 A L 3 :J

L of the last two clauses such that Ll, L2, L3, and L are

all true in MI (S) there is a causal connection between {LI, L2, L3} and L in the dependency graph. These causal connections are annotated with the appropriate clauses of S. From this dependency graph, causal chains may be extracted and lifted to obtain a proof of the goal literal from S. 2.3 Example 4 This example is intended to show the reduction in search space that may be obtained by using examples.

Theoretically, there may be many literals L, perhaps infinitely many,

corresponding to a given interpreted literal L I

.

When using interpretation I as an example, all

these literals will be considered at once. This makes it possible to learn about the structure of the search space as a whole while examining a comparitively small set of literals. The information gained in this way may be used to guide the search in the full search space.

363

Consider the following axioms:

Pf(c)

x2:y

:::> Pffx Pfx :::> Pgfx Pgx :::> Pggx gfx 2: Igx

x2:y

Pfx

Theorem: (3z )P(x)

"

:::> :::>

Ix 2: fy gx2:gy

"

y2:z

x2:x x2:y

:::>

x2:z

x 2: ffgfggc

The first four axioms permit derivation of P g k f

j

c. The remaining axioms permit g and f to

interchange, decreasing the value in the partial ordering. We consider two interpretations, I. and lb . First Interpretation We interpret c as 0, f(x) as x, g(x) as x+l, P(x) as x model basically counts occurrences of g.

> 0, and

x 2: y as x

= y.

This

Second Interpretation We interpret c as 0, f(x) as x+l, g(x) as x, P(x) as x 2: 0, and x ~ y as x = y. This model basically counts occurrences of f. V ..(x ~ ffgfggc). This is converted to :::> GOAL where GOAL is the goal literal. For the first interpretation, assuming J is chosen large enough, we obtain the following dependency graph, where literals have been omitted that cannot contribute to a proof of the goal: The theorem, negated, becomes ..P(x)

p (z)

"

z ~ ffgfggc

P (0)'

~

(J o~o' ~

cr

P(l)

~

0 l~l'

C!5

P(2)

~

0 ~

2~2'

P(3)

(5 ) : GOAL ~ 3~3'

(J

For the second interpretation, we obtain the following graph:

<J

364

P (I)' - ; P(2) -----?- P(3)

o

o~o'

--4

(J

1~1'

----+

(J

o

0}:

2~2' ~

3~3'

0

()

GOAL

As before, superscripts of ... indicate that the literal is an assumption literal. The graph for the first interpretation will prevent the theorem prover from examining literals with too many g's, and the graph for the second interpretation will prevent the prover from examining literals with too many I's. These graphs can be matched, as will be explained in section 4.1. This essentially results in graphs whose literals are ordered pairs of literals from the above two graphs, and results in a graph which keeps track of numbers of f and g symbols at the same time, further reducing the search space.

Note that in the matched graph, the single pair (P(3), P(3))

represents literals having 3f symbols and 3 g symbols; there are 20 such literals, but only one entry in the graph. This illustrates how these graphs form a compact representation of the full search space. Although this example may be artificial, it should illustrate the possibilities for controlling search using examples in this way. The reduction in search space is similar to that achieved by the abstraction strategy [171. 2.4 Comments As mentioned above, this method of using examples is similar to the "abstraction" strategies presented in [171. However, in this method, it is not necessary to use multiclauses (in which a literal may appear more than once); ordinary clauses suffice. The method may easily be modified to find refutations (proofs of FALSE) rather than proofs of a goal literal. For this, it is permissible for S to contain all-negative Horn clauses. In this case, S is modified to obtain a new set SI of clauses by replacing each all-negative clause

~L

1 V

V -Lk by

L 1 A L2 A A Lk ::> GOAL, where GOAL is a new predicate symbol interpreted as TRUE. The method is then applied to SI with GOAL as the goal literal.

2.5 Equality The method also works for problems containing equality axioms. However, certain choices of models and representations are better than others. It may be that CI(S) contains literals of the form

8

=t for ground terms sand t. If the interpretation is chosen so that sand

t have the same value for all such pairs sand t, then reasoning involving the equality axioms is difficult to extract from the example. For example, if I is an interpretation such that for terms s and t, 8 1 =t l if S 1=8 =t , and L is a literal in Ct (SI) of the form q =r , then q and r will be identical, which gives no information about how the literal q =r can be derived. One solution is to choose interpretations I in which equality is a congruence relation, not necessarily the

365

equality relation on D1 . This permits sand t to have different values in I even if

8

=t is a logi-

cal consequence of S. Another solution is to redefine the concept of an interpreted literal. Suppose an interpretation I with domain D1 is specified. Then we can define an interpreted literal to be a ground literal in which certain subterms are replaced by their values in 1. For example,

if literal L is S(S(S(O))) > S(S(O)), and I is the usual interpretation of arithmetic with S the successor relation, then the following are all interpreted literals corresponding to L: 3 S(I), S(2)

>

S(I), S(2)

>

>

2, 3

>

S(S(O)), et cetera. Then L 1 may be chosen to be some such literal

corresponding to L. It is necessary to choose LI consistently, that is, if Land M are complementary ground literals, then L 1 and M 1 must also be complementary. The advantage of this method of defining L 1 is that more information can be retained when using equality. For example, it is possible to have a causal relation between even(4*3) and even(3*4) using the commutativity of multiplication. This gives more information than having a causal relation between even(12) and even(12), which is all that would be permitted by the previous definition of interpreted literal. Also, since the equality axioms are cumbersome to use explicitly, it seems better to use some representation such as Brand's "modification method" [31 to eliminate the need for most of the equality axioms. With these choices of representation and interpretations, the method should work reasonably well for problems involving equality. It should be possible to incorporate some term-rewriting ideasl13] to only construct proofs in which "complex" terms are replaced by "simpler" terms, reducing the search space. Such a method was shown complete by Lankford [14]. 2.5.1 Equational systems If S is of the form SlUE where E is an equational system having an efficient

unification algorithm, the method can be used together with E unification as follows: The interpretation I must be chosen so that all equations in E are true on D1 . Then steps 1 through 4 of the method can be applied exactly as before, using the set SI instead of S. Thus the equations in E are ignored for steps 1 through 4. In step 5, to lift the proof, we use E unification instead of unification. With this modification, the method is complete and can take advantage of the many efficient unification algorithms known for various sets of equations, such as the associative-commutative unification algorithms of 121] and [121 and many other algorithms (8).

3. The Non-Horn Case The idea for extending the method to the non-Horn case is to use splitting[41 to obtain Horn clauses and to prove the theorem for each set of Horn clauses obtained by splitting. Suppose S is a set of clauses, possibly including non-Horn clauses. Let I be some interpretation

366

or the function and constant symbols or S, as before, Then the domain Dr' the interpreted literals L J , the interpreted clauses OJ, and the set SJ or interpreted clauses are defined as before, Let SH be S with non-Horn clauses replaced by Horn clauses in the following way: Sup/I. L t

pose L 1 /I.

:J M 1 V

V M. is a clause or 5, in which the

>

1. This non-Horn clause is replaced by the Horn

L; and M J are all positive literals and n /I. L t

clause L 1 /I. sidered

as

alternatives

L 1 /I.

to

:J

/I. L t

:J this

M 1 with the literals M 2 V clause.

We

call

this

V M. connew

Horn

clause

Mia split clause. Then SH is S with all non-Horn clauses

replaced by split clauses this way. We can now apply the method as usual to SH using interpretation I. The difference is that the split clauses have alternatives associated with them, so that the annotations or the dependency graph also include the alternatives or split clauses. The causal chains similarly have alternatives or split clauses included. Then we lirt to get a general proof or the goal literal to a proof or GOAL

GO~

from SH as before, This proof can be converted in a simple way

V M1 V

V M p from S, where all the alternatives or split

clauses used in the proof have instances among the M,. It could be that no split clauses were used in the proof; in this case we are done. Ir not, the next task is to find a proof or the goal literal from S U {M, } Ior each Mj in turn, by the same procedure applied recursively. The procedure must be modified so that only one instance or each M, is used in the proof this is easy to do but the details are tedious so we omit them here. Also, after finding a proof or the goal literal Irom 5 U {Mj

},

it is necessary to find which instance or M, was actually used in the

proof', and to apply the same substitution to the remaining M J • Again, we omit details ror simplicity; the general idea should be clear. It is most convenient Ior this method ir the goal literal is a ground literal. Otherwise, it is possible to replace all free variables in the goal literal by new Skolem constants. Ie one desires to find all instances or the goal literal that are derivable, there could be a complication, because it may be that P(a)

V P[b] is derivable even ir neither P(a)

nor P(b) is derivable separately. Again, the method for the non-Horn case can be modified in a rather direct way to take care or this possibility and in general to return instances or the goal literal that are derivable, ir such instances exist. This method or dealing with non-Horn clauses is intuitively appealing because it corresponds naturally to the human method or performing case analysis; the alternatives M, may be thought or as other cases that need to be examined later, and cases that arise in the proof or the goal literal from S U {M, } may be thought or as subcases. 3.1 Example 5 We give an example to illustrate the use or splitting and case analysis Ior non-Horn clauses. Suppose 5 consists or the following clauses: x ';!:y :J max (x, y)=x y »» :J max [s , y)=y

367

z 'C?:y

V

y>x

x =y

::::>

x ~y

x 'C?:y " Y >z ::::> z ~z Suppose the goal is to show that max (x , y )~x. We first introduce new Skolem constants c and d and transform the goal to max (e,

d )~e. Let I be the interpretation in which the

domain D1 is the natural numbers {O, 1, 2, ... } and max(x, y) gives the maximum of the numbers x and y.

x ~y

V

Suppose I interprets c as 2 and d as 4.

y>x is replaced by the split clause x

is obtained by replacing z ~y including 2>3

::::>

~y

The non-Horn clause

with alternative y >x. The Horn set SH

V y>x in this way. Now Sk has infinitely many clauses,

3=2 and 4>1

::::>

4=4 (from the first clause), 4>2 and 1>5 etc. (from

the split clause), and in general we get all clauses resulting from replacing the variables of clauses in SH with natural numbers and evaluating max in the standard way. The closure

Cl (Sk), is all ground literals derivable from Sk by hyper-resolution . This includes all literals oC the form x ~y (from the split clause) and all literals of the form y =x (from the first clause, using literals of the form x~y). Let J be all such literals whose arguments x and yare in the range {O, 1, 2, 3, 4, 5}, say. From J it is possible to construct a dependency graph, a causal chain, and a proof of max (e ,d )~e

V d >e. There are also many other proofs that could

be constructed . We then attempt a proof of the goal literal from S U {d

>

c], This proof can

be found without using the split clause, so we are done . This example illustrates the use of splitting . However, it is not entirely satisfactory because the closure Cl (Sk) includes all possible literals, which means that in M1 (S) , both "=" and

"~"

are interpreted as identically true.

Thus M1 (S) is "dense." This does not seem intuitively appealing, and in addition results in a dependency graph which does not help much in reducing the search space. We will show how to overcome these problems below. Despite the problems, this example should give an idea or how splitting works, and should help to motivate the discussion or section 5. 4. Matching and Searching Dependency Graphs We now give a more detailed description of how the dependency graphs are actually used to guide the search for a proof, and how several dependency graphs may be matched to obtain a graph which contains the common features of all the separate graphs. It is not best to extract causal chains and then search separately Cor a proof using each causal chain, because there may be many subproofs in common among the various causal chains and these will be Cound repeatedly . It is better to work directly with the dependency graphs. The number oC vertices in the dependency graph is bounded by the number of literals in J; however, there may be infinitely many causal chains, even for a finite number of literals . Even if the depth of the causal chains is limited to d, the number oC such chains may still be double exponential in d (an exponential number of possible nodes, and Cor each node more than one possible literal). We show how to work directly with the dependency graphs, and thereby avoid this problem. Also, we show how to attach depth information to the nodes of the dependency graph to make it easy

368

to search for proofs at restricted depths. 4.1 Matching Dependency Graphs Gn are given and we want to find a graph

Suppose dependency graphs G l '

G which contains the common features of all the G;. More precisely, we want a graph G such

that the set of causal chains of G is the intersection of the sets of causal chains of the Gj

•

To

obtain G, we first define a procedure "match" which matches two graphs F 1 and F 2 to yield a graph F whose set of causal chains is the intersection of the causal chains of F 1 and F 2' This procedure is then applied to G 1 and G 2 to produce a new dependency graph HI; then HI and G 3 are matched to produce H 2; then H 2 and G 4 are matched to produce H 3; and so on, until

all graphs are matched to produce H. -1' which is the desired graph G .

The graph G

=

match( G l'

G 2) is defined as follows, for arbitrary dependency

graphs G 1 and G 2: Let Nodes(G) be the nodes of G and Rel(G) be the set of causal relations of G. The nodes of G are ordered pairs «s», N 2> for N' E Nodes (Gil. To obtain G, first let Nodes(G) be 0. Then add

< N 1,

N2> to Nodes(G) for all N' E Nodes (G i ) such that the N i

are assumption nodes of G, and both N 1 and N2 are annotated with the same clause {M}. The ordered pair

<s»,

N 2 > is an assumption node of G and is added to Rel(G) as a causal rela-

tion, annotated with all such clauses {M}, which will be positive unit clauses. Then successively add to Nodes(G) all nodes as follows, and add to Rel(G) all causal relations as follows, until no more can be added: If N' and N~ ,

"',

N] are in Nodes (G') for i

=

1, 2, and the causal

relations {N J , "', Nt}" N' E Rel( G') for i = 1, 2, both causal relations annotated with the same clause C, and E Nodes(G) for IS;jSk, then add to Nodes(G) and add the causal relation {
N 12 >,

N t2>} ..
N2>

to Rel(G), this causal relation annotated with all such clauses C. 4.2 Assigning Depths to the Dependency Graph After matching the dependency graphs as above, depth information is added to each node telling at what depths the node can possibly contribute to a proof of the goal node. Each node N of the dependency graph G has a forward depth d f (N) and a backward depth db (N ) assigned to it. If no matching on graphs has been done, then the forward depth of N tells the smallest depth at which the literal labeling N can possibly be derived by hyper-resolution; the backward depth of N tells the smallest possible depth of a proof of the goal node such that the label of N occurs in the proof. To be precise, d f (N) is the depth of the smallest causal chain whose root node is labeled with the label of N. Also, db (N) is the depth of the smallest causal chain whose root node is a goal node, and which also contains some node labeled with the literal

369

labeling N. Therefore, when searching for proofs of depth d or less, it is only necessary to consider the subgraph G. of G consisting of nodes N such that db (N)sd, together with all associated causal relations. The forward depths are assigned as follows: First assign df (N):=ao for all N which are not assumption nodes; if N is an assumption node, assign d f (N )=0. Then iterate the following equation on all causal relations {N l '

"',

Nt} ~ N until no more

change occurs:

To assign backward depths, first assign db (N )=00 for all N except the goal node N G , for which

db (NG )=d f (N G ). {N l'

. . . ,

Then

iterate

the

following

equation

on

all

causal

relations

Nt } ~ N until no more change occurs: db (N i )=min( db (N j ),max( db (N ),1+max( d f (N 1)'

4.3 Searching for Proofs To search for proofs at depth d or less, we consider a subgraph Gd of the dependency graph G, where Gd consists of all nodes N of G such that db (N)sd , together with their associated causal relations. With each such node N, we associate a set clauses(N) of (clause, depth) pairs as follows: Initially clauses(N)

=0

for all nodes N of G• . Then add <{M}. 0> to

clauses(N) for assumption nodes N, where {M} is a positive unit clause annotating N. Thereafter, do the following as often as possible until no more clauses can be generated or until a proof is found: Suppose {N l' clause L l'

.. • ,

Lt

"

',

Nt} ~ N is a causal link of G•• annotated with the

::::l L, and suppose <Mi

lSi sk. Let M be the hyper-resolvent of L l' such a hyper-resolvent exists. Add <M,

,

... ,

1+maz (d l'

s,» Lt

E c1auses(Ni

)

::::l Land M l'

... ,

and di
M t , if

dtl> to c1auses(N). When

doing splitting it may be necessary to continue even after some element of clauses(N G ) has been derived (where N G is the goal node), since different proofs may use different instances of the alternatives. 5. Splitting Refinements We present semantic refinements of splitting that are intuitively appealing and also reduce the search space. The motivation is that it is desirable to reduce the number of split clauses used in a proof. since each such clause used introduces alternatives which must also be considered . The refinements discussed here are intended to deal with problems such as those mentioned in example 5. Suppose S is a set of clauses; let H(S) be the set of Horn clauses of S

370

containing exactly one positive literal. replaced by

a1 v

be expressed as

(We assume all-negative clauses Cl of S have been

GOAL as indicated earlier.) Let C be a non-Horn clause of S which can

a1 v

V Ok where

a1 is a Horn clause and the remaining a,

are

the remaining positive literals of O. Let D be a ground instance of C, and let VI be the corresponding ground instances of the OJ' Now, we want to minimize the use of such ground instances D in proofs, as mentioned above. If H (S)

I=v

j

for some i, then there is a proof which

does not use D, since some V, can be derived entirely from Horn clauses. Therefore it is sufficient to use only D such that no Vi is a logical consequence of H (S). Furthermore, there are certain models of H (S) from which it is easy to determine which literals are logical consequences of H (S). For an arbitrary set S of Horn clauses, we sayan interpretation I of the function and constant symbols of S is S-minimal if for all ground literals L, S t=L is equivalent to SI p.L I . Note that if I is S-minimal, then S p.L iff LIE 01 (SI). Therefore by examining the model CI (SI ), it is possible to determine which literals are logical consequences of S, which may be useful for restricting splitting, as indicated above. We sayan interpretation I of the function and constant symbols of S is S-initial if I is S-minimal and furthermore for all terms and t composed of the function and constant symbols of S,

81

8

=/1 iff S 1=8 =/. Initial

interpretations are particularly intuitive because they only identify terms which are equal in all models. Often, initial interpretations correspond to standard models, such as the standard model of the integers. According to the above remarks, it is not necessary to use ground instances V such that V/ E

=

2~3

at (SI ) if I is S-minimal.

For example, we would not want to use D

V 3>2 in the usual axioms for inequalities and natural numbers, since 3>2 is itself a

consequence of the axioms. There is another approach to restricting the use of non-Horn clauses. It seems counter-intuitive to use some "false" literal such as

2~3

as a split literal. When we do a proof

by case analysis, if we assume some case V, is true, we generally have in mind a model in which Vi is true. To force consideration of models in which D, is false is unnatural and also increases

the search space (because such models are less sparse). Frequently the goal literal is of the form Vx1

V xm A (x l'

"',

xm

)

for some formula A. To prove this, we convert the

Xi

into new Skolem constants which may appear in the ground instances used in the proof. Often all function symbols have standard interpretations except for these Skolem constants. Then, if the V, do not contain any Skolem constants, they will be true or false in the standard interpretation. Ie some Vi is true, then it can be derived from the axioms and so splitting is not needed. Furthermore, not all D, can be false since V is true in the standard interpretation. Now, if the

Vi do contain Skolem constants, we are free to choose I to interpret them in any way desired. We would like to say that for each D, such a choice can be made so that V, will be true in

MI (S). If this is not possible, it must be that S UVi is inconsistent. Therefore it seems reasonable for each V, to restrict I to interpret the Skolem constants so that Vi is true. (It is permissible and natural to choose a different I for each case in the case analysis.) Ie a proof cannot be

371

found , then attempt to show that S UD j is inconsistent. Thus if the non-Horn clause were e sd

V d
we would at first only consider I interpreting c to be some natural number less than or equal to d. In this step , the dependency graphs would be less dense than if I is unrestricted, since we do not have to consider the possibility that literals such as 3S2 are true. It this step fails, then we can look for proofs that 5 Uc Sd is inconsistent. In the special case in which there are only two Dj

instead of attempting to prove that 5 UD 1 is inconsistent, we can attempt to prove that

,

5 I=D 2 (since presumably D 1 V D 2 is true in the standard interpretation), and this may be

much easier since a more natural semantics should be available. 5.1 Initial versus first-order semantics The use of semantics and splitting introduces some subtleties concerning models of H(S). Suppose S is consistent, and D, are as above. Thus D =

V

instance

literal

of

'II r 1

some

clause

'II r ill A (r I'

C

"' ,

of

S.

Suppose

the

goal

i

D, and D is a ground is

of

the

form

rill)' as above . Suppose some of the D, contain Skolem con-

stants c j obtained from r i in the theorem. Let I be some S-initial interpretation of H(S). It is conceivable that no matter how the Skolem constants are interpreted as elements of the domain of I, all the D, will still be false in MI (H (5 )), even though D is consistent with S. The reason is that restricting the Skolem constants to be elements of the domain of I is like restricting attention to the initial model semantics of H(S) instead of the usual first-order semantics. It appears possible at first glance that D may be false in the initial model but true in some other models. This would make it necessary in some cases to use split literals all of which are inconsistent with the initial interpretation, contrary to intuition . However, the following result shows that this need never happen. Theorem. Suppose S is consistent, and D is a ground instance of a clause C of S.

Suppose D contains new Skolem constants

Cj

in addition to the function symbols in S. Let I be

an S-initial interpretation of the function and constant symbols in S. Th en no matter how I is extended to interpret the Skolem constants sisten t with 5 I

Cj

as elements of the domain DI of I, DI is con-

.

Proof. Since S is consistent and I is S-initial ,

Skolem constants the

Cj

Cj

replaced uniformly by variables r J

SI '

is consistent. Let D 1 be D with the

Then D 1 is also an instance of C since

do not occur in C. Since S is consistent, D 1 is consistent with S. Let D 2 be an arhitrary

ground instance of D I' such that D 2 contains none of the symbols c s: Then SUD 2 is consistent, so

SI UD~

is consistent. Therefore, no matter how we assign elements of DI as values

of the r J' D 1 is consistent with values of the

Cj ,

SI.

Therefore, no matter how we assign elements of DI as

D is consistent with 51. Thus values can be assigned to the

Cj

so that at least

372

one of the literals of D is not contradictory with

s! .

This guarantees that if S is consistent, then one of the split literals D j will be consistent with Sl , when the Skolem constants are interpreted as elements of the domain of the initial model of S. For example, if D is c Sd

V d
as integers, one of the literals of D will be consistent with the standard model of the integers . Thus if the ord er of considering split literals is flexible, we can at least guarantee that the first case considered preserves consistency. Arter the first split, usually more will be known about the instance D of C since the variables of C will be restricted by the proof found for the first case. Still, after the first split, we may not know that SUD, is consistent. Possibly when proving theorems using the initial semantics , other strategies can be used. 6. Similarities with Human Theorem Proving Methods There are a number of similarities between this general approach and the methods used by humans to prove theorems. This method makes heavy use of examples and diagrams, as people do. It considers causal connections between known facts, using general axioms. The method separates the proof search into two steps : 1. Finding a relevant set of facts and 2. Connecting these facts together to get a proof . Frequently for humans, one of the most important steps to finding a proof is to find a relevant set of facts. This method has an explicit. case analysis structure, which is orten buried in conventional theorem provers . Also, in section 5 we considered ways in which semantics can be integrated naturally into the case analysis structure. The method emphasizes relevant facts instead of relevant axioms as in [161. This makes the strategy somewhat more independent of the particular axiom system chosen. Finally , the method is sensitive to the difference between positive and negative literals, as people are; to replace P by

~P

everywhere would significantly alter the behavior of the method .

7. Extensions and Modifications It would be useful to have heuristics for adding relevant facts to a given set of facts; these heuristics would depend on the problem domain considered. It might be interesting to consider causal relations that correspond not to a single clause but perhaps to short proof steps; possibly specialized decision procedures could be used to incorporate specialized axioms into the causal links so these axioms would not have to be represented explicitly . It seems useful to consider only minimal causal relations ; that is, relations {L l '

.. . ,

Lk

} ..

L such that L cannot

be derived in one step from any proper subset of the L, . It might be possible to search for proofs of theorems of the form V xP [s ] by individually proving Pta), P(b), PIc), ... and somehow matching up these proofs, using methods similar to those of section 4.

373

8. References 1. Ballantyne, A., and Bledsoe, W., On generating and using examples in proof discovery, Machine Intelligence 10 (Harwood, Chichester, 1982) 3-39. 2. Bledsoe, W., Using examples to generate instantiations for set variables, IJCAI(1983)892-901. 3. Brand, D., Proving theorems with the modification method, SIAM J. Comput. 4 (1975)412430. 4. Chang, C., The decomposition principle for theorem proving systems, Proc. Tenth Annual Allerton Conference on Circuit and System Theory, University of IIIinois(1972)20-28. 5. Chang, C. and Lee, R., Symbolic Logic and Mechanical Theorem Proving (Academic Press, New York, 1973). 6. Charniak, E., Riesbeck, C., and McDermott, D., Data dependencies, Artificial Intelligence Programming (Lawrence Erlbaum Associates, Hillsdale, N.J., 1980) 193-226 7. Doyle, J., A truth maintenance system, Artificial Intelligence 12 (1979) 231-272. 8. Fay, M., First-order unification in an equational theory, Proceedings 4 th Workshop on Automated Deduction, Austin, Texas (1979)161-167. 9. Fikes, R., Deductive retrieval mechanisms for state description models, Proceedings of the Fourth International Joint Conference on Artificial Intelligence, Tbilisi, Georgia, USSR (1975) 99-106. 10. Gelernter, H., Realization of a geometry theorem-proving machine, Proc. lFlP Congr. (1959)273-282. 11. Henschen, L. and Wos, L., Unit refutations and Horn sets, J. ACM 21(1974)590-605. 12. Huet, G., An algorithm to generate the basis of solutions to homogeneous linear diophantine equations, Information Processing Letters 17 (1978)144-147. 13. Huet, G. and Oppen, D., Equations and rewrite rules: a survey, in Formal Languages: Perspectives and Open Problems (R. Book, ed.), Academic Press, New York, 1980. 14. Lankford, D., Canonical algebraic simplification in computational logic, Memo ATP-25, Automatic Theorem Proving Project, University of Texas, Austin, TX, 1975. 15. Loveland, D., Automated Theorem Proving: A Logical Basis (North-Holland, New York, 1978). 16. Plaisted, D., An efficient relevance criterion for mechanical theorem proving, Proceedings of the First Annual National Conference on Artificial Intelligence, Stanford University, August, 1980. 17. Plaisted, D., Theorem proving with abstraction, Artificial Intelligence 16 (1981) 47 - 108. 18. Plaisted, D., A simplified problem reduction format, Artificial Intelligence 18 (1982)227- 261.

374

19. Reiter, R., A semantically guided deductive system for automatic theorem proving, Proc. 3rd IJCAJ (1973) 41-46. 20. Slagle, J., Automatic theorem proving with renamable and semantic resolution, J. ACM 14 (1967) 687-697. 21. Stickel, M., A unification algorithm for associative-commutative functions, J. ACM 28 (1981)423-434.

375

EXPANSION TREE PROOFS AND THEIR CONVERSION TO NATURAL DEDUCTION PROOFSDale A. Miller Department of Computer and Information Science University of Pennsylvania Philadelphia, PA 19104 Abstract: We present a new form of Herbrand's theorem which is centered around structures called expansion trees. Such trees contains substitution formulas and selected (critical) variables at various non-terminal nodes. These trees encode a shallow formula and a deep formula - the latter containing the formulas which label the terminal nodes of the expansion tree. If a certain relation among the selected variables of an expansion tree is acyclic and if the deep formula of the tree is tautologous, then we say that the expansion tree is a special kind of proof, called an ET-proof, of its shallow formula. Because ET-proofs are sufficiently simple and general (expansion trees are, in a sense, generalized formulas), they can be used in the context of not only first-order logic but also a version of higher-order logic which properly contains first-order logic. Since the computational logic literature has seldomly dealt with the nature of proofs in higherorder logic, our investigation of ET-proofs will be done entirely in this setting. It can be shown that a formula has an ET-proof if and only if that formula is a theorem of higherorder logic. Expansion trees have several pleasing practical and theoretical properties. To demonstrate this fact, we use ET-proofs to extend and complete Andrews' procedure [41 for automatically constructing natural deductions proofs. We shall also show how to use a mating for an ET -proof's tautologous, deep formula to provide this procedure with the "look ahead" needed to determine if certain lines are unnecessary to prove other lines and when and how backchaining can be done. The resulting natural deduction proofs are generally much shorter and more readable than proofs build without using this mating information. This conversion process works without needing any search. Details omitted in this paper can be found in the author's dissertation [161. Key Words: Higher-order Logic, Expansion Trees, ET-proofs, Natural Deduction, Matings. 1.

Introduction

Problem solving in mathematics involves many different kinds of reasoning processes: about propositional connectives, about individual objects in a given domain, about equality and order relations, about sets and functions, and, among a host of others, the more exotic reasoning by example, analogy, etc. Approaches to theorem proving have generally focused on studying the first three of these reasoning processes. Reasoning of the more exotic kinds have also been studied by various artificial intelligence researchers. Although logics based on the ability to reason about seta and functions (higher-order logics) have been studied (see [1, 2, 8, 11, 12, 14, 18, 19, 20, 22]), until very recently few implementations of theorem provers in such logics have been described in the literature. - This work was supported by NSF grant MCS81-02870_

376

The importance of doing theorem proving in higher-order logic has been argued by several people, including Andrews in 1151 and Robinson in [20}. In fact, Robinson concludes [20] with:

It is important to recognize that it is higher-order logic, and not first-order logic, which is the natural technical framework for the 'mechanization of mathematics'. We have in fact attempted ... to pursuade those engaged in mechanical theorem-proving research, and those proposing to start such research, to focus their attention henceforth on mechanizing higher-order logic. The computer system, TPS, described in [15], is the result of an ongoing project in which many automatic and interactive approaches to theorem proving in higher-order logic have been developed. Initially, the logical language described in the next section was implemented, along with Huet's higher-order unification algorithm [121. Using this unification algorithm, a mating enumeration theorem prover, described in [5], was then implemented. The mating enumeration strategy, being conceived for first-order logic theorem proving, provided TPS with an automatic theorem prover which was complete for first-order logic. The use of Huet's algorithm also permitted genuinely higher-order theorems to be proved. In particular, TPS found a proof of Cantor's theorem, s.e. there is no one-to-one mapping from a power set of a set back to that set. The classical diagonal argument was found by discovering a non-trivial higher-order substitution term 13]. However, the collection of theorems for which TPS could (theoretically) find a proof was only a very modest extension of first-order logic. Currently, TPS is still very incomplete in the general higher-order setting, since substitution terms containing quantifiers and binary, logical connectives are not discovered by a straightforward use of Huet's unification algorithm. One way to describe this inadequacy is to say that TPS was not searching through the proper "search space" of proof structures for higher-order theorems. What we shall present in this paper is a useful characterization of the search space for a theorem prover in higher-order logic. This characterization is based on a structure called an expansion tree. Among the set of expansion trees for a given theorem, certain ones will be considered proofs, called ET-proofs. In this paper we shall define and present several important properties of expansion trees, but we shall not discuss the many issues surrounding how to automate the search for ET-proofs. Approaches to this problem are currently being studied and implemented in TPS. In the next section, we shall describe the higher-order logic T on which we base the rest of this paper. In Section 3, we present the definition of expansion trees and ET-proofs. As it turns out, expansion trees and ET-proofs are useful structures for the study of proofs in both first-order and higher-order logic. All the results in this and following sections will work equally well in both logics. (See [171 for examples of how expansion trees can be used in first-order logic meta.theory.) The fact tha.t this one kind of structure can actually be used in both setting is clearly one of its strengths, since most definitions of search spaces for first-order logic do not work in the higher-order case. In Section 4, we present a list representation for expansion trees which are succinct and easily implemented. We warn the reader that these 3 sections may prove to be difficult to read, especially for a reader not familiar with higher-order logic and A-conversion.

377

In the remaining sections of this paper, we show another strength of expansion trees: Given an ET-proof, it is easily converted to a natural deduction proof. Such a feature is also very important to the TPS project since a considerable amount of effort has gone into providing the TPS user with not only automatic tools for proving theorems but also interactive tools (see [4] and [15]). For example, TPS provides a interactive editor for constructing natural deduction proofs in a top-down and bottomup fashion (much as is also done in [6)). At any point in editing such a proof, the user can have an unfinished portion of the proof given to the automatic theorem prover. If the theorem prover finds a proof - that is, an ET-proof - the methods described in the last sections can be used to finish the unfinished portion of the natural deduction proof which the user started. In this way, the theorem prover can explain the proof it found in a readable fashion. This capability is clearly valuable not only to researcher using TPS but also to beginning logic students who use the interactive proof editor to learn the process of building proofs. If the student gets stuck, the automatic theorem prover could provide hints or complete instructions on how to complete the proof. See [17] for more discussion of this feature, along with the description of how it is possible to convert natural deduction proofs to ET-proofs. In that paper, Pfenning also describes an algorithm which will convert a resolution-style refutation of a theorem into an ETproof of that theorem. Thus the results about expansion trees mentioned in this paper and Pfenning's can be made available to those systems which are based on resolution theorem proving. All the results described below are contained in [161, and the reader is referred to this dissertation for details of proofs which are omitted below.

2.

Logieal Preliminariea

It has often be observed that first-order logic is inadequate for formulating mathematics. For example, consider Tarski's lattice-theoretical fixpoint theorem [23]:

If (L, $;) is a complete lattice and if f is an increasing function on L, then f has a fixpoint, i.e. there is an x E L such that f(x) = z.

One difficult in representing this theorem and its proof in first-order logic is the need to quantify over a set variable in the axiom concerning a complete lattice . If we let (L, $;) be a lattice and let B be a set variable (a higher-order variable), an informal mathematical representation of the completeness axiom would be: VB [V:c [:c E B:):c E L j:) 3 Z [z E L 1\"I:c [:c E B :) :c $; z] 1\

vv l!y E L 1\ V:c [:c E B :) :c $; y]] :) z $; yJl]

In the proof of the fixpoint theorem, this axiom is used by applying it to the set

{:cl:c E LI\:C $; f(x)}. Informally this is done referring to ~he property used to define B wherever x E B appears in this instance of the axiom. In other words, we actually replace :c E B with :c E L 1\ x $; f(:c). Here, again first-order logic is inadequate

378

to represent this kind of substitution - an atomic formula x E B becomes the nonatomic formula x E L 1\ x::; !(x). We D.OW present a higher-order logical system which solves these two problems: explicit quantification of set (and function) variables and the possible change of atomic subformula occurrences into non-atomic subformulas under substitution. The higher-order logic, T, which we shall consider here is essentially the simple theory of types given by Church in [10], except that we do not use the axioms of extensionality, choice, descriptions, or infinity. T contains two base types, 0 for boolean and £ for individuals. All other types are functional types, i.e. the type (,8a) is the type of a function with domain type a and codomain type,8. In particular, the type (oa), being the type of a function from type a to a boolean, i.e. a characteristic function, is used in T to represent the type for sets (predicates) of elements of type a. For example, if the lattice L mentioned earlier is a set of elements of type a, then we say that L has type [co). Formulas are built up from logical constants, variables, and parameters (non-logical constants) by >.-abstraction and function application. Hence, the type of [>.xaApl is (,8a) while the type for [A(Pa)Bal is,8. (We shall seldom adorn formulas with type symbols, but rather, when the type of a formula, say A, cannot be determined from context, we will add the phrase "where A is a formula.," to indicate that A has type a.) For the convenience of making definitions in the next section, the formulas of T which we shall consider contain only the logical constants - 0 0 (negation), v(oo)o (disjunction), and JIo(oa) (the "universal a-type set recognizer"). Other logical constants will be considered abbreviations, i.e. A 1\ B stands for -. - A V -B, A:J B stands for -A V B, "Ix P stands for JI[>'xP], and 3 x P stands for -JI[>'x. - PI. In particular, we write Loaxa to denote the expression x E L. This definition of the universal and existential quantifier may look rather peculiar, but it is very simple to explain. The meaning of the logical constant JI is such that JIo(oa)Boa is true if and only if Boa is the "universal" set of type oa. Hence, JI[>,xaPol is true if and only if >'xaPo is the universal set of type (oa), i.e. Po is true for all Xa' We shall take as axioms of T the following formulas (p, q, and r are formulass): pVp:Jp p:Jpvq

pvq:J.qVp P :J q :J .r V p :J .r V q

JIo(oa)/oa :J !oaxa "Ixa [pv !oax..J:J pvJIo(oa)!oa Here, a is a type variable, and the last two axioms represent axiom schemes. The rules of inference are substitution, modus ponens, universal generalization, and >.-conversion. We shall write fr A to denote that A has a Hilbert-style proof using these axioms and inference rules. The deduction theorem holds for T. At first glance T may look rather esoteric, but it can be described as being simply first-order logic in which we permit unrestricted comprehension via the use of >'-terms. The type structure is necessary here in order to avoid the paradoxes (like Russell's paradox) which arise from unrestricted comprehension. The use of >'-terms in substitutions can make the nature of deductions in T more complex than in first-order logic. In the fixpoint example, the result of substituting B with the term [>.x.Lx 1\ x ::; !(x)] in the

379

completeness axiom will change the subformulas of the form Bz to [Ax.Lx A x ~ f(z))x which A-convertsto Lx A x ~ fez). For another example, if we have the formula (where Y is a variable o• and D and T are variableso(Oij) VD [DY:JTY)

and we wished to do a universal instantiation (a derived rule of inference) of this formula with the term AZ[TZ 1\ Vx .Z x :J Ax), i.e. the set of all sets of individuals which are members of T and are subsets of A, we would then have

[AZ. TZ AVx .Zx:J AX)Y:J TY. We can now apply the A-conversion inference rule to this formula to deduce

[TY 1\ Vx .Yx:J AX]:J TY. Notice how the structure of this last formula is much more complex than that of the formula it was deduced from. This last formula contains occurrences of logical connectives and quantifiers which are not present in the original formula. Notice also that Y now has the role of a predicate where this was not the case in the first formula. None of these structural changes can occur in first-order logic. The discovery of such substitution terms as the one used to instantiate D is a much more complex problem than can be achieved by simply applying unification. TPS, for example, cannot currently discover terms of this kind. Radical new heuristics for finding substitutions must be developed, and we hope that expansion trees will provide a vehicle for formalizing such attempts. Bledsoe in [8] and [9) has made some exciting progress in the development of just such heuristics.

3.

Expansion TJoees and ET·Proofs

All references to trees below will actually refer to finite, ordered, rooted trees in which the nodes and arcs mayor may not be labeled, and that labels, if present, are formulas. In particular, nodes may be labeled with simply the logical connectives >- and v. We shall picture our trees with their roots at the top and their leaves (terminal nodes) at the bottom. In this setting, we say that one node dominates another node if it they are on a common branch and the first node is higher in the tree than the second. This dominance relation shall be considered reflexive. All nodes except the root node will have in-arcs while all nodes except the leaves will have out-arcs. A node labeled with '" will always have one out-arc, while a node labeled with V will always have two outarcs. We shall also say that an arc dominates a node if the node which terminates the arc dominates the given node. In particular, an arc dominates the node in which it terminates. Also, we say that an arc dominates another arc if their respective terminal nodes dominate each other in the same order. 3.1. Definition.

Let A be a formula.; An occurrence of a subformula B in A is a

boolean sublormuJa occurrence if it is in the scope of only '" and v, or if A is B. A formula, A is an atom if its leftmost non-bracket symbol is a variable or a parameter.

380

A formula B is a boolean atom (b-atom, for short) if its leftmost non-bracket symbol is a variable, parameter or IT. A signed atom (b-atom) is a formula which is either an atom (b-atom) or the negation of an atom (b-atom}, I 3.2. Definition. Formulas, of T can be considered as trees in which the non-terminal nodes are labeled with"", or v, and the terminal nodes are labeled with b-atoms. Given a formulag, A, we shall refer to this tree as the tree representation 01 A. I 3.3. Example. Figure 1 is the tree representation of ""'[ITB V Ax] V ""''''''IT[>.x.Ax V Bxl. This formula is equivalent to ""'[V '!I By V Ax] V""'''''' V x .Ax V Bz; I

/v~ f V

\_

/\

nB

A%

\

n[>.z.A% v Bz].

Figure 1 We shall adopt the following linear representation for trees. If the root of the tree

Q is labeled with ""', we write Q == _Q', where Q' is the proper subtree dominated by Q's root. Likewise, if the root of Q is labeled with v, we write Q == Q'v q", where Q' and Q" are the left and right subtree of Q. The expression Q' 1\ Q" is an abbreviation for the tree ""'[""'Q' V ""'Q"l. 3.4. Definition. Let Q, q' be two trees. Let N be a node in q and let I be a label. We shall denote by Q +~ Q' the tree which results from adding to N an are, labeled I, which joins N to the root of the tree Q'. This new arc on N comes after the other arcs from N (if there are any). In the case that the tree Q is a one-node tree, N must be the root of Q, and we write A +/ Q' instead of Q +~ Q', where .4 is the formula which h~~~ I 3.5. Example. Figure 2 contains three trees, Q, Q' and Q+}j Q', where N is a node of Q and c is some label. The nodes and arcs of Q and Q' mayor may not have their own labels, I 3.6. Definition. Let Q be a tree, and let N be a node in Q. We say that N occurs positively (negatively) if the path from the root of Q to N contains an even (odd) number of nodes labeled with ""'. We shall agree that the root of Q occurs positively in Q. If a node N in Q is labeled with a formula of the form ITB, then we say that N is universal (existential) if it occurs positively (negatively) in Q. A terminal node which is not labeled with a formula of the form ITB is called a neutral node. A universal

381

N

Figure 2: The trees

q, q', and q +~ q'.

(existential) node which is not dominated by any universal or existential node is called a top-level universal (existential) node. A labeled arc is a top-level labeled arc if it is not dominated by any other labeled arc. I 3.1. Definition. Let Q be a tree with a terminal node N labeled with the formula nB, for some formulag; B. If N is existential, then an ezpan8ion of q at N with respect to the list of formulass,, (t1,.. . ,t..), is the tree q +~ ql +~ ... +~ q.. (associating to the left), where, for 1 ~ i ~ n, Qi is the tree representation for some ..\-normal form of Bt. , The formulas t 1 , ••• ,t.. are called ezpan8ion terms of the resulting tree. We say that each of these terms are used to expand N. If N is universal, then a 3election of q at N with respect to the variable., y, is the tree q +~ q', where q' is the tree representation of some ..\-normal form of By, and y does not label an out-arc of any universal node in Q. We say that the node N is selected by y.

The set of all ezpan8ion trees is the smallest set of trees which contains the tree representations of all ..\-normal formulaa, and which is closed under expansions and selections. I Expansion trees are, in a sense, generalized formulas. The main difference is that expansion trees can contain labeled arcs. An expansion tree which contains no labeled arcs can easily be interpreted as a formula. S.8. Definition. Assume that Q is an expansion tree. Let SQ be the set of all variable occurrences which label the out-arcs from (non-terminal) universal nodes in q, and let I 8 Q be the set of all occurrences of expansion terms in q. Expans ion trees, a generalization of Herbrand instances, do not use Skolem func-

382

tions as is customary in Herbrand instances. Skolem functions can be used in this setting, but their occurrences in substitution teI'IIU! must be restricted in ways that are not apparent from the first-order use of Skolem functions. The reader is referred to [16]' for details. In order to do without Skolem functions, we need to place a restriction on selected variables which models the way in which Skolem teI'IIU! would imbed themselves in other Skolem teI'IIU!. This restriction amounts to requiring that the following binary relation on SQ be acyclic. 3.9. DetlnitioD. Let Q be an expansion tree and let -<~ be the binary relation on SQ such that z -<~ y if there exists an expansion term occurrence t E 9 Q such that z is free in t and y is selected for a node dominated by the arc labeled by t. Let -
3.10. DetlnitioD. Let q be a tree such that either define Dp(q) by induction on the structure of q.

q or ...q is an expansion

tree. We

(1) If q is a one-node tree, then Dp(q) = A, where A is the formula which labels that one-node. (2) If q = ...q' then Dp(q) := ...Dp(q'). (3) If q

= q'v q" then

(4) If q::::: IIB+" ql

Dp(q) :== Dp(q') V Dp(q").

+ ... +'. q"

then Dp(q):= Dp(ql)I\.·.I\Dp(q,,).

I

3.11. DetlnitioD. Let q be a tree such that either q or ...q is expansion tree. We define Sh(q) by induction on the top-level boolean structure of q. (1) If q is a one-node tree, then Sh(q) = A, where A is the formula which labels that one-node. (2) If q ::::: ...q' then Sh(q) := ...Sh(q').

q'v q" then Sh(q) :== Sh(q') V Sh(q"). == lIB +" ql + ... +'. q" then Sh(q):= lIB.

(3) If q == (4) If q

I

Notice, that if A is a formulas; and q is the tree representation of A, then Dp(q) = A == Sh(q). 3.12. DetlnitioD. Let Q be an expansion tree. q is sound if no variable in SQ is free in Sh(q). q is an ET-proof if q is sound, Dp(q) is tautologous, and - Py. An ET-proof for A would then be the tree q given as:

... 1IIl,\y· ... Il,\%.... p%V Py]+-

+-

[l1IA% [lTIAx

p%V Pu) +- [ PtI V PuJl p% V Pv) +w [ Pwv PvllJ.

383

,.+·+·..

Here, Dp(q) = PtI V PulA "",["",Pw V PtllJ. The imbedding relation is the pair tI -
Let A be a

3.14. Soundness and Relative Completeness ror ET·Proors. formulao. A if and only if A Iuu an ET-proof.

r,.

This theorem is what we shall consider our higher-order version of Herbrand's theorem. The reader is referred to [16] for the details of this proof. The relative completeness A then A has an ET-proof, is proven by using the Abstract Consisresult, i.e. if tency Property in [11. The central result concerning Abstract Consistency Properties is based on Takahashi's proof of the cut-elimination theorem for higher-order logic [221. Since i is non-extensional, Henkin-style general models do not correctly characterize derivability in i. Hence, the completeness result is stated relative to the notion of derivability and is not based on a notion of validity.

r,.

3.1&. Definition. An expansion tree is grounded if none of its terminal nodes are labeled with formulas of the form JIB. An ET-proof is a grounded ET-proof if it is also a grounded expansion tree. I

A formula has an ET-proof if and only if it has a grounded ET-proof. 4.

List Representations or Expansion Trees

We shall now present a representation of expansion trees which is more succinct and more suitable for direct implementation on computer systems. We shall no longer consider the logic connectives A and:) and the quantifiers V and 3 to be abbreviations. This will help make list representations of expansion trees more compact. The set of all list structures over a given set, 5, is defined to be the smallest set which contains 5 and is closed under building finite tuples. Since expansion and selection nodes in an expansion tree must occur under an odd and even number of occurrences of negations respectively, we need to be careful how we imbed expansion trees under negations when we attempt to build up larger expansion trees from smaller ones. This explains why we need to consider so many cases in the following definition. 4.1. Definition. Let 5 be the set which contains the labels SEL and EXP and all formulas of T. Let be the smallest set of pairs (R, A), where R is a list structure over 5 and A is a formulas, which satisfies the conditions below. We say that a variable y is selected in the list structure R if it occurs in a sublist of the form (SEL y R').

e

(1) If A is a boolean atom and R is a A-normalform of A, then (R, A) E "'"A) E e. Here, "'" R is shorthand for the two element list ("'" R). (2) If (R, A) E

e and ("",R,

e then (R, B) E e where A conv B. e

e.

(3) If (R, A) E then ("'" "'" R, "'""'" A) E In cases (4), (5), and (6), we assume that R l and ~ share no selected variables in common and that Ai (A2 ) has no free variable selected in ~ (Rd.

e

(4) If (R l , Ad E and (~, A z ) E ((1\ R l ~), Ai 1\A z } E

e.

e then

((V R l

~), Ai

V Az} E

e and

384

(5) H (.....RI, .....A I) E t and (..... ~, ..... A~) E and (....(/\ R I ~) . ...... AI/\A~) E

e.

e

t

then (..... (v RI

~),

-« ,

AI

V A~)

E

t

e

(6) H (.... RI .....Ad E and (~,A2) E t then «~ RI ~),AI ~ A2) E and (.... (~ ~ Rd......A:I ~ AJl E In cases (7). (8). and (9). we assume that y is not selected in R and that y is not free in [A%P] or in B.

e.

e

e.

(7) H (R, [A%Ply) E then «SEL y R), V % P) E (8) H (....R, ....[A%P)y) E t then « .....(SEL y R» ...... 3 % P) E (9) H (R, By) E

t

then «SEL y R), rrB) E

e.

e.

In cases (10), (11). and (12), we must assume that for distinct i,i such that 1 ~ n, R; and R; share no selected variables and that no variable free in !A%P jti is free in Rj .

i.i

~

= 1•. ..• n, (R;, [A%Pjti) E t

(10)

H for i

(11)

H for i = 1, . .. .... '1% P) E t .

t.

,n. (.. . R;, .... [A%P)t;)

then «EXP (t l Rd .. . (t" R,.»,3 E

t

%

P) E

then (....(EXP (t l Rd . .. (t" R,.»,

H fori = 1, . .. . n, (.....R;. .... Bti) E t then (.....(EXP (t l Rd .. . (t" R,.», rrB) E t. I The pair (R, A) E t represents - in a succinct fashion - an expansion tree. Notice that the only formulas stored in the list structure R are those used for expansions and selections and those which are the leaves of the expaneion tree. Expansion trees as defined in §2 contain additional formulas which are used as "shallow formulas" to label expansion and selection nodes. These formulas. however, can be determined up to Aconvertibility if we know what the expansion tree is an "expansion" for. Notice, that one list structure alone may represent several expansion trees. For example, (EXP (a P aa» could represent an expansion tree for 3 % Pes, 3 % Paz, and 3 x Paa. H we keep this complication in mind. we can informally considered list structures as expansion trees. (12)

4.2. Example. The expansion tree in Example 3.13 can be written as the list structure: (EXP (u (SEL

5.

II

(~

PII Pu)

»(v (SEL

w (~ Pw PII) ))).

Natural Deductions

Beyond the fact that ET·proofs are sound and (relatively) complete for r, they also have several other pleasing properties, for both theoretical and practical concerns. We shall illustrate this claim by showing how ET.proofs can be converted to natural deductionstyle proofs. This investigation is an immediate extension of the work described by Andrews in [41. In that paper, Andrews showed how natural deduction proofs could be constructed by processing incomplete proofs, called ouUines, in both a top-down and bottom-up fashion. In these outlines, certain lines. called sponsoring lines, were not justified. To each sponsoring line is associated a (possibly empty) list of justified lines which appear earlier in the proof and which might be required for completing the

385

proof of the sponsoring line. These lines are called supporting lines. Proof lines which are either supporting or sponsoring are called active. Incomplete proofs built in this fashion are such that their asserlions are subformulas of the original theorem. (Notice that in higher-order logic, this is stretching the usual meaning of subformulas.) Using ih is fad, we shall be able to attach to each active line an expansion tree (actually a list representation) lor the assertion in that line. These expansion trees, which are essentially sub-trees of the ET·proof of the original theorem, provide the information necessary to determine how an active line should be "processed." Beyond the fact that the conversion process describe below works for higher-order logic, this process differs in two other important ways from the process described in (4). First, Andrews used a structure called a plan to provide the information which would indicate how to process active lines. ET-proofs, when restricted to first-order logic, contain the same kind of information as plans. Plans, however, are defined with respect to several global properties of formulas. This makes it awkward (in theory and practice) to construct new plans for new subproofs. Since subtrees or the negation of subtrees of expansion trees are themselves expansion trees, it is much easier to build new ET-proofs for new subproofs. Secondly, Andrews actually considered subproofs to be based on a sponsoring line and its hypotheses while we consider subproofs to be based on sponsoring lines and their supports. These differences allow us to give a complete analysis of this transformation process. Below we provide formal definitions for the concepts informally discussed above. In the rest of this paper, all ET·proofs will be assumed to be grounded. 6.1. Definition. By a natural deduction proof we mean a Suppes-style proof structures [211. Such systems emphasize reasoning from hypotheses instead of axioms. An incomplete natural deduction proo/is a list of proof lines some of which are justified by NJ - the non-justification label. Such lines represent subproofs which must be completed. The rules of inference in this system are those listed in [4] along with a rule for ~·conversion. The rules of existential generalization and universal instantiation are ! examples of two rules of inference. 6.2. Example. The following is an example of an incomplete natural deduction proof. (1) 1 f- 3 c Vp .13 u .pu] ::) .p.cp Hyp (2) 2 l- V:c 3y .P :cy Hyp (3) 3 f- V P .13u .pul ::) .p.cp H yp (16) 2,3 f- 31 Vz .P z .l z NJ (17) 1,2 f- 3/Vz .Pz.lz RuleC :1,16 (18) 1 f- [V:c3y .P:cy]::).3/Vz.Pz.lz Deduct: 17 (19) f- [3cVp .[3u.pu]::).p.cp!::) [V:c 3y .P :cyl ::) 3/ VZ .Pz.f« Deduct: 18 Here e is a variable,(o,), p is a variable"., P is a variable"... / is a variable,.. and :c, y, u are variables,. I

Z,

In what follows, we shall use .L to represent a false statement. It can be treated as an abbreviation for p" -po We shall also let .L stand for both the expansion tree for .L and for the list representation for this expansion tree. If .L occurs as one of the

386

disjuncts of a formula, we shall assume that that formula is an abbreviation for the formula which results from removing .L as a disjunct. 6.3. Definition. A proof outline, 0, is the triple, (L, p, {R,}), where:

(1) L is a list of proof lines which forms an incomplete natural deduction proof. A line with the justification N J corresponds to a subproof which must be completed. Let L o be the set of all lines labels in L which have this justification. These are called the sponsoring lines of O. (2) P is a function defined on Lo such that whenever z E Lo, p(z) c L \ Lo and all the lines in p(z) precede z in the list L. Whenever I E p(z), we say that z sponsors I, I supports z, z is a sponsoring line, and I is a supporting line. A line is active if it is either a supporting line or a sponsoring line which does not assert .L. (In the outlines we shall consider, only sponsoring lines may assert .L.)

(3)

{R,} represents a set of list structures, one for each active line, such that if I is a supporting line, then (.... R" ..../) E and if I is a sponsoring line, then (R" 1) E

e

e.

(4) If line a supports line z then the hypotheses of a are a subset of the hypotheses of z, If L o is not empty, we define the following formulas and expansion trees. For each z E L o set A z := [VIEP(z) ....lj V z (where line labels stand for their assertions) and let Qz be the expansion tree for A z represented by the list structure (v (V1EP(z) ....R,) R z ).

The following condition must also be satisfied by an outline. (5) If L o is not empty, then Qz is a (grounded) ET-proof for A z for each z E L o. It is easy to show that 0 has an active line if and only if L o is not empty. We say that 0 is an outline for A if the last line in 0 has no hypotheses and asserts A. I

The ET-proof Qz roughly corresponds to a plan for the sponsoring line z as described in [4]. 6.4. Delinition. Let A be a formula and R a list representation of an ET-proof for A. Let z be the label for the proof line

(z)

f-

A

NJ,

and set L:= (z), p(z) = 0 and R z := R. Then 00 := {L,p, {R,}) is clearly an outline. I We call this outline the trivial outline for A based on R. 6.5. Example. An example of a proof outline is given by setting L = (1,2,3, 16, 17, 18, 19), p(16) = {2,3} and

s«

= (EXP (z (SEL y pzy))) Ra = (EXP (pz (:) (EXP (y pzy)) pz.c.Pz))) R I 6 = (EXP ([).V.C.PII] (SEL z pz.c.Pz)))

where the lines in L are those listed in Example 5.2. It is easy to verify that (v .... ~ (v ....Ra R 16 )) represents an ET-proof of ....2 v ....3 V 16 and that {L,p, {~, Ra , R I 6 } ) is an outline. I

387

5.6. Deftnition. A formula t is admi$sible in 0 if no free variable in t is selected in R, for any active line I. I The D- and P- (deducing and planning) transformations described in [4) can now be used in this setting if we describe how each such transformation attributes expansion trees to each new active line. We illustrate how this is done with the P-Conj and D-All transformations. IT some sponsoring line z in an outline 0 = (L,p, {R,}) is of the form NJ

then R. is of the form (A HI ~). Applying P-Conj to line z will result in an outline 0' = (L', I, {H[}), where L' contains the new sponsoring lines

(%) Jl

NJ NJ

(y) Jl

and line a has its justification changed to RuleP: %, y. Also, P'C%) and p'(y) are set equal to pea), and R~ := RlI R~ := ~. I agrees on all other sponsoring lines of 0', and R( := R, for all active lines of 0' other than % and y. This application of P-Conj has reduced the subproof based on line a to the two subproofs based on lines % and y. IT the outline 0 contains a supporting line a of the form

(a) Jl

f-

"1% P

Hu/eX

for some justification RuleX (other than NJ), then Ra has the form (EXP (tl Rtl... (t,. R,.)). IT anyone of the terms t l , ••• ,t,. is admissible within 0, say ti, then D-All can be applied to line a by doing a universal instantiation of it with ti. L' is then equal to L with the line h, shown below, inserted after line a.

(h) Jl

f-

V1:a.

Here it is assumed that in this substitution, bound variables are systematically renamed to avoid variable capture. Also, R1 := ~. IT n ~ 2 then line a must remain active, so

and for each sponsoring line a such that a E p(z), set I(z) := p(z) u{h} (i.e. h is a cosupport with a). IT n = 1, then line a is no longer active so b replaces a as a support - that is, for each sponsoring line z such that a E p(z), set p'(z) := p{z) \ {a} u{h}. In either case, l(z) := p(z) for all other sponsoring lines of 0 and R( := R, for all active lines I =I a of O. It is straightforward to verify that 0' = (L', I, {Rf}) is an outline. It is possible to show that at least one expansion term associated with such active lines in 0 must be admissible, so requiring that the tertIU! introduced in a universal instantiation (or introduced in a bottom-up fashion by P-Exists) be admissible is always possible to meet. This restriction to admissible terms is necessary to guarantee that

388

when variables are selected in the P-All and P-Choose transformations, they do not already have a free occurrence in the current proof outline. A simple, naive process of transforming an ET-proof, represented by the list structure R, for the theorem A, would then start by successively applying either D- or Ptransformations to the trivial outline for A based on R and finish when a.ll the subproofs generated can be recognized as instances of the RuieP transformation.

6.

Focused Construction of Proof Outlines

The proof outlines produced by the naive method described above will often turn out to be very inelegant for at last two reasons, which we willexamine here. An implementation of this naive algorithm was made in the computer program TPS (see [15]) and it was frequently found that many of the supporting lines for a given sponsoring line were not really needed to prove that sponsoring line. The naive algorithm contained no way of checking for this since it was provided with no ability to "look ahead." Hence, may applications of D- and P- rules were not necessary and the resulting, completed natural deduction proofs were much longer and redundant than necessary. The naive algorithm was also not equipped to recognize when it could backchain on a supporting line which asserted an implication, since backchaining also requires looking ahead to see it if can actually be applied. Hence, the naive procedure always treated such implicational linea in the most general possible way - by using its equivalent disjunctive form in the form of an argument from cases. Implicational support lines were always used in a very unnatural fashion. The information which would supply a transformation process with the necessary ability to look ahead is contained in a mating which is present in the tautology encoded in the ET-proofs of each subproof of a given outline. We now need several definitions. 6.1. Definition. If Jh and Jh are sets, define Jh till ..42 := {el u 6 I 6 E Jh, e2 E Jh}. Let D be a A-normal formulas. We shall define two sets, CD and VD , which are both sets of sets of b-asom subformula occurrences in D, by joint induction on the boolean structure of D. CD is the set of clauses in D while VD is the set of "dual" clauses in D. Dual clauses have been called vertical paths by Andrews (see [5]).

(1) If D is a b-asom, then CD := {{D}} and VD := {{D}}. (2) If D = -D 1 then CD := VD, and VD := CD,. (3) If D = D 1 V D 2 then CD := CD, iIJJ CD. and VD := VD, U VD•. (4) If D = D 1l\D2 then CD := CD, uC D• and VD := VD , UlJ VD•. (5) If D = D 1 :.) D2 then CD := VD, UlJ CD. and VD := CD, U VD•. I 6.2. Definition. Let D be a A-normal formula.; Let .M be a set of unordered pairs, such that if {H, K} E .M and Hand K are b-asom subformula occurrences in D, then Hand K are contained in a common clause in D, H conv-! K, and either H occurs positively and K occurs negatively in D, or H occurs negatively and K occurs positively in D. Such a set .M is called a mating for D. If {H, K} EM we say that Hand K are oM -mated, or simply mated if the mating can be determined from context. If it is also the case that for all E CD there is a {H, K} E .M such that {H, K} c then we say

e

e,

389

that M is a clause-spanning mating (cs-mating, for short) for D. In this case, we shall also say that M spans D. H Dis a set of A-normalformulas o , we say that M is a mating (cs-mating) for f) if M is a mating (cs-mating) for V f). Here, the order by which the I disjunction V D is constructed is taken to be arbitrary but fixed. The notion of a mating used by Andrews in [5] is a bit more general than the one we have defined here. In that paper, a mating, M, is a set of ordered pairs, (H, K), such that there is a substitution 0 which makes all such pairs complementary, i.e. OK = -OH. Except for this difference, the notion of a cs-mating corresponds very closely to his notion of a p-acceptable proof*-mating. Bibel in [7] also exploits matings for various theorem proving and metatheoretical application. 6.3. Proposition. a cs-mating.

Let D be in A-normal form. D is tautologous if and only if D has

6.'. Definition. Let f) be a finite, nonempty set of formulasg, and let M be a mating for f). With respect to D and M, define ~o to be the binary relation on f) such that when D I , D2 E f), D I ~o D2 if D I contains a b-asom subformula occurrence Hand D 2 contains a b-atom subformula occurrence K such that {H, K} E M. Let ~ be the reflexive, transitive closure of ~o. Clearly ~ is an equivalence relation on f). H D ED, we shall write [D].. to denote the equivalence class (partition) of D which contains D. The following proposition is easily proved. I 6.5. Proposition. Let f) be a finite, nonempty set of formulaso ' If M is a cs-mating for f) then M spans at least one of th.e ~-partitions of D. Th« conflerse is trivially true. 6.6. Definition. Let 0 = (L,z, {~}) be an outline. Let Dr be the formula Dp(Q,) if I is a sponsoring line or Dp(-Qd if I is a supporting line. Now define liz := {D,} U {Dr II E p(z)} if z does not assert .L and f)z := {Dr II E p(z)}, otherwise. Notice that for each z E Lo, Dp(Qz) = V Dz . Now let Mz be a cs-mating for Dp(Qz) for each z E L o and set oM := UzELo Mz. M is called a cs-mating for O. (Notice that .M is also a cs-mating for each Dp(Qz).) We say that 0 is .M-focused if for each z E Lo, Dz is composed of exactly one ~-partition. I 6.7. Example. If 0 is the outline in Example 5.5, then

D2 = -pzy D3 = -[Pzy:J pz.c.pz] D I 6 = Pz.c.Pz VDI 6 = .... Pzyv -[Pzy:J pz.c.pz] V pz.c.pz] Notice that DIG is tautologous. If we let AI, A2, A3, A4 represent the four b-atom I occurrences in DI 6 then a cs-mating for DIG would be {(AI, A2), (A3, A4)}. Let 0 = (L,z,{~}) be an outline and let M be a cs-mating for O. If 0 is not M-focused, then there must be a z E L o such that Dz has too many members, i.e. there are at least two ~·partitions of Dz • What we need is a thinning outline transformation which will permit us to deactivate lines in 0, there by removing elements from Dz • As long as the resulting Dz is still spanned by M, the result of the thinning transformation will satisify the requirements of being an outline. The thinning transformation works as follows. Let outline 0 and a cs-mating M for 0 be such that 0 is not M-focused. Let z be sponsoring line such that Dz contains

390

more than one ~-partition. By Proposition 6.5, there is at least one ~-partition P C f)~ such that .M spans P. Set P' := p \ f)~. For each supporting line I of z such that I E pI, the thinning transformation modifies the value of p(z) by removing I from it. If it is the case that Dz E pI, then the supporting lines in P are strong enough to prove .1, from which the assertion in line z follows immediately. In this case, the thinning transformation must add the new sponsoring line

(y) )(

f-

.1

NJ,

where )( is the set of hypotheses for line z. The justification for line z is changed to RuleP: y. The supports for line yare those lines which were supporting line z and were not thinned out as described above.

7.

Baekehaining

Using the mating information contained in the Dp-values of the expansion trees associated with each active line of an outline provides the outline transformation process with enough information to look ahead and identify unnecessary supporting and sponsoring lines. This same look ahead will help us determine when we should backchain on an implicational support line. Consider the outline fragment

(a) )( f(z)

RuleX NJ

f-

)(

(0')

where we have already determined that line a is a necessary support of line z, and RuleX is the justification for line a. One way to use line a in proving line z is to apply P-Cases (see [4]) to the lines in (0'), which would then yield the following lines.

(a) (b) (m)

(n)

)(

f-

b f)(, b f-

(y)

n }(,n

(z)

)(

fff-

.....Al V A 2 -Ai B A2 B B

RuleX Hyp NJ Hyp NJ Cases: a, m, y

(Til

It may turn out that this new outline is no longer focused for at least two reasons. First, line m may be proved indirectly from its sponsors, which now includes line b. In other words, f)m may contain a partition P such that Db E P but Dm !f. P. Hence, -Ai .is used to prove.i. The proof could, therefore, be reorganized so that we instead try' to prove Ai directly. In this case, we should apply the new D-ModusPonens transformation to the lines in (0') to yield the following lines.

(a) (m)

(n)

(y) (z)

)( )( )( )( )(

fffff-

.....Al V A2 Ai A2 A2 ::>B B

RuleX NJ R'uleP: a,m NJ RuleP: Y,n

(T2)

391

Lines m and Y are new sponsoring lines and they share the supports which z had, less line a. Notice that R". has the form (v -RI R.i) for some list structures R I and R.i . In the new outline, we set R:,. := R I and R~ := (:J R.i R,,). The new outline will be focused. Another way the outline containing the lines in (TI) may not be focused is that line y is proved indirectly from its supports. In this case, we need to backchain on the contraposltive form of line a, i.e. we should apply the new D-ModusTollens on the lines in (0") to yield the following lines.

(a)

}{ I-

(m) }{ II-

(n)

}{

(y) (z)

}{ I}{ I-

-AI V A 2 -A 2 -AI -AI:J B

B

RuleX NJ RuleP: a,m NJ RuleP : Y,n

If in fact the outline containing the lines in (Td was focused, then neither DModusPonens or D-ModusTollens could not be used on line a, and we actually needed to treat line a as a disjunction by applying P-Cases. Of course, all these comments apply equa.lly well when line a asserts a formula of the form Al :J A2 •

8.

Other Forms of Natural Deduction

There are several different formats of proofs which have been called natural deduction, and, at first glance, the problems encountered in converting ET-proofs to these other proof formats might appear to be quite different than the problems encountered in building the Suppes-style proofs of the previous sections. This is genera.lly not the case. For example, the transformation process already described produces , in a sense, proofs in Gentzen's LK format [131. For each sponsoring line z in a given outline, consider the sequent, p(z) -+ z, where line labels are used to refer to their assertions . Hence, to each outline there corresponds a set of sequents which represent the unfinished subproofs of that outline. The D- and P- transformations can then be seen as ways of taking the sequents of one outline and replacing some of them with logicia.lly simpler sequents. These simpler sequents can then be joined using derived rules of the LK-calculus to yield the sequents they replace. In this fashion, an entire LK derivation can be built. Of course, for this to work in higher-order logic, we would need to add an inference rule for A-conversion, but this is the only essential addit ion needed for this accommodation. LK derivations built in this fashion will contain no instances of the cut inference rule. Thus, by using our relative completeness result for ET-proofs, if A is a theorem of T, A has an ET-proof which can be converted to a cut-free LK derivation. Via the transformation process, our version of Herbrand's theorem can thus be used to prove Gentzen's Hauptsatz. See [16) for a complete account of how ET-proofs can be converted to LK deriviations.

o.

Acknowledgements

I would like to thank Peter Andrews and Frank Pfenning for many valuable comments concerning this paper and the work reported in it.

392

10.

Bibliography

[1]

Peter B. Andrews, "Resolution in Type Theory," Journal of Symbolic Logic 36 (1971), 414-432.

[21

Peter B. Andrews, "Provability in Elementary Type Theory," ZeitBchrift fiir Mathematische Logik und Grundlagen der Mathematik 20 (1974), 411-418.

[31

Peter B. Andrews and Eve Longini Cohen, "Theorem Proving in Type Theory," Proceedings of the Fifth International Joint Conference on Arti/kial Intelligence 1977,566.

[4]

Peter B. Andrews, "Transforming Matings into Natural Deduction Proofs," Fifth Conference on Automated Deduction, Le« Arcs, France, edited by W. Bibel and R. Kowalski, Lecture Notes in Computer Science, No. 87, Springer-Verlag, 1980, 281-292.

[5]

Peter B. Andrews, "Theorem Proving Via General Matings,;' Journal of the Association for Computing Machinery 28 (1981), 193-214.

[6]

Maria Virginia Aponte, Jose Alberto Fernandez, and Philippe Roussel, "Editing First-order Proofs: Programmed Rules vs. Derived Rules," Proceedings of the 19S. International Symposium on Logic Programming, 92-97.

[7]

Wolfgang Bibel, "Matrices with Connections," Journal of the Association of Computing Machinery 28 (1981), 633-645.

[8]

W. W. Bledsoe, "A Maximal Method for Set Variables in Automatic Theorem proving," in Machine Intelligence g, edited by J. E. Hayes, Donald Michie, and L. I. Mikulich, Ellis Horwood Ltd., 1979, 53-100.

[9]

W. W. Bledsoe, "Using Examples to Generate Instantiations for Set Variables," University of Texas at Austin Technical Report ATP-67, July 1982.

[10]

Alonzo Church, "A Formulation of the Simple Theory of Types," Journal of Symbolic Logic Ii (1940), 56--68.

[I1J Gerard P. Huet, "A Mechanization of Type Theory," Proceedings of the Third International Joint Conference on Artificial Intelligence 1973, 139-146.

[12]

Gerard P. Huet, "A Unification Algorithm for Typed A·calculus," Theoretical Computer Science 1 (1975), 27-57.

[13]

Gerhard Gentzen, "Investigations into Logical Deductions," in The Collected Papers of Gerhard Genizen; edited by M. E. Szabo, North-Holland Publishing Co., Amsterdam, 1969,68-131.

[14]

D. C. Jensen and T. Pietrzykowski, "Mechanizing w-Order Type Theory Through Unification," Theoretical Computer Science 3 (1976), 123-171.

[15]

Dale A. Miller, Eve Longini Cohen, and Peter B. Andrews, "A Look at TPS," 6th Conference on Automated Deduction, New York, edited by Donald W. Loveland, Lecture Notes in Computere and Science, No. 138, Springer-Verlag, 1982, 50-69.

[16]

Dale A. Miller, "Proofs in Higher-order Logic," Ph. D. Dissertation, CarnegieMellon University, August 1983. Available as Technical Report MS-CIS·83·37

393

from the Department of Computer and Information Science, University of Pennsylvania. [17]

Frank Pfenning, '"Analytic and Non-analytic Proofs," elsewhere in these proceedings.

[18]

T. Pietrzykowski and D. C. Jensen, '"A complete mechanization of w-order type theory," Proceedings of the ACM Annual Conference, Volume I, 1972,82-92.

[19]

Tomasz Pietrzykowski, '"A Complete Mechanization of Second-Order Type Theory," Journal of the Association for Computing Machinery 20 (1973), 333-364.

[20]

J. A. Robinson, '"Mechanizing Higher-Order Logic," Machine Intelligence inburgh University Press, 1969, 151-170.

[21]

Patrick Suppes, Introduction to Logic, D. Van Nostrand Company Ltd., Princeton, 1957.

[22]

Moto-o-Takahashi, '"A proof of cut-elimination theorem in simple type-theory," Journal of the Mathematical Society of Japan 19 (1967), 39~1O.

[23]

Alfred Tarski, '"A Lattice-theoretical Fixpoint Theorem and Its Applications," Pacific Journal of Mathematics li (1955), 285-309.

4, Ed-

394

Analytic and Non-analytic Proofs Fran k Plcnn iug Department of Mathematics Carnegie-Mellon University Pittsburgh, PA 15213

o.

Abstract

III au toruntcd theorem proving different kinds of proof systems have been used. Tradi~ t.ional proof systems, such as Hilbert-style proofs or nat urul deduct.ion we call non-analytic, while rosolut.ion or mal.ing proof systems we call analytic. There arc runny good reasons to study the connections between analytic and lion-analytic proofs. We would like a theorem prover to make officicnt use of both analytic and non-analytic methods to gel. the best of both worlds.

In this paper we present an algorithm for translating from a particular non-analytic proof system to analytic proofs. Moreover, some results about the translation in the other direction are reformulated and known algorithms improved. Implementation of the algorithms presented for use in research and teaching logic is under way at Carnegie-Mellon University in the framework of TPS and its educational counterpart ETPS. Finally we show how to obtain non-analytic proofs from resolution refutations. As an application, resolution refutations can be translated into comprehensible natural deduction proofs.

1.

Introduction

In automated theorem proving different kinds of proof systems have been used. Traditional proof systems, such as Hilbert-style proofs or natural deduction we call non-analytic, while resolution or mating proof systems we call analytic. There are many good reasons to study the connections between analytic and non-analytic proofs. We would like a theorem prover to make efficient use of both analytic and non-analytic methods to get the best of both worlds. The advantages of analytic proofs are well known. One of the most important advantage is that they seem to be ideally suited for an efficient automatic search for a proof on the com pu ter. On the other hand there is much to gain from the use of non-analytic proof systems in addition to analytic methods. Non-analytic proofs can be presented in a comprehensible and pleasing format. If we can translate, say, resolution refutations into legible non-analytic proofs, we can help the mathematician understand the automatically generated proof. Valuable work here has been done by Miller [10]. The natural deduction proofs obtained from mating refutations are often elegant and easy to understand and use such mathematically common concepts as proof by contradiction and case-analysis, and make use of intuitive operations such as backchaining. Better translations which arc the object of current research would make this even more useful for a wider class of theorems. The ability to freely translate between analytic and non-analytic proofs also gives us a tool for creating a more elegant natural deduction style proof from a given one. We would

395

translate a given proof into an analytic proof, possibly transform this analytic proof into a shorter one, and then build a new natural deduction style proof from it in a canonical fashion. Good translation procedures can also serve as a valuahle research tool. Heuristics and lemmas of use to a theorem prover can often be discovered and formulated naturally in some non-analytic proof style. The ability to translate these into an analytic format may help to incorporate them into a theorem prover. Moreover, if we can translate automatic proofs obtained with and without a certain heuristic , we may gain deeper insight int.o the nature and performance of the heuristics. Another perhaps more immediately important application is in the use of these procedures in compnl.er-nidcd instruct.ion in logic. The student will at.tempt his proof in a deductive format, e.g. in a natural deduction style, on the computer. The analytic proof of the exercise can he found beloreluuul by an automated theorem prover eiuploylug resolution or a mating procedure, or even constructed from a sample natural deduction proof given by the teacher. This analytic proof can then be used to guide the student through his own attempts to prove the theorem by suggesting which inference rules may be appropriate when the student asks for help. Moreover, when the student is done, a "normalizing" procedure like the one described above can demonstrate to the student how he might have proven the theorem more elegantly or efficiently. A system called ETPS, which will contain all these features, is currently under development at Carnegie-Mellon University. There is also a very good complexity-theoretic reason why a theorem prover may want to make use of non-analytic as well as analytic methods. A result by Statman [14] shows that there are theorems which have "short" non-analytic proofs, but no "short" analytic proofs whatsoever. He exhibits a sequence of theorems (from the theory of combinators) whose

.., }

d. (d is the number of connectives and quantifiers shortest possible analytic proof is 22 " of a theorem X, and I the length of a non-analytic proof for X.) This lower bound is not Kalmar-elementary, and there are therefore theorems which cannot be practically proven by purely analytic methods which have short non-analytic proofs.

Let us now try to make more precise the distinction between analytic and non-analytic proof systems. The term "analytic" was introduced by Smullyan in [13] and conveys the idea that the proof (or refutation) procedure analyzes the given formula. An analytic proof has a very strong subformula property: Only sub formulas of the theorem and their instances will appear in an analytic proof. In the field of automated deduction the discovery of analytic proof systems such as resolution [12] went hand in hand with the beginning of research. The mating approach [3] and a similar method by Bibel [4] are other examples of analytic proof systems. Examples of non-analytic methods in automated theorem proving can be found in Bledsoe's survey [6] of non-resolution theorem proving. This includes approaches like termrewriting, built-in inequalities, forward-chaining, models, and even counterexamples. Some of these approaches may be called non-analytic, since they sometimes consider formulas not part of the proposed theorem. Many of the stimuli here come from mathematics rather than pure logic. Hilbert-style, Gentzen-style [7], or natural deduction style systems are all examples of traditional non-analytic proof systems. In general they do not obey the subformula property. Usually Cut or Modus Ponens is used to eliminate the helpful formulas, which are not part of the theorem, but substitutivity of equivalence or equality may be used as well. The use of Cut itself does not characterize non-analytic proof systems, as can be seen from the case of resolution, where the cut formulas arc all suhformulas of the given theorem.

396 Andr ews ha s shown in 1 21 how to conve rt matin gs int.o natu ra l ded uct ion proofs. Miller 19] t ook t his work fur t her by generalizing it to higher-ord er logic an d also add ressing q uest.ions of sty le in th ese pro ofs. Som e rd at .,,) work was also done by Bihtd ill 151. Au algoritl uu tran slati ng in th e ot her dir ect ion is t he main contribu t ion of this pav er. T he ability to readil y translat e in eit her dir ection bet ween analytic and non-analyt ic proofs (in th e case of t he Imp lem en tat. iou ill Tl'S between expa nsion p ro ofs and na t ur nl dedu ct ion style proofs) gives us all the afor emention ed adv aut ngcs. As a rcp reseu t.ati vc of non-an aly tic proof systems we pick I' , mainly for its couccpt.ual cla rity a nd simp licity of cu t-eliminat ion. I" which is described in section 2 is closely related to the sys te m [,[( of G en tzen 171 an d a relate d syste m of Smu llyan 113]. Following Miller in [9j, who works in th e sctt.ing of higher ord er logic, we define a purely an nlyl.ic proof syste m in sr-ct iou 3. Expan5ion proo fs, as t. IICY ar c ca lled , arc very nnturnl and conve nient and very conc isely represen t the in formation contained in an analyt ic proof. In section 4 we give a new exposi t ion of part .of Miller 's work in ter ms of our analytic and non -analytic first-order proof syst ems. This exposition prov ides th e reader with a selfcontained and unified treatment of the translations between the vari ous proof styles. We also handle conjunction in a new way, thus creating stylistically different pro ofs. As the main part of this paper, we give an explicit algorithm which t ran slates r-proofs into expansion proofs in sections 5, G, and 7. Expansion pr oofs ar e very milch different from the kind of analytic proofs G entz en or Smullyan cons id ered, thou gh some of th eir ideas, in particular for cut-elimination, are us ed. Our merge al gorithm wh ich deals with the inference rul e Contraction is a significantly improved version of Miller 's [9] MERGE , which generally prod uces much larger exp ansion trees. Andrews in [1] has given an algorit hm whi ch comp utes a mating from a resolu t ion refutatio n. In sectio n 8 we state and prove the correctn ess of a different algor ithm which tr ans lat es res olution refu t ations into expansion proofs, which do no t make use of Skolem -Iun ct ions or conj unctive normal forms and satisfy a quite di fferent acceptability crit erion from An drews'. We thu s give a two-step procedure by whi ch re solu ti on refu ta tions can b e tran slated into I··proofs, or, in on e more step , in to na tural deduction proofs. Space do es not permit to include he re non-trivial examples illustrating th e various algorithms. Detailed examples for all th e translation procedures pre sented here are given by the author in (11].

2.

The Systems I and

t:

Our non-analytic proof syste m is r-, which builds up on simi lar syste ms of Gentzen (7] and Smullyan [13]. 1· is particul arly well suited for the descri pti on of our algorit hms. Notice, for instance, that any theorem derived in 1 · is automatically in negation normal form. The work done here can easily be gen eralized to other superficially riche r systems of first-order logic. To simplify some of our exp osition we introduce a system 1 wh ich is identical to 1" but doe s not contain the rule of Mix (a vari ant of Cut). Our formulation of first-ord er logic includes the proposit ional connectives V, A, -', the quantifiers 3 and \;f an d an infinite nu mber of individual vari ables and const an ts. Function constants of arb it rary finite arity are also permitted. An atomic formula is of the form PtJ _. . t n for an n-ary predicate P and term s t l , _. _, tn- A literal is of th e form A or -,A for an atom ic formula A . A formula is in negation normal form if the scope of each negation is

397

atomic. l':ach first-ord er form ula has a classically equi valent formula in negati on normal form, and we generally assume our form ulas to be in negation normal Iorm. X lv /a J is our notation fur the result of subst.il.ul.ing II for th e [rce occurrences of v ill X . We write n n fo nnu la for a for m ula in neg at ion norm al form . \Ve do not assume th at. formulas are alphabetically nor mal , exce pt in section 8 wh ere we talk ab out resolu t ion refut atio ns. Sometimes we wr it.e XX to indi cate th at an equation is valid for hoth conjunc t ion a nd dis] unction . Nod es in a pro of-t.rce in 1 we call lines. A line in I is a multi-set of fo rmulas . Th is form ula t.iou is halfway b etween Gen t.zen 's (sequ ent.s) and Smullyau's (s<>l.s). Th e rea son for choosing t his pnr t.icular repr esentati on lies in the fact. th at contraction is an ext re mely poworful inferenc e rule of our system , W hen we t.ry to an alyze how the d f"I,t. of a cont.radion ind uces a cha nge in au associated expa nsion tree, we will see th at I he t ransforma tion is really q uite com plica te d . T hus we can not. leave contractio n imp licit , like Smullyan did , when he int roduced set s of Iorm ulus as obj ect s in the pro of. St ructurul rules like exchange, however, h ave no imp act on the logical cont ents of the formul a or proof line, We therefore leave th em imp licit in the multi-set. notation. In general we let U and V stand for multi-sets of formulas, i.e, set s where we allow the same formula to appear more th an onc e as a member. We often write U, X to mean U U {X} if U is a multi-set, The axioms of 1 are of the form

U,A,--,A where A is an atomic formula. The inference rules can be divid ed into structu ral rules, propo~ itional ru les, and quantificational ru les. The only structu ral rul e in 1 is contra ction (0 ). T here is on e prop ositional TIlle for each propositional connect ive: V -int roduction (VI) and /\ -in troduction (1\1). There is also exactl y one rule for th e quantifi ers: 3 -in troduetion (31) and V-in troduction (VI) . Structural rules Con tr action:

U,X,X U,X

c

Propositional TIlles

U,X,Y VI U,XvY

U,X V,Y AI U, V,X 1\ Y

Quant ificational rules

U,X [v/t ] 31 . U,3vX , t a term free for v ill X. U,X[v/a] i U ,vV U X• U,VvX VI ' a not free ree 111 U, V contain the side -formulas of an inference rule . Th ey may b e empty. The proposit ional and q uantificational infer ence ru les correspond to Smullyan's [13] rul es Q, fl, "I, 6. System 1 is complete in the sense th a t we can derive th e negati on norm al form of every valid formula in classical first ord er logic. This follows almost inun cdiatcdly from Smullyan's

398

form of t he completeness res u lt for Gen tzen syste ms an d we will no t re peat t he argu m ent here . We shall a lso use th e sys tem I. " wh ich contains the rule of Mix:

X ¢' U, X ,/-V

X is th e negatio n uorm nl fOTlII of .X: Th er e must I" , at

leas t, 0 111' 0",',111'1"'1 " :" of X, the rrrix formula , in the left pre mi se and at leas t one occurrence of X ill th e figh t. p remise. Mix was intro d uced hy C"nlzen a mi is a vnri aut . of t l" , ru le of Out, and t Ill, tw o a n' ea sily shown to be equl valent .

3.

Expansion Trees

Analytic proofs in this paper ar e presented as expansion tr ees. Expan sion trees very concisely and naturally represent th e inform a t ion contained in a n an aly ti c proof, as we hope to show. They were first introduced by Miller [9] and are some what sim ilar to Herbrand expansions [8]. Some redundancies can eas ily be eliminated for an actua l implementation as done by Mill er in the context of h igh er ord er logic . The sh allow formula of an expansion tree will corresp ond to the th eorem; the deep formula is akin to a Herbrand-expansion proving t he theorem. Our formu la ti on of expansio n tre es differs onl y triv ially from Miller's in [10], if restricted to first-ord er logic. At se veral pl ac es it is con ven ient to allow n-ary conjunction and disjunction ins te a d of tr eatin g them as binary operat ions . 3.1. Definition.

We define E xpansion Trees ind nctively. Simu ltaneous ly, we a lso define

QD, t he deep formula of an exp an sion tree, wh ich is always quanti fier-free , a nd QS, the shallow formula of an expansion tree . We furth ermore place the res tr ict ion that no variable in an exp a nsion tr ee m ay be selected more than onc e. (i) (ii)

A litera l l (signed a t om ) is an exp a nsion tree. QD(l) = QS(l) = l. Literals form t h e leaves of expansion t r ees.

If Q I, .. . , Q n,

Q=

n ~ 2, are exp ansion trees, so is

A

QI

Then and

Qn

(iii) If Q1J .. . , Q", are expansion tre es s uch th at If Qr = S [v/t l ] , ... , Q~ = S [v /t n ], t; a term free for v in S for 1 ~ i ~ n, n ~ 1, then

Q=

is an expansion tree.

Then and

QD = Qf V • . . V Q~, QS = 3vS.

3vS is called an expansion node; v is th e expanded variable; tl, "" t n are the expansion terms. (iv)

If QIJ is an expansion tree such that

Qg = S[v/a] for a variable a , so is

399

Then •mel

QV =QR, QS ~,VvS.

VvS is called a selection node; a is the variable selected for this occurrence of v. To improve legibility of our diagrams we will frequently draw tree with QH = X.

&

for an expansion

Since traditional proof systems do not contain Skolcm-Iunctions, we need a different mecbanism to insure the soundness of our proofs. Following an idea of Bibd 1'11, which was picked up by Miller [9]' we introduce a relation <(J on occurrences of expansion terms. The condition that
= I, a literal.

Then C = (I) is the only clause in X.

(i)

X

(ii)

X = A V B. Then for all clauses (al,"" an) in A and (bl, ... , bm) in B, C = (al, ... , an, bl, ... , bm) is a clause in A V B.

(iii)

X = A /\ B. Then all clauses in A and all clauses in B are clauses in A r. B.

3.4. Definition. A relation on literal occurrences in a quantifier-free nnformula X is a mating .M if -.1 = k for every pair (I, k) E .M and there is at least one clause in X containing both I and k. If (I,k) E.M, I and k are said to be .M-mated. 3.5. Definition. A mating .M is said to span a clause C if there are literals I, k E C such that (I, k) E .M. A mating .M is said to be clause-spanning on a quantifier-free nnformula X if every clause in X is spanned by .M. The significance of this definition is of course that a quantifier-free nnformula X is tautologous iff there is a mating clause-spanning on X (see Andrews [3], [1], and Miller [9)). 3.6. Definition. A pair (Q,.M) is called an expansion tree proof for a nnformula X if

(i)

QS = X.

(ii) No selected variable is free in QS. (iii)
400

4.

Building I -P'roofs from Expansion Tree Proofs

The nlgorit.lun follows ideas or-Miller [9], but. we provide a different treatment of conjunction. Our algorit.luu results in shorter proofs than the more naive algorithm that always applies case (vii) below for a conjunction, but we do not achieve the full power of Miller's focusing method. In return, our method is computationally faster. In the cxposil.ion below we somct.imcs assume that. there is a unique correspondence between the formulas in it line and an associated expansion tree, even t.hough we like to think of tho line as a multi-set where several identical members are indisl.inguishablc. In general it is sufficient t.o pick any correspondence between those multiple occurrences of a formula in a line nnd t.he unique subtrees of the ussociatcd expansion tree. 4.1. Definition. 11 pair (Q, M) is an expansion t.ree proof for a line L I-proof iff (Q, M) is an expansion tree proof for Xl V··· V X".

X], ... , X" in an

4.2. Definition. Let (Q, M) be an expansion tree proof for a line L in an I -proof, and let X be a subforrnula of ,U\ element in L. Then Qlx is the part of the expansion tree Q representing X (Qli = X) , and Mix is the restriction of M to pairs both of whose elements lie in QI~. We will sometimes talk about X D instead of QI~, if the expansion tree Q is clear from the context. We shall describe an algorithm which constructs an J -proof from an expansion tree proof, starting with the nnfornmla to be proven and working upwards until every branch in the proof tree begins with an axiom. The cases given below can in principle be applied in any order. The ordering below will often, but not in general, result in the shortest proof that can be constructed with this algorithm. If an X E L is such that QI~ has no literal in a pair in M, then X is to be ignored and can only be part of a side-formula in an inference above L. Now assume L is a given line in an I -proof, and (Q, M) is an expansion tree proof for L. (i)

L

= U, A, -,A.

Then L is an axiom.

VI. (Q, M) is an expansion tree proof for U,X, Y. (ii) L = U,X V Y. Infer L by!/'J'VYy ,

(iii) L

= U,VvS.

Infer L by

U,S[v/a] VI U,VvS '

where a is the variable selected for this occurrence of S[v/aJ.

In Q we replace the corresponding subtree

By definition 3.6 and the inductive assumption that (Q, M) forms an expansion tree proof for U, VvS, a cannot be free in U or VvS, since a is a selected variable in Q. (iv)

L = U, :JvS and :JvS has n, n 2: 2 successors in Q. U,:JvS, ... ,3vS Infer L by

...

U,3vS

(n-1)xC.

401

Change Q =

Since QD = R D , (R, M) is ag..in an expansion tree proof for V, 3vS, ... , 311S. (v)

L U,JvS, and JvS has exactly one successor SlvjtJ, and no free variable in t is a variable to be selected in Q. 3vS CO"

%~J~~] 31, and replace

Infer L by

t ill Q by

A

S[v/t]

From the restriction on t it is clear that no variable to be selected will be free in S[11/t), and therefore by inductive hypothesis in V,8[11/tJ. (vi)

L = V, V, X 1\ Y such that M to any literal in V D or yD.

Mjff,X U MIv,Y, i.e. no literal in V D or X D is M-mated

Here we have to consider three subcases, (a)

(b) (c)

=

Miff is clause-spanning for VD. Then restrict the mating to )J Mlu. Then no literal in V, X 1\ Y is involved in the mating and they will only appear as side formulas in any inference above L.

M!v is clause-spanning on yD. This case is symmetric to case (a): Let Neither case (a) nor case (b) apply. Then infer

)J:=

Mly.

Lby ViT:ir, X : ; AI.

Since the problem is symmetric, we will simply show that (Q!ff,X, Mlu,x) is an expansion tree prooffor V,X. It then follows analogously that (Qlv,y, Mlv,¥) is an expansion tree proof for y, Y. The only condition we have to test is whether Mlu,x is clause-spanning on QlfJ,x' Let P be a clause in QI{J,x. Since neither case (3) nor case (b) applies, there is a clause 0 in yD not spanned by .M. Let P' be the extension of P to a clause in QD such that P'jv = 0 and P'!u,x P. By inductive assumption, pi is spanned by (I, k) E M. Not both I and" k are in v>, since M does not span O. We also assumed M:::= Mlff,x U M\v,Y and hence (I,k) E MhT,X.

(vii) L

= V,X 1\ Y

and case (vi) does not apply.

. r Lb Then mer y

V, U, X 1\ Y 0 V,XAY .

V

Modify Q

~

A'>...

V

to

go' R ~

Li/~

For every occurrence of a literal I in V, there are two occurrences of I in U,V. Call these 11 and 12 for the occurrences in the left and right copies of V, respectively. Let

402

MII,.x I}.{Ifr.l·1be th e resu lt of repla cing eve ry oc currenc e of a literal l Irom U D in M1/1.x IMlv.yl hy [I 1[2]. TllCn ),f = }.{Ilu : U }.1I ~.l' spa ns every cla use in R D. To sec th is, Jet P he a clause in R V . Theil P contains lite ral s from eit her X or Y, but not bo th. Withou t loss of gene rality , assume l' contains lite ral s in X , and let 0 be th e clause in QD which agrees with P on X a nd contains a literal

I in U D iff [I is in P . By inductive

N . But th en a lso (k l , m) E Mlk x C ),f (if m (ifm is in QI/?). T hu s I' is spa n ned hy N. Since

assumption, 0 is closed hy a pa ir (k, m)

E

is in Q I~) , or (kl,rn 1 ) C .M II,.x c » / ' was a rhit.rary, N s pa ns eVNy d an s" in

tt" ,

Now the ca se (vi) ca n h e a p plied lnuu cdiatcdly, thus red ucing th e co m plex ity of L lJ, X /\ Y to the complexities of 1.Ju, lines I I, X a url V , Y .

=

Since the siz e of connected s uhfon uulas of th e unju stifi ed lines ill t he I -proo f is dlminis hed in each step, all we need t o s how to prov e correct.ne ss is (.]I al. a t. k-ast, on e of till< cases alw ays applies. One can sec that onl y on e problem may ari se: a ll top' lev el nnforrnulas are exi stentially quantified, each of them h as ju st one subst.itution t erm , a nd all of the substitu-

tion terms contain a free variable wh ich is still to be selected. Sin ce < q has no cycles, there is a term t such that for no 5, 5 « J t. If t contained a free vari nb!e a, which were still to be selected, then the node whe re a is selected has to lie below on e of th e top-level existential quantifiers in Q. But if 5 is th e sub stit ut ion term for this node, th en by definition 3.2, 5
5.

Building Expansion Tree Proofs from I -proofs

In this section we show how to construct a n expansion tree pro of fr om a proof in I. This transla tion plays an imp or tant role in giving a translation p roc ed u re from I· into expansion tree proofs. Some ideas of Mill er [9] are used, but we proc eed enti rely construct ively, Also, th e p rocedure for merge presen t ed in cas e (vi) below results in mu ch smaller exp ansion trees than the ones obtained by Mill er 's MERGE alg orithm. M or eover , b ecause of the way we set up I· , a merge is nec essary only for contra ct ion and not inherentl y ti ed to any quantifier or logical connective. This allows a clearer exposition of the id ea s wh ich underly the translation from I-proofs in t o expansion tree proofs. The construction proceeds by in duct ion on the I -pro of t ree. Not e th a t all cases except for Contraction are very simple. This supp orts our claim that th e exp an sion t ree proof induced by an I -pro of corresponds to the I -pro of "in a natural way" . Th e b asic "id ea" underlying the original proof is retained. We now as sume we are given an in Ierence (or axiom) in I , and we have already constructed expansion tree proofs for the prem ise. We shall call this exp ans ion tree proof (Q, M) ((QJ, Mil and (Q2, M2) in the case of III). The expansion tr ee proof for the conclusion will be (R, ),f). (i)

We have an axiom U, A, -,A. V

Then N

= {(-'A,A)}

and 17.

~ 6/ 1 ~

=

In Qlu , let each existen tially quantifi ed variable expand to its elf, a nd s elect a new unique variable for each universall y qu antifi ed variable.

403

(ii)

V[:

iJ,:'-f-tlv' Here (H, lJ) '"" (Q,.M).

(iii)

/\J:

--U;-V; X /\ y-' Hen' lJ = .M 1 U .M 2 and

/I, X

V, Y

v

r,"mQ,~ h"'" AW,g'IR~~ In the new tree we JIIay have to rename the selections for some universal variahles, to make sure that no free or selected variable from one branch of the I -proof tree is selected in the other branch. (iv)

31:

UU~~~D, t free

for u in S.

v

FwmQ~APM"roR~~~ S[u/t] If u does not appear in S, we pick a new variable a to be t, a not selected in Q and not free in U,S. Since R D = QD, we can take lJ = .M. What remains to be shown in this case is that
VI:

UU~~:~!!l, a a variable not free

in

Uor "IuS. V

FwmQ~ A ~P~roR~~"S L!J~

£

~

If u docs not appear in S, we pick a new variable a not free in U or S or selected in Q. Since R D QD, we can take lJ .M. Moreove~, since a is not free in U,"IuS, a is a valid selection. Moreover, a could not have been. selected in Q, since a occurs free in S[u/aJ or had been chosen not to be selected in Q. Thus a is selected in R only once.

=

=

404

(vi)

C; Let Q I , Q 2 be th e su btre es of Q with the root nod e being th e left a nd right occurren ces of X in th e pr em ise, r cspcct.ivcly, We a pply a recu rsive mer ging algor it h m t o obtain an expa nsio n t ree Q I ED Q z for t he sing le occur re nc e of X in the conclus ion. We will p ass from

v

V

~

Q ='

d/

cil ~z

to li =

&

/1\ QI"mQZ

In order to appl y ED t o tw o exp a nsion t rees PI , Pz , we req uire p i'i = p 2'i, wh ich is certuinly true or Q I and Qz.

(a)

PI =' II = l = lz = Pz . Then PI ED Pz occurrences of the literal l ,

A

(b)

and Pz =

= l. We say we identify th e distin ct

A

A

Y I EB Z I "IuS

"IuS

(c)

PI

=

laand Pz

Yn EB Zn

= Ib

Yl

Yz

V F Yl EB Yz [b/a) Yz[b/a] is the result of repl acing every occurrence of b in th e exp ansion tree Yz by

a. But not onl y do we hav e to a p ply this change of names in Y z, but in th e whole exp a nsion tree in wh ich ou r merge takes place.

3uS

3uS and Pz

=

405

Here T I, ... ,Tk arc the expansion terms which appear only in one of t I, ... ,tTl and s j , ••• ,8 m ; TI'+I, ... ,Tk.H are the expansion terms which appear in both. 8 1 [8 2 ) stands for the occurrence of a subtree in 1'1 l/'2]. If Tk+h -- t; .c: 8 J we say that Tk+h is the result of identifying the distinct occurrences of the expansion terms ti and Bj.

We now show by induction on the number of identifications of expansion terms in QI G1Q2 that
6.

Cut Elimination in I"

Our cut elimination algorithm is based on similar algorithms of Gentzen [7] and Smullyan [13]. We reformulate these algorithms in terms of the system I" in order to give a completely self-contained and unified treatment to all the translations between analytic and non-analytic proofs. If one wanted to write out the details of a procedure which computes an expansion tree proof for a formula B, given those for A and ...,A V B directly in terms of expansion tree proofs, one could usc the cases below in an inductive proof to show that such a direct procedure will result in the same expansion tree proof for B as the less direct procedure described in section 7. The proof of termination relies on a double induction argument: At each step we transform one mix (which has no other mixes above it) into one or several mixes with lower degree, or, if the degree stays the same, with smaller rank. The degree of a mix is the number of quantifiers and connectives in the mix formula (the formula being eliminated). The left [right]

406

rank of a mix is the number of lines in the left [right] premise of a mix which contain the mix formulas. The rank of a mix is the SUIll of left and right rank. For many of the following cases there is an obvious symmetric case which can be treated completely analogously. It is to be understood that there could be more occurrences of the mix formula in the premises of a mix, hut we .do not write this out to keep the diagrams as simple as possible. First we consider the case that one of the premises of the mix is an axiom. (i)

The mix formula is the side-formula of the axiom. Then we eliminate the mix immcdiatedly: U,A,~A,X V,X. ---V;V,-A~-::;A----M,x

(ii)

U,V,A"A

The mix formula is not the side-formula of the axiom. Then we also eliminate the mix: Add V as a side-

U,A

V,A,~A M'

U,V,A

lX

formula

to every inference above U,A U,V, A

We will now treat the case that the rank of the mix (which contains no other mix above it) is 2. (i)

The mix formula is a literal A. Since the rank of the mix is 2, one of the previous two cases must apply.

(ii)

C = X V Y,

C = X I-.}'. U,X,Y VI,X M' U,VI,Y ,:z: V.,V M' U, VJ,V. ,:Z:

Each of the two new mixes has smaller degree.

(iii) C

= VvS, C = 3vS. U, B[vla] VI U, VvS U,V

V,S[v~13I V,3vS Mix

V,S[vlt] M'

'x

Now we consider the case where the rank is greater than 2. We treat the case where the left rank is greater than 1. The case where the right rank is greater than 1 can be treated analogously. This case again breaks up into two subcases. The new formula on the left hand side of the premise mayor may not be the same as the mix formula. First we show how to reduce a mix in case the new formula is not the same as the mix formula. Here we generally reduce the mix to a mix with the same degree but lower rank. (i)

U,A,B,X VI U,AvB,X V,X M' U,V,AVB ,:z:

U,A,B,X

V,X M'

U, V, A, B VI

U,V,AvB

,:z:

407

(ii)

If X appears in only one premise of the 1\1, this case simplifies in the obvious way. U,A[l,/t],X (iii)

V,X Mix

ll,Afll/tj,X V,X. -------.----------- M 1X U, !':!....~1':(!131 U,V,JvA

V;VvA;X-' V1 V X -'--U",.... V",V7';;"A"--'-'- Mix

-U,A[lI/aj,X -.-.- ..~-.-- V,X. -.-.- M1x U, V, Alv/a] VI U,V,VvA

"U,-3;;A~X- J[ ll,V,311A

u, AllI/a], X (iv)

*

If a happens to be free in V, replace a by a new variable b everywhere above V, X, (v)

U,A,A,X C _ U,A,X V,X M' U,V, A 1X

U,A,A,X v,X , M 1X U,V,A,A C U,V,A

*

The last case remaining occurs when the mix formula is also the formula introduced by the last inference rule on the left-hand side, The cases are analogous to the previous ones, except that one mix is now reduced to one mix of lower rank and another mix of left rank 1.

(i)

(ii)

U,A,B,AVB V,AI\B M' U,V,A,B VI _ 1~ U, V, A v B V, A /\ B M' U,V,V C 1Z U,V

U,A,B,Av B VI U,AvB,AvB U,V Ul,A,A/\B U2,B,AI\B /\1 U1,U2,A /\ B, A /\ B,A/\ B Ul,U2,V

V,AVB

M,'LZ

This case simplifies if the mix formula does not appear in both premises of the /\1.

(iii)

U, 3v8, 8[v/t] 31 U,3v8,3v8 U,V

(iv)

U, Vv8, S[vla] VI U,VvS,VvS U,V

(v)

U,X,X,X U,X,X C U,V

V,\lvS M'1X

V,VvS M'1X

V,X M'1X

*

U, 3v8, 8[v/t]· V, VvS M' U,V,8[v/t]31 1~ U,V, 3v8 V, \lv8 M' U,V,V 0 1Z U,V

*

U,Vv8,S[v/a] V,VvS M' U,V, S[vla] VI ~ U,V,Vv8 V,VvS M' U,V,V 0 1Z U,V

*

U,X,X,X U,V

V,X

M,1Z

408

7.

Building E xpansion Tre e Proofs from 1"- p r o o fs

Sinc e we already showed how to con st ru ct expansio n tree p roofs from I -pro ofs we have only to show how to constru ct an ex pansion t ree proo f, given expansion tree proofs for the two premi ses of a mix. We emphasize the constructiven ess of our a p p ro ach. Of course we could simpl y use a ny theo rem pr ov ing procedure a nd a rrive a t a pro of, since we a lready know we are dealin g wi th a th eorem. Our goa l, how ever, is t.o construct a n exp ans ion tree proof whi ch most closely reflects t he st r uctu re of t.he two given or igina l pr oofs, an d moreover can h e explicit.ly olrtainerl from the m. Here is onr pr ocednr e: If we do Hot alread y have mix-free I -p roofs for b oth premises, construct th em wit h t he algor ithm described in section 1 . El iminate t.11(' mix [rm u the res ulting proof in 1" t.o obtain a proof in I usin g th e a lgor it hm in secti on G. Fin all y, cons truct an expansion tr ee proof from th is I-pro of usin g t he proced ure given in section 5. In practice we do not have to exp licitly contr uct these I-pr oofs. The proced ure may be reformulated in terms of the expansion tree proofs themselves, bu t sp ace does not permit to wri te out the rather laborious details here. By can see (degree a wors t

looking at one of the cr it ica l ca ses, case (i) where a mix of rank I is eliminated, one the following: If d is the number of quantifiers and conn ectives in the mix formula of the mix), l is the length of the p roof (say, above the leftv pr em ise) , and f(d,l) is cas e lower bound of the leng th of the r esulting mix-free proof, th e following relation

must hold: f(d,l) 2':

f(~ ,f(~,l» .

Thus we get f(d,l) 2': 22"'"

}d.

Since an I -proof is at most exponen tially bigg er than a correspond ing expansion tree proof, the lower bound rem a ins non- K a lm a r-e leme n tary wh en the re su lt ing I -p roof is translated in to an expansion tree pr oof. A r esul t by Statm an [14J m ention ed in the introduction t ells us that this can no t be sign ifica ntly improved. There ca nnot be a Kalmar-elementary translation from L"-proofs into I -proofs, In practice, however, th e trans la t ion is oft en feasibl e and it is not clear which class of theorems will ac tually blow up the size of the proof by as mu ch as J(d, l).

8.

Building Expansion Tree Proofs from Resolution Refutations

When de scribing the tr an sla tion procedure from resolution refuta t ions into exp ansion tree proofs care mus t be taken to avoid confus ion between th e d iffer ent nnform ulas and the clauses in them . Resolution refuta tio ns are st a t ed for th e n egati on of a theorem ; expansion tree proofs ar e defined for the th eor em it self. In both cas es clau ses playa central role. Thus we will call clauses in an expansion tr ee paths, while clau ses in a resolution refutation will be called clauses. We say a path intersects a clause if they have a literal occurrence in common. Notice that our definition of a cla use is sligh tl y different from the cus toma ry definition as a set. Since matings are rel ations on literal occurrences, we canno t afford t o regard different occurrences of the same literal as id entical. During a resoluti on of two clauses we delete all occurrences of the literal res olved upon . Gen erally in this section we will assume nnformulas als o to be a,B-normal, i.e, no variable occurs both free and bound and ea ch variable is bound at most once. Andrews [IJ described an algorit hm which tran slates resolu tio n refutati on int o matings, but the setting here is essentially difTer en t. We do not work wit h conj unct ive normal forms or Sk olem-terms in expansion tr ee proofs and the cond ition th at matin gs in expansion tree

409

proofs mu st be cla use-sp anning is also qui te different fro m Andrews' condition that every cycl e in a mating mu st have a m erg.e. W ith the aid of thi s algorithm a resolu tion refutation ca n b e t ran sla ted into a nonanalyt ic proof hy first t ranslating it into an expansion tree pro of and then into a proof in I" using th e alg orithm in sect ion 5. Th is can b e ca rri ed even furt her by t ra nslating the I" -proof int o a proof in natural dedu ction sty le. A procedure for t his tr a nsla tion is given by Miller in [101. This can help a mathemat icia n und erst a nd a proof by a reso lut ion t h eorem prover since he ca n study it in a familiar form a t . It may also he a valuahl e research t oo l as ind icated in the introduction. 8 .1. Definition. Let X b e au o:,B-normal nnformula. is t h e resul t of replaci n g ever y subfo rmula of th e form W j , .. . ,W" are all the univ ersally qu antified variables deleting all the universal qu antifiers. f ,,(wj, ... ,w..) and terms , Iv the Skolem-function for v.

T hen X" , th e Skolem-forrn of X,

311S hy S[II/f,,(v'l""'w,,)], where in wh ose scope :JvS lies, and then instances thereof a rc called Skolem-

8 .2. Definition. Let X be an o:j9-norm al nnformula. A resolution refutation of X is a list of clauses C1, ..• ,c" such that (i)

3m such that {c; : 1 :S j :S m} is a sub set of the set of clauses of

X" ,

(ii) for each j > m either (a) (b) (c)

c; is a subs t itu t ion ins tance

,pc; for

some i $

i,

c; is the resol vent of C"i and Chi ' wher e a;, b; $ i, an d c; is formed by appending the r esul ts of del etin g all occurr ences of a liter al /; from c", an d -. /; from Chi ' c n = 0 (the empty clau se).

In our t ran sla ti on we will have to select unique variabl es for Skolem-functions and their arguments. In general, if I( W I ' .. . , w,,) is a Skolem -term for ar b itrary terms WI,,,., W n , th en I(wj, . . . ,w n ) is a unique corr esponding var iable. Note th at this is just a notational conveni en ce in our metalanguage. We m ust also occasionally m od el the effect of a subst it ut ion into a Skolem-tenn on the corr esponding variables. 8.3. Definition. Let I( W I , • .• ,w n ) b e a vari able, ,p a subs titution for var iables which do not come from Skol em-terms. We ext end ,p to te rm s an d form ulas in the usual way, but also ex t end it to act on vari a bles wh ich com e from Skolem- t enns. R ecursively define

,pf(wj , .. . ,w,,) := J( rf>wl ,"" rf>wn ). We a re now ready to define what it me ans to apply a subst itut ion to an expansion tree. Note that (,pQ)S = ,p(QS). 8 .4. Definition. Let Q be an expansion tree. Then we define ,pQ inductively.

(i) Q is a literal I. Then ,pQ

(ii) Q =

A

= ,pI. Th en ,pQ =

410

(iii) Q =

We leave the original expansions intact, and add all terms which change under the subst.itut.ion as new expansion terms. Let t i l " ' " tim be all t.he expansions terms Ii such that
3vS

~t,.

Q1

VvS (iv)

Q=

VvS

/f(Wb ... ,wn)

Then
Qo

=

1

During the translation from resolution refutations to expansion tree proofs we associate an expansion tree and a mating with each line in the resolution refutation. These expansion trees have to satisfy all of the conditions of expansion tree proofs except that the mating does not have to be clause-spanning. We therefore define: 8.5. Definition. A partial expansion tree proof (Q,.M) for a nnformula X is an ordered pair consisting of an expansion tree Q and a mating .M on QD such that

(i) QS (ii) (iii)

= X.

No selected variable is free in QS.
is acyclic.

A particular partial expansion tree will correspond to the part of the resolution proof which is constructed solely from the clauses in the negated and Skolemized theorem. 8.6. Definition. Let X be an al9-normal nnformula. The initial expansion tree Q(X) for X is inductively defined for parts Y of X by (i)

Y

= 1 for a literal I. Then

(ii)

Y

= Y1)O( .. ·)O(Yn .

Q(Y)

Then Q(Y) =

3vS

(iii) Y = 3vS. Then Q(Y) =

Iv Q(S)

= I.

411

VvS (iv)

Y =VvS. Then Q(Y)

=

If1J(Wl, ... ,Wn)

Q(S[v/ f1J(Wl,"" w n )]) where f1J(Wl,"" wn)is the Skolem-term for v in X. Now we construct an expansion tree proof from a resolution refutation. Let a resolution refutation Cll"" C""C m + l , " " Cn C~ LJ he given. For each clause Cj, j 2: m we will recursively construct a partial expansion tree proof (Qj, Mj) with the following property:

(*)j

Let Ci, i :S j be a clause in the resolution refutation. Then every path through which does not intersect c, contains a pair of Mj-mated literals.

Qf

If we can show that (*)j holds for all m :S j :S n, the correctness of our translation is 0 and therefore no path through intersects C n by (*)n' Hence every proven, since Cn path through Q:? must be spanned by Mn and (Q:?, Mn ) is an expansion tree proof for X.

Q;;

Now we come to the construction of (Qj, Mj). Let (Qm,M m) (Q(X),O). Since every path in Q(X)D intersects every clause in X·, (Qm, M m) is a partial expansion tree proof for X and satisfies (*)m' Now assume (Qm, Mm ) , ••• , (Qj-b Mj-I) are partial expansion tree proofs for X and (*)i is satisfied for m :S i :Sj -1. We have to distinguish cases, since Cj could either be a substitution instance or a resolvent of earlier clauses. (i)

1, a substitution for Assume Cj is a substitution instance Ci for some 1 :S i :S j the free variables in Ci. If a variable is free in c, it must be existentially quantified in X. Now we pass to a substitution (J such that (} agrees with if the substituent is not a Skolem-term, and (Jv == f(Wl, .•. ,wn ) if v = f(Wb'" ,wn ). Let Qj later):

= (}Qj-l. = QJ-l

(Qj, Mj) is a partial expansion tree proof for X (Mj to be contructed

(a)

QJ

(b)

From the way selections for universal variables in X are chosen and from the fact that X was a,B-normal, it is clear that every variable is selected at most once and that no selected variable is free in QJ.

(c)

X by inductive assumption.

The first relation means that there is a variable selected below t l which is free in t 2 • Since the variable is selected below t 1 in the expansion tree, it has the form of a variable corresponding to a Skolem-term which contains tl. Thus t2 contains a term of the form ft( ... , tb . ..). Hence in the Skolem-form of the substitution, tl is free in t2 The next relation would say that there is a variable selected below t2 which is free in t3' Thus a term of the form 12(... ,t2,"') is free in t3' Combined with the previous conclusion this gives us that t 1 is free in t3. Iterating this process tl. But this would mean we finally arrive at the conclusion that tl is free in t n that the original substitution was not legal, which is a contradiction. Therefore
412

Now we show how to construct .Mj. First note that because of definition 8.4 any literal occurrence in Qi'-l is still present in Qf. Each new literal occurrence in Qf is of the form Oi for some l in Qf-l' Then we simply let.Mj .Mj-l U HOi,Ok): (I,k) E; .Mj-l}' (a)

Consider Ch, h < j, P a path through Qf not intersecting CI<' Since paths in Qi' can only be longer than paths in Qf-l' there is a projection 1" of P-in Qf-l' P' may be obtained by deleting all the new literals from P. Then 1" is spanned by M j _ 1 by inductive hypothesis and hence P by M j ::J Mi-l.

(b)

Consider Cj, P a path through Qf not intersecting ci' Construct a path 1" through Qf-l as follows: Every literal occurrence i in Qf-I such that there is a new literal occurrence Oi E P is included. Furthermore all Iitcral occurrences such that there is no new literal occurrence 01 in Qf, but l E P are also included. Then 1" does not intersect Ci and is therefore spanned by a pair (i,k) E .Mi-l' But then OI,Ok E P (neither necessarily new) and (Oi, Ok) E .Mj. Hence Pis spanned by .M j •

(ii)

Assume Cj is the resolvent of ca j and Cbj upon the literal Ii E Ca" .ii E Cbi' where aj, bj < j. Define Qj = Qj-l and let .Mj .Mi-I U {(I,k) : I an occurrence of ij in Ca" k an occurrence of ..,lj in Cbj} '

=

Since Qi Qj-l, Qj is a partial expansion tree proof for X. What remains to be shown is that Mi spans every path through Qf which does not intersect Ci, for all i ~ j. For i < j this is obvious by the inductive hypothesis and the fact that M j ::J M j - 1 . Now consider a path P through Qj not intersecting Cj' There are three cases:

c

.Mj spans P.

1 C

Mi spans P.

[a]

P does not intersect ca j ' By inductive hypothesis Mj-l

(b)

P does not intersect

(c)

Cbi'

By inductive hypothesis M j _

P intersects both Ca, and Cbi' Since P does not intersect ci, P must intersect in one of the literal occurrences Ii resolved upon, and Cbl in one of the literal occurrences .li' But then .Mi spans P since (lb·li) E .Mi'

Ca,

9.

References

(1) Peter B. Andrews. Refutations by Matings. IEEE Transactions on Computers C·25 (1976). 801-807. (2) Peter B. Andrews. Transforming Matings into Natural Deduction Proofs. in 5th Conference on AutomatedDeduction. Les Arcs. France. edited by W. Bibel and R. Kowalski. Lecture Notes in Computer Science 87. Springer-Verlag. 1980.281-292.

[3] Peter B. Andrews. Theorem Proving viaGeneral Matings, Journal of the Association for Computing Machinery 28 (1981). 193-214. [4) Wolfgang Bibel, Automatic Theorem Proving. Vieweg. Braunschweig. 1982. (5] W. Bibel and J. Schreiber. Proofsearch in a Gentzen-like system a/first-orderlogic. Proceedings of the International Computing Symposium. 1975.pp, 205-212. (6] W. W. Bledsoe. Non-resolution Theorem Proving. ArtificiallntclIigence 9 (1977).1-35.

413

[7] G. Gentzen, Investigations into Logical Deductions. In The Collected Papersa/Gerhard Gentzen, M. E. Szabo, Ed.,North -Holland Publishing Co., Amsterdam, 1969, pp . 68-131. [8] J. Herbrand, LogicalWritings, Harvard University Press. 1972. [9] Dale A. Miller, Proofsin HigherOrderLogic, Ph.D. Th., Carn egie-Mellon University, August 1983. [10] Dale A. Miller, Expansion Tree Proofs and TheirConversion to Natural DeductionProofs. 7th Conference on Automated Deduction, Napa, May 1984. [II] Frank Pfenning. Conversions between Analytic and Non-anal ytic Proofs. Tech. Report, Carnegie-Mellon University, 1984. (to appear) [12] J. A. Robinson, A machine-oriented logicbasedon the resolution principle, Journal of the Association for Computing Machinery 12 (1965), 23-41. [13] R. M. Smullyan, First-Order Logic. Springer-Verlag, Berlin, 1968. [14] R. Statman, Lower Boundson Herbrand's Theorem, Proceedings of the American Mathematical Society 75 (1979),104'107.

414

Applications or Protected Circumscription Jack Minter and Donald Perlis

Computer Science Department University or Maryland College Park, MD 20742

Abstract We examine applications or an extension or circumscription that allows protection or certain objects against being included in the circumscription process. We show that this allows a clean handling or incomplete information in problems from artificial intelligence and databases.

1. Introduction

Thill paper amplifies on results proven elsewhere (see Minker

&;

Perlis (1984)), in

which we extended the idea or circumscription to allow prescription of what objects are or are not to be included in the circumscription process, broadening the applicability of the technique. A way to view circumscription is that it characterizes what it means for a set to be specified by means of various assertions. We review briefly the idea of circumscription, before discussing the extended version. We begin with a suggestive example.

Suppose a precious red sapphire, s, ill purchased in India and brought to Denver, only to be lost. Then years later a youngster ill found living alone in the Rocky Mountain wilderness wearing a red sapphire ring, r.

The reader of the mystery is supposed to

immediately think, Aha! That's the red sapphire that disappeared earlier! In fact , only one

415

red sapphire exists, one presumes, at least as far as we need consider.

Yet such has not been stated, and to state it is to go further than we wish. Somehow we have great use for jumping to conclusions of thls sort, although we realize they need not be true.

Still, in order to get ideas to begin reasoning at all, we need to do some such

associating, and often it is useful to use these associations as conclusions for immediate acceptance (at least until forced to alter them by weight of later evidence). How then are we

to do this? It clearly is a kind of default problem, and one addressed recently by several workers in artificial intelligence (McDermott & Doyle [lgSO), McCarthy IlgSO!, Reiter [lgSOI). The approach of McCarthy, predicate circumscription, applies particularly well to the above In another paper (Minker & Pedis IlgS4)) we have extended McCarthy's formalism;

problem.

here we are concerned with specific applications of the extension.

McCarthy's approach then is as follows: Given a predicate symbol P and a formula A[P) containing P, the circumscription of P by AlP! can be thought of as saying that the P-things consist of certain ones as needed to satisfy AlP! and no more, in the sense that any P-things Z satisfying AIZ! already include ALL P-things:

P C IZI:

[AIZ! & (x)(Z(x)->P(x))] -> (x)(P(x) -> Z(x))

A

To see how this •solves' the sapphire problem, let P(x) say x is a red sapphire. We decide to circumscribe on P since red sapphires are, as far as we can judge, quite unusual and unlikely to be present without being recognized and well-known. Once mentioned, the gem becomes •the' red sapphire s of the story until futher notice. So, the property of being a red sapphire becomes the only contextual information needed: AlP) is P(s). As long as it remains our judgement that red-sapphired-ness is appropriate to circumscibe, we will conclude that this red sapphire is also the one and only red sapphire, namely, the lost one. Thus we

416

wiD be able to prove that r

= s.

In detail, circumscription of P by P(s) (as the only information AIPI that initially pertains) can be applied by taking the predicate Z(x) to be x

= s.

Then AIZI will be Z(s),

i.e., s = s, resulting from replacing P by Z in AIPI. It follows by the above circumscription schema that P(x) -> Z(x), i.e., that the only red sapphire is s, This is seen as follows: first, Z(s) is obvious; and Z(x) -> P(x) follows from P(s). So the schema yields P(x)-> Z(x).

If we retain this conclusion on hearing about the sapphire r, then of course we must

conclude that r

= s,

which is automatic:

Of course, we have made two significant judgements here, neither of that red sapphires are things to circumscribe on, and that new data of

the sort presented (the existence of g) does not alter the first judgement.

We are not

tackling this issue here, but simply the one of how to formally represent such reasoning.

2. Circumscription with Protected Terms

Here we discuss a simple syntactic device from Minker &. Perlis 119841. There we suggested that once A has been selected as appropriate for circumscribing P, and if (perhaps later) it is desired to protect S-things from this process so that circumscription wiD not be used to show S-things are not P-things, we can keep the same criteria A, but alter the form of the schema itself.

Starting with P(x) &. -S(x), which we write P IS(x) (and more generally

T/U(x) for T(x) &. -U(x», we alter the circumscription schema to read as follows:

PIS C IZI: tA[ZI &. (x){Z/S(x)->P(x») -> (x)(PIS(x) -> Z(x» A

for all formulas Z.

Intuitively, we are saying that conclusions are drawn only about

417

non-S-things, u far u ruling out possible P-thinp goell. 'protected circumscription '

j

We refer to this IIChema u

unless so indicated, circumllCription wiD refer to McCarthy's

IIChema. We write CIZI when context makes clear what the A, P , and S (if protected) are.

It may appear that by circumllCribing on the formula P(x)k-S(x) the same elect ill achieved. Indeed intuitinly this should be the cue. However, circulDlICription, u dellDed by McCarthy, applies only for sinpe predicate letters. It is Dot obvious how to extend it to general formulu. John McCarthy hu communicated to us that he is currently pursuing this extension.

To return to our sapphire example, suppose in addition to the red sapphire that ill lost, another precious stone has been brought from India by another Denver resident, but its precise gemology has not been revealed . III fact, we may suppose for the sake of story-line, that the two gem buyers are in fact obtaining gifts for their (one and the same) admiree, a third Denver resident whose birthday anniversary is to be celebrated soon. The reader may already feel a tingling sense of worry that the two gems may be identical in type and bound to produce embarrassment.

How then can we represent the reasoning that there are one and p06llibly two red sapphires, but no more, and that s is one, and the other stone, say g, may or may not be, in such a way that we still can conclude later that r

= s (supposing g not to be l06t)!

Our

schema will do this if we again let P(x) say x is a red sapphire, P(s) being the only information that is needed to circumscribe that very property (i.e., the axiom AIPI is simply P(s) itsell) except that now we also state S(g) to protect g from being squeezed out of possible red-eapphired-ness.

Again we let Z(x) be x=s, and further simply take S(g) as an

axiom. S(x) will have no special meaning other than that x is ' selected ' for protection from circumscription.

418

Then much as before we call conclude P(x)

-> l(x) v S(x), l.e., any red lapphire

either is the Ont one (I) or is the new untyped stone (g). Then on learning 01 the red sapphire ring r, it followl thM either r - lor r - g. If further it is bown that g il not

loat, indeed is in the Orm poll8ellllion of ita owner, then we toow r -/:- g, hence r - I.

Notice the apparent non-monotoDic:ity present in 8uch a line of re3llOning. Before we have heard

or the second stone g, we conclude r

-

I; later with further information but

(apparently) no 1081 of what was previously known, we no longer Call make such a Itrona conclusion but in8tead have only (r == 8) V (r

== I).

III lact, of course. information has been

retracted, namely our original unprotected treMment or red sapphires: now AlP) il {P(s),S(g)} where311 before it was jU8t {P(s)},

60

the previouslc:hema haa been replaced by a new one thM

in fact is not loaically stronger.

3. Using Model-Theory

III McCartby I198O( the concept 01 minimal model was discussed in the context of clreumscription. In Minker k Perlis IIgs4) we re-defined minimal model in an manner appropriate to tbe new version of c:ircumseription as follows: Let M aad N be models of AlP). We say M

truths of M are contained in those of N, if those atomic: truthl

of M not using P are precisely those of N, and if the extension i.e., if {x

I

or PkS in M is also that in N,

P(x) and S(x) holds in M} == {x I P(x) and S(x) hold in N}. Then M is a

PIS-minimal model of AIPI if M is a model of AIP( minimal with respect to the relation

As an example, suppose P(a)kP(b)kP(c)k-P(d)kQ(d) is the sentence AIP(, and we wish to

protect the constant c: S(e). Then the only model is {Pta) P(b) P(c) Q(d) S(e)}. (Here we indicate a model by writing the positive ground clauses thM hold in it.) This model is the only minimal model. In this case protection is superOuous since P(c) is required to hold.

419

Now consider the sentence P(a)&P(b) where e is still a protected constant-S(c)-and d is

aD

unprotected constant. Here we obtain four models:

MI =: {Pta) P(b) P(c) P(d) S(cn

M2 =: {Pta) P(b) P(c) S(cn

M3 =: {Pta) P(b) P(d) S(cn

M4

=

{Pta) P(b) S(c)}.

Of these only M2 and M4 are minimal, M2 beinl a PIS-minimal submodel of MI, sad M4 of M3.

Finally, consider Pta) v P(b) v P(c) with S(a) aad S(c). Then the models are

MI = {Pta) P(b) P(c) S(a) S(cn

M2 = {Pta) P(b) S(a) S(cn

M3 = {Pta) P(c) S(a) S(cn

M4

=:

{P(b) P(c) S(a) S(en

M5 = {Pta) S(a) S(e)}

M6 =: {P(b) S(a) S(cn

M7 = {P(c) S(a) S(c)}.

420

The minimal ones are M3, MS, M6, M7.

Using models to draw conclusions about derivablility relies on having appropriate soundness and completeness theorems tying model-theoretic truth to syntactic proof.

McCarthy

1111801 provides the soundness haIr of such a result for circumscription, but not the

119801 the fully general completeness result would be at Perlis 119841 have a soundness and completeness result that

completeness part. As noted by Davis false.

Nonetheless, Minker

applies to cases of •ground' theories (among others), i.e., ones with no variables, such as we are considering here: for such theories AIPI, and for any ground formula B, we have

AIP]IP/S== B ill' AIPIIP/S- B.

It is instructive to consider t,he following example: Let AIPI consist of the data P(a),

-Pfb] v -Pfe]. Then there are three models 01 AlP]:

1. {P(a)} 2. {P(a), P(b)} 3. {P(a), P(c)}

or these, only 1 is minimal, and so the formulas true in 1 are the circumscriptive theorems of AIPJ, for all choices 01 Z at once! Notice that the theory A'IPI having ONLY P(a) as axiom also has these three models as well as: 4. {P(a), P(b), P(c)} which still is not minimal. So A and A' have the same minimal models and hence the same circumscriptive theorems. In fact in both theories we have the theorems -Plb] and ·P(c), so that the axiom -P[b] v -Plc] in A is circumscriptively redundant.

421

Now suppose we wish to protect b and e in A 80 that ALL we bow about P(b) and P(c) is that they are not jointly true, i.e., -P(b) v -P(c) represents real uncertainty. Then we find that 1, 2, and 3 are the only models and all are minimal. Furthermore, although -P(b) v -P(c) holds in each, neither -P(b) nor -P(c) does, 80 that the protection has really worked.

But

now if we pass to A' and protect b and c, we find still all four models as before and aU are minimal, 80 that Dot eveD -P(b) v -P(c) holds.

Although the completeness result has shown us what the ground theorems of these four theories are, we see from this example that negative data (-P(b) v -P(c» can have a DOD-redundant elect when there are protected constants. This shows a strong distinctioD from the situation for ordinary circumscriptioD.

4. Applications to Databases

We believe that protected circumscription is applicable to belief systems, databases, and many other areas. We give here an application to databases.

Suppose a database DB contains the information P(a) and P(b) and neither P(c) nor P(d). Traditional database approaches would take this to mean that Ple] and P(d) are false. Le., there is an assumption of complete data, often referred to as the 'closed world assumption' (Reiter 119781). This is not to say that the closed world assumption is logically valid; rather, that in certain data sets, it happens to hold. This of course is a very limiting situation. For instance it does not allow for the possibility that some data simply has not yet been gathered, surely an extremely frequent oceurence in real-world databases.

A more dramatic version of this is 'indefinite' data of the form P(c) or P(d). Here it is not simply that we do not know about e and d. We know that at least one of them has

422

property P, but we do not know which. McCarthy's circumscription (amonl other approaches) provides a solution to this, in which from P(x) one can conclude, ror instance x=a v x=b v x-c v x=d. pven the database DB = {Pta), P(b), P(c) v P(d)}. Thus there is in force a kind of closed world assumption, but broadened so as to deal with indefinite data, what Minker 119821 calls •the generalized closed world assumption' .

Indeed, we can regard the incomplete database as a special kind of indefinite database, in which the lack or information about P(c) is represented as an indelinitenC88 between P[e] and .P(c). Yet no assertion of the form P(c) v P(x) will do what is required. Ir x is different from e, then we are asserting more than is wanted , for now we are committinl x also to be indefinite, not to mention that x and e are also being bound together in a special relation not part of our intention. Ir on the other hand , we let x be c itsetr, then P(c) v P(c) tells us too much, namely that e definitely has property P.

Other ideas in this vein include P(c) v -Ptc] (a tautology which achieves nothing). and P(c) v P(ind) where ind is a new constant introduced for this purpose.

The latter has

some promise, but leaves us with the undesirable reature that now we can prove that something (either e or ind) has property P, this again not being the intended outcome.

With this background we then look at protected circumscription for a solution to this dilficulty. Let S(x) be the predicate x=c. This will serve to protect c.

Now if we use

protected circumscription on P by the database DB = {P(a) P(b) Q(c) Q(d)} with S as stated, we find as expected that -P(c) cannot be concluded. although -P(d) can be concluded. In terms of minimal models, we first consider all objects that do not have property S (this is only c here) and that also must have property P in each model. These objects are only a and b, so these are the only ones we can conclude to have property P. On the other hand , we also examine all objects which do not have property S (again just c here) and which must rail to have property P in each minimal model. In this case the only such object is d. hence we

423

eoaelude .P(d).

In the case of (Hora) datablllell we han a seneralization of the idea of Clark

110781

who, when discu8llins negation aa failure, showed that an 'if and only if' condition waa its analogue. For example, if P(a) and P(b) are known and we do not care about e or d, then we would write

(x - a) v (x = b)

<->

P(x).

Now, if one wants to protect c while leaving d unprotected, our solution is simply to place (x - c) on both the right and left hand sides of the above formula, to obtain

(x = a) v (x = b) v (x = c)

<->

P(x) v (x = c).

Relating this to our protected circumscription shcema, we can re-write this as a conjunction

or two formulas and then remove tautologies: (1)

(x=a) v (x=b) v (x=c)

->

P(x) v (x=c)

(2)

(x=a) v (x=b) v (x-c)

<-

P(x) v (x=c)

and then

(3)

(x=a) v (x=b)

(4)

P(x)

->

->

P(x)

(x=a) v (x=b) v (x=c)

(here we assume distinct constants stand for distinct entities). Let Z(x) be (x=a) v (x=b);

424

then

(5)

P(x) 8l (x=/=c)

->

Z(x) from (4)

and flnally, letting S(x) be (x-c), we have

(6)

P/S(x)

->

(7)

Z/S(x)

-> P(x)

Z(x)

from (5)

from (3).

Hence, the &eneralization or Clark's is simply achieved for databaaea by the modified formula which is equivalent to our protected circumscription.

Acknowledgements

Our work obviously depends greatly on that of John McCarthy. We have also been influenced by work of and discussions with Ray Reiter. This paper was written with support from the following &rants: AFOSR-82-0303, for J. Minker and D. Perlis NSFD MCS 79 19418, for J. Minker U. of Md. Summer Research Award for D. Perlis

425

Bibliography

(19781 "Negation

Clark, K.

88

Failure", In: Logic and Databases, (Gallaire, H. and

Minker, J., Eds.) Plenum Press, NY 1978, 293-322. Davis, M. (19801 "The Mathematics of Non-Monotonic Reasoning". ArtiflciallntelIigence 13

(1980), 73-80. McCarthy, J.

119801 "Circumscription-A Form of Non-Monotonic Reasoning". Artificial InteUigence 13 (1980), 27-39.

McDermott, D., and Doyle, J. 119801 "Non-Monotonic Logic I" Artiflciallntelligence 13

(1980), 41-72. Minker, J.

(19821 "On Indellnite Databases and the

CIOlIed-World

Assumption".

Springer-Verlag Lecture Notes in Computer Science, v.I38, 292-308.

Sixth

Conference on Automated Deduction. New York, NY. 1982. Minter, J., and Pedis, D.

11984) "On the

Semantics

or

Circumscription". Technical Report, Univ. of Maryland, 1984. Reiter, R.

119801" A Logic for Default Reasoning". ArtificiallntelIigence 13 (1080),

81-132. Reiter, R. 11(78) "On Closed World Databases". In: Lolic and Data Bases, (Gallaire, H. and Minker, J., eds.] Plenum, 1078, 1)5.76. Reiter, R. 119821 "Circumscription Implies Predicate Completion (Sometimes)". Proceedinp

or AAAI-82, 418-420.

426

IMPLEMENTATION STRATEGIES FOR PLAN-BASED DEDUCTION Kenneth Forsythe and Stanislaw Matwin Dept. of Computer Science University of Ottawa Ottawa, Ontario KIN 6N5

ABSTRACT This paper discusses some results of experimentation with a plan-based deduction system. The system incorporates an efficient intelligent backtracking strategy. During implementation, several important questions concerning different strategies to control the deduction process arose. These questions are answered in the paper, with special emphasis on the problem of generating redundant solutions. 1. INTRODUCTION This paper presents different implementation strategies for a plan-based deduction method. The method, presented in [Pietrzykowski & Matwin 82] and further developed in [Matwin & pietrzykowski 83], forms the basis of a logic programming system using intelligent backtracking. Given an initial set of clauses with a goal statement, a mechanical theorem prover will attempt to refute the goal statement via resolution. There are many different algorithms upon which to base the resolution process (for example see [Chang and Lee 73]) but most of these incorporate a linear backtracking strategy or do not address the backtracking implementation at all. By linear

This work has been supported by National Sciences and Engineering Research Council of Canada grant No A2480.

427

backtracking

we mean

a strategy

which backtracks

through applicative goals in exactly

the reverse order they were

encountered, starting with the current goal. strategy found.

will blindly Since the

sequentially

explore every

Dnfortunately, this

path until

number of paths grows

a solution

is

exponentially with the

number of clauses, it is advantageous to elimate paths which can~his

is the concept

is not

restricted to

not lead to a solution before they are tried. behind plan based deduction. In plan

based deduction,

starting with goal.

In

flicts) ~he

it.

backtracking

the most current

practice,

goal but

we limit this

can be applied

to those goals

whose removal from the plan will restore unifiability to structure

of the plan is such

that backtracking termi-

nates when the original goal statement is encountered. erty speeds up the worst case

~his

prop-

of linear backtracking by an expo-

nential factor [pietrzykowski & Matwin of other

to any

(called con-

deduction algorithms in

82].

There are a number

which graphs are

76], [Kowalski 75], [Chang, Slagle 79],

[Bibel 83].

used [Sickel However, the

approach presented here differs from all of them in terms of what a plan represents and how it is operated upon to obtain a refutation. In [Chang,

Slagle 79] connection

graphs represent the search

space and rewriting rules are obtained from it. Connection graphs determine sequences of substitutions, leading possibly to a refutation.

Consistency of these substitutions is only checked after

a whole sequence has been generated. If it turns out to be inconsistent, another sequence is tried, tracking.

which is equivalent to back-

The problem of avoiding backtracking is not addressed,

neither is the problem of redundancy as understood here. In [Sickel 76], tation of the 79],

clause interconnection graphs are a represen-

total search space.

Similarly

this representation is traversed

set of substitutions. crementally,

~his

Slagle

set of substitutions is generated in-

as opposed[to [Chang,

approach does not, however,

to [Chang,

in search of a consistent

Slagle 79].

The incremental

prevent the method from a backtrack-

428

ing behavior: the issue of the action, appropriate when inconsistency is detected, is not discussed. Yet another, comprehensive approach is presented in [Kowalski 75] • Although the redundancy problem is discussed, it is presented differently from our approach, i.e. on the propositional calculus level. It is not obvious how the method suggested by [Kowalski 75] for deduction in prepositional calculus generalizes for predicate calculus. The approach, presented in this paper, follows suggestions in [Kowalski 75] and develops them into a full redundancy removal algorithm for predicate calculus. Another method, presented in [Bibel 83], is different from all the ones mentioned above because of its non-clausal representation. It also uses graphs to represent the solution space, similarly to [Kowalski 75]. However, as in [Sickel 76], backtracking may occur, but this issue is not addressed at length. In our plan based deduction system, the plan is a graph containing all the clauses currently involved in the resolution process where each clause is a node in the graph. A node consists of a key: the complementary literal selected for resolution, and its goals: all the remaining literals in the clause. The root of the graph is the original goal statement which consists of all goals. The deduction process consists of selecting clauses with complementary literals to resolve all the goals in the plan (i.e. the plan is closed). If this is accomplished and a most general unifier for the plan exists then a refutation for the goal statement has been found. If the plan is nonunifiable then conflicts are determined and removed from the plan so that the deduction process can be resumed and new clauses selected. Unfortunately, the problem of generating redundant plans is inherent to this type of deduction system. In other words, unless some kind of restriction is placed on the conflict selection process, duplicate plans will be generated although the paths leading to these plans are unique. This overlapping of paths results from generating new plans that produce the same conflicts as those they were developed from. As there may be several different

429

clauses from which to resolve a goal with,

it is inevitable that

different paths may derive the same plan. This problem is

further compounded when one

all the refutations of clauses. tions,

possible for a given goal

This is important in many Logic Programming applica-

particularly

[Clocksin,

wishes to obtain statement and set

Mellish

when

the

"generate

82] is applied.

specified by a set of clauses, superset of solutions.

and

test"

A solution to

paradigm a problem,

is obtained by first generating a

Each of them

is then tested for satisfi-

ability of conditions, which extract a true solution from the superset. cial

To accomplish this,

conflicts.

The

we introduce the concept of artifi-

deduction

process

in

our

system,

as

previously mentioned, consists of resolving goals introduced into the plan via nonunit clauses or by processing conflicts. To reactivate the deduction to artificially

process on a closed unifiable

induce conflicts

into this plan

that all solutions will eventually be generated.

plan we have in such

a way

However, unless

some restrictions are placed on the method of selecting artificial conflicts, redundant plans will also be generated. The problem of generating duplicate plans is the main theme of this paper,

but when these problems were realized a separate but

related problem concerning the efficiency of developing plans was also encountered. It was found that sometimes, nodes are added to a plan, the

which later becomes

selection of

nonunifiable,

conflicts and

later get

but do not influence deleted through

the

backtracking mechanism. All of these problems were

encountered during the implementa-

tion of a plan based deduction system. This paper describes more completely the nature of these problems and the strategies used to solve them.

In summary,

the following questions: generating 2)

we can paraphrase these problems as

1)

Removing redundancy - How to avoid

redundant solutions

processing

criteria - How to

while maintaining develop a plan

completeness? efficiently so

that only nodes relevent to the selection of conflicts are created?

3)

Artificial conflicts - How to introduce artificial con-

430

flicts on a unifiable plan so that a complete solution set can be generated with minimal redundancy? These three 3,

4,

questions are discussed individually

and 5.

tn section 2,

concepts is presented

in sections

a preliminary review of terms and

and section 6 contains

our concluding re-

marks. 2. TERMS AND CONCEPTS This

section

deals

with

[Pietrzykowski & Matwin 82],

introducing for

This system acts on a given goal collectively called the base,

notation

used

in

a plan-based deduction system. statement and a set of clauses,

and

attempts to find a refutation

for the goal statement via resolution. In a preprocessing associated with

phase,

a list

every literal in

of all

the other

complementary literals in the base. the literal's potentials

This

the base becomes

potentially unifiable list is referred to as

and it also contains

the corresponding

most general unifier for the two literals. In the

second phase of

the resolution process,

the dynamic

processing phase, a refutation for the goal statement is attempted.

This

phase builds and

which is a resolution,

maintains two structures:

graph depicting which clauses have and the

the plan

been selected for

graph of dynamic constraints

which records

the most general unifier for the plan (i.e., records all the substitions which have occurred). The

plan is

constructed of

nodes,

which

corresponds to

a

clause in the base, each of which contains a key and zero or more goals. The key is the resolvent literal and the goals are the remaining literals in

the clause.

The only exception

to this is

the top node, which consists of all goals and is derived from the goal statement. eral is

The list of potentials associated with each lit-

accessible by

the goal representing

that literal

in a

the original

goal

node. To

begin the

statement is

dynamic processing

inserted into the plan

phase,

as the top node,

which is

431

considered to be the root of the graph and all other nodes become descendants of it.

As nodes are

goal in them is classified as

inserted into the plan,

open.

every

The resolution ·process con-

sists of repeatedly selecting open goals to be processed. This is goal~s

accomplished by choosing one of the vent,

potential~s

inserting the

clause

the most general unifier for the the goal from open to closed. are no

more goals to

This process continues until there

clash is said

the open goals

have an

In the first case, if the plan is uni-

fiable then a refutation for the wise a

updating

plan and changing the status of

resolve or all of

empty list of potentials.

potentials as a resol-

into the plan,

to have

statement has been found otheroccured and the

conflict checker

phase is activated. In the second case there is no refutation for the goal statement. The conflict checker phase removes the plan,

restores

back to the

the conflicting nodes from

the graph of dynamic

dynamic processing phase.

constraint and returns This

is accomplished by

determining which sets of nodes can be removed so that unifiabilty will be restored to the plan. These sets of nodes are represented via the

goals they are resolvents of,

where

each set of

goals is called a clash and each goal in a clash is called a conflict.

Each of these clashes

completeness

will be

are processed individually so that

retained.

To

ensure that

each of

these

clashes are processed on the correct plan and corresponding graph of dynamic constraints, copied to Matwin 83]. in turn and

the current state of the search space is

disk so that it

can be retrieved later

Processing a clash

[Forsythe and

consists of selecting each goal

backtracking up the graph

through successive father

goals until a goal with a nonempty set of potentials is found. If no such

goal is encountered then

clash is tried.

the process fails

If the search is sucessful

nodes of that goal are pruned from namic constraints updated to reflect the modified plan. processing resumed.

and another

then all descendant

the plan and the graph of dythe most general unifer for

The goals are then marked as open and dynamic

432

3. REMOVING REDUNDANCY

Consider the following set of clauses in example 1: -P(x)-Q(x) P (a)

P(b) P (e)

Q(c) Q(d) Q(e) Example 1. Figure 1 shows

a trace of the deduction process

for the base

in example 1 of which the first clause is the goal statement. this figure

each plan is

above the constants variables of each

represented by

the name of

of the conmplementary literals goal is bound.

In braces

In

the goals

to which the

alongside the con-

stant is a list of constants belonging to the list of potential complementary literals which could have been chosen instead. Each line under a goal-complementary literal pair generated by replacing potential inside the then this leads to the line.

leads to a new plan

the complementary literal with braces.

If the set of

a failure,

the first

potentials is empty

indicated by a bar

at the end of

If a line extends from more than one goal-complementa-

ry literal pair it means that both goals belong to the same clash and the two literals are replaced simultaneously. notation is used

as we are interested in the

This shorthand

development of the

total search space rather than the individual plan. From this figure, we can see that the first plan is nonunifiable.

The set of clashes associated with it consists of two ele-

ments, each containing one conflict.

One element consists of the

goal which

other consists of

introduced P(a)

and the

which introduced Q(c) .

The left

flict introducing P(a)

was chosen and replaced with

containing P(b).

the goal

branch indicates that the con-

The right branch indicates that Q(d)

a new node replaces

433

Q(c) as the new node. Both of the two new plans contain similar conflict sets which when resolved lead to further plans. All the possible plans which could be developed for this set of clauses are as shown. Notice that in figure 1, six branches lead to the same refutation. In fact the left subtree of every right branch duplicates the right subtree of the corresponding left branch. The inefficiency of this strategy (although it is complete) is unacceptable for an¥ practical implementation and this section presents an algorithm to remove this inefficiency • • Q(x)

<{d,e}

~ -Plx).Q(x)

e l c]d,e}

J/ .P{x).Q(xl

ell die

1 /

/ -P(xl-Q(x)

.P(X).O(X)

-P(xl-O(x)

_P(x)_O(x)

ell ell

ell ell

ell ell

ell ell

-P(x).O(x) .P(x).Q(X)

ell ell

ell ell

Figure 1. A trace of the deduction process for example 1. A possible solution to this problem, suggested in [Bruynooghe 83], is to give each predicate an ordering which controls which alternatives to that predicate are permitted to be selected. This ordering restricts lower order predicates from generating solutions already obtained by a higher ordered one. We have employed a similiar strategy, but instead of giving each individual predi~

434

cate an ordering we have given each clash in the set of clashes an ordering (it is possible that a conflict can occur in more than one clash though the clashes, themselves, are distinct) • Initially each clash in a set is given a unique order number which becomes the current order number as each particular clash is processed. If the resulting plan produces a new set of clashes then all the similar elements in the new set are given the same order number. By similar we mean any clash which contains the same conflicts as a clash in the generating plan. All the clashes in the new set which obtain an order number greater than the current order number are discarded. As shown in figure 1, it is common for most of the resulting clashes to be similar to the clashes of the plan it was derived from. In the special case where nonsimilar clashes are generated then each of these is given an order number (bounded by the current order number) which guarantees completeness of the search space. The result of applying this strategy to the base of example 1 is shown in figure 2. The two clashes of one conflict each, are initially ordered as 1 and 2. The figure shows that the set of clashes of any plan derived from processing a clash with an order number of 1 is restricted to elements whose resulting order number is not greater than one. By censuring the paths leading from any given plan through this algorithm, we can eliminate the overlapping of the search space. Graphically, this can be interpreted as removal of the redundant left subtree of every right branch. In order to measure the amount of improvement by this strategy we introduce the idea of counters for the the number of arcs in the graph we traverse, delete and insert. By arcs we mean every goal-key pair in the graph. For the search space using the original strategy as shown in figure 1 we determined that the number of traversals was 46, the number of insertions 20 and the number of deletions 18. Comparatively, for figure 1, the numbers of traversals, insertions and deletions are 22, 10 and 8, respectively. Also, the number of identical refutations found in figure 2 is zero as opposed to five in figure 1.

435

. P(x)

-O(x )

.{ b,e l t jd, . }

I

~,

-P( x)

' Pl x/ · 0(, ) b.

t {d, .{

/ \ J\ \ J\ - P(x)-Q(, )

\

- P(x). O(' )

- P(xJ. O(' 1

ell t{ d,e}

b{e dje

,P (x)' O(xl

. 11

-O{' I

a{b•• } dIe

dje

a{b, t}

.1I

\ -P(,/-O(,)

b{.

ell

\

. P(x)-O(x)

ell

el l

Figure 2. A trace for example 1 using the improved algorithm. In order to measure the amount of improvement by this strategy we introduce the idea of counters for the the number of arcs in the graph we traverse, delete and insert. By arcs we mean every goal-key pair in the graph. For the search space using the original strategy as shown in figure 1 we determined that the number of traversals was 46, the number of insertions 20 and the number of deletions 18. Comparatively, for figure 1, the numbers of traversals, insertions and deletions are 22, 10 and 8, respectively. Also, the number of identical refutations found in figure 2 is zero as opposed to five in figure 1. We now explain the ordering strategy for situations where a nonsimiliar set of clashes is generated. There are essentially two types of these situations: when a new open goal is developed thus introducing a totally new conflict set and when the same conflicts get rearranged i n t o different clashes.

436

When a new

open goal is developed clashes are

given an order

number dictated only in the way they are arranged. This method of ordering is obviously conflict set

is empty

situation can which creates a

and all goals

also occur

when a

as the previous

are initially

clash has

unifiable plan causing the

choose a new open goal. a conflict is

used on the original plan

open.

just been

This

resolved

deduction process to

This phenomena occurs because as soon as

determined the plan development

process is inter-

rupted and the conflict-checker phase initiated. As a result, the development of

all the other open

goals is suspended

until the

conflict is resolved (see section 4) . It is

possible that

which cannot be ordering.

a new

set of

clashes can

paralleled to the previous set

If this happens,

any element in the new set for which

there is no corresponding clash in the number high enough

be generated

for an identical

old set is given an order

to maintain completeness.

usually means giving these elements an

In

practice this

order number equal to the

current order number. This approach may lead to generating redundant solutions

but we have not

yet discovered a

more efficient

algorithm that will still generate a complete solution set.

How-

ever, it may be possible to minimize the inefficiency by ordering the elements in the conflict set using heuristic strategies which would reduce the number of redundant solutions generated. example of how this strategy of example 2. P(x)Q(x)R(x) -Pia) -Pie) -Q(a) -Q(b) -Q(e) -R(b) -R(e) Example 2.

For an

is implemented consider the clauses

437

An outline of the resolution process for the clauses in example 2 is given in figure 3. In the first plan of figure 3, predicates P and Q are in conflict with predicate R. Suppose the goals belonging to P and Q are given an order number of 2 and the goal belonging to R an order number of 1. New potentials are selected for P and Q and the second plan is developed. In this second plan the predicates Q and R are in conflict with predicate P, thus there is no parallel ordering between the goals in the two plans. Consequently, both elements in the new conflict set are given an order number of 2, the current order number. Resolving these two conflicts leads to a failure and a refutation. If we resolve the second conflict of the first plan we find that after generating a nonunifiable plan this path leads to a failure. It can be shown that the work done by this strategy in this example is less than half of what a linear backtracking algorithm would do.

7<""\ -P(x)

j'

-PI"

-Q(x'

-R(x)

-Q(x' -R(X/

.~

-P(xl -Q(X) -RlX'

a]• • , ••, ' \

-} ell '-RlX' eI

-Pix '-Q(X

Figure 3. A trace for example 2 using the improved algorithm.

4. PROCESSING CRITERIA In this section we would like to address the question of when to interrupt the processing of open goals and begin resolving clashes. Our initial strategy was to allow the resolution process to resolve all the open goals (i.e. generate a closed plan) before it began processing conflicts. However, analysis of this

438

a pp roa c h s howed t hat

t h e r e wa s much work that was

velop ing g o als tha t we r e la t er disc ard e d . t h e following

b ase of

clause s

~s

( t h i s e x a mpl e

was ted by de-

an e x amp l e c o nsider shows o nl y

one o f

many s it ua tio n s wher e de veloping a c l os ed p l a n is ine f f i c i e n t) . -P( x)-Q(x, y) Pta)

Q(c,z) M(z) Q(b,z)M(z) Q(a,z)M(z) -M(Z)R(x,z)S(y,z) T(x,yl -R(c,e) -S(d,e) -T(c,d) Example 3. If we consider clauses we would

th e resolution proces s appl i e d to find that the fi rst a ttempt to

t his s e t o f

r e f u t e the goal

statement leads to a confl ict between the term a in li teral P and the term c in

literal Q.

If we develop the

whole plan we would

also resolve literals M, R, S, and T. Howe ver to reso l ve t he conflict we

need only to r e p l a c e

t he clause contain i ng c

with an-

other alternati ve ( t h e clause containing liter a l Q(b,z». ing

this

we would

remove

the

effectively removing the a rcs

arc

from -Q( x,z)

to

In doQ(c,z),

containing the comp lementary pairs

of literals M, R, Sand T, and r e p l a c e Q(c,z) wi t h Q (b,z) . This in turn leads to a simili ar situat ion wh er e l ite r a l s M, R, Sand T

are again

resolved and

Q(b,z) with Q(a,z). solved.

This

deleted

Once more,

give s a closed

from the

literals M,

p lan b y

replacing

R,

Sand T are re-

plan without any

conflicts (i.e.

refutation) so the resolution process is finished. If we determine the work done using this stra teg y we find that 13 insertions and 2 deletions were made befor e a refutation was found.

However,

if we interrupt

the plan develoment process as

soon as a conflict is encountered we would have avoided resolving

439

literals M, R, Sand T.

In the third attempt, there are no con-

flicts generated so the literals M, R, Sand T are resolved. ing this approach we find that are made,

Us-

only 7 insertions and 2 deletions

which is roughly half

of the work that

our original

strategy does. If in example 3, we replace the literal Q(a,z) with Q(d,z), we would find this new set of clauses has no sOlution.

In this case

the process would terminate after finding that Q(d,z) conflict in which there are the literals M, R,

no more alternatives.

leads to a

Consequently,

Sand T would never be resolved and the total

number of insertions would be 4 and the number of deletions 2.

5. ARTIFICIAL CONFLICTS This section deals with the question of how to generate a complete solution set. cess open

goals,

The deduction obtained

algorithm is designed to pro-

by resolving

literals contained

in

non-unit clauses or by removing conflicts, until a refutation is found. The idea behind artificial conflicts is that by designating specific goals in a closed deduction process

unifiable plan as conflicts,

can be continually

reactivated until

the

all the

solutions are found. The problem we would like to address is how to arrange the goals which are selected as artificial conflicts into clashes so that a complete

solution set with minimal redun-

dancy can be generated. To obtain artificial conflicts from a closed unifiable plan we simply label flict. ally

each goal that introduces

a unit clause as

a con-

This ensures that all goals with potentials will eventube considered

strategy which

because

checks each

of the

nature

goal on the

of the

path from

bactracking the selected

goal to the root of the plan. The problem is how to arrange these conflicts into

clashes so that

every possible solution

will be

derived. The first approach which comes to mind is to put each conflict into a

separate clash.

This

strategy will

obviously guarantee

completenes as each path will eventually be backtracked.

The ef-

440

fect of applying this strategy to the clauses of example 4, below, is shown in figure 4. (In this figure only the plans which represent soutions are shown.)

-P(x)-Q(x)-R(y) Pta)

P(b) Q(a) Q(b) R(c) R(d) Example 4. In figure 4 we see that by selecting every goal introducing a unit clause as the only conflict in a clash, we obtain three clashes for the first solution space generated. Resolving each of these clashes gives three more solutions of which only two are unique. Applying this strategy to each of these solutions results in four more solutions of which only one is unique. So out of eight solutions generated four are redundant.

-. 7"1 /'1'\

-:\~I -~\~I -~\~I

/

-PIx )-OIX)-Rly!

'j"

-Pix )-0(') -R(¥)

b)

b{} dll

-P(X) -O{x) -Rly)

-P(x )-O(x) -R(y I

b{} b{}

d{}

-Pix) -O(x) -R{y)

-P(x )-0(') -R(y I

b{} b{} d{}

-P(X)-Q(x)-R(y)

b{} b{} d{}

Figure 4. A complete solution set for example 4.

To derive a more efficient algorithm for this clash selection process we must first consider the sources of redundancy. The major cause of duplication is the lack of consideration for bound constants. If we examine figure 4 more closely we see that in the first solution the predicates PIal and Q(a) are bound together through the variable x in the clause -P(x)-Q(x)-R(y). Resolving the clash containing P leads to the same solution as resolving the clash containing Q. This is because the binding between P and Q implies that to replace P one must also replace Q and vice-versa. Thus the first improvement we can make to the clause selection algorithm is to place all the goals bound through a variables together into one clash. This ensures that both predicates will be replaced at the same time. If a goal is bound to two different goals through two diffent bindings, which are not themselves bound together, then two separate clashes must be created with the common goal contained in both. This is neccessary to maintain completeness by allowing each binding to be processed individually. The second cause of redundant solutions is the same as that described in section 3. Essentially, other duplications can occur because all the potentials associated with one clash are systematically processed with all the potentials of a second clash. Then the potentials for this second clash are processed with all the potentials of the first clash. (This is the duplication of subtrees phenomena described above). In section 3, we described a solution to this problem by introducing the idea of ordering the clashes in the set. This caused the deduction process to keep from choosing combinations of potentials and goals that had already been resolved. Applying this concept of ordering to clashes to artificial conflicts allows the deduction procedure to prevent duplicate solutions from being generated. In other words we wish to extend this concept of ordering clashes within a single solution search space to apply to the complete search space. That is, as new clashes of artificial conflicts are generated they are compared with the previous set of artificial conflicts and given

442

the appropriate ordering. Any clash in the new set given an order number greater than

the element which generated the

set is dis-

carded. Adding the above two strategies to the clash selection process for artificial conflicts produces an ate a complete solution set of applying

this new

shown in figure 5.

algorithm which will gener-

with minimal redundancy.

strategy to One can see

the clauses

The result

of example

4 is

that a complete solution set is

generated but without the redundancy of the original algorithm.

/'Id'\ -P(x) -O(x) -R(y)

-Pix )-Q(x )-R(yl

-P(x) -Q(x) -R(y)

XI 'I'

'

1 ' , \,

-P(x)-O(xi-'(y)

bll btl dtl

Figure 5. A complete nonredundant solution set for example 4.

CONCLUSION An implementation been completed.

It

of a

plan based

deduction system

involves some 6000 lines of

has now

PASCAL code and

runs under eMS on an AMDAHL 470jV5. Three important problems,

encountered during early experimen-

tation with this plan-based deduction system, ed.

have been present-

We have shown, using simplified examples, solutions to these

problems. leading.

However, simplicity of the examples should not be misThey extract important experience gained during the us-

age of our system

on larger logic programs,

such

as the graph-

coloring problem used as illustration in [pereira & Porto 80] ,lor the Huffman-Clewes theory of polyhedral by M. van Emden.

scenes,

suggested to us

443

Reviewing

the

results

of

experimentation

and

suggested

implementation strategies, we feel that our research will lead to a practical and efficient deduction tempting to control

system.

the system so that it

Our emphasis on atavoids generating re-

dundant solutions is particularly significant.

Without some kind

of constraint, the system tends to generate an unacceptably large number of identical

solutions which makes it

and inflates its memory requirements.

impractically slow

The problem lies with im-

posing a constraint which does not

restrict the system from gen-

erating a complete

Completeness is

solution set.

an important

consideration when applying an automated deduction system to cope with the

intensional clauses of a

that the

strategy outlined

large data base.

straint,

where completeness (of [Matwin & Pietrzykowski 83])

in this paper

We believe

provides such

a conis

preserved and reduncancy is significantly decreased. An open and interesting question, however remains. duction the

system may

proceed either

first when developing open goals. egy relevant for

During de-

depth-first or

breadth-

lS the choice of either strat-

efficiency (if yes,

in what way?)

dependant upon some topological properties

or is this

of the plan being as-

serted? REFERENCES [Bibel 83] Bibel, W. "Matings in Matrices", Communications of ACM, Vol 26, No 26, pp. 844-852, 1983. [Bruynooghe 83] Bruynooghe,

M.,

Backtracking",

"Deducti.on

Revision by

Universidade Nova

Illtelligent

de Lisboa,

Research

Report, July 1983. [Chang and Slagle 79] Chang, C.L.

and Slagle,

J.R.,

"Using Rewriting Rules

for Connection Graphs to Prove Theroms", Artificial Intelligence, Vol. 12, pp. 159-180, 1979.

444

[Clocksin and Mellish 82] Clocksin, W.F.

and Mellish, C.S., "Programming in Pro-

log", Springer verlag, 1982. [Forsythe & Matwin 83] Forsythe,

K and Matwin,

S.,

"Copying of Multi-level

Structures in a PASCAL Environment", submitted to Software - Practice and Experience, 1983. [Kowalski 75] Kowalski,

R.,

"A

Proof

Graphs", Journal of ACM,

Procedure Using Vol 22,

No 4,

Connection

pp.

572-595,

1975. [Matwin & Pietrzykowski 83] Matwin,

Sand pietrzykowski,

tracking in

T.,

"Intelligent Back-

Plan-Based Deduction",

submitted

to IEEE

Trans. on Pattern Analysis and Machine Intelligence. [Pereira & Porto 80] Pereira, L.M., and Porto,

A.,

"Selective Backtracking

for Logic programs", Procs. of CADE-5, pp. 306-317. [pietrzykowski & Matwin 82] Pietrzykowski,

T.

and Matwin,

"Exponential Improvement of Exhaustive Backtracking: Strategy

for

S., A

Plan-Based Deduction", Procs. of CADE-6,

pp.223-239. [Sickle 76] Sickle, S.

"A Search Technique for Clause Interconnec-

tivity Graphs", IEEE Trans. on Computers, Vol 25, No 8, pp. 823-835, 1976.

445

A Programming Notation for Tactical Reasoning David A. Schmidt Computer Science Department Edinburgh University Edinburgh, Scotland*

Abstract:

A notation for expressing the control algorithms (subgoaling strate-

gies) of natural deduction theorem provers is presented.

The language provides

tools for building widely known, fundamental theorem proving strategies and is independent of the problem area and inference rule system chosen, facilitating formulation of high level algorithms that can be compared, analyzed, and even ported across theorem proving systems.

The notation is a simplification and

generalization of the tactic language of Edinburgh LCF.

Examples using a

natural deduction system for propositional logic are given.

O.

Introduction Logical systems of natural deduction (Pra) have demonstrated their useful-

ness in the development of traditional problem areas in formal logic and mathematics.

Their application to computing related areas such as formal semantics

(Hoa,Plo), data type specification (Gut), and program development (Cos,Nor) emphasizes the importance of understanding the notion of derivation and the strategies available for constructing proofs.

Traditionally, these concerns

have fallen in the realm of automated theorem proving (Ble,Boy,Coh,Gor), but the emphasis in this mechanized world often falls upon the number ot difficulty of the theorems proved, rather than the style in which they are proved. the boundaries

Further,

between kind of logical system (natural deduction versus axio-

matic), problem area of interest (first order logic, group theory, set theory, etc.), and the proof discovery strategy are often poorly delineated.

If the

theorem proving art is to be advanced and its most elegant ideas applied to new problem areas such as program development, the distinctions between these levels must be made clear, and the methodologies underlying proof discovery need to be expressed in a machine independent, understandable way. This paper describes an initial version of a notation for expressing control algorithms for natural deduction theorem provers.

The notation is independent

of the specific problem area and rule system chosen, but it supplies the *Present address: Computer Science Department, Kansas State University, Manhattan, Kansas 66506

446

fundamental tools for building useful subgoaling strategies from the inference rules supplied.

The addition of control structures and a scoping mechanism

allows definition of realistic algorithms. generalization of the tactic language

The language is a simplification and

of Edinburgh LCF (Gor).

After a brief review of natural deduction in section 1, section 2 outlines the basic features of Edinburgh LCF.

The new notation is described in section 3,

and section 4 presents an example algorithm for theorem proving in a subset of propositional logic. 1.

Background The form of natural deduction used is described in Prawitz (Pra); the exam-

ples in this paper use propositional logic (Lem) , although any other problem area would do as well.

Given a language L built up from propositions P,Q,R, .•. , and

logical symbols A, V, => propositional logic.

,~,

use the usual syntax rules to build the language of

Arbitrary propositions (also called formulas) are denoted

by A,B ,C, ... , and lists of propositions are represented by

r, (:" '[, ,...

The

rule schemes for inferring new facts from already established ones are

AE.>',:

AI: A

vIr:

VI.>',:

AAB B

A A v B

A AB A

AEr:

vE:

(A) C

Av B

(B)

C

C

(A) =>E:

=>1:

A=> B

A B

(~A)

(A) ~E:

~I:

ff ----A

where ff abbreviates any formula propositions C.

(:,

DA ~D.

A proof of a proposition

is a tree whose leaves are the members of

(:,

C from

and whose root is

Each internal connection between parent and children nodes is justified by one

of the inference rules.

A rule with a parenthesized formula (Such as

::>1)

causes the removal (discharge) of that parenthesized leaf node when applied to the tree.

As an example,

447

(PAQ):>R

PA(Q::>R) is a proof that

(PAQ)::>R

and

P

infer

PA(Q::>R).

Write (PAQ) :>R, P I- PA(Q:>R)

to abbreviate the tree; this expression is called a theorem.

Note that the

order in which the nodes of the tree were added does not affect the final result. The following two results hold for all natural deduction systems: i)

if

ii)

if

r

I- Band

r l-C, ~

Call i) the

then

B,lI I- C, then

r,A l-

r,l\ l- C.

C.

principle and ii) the

~

principle.

Since the proofs of both

are constructive, there exist associated functions cut and

~

which build the

deduction tree of the consequent theorem form the deduction tree(s) of the antecedent theorem(s).

2.

These functions will be useful for tree assembly.

LCF ~ogic

for fomputable

~unctions

natural deduction style proofs. tional depiction of deduction.

(Gor) is a software tool for developing

A notable feature of the system is its funcFormulas are assigned the data type form and

are built using formulas and logical connectives. taken as axioms are given type thm (theorem).

Those formulas which are

Inference rule schemes are func-

tions which produce results of type thm from their arguments. For example, P

has type form

PAQ

has type form

~

has type form -+ thm

~(PI\Q)

has type thm

:>E

has type thm x thm ... thm

:>E

(~(P::>(PAQ»,

~(P)

has type thm.

448

Expressions of type thm are written in their sequent form, e.g., Po (PAQ),P ~ PAQ.

The two expressions of type thm seen above are short

exa~ples

of (forwards) LCF proofs, which are constructed by nested applications of inference rule functions.

th~

This provides an element of security to the system,

for only through the use of the

axio~

and the inference rules can new theorems

be created. The LCF system rises above its role as a mere proof checker due to its perfor~

ability to

goal directed (backwards) proofs, bUilding a deduction tree

from its root-- its goal-- to its leaves, i.e., assumptions.

The basic approach

~~? C, where ~ is a set of assumptions and

is to take a goal,

desired conclusion, and decompose that if proofs exist for also constructable.

C into a list of subgoals

~~? CI, '"

,

~~? Cn,

C is the

CI, ... ,Cn

then a proof of

such

~~? C is

A function which decomposes a goal is called a tactic.

The functional formalization of goal directed proof is defined in LCF as goal:

form list x form the assumption formula set

tactic:

~

and the desired conclusion

C;

goal ->(goal list x validation)

-- the decomposition step of goal a thm producing function; validation:

~~? C into its subgoals plus

thm list ->thm

the thm producing function which produces ~ ~ C_ from CI, ••. , ~ ~ Cn, thus justifying the decomposition.

~ ~

Using angle brackets to enclose lists and parentheses to bind pairs, here are the definitions of some LCF-style tactics: IMPTAC: ANDTAC: TRIV: IDTAC:

~ ~

? «~,A ~.

I~

? «~ ~.

?

f-' AAB ?

~,A ~. ~

~

?

1-' A:>B

?

1-'

A

A H>

«> , ?

Ho

«~ 1-'

B>, :::>1)

A;

? ~ 1-"

B> , AI)

triv<~;A»

A>, A.t)

For example, ANDTAC accepts as an argument a goal ture

AAB, the result is the list of subgoals

sition is justified by the rule

AI.

Note that if

the tactic fails (generates an exception). empty list of theorems to the axiom tactics.

<~

~,A ~

~

?

1-' C.

I-? A;

~

If

B>.

C has strucThis decompo-

C is not a conjunction,

The validation A.

?

~.

triv<~;A>

maps an

IDTAC is the identity map for

449

Systematic decomposition of goals is performed by composing tactical steps such as these until all subgoals reduce to empty subgoal lists.

The forwards

proof corresponding to the decompositions is obtained by applying the validation functions in the order inverse to that of the composition of the corresponding tactics.

To aid in tactic (and validation) composition, tactic combinators

known as tacticals are used. and

In the descriptions to follow, let

@ denote the list append operator.

g

be a goal

The four tacticals used most frequently

are:

i)

THENL:

tactic x tactic list -> tactic

THENL performs sequencing of tactics. applies

t

to its input goal

corresponding goal

gi

g

An expression

t THENL

and then applies tactic

from the list of

m subgoals that

ui t

to the produces.

Formally, (t THENL
;um»

g

let

«gl;

let

(G'i,v'i) = ui gi, for lsism, in

;gm>,v)

(G'l @ '"

t g

in

@ G'm, v.(v'l x '" or any

x v'm)).

The tactic fails if

t g

ui gi

v.(v'l x ... x v'm)

represents the function which accepts the list of

fails.

The validation

theorems that satisfy all the subgoals in the list G'l @ ... @ G'm. function parcels out the theorems to each of the subvalidations that the theorems satisfying goals theorems are in turn given to ii)

THEN:

v

are produced.

The

v'i, so These

for production of the original goal.

tactic x tactic -> tactic

THEN is a form of THENL which is used when the number of sub goals produced from the tactic

t

may vary.

(t THEN u) g = let

(cg l ; ... ;gm>,v)

let

(G'i,v'i) = u gi,

t g

in

for lsism,

in

(G'l @ ... @ G'm, v.(v'l x ... x v'm)). The tactic fails if iii) ORELSE:

t g

or any

u gi

fails.

tactic x tactic -> tactic

(t ORELSE u) g t g,

if

u g,

otherwise.

t

does not fail with argument

ORELSE directs depth-first search.

g

450

iv)

REPEAT:

tactic -> tactic

The tactic t

REPEAT t

applies

t

repeatedly to the sub goals produced by

until it cannot succeed on any more of the subgoals.

It is defined

recursively as REPEAT

t g

let rec

As defined,

f = (t THEN f) ORELSE IDTAC

REPEAT t

in

f g.

can never fail.

Here are some simple tactics built using tacticals: DECOMPOSE-TAC

ANDTAC ORELSE IMPTAC ?

attempts to simplify a goal

6~'

conjunction gets split via but

C by decomposing

ANDTAC; if

C

C into its subparts.

is an implication,

ANDTAC

A

will fail,

IMPTAC will successfully remove the implication and assume the antecedent

portion.

DECOMPOSE-TAC fails if

LOOP-TAC applies

(REPEAT DECOMPOSE-TAC) THEN TRIVTAC

DECOMPOSE-TAC

symbols from

C is neither a conjunction nor an implication.

repeatedly to remove all conjunction and implication ?

C of

6~'

C.

Once all the connectives are removed,

TRIVTAC

is applied to each of the remaining sub goals to see if each is a trivial theorem. If so, the subgoal list empties and the strategy succeeds; if not, the tactic fails. Applying

LOOP-TAC

to the goal given in section 1 yields the following

trace of the subgoal lists: LOOP-TAC

«AAB)~C, ?

=>

«AAB)~C,A~'

=>

«AAB)~C,A~'

?

?

A~'

AA(B~C) ?

)

A;

(AAB)~C,A~'

B~C>

A;

(AAB)~C,A,B~'

?

by

C>

ANDTAC

by IMPTAC,

but note that nothing can be done to the first subgoal. of REPEAT =>

failure,

The next iteration

can cause no changes, so the loop is exited. as the first goal can be removed by

causes an exception. the subgoal list

If

TRIVTAC ?

«AAB)~C,A,B~'

TRIVTAC, but the second

was replaced by C>

(TRIVTAC ORELSE IDTAC),

would result.

In either case, the

problem is not completely solved. The tacticals plus primitive tactics form a language for theorem proving. The discovery of useful proof algorithms for specific problem areas is a major topic of study in LCF-oriented research (Con,CoM,Les,Mon). Two conceptual problems exist with LCF-style tactics.

The first is that the

system directly supports only backwards inference in the tactical mode.

That is,

451

a tactical step upon a goal no effect on the formulas of tion).

6~? C is designed to simplify formula 6

C with

(aside from occasionally adding a new assump-

Often, a form of forwards chaining is desired while in tactical

the above example would certainly benefit.

mode~

The existing system allows forwards

chaining only through hand-coded simulation in which the user codes the subgoaling step and the validation function. The second problem concerns the validity of tactical decomposition.

An

LCF tactic ?

6~'

C r+

is valid if

?

«fl~'

v
Bl; .•. ;

r Bl; .•. ; fn

?

fn~' ~

Bn>

Bn>, v) 6

~

~

C.

A user wishes to deal with

valid tactics only, but allowing user-coded tactics to alleviate the first deficiency also allows the introduction of tactics which perform decompositions which are not justified by their corresponding validations.

A sample invalid

tactic is

~~? AAB ~

«D~? A>, (A.AI(t, B ~ B»).

"Tactics" such as these require run time checking in some form, even after an apparently successful decomposition.

What's more, even a valid tactic can fall

prey to a situation in which the wrong theorems are given to its validation. This situation certainly arises if an invalid tactic is composed with a valid one.

In order to control these problems and develop a useful programming nota-

tion, a new view of tactical proof is needed.

3.

Tactical proof The task of tactical theorem proving is that of completing a partially

constructed deduction tree.

Both the leaves (assumptions) and root (conclusion)

of the tree are known, and the proof is obtained by filling in the interior nodes of the tree in any order desired. proof of conclusion

C

The status of a partially completed

from assumptions AI, ... ,Am

AI, ... ,Am

is

452

Some forwards i nf er enc es have been made f rom the assumpt i ons to create t he upp er f r ont i er of known fac t s

r.

l ower f r ont i er of subgoals

a

f ac t

E

on to

r

po s i tion of s ome Dk

r

~

whenev er

Conclusion

Dl", .,Dp. E

~cr .

an d ~

int o f ormula s

C ha s been simplifi ed t o cr ea t e

A f orwards step "pastes" a new Similarly, a bac kwards de com~ ~

occurs whe n

Dk .

(A r elated con-

cep t of " pr oof window" appe ar s in (And) . ) Thes e i de as can be f ormalized. ~

compl e t ed t r e e with frontiers contai ns a list of f ormul as <~ l

?

r'

?

r'

~p

Dl; .. , ;

C.

and

Dl, .. "Dp.

Dp>

is used .

?

r ' C repres ent a partially

~

Let a goa l

In th e cas e that the l ower f ront i e r a list of goa l s

Tactics are used to clos e fr on t i e r s .

and they are naturally induced fr om the inference rules of t he natural deduction sys tem supplied,

Given rule scheme

(AI)

(Ak)

Bl

Bk

B(k+l) " . Bm C

where propositions tions

Bl •. , . , Bm ar e us ed to deduce

Al •. . .• Ak a re us ed t o dedu ce

Bl • ... ,Bk

C,

and di schargab l e assump-

respe ctively. there exi s t two

fundamental tactic s chemes ba s ed upon r: i)

Backwar ds : (I-r) =

?

r'

/',

C

~

?

r'

<~ . Al ~

?

r-:

Bl ; ... ;

~ ' Ak

B(k+1) ; ••• ;

The t actic is va l i da t ed by r ule

r

?

1-' Bk;

? ~ ~ .

Bnr- ,

t reated as a f uncti on of t ype

t hm list - c- t hm. ii )

Forward s : (rl- ) =

~
7

Bl •...• Bm./', 1- D

?

1-' D>

and is validated by the function C\ :thm list.

cu t(C) ; t >.

is the fun ct i on menti oned a t the end of s e cti on 1:

Cut

cut (D)
~ ,D

r

r ,~

E> =

r

E.

Thes e tactic buildi ng operations may also be ap p l i ed to a previously prov ed theorem

r

I- D,

as the the orem can be viewed as an inferen ce r ul e

Some useful examples of tac t i cs built wi t h the oper a t ors ar e (hi I )

(l- c L} (H AA ~B

?

?

/', !- ' AA B 1-+
/', 1-'

!bB

l- ..., (!bB) ) )

~

<~ . A

~

?

1-'

?

r- :

?

/', r '

B>

B>

., (!bB)

?

..."
AA ~ B >

,

r D'

453

(~opI),

All theorem provers make central use of tactics of the form op

is a logical connective. ?

?

(AER,t-)

AAB,lI 1-' C 1-+

(::>~)

A:oB,A,lI 1-' C 1-+

«..,€AvB) I(opE 1-)

-AA-B)~)

where

Forwards examples include

?

?

=

_(AvB) ,1I1-' C 1+ <_(AvB) ,-AA_B,lI 1-'

?

?

tactics are used for forwards chaining strategies.

e:.

The appendix

presents a set of inference rules from which tactical skolemization is derived. The following three tactics are not created using the above operators, but they are quite important and easily justified. ?

?

IDTAC

1I 1-" C 1-+ <1I 1-" C>

TRIVTAC

1I,A 1-" A

THIN (A)

?

~

as before, and

<>,

?

?

1I,A 1-' CI+
justified by

Tacticals are again used to build compound tactics.

~

(see section 1).

To keep this paper brief,

the combinators of section 1 are reused. The informal operational semantics of a tactic scheme goal

g

is as follows:

t

t

when applied to

attempts to perform a unification of

g

against

its expected argument structure (its "domain").

If successful, the variables

in t's

g,

domain scheme are instantiated to match

and

goal list defined by the tactic's instantiated "range". unified to the domain, failure results.

g If

is mapped to the g

can not be

The validation portion of a tactic

is automatically generated from the operators used to build it.

In this way,

the validation accompanying a subgoal list is kept hidden from the user and safely maintained by the system. tactic

t

When a goal

to an empty subgoal list, i.e.,

g = 1I I-? C is decomposed by a

t g = <>, a system function

validate can be invoked to activate the validation attached to the theorem (and its deduction tree):

validate(t g)

=

1I I- C.

<> and create The application

of validate to a nonempty subgoal list fails. More useful tactics can be built from partially or totally instantiated inference rule schemes.

A substitution operation is used for this purpose.

Given a rule or theorem scheme r, let

r[vl .•. vn J denote the instantiation el .•• en of term and formula variables vl, ••• ,vn in r by appropriately typed expressions

el, •.• ,en, For example,

respectively.

The expressions

ei

may also contain variables.

454

A

~E[PAC]

(PAC)~B

is the rule scheme

(PAC) B

Given a tactic

Domain restriction is also needed. ? ~ ~.

C,

possibly

t = tact (~ ,...? C)

containing formula list and formula variables, let denote

tac

restricted to operate only upon those goals

~ ~? C.

whose structure matches

If t

is applied to an argument whose

structure does not match, failure occurs. goal

r,...? P

and a goal expression

tac

(The order of formulas in

r~?

is not significant, and the unification of

~ ,...? C takes this into account.)

P

r

in a

with

The restriction operator is essentially a

guard, and the reader may prefer a prefix notation; make use of the established operator

I use the postfix form to

and to point out the true nature of

restriction. Restriction and substitution work together with tacticals to establish a block structured language for theorem discovery. proof by cases tactic is defined from the CASES-TAC

A B

(~(VE[A' B'])

vE

For example, the standard rule as

THENL

A'vB',~,..."

Backwards chaining using

C

~E

pruning

~,B''''''

/-' C;

HA'vB',~}-? C)

C>.

can be defined similarly, and, in general, such

"match and thin" tactics using discovery algorithms.

~ <~,A'

?

?

opE

rules are a standard part of useful proof

In a related fashion, forwards proof can be aided by

search space, as in

REMOVE AND

A

B

«AE~[A' B']~)

THEN

(AEr[~, ~I]/-)

THEN

?

A'AB',~ 1-'

C

THIN(A'AB')) HA'AB',~ ,...? C) ?

~
C>.

Useful proof by contradiction strategies may also be expressed: REMOVE NOT THENL
4.

?

1--" ~A'

TRIVTAC» ~

~,A'

?

~(~C',~ 1-" ~A) ?

1-"

C'.

An examp Le Consider the problem of constructing a theorem proving procedure for that

subset of propositional logic utilizing only the conjunction and implication

455

connectives. decomposes

?

6~'

Given goal

C,

a standard theorem proving strategy first

C by systematically removing its logical connectives (via back-

wards tactics) and then simplifies members of ?

6,A~'

subgoals of the form

SIMPLIFY RHS

FIND

A appear.

THEN

6

(using forwards ones) until

The procedure has the form

SIMPLIFY LHS.

The first step is straightforward: SIMPLIFY RHS

REPEAT (TRIVTAC ORELSE

(r~I)

ORELSE (rAI».

At each iteration, a check is made to see if an axiom form has been reached. If not, the primary logical connective of the goal is removed. SIMPLIFY LHS

REPEAT (TRIVTAC ORELSE REMOVE_AND ORELSE REMOVE_IMPLIES).

The

REMOVE_AND

tactic is exactly that defined in section 3. ~B

of an implication ~E

can be applied.

Showing ?

r~'

FIND to solve for

r r

A.

r' C is not trivial, as the ~E More generally, if r r A, then

~B,r

in a goal

rule requires the presence of

A in

Decomposition

?

r.

r A requires a subroutining process: invoke

r

A and then

validate

the successful search, creating

The theorem can be used to build a forwards tactic in ?

« (validate

REMOVE_IMPLIES

A B (~E[ A' BI

(FIND (6 r' A'») 1-)

THEN

t-}

]

THIN(A'~B')

THEN

?

) r(A'~B' ,6 r' C) .

This completes the formulation of

FIND.

Each tactic used in its construc-

tion effectively reduces the number of logical connectives in a subgoal by one, and so

FIND always terminates.

I conjecture that

FIND is "complete" in

the sense that any theorem provable using just the rules for conjunction and implication is also provable with

FIND.

The example of section I generates

the following trace: FIND( -> ->

c c

?

(PAQ)~R,P

(PAQ)~R,P

?

r'

)

(PAQ)~R,P

1-' P;

(PAQ)~R,P,Q (r~I)

1-' PA (Q~R)

?

R> since

?

r'

Q~R

TRIVTAC

does the second.

?

cP,Q r' P;

handles the first subgoal, and

FIND must apply

subroutine in REMOVE IMPLIES 1 P,Q 1- PAQ ->

> SIMPLIFY_LHS,

and the

generates a local subgoal:

?

P,Q 1-' Q>

-> <>

so the search returns to the top level with

P,Q f- PAQ, yielding

456

?

=>

«PAQ);::>R,P,Q,PAQ 1-' R >

=>

<

=> =>

?

(PAQ)::>R,P,Q,PAQ,R 1-" R > ?

< P,Q,PAQ,R 1-" R>, <>

all of whi ch occur wi t hi n

REMOVE_IMPLIES.

on the ne xt iteration of SI MPLI FY_ LHS .

The derivation tre e built by va l i da t ing the empt y s ubgoal lis t is ex a c t ly that s hown in section 1, a s the cu t and and not inf erence rules.

~

functions ar e tree s ti t chi ng ope r a t i ons

The r eade r i s encouraged to build the valid ation

based upon the trace a nd ge ne rat e the proof tree . 5.

Conclusion A notation f or expressing proof di s covery algorit hms for natur al dedu ct i on

s ystems ha s been defined.

It a l l ows s uccinct sp ec ification of strategies

widely used in all t heorem pr overs and encour a ges the discovery of compl ement ary one s as we l l .

As the language i s independent of the in ferenc e ru l e

s ystem used, it suppo rt s formu la t ion of prob lem area- i ndepende n t s t r ategi es and facilit ates compar is ons of co n t rol a l gor i t hms of di f f ere nt t heorem proving sys t ems .

This s e paration of t heor em d i s co very s t ra tegies f ron the problem

a r ea s to be s t ud ied ex pos es the i mport ance of prop e r l y f ormu l ating the unde r lying infer enc e rule sys tem of the theorem prove r . dis appea r when a

Many t echnical di ffic ultie s

co he r en t , compl eme n tary set of rules i s

us ed as a foundati on.

This pa per 's leng t h p reven t ed a close r examinati on of t he co ntro l cons t r uc t s fo r the lang uag e .

The LCF t actica ls served we ll for t he simp l e examples, but

an obvious need exi sts for more soph i s t i ca t ed combi na t ors su ch as br ead th- f i rs t s earch ("EITHER tl OR t2") a nd condit i onal iterat ion ("REPEAT t UNLESS e n) . This ar ea mer i t s study. Acknowl ed gements:

Br ian Monahan produc ed many conc r e te examples of t a ct i c

ge ne r a t ion i n his work wi th the LCF s ys t em.

He made ma ny use ful s uggestions

a nd ca ref ully read a n ea r l i er ve rs ion of this pap e r . St i r l i ng and Robin Milne r have al s o been helpful . gr a t ef u l ly acknowledged .

Discussions wi t h Col i n

Dor othy McKi e ' s t ypi ng is

457

References (And)

Andrews, P.B. Transformaing matings into natural deduction proofs. 5th Conference on Automated Deduction, Les Arcs, France, 1980, LNCS 87, pp. 281-292.

(Ble)

Bledsoe, W.W., and Tyson, M. The UT interactive theorem prover. Memo ATP-17, Mathematics Dept., University of Texas, Austin, 1975.

(Boy)

Boyer, R.S., and Moore, J.S. New York, 1979.

(Cha)

Chang, C., and Lee, R.E. Symbolic Logic and Mechanical Theorem Proving. Academic Press, New York, 1973.

(Coh)

Cohen, P.R., and Feigenbaum, E.A., eds. The Handbook of Artificial Intelligence, Vol. 3. Pittman, New York, Ch. 12.

(Can)

Cohn, A. The equivalence of two semantic definitions: a case study in LCF. Report CSR-76-81, Computer Science Dept., University of Edinburgh, Scotland, 1981.

(CoM)

Cohn, A., and Milner, R. On using Edinburgh LCF to prove the correctness of a parsing algorithm. Report CSR-113-82, Computer Science Dept., University of Edinburgh, Scotland, 1982.

(Cos)

Constable, R.L. Proofs as programs: a synopsis. Letters 16-3 (1983) 105-112.

(Gar)

Gordon, M., Milner, R., and Wadsworth,C. Springer-Verlag, Berlin, 1979.

(Gut)

Guttag, J. Notes on type abstraction. IEEE Trans. on Software Engg. SE-6-l (1980) 13-23.

(Hoa)

Hoare, C.A.R. An axiomatic basis for computer programming. 12 (1969) 576-580, 583.

(Lem)

Lemmon, E.J.

(Les)

Leszczylowski, J. An experiment with Edinburgh LCF. 5th Conference on Automated Deduction, Les Arcs, France, 1980, LNCS 87, pp. 170-181.

(Man)

Monahan, B.

(Nor)

Nordstrom, B. Programming in constructive set theory: some examples. ACM Conf. on Functional Programming Languages and Computer Architecture, Portsmouth, N.H., 1981, pp. 141-153.

(PIa)

Plotkin, G. A structural approach to operational semantics. Report DAIMI FN-19, Computer Science Dept., University of Aarhus, Denmark, 1981.

(Pra)

Prawitz, D.

(Rob)

Robinson, J.A. Logic:Form and Function. Edinburgh, 1979.

(Sup)

Suppes, P.

A Computational Logic.

Beginning Logic.

Academic Press,

Information Proc.

Edinburgh LCF.

LNCS 78,

Comm. ACM

Nelson, London, 1965.

Ph.D. thesis, University of Edinburgh, forthcoming.

Natural Deduction.

Almquist and Wiksel, Stockholm, 1965.

Introduction to Logic.

Edinburgh Univ. Press,

Van Nostrand, Princeton, 1957.

458

Appendix Inference rule schemes for quantifiers have additional restrictions attached to control the use of free variables.

Tact ics generated from those

rules must also follow these restrictions. The rules given here for the universal and existential quantifiers have restrictions which make their corresponding tactics into the skolemization procedures of Robinson (Rob); the rules are a slightly modified version of those in Suppes (Sup). operation is treated as a function like

cut

or

Given the usual syntax for first order logic, let variables which may be quantified. variable, e.g.,

x,y,z, ... ,

The unification

~.

A Skolem variable

x,y,z, '"

represent

is a free, barred

and a Skolem constant is a free variable, possibly

subscripted by a list of Skolem variables (a "Skolem function"), e.g., Let

AX t

denote the usual syntactic substitution of term

occurrences of variable

x

in formula

A.

t

x(y).

for all free

The rules are

VE

VI:

AX _ _ y(z1. .. Zll) VxA

y,zl, ... ,zn

where none of

are free in any assump-

tion upon which

AX _ y(z1. .. zri) Skolem variables in AX _

_)

belong to

y (z l , .. zn

The tactics

(VE~)

and

(rVI)

for universal quantifiers. generated;

zl, •.. ,zn

3xA AX _ _ y (z1. •• zri)

{zl, ... ,zn}.

correspond to the usual skolemization routines

VI's

inverted restriction forces

are exactly the Skolem variables in

where 3E

depends, and all

y

to be newly

is not free in any assumption

upon which and

y

A.

3xA depends,

all Skolem variables in

A belong to

{zl, .•. ,zn}. The tactics

(r3I)

(take term

t

to be

y)

and

(3E~)

usual skolemization routines for existential quantifiers.

correspond to the

459

Unification is substitution: UNIFY BACKWARDS (x, t

)

?

= b.

C

f-'

~

?

UNIFY_FORWARDS (x, t

b. 1-' C

)

j-;>

?

t
?

,.."

C~.

Of course, a global state is needed to ensure unique generation of Skolem constants (and variables) and consistent unification across all subgoals. Here is a hypothetical tactical proof of an example using quantifiers: ?

3yVxF(y,x)

f-"

=>

<3yVxF(y,x)

.... '

=>

<3yVxF(y .x)

=>

FG ,x) ? .... " FG ,x)

=>

....

=>

l-

=>

f-'

=>

<>

?

Vx3yF(y,x) 3yF(y,x) >

?

f-"

? ? ?

by

(I-VI)

by

(I-H)

(3Ef-)

>

by

F(f,X)

>

by

(VEf-)

F(y,x)

>

by

UNIFY_FORWARDS (x,x)

F(y,x)

>

by

UNIFY_BACKWARDS (y,y)

by

TRIVTAC.

This should be compared to the attempted proof of Vx3yF(y,x)

?

1-'

=>

=>

? ?

3yVxF(y,x) VxFG ,x)

>

by

(1-3I)

F(y,xG»

~

by

(1-1Il) •

ment => =>

<JyF(y,x)
?

t-" F(y,xG» > ?

t-" FG ,x(y» >

y

Note the necessity of the

in Skolem constant

by

(VEt-)

by

(3Ef-) .

x(y).

Attempts at unifying the structurally similar formulas can not succeed.

argu-

460

The Mecharriz at.ion of Bxistence Pr-oofs of Recursive Predicates Ketnn Mulrnuley Computer Science Department Carnegie-Mellon University Schenley Park Pittsburgh, PA 15213, USA 1.

Abstract

Proving the congruence of two semantics of a language is a well known problem. Milne[3] and Reynolds [5] gave tcchniques for proving such congruences. Both techniques hinge on proving the existence of certain recursively defined predicates. Milne's technique is more general than Reynolds', but the proofs based on that technique are known to be very complicated. In the last eight years many authors have expressed the need for a more systematic method and a mechanical aid to assist the proofs. In this paper we give a systematic method based on domain theory. The method works by building up appropriate cpos and continuous functions on them. Existence of a predicate then follows by using the Fixed Point Theorem. A mechanized tool has been developed on top of LCF to assist proofs based on this method. The paper refutes the fear expressed by many people that fixed-point theory could not be used to show existence of such predicates.

2.

Introduction

The Scott-Strachey approach to giving semantics to a language is well known. In this approach each programming construct in a language is given a denotation or meaning in an appropriately constructed domain which is some kind of cpo (complete partial order). Of course there can more than one way of constructing such domains and denotations. The question which arises naturally is: how does one know that these ways are in some sense equivalent? Say the language is L and we have two semantics L1 and L2 which map L into the domains D 1 and D 2 respectively. One might start by constructing some predicate, say P E D 1 X D 2 , which relates the equivalent values from two domains i.e (db d2 ) E P iff d 1 and This research was supported in part by the Defense Advanced Research Projects Agency (DOD), ARPA Order No. 3597, monitored by the Air Force Avionics Laboratory under Contract F33615 c81-K-15:{9, and in part by the U.S. Army Communications R&D Command under Contract DAAK80·81 cK-OO'j4. The views and conclusions contained in this document are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of the Defense Advanced Research Projects Agency or the U.S. Government.

461

d 2 are in SMile sense equivalent. The above question then reduces to asking: does (LI(e), L 2 (e)) E P for all eEL? Sometimes oue might succeed in constructing only a weaker predicate P such that (d l , d 2 ) E P iff d] is weaker than d 2 in some sense. One then proves the equivalence of two semantics by proving the following two assertions independently:

(1) L](e) is weaker than Lz(e) for all eEL. (2) Lz(e) is weaker than L1(e) for all eEL. The key problem then is the construction of an appropriate predicate and proving its existence, the issue we shall be addressing in this paper. To make the ideas concrete we shall consider one example from [8]. Consider the simple expression language, essentially the lambda calculus with atoms,

E

::=

I I B I (AI.Et}E2

Here I belongs to a syntactic domain Ide of identifiers and B belongs to a domain B of basic values. Dynamic seeping is intended. Let D, the domain of values and C, the domain of environments, be the least solutions satisfying D = C ~ B and C = Ide -> D. Let E be the syntactic domain of expressions. Let F denote the domain of contexts, Ide -> E. Then a denotational semantics, e : E -> D (or e : E -> C -> B) and an operational semantics, 0 : E -> F -> B can be given easily: e[I~e

= (eI)e

e[B]e = B f:[(H.Et}E 2]e = e[Ed(e[e[E2]jI])

O[I]! = O(lI)! O[BD! = B O[(AI.EdBz]J = O[E1](I[E2/I])

e

The congruence problem is to show that and 0 compute the same thing. More precisely let convert: F ~ C(Le Ii' -> Ide -> D) = ).,J).,I.(e(lI)). Then we have to show that for all E : E and I : F, e[E]{eon·uertJ) = O[E]f. It is easy to prove that O[E]f ~ c[E](convertJ). The difficult part is to prove the other inequality. For that, one shows that the following recursively defined predicates, (j f:; D x E and 77f:; C x F exist, where () = {(d, e) I V(e, I) E 77. de ~ Oel} 77 = {(e, J) I VI E Ide. (eI, 11) E O}

(2.1)

I: F, (t:[E], E) E () and (convertJ, J) E e[E](convertJ) i;; O[E]f. The above discussion shows

Then it can be shown that for all E : E and 71. Hence from (2.1) we get

the importance of recursively defined predicates in a congruence proof. Henceforth we shall confine our attention solely to the existence of such predicates. The existence of the above predicates, () and n, is shown in [8] using Milne's technique [3]. Later in this paper we will show how the proof based on our method

462

can be mechanized. Bo th meth ods show t hat 0 a nt! T/ exist for al~Y choice of 0 But if we replace ~ by = or J in (2.1), th e int erest ing questi on is: do the predicates st ill exist? This question was rai sed in [8] bu t was left open. V/e can answer t hat ques tion par ti ally now . For that 0 given above we still do no t know the answer . However we shall sh ow using Jiagonaliza tion that there exists an 0 .such that the predicates do not exist. Con sider the case of :J first . Let Ide be a tr ivial one-po int dom ain. Then C can be identified with D and F can be iden tified with E , so W f. have D = D ~ B , F = E and 0 = 7/. And the quest ion reduces to as king whet her 0' exists wh ere (j' =

{( d : D, e :

E) I V( d', e') E 0'. dd' :J Oce'}

Let I : D = (AX.XX). vVe arc omitting the obvious inje ctions an d projections. It can be shown that II =..l(as in the well known Park's theorem.) Let 0 = AelAe2.b where the constant b : B:J .L. We are assuming that B contains at least two elements. As 0 is constant, existence of ()' is equivalent to existence of some 0 where 0 =

{d : D I Vd' Eo. dd'

:J b}

(2.2)

Assume for a contradiction that such an 0 does exist. Consider two cases. 1) lEo: in this case by (2.2), I I :J b and hence ..l :J b. This is a contradiction. 2) I ~ 0 : Consider any d E o , By (2.2) we get dd ;;:) b. Hence [d = (Ax .x x)d = dd ;;:) b. Thus for all s e 0, [d :J b. Hence by (2.2) lEo. This is again a contradiction. We conclude that such an a does not exist. If we replace

~

in (2.1) by = exact ly the same reasoning goes t hrough.

The implications of this counterexample are many. It shows that the existence of recursively defined predicates is indeed a nontrivial problem. Secondly it weakens the hop e that there exists a rich enough syntactic language su ch that any predicate expressed in that language exists. Reynolds [5] and Milne [3] gave general techniques to prove the exist ence of such predicate s. Note that Reynolds ' method can not be used to show the existence the above pr edicates, () and 'YJ . There has been som e confusion ab out this in the literature. For example, see lemma 2.2.1 in Peter Mosses' thesis [4]. There he says that the existence of a certain recursively defined predicate follows by Reynolds' method easily. But it is possible to give a somewhat similar looking predicat e and show that it does n ot exist. Due to lack of space we shall not discuss that exa mple here. But the idea is the same as in the above counterexample, nam ely self-application. We hop e t hat one example makes our point clear. (Incid entl y I can not show tha t the particular predicate in [4] does not exist. So that remains an op en question agai n). Milne 's method can be used to show existence of () and 'YJ (see [8]). The proof is unde niably complicated. Because the existen ce proofs of recurs ively definer! pr edicat es are so complicated, many authors have expressed th e need for som e me chanica l aid in ca rry ing out

463

su ch proo fs [2,8]. T his probl em has be en ope n for ma uy yea rs . III th is pap er we sh all show that such a me ch an ia.u ion is indeed po ssible. .:\. mechanical aid has been ac tually implemented on top of LCF to assist in th ese existe nce proofs.

3.

Basic Not a tions

It will he assumed that the reader is familiar with domain theory , All the results in this section can be found in [6]. Dom ains in this pap er are assum ed to be consistently complete algebraic cpos. We shall denote t hem by O,D etc. C -) D will de note t he usual domain of continuous func tions from C to D. The universal domain will be denoted by U.

3.1. Definition.

(1) r: D

--> D is called a retract if r 0 r cc: ?', where 0 denotes composition. We shall denote the fixed point set of r by Iri, and by r if no ambiguity arises. Note that Id is a cpo. Its bottom elemen t is ·r(1.); we shall denote it by 1.r. .

(2) A retract r : D --> D is called a projection if r i;:;; I, where I : D the identity function.

(3) A projection r : D

-7

-->

D, we say that r is a subcpo of s (denoted by

r:s 8) if 8 0 r = r. This is equivalent to I, g : D --> D

D is

D is called finitary if Irl is isomorphic to a domain.

(4) Given retracts r, s : D 3.2. Fact. If

-7

ar!! projections then

I

saying 11'1 ~ ~ g

lsi.

I

iff If I ~ I!JI I

The domain of finitary projections of the universal domain U will be denoted by V. Each domain is isomorphic to the fixed point set of sorne finitary projection on U [6]; hence it can be regarded as an element ofV. Therefore, we shall use the notation 0, D etc to denote elements of V as well.

3.3. Fact; There exist -7, x, + : V X V --> V such th at ID X EI ~ IDI X lEI, ID+E\ ~ IDI + lEI· I

ID - EI ~ IDI -. lEI,

In this paper a retract will perform a double duty. Some times vie can think of it as denoting the set of its fixed points, sometimes we can think of it as a function. Thus if r : D --> D is a retract, then x : r, x E .,., z E irl, r x == x all make sen se; in fact they mean the same thing. Similarly if S ~ ID I th en r(S) denotes the set {y I 3x : D. y = r x}. Of course the finitary projections are retracts, so all this applies to them too. The duality between finitaryprojeetions of U,i.e. elements oj V ; and domains will be assumed throughout this paper. Thus if L E V k t hen some times we shall think of it as a subdomain of Uk, and sometimes as a function over Uk.

464

aA. Defin it io n . (1)

T he n-ary relati on H on some d oma in R is sa id 1,0 be u pwa rd/dow nward] closed in its ith index iff (;);1, " ' , ~; i , " ' , Xn ) E R ana Vi ;J ( ~ ) Xi implies (Xl , · · ·,Yi, · · ·, xn) ER.

(2)

The rela tion S is di rected-com plete if the l.u .b of any d irected subset of S belongs to S.

(3)

Cl (A,B), where A and B
For any domain D, with a partial order G on it , the partia l order can be regarded a b inary predicate an d f;;;E Ol({2}, {I}). However, the equality predicate is not. in CI( {2}, {l}) and nor is ;J . This fact might have been crucial in deciding the exis te nce of a solution to (2.1) . We shall use upward and downward closure properti es of the pr ed icates throughout this paper. As A

-+

B will denote t he set of continuous functions , we shall denote the set ~-+ B.

of all function s from A to B by A 4.

Predicate Cpos

4. 1. Predicate Opo D efinition.

(1) Given a retract L : y

Tl

-+

V"', the sets A and B, and an n-ary predicate

! w here P E CI(A, B) we define II L,A,B,P II = {(L,L) I L E ILl, L ~ ILl,

P ~ I 1.L

L E Cl(A, B), (1.L)JJ ~

~

JJ} (2) For (L, L), (R, R) E il L, A , B, P II we say (L, L) (;;; (R, R) iff (a) L (;;; R. (Thifl m eans ILl S; IRI by Fact 3.2). (b) L(R) ~ L (c) L ~ R I P, P

4.2. Theorem. II L,A,B,P II is a cpo ui.r.i the ordering given in (de! 4.1). Further (1.L, P) is the bottom eleme nt of this cpo, Pro of.

Suppose X =

II L, A,B,P II. Let Y

{(Lj, Lj) I j E J} is an indexed d irected subset of I j E J}. Then it can be shown that

= {L j

lub(X) = (lub(Y) , {x : l'ub(Y ) I Vj

E

J .L j x

E ~j'})

The proof is similar to [p rop 2.5.3 ,(3)]. I

4.3. Fact. If (L , Lt) ~ (L, L 2 ) the n L 1 = 1rJ. II

(4.1)

465

Lei II L, A, H, P 11 and Ii R, 0, D, Q II he predicate cpos where L is a retrad over ym and and R is a retract over V n .We want to construct a map from Ii L,A,B,P II to II R, O,D, Q II. As any eiement in I! L,A,B,P II is of the form (L, L) where L r; ILl, it is natural to consider a map consisting of two components (see [3] also). The first component will he map on domains, T : ym --> 'V", which will map L to T(L). The second component will be a map on predicates, which will map L to some predicate P on 1'(L). We shall specify this map by some formula, w, based on a domain variable X: ym and a predicate variable, X. Then P can be defined as the set of all elements in 1'(1.) which 'satisfy' w when X is 'interpreted' as I. and X is 'interpreted' as L. 4.·1. Definition. Suppose we are given a domain variable X : V?' and a predicate variable X.

(1) The set of predicate symbols, 8=XI~1

e ::= 8(X),

is defined as:

d I=IE

where E is any constant predicate symbol.

(2) The set of domain terms, I'[X}, is defined as the set of lambda calculus terms over Y which can possibly contain X : V?" as a free variable and some constant symbols.

(3) The set of simple terms, .6.(X), is defined as the set of all r such that:

T

is a term over u-, for some k, which can contain SOHle free variables over U and some constant symbols. \Ve shall also allow an occurrence of X in a term belonging to ~(X).(Rememberthat X : Y'/'1. can be considered as a function over U m )

(4) The set of well formed formulae (wffs),
=

defined as follows:

(1) "r E P" E ¢, where

T

E

t.(X) and P E e(X).

(2)"'v'y E S. s" and "3y E S. g" E s" E
I

One can then routinely define freeness of a variable, z : U, in a wff. As an example "'v'x E Fst(X). ((X(x, z), .L) E :J) t,((x, x) E X)" is a wff in $(X : y 2 , X), where Fst : y2 --> Y is a constant symbol, It has one free variable, z, over U. For the sake of simplicity we would write this formula as: "'v'~; E Fst(X). X(x, z) d .l /\((x, x) EX)". As syntactic sugar we define ""Ix E X. g" aile! "3x EX. g" as short forms for "'v'x E X. x EX=> g" and "3x E X. x EX 1\ g". We shall also allow the obvious shortforms such as: "'v'(Yl,"', Yl) E ...".

466

Let (1)

(2)

W

E $(X : Y7n,X). LeI, a n interpret a t ion ~ be a function such that:

~ assigns to X ; ym a domain X '" in V m and to X a relation X~ on X~; i.e, X~ ~ IX ~I . ~

assig ns to every constant symb ol C : D ( which can occur in 6.(X) or

f (X)) an element C~ in D. (3) ~ ass igns to every constant predica t e symb ol E a pr edicate E~ on some domain E ~~; i.e, E~~ ~ I E~I.

(4)

~ assigns to the predicat e symbol ' 1;;;; ' the standard partial ord er ing pr edicate ~ on U 2 • Assignme nts t o t he predic ate sym bols ' J ' and '=:' are sim ilar.

Let s be a function from the set of va riables over U int o U. It is very easy t o extend the interpretation to predicat e symbols, domain t erm s an d simple terms: (1)

Every S : Y E f(X) can be assigned an element S fJ in Y .

(2)

Eve ry r : U k E 6. (X ) can be assigned an element r fJ [s ] in Uk.

(3)

Every P E 9(X) can be assi gned a pr edicate p~ on some domain p ~ ; i.e, p S s;;; I P ~I .

Now we inductively define what it me ans for ~ to satisfy w E q>(X, X) with

s,

~

f= w[s] : (l)

~

(2)

~

1= (1" E P)[s] iff P ~ (r ~ [s]) E P~.

1= (Vy E S . g)[s] iff Va E S ~. ~ 1= g[s(a/ y)]; s(a/y ) is exactly like e except that it maps y to a.The other case is

sim ilar. (3)

*

~ 1= (f g) [s]iff ~ 1= f [s] implies ~ the other case s are similar.

1= g[8].

Let w be a wff whos e fr ee variables over U ar e contained in t h e list Vt , . . . , Vk . Let a be a t u ple in Uk. Then we say ~ 1= w[a] iff ~ 1= W[8 ] where s is a function which assigns the jth component of a, (a) j' to Vj. If r is a t erm whose free variabl es over U are contained in the list Vt, •.• , VI", r~ [aJ is simil arl y defined. As the int erpretations intended for the constant symbols an d the constant predicate symbols sho uld be clear fro m the context, we shall no t mention them. In fact, if ~ is th e int erpretation whi ch assigns L to X and L t o X , then we sha ll som et imes denote ~ l= w [sJ by sim ply (I" L) 1= w [sJ. Fin ally we are in a position to define a m ap on a predicate doma in. 4.5. D efinition. Let Il l , A, B, P II a nd II R , C, D, Q II bc pr edicate cpos, 'wher e l is a r etract over V"? an d R is a retract over V " , T hen given T : V m -+ y n J

467

and w E
IIT,wll(L,L)-= (T(L),{a: T(lJ) I (L,L) where (L,L) E II L,A,B,P iff for all (1" L) E L,

II.

Vi, . " , tin,

F= w[a]})

Clearly IIT,wl! E

(4.2)

II

L,A,H,P

II

II

L,A,B,P

II.

~-,

II

R,O,D,Q

II

IIT,wll(L, L) E II R, 0, IJ, Q II 4.6. Theorem. S1/,ppose 111', wll is a map on monotonic it is continuous. Proof.

5.

Then if 111', wll is

Follows from (4.1), Ii'act 4.3 and the continuity oCT. I

Goal Generator

Let the predicator !IT,wll be a map on the predicate domain 1\ L,A,B,P II, where L is a retract over V?", T E ym -+ yn and w E <.fl(X : V'", X). We describe a crucial algorithm which generates s1tfficient goals to guarantee monotonocity (and hence continuity by Thm 4.6) of 111', wl]. The algorithm is crucial because goals generated by it can be proved within the LCF formalism. Assume (L,L),(.!!,L) E II L,A,B,P II, where (L,L) i;;;; (L.,L). We want to prove 111', wll(L,L) i;;;; IIT,wll(kL). Using (4.2) and Def 4.1 (2), this follows if

(1) 1'(1,) l:; T(I:t). (2) For all a E 1'(1,), (1" L) F= w[a] implies (L.,L) (3) For all g E T(L.), (1,L)

F

w[gJ implies (L,L)

F w[a].

F= w[T(L)!!] .

The first one is trivial. To prove (2) we generate the goal '(1" L) F= w[a : 1'(1,)] implies(L., L) F w[a]', and simplify it using the following algorithm Reduce.L Note that the variable a in the goal is implicitly qua.ntified over T(J...). To prove (3) we generate the goal '(!t,L) F= w[!l : T(~)] implies (1" L) F w[T(L)!!]' and simplify it using the following algorithm Reduce2. Reducel and Reduee2 are mutually recursive. Reducel reduces a goal of the form '(1" L) F w[z] implies where z i;;;; y. Reduce2 reduces a goal of the form where U i;;;; v.

(L, L)

F wry]'

'(L., L) F w[v] implies (1" L) F w[u]'

Reducel: Let w E <.fl(X : V'", X) be a wff whose free variables over U are among Vl, ... ,Vn • Let II L,A, B, P II be a predicate cpo where L is a retract over vv. Let (L : Vm,L),(L.: Vm,L) E Ii L,A,B,P II be such that (L,L) ~ (1,L).

468

Let :s be the interpretation which assigns L Lo X and L to X. Let interpretation which assigns L to .X and 1. to X. Let the goal be ~

f'--= w[z] implies

~ ~

wry] i.e, (IJ L) 1

w[z] implies (L,1.)

il be the

r- wry]

where z : U" [:; y : U". Then inductively we reduce this goal to the simpler goals. We shall justify only a few cases. The others are similar.

(1)

= "r E P": Then goal reduces to showing 'P'\r~[z]) E pS implies pil(r51..[y]) E p3!,. Let (rl'" .,rk) = P~(rS[z]) and (t1"" ,r.k) == pQ.(T~rZ]), for some k. Then for all j, rs [:; t j ' Assume that pS(-r'1'[z] E P'~. As (P~.1,P;~) [:; (p3!,p3!), this is immediate because (L, L) [:; (I!, 1), it follows that (r1,'" , r m ) = p\3'(r S[z]) E Suppose pQ. is upward closed in the indices of some index set G. (For example, if P is the predicate symbol '[;;;;', we know that p3! = [:; is upward closed in the second index. If P is the predicate symbol X then, because (L.,1) E Ii L,A, B,P II, we know by De! 4.1 that pQ. = 1. is upward closed in all indices of A). Then to show (r.1I ... , r.k) E pil it suffices to prove that rj = [ j for all j ¢ G. Hence for all i ¢ G we generate the goal

(2)

(a) w = "Vq E S. g": The goal reduces to showing '(Va E S~. ~ ~ g[a, z]) implies (V~ E S3!. ~ F= g[g, yD'. But for all g E S'3J.., S'J g E S'J. Hence it suffices to show: 'rig E S51... (~ F= g[SSg, z] implies ~ F= g[g, y]). Therefore we generate the sufficient goal

w

~

F= g[S'JQ, z] implies ~ F= g[g : S3!, yJ

Note that (S'J Q, z) [:; (g, y), for all Q E S'3J... Hence the goal can be further reduced by Reduce1 as its induction hypothesis is satisfied.

(b) w

= "3q E S.

g":

generate the goal ~

F= g[a

: S;)<, z] implies ~

F= g[a, y]

Reduce it further by Reduce! recursively.

(3) (a)w

= "h :=> g" : vVe have to prove that:

469

mr--:: h[yj implies

(~ ~ h[z] implies ~ F= g[z]) implies is easy to see th e suffi ciency of the goals: ~ ~

!= h ly] implies ~ F= h[z] i= g[z ] implies ~ F= g[y ]

~

!= g[y])

It

and

These can be redu ced further by Reduce2 an d Reducel respect ively. (b)w = " h l\ g" or " h v g": generate t he goals : ~ ~

F= h[z] implies ~!= h[y]

and

f= g [z] implies ~ != g[y]

These could be reduced further by Reducel. Reduce2: Let the goal be l§ll= w[v] implies ~ F= w[u]' where w, ~, ~ are as in Reducel and u : U" ~ V : U". Reduce2 simplifies the goal recursively. It is very similar to Reducel and hence we shall not discuss it. 5.1. Example. Let us now prove the existence of a solution to (2.1). Remember that D , the domain of values, and C, the domain of environments, are the least solutions to the equations D = C - 4 B and C = Ide -4 D. E is the domain of expressions and F is the domain of contexts , Ide -; E . The general plan is as follows. We first construct the predicators liT, w l] and 118,fll where

T: y2

-+ y2 =

8 :V

-+

A(C',F').(C'

-4

wId: 2v, e : V ] E (X : y2 ,X) = y2 = A(D', E').(Ide

-+

B,E) V(c, J) E X.app ly d e ~ 0 e f

D', Ide - , E ')

f[e, f ] E {Y : v", Y) = VI E Ide .(apply el, apply f I) E Y Here apply: U -4 U - 4 V = AXAy.(j__ x) y whe re i-, is the projection from U to U -+ U . Strictly speaking 0 here is the extension to V -; U -+ U of the operational semantics, 0, given in section 2. Such an extension can he carried out uniquely. Next we shall construct predicate cpos such that the above predicators will form continuous functions between them. That is, we choose certain retracts L, R over y2 and predicates P and Q such that:

liT, wll

E II

R, {2}, {1}, Q I

-+

II L, {2},{1}, P II

IIS,fll E II L,{2},{1},P 11-+ I! R,{2},{1},Q II Note that we are considering only those relations which belong to CI({2}, {1}). Continuity of the predicators will allow us ~.C) t ake the following fixed point.

(L 0, LO), (RO, RO)

= FIX(>.(.L, L), (R, R).(IIT, wI! (R , R), liS,fll(L, L))). (5.1)

470

It will turn out chat

(5.2)

L'=DxE,R"=Cxl"

Then using (5.1), (5.2) and (4.2) it follows that (2.t) gets satisfied if we let 0 .=:c: L and "l = R. Of course several conditions ought to be satisfied before this can be done. Firstly to guarantee that the predicators are well defined we require that: for ::,11 (R,Il) Ell R, O,D, Q II, IIT,wl!(R,R) E ill, A,B,P for all (L, L) E Ill, A, B, P II, !IS, fll(L, L) E I R, 0, D, Q Ii

I

(5.3) (5.4)

Secondly the algorithm given above generates some goals to ensure monotonicity (and hence continuity by thm 4.6) of the predicators. (The goals given below were the ones generated by the implemented version. It takes care of some trivial goals, carries out primitive simplification and generates the goals which are more readable.) The goals for 111', "'IIi are:

V(C',F'),(C',E,') E R. (C/,F') ~ (C',E') ==> \/£ E Q', dEC' ----7 B. B(apply d (C' f)) = B(apply dfJ V~ E E, f E F', B(O(E f) f) = B(O ~f) Similarly goals for

(5.5) (5.6)

liS, fll are:

V(D',E'),(D',~') E

L. (D/,E') [;;;;(D',~') ==> '
(5.7)

(5.8)

Note that (5.5) to (5.8) are goals which can be handled within the LCF formalism ( remember that x E j,where j' is retract, is a shortform for jx = x.) It can be easily shown that the goals (5.5) to (5.7) are satisfied irrespective of the choice of land R. In fact in the implemented system there is a tactic STANDARDTAC which automatically proved them. In the deduction it made use of definition of ----7: V xV ----7 V(we didn't give it here) and simple properties of finitary projections. That leaves us with (5.8) and some goals corresponding to (5.3) and (5.4) (we shall see them later). '/Ile shall construct R,L as the fixed points of certain higher order functionals. Then these goals can be proved in LCF u.n·ng fixed point induction. In the next section we shall give a technique for constructing these higher order functions.

6.

Higher Or-der Functions

Suppose T E V"' -~ V"'. Can we always find a i' : (Y"'" ----7 ym) _, [V" -) yn) such that for given a retract L : V"" --, V?", T(l) is also a retract and li'(l)1 = T(iLI)? The answer is no. For example, let projr : V" ---J y =

471

~( X l ,' " , Xn} .:Cj . It can be shown that th ere exists a retrad l : y n ---+ yn su ch that projt(!L I) is no t even a cpo. Ob viously sonic rest ri ction s arc Heeded.

Let proi~t : y --;. V " = ).,Xi .(l- , . " , Xi , " ' , 1 ). A retract l : V" -> yn will be called well beho.ved (wb) if for all i, (pr oj!, 0 l 0 proj~) : Y -> V is a retract and jproj i 0 l 0 proj~ i = proji(ll f). Then it can be easily show n that the set of well behaved retracts on y n forms a cpo ; ca ll it \ Vb(V n ) .

6.1. Definition. A pair (1' : V ?' -> V ", T : V" - ,} V'") is called a dual if given l : 'Wb(ym), (1' 0 l oT) E '\\lb(yn) and IT 0 l oTI = T(lll). We shall call T a right inverse' of T. I

It is obvious that 6.2. Definition.

< proj!', proj~ >

1': ym

->

is a dual.

yn is sa id to be nice if it can be constructed from

- , x, + : y X Y -->V, I: Y --> V, constants C : 'Wb(V k ) (k is arbitary), 0, fnpair, projf (k is arbitary). Here 0 is the composition, fnpair f g = >..x.(J x, g x) , and I g

is the identity function. 6.3. Theorem.

Proof.

T.

If T is nice then it has a 'right inverse

Though the proof is nontrivial it is technical so we shall omit it I

The result we are interested in is:

6.4. corollary. If T : ym such that IT(L) j = T(lLI). Proof.

Let

->

V" is nice, t her e exists

f = >..L. To LoT.

T : "Wb(ym) ---+ 'Wb(yn)

II

Nice functions form a fairly rich class. Most of the functions on finitary projections used to build reflexive domains are nice. For example T and S in example 5.1 are nice . In addition we have at our disposal the following simple combinators:

(1)

strict: 'Wb(yn)

---+ 'Wb(yn),

x =1. then 1. else /(x). (2) /8): 'Wb(yn) x '\!Jb(ym) g)(x,y) = (lx,gy). I

where strict is defined by : strict /

->

'Wb(yn+m) where

e

= AX. if

be defined by (J ®

6.5. ( Continuation of example 5.1). We shall choose now L, R, P, that the conditions stated in Example 5.1 are satisfied. Conditions (5.5) guarantee monotonocity of liT, wll and liS, f'[ ], We have already seen how (5.7) could be proved. Hence only (5.8) remains to be proved. We state it

V(D', E'), (D' , E') E L. (D' , E') G (D' IE') =:>

Vt E Ide --> E', I

E Ide. E'(apply ((Ide ~

E')£) I)

= E'(apply

1. I)

Q such to (5.8) (5.5) to again. (6.1)

472

By the monotonocity of liT, wj], liS, fll ( above conditions will guarantee that) and corollary 6.4, (5.3) and (5.4) can be reduced to:

(a) (b)

T(R):5 land S(L) :5 R given G ~ IGI, if G E Cl({2}, {1})

(6.2)

then Iwl(G) and Ifl(G) E Cl({2}, {1})

(c) IIT,wll(1-R, Q) J (1-L,P), IIS,fll(1-L,P):J (1-R' Q)

(6.3) (6.4)

where Iwl(G) = {x : T(G) I (G, G) F= w[x]}, and Ifl(G) is similarly defined. By the preceding theory we are justified in choosing Land R as the fixed points:

l : "Wb(y2), R : "Wb(y2) = FIX(>'(l', R').((strict(T R')), S(L'))

(6.5)

Then it can be shown in LCF that 1-L: Y x V = (1-, 1-), 1-R = (1-, 1-). The fixed point sets of land R look simply as shown in fig 6.1. f:D,E)

•

"

CC,Ide-+ E)

= (C,F)

(..L ~B ,E).

• C.J...,J.)

(.1. ,.L J •

I'RI

roLl fig 6.1

Let P(~ I1-L I) = {(1-, 1-)} and Q(~..LR) = {(..L, 1-n. Then (6.3) and (6.4) follow trivially. But they can not be proved within the LCF formalism, hence have to be proved separately. (6.2) can be proved almost automatically by using the standard set of tactics provided by the implemented system. These tactics use straightforward properties of wb retracts , right inverses etc which we have not stated in this paper. (6.1) can be shown using fixed point induction (refer to (6.5)). Thus all the conditions stated in Example 5.1 can be proved. The fixed point operation in (5.1) is then justified. Finally (5.2) follows by one more fixed point induction. Now the recursive predicates are obtained as was indicated in Example 5.1. Thus except for (6.3) and (6.4) everything else has been mechanized. As a side remark it is interesting to note what happens if ~ in (2.1) is replaced by = or d . In that case the goals which are generated by the LCF interface can no longer be proved for all O. But we have already proved that in this case there exists 0 for which the predicates do not exist. Thus though the goals generated by the LCF interface are by no means necessary, they seem to be necessary in some weak sense.

473

7.

Implernentation

Now the basic design of the system should be clear.It consists of two parts; an, LCF interface and a working environment.

7.1. LCF interface. The input to this stage is a representation of the predicate domains and predicators, A function on predicates is specified by a wIT. The output of this stage is the set of goals which will guarantee the continuity of' the predicators. We have already discussed how this is done. These goals are proved in the working environments. 7.2. Working Environment. The working environment provides a hierarchy of LCF theories and a standard set of tactics. The hierarchy consists of theories of universal domain, finitary projection, duals, well behaved retracts etc. Tactics are provided which take care of goals which frequently arise. They are programmed in ML using well known programming principles such as tacticals and simple resolution. Though the implementation is nontrivial, these aspects of ML programming and LCF are well understood, hence we shall not discuss them here (for example, see [1]). 8.

Example

We shall consider one bigger example. This is taken from [2]. It is was used to show correctness of a LISP implementation. Let D, a domain of values, and C, a domain of environments, be the least solutions of the equations D = C -~ B and C = Ide -0> D as in section 2. We want to show the existence of recursively defined predicate's:

6 ~ D x {Z ! Z ~ Ide} =zc;CxC

(one for each Z ~ Ide)

Where intuitively (v, Z) E () means the free variables of v are included in Z and (p, pi) E =,Z means p and p' "strongly" agree for all Z E Z. Formally we require that:

{(v, Z) ! W,p, p'.(Z ~ Y => (p =Y pi => V P = =z = {(p, p') I Yz E Z.pz = p'z a.nd (pz, Z) E B}

() =

V

pi))}

By rearrangement we get :

e

=

{(v,Z) IYY,p,p'.Z ~ y => (Yy E Y.py = pi Y and (py, Y) E B) => vp = Vp'}

We shall show the existence of 0, then that of =z easily follows.

474

Let Pi d e be t he fiat dom ai n of ::\1 b sc ts of ide.

We show tha t,

II L ,{},{h f' 11- ) II L,{},{},J' iI · wher e: T = (>.(D' , P } ((Ide

-+

111', w]I

E

D 'J ..-) B,Pitle)

w[u : U, Z : U] E (X : V , X ) = 'ify E P i de. 'ifp E Ide ........ (proj f( X )).\Ip' E Ide -) (proJf(X) ). Z ~ Y ((\l y E I de.y m emberof Y => (p y = p' y /\(p y , Y ) E X)) => v P = v /)

=

I

A

2

A

I

=>

I

F I X (AL .(st·rict (pr oj 1 (T(L ))) 0 (AP .Pide ))) P = {(.1, Q) I Q E P i de} L

The wff w contains two con stant pr edicate symbols 'rnembero f" and ' ~ ' . The symbol ' ~ ' is in ter preted over P i de x P i de an d 'm emberof" is interp reted over Ide x Pide in the obvious manner. Sixteen goals were generated by t he LCF interface to guarant ee continuity of the ab ove predicator. Fifteen we re generated by the goal generation algorithm of section 5. All of them were pro ved by standard tactics autom aticall y except the following:

'v"(D', pI), (D') ~') E L. (D', pI) [:;;; (D', pI)

=?

(W E P ide. p' Y =

~' Y)

This can be proved very easily using fixed poi nt induction (refer t o defn of Labove) by th e user . The remain ing go al was 'T (L ) :5 L '. T his cou ld be proved almost au to m atically . Once th e continuity of th e predicat or is pr oved, existence of the pre dic ate follows immediately a s in Example 5.J.. 9.

Scope And Conclusion

It is hoped t hat t his work w ill meet a long felt need for a mechanical aid in prov ing the existen ce of recursively defin ed predicates. We wc uld like to make few commen ts. Everything we have said so far easily genera lizes t o t he case when predictors have m ore than one argument. Secondly, compli ca ted pr edi ca tors can alw ays be cons t ructed from simple r predicators. Hope fully con tinuity of these simpler one s would have be en p roved already . In the gene ra l case r et r acts underlying predica te cpos do not look linear as in fig 6.1. Du e t o lack of space we could no t discuss more examples, but it sh ould suffice to say t ha t existence of all the predicates in [2,9,7J can be proved in the present system. A lso corr espon ding to ever y r elational functor in [5], it is easy to construct a p redicator an d prove its cont inu it y in the present system. In the next paper we hope t o report on these an d big ger predicates. Rig ht n ow t he maj or b ot tl eneck of the syst em seem s to be th e speed . 10.

.A ck n o wle d g em ent

This wor k was don e und er t he gui da nce of Prof. Dana Scot t. I would like to th ank him f or his insight•., and advice. Sp ecia l thanks to Steve Brookes and Glynn

475

Winskel for many helpful discussion s an d a great help in imp roving readability oi t he paper. Thanks also to Rob er to Minio and Dill Scherlis for helpi ng rue with ~{.

11.

Bibliography

(1)

Cohn,Avra: T he Equivalence of Two Sem antic Definitions: A Case Stud y In LCF; Internal Report, University Of Edinburgh R eport, (1981).

(2)

Gordon, Mich ael: Toward s a Semanti c Theory of Dynamic Binding; Memo AIM-265, Comput er Science Depar tment, Stanford University.

A Theory of Programming (3) Milne,Robert and Strachey,Christopher: Language Semantics; Ch ap man and Hall, Londo n, and John Wiley, New York (1976). (4) Mosses, Peter: Mathemat ical Semantics and Comp iler Generation; Ph.D t hesis, Oxford University Computing Laboratory, P rogramming Research Group (1975). (5) Reynolds, J.C .: On the R elation B etween Direct and Con tinuation Sem antics ; pp. 111-156 of pr oceedings of the Second Colloquium on Au tomata, Lan guages and P rogramming, Saarbriicken, Springer-verlag, Ber lin (1974).

(6) Scott,Dana: Lectures On a Mat hematical The ory of Computati on; Technical Monograph P RG-19 (May 1981). Oxford University Computing Laboratory, P ro gram ming Research Grou p.

(7) Sethi, R.avi an d Tang Adrian : Con structing Call-by-v alue Continu ation Sem antic s; Jou rnal of the Association for Computing Machinery,vol 27. No.3. July 1980.pp.580-597. (8)

Stoy,Joseph : Denotationa l Semantics: The Scott-Strachey A pproach to Programming La ngu age Th eory; (MIT Press, Cambridge, MA, 1977).

(9)

St oy,J oseph : Th e Con gruen ce of Two Programming Language Definitions; Theoretical Computer Science 13 (1981) 151-174. North-Holland Publishing company.

476

Solving Word Problems in Free Algebras Using Complexity Functions Alex Pelin Department of Computer and Information Science Temple University Philadelphia, Pa 19122 and Jean H.

Gallier

Department of Computer and Information Science University of Pennsylvania Moore School of Electrical Engineering D2 Philadelphia, Pa 19104

Abstract: We present a neW method for solving word problems using complexity functions. Complexity functions are used to compute normal forms. Given a set of (conditional) equations E, complexity functions are used to convert these equations into reductions (rewrite rules decreasing the complexity of terms). Using the top-down reduction extension Rep induced by a set of equations E and a complexity function, We investigate properties which guarantee that any two (ground) terms t and t are congruent modulo the congruence z 1 ~E if and only if Rep(t1)~Rep(tz)' Our metbod actually consists in computing Rep incrementally, as the composition of a sequence of top-down reduction extensions induced by possibly different complexity functions. This method relaxes some of the restrictions imposed by the Church-Rosser property. 1.

Introduction

We present a method for computing the normal form of a term with respect to a set of (conditional) equations.

2

2

and

the initial algebra over 2 is denoted

a countable set of variables V, as T

Given a signature

and the free algebra generated by V is denoted as T

(T 2(V) Let E be a set of

2(V)

is the algebra of 2-terms with variables from V). equations (or conditional equations) over T

2(V).

solving the word problem for for any two terms t

1

modulo the congruence

and t

z

,

that is,

in T

2

We are interested in

the problem of determining

whether t

1

and t

2

are congruent

induced by the set of equations E.

Our met hod i s t o c ompute n ormal forms fo r the term s in T

2

,

Sinc e

the word problem is undecidable in g e n e r a l , we a r e interested in classes of sets of equ ations for which normal f orms are eff ectively comput abl e a n d can be characterized by (det erministic) cont e xt-free langu ag es. Giv en a set o f e qua t i o n s E and a set o f g r o u n d terms L such that all gr ound subs tituti on i nst an c e s of terms o cc u r r i n g in E ar e in L, a funct i on f

: L

->

L i s a repr esentat ion funct i on for (L,E) , if for a ll In words, t

and t

2

are equivalent modulo

are id e nt i c al.

~E

if and only i f their repres entatives

Our go a l is to f ind a repres ent ation fun cti on

Ac t u a l ly , th e r epr ese ntati on f un c t i on Re p i s co mput ed as the comp o s it i on Rn.R n_l.,.R

I

With e a c h fu n ct i on R L _

i

1

of re p r e sent ation funct io ns,

i

is a ss o c iated an i n p u t s et o f g r o un d terms

a nd a set of equ at i ons Ai_lover L _ • i l

Th e equation s

(cond it i on al e q u a t ion s ) i n Ai_I a r e called Axi oms , the s e t o f th eorems d eri v abl e fr o m Ai _ I ' R

Th e r e pr e sent at ion funct i on

) whi c h i t at te mp t s t o " eli minate". s el e cts a s u bse t E o f Th(A i i_ l

i

Th e e li mi na t io n i s acc o mp l i she d b y tr a ns f orm i n g the e qu a tion s i nt o reduction r ules. L _

i

1

whic h suits E

Thi s i s d on e b y defi n i ng a c omple xit y fu n ction over i

( t hi s c on c e p t is related t o th at of a

norm-decr e asing set o f rules in Gal l ier and Book [4]). A c ompl e xity f unct i on f ov er L _ assi gn s t o each t erm t in L _ i 1 i 1 k an ele ment f( t ) o f a we l l- o r dered s e t ( N , » , whe r e N d en ot e s the s et of no nnega tiv e i nt e g er s, an d k is a p o s it iv e intege r . f

: L i_ 1

->

k ( N ,»

A fu nc t i o n

is a comp lex ity f unc tion if it i s r e cursi v e,

mon ot on e , and h as t h e subter m prop e r ty , t h at is, wh enev er t) subt e r m o f t 2 t h en f( t2)

>

f (t)).

is a

l

478

We say that a complexity function f l~r if

strongly suits an equation

for all ground substitutions s. f(s(l»

ground substitutions s, f(s(r»

> f(s(l».

> f(s(r».

If

reduction (with respect to f).

l~r

suits

ground substitutions s. f(s(l»>f(s(r», we say that

or for all

I~r

A complexity function f

and for all

is a strongly suits

a conditional equation el ••••• en=>e if f strongly suits e and for all equations e f(s(e»

i•

liiin.

> f(s(e

s(l)~s(r)

i».

for all ground substitutions s. The complexity of an instance of an equation

is defined as max{f(s(I».f(s(r»}.

A weaker version of the suitability concept which is useful in treating conditional axioms is the concept of weak suitability defined below. l~r

A complexity function f weakly suits an equation ground substitutions s, is,

f(s(l»

if for all

implies that s(l)=l(r),

= f(s(r»

that

s(l) and s(r) are identical. Both strong and weak suitability are used to generate

meta-reductions.

A meta-reduction has

the form (C) => 1 -> r, where C

is a recursive predicate in a meta-language which contains names for the well-order (N

k

,»

the complexity functions,

For all such meta-reductions. f(s(l» condition C evaluates to true.

and 1 and r are terms

> f(s(r»

if the

Note that our approach is somewhat

similar to that taken in Brand, Darringer and Joyner Once we have selected the set of theorems E.

1

complexity function f which suits
i>

i),

[2].

and we have found a

(that is.

suits all

we define R.

1

to be the

top-down reduction extension of the set I i of reductions generated by

E

i

and f.

479

The top-down reduction extension

of a set of meta-rules lI.

0:

i

is

defined as follows:

For every term t

in L

i_ 1•

we have the following cases:

->

(I) If for a meta-rule (C) I s(l)=t and C evaluates to true,

r

then

and a ground substitution s. o:(t)=oc(s(r».

(2) If case 1 does not apply and t compute recursively t hen

cr:(t

1)

••••• oc(t

has the form g(t

1'

•••• t

n).

then

n).

DC (t) = DC (g ( DC (t I ) , ••• , DC (t n) » • (3) If neither of

In case 3. t

Let L

i

the above cases applies,

then

DC(t)=t.

is called an atom.

be the range of R.

1

obtained by applying R

i

and let A.

1

be the system of axioms

to the axioms in Ai -1 •

L. consists of

the set

1

of atoms of

the top-down reduction extension R .•

theorems E.

is well chosen, some axioms from A become string i_ l

1

1

identities in A.

1

and can be eliminated.

eliminate axioms,

a proper choice for E

syntactic form for the set of terms L

We say that a

operator f

of

i

If

the set of

If it is not possible to i

may produce a simpler

or for the axioms Ai'

top-down reduction extension

rank n and for every n terms t

every j, 1ijin, if f(tl ••••• t

n)

e

L

i_ 1

1'

•••• t

n

in L

i_ 1,

for

then

Ri(f(t1'···'tj' ••• 'tn»=Ri(f(t1····,Ri(tj)' ••• 'tn»· The representation function Rep can be seen as the composition of the functions given by the sequence

R

(4)
1 > --->

O

1.A 1

It can be shown that

> ---> .•.

if

the composition Rn.Rn_I ••• R

I

has the

DC -property, then it has the representation property for
to Rep.

>'

O

L

n

~o

We will n ow g i v e criteria for o b t a i n i n g "usefu l" t h e or e ms in !i ' Th e r e a re essen tiall y t wo me t hods . The firs t method is t o f or c e l o c a l confl uence in t h e se t of rul e s a s s oc i a t e d wit h

usin g t he met hod o f c r i t i ca l p a i r s o f l Knuth and Bendix [7J .

Howev er, t h e resul ting equation is not

transfo rmed int o a rewri te ru le bu t a d de d t o E . • 1

Th e second method is to use c o n d i t i o n a l meta -ru les . c omp l e x it y f u n c tions guar an tee tha t th e red u c t i o n s In &i t h at is , that there is no infin it e sequence o f

The termin a t e ,

rewri te ste p s .

Us i ng conditi o n al me ta - ru les a l l ows us to d e a l with sy s t e ms of rul e s i n whi ch rul e s which i n cr e a s e the com plexi ty are al l owed , prov ided t h at the number of applicati ons of such ru les is finite . Ou r app roach a l lo ws u s t o d eal with sets o f

ru l es wh i ch a r e n o t

n ec ess a r ily confluent, a n d we li st b el o w some of i ts a d v a n t a g e s . Fi rst, the me th od i s a stepwis e refi ne me nt t ec h n i q ue In whic h one ch o o s es t o eli minate a x i oms o n e by one.

At e ach st e p, t he com p l e x i t y

We a ls o do no t

f un cti ons may be diff er en t .

requir e t h at there is a

sin gl e c omp l e x i t y func ti on which transfo rms all the equa tions i nto re wri te r ul e s a s i n Knuth a n d Bendi x [7J. t o co mp u t e t h e p re n ex disj u n c ti v e n o r ma l

For example , if o ne wan ts f orm of a first -order

sen tence , o ne ca n pr o ce e d a s fo l lo ws :

( 1 ) Re l a b e l vari abl e s whic h a p p e a r more t h a n once (2) Push nega tio n s a l l

t h e wa y inward

(3) Push quantifier s i n fro nt (4) Di st r i b u t e A over V ( 5) Elimina te du pli cates

481

If one attempted to perform these five steps in a single transformation.

it would be very complicated.

breakdown seems quite natural.

The complexity functions serve a

They can be used as the "control part" of

for computing normal forms.

this

Also. our approach structures the

process of computing normal forms. double role.

With our approach.

the algorithm

and they can be used to prove termination.

Since there is a certain flexibility in choosing the complexity functions.

desired normal forms can be computed.

For example. generators a (5)

l

if We consider a semigroup with a finite number of

, ..• ,

+x+yz~++xyz.

and have the single associativity axiom

L -) N,

We can define complexity functions f,g

where L is the language given by

the context-free grammar

({S}.{al ••••• an}.F,S> whose productions are shown in (6):

(6) s -) +SS S -) a

(7) f(a.) 1

i

n

=

f(+xy) (8) g(a )

a

l

= J + 2f(x) + fey) =

1

g(+xy) = 1 + g(x) + 2g(y)

It is easily verified that both f and g strongly suit

(5).

For

exampJe. f(+x+yz)=J+2f(x)+J+2f(y)+f(z)= 2+2f(x)+2f(y)+f(z) ( 3+4f(x)+2f(y)+f(z)=f(++xyz).

The complexity function f

transforms (5) into the reduction

R1={++xyz -) +x+yz} and the complexity function g yields the reduction R

2={+X+Yz

of R

J•

-) ++xyz}.

The range of

is the language L

J

p.

generated by

productions is shown in (9):

the top-down reduction extension the grammar whose set of

482

It can also he

p

shown that

has the

oc-property, and similarly for

the top-down reduction extension of R

Z'

Z.

The Formalism

We will follow the notation and definitions found in Huet and Given a finite signature (S.Z,t), the initial algebra T

Oppen [6].

Z

is defined in the usual way ([6]). prefix norm.

Then.

Terms in T

will be represented in

Z

the set T s of terms of sort s is a deterministic

L

context-free language. Given an S-sorted set of variables V,

the free algebra over V is

denoted as TZ(V) and consists of terms with variables.

A 2-equation

~

sort s is a pair <M,N> of

will also be denoted as

M~N.

terms in

A conditional equation is an expression

A presentation is a triple P=, where

Z

is a finite

signature. V an S-indexed set of variables and E a finite set of (conditional) equations.

Substitutions and E-unification are defined

as in Huet and Oppen [6]. Let k be a positive integer, and> a recursive well-ordering on A complexity function for 2 is a function h : T

2

k -> N such that

the following conditions hold: (1) For every operator f sl sn terms tI€T ....• tn€T •

Z

h(f(t

1,

such that t(f)=sIx ••• xsn->s. for any n

Z

•••• t

n»>h(t i).

for all i,

I
Condition (1) is called the subterm property

(Z) h is a recursive function.

483

Given a complexity function h, for an operator f such that r(f)=slx ••• xsn->s, we say that h is monotone ~! if for every i. s1 s l~1~n. and all terms t •...• t n€T n. the following condition l€T2 2 holds: (3) h(ti»h(t

i)

implies

h(f(tj ••••• ti·····tn»>h(f(tl·····ti·····tn»· We say that a complexity function is monotone if it is monotone in every operator in

2.

The concepts of strong and weak suitability were introduced in the Introduction and are not repeated here.

A complexity function h

strongly (weakly) suits a set of (conditional) equations if it strongly (weakly) suits every (conditional) equation in the set. Given a set of (conditional) equations E and a complexity function h which suits E (weakly or strongly). we define the set of rules associated with E under

~.

denoted as R(E.h) or for short R. as

f 0]] ows: (4) If la r € E and h strongly suits lar then (i) If for all s. h(s(l»>h(s(r». then l->r is in R(E.h) (ii) otherwise r->l is in R(E.h) (5) If la r € E and h weakly suits (h(l»h(r»

=> l->r and (h(r»h(l»

(6) If e

(elA ••• Ae

n)

el ••••• e

n

=>

l~r

l~r

then both meta-reductions

=> r->l are in R(E.h)

€ E and h strongly suits e then

=> l-)r is in R(E.h) if h(s(l»>h(s(r»

substitutions s , or (e

l"

(7) If e : el ••••• e

n

••• Ae =>

n)

l~r

=> r->l € R(E.h) otherwise. € E and h weakly suits e then both

meta-rules «h(l»h(r»AelA ••• Aen) => l->r and «h(r»h(l»AelA ••• Ae

n)

for all ground

=> r-)] are in R(E.h).

484

We s a y t h a t R( E, h) is f unc tion a l

i f fo r al l

of me t a -r u l e s (C ) =) ll - ) r and ( C ) =) IZ -)r Z I l Z

te rms t € T a nd pairs 2 i n R(E ,h ) , if t her e

ex is ts substi tu ti o ns sl an d s 2 s uc h t h a t S j( II ) =sZ(1 2 ) = t and b oth CI(t) and CZ(t) a re t r u e, th en s l ( r j ) =s Z( r Z ) . The se t R( E ,h ) i s li n e a r if f or an y me ta - r ul e (C) =) l -) r , eve ry var i a bl e oc c urs at most o n c e in 1 .

Th e t o p-d own r educti o n e x ten s ion oc

o f R( E,h) h as b ee n defi ne d in the In t rod uc t ion . 3.

Te chni c a l Re sult s

In Le mma s I-II, E is a s su med t o be a f i n i t e se t of 2- e q u a t i o n s , h is a co mp l e x i t y fu n c t i o n which we akly s ui ts E , R is t h e se t of me ta - r ules as soc iate d wi th E un de r h , an d r is the t o p - d own r e d ucti o n ex te n si o n o f R to T

2

.

Lemma I f h is monotone t h en r i s a r ec ursi ve f u nc ti o n f rom T t o t h e 2 power -se t of T

2

.

Lemma Z If h is mono t one and R i s func tIona l then f u n ct i on f r om T

2

p

i s a recursi ve

to T • 2

Lemmas 3 - 8 c h a r a c t er i ze t h e range of

p.

Le mma 3 If h is monotone

t h e n for all te rms t i n T , r(r( t»=r( t ) . 2

Lemma 4 If

o nly i f

h is mono tone t hen for all t er ms is an atom ( that is , r (t)= t ) .

i n T , h( t )= h( jH

2

t )

if and

485

As a corollary we have:

Lemma

5

If h is monotone.

the range of

r

is a recursive set.

Lemma 6

If h is monotone. f(vl ••••• v

n)

is an operator of

type t(f)=slx ••• xsn->s. if

and 1 are not unifiable for any variables v1 ••••• v

(with

n

each vi of sort si) and meta-rule (C) => l->r. then r(f(t1·····tn»=f(r(t1)·····r(t n»· If the number of variables of each sort is unbounded. the unification condition can be replaced by: f(v1 ••••• v

n)

and 1 are not unifiable for an n-tuple of distinct

variables vl ••••• v n• each vi of sort si' Lemmas 7 and 8 characterize the range of

r using

concepts from

formal language theory.

A set R of meta-reductions said to be pure if it does not contain conditional meta-rules and h strongly suits E.

Lemma

7

Let R be pure and linear. h be monotone and L a subset of T

Z

that r(L) is a subset of L.

If L is accepted by a

(deterministic)

bottom-up finite tree automaton then r(L) is also accepted by a (deterministic) bottom-up finite tree automaton. As a corollary we obtain:

Lemma

8

such

486

If the terms of the language L in Lemma 7 are represented in prefix notation then p(L) is accepted by a deterministic pushdown automaton. The proofs of Lemmas 1-8 can be found in Pelin and Gallier [9J. Lemmas 7 and 8 have a constructive proof, that is, automata accepting the range of p can be effectively constructed. If R is not linear or contains conditional equations. p(L) is not necessarily deterministic or even context-free. Next, we show some results which relate the oc-property presented in the Introduction to the concept of local confluence (Huet [5J).

Let R be pure and functional and h be monotone.

The top-down

reduction extension p has the oc r-p r cp e r t y i f and only i f R is locally confluent. ~lO

Let R be pure and functional and h be monotone.

If

R is locally

confluent, then p has the representation property for E. By extending the concept of confluence to conditional meta-rules, the purity condition can be dropped.

In this case, Lemma 9 becomes:

~ll

Let h be monotone. function with the Lemma 12

The top-down reduction extension p is a

oc-property if and only if R is locally confluent.

487

Assume that h is monotone, L'=~(L) be

the range of

r'

that

~

has the

cc e-p r o p e r t y , and let

Given an equation l~r with terms in L,

~(s(l»=~(s(r»

for every substitution s with range L if and only if

f(s(l»=r(s(r»

for every substitution s with range L'.

Let uS consider a sequence

--->

1

> ---> •..

we have a set of equations E function hi

into a

i

--->

n>,

to L

l~i~n,

transformed by a suitable complexity

system of reductions R • i

reduction extension of R i

where for each i,

i_ 1

Let ri

Li=~i(Li_l)'

,

be the top-down

Ai=ri(A

i_ l)

and

Lemma 13

If conditions (i),(ii),(iii) below are satisfied then 3 has the representation property for
i

An =91

(iii) For all i , j , in R

O>'

is confluent

I~i<j~n,

term t6L

i_ 1,

meta-reduction (C) => l-)r

and tree-address u in dom(t):

j If

the subtree t/u at u is unifiable with I and C(t/u) holds then

~i(t)=rj(t[u<-rj(t/u)]) (where

t[u<-~j(t/u)l denotes

by replacing the subtree rooted at u in t

Given the initial pair
Sometimes,

O>'

with

the tree obtained

~j(t/u».

we must decide which axioms will

One criteria is that the sets Ai it

(i~l)

should be

is advantageous to split a step into substeps

as shown in Lemma 14.

Lemma 14

Let R ~l

and

~Z

1

and R

Z be two sets of reductions over a language L and let

he their top-down reduction extensions.

of rl and L Z the range of fZ'

If

Let L

~I and rZ have the

l

be the range

cc r-p r o p e r t y for L,

488

the union L

l

is a

then ~2'~1 has the representation property for

equations generated by R

Next,

we

present

l

U R

subset of

the system of

2•

examples illustating the above techniques and

results.

Example 1 (Stacks of natural

The set

of

numbers with errors)

sorts is S={nat,stack}, 2={pop,push,top,0,succ,Jl,e,E},

and the typing function is: r(O)=r(e)=

-> nat

r(Jl)=r(E)=

-> stack

r(push)= stack x nat -> stack r(pop)

stack -> stack

r(top)

stack -> nat

r(succ)

= nat

-> nat

Variables of

sort nat wi l l

The axioms for

stack as sk'

be denoted as n

and variables of

k

the data type stack of

sort

natural numbers

are:

( A)

Axioms for

-

1. push E n

3. pop E =

stacks E

2. push s e 4.

E

5. top Jl = e

6.

7. pop push s 0 = s 9.

top push J l n = n

pop J l = E top E = e

8. 10.

E

pop push s n top push s

n ~ n,

s => pop push s

top push s m ~ m =>

top push push s 11. Suc c e

~

n m

~

m

e

The symbol is the natural error natural

Succ n ~ s

J l stands for number zero,

number,

the empty stack, E for

the error stack,

Succ is the successor function,

push is the push function,

e

is the

pop pops the

top of

0

489

the stack and top returns the top of stack).

Axioms 1,2,3,6 and 11

the stack (without altering the

state that once an error occurred, any

subsequent operation will yield an error as result. state that poping or retrieving the top of

Axioms 4 and 5

the empty stack yields an

Axioms 7 and 8 are weaker versions of pop push s n ~ S which

error.

is not valid for stacks with errors. versions of top push s n s#E => top push s n

~

~

Axioms 9 and 10 are weaker

n which holds for s#E.

n is not a Horn formula,

However,

and this is the reason

why they have been introduced.

It can be verified that a

the complexity function length (length of

string) strongly suits (A).

reductions obtained from (A)

It can also be verified that is functional.

top-down reduction extension p is recursive. the range of p,

Lemma 15

Hence,

by

the set of

Lemma 2,

the

In order to determine

can be proved.

Lemma 15

For all

stacks s and natural numbers n:

iH s )

~(n)=e

1\

=> ~(s)=E

1\

length(p(pop s ) ) < length(pop s)

1\

~(pop

push s n) -F

~(top

push s n) -F ~(n)

length(~(top

=>

s ) ) < l"ngth(top s ) •

The proof proceeds by induction on the length of push s n.

Lemma 16

The range of

~

is a context-free language.

Lemma 17

f

has the representation property for where L

O

language generated by the context-free grammar given below: S ->

Jl

I E I push S N I pop S

is the

490

N -) e

°

I

I

Examp~

top S I Succ N 2 (Finite sets of natural numbers)

The set of sorts is S={nat,set}, 2={add,succ,¢,Oj, and the typing function is:

r(o)

-) nat

r(¢)

-) set

r(succ) = nat -) nat f(add) = set x nat -) set

Variables of sort set will nat as n T

2

The set of

k,

is denoted as L,

be

denoted as sk and variables of sort

terms in prefix notation of the initial algebra

and is generated by

the following context-free

grammar: S' -) SIN S -) ¢

add S N

N -) 0

Succ N

The set of axioms for finite sets of natural numbers is given below: (B) Axioms for finite sets of natural numbers add add

So

nO n l - add add

add add

So

nO nO = add

So

So

n l nO

nO

In order to transform (B) into a set of meta-rewrite rules, we define the complexity function h as follows: h(O)=h(¢)=l h(Succ n)=l+h(n) h(add

So

n O)=1+2h(sO)+h(n)

It can be shown that h weakly suits
The system of

meta-rules generated from B by h is the following:

491

(h(no»h(n

=) (add add So nO n j

j»

=> add So n j

nO)

add add So nO nO -) add So nO There is only one conditional meta-rule because the two conditional meta-rules generated from the first equation in (B) are isomorphic.

It can be shown that the system of meta-rults is locally

confluent, and therefore, the top-down reduction extension representation property.

However, the range of

~

~

has the

is not context-free.

This happens because the second rule is not linear.

In general, it is

difficult to relax the conditions of Lemma 7. Example 3 (Closure system). The set of sorts is S={N}, 2={g,I,+,-}, and the typing function is: t'(g)=

-) N

t'(I)=t'(-)= N -) N t'(+)

= N x N -) N

The set La of terms in prefix form in the initial algebra T is 2 given by the following context-free grammar: S' -) GIS

G -) g

I IG

8 -) +88 I -8

The set of axioms (C) is given below: (C) Axioms for a closure system l.

--x

-x

2.

-+xy

3.

+xy'::' +yx

4.

+x+yz

~ ~

+-y-x

~

++xyz

492

F ir st , we pi c k Ej = l - - x

~

- x, - +x y

~

+ - y -x } .

It ca n be veri fied

t hat t h e c omp lex i ty f u nc t ion h j g i ve n be l ow s t ro ng ly su i t s £ j : h j (- x) = Zh j (x) h j ( +xy ) = j + h j(x ) +hj(y ) hl (l x)=l hJ( g)=J Th e r e du c t i o n s ys tem RJ = l- - x - ) - x, -+x y - ) +- y- x} is o b t ain e d . Let

PI

b e the t op-do wn re du c t ion e x tension o f R

th at R

l

is locally c onf lu en t a nd thus,

PI

Lemma 8, the range of

PI

l•

It c a n be shown

has t h e oc r-p r o p e r t y ,

By

i s a deterministic cont ext-fr ee language.

It c a n also be sh own that if we take the r edu ctio n s Ri= l - - x - ) -x} a nd Ri '= l-+x y - ) +-y-x} sepa r at el y, the y e ach h a v e th e lo c al co nf l ue nce pr o pert y.

Let pi b e the t op-d own ex t ens io n of Ri an d

pi ' th e t op -d own ex te n sio n of Ri' .

It ca n be s ho wn t bat p i( p i '( La »

i s a su b se t o f pi' ( La ) , b ut th at p i '(p i(L a » pieL a)'

is n o t a su bs et o f

App ly i ng Le mma 1 4 , we h a ve Pl=pi',pi'

We h a v e cho sen to el i mi nat e e qu a tio n s ( j) and (Z) in ( e ) be cau se,

+ xy ,; + y x +x+ yz

~

++ x yz

At t his point, we tr y t o eliminate asso c i at i vity. d efine the function h

Z

hZ(g) =l hZ(I x)=j h Z(-x)=l h Z( +xy ) = J + h Z(x )+Z hZ(y )

as follows:

For that, we

493

It c an be s h own t h at h l et

~2

2

s t ron g ly su its

E 2~{+x+ yz

be the t o p-d own re d uc t i on e x t en s i on o f

A2~~2(AI)

in L } . I

has t he f o r m

{~ 2 (+ X Y)~~2(+ Yx)

~

R2~{ +x+ yz

++ xyz }, an d we

- > ++ x yz }.

I for all ground terms x and y

In order t o elimi n ate the set of ax ioms A we us e r eductions 2,

obt ained by forcing l ocal confluence on the s et of

rules obt ai ned by

orient ing the ax i oms o f AI fr om left to r i g h t . Sin ce +x+yz

~>

++x yz, +x+ yz ==> +x+ zy == > ++ xz y, we use the

equation ++xyz~++xzy.

Le t us call the ter ms o f th e form g , -g, I ng or

_Ing ( n~ l ) li terals.

We c h o o s e E3~{++xyz~++xzy} U { +yz ~+zy I for all

li te ra ls y ,z in L

If we define t h e comp l e x i t y function h

h

3

2}.

3

s o tha t

assigns a nat u r a l num be r t o e ach litera l and this assignment is

injective, a nd if h suits E • 3

3(+xy

)=1+2h

we can show that h

3(x)+h 3(y),

The s yst em of meta-rules obtai ned from h

3

3

weakly

is

R

h » => ++xyz -> ++xzy, (h => +yz - >+zy }. Let 3(y»h 3(z» 3={( 3(y»h 3(z be the top - down re duction ext ension of R to L • Usin g Lemma 1 2 , it 2 3 ca n be shown tha t P3 (P2(+ XY»=~3(r2(+ Y x»

can also b e shown t h at P3 'P 2 'rl has t he

.

He n ce , A3=r3( A2)=0 .

oc e-pr op e r t y ,

P3

It

Thus, j33 '~2 'PI

h as t he representati on propert y f or . 4.

Co n c l us i o ns

Complexity functi on s pla y an im p orta nt r o le in co mputi ng normal forms .

Various aut hors s uch as Book [1], Gallier and Bo ok [ 4], Huet

[5], Lankfo rd and Ball antyne [8], Knuth and Bendix [ 7] and others have used comp lexi t y funct ions for generatin g reduct ions .

Dershowit z [3]

and Plaisted [10] use partic ular compl e x i t y functions to prove t ermination of rewriting systems .

In our approach, c ompl e x i t y

funct i ons serve b oth t o p ro ve term i nation and gener ate red uctions, in an incremental fash ion .

494

Of course. our definition of a complexity function is very general and it would be useful to identify which classes of complexity functions can be associated with particular types of axioms. instance, we have shown that linear functions (functions of h(ftl ••••• tn»=cO+clh(tl)+ ••• +cnh(tn» and commutativity axioms. *x+yz~+*xy*xz

Eor the form

can be used for associativity

However, the distributivity axiom

requires a quadratic function to obtain the reduction

rule *x+yz -> +*xy*xz.

The following function does the job:

h(*xy)=2h(x)h(y) h(+xy)=l+h(x)+h(y). Also. some axioms. such as the commutativity axiom for a groupoia with an infinite number of generators. require compleXity functions k• over N

for k>l.

Complexity functions give an upper bound on the number of steps needed to carry out a computation.

For instance. a linear complexity

function yields an exponential upper bound. Complexity functions are also useful to carry out proofs by induction.

Finally. connections with the work of Siekman and Szabo

[11] should be explored.

495

References [1]

Book,R., Confluent and other Types of Thue Systems, JACM 29 (1982), 171-182.

[2)

Brand,D.,Darringer,J., and Joyner,W., Completeness of Conditional Reductions, IBM Technical Report RC-7404 (1978), T.J. Watson Research Center, Yorktown Heights, N.Y.

[3]

Dershowitz,N., Orderings for Term-Rewriting Systems, Theoretical Computer Science 17 (1982), 279-301.

[4]

Gallier,J.H. and Book,R.V., Reductions in Tree-Rewriting Systems, to appear in Theoretical Computer Science (1984).

[5]

Huet,G., Confluent Reductions: Abstract Properties and Applications to Term-Rewriting Systems, JACM 27(4) (1980), 797-821.

[6]

Huet,G. and Oppen,D., Equations and Rewrite Rules, in Formal Languages: Perspectives and Open Problems, R.V. Book,~ Academic Press (1980), 349-405.

[7]

Knuth,D. and Bendix,P., Simple Word Problems in Universal Algebras, in Computational Problems in Abstract Algebra, Leach J., Ed., Pergamon Press (1970):-263-297.

[8]

Lankford,D.S. and Ballantyne,A.M., Decision Procedures for Simple Equational Theories with Permutative Axioms: Complete Sets of Permutative Reductions, Report ATP-37. Department of Mathematics and Computer Science, University of Texas, Austin, Texas (1977).

[9)

Pelin A. and Gallier,J.H.'kComputing Normal Forms Using Complexity Functions over N , in preparation.

[10)

Plaisted,D., Well-Founded Orderings for Proving Termination of Systems of Rewrite Rules, Report R-78-932, Department of Computer Science, University of Illinois, Urbana, Ill. (1978).

[11)

Siekman,J. and Szabo,P., Universal Unification and Regular Equational ACFM Theories, Technical Report, University of Karlsruhe (1981).

496

Solving a Problem in Relevance Logic with an Automated Theorem Prover Hans-Jurgen Ohlbach Institut fUr Informatik I University of Kaiserslautern Postfach 3049 D-675 Kaiserslautern

Graham Wrightson Victoria University Dep. of Computer Science Private Bag Wellington, New Zealand

Abstract A new challenging problem for automated theorem provers (ATP) is presented. It is from the field of relevance logic and is known as "converse of contraction". We firstly give some background information about relevance logic and the problem itself and then discuss a proof for the theorem which has been found by the Markgraf Karl Refutation Procedure (MKRP), a resolution based theorem prover, under development at the Universities of Karlsruhe and Kaiserslautern. For solving this problem we implemented a new method to control the application of "generator clauses" like Exy => Ef(x)f(y). An uncontrolled appl ica tion of such clauses produces arbitrarily deeply nested and often useless terms f(f(f(f •.. which in general cannot be avoided with a global term depth limit because deeply nested terms of more heterogenous structure may occur in the proof. I. a-Order Relevance Logic 1.1 Introduction Relevance logic was apparently first treated by Ackermann and Church and has been intensively refined and developed mainly by Anderson and Belnap [AB75J. The main motivation was to avoid certain paradoxes of implication which are present in classical formal logic. For example, ex falso quod libet, which has been shown to lead to all sorts of unpleasant results, is a valid formula in classical logic. The cause of the matter seems to lie in the definition of implication, which leads to other counterintuitive properties as well. For example the two true sentences "Grass is green" and "Two plus two equals four" lead to the true sentence "Grass is green implies two plus two equals four". Yet another feature of classical logic which does seem to be adequate for many applications lies in negation: by not denying a statement it cannot be concluded that the statement is affirmed. 1.2 Syntax of a-order Relevance Logic RL We closely follow [RM72J in presenting the syntax and semantics of RL. RL is built up syntactically in the usual way from a denumerably infinite set S of sentential parameters, the unary connective (called relevant negation), the usual truthfunctional connectives & and v, the binary connective .. (called relevant implication). The axiom schemata of RL are: AI. A2.

A" A A" ((A

.. B)

.. B)

497

A7.

(A.. B) .. « B .. C) .. (A .. C» (A .. (A .. B» .. (A .. B) A &: B .. A A &: B .. B ( A " B) &: (A .. C) .. (A .. B &: C)

AB. A9.

A" A y B B" A y B

A3. A4. A5.

A6.

Ala.

( A " C) &: (B

All.

A &: (B

A12. A13.

( A " -B) --A" A

C)

y

and the deduction Rl. R2.

F~om F~om

.. C)

.. A &: B .. (B

.. (A y

y

B

.. C)

A &: C

.. -A)

~ule

schemata

a~e:

A .. B and A, inre~ B A and B infe~ A &: B

1.3 Semantics of RL

A ~eleYant model st~ueture (rms) 1s a quadruple (O,K,R,*), whe~e K 1s a s.et of objects called set-ups, a ~ K, R 1s a te~na~y relation on KJ and * is a una~y operat10n on K, satisfying the postulates pl-pb below.

A bina~y ~elation < and a quate~nary relation R2 on K a~e defined as

abb~eYiations: Fo~

dl. d2.

all a,b,e,d in K a < b : = ROab R2abed:= ! x (Rabx and Rxed and x in K)

The postulates pI. p2. p3.

p4.

p5. p6.

fo~

all a,b,e,d in K are

ROaa Raaa R2abed =) R2acbd R20abe =) Rabe Rabe =) Rca"b!!

a**

=a

The definition d2 is d3.

Rnabe ••• def := n+2

gene~alized

!

1n the following way:

xR~abe ••• x

r+2

and

Rn-~x ••• def

n-r+2

and x in K and 1 ( r ( n-l From the postulates the following lemmas can easily be

de~iYed:

Ll. L2.

Rnabe ••• def <=> Rn+10abe ••• def All Rn_~elations, which can be formed by a permutation of the first n+l set-ups in the relation Rnabc ••• def, can be shown to follow from Rnabc ...def

L3.

Each relation Rn implies a relation Rn+ l by replacement of any set-up a in Rn by aa.

498

A ~al!!~!.-!:.Q!! v of RL in an roms (O,K,R, . is a function I' r-om S x K to the set of classical truth- values {T,F}, which satisfies the following condition fo r all p in Sand a , b in K: ( 1 ) a < band v ( p, a )

=

T => v p,b)

=

T

The interpretation I asso ciated with v is a fu nctIon from FORMULAS x K to {T, F} which satisfies the following conditions f o r all p i n S, A,B in FORMULAS , and a i n K: ( 1) ( 11 ) (iii)

(i v) ( v)

I(p,a ) = v (p , a ) I( A & B,a) = T iff I (A,a) = T and I( B,a ) = T I(A v B,a) = = T or I{B ,a) = T rcA + B,a) = T iff , for all a ,b,c in K, Rabc and I(A ,b) = T = > I (B,c) = T rc -A,a) = T iff I(A,a*) = F

A formula A is true on a val uation v or on the a sso ciated i nt erpretati on I at-a-s e t -up a in K, i f I(A,a) = T, otherw i s e A is fals e at a. A is ve rified on v or on the ass ociated I if I(A,O) = T and otherwise A is falsified on v . A is valid in an r IDS iff A is verified on all valuations therein. A i s R-valid iff A is va lid i n all rms . A entails B on a valuation v or in t h e associated Interpretation I provid ed that , for all a in K, rcA,a) = T => ICB,a) = T A ent a i l s B in a r ms iff A entai ls B on all valuations therein. A R- en t a i l s B iff A enta i l s B i n a l l rms. The fol lowing 3 lemmas and their proofs can be found In [RM72J L4. L5. L6.

a < band I(A ,a ) = T => ICA ,bl = T A entails B on v iff A + B is verified on v A entails B in v iff A + B is verified on v. So A en tails B in an r ms iff A + B i s valid t h e rei n, and A R- e ntails B iff A + B i s R-valid.

Fundamental Lemmas From the def i nition of an int e r pre t a tion the f ollo wi ng lemmas resul t : (a)

I (A

+

B,x)

(b)

I(A

+

B,x)

(c) (d ) (e) (f) (g) (h)

I(A & B,x) rcA & B,x) ICA v B,x) rcA v B,x) I( -A ,x) T I( -A ,x) = F

T and ¥y, z i n K Rxyz = > (I ( A, y) = F or I( B, z ) F => y ,z in K Rxyz and I A,y) = T and I(B ,z) = F) T and I(B ,x) T = > ICA,x) T F or ICB,x) F => rcA,x) F T or I(B ,x) T = > I (A, x ) T F and I(B,X) F => I ( A, x ) F = > I(A ,x*) F = > I(A,x*) T

(t

=

T)

499

r.4 An Example To illustrate the non-triviality in applying the fundamental lemmas 1n order to show the R-validity or R-invalidity of a formula of RL we give an example "done by intuition". We use the notation (line number, applied "rule", line no or nos to which the rule is applied.) and e.g. "A,F in a" for I(A,a) • F. We take the contraction axiom A4. The tree must close 1n the classical (or extensional) sense, l.e. 1n each path in the tree must be an atom and its extensionally negated atom both holding in the same set-up. In order to achieve this it 1s necessary to manipulate the relations determined by the use of theorem (b) in the example ROab and Rb cd , so that suitable relations are created in order to close the tree extensionally, i.e. we need to find in this case Ra c x and Rxcd. In other words, the tree needs to be closed intensionally as well as extensionally.

(A

0,0,0) (2,lemma b,l) (3,lemma b,l) (4,lemma b,l) (S,lemma b,4) (6,lemma b,4) (7,lemma b,4) (8,d2,2/S) (9,p3,8) 00,p4,9) (11, p2 ,0) 02 ,d2, 10/11) 03,p3,12) (14.d2,13) OS.d2,13) (16.L2,14)

+

(A

+

B»

+

ROab

(A

+

B). F in 0

A + (A + B), T in a A + B , F in b Rbcd

A, T in c B, F in d R20 acd 20 cad Read Rccc R2ccad 2cacd R Rcax Rxed Racx R

------------~

(17, lemma a, 3/16) A, F in c closed

08, lemma a, 3/16) A

(19, lemma a,lS/18) A, F in c closed

( 20, lemma a,15/l8) B, T in d closed

-------------

II.

~

+

I

B, T in x

I

Problem

The problem we have dealt with is known as "converse of contraction": 0)

(A + (B -> B)}

(A'" (A -> (B .., B»)

+

and is given in [AB 75 J on page 96 as being a theorem of T.. a SUbsystem of RL. Now notice that (I)

I ~RL (A -> B) +

A

+

(A

+

B)

500

Whereas (II)

I- RL

(A + (A + B)) + (A + B)

(Contraction AXiom)

(II) concurs with one of the intuitions of relevance implication + namely that in X + Y, Y cannot contain more information than is being put into it by X. (I) on the other hand contradicts this intuition. However it is not a theorem of RL. The oddity about RL though is that (1) is a theorem and contradicts the intuition. It was Prof. N.D. Belnap, one of the fathers of relevance logic, who gave us this problem. Therefore we call it sometimes Belnap's theorem. The proof is so tricky that even Belnap himself had forgotten how to do it. Dr. Michael Mc.Robbie's theorem prover at La Trobe University, Melbourne, Australia solved it in less than a second, but his system works proof-theoretically and is especially designed to handle RL and the related systems, I.e. it is a special purpose ATP. Dr. Bob Meyer at ANU is one of the very few specialists in relevance logic and he was able to solve the problem by hand using semantic tableaux. An ordinary resolution proof, which is much more complicated than a proof-theoretical solution, however, was not known so far. The real difficulty and hence the significance for resolution based theorem provers (and for human logicians) is due to the enormous search space which is generated by the postulate p3: ¥abcdx Rabx & Rxcd

=>

jy Racy & Rybd

Using the symmetry of R (L2) and p2: ¥x Rxxx with this axiom one can deduce from each clause of the form Rabc ei@! new clauses to which d2 can immediately be applied again. A rough estimate yields a total of several million clauses which have to be generated before solving (1) using a straightforward search strategy. III.

Solution of the Problem by the MKR-Procedure tKMR

§l1

We need the following axioms and lemmas for the axiomization of relevance implication: (In the sequel "-" is the predicate logic negation sign) pI: p2: p3: p4: L2:

¥a ¥a ¥a,b,c,d,x ¥a,b,c,x ¥a,b,c

ROaa Raaa Rabx AND Rxcd => jy Racy AND Rybd ROax AND Rxbc => Rabc Rabe => Rbac

Lemma a: ¥C,D,x,y,z T(+(C D) x ) AND Rxyz => -T(C y) OR T(D z) Lemma b: VC,D,x -T(+(C D) x) => tY,z Rxyz AND T(C y) AND -T(D z) where and

T( +(C D) x) - T(+(C D) x)

means I(C + D, x) means I(C + D, x)

=

T F

In this notation the theorem is T(+(+(A +(B B)) + (A +(A +(B B)))) 0) The following list is an original protocol of the proof found by the MKR-Proeedure with some minor manual modifications to make it more

501

readable and to fit it tnto the format of this paper.

***************************************************************** * *

*

MARKGRAF KARL REFUTA'rION PROCEDURE, VERSION I4-MAR-83

**

DATE:

28-MAR-83

*

**

19:47:07

* * ***************************************************************** FORMULAE GIVEN TO THE THEOREM PROVER: AXIOMS: 11 X 11 X 11 V,W,X,Y,Z

R(O X X) R(X X X) R(V W X) AND R(X Y Z) IMPL

t

U R(V Y U) AND R(U W Z) 11 W,X,Y,Z R(O W X) AND R(X Y Z) IMPL R(W Y Z) 11 C,D,W,X,Y,Z T(+(C D) W) AND R(W X Y) IMPL -T(C X) OR T(D Y) 11 C,D,W, -T(+(C D) W) IMPL t X,Y R(W X Y) AND T(C X) AND -T(D y) SYMMETRIC (R) T(+(+(A +(B B)) + (A +(A +(B

THEOREM:

B)))) 0)

*********************

*

INITIAL GRAPH * ********************* AXIOMS: AI: A2: A3: A4:

11 11 11 11 A5: 11 A6: 11

A7: A8: A9:

X

R( 0 X X)

X

V,W,X,Y,Z V,W,X,Y,Z W, X, Y, Z

V,W,X,Y,Z

11 X,Y,Z 1j. X,Y,Z 1j. X,Y,Z

R(X X X) -R(V WX) OR -R(X Y Z) -R(V W X) OR -R(X Y Z) -R(O W X) OR -R(X Y Z) -T( +(V W) X) OR -R(X

OR R(V Y F I(Z W Y V)) OR R(P l(Z-W Y V) W Z) OR R(W-Y Z) Y Z) OR -T(V Y) OR T(W Z) T(+(X Y) Z) OR R(Z F 3(Z Y X) F 2(Z Y X))) T(+(X y) Z) OR T(X F3(Z Y X))T(+(X y) Z) OR -T(Y F_2(Z Y X))

THEOREM: TIO:

-T(+(+(A +(B B))

+

(A +(A +(B B)))) 0)

ABBREVIATIONS: a = F 3(0 +(A +(A +(B a'= F-2(0 +(A +(A +(B F-3(a' +(A +(B B)) b F-2(a' +(A +(B B)) c d = F-3(c +(B B) A) e = F-2(c +(B B) A) f = F-3(e B B) g = F-2(e B B) F-l(c a b a) h 1 F-l(e h d a) j = F:l(g 1 f h)

B))) +(A +(B B)) ) B))) +(A +(B B)) ) A) A)

502

TID + RI + R2 + R3 + R3 + RI + TID + TI D + RI + A5 + RIO + A3 + Rl2 + A6 + Rl4 + Hl 5 + R3 + R2 + A4 + RI9 + A4 + R21 + A3 + R23 + A6 + R25 + R26 + R2 + A3 + R29 + A6 + R31 + R32 + A4 + R34 + A6 + R36 + R37 + R4 +

A9 A9 A9 A9 A8 A8 A8 A7 A7 R9 H8 A2 RI I Rl3 R7 R6 A7 A7 A2 Rll R20 RIB R22 Rl7 R24 R16 R5 A8 R20 R18 R30 R7

= > RI

= > R2

=> => => => => => => => => => => => => => => => => => => => => => => => => =)

=> = )

=> => R2B = > R22 = > RI7 =>

R35 = ) R33 => R27 = ) R38 =)

R3 R4 R5 R6 R7 R8 R9 RIO: RII : R12 : R13 : R14: R1 5 : R16 : H17: R18: R19: R20: R2I: R22 : R23: R24 : R25: R26: R27: R28 : R29 : R30 : R31 : R32 : R33: R34: R35 : R36 : R37 : R38: R39:

- T(+ ( A +(A +(B B») a') - T ( >( A +(B B» c) - T( +( B B) e) - 'f eB g)

T(B f ) T( B b ) T( +( A +(B B» a ) R(O a a ') R(a ' b c) R(a b c) OR - R( O a a ') R(a b c) R(a b h ) OR - R( a b c ) R( a b h) T( +( B B) h) OR - T(A b) OR -T ( +(A +( B B» T(+(B B) h) OR -T(A b ) T(+(B B) h ) R(e f g) R( c d e) R(h a c) OR - R(a b c) R(h a c) R( i h e) OR - R( c d e) R(i h e) R(h f j) OR - R( e f g) R(h f j) T(B j ) OR - T( B f ) OR - T(+ ( B B) h ) T( B j) OR - T( B f) T(B j) T{A d) R( a d i) OR - R( c d e ) R(a d i) T(+( B B) i) OR - T( A d ) OR - T( +(A +( B B» T(+(B B) i ) OR - T(A d) T(+ (B B) 1) R( j i g ) OR - R( e f g) R(j i g) T( B g ) OR - T( B j ) OR - T(+ ( B B» i ) T(B g) OR - T(B j) T(B g) EMPTY

a)

a)

GRAPH SUCCESSFULLY REFUTED . CPU- TIME USED : NUMBER OF LINKS GENERATED: NUMBER OF CLAU SES GENERATED: NUMBER OF CLAUSES DELETED: LEVEL OF PROOF : G- PENETRANCE: D-PENETRANCE:

516 .33 SECONDS

311

49 15 14 0.97 1.00

THE FOLLOWING CLAUSES WERE USED IN THE PROOF: A8 A9 TID RI R2 R3 R5 R6 R7 A7 R8 R9 A5 RI O RI I A2 A3 RI2 Rl3 A6 R14 RI5 R16 Rl 7 RIB A4 R19 R20 R2I R22 R23 R24 R25 R26 R27 R28 R29 R30 R31 R32 R33 R34 R35 R36 R37 R38 R4 R39 . END OF PROOF :

28- MAR-8 3

20:13 : 37

503

To demonst~ate one of the difficulties which is hidden by the use of abbreviations in the p r o t o c oL, we list an unab b r ev La t e d ve~sion of only one of the resolvents:

R35: R(F l(F 2(F 2(F 2(F 2(0 +(A +(A +(B B))) +(A +(B B))) - +TA +(B B)) A)

+(B B) A) B B)

F l(F 2(F 2(F 2(0 +(A +(A - ~TA +(B B)) ~(B

~(B

B))) +(A +(B B)))

~(B

B))) +(A +(B B)))

A) B)

A)

F l(F 2(F 2(0 +(A +(A - +TA +(B B)) A)

F 3(0 +(A ~(A +(B B))) +(A +(B B))) F-3(F 2(0 +(A ~(A +(B B))) +(A +(B B))) - +TA +(B B)) A)

3(0 +(A ~(A +(B B))) ~(A +(B B))) F 3(F-2(F 2(0 +(A +(A +(B B))) +(A +(B B))) - +TA +(B B)) F

A)

+(B B)) A)

3(0 +(A +(A +(B B))) +(A +(B B)))) F 3(F-2(F 2(F 2(0 +(A +(A +(B B))) +(A +(B B))) - +(A +(B B)) F

~(B

A) B)

A) B B)

F l(F 2(F 2(0 +(A +(A +(B B))) +(A +(B B))) - +TA +(B B))~ A)

3(0 +(A ~(A +(B B))) +(A +(B B))) F-3(F 2(0 ~(A +(A +(B B))) +(A +(B B))) - +TA +(B B)) F

A)

3(0 +(A ~(A +(B B))) +(A +(B B))))) F l(F 2(F-2(F 2(0 +(A +(A +(B B))) +(A +(B B))) - +TA +(B B)) F

A)

+(B B)) A)

F l(F 2(F 2(0 +(A +(A +(B B))) +(A +(B B))) - +TA +(B B)) A)

F 3(0 +(A +(A +(B B))) +(A +(B B))) F-3(F 2(0 +(A +(A +(B B))) +(A +(B B))) - +TA +(B B)) A)

F_3(0 +(A +(A +(B B))) +(A +(B B))))

504

F 3(F 2(F 2(0 +(A +(A +(B B))) +(A +(B B))) - +TA +(B B)) A)

+(B B)) A)

F 3(0 +(A +(A +(B B))) +(A +(B B)))) F 2(F-2(F 2(F 2(0 +(A +(A +(B B))) +(A +(B B))) - +CA +(B B)) A)

+(B B) A) B B) )

It is most inte~esting to note that the p~oof is almost identical to the one found by Bob !1eye~, only the order of the deductions is different. A more natu~al order is used below where we translated the proof back into a relevance logic notation. TIO:

(A + (B .+ B)) + (A + (A + (B + B))) .F in 0

TIO,A8 TIO,A9

=> R7: => RI:

A + (A + (B + B))

TIO,A7 RI,A7 A5,R8,R9

=> R8: => R9: => Rll:

ROaa' Ra'bc Rabc

RI,A8 RI,A9 R2,A8 R2,A9 R3,A8 R3,A9

=> => => => => =>

R6: R2: R28: R3: R5: R4:

B B

R2,A7 R3,A7 A3,A2,Rll A4 ,A2 ,Rll A3,RI8,R20 A4,RI8,R20 A3,RI7,R22 A4,RI7,R22

=> => => => => => => =>

R18: R17: R13: R20: R30: R22: R24: R35:

Rcde Refg Rabh Rhac Radi Rihe Rhf j Rjig

A6,R7,R28,R30 => R33: A6,R7,RI3,RI4 => R16: A6,R5,RI6,R24 => R27: A6,R4,R27,R33,R33

A + (B + B)

A

T

A +

B B

+

+

+

B

B B

b

T in i T in h T in j

B

=>

in

F in c T in d F in e T in f F in g

(B + B)

A

B

T in a in a'

'I'

EMPTY

***** IV.

How did the MKR-Procedure Manage this Problem?

The deduction machine of the MKR-P~ocedure is primarily based on an extension of Kowalski's "Connection Graph Calculus" [KO 75J. In this calculus the clauses are represented as nodes of a graph, whe~e the edges (links) represent the deduction possibilities. Pr-eaen t Ly there are two main components responsible for the strength of the system.

505

1. A powerful reduction module which is able to detect redundant

clauses and links of various types (subsumption, purity etc.) ~J, [WA 8lJ. Link reduction is as clause reduction, because each deleted link considerably the work of the second main component of

tautologies, important as facilitates the system:

2. The Terminator Module [AO 83J: This module is designed to extract refutation trees from unit refutable clause graphs [HR 78J. The extraction mechanism itself can be compared to UR-Resolution, where one resolves between a nonunit clause and as many unit clauses as necessary to produce a new uni t clause. The difference lies in the exploitation of the information of the clause graph by the Terminator module. However all the strategies and heuristics for guiding the search for a refutation tree can be used in UR-Resolution too, and vice versa. The proof of Belnap's theorem was found by the Terminator module, but not without an additional mechanism, which we had to implement for solving this problem. The difficulties which could not be overcome even by the AURA system at Argonne National Laboratory [OW 83J come f rom the ax i oms A3 and A4 toge the r wi th A2 and the symmet ry of R. A3: A4: A2:

-Rvwx -Rvwx Rxxx

-Rxyz -Rxyz

RvyF l(zwyv) RF_lTzwyvJwz

As soon as a unit clause Rabc (which is equivalent to Rbac) has been generated during the search for a proof, four UR-Resolution steps with A2 and A3 are possible, producing four new-llnit clauses: Ul: U3:

RabF l(caba) RacF:l(cbca)

U2: u4:

RbaF l(cbab) RbcF:l(cacb)

and analogously four new unit clauses can be deduced from A4. These eight formulas can be immediately used to produce another 64 new unit clauses etc. All these atoms are in some sense variants of each other, that means they contain the same terms, but with a deeper nested F 1 function. In this example it is absolutely necessary to restrict the exponential growth of such unit clauses. A limitation of the term depth, however, is not possible, because the limit has to be so high, as one can see from R35, that i t is useless. Therefore a more sophisticated control of the application of A3 and A4 is necessary. We solved this problem with a mechanism which can be applied to a r b t t r a r y "g e ne r a t.o r clauses", I.e. clauses which tend to produce deeply nested terms with the same subterm structure like f(f(f(f ••• or f(g(f(g(f(g ••• etc. The Generator Control Mechanism Definitions: Let Land K be two literals of a clause with the same predicate symbol and opposite sign. We call L a ~enet'ator literal if there exists a substitution u with ulKI == ILl (== means equal after variable r-enarn t ng ) , and u contains a component v ... r t t L) and tl is a

506

termli st wi t h variables occu rring i n it . (The third l itera ls of A3 and A4 and the fi rs t li t era l of A6 are of this t ype. ) A un it clause is ca l l e d a de sc endan t of a litera l L, if i t is a n instan ce of L, produc e d by a n UR- Re s ol ut ion step. Exampl e 1 A3 : A2 : U2 =
- Rxyz

<- Rvwx

I

is a

U1 :

I

de scendant of the third lit e ral of A3.

Unit c l a u s e s whi ch a re used in th e same UR st e p a r e ca l l ed part ner uni t s. In the exampl e a bove U1 i s the partner of A2 and vice versa. Fur thermore A2 and U1 a r e ca l led parents of U2. The Mecha ni sm: We a t t a ch t o each unit clause a ma rk which may be NI L or an integer)

°:

Eve r y des cendant of a non ge ne r a to r li t e r a l i s marked with NIL . De s cenda n t s of ge nerator l i te r a ls are mar ked a s f ol lows: 1. If at l e a s t one parent un it PU is marked wI t h a number n

*

NI L a nd eve ry pa rtne r uni t of PU is a paren t unit of PU as well , t h en U is ma rk ed wit h n- l.

2 . Ot h e r wi s e , if a t least on e pa re nt un it PU is ma rk ed wit h a number n NI L t he n U is mar ked with n- m, where m is th e number of pa rtne rs of PU wh i c h a re a l s o mar ke d wi th a number NI L.

*

*

3. In a ny ot he r c a s e U is ma rk ed wi t h a nu mbe r LIMI T ) 0 , gi ve n by the user . I n the "c o nverse of co ntr action " - e xa mpl e we have s et LIMIT to 1. In t his ca se the mecha nism wo rk s as fo llows: - The r e is no inf lue nce to dedu ctions fr om A6 . - UR-Reso lut io n s wi t h A3 a nd A4 li ke t h a t one in exam p l e 1 are a l lowed , i f U1 i s marked wi t h NIL. U2 ho we ver will be ma r k e d wit h n =l. The r e f o r e a f u rth e r UR -Re s o l ut i on with A3 , A2 a nd U2 or A4, A2 an d U2 is forbidden, bec ause A2 is a partner of U2 as well as one of i ts pa r en t s . - UR-Resolu t i ons wi th A3 or A4 l i ke Example 2 A3 :

<- Rvwx

I

- Rxyz

I

whe re bo th unit cl au se s a r e marked with 1 a r e f o rbi dden , bec au se

507

rule 2 yields n-m

=

0 in this case.

The user can control the search space restriction with the parameter LIMIT. Mainly this parameter defines how deeply a function ma y be nested in a direct sequence. If for example we have a clause <-Px Pf(x» and LIMIT is set to 2, then a term r(f(f( •.• derived from this clause is not possible, but a term f(f(g(f(f ••• is allowed because all intermediate steps with other clauses restore the mark of the descendant units to NIL. General Utility of the Mechanism A general valuation of the utility of this method is very difficult, because there exists no representative set of examples for statistical investigations: The following assertions however can be made: - Belnap's problem is almost unsolvable without the mechanism. - We have other examples of similar complexity which have not been solved so far by our theorem prover; however the form of the axioms give rise to the assumption, that the mechanis m c an help solving them, too. - We inspected the comparative study of Minkel' and Wi l s on [MW 76J. 45 of altogether 98 examples in this study contain ge ne r a t or clauses; mainly substitution axioms for the equality predicate. Tests with these examples yielded reductions of the search space of up to 70%. The reductions are more significant for problems with a large prove depth and with a large percentage of generator cl au ses among the other axioms. Finally we can say, that the generator control mechanism is useless for textbook examples. Its application can however be very helpful in case of really difficult problems. References [AB 75J

A.R. Anderson, N.D. Belnap Entailment: The Logic of Relevance and Necessity Vol. 1, Princeton University Press, 1975

[AO 831

G. Antoniou, H.J. Ohlbach

[E 81J

N. Eisinger Subsumption and Connection Graphs Proc. of GWAI-81, Bad Honnef 1981

[ HR 781

M.C. Harrison, N. Rubin Another Generalization of Resol ution JACM 25:3 341-351 1978

( KMR83 I

Karl Mark G. Raph The Markgraf Karl Refutati on Procedure: Spr i ng 1983 Uni ve r s i t y of Karlsruhe, I nterneI' Berlcht

Terminator Proc. of Eights IJCAI, Karlsruhe 1983

508

[KO 75J

R. Kowalski A Proof Procedure Using Connection Graphs JACM 22:4 1975

[MW 76J

G.A. Wilson, J. Minker Resolution Refinement and Search Strategies: A Comparative Study. Trans. on Comp., vol. C-25, no.8, 1976

[ow

R. Overbeck, L. Wos Private Communication (IJCAI 83)

83J

[WA 81 J

C. Walther Elimina tion of Redundant Links in Extended Connection Graphs. University of Karlsruhe, Interner Bericht 10/81

[RM 72J

R. Routley, R. Meyer The Semantics of Entailment in Truth, Syntax and Modality. Leblanc(ed) North-Holland 1972

Automated Deduction - CADE-18: 18th International Conference on Automated Deduction, Copenhagen, Denmark, July 27-30, 2002 Proceedings

Read more

Automated Deduction, Cade-12: 12th International Conference on Automated Deduction, Nancy, France, June 26 - July 1, 1994. Proceedings

Read more

Automated Deduction - CADE-21: 21st International Conference on Automated Deduction, Bremen, Germany, July 17-20, 2007, Proceedings

Read more

Automated Deduction CADE-20: 20th International Conference on Automated Deduction, Tallinn, Estonia, July 22-27, 2005, Proceedings

Read more

Automated Deduction - CADE-15: 15th International Conference on Automated Deduction, Lindau, Germany, July 5-10, 1998, Proceedings

Read more

Automated Deduction - CADE-11: 11th International Conference on Automated Deduction, Saratoga Springs, NY, USA, June 15-18, 1992. Proceedings

Read more

Automated Deduction - CADE-15: 15th International Conference on Automated Deduction, Lindau, Germany, July 5-10, 1998, Proceedings

Read more

Automated Deduction -- CADE-23

Read more

Automated deduction -- CADE-21: 21st International Conference on Automated Deduction, Bremen, Germany, July 17-20, 2007 : proceedings (Lecture Notes in Artificial Intelligence 4603)

Read more

Automated Deduction CADE-13: 13th International Conference on Automated Deduction, New Brunswick, Nj, USA, July 30 - August 3, 1996, Proceedings: ... 13th

Read more

Automated Deduction - CADE-16: 16th International Conference on Automated Deduction, Trento, Italy, July 7-10, 1999, Proceedings: Cade-16, ... Computer Science; 1632. Lecture Notes in Ar)

Read more

Automated deduction--CADE 16: 16th International; Conference on Automated Deduction, Trento, Italy, July 7-10, 1999 : proceedings (Lecture Notes in Artificial Intelligence 1632)

Read more

Automated deduction, CADE-20: 20th International Conference on Automated Deduction, Tallinn, Estonia, July 22-27, 2005 : proceedings (Lecture Notes in Artificial Intelligence 3632)

Read more

Automated deduction - CADE-17: 17th International Conference on Automated Deduction, Pittsburgh, PA, USA, June 17-20, 2000 : proceedings, Volume 17, Part 2000 (Lecture Notes in Artificial Intelligence 1831)

Read more

Automated Deduction - Cade-22: 22nd International Conference on Automated Deduction, Montreal, Canada, August 2-7, 2009. Proceedings (Lecture Notes in Artificial Intelligence 5663)

Read more

Automated Deduction - CADE-22: 22nd International Conference on Automated Deduction, Montreal, Canada, August 2-7, 2009. Proceedings (Lecture Notes in ... Lecture Notes in Artificial Intelligence)

Read more

Automated deduction-CADE-18: 18th International Conference on Automated Deduction, Copenhagen, Denmark, July 27-30, 2002 : proceedings (Lecture Notes in Artificial Intelligence 2392)

Read more

CADE Automated Deduction 10 conf

Read more

Automated Deduction in Geometry: International Workshop on Automated Deduction in Geometry, Toulouse, France, September 27-29, 1996, Selected Papers

Read more

10th International Conference on Automated Deduction: Kaiserslautern, FRG, July 24-27, 1990. Proceedings (Lecture Notes in Computer Science)

Read more

Instantiation Theory: On the Foundations of Automated Deduction

Read more

International Mathematical Conference 1982: Proceedings

Read more

Mathematical morphology: 40 years on. Proceedings 7th International symposium

Read more

Automated Deduction in Geometry: Second International Workshop, ADG'98, Beijing, China, August 1-3, 1998, Proceedings

Read more

Automated Deduction in Geometry - ADG 2010

Read more

The Evolution of Language: Proceedings of the 7th International Conference (EVOLANG7), Barcelona, Spain 12-15 March 2008 (Proceedings of the 7th International Conference (EVOLANG7))

Read more

Proceedings of the 20th International Conference on Fluidized Bed Combustion

Read more

Logic for Programming and Automated Reasoning: 7th International Conference, LPAR 2000 Reunion Island, France, November 6-10, 2000 Proceedings

Read more

Proceedings of the 6th SIAM International Conference on Data Mining

Read more

Proceedings of the International Conference on Experimental Fluid Mechanics (2nd)

Read more

Recommend Documents

Automated Deduction - CADE-18: 18th International Conference on Automated Deduction, Copenhagen, Denmark, July 27-30, 2002 Proceedings

Lecture Notes in Artificial Intelligence Subseries of Lecture Notes in Computer Science Edited by J. G. Carbonell and J....

Automated Deduction, Cade-12: 12th International Conference on Automated Deduction, Nancy, France, June 26 - July 1, 1994. Proceedings

Automated Deduction - CADE-21: 21st International Conference on Automated Deduction, Bremen, Germany, July 17-20, 2007, Proceedings

Lecture Notes in Artificial Intelligence Edited by J. G. Carbonell and J. Siekmann Subseries of Lecture Notes in Comput...

Automated Deduction CADE-20: 20th International Conference on Automated Deduction, Tallinn, Estonia, July 22-27, 2005, Proceedings

Lecture Notes in Artificial Intelligence Edited by J. G. Carbonell and J. Siekmann Subseries of Lecture Notes in Comput...

Automated Deduction - CADE-15: 15th International Conference on Automated Deduction, Lindau, Germany, July 5-10, 1998, Proceedings

Automated Deduction - CADE-11: 11th International Conference on Automated Deduction, Saratoga Springs, NY, USA, June 15-18, 1992. Proceedings

Automated Deduction - CADE-15: 15th International Conference on Automated Deduction, Lindau, Germany, July 5-10, 1998, Proceedings

Lecture Notes in Artificial Intelligence Subseries of Lecture Notes in Computer Science Edited by J. G. Carbonell and J....

Automated Deduction -- CADE-23

Lecture Notes in Artificial Intelligence Edited by R. Goebel, J. Siekmann, and W. Wahlster Subseries of Lecture Notes ...

Automated deduction -- CADE-21: 21st International Conference on Automated Deduction, Bremen, Germany, July 17-20, 2007 : proceedings (Lecture Notes in Artificial Intelligence 4603)

Lecture Notes in Artificial Intelligence Edited by J. G. Carbonell and J. Siekmann Subseries of Lecture Notes in Compu...

Automated Deduction CADE-13: 13th International Conference on Automated Deduction, New Brunswick, Nj, USA, July 30 - August 3, 1996, Proceedings: ... 13th