This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
has t for */#c-value while K ...Kr' ...&\ where r', ...,r(r) are respectively p' or Np'', ...,p(v) or Np{v) but the set / , ...,rW is different from the set qf, ...,#(j;), h a s / for *JKC-value, on the same replacement. Now form the disjunction of all K ...Kq' ...q(v) for just those sets q', ...,q(v) for which PROP. } there may have been several cancellations of (Eg) $'{£}. K . lid' we can find a ^K-proof of (p. by Ncj), where " be the conjunction of the following three /^-statements: {A%",..., gW) (A^') (EC, C) W, £2] = * if and only if £ is the place number of the ordered pair <£1? £2) m o u r ordering of all ordered pairs of natural numbers, then p',p", ...,p^K+2) clearly satisfy <j>". We now show that if are the same. If o) is A00-true then ({){OL} v OJ and 0{/?} v &> are Aoo-true. Since a = /? v ^ is A00-true then one of a = /? or w is A00-true, so in either case A 00 -truth is preserved. Clearly R 2 preserves A 00 -truth. If a 4= /? v co is A00-true then either o) is Aoo-true or oc and /? determine distinct numerals. In either case Sot 4= S/3 v a; is A00-true. It is clear that the remodelling rules preserve A 00 -truth from (iv) of the definition of A 00 -truth. Similarly for the building rules using (iii) and (iv) of the definition of A 00 -truth. Thus finally an Aoo-theorem is Aoo-true. The importance of the result that a closed numerical term determines a unique numeral is shown by the following: suppose that ot determines v and K where v 4= K and that j3 determines v only and y determines K only. Consider _ ^ _ a is an AoO-theorem by Prop. 3. If , for 1 ^ n < /c, and ^ | > , K , K ] = 0M, in the above notation, if K is positive. We have pt[v, K, TT] = 0 for n > K, also jp£[O, A:, TT] = pt[v, 0, TT] = ^^[P, AC, 0] = 0. pt is called a coordinate function. This formalizes the informal discussion of coordinate functions given earlier in this chapter. D154 = <9, 9, 9, 9, 9, 9>. D157 (vV)
2.13 Consistency and completeness of 0>c
57
T[H', ..., JS^] is t. This is a ^-statement which has J(c-value t just in case F reduces to t. Thus ^ c is functionally complete with respect to J(c, 2.14 Decidability 15. &c is decidable It is an effective process to decide whether a ^-statement is Jtc-valid, hence it is an effective process to decide whether a ^-statement is an ^#c-theorem. Furthermore if a ^-statement is found to be a 0>ctheorem by the test of .^-validity then it is an effective process to supply the ^a-proof. For all we need do is to put the ^-statement into conjunctive normal form, this is easily tested for .^-validity and if the test is affirmative it is a routine matter to supply a ^c-proof. We can obtain this result in another way. We note that apart from l a the length of a ^-statement increases as we proceed down a ^ o proof-tree and la leaves the length unaltered, repeated use of l a will reproduce a previous formula so use of l a is limited. Thus given a ^^-statement <J) we can construct all possible deduction-trees with base 0. We can then decide if any of these trees are ^o-proof-trees of (j) or whether each fails to be a ^ c -proof-tree of (j). Thus we can decide if ^ is a ^-theorem and if so we can find a ^ c -proof for it. For instance DNDDNpp'Npp', call it #, fails to be a ^-theorem. It is of the form DNDcfrtyo) with DNpp' for (j) and Np for i/r and p' for 0). PROP.
This can arise (i) . , or (iii) from v from I a ^ , ^ , , or v(ii) ; from I l a ^^^ DNDcfiiJfG) DND(f>i/ra) v ' —J;
,.
only. Case (i) DcoNDcfriJr can arise from v by l a
which brings us back to where we started or from ^ .-T^ , . I l a only. can arise from
^
- 116 only. Now iV^ is NDNpp' which
can only arise from NNp Npr ^rt^^r—7- Ho. l h e second upper formula fails to arise from any other ^-statement by the ^ o -rules and fails to be a ^ c -axiom. Thus case (i) fails to provide a ^ o -proof of %. Case (ii) o) isp' and so fails to provide a ^ o -proof of x- Case (iii) DNcjxx) can only arise from ^ T .
DN(j)(o
l a or -^AT, I l a this we can reject as before since o) is rp'. J DNcpoj
58
Ch. 2 Propositional calculi
Do)N(j) is Dp'NDNpp' and this can only arise from DNtpoj by la, which we started vwith, or from
NDNpp'
^ — , II a, NDNpp' can only arise from
NNp Np'
, 116 and this we can reject as before. Lastly DNcfra) is
DNDNpp'p' and, apart from cases already considered, can only arise from
DNNpp' DNp'p' DNDNpp'p' DNp'p' can only arise from an axiom by la, but DNNpp' can only arise from Dpp' or Dp'p each of which fails to be a ^-axiom. Hence case (iii) fails to produce a ^ o -proof of x- Case (iii) is more easily dealt with if we consider DNi/roj which is DNNpp'. A system with 16 as an independent rule would fail to be decidable in this way. We can add any ^-statement as an extra axiom without causing every ^-statement to be a ^-theorem, provided the statement contains a connective. A variable would then fail to be a ^-theorem. The resulting system would however fail to have many of the properties of ^ c . 2.15 Truth-tables The simplest way of testing a ^-statement for being a tautology is by the use of truth-tables. Suppose that a .^-statement (j) is written in terms of D, N, K, C and B; we replace D, K, C and B by two place functions, d, k, c and 6 respectively, and replace N by a one-place function n. We then replace the propositional variables by elements t a n d / in any manner, always replacing different occurrences of the same propositional variable by the same element. We then evaluate the resulting composition of functions by using the following truth-tables: p t t
f f
st
dpp' t t t
f
f
p' t
V t t
P' kpp' t t
f
f f
f
t
f f f
V t t
f f
P cpp' t t
f
t
f
f
t t
p t t
f f
P' bpp> t t
P t
f
f
t
f
f f
np
st
t
These tables give the values of the five functions for values of the arguments on the same line.
2.15 Truth-tables
59
For example let us test CCKpp'p"CpCp'p" for tautology. This Restatement is a conditional, let us see if it is possible to make it take the value/ by suitable values, t or/, given to p, p' and p". The only way of making a conditional take the value/ is to make the first component t and the second component/. Thus we want to make GKpp'p" take the value t and CpGp'p" take the value/. Both of these are again conditionals so we want p to have the value t and Cp'p" have the value/, this requires p' to have the value t and p" to have the value/. This then fixes the values of p, p' and p" in order that our original statement take the value/. Putting these values for p, p' a>ndpff in the original statement we easily calculate that it is t, thus it must always be t, and so is a tautology. We put the working down as follows: CCKpp'p'VpCp'p"
f t
f
tf ttf t
f The first line with / under the first occurrence of C indicates that we want to make the whole statement take the value/. To do this we must make the first component take the value t and the second component take the value/, this is indicated by placing in the second line t under the second occurrence of C and/ under the third occurrence of 0. Since the third occurrence of C is to take the value/ then its first component must take the value t and its second component must take the value/; this is indicated by placing t in the third line under the second occurrence of p, and/ in the third line under the fourth occurrence of C. We have now found values that p, p' and p" must have if our statement is to take the value/. Because p' must have the value t and p" must have the value/ since Cp'p" is to have the value /, this is centred in the fourth line. In thefifthline we enter the values of p, p' and jp" under the first occurrence of these symbols. In the sixth line we evaluate Kpp1 andfindthat it is t, in the seventh line we evaluate the first component of our original condi-
60
Ch. 2 Propositional calculi
tional and find that it i s / , but we had found in the second line that it should be t, hence it is impossible to make the original conditional take the value/, hence it always takes the value t and so is a tautology. The working can be put down on one line, since except for the last entry we have always used different columns. Thus: CCKpp'p"CpCp'p"
fttttfftft
f
f 176 555 2334 4 The numerals indicate the order in which the letters t a n d / are put in. The final line puts both t a n d / under K indicating an impossibility. Since SPC is decidable we could have another formulation for it, namely: the axiom-scheme 'a tautology is an axiom5 and dispense with rules. The conditions for being a formal system are satisfied because we have a test for being an axiom. Sometimes it is simpler to evaluate directly for all possible argument values, we would then put down the work as follows: p t t t t f f f f
p' t t f f t t f f
p" t f t f t f t f
BBpp'BBpp'fBp'ptf t t t t t t t t
Thus we have a tautology. Independence of axioms and rules may also be shown by means of models. We find a model J(' for the formal system <£' obtained from the formal system jSf by omitting one axiom, axiom-scheme or rule and such that the omitted axiom or axiom-scheme fails to be valid in the model J(' or the omitted rule fails to preserve JS?'-validity. For instance the following three axiom-schemes and Modus Ponens give a Propositional Calculus 0*x equivalent to ^ c , / is a constant.
2.15 Truth-tables
61
(i) dt
(ii) a
3C6S'C(i
(iii) G( Consider the following truth-tables for the connective M.P. Cpp'
v'
V t t t
g
t
t t
f
f
t
g g g
g
t t
f
f
g
g
f
t
t
f f f
t
(i)
Cpp'
(ii) Cpp'
c. (iii) Cpp'
t
t
t
f f f
g
g
f
f
g
f
t t
g t t t t
t t t
t t
t t t
The constant/ is undesignated. The heading of the various columns denotes that the corresponding rule or axiom-scheme fails for that truth-table, but all the other rules and axiom-schemes hold for that truth-table. In this model we have three elements, t is designated,/and g are undesignated. We leave the checking of this table as an exercise for the reader.
2.16 Boolean Algebra A Boolean Algebra is a formal system with the symbols: nooi a
type I Oii Oii
0 1
i
u n
iii
i
iii
ii
( )
name
variable for an element equality inequality null element unit element union intersection complement generating symbol left parenthesis right parenthesis
62
Ch. 2 Propositional calculi
The axioms are given by the following schemes: We have written (a U /?) for ((U a)/?), etc. a = a 0+ 1 0=1
1=0 au 1 = 1 au 0 = a
a no = o a ni = a
ft cz ftl II >-
na = o a = a
ft cz ft II ft
a na = a (a f) /?) = a II/?
(a u B) = a n B
a n (/? n 7) = (a n /?) n 7
a u (/? U 7) = (a u /?) U y
a n (/? U 7) = (a n /?) U (a n 7)
a U (/? n 7) = (oc U /?) n (a U 7)
Note the duality. We have neglected independence. We have written equalities in the customary manner, a, /?, 7 stand for arbitrary elements. We have omitted parentheses wholesale. The rules are:
We could add the symbols and rules of the Propositional Calculus and so get compound statements, but this is unnecessary. Note that if we replace n by K u by D
—
0 1 = *
by by by by by
N
p&Np py Np B NB
and the variables for elements by propositional variables, then the axioms become tautologies of 0*c. Again if we replace the elements by variables for subsets of a given set X, then the axioms become axioms for elementary set theory, as regards union, intersection and complementation, 0 becomes the null set and 1 the given set X. The simplest Boolean
2.16 Boolean Algebra
63
Algebra consists of the two elements {0,1}. This is called the two-valued Boolean Algebra and corresponds to the pair {/, t}. PROP.
16. We have
(i) a U (a n ft) = a
a n (a U /?) = a
(ii) (any3)U/? = aU/? (iii) aU^ = a if and only if We have
(a U ^) H/tf = a n/? a n /? = /?.
a (J (a n /?) = (a n 1) U (a n A) = an(lU/?) = an 1 = a.
Again
(oc(]fi)[)j3=(oc[)/i)(](^[j^) = (a U /?) H 1
Lastly if a U /? = a then
an/?=(au/?)n/?
= /? by (i). D6.
a - > / ? for aUyff a<->y5 for (a->/?) n ( y ) a ^ J3 for a n /? = a.
Notice the correspondence with ^ c on replacing -> by O and <-> by S . PROP.
17. We have
If
a ^ /? a^cZ ft ^ a
//
a ^ /? a^cZ J3 ^ y 0^ a
//
then
a = /?.
then a ^ y. a ^ 1.
a ^ /? tf^w a n y ^ y? n y.
64
Ch. 2 Propositional calculi
//
a ^ /? then afl/J^a
a u y ^ /? U y. a < all/?
a < /? i / cmd owfo/ ^/ fi ^&>
a < /? if and only if a ->/? = 1 a = /? if and only if a <-> /? = 1. These are very easy and are left as exercises for the reader. 2.17 Normal forms Using the commutative, associative, distributive, de Morgan laws and double negation we can express any J^-statement in an equivalent form, either as a conjunction of disjunctions of propositional variables and negated propositional variables or as a disjunction of conjunctions of propositional variables and negated propositional variables. The first case is called the conjunctive normal form (c.n.f.), and the second case is called the disjunctive normal form (d.n.f.). We push the negations to the right as far as they will go so that they act only on propositional variables or negated propositional variables then use double negation as long as possible and lastly the distributive laws. It is like multiplying out an algebraic formula. We can further ensure that in the first case each propositional variable which occurs in the original J^-statement occurs negated or unnegated in each disjunctand in the c.n.f. and dually in the second case in each conjunctand in the d.n.f. This is achieved by disjuncting KpNp in the first case and conjuncting DpNp in the second case, if p is a relevant variable. Clearly we are left with an equivalent J^-statement in each case. The N. and S.C. that an J^-statement be a tautology is that each disjunctand of its c.n.f. contain a variable and the same variable negated. Dually the N. and S.C. that an J^-statement be refutable is that each conjunction of its d.n.f. contain a variable and the same variable negated. We can further ensure that in the c.n.f. each disjunction contains each variable exactly once either negated or unnegated, unless the c.n.f. is a tautology. For we can omit any disjunction which contains a variable and the same variable negated and obtain an equivalent J^-statement. If the original ^r-statement was a tautology by this removal we would finally remove all the disjunctands. Dually for the d.n.f.
Historical remarks to Chapter 2
65
H I S T O R I C A L R E M A R K S TO C H A P T E R 2
Propositional calculi have a long history. Aristotle's syllogistic, when expressed in modern symbology, amounts to a form of singularly predicate calculus or class theory. This is discussed by Lukasiewicz (1951) in detail. The early history of the classical propositional calculus is given in great detail by Bochenski (1951, 1961), his account covers all the middle ages up to Frege and then on to the present time. Church (1956) gives the history of propositional calculi since Boole. Lewis (1918) gives the history from Leibniz to Schroder in great detail. Other works on the early history of logic are Bochenski (1951) which deals with preAristotlean times, the old Peripatetics, the Stoic-Megara school and the last period after Chrysippus. Another work is Moody (1953) which deals with the Medieval period, and lastly Diirr (1951) which deals with Boethius. The definition we have given for a propositional calculus seems to cover all systems which are usually called propositional calculi. The problem of the independence of symbols, rules, axioms, etc. is of quite modern origin. Huntingdon (1904) and Bernays (1926) were the earliest to discuss these matters. Models for propositional calculi came in with truth-tables. Lukasiewicz (1920, 1941) was the first to formalize a three-valued propositional calculus. Post (1921) also considered many-valued propositional calculi. Independence of axioms, etc. was demonstrated by Bernays (1926) using many-valued models, this is called the matrix method. Extensions of formal systems were studied by Lukasiewicz and Tarski (1930). Modus Ponens first appears in Scholastic Logic. In treating the classical propositional calculus we use the notation of Lukasiewicz (1920). This is easily read in conversational English if one reads as follows: read D(j)f as either (j) or j/r, read N(j) as not <j), read K
66
Ch. 2 Propositional calculi
The form we have taken for the classical propositional calculus is due to Gentzen (1934, 1955) (see also Anderson & Johnson (1962) and J. Dorp (1962)), who used the terms 'remodelling scheme' and 'building scheme'. The type of proof used in Prop. 2 is due to Gentzen (1934). Prop. 3 was pointed out to me by A. H. Lachlan. The direct method has been further developed by Schutte (1950, 1960). The de Morgan laws were stated by de Morgan (1867), but were known long before that, and Prop. 5 probably first appeared in connection with Boolean Algebras. The deduction theorem first appears in Herbrand (1930) and has been much used since, notably by Hilbert-Bernays (1934-6) and Church (1956). Tertium non datur or the law of the excluded middle goes back a long way and was first called in question by Brouwer (1908) in the case of infinite classes. He maintained that rules that apply to finite classes might fail for infinite classes. Thus he invented Intuitionism, a form of mathematics which does not accept T.N.D. It gives rise to a propositional calculus. But it is far more complicated than the classical propositional calculus, and so is his form of analysis, much more complicated than classical analysis (see Heyting (1934, 1955, 1956) and Kleene-Vesley (1965)). But his methods go some way to clarifying ones ideas about effectiveness, finiteness, constructiveness, etc. which are conditions that Hilbert (1904) insisted should apply to metamathematical demonstrations. Prop. 7, the elimination of Modus Ponens, is Gentzen'sHauptsatz (1955). We thought that since it can be eliminated then why have it at all. The word' syzygy' is due to the algebraist Sylvester who used it in connection with non-linear relations between algebraic invariants, it means a yoking together. Many proofs of the decision problem for the classical propositional calculus have been given, notably by Church (1956), Kalmar (1935), etc. A correct truth-table for implication was given by Philo of Megara about 300 B.C. Truth-tables were informally used by Frege in special cases, six years later Peirce stated them as a general decision method for the classical propositional calculus. Much of the recent development is due to Lukasiewicz and Post. The term 'tautology' is due to Wittgenstein (1922). D4, 5 are due to Sheffer (1913). The modern treatment of propositional calculi stems from Boole and
Historical remarks to Chapter 2
67
de Morgan in 1847. This was the algebra of logic. MacColl (1877) was probably the first to deal with a true propositional calculus. Frege gave the first formulation of the classical propositional calculus as a formal system in its own right. But his work was for long neglected and so the propositional calculus developed in the older form as in the work of Peirce, Schroder and Peano. Whitehead and Russell appreciated the work of Frege and gave the classical propositional calculus a formulation with negation and implication as primitives, Modus Ponens and substitution as rules. But substitution was not explicitly mentioned, though this omission was noted later. One of their axioms was found to be redundant by Bernays (1926). Nicod (1916) found a formulation of the classical propositional calculus with only one connective and only one axiom and only one rule apart from substitution. Since then a variety of formulations have been discovered. One by Hilbert (Hilbert-Bernays (1934-6) vol. 1) has 12 axioms in 4 groups of 3 axioms each, this was designed to separate out the roles of the connectives N, C, K, D. If in this formulation we omit one axiom then we get a formulation of the intuitional propositional calculus. Between the two world wars the Poles were very active in reasearch on the propositional calculus, see Jordan (1945), Storrs McCall (1969) and H.Skolimowski (1969). Another study is to formalize a partial system of the classical propositional calculus which has only the implication sign, and the object is to find axioms so that exactly all tautologies which only contain the implication sign and propositional variables are theorems of the system. The chief interest of such studies is to find a formulation of the classical propositional calculus which is an extension of this partial system. For instance CCCpp'p"CCp"pCpfffp, with Modus Ponens and substitution is an elegant formulation of the implicational propositional calculus. If to this we add the axiom Cfp, where / is a constant, we get a formulation of the classical propositional calculus. Implication in the classical propositional calculus allows as true an implication with a false antecedent, this seems in some sense unnatural. This has given rise at the hands of Lewis (1918, 1920, 1932) of various other propositional calculi designed to rectify this. He also considered other connectives such as 'possible' and 'necessary'. These give rise to what are called modal logics. We do not discuss them in this book. Much 3-2
68
Ch. 2 Propositional calculi
work has been done on these systems, finding decision procedures, formulations, etc. Gentzen (1955) used Sequenzen, that is figures of the form
where we have used# and q as variables. This behaves in the same way as CK ...Kp'p" ...jP>D ...Dq'q" -.tf*We have written the sequenzen as p' p"...p{v) qfq"-..q^ but have only used it with one lower formula. Gentzen allows the cases when either upper or lower formula may be void. The idea of using axiom schemes is due to v. Neumann (1925), a substitution rule is then unnecessary. Prop. 8, the substitutivity of equivalent statements, is due to Post (1921). Conjunctive and disjunctive normal forms derive from Boolean Algebra. A large number of examples on the propositional calculus are found in Church (1956). Boolean Algebra was invented by George Boole (1847, 1854, 1916). He noticed the resemblance between the behaviour of K and D and + and x in arithmetic. The history of modern symbolic logic can be traced back to Boole. More will be said about Boolean Algebra at the end of Chapters 3 and 12, there we shall consider Boolean-valued settheory as an extension of the classical two-valued set-theory. Sheffer (1913) gave a set of independent axioms for Boolean Algebra.
EXAMPLES 2
1. Complete the demonstration of Prop. 5 for the cases other than 4*. 2. Complete the demonstration of Prop. 8, Cor. (ii) for the first and third cases. 3. Obtain ^-proofs for BBpp'Bp'p, CCpp'CNp'Np. 4. Give a ^-proof for Cpp. 5. State and prove the Deduction Theorem in £PV
Examples 2
69
6. Obtain a ^ - s t a t e m e n t ^ which has the following table: P
p'
Ptf
t t t t
t t
f
Jt
t
t
f
t
st
f
f
f
f
J f f f
f f
t t
f J
t
t
t
7. Define D, B, K in terms of C, N. Define C, D, B in terms of K, N. Define C, D, B, K, N in terms of S and in terms of S'. 8. Show that SSpSqrSpSSrpSSsqSSpsSps is a tautology, where p, q, r, and s are of type o. 9. A Propositional calculus ^ 2 n a s ^ n e &xiom schemes: (i) (ii) (iii) And the rule Modus Ponens. State and prove the deduction Theorem for ^ 2 . 10. Prove the following theorems of ^ 2 : CNpCpp',
CNNpp,
CpNNp,
CCpp'CNpNp'.
11. Check the table giving the independences of the axioms of 3PX. 12. Give definitions of C and / of SPX in terms of D and N of &c, and give definitions of D and JV of £PC in terms of G and fo{^v 13. Show that theorems of 0*x translate into theorems of ^ c by the definitions found in Ex. 12 but that there are theorems of £PC which are unobtainable in this way, show also that there are theorems of ^ which fail to be translations of theorems of ^ c . 14. Show that the axioms of SPC are theorems of ^ and that the rules of 8PC are derived rules of SPX. 15. Prove BNBNpp'Bpp' in 0>c.
16. Show that if- and t then 4 ^ and f
is a ^-theorem.
70
Ch. 2 Propositional calculi
18. Define t for Cff in the system 8PV Define K of &c in terms of G and / of ^ . Write out DKptNp without definitional abbreviation in terms of C and/of ^ . 19. Write $ = ijr if and only if B^xjr is a ^-theorem. Show that the result is a Boolean Algebra when U 0 are suitably defined for Restatements. 20. Suppose that a Boolean Algebra 88 has an additional functor * of type u with the axiom schemes: a ^ a*, a** = a*,
0* = 0.
Show that the elements of 88 which satisfy a** = a form a Boolean subalgebra «^* of ^ . When the functions in 88* are defined as follows: write a®
for
a**,
then
a&fi
for
a n /?,
a^jS
for
(all/?)®,
a
for
a®.
(If the elements of 88 are sets of points in a topological space then the specified sets are the regular open sets-open sets which have no pin holes or cracks.) 21. Define for ( a n ^ a x /? for a n /?. Show that with these definitions every Boolean Algebra becomes a Boolean Ring, i.e. a ring in which = 0, if
a + /? = 0 then
a = /?,
Examples 2
71
Conversely show that with the definitions all/? for
a + /3+(axjS),
an/?
for
a x /?,
a
for
1 + a,
every Boolean Ring becomes a Boolean Algebra. In both cases the algebraic zero and unit coincide with the Boolean zero and unit respectively.
Chapter 3 Predicate calculi
3.1 Definition of a predicate calculus A predicate or functional calculus of the first order is a formal system J5" which is a primary extension of a propositional calculus SP, it is obtained from a propositional calculus by adding additional symbols for variables or constants of type 1, calledindividual variables or constants respectively, and adding variables or constants of some or all the types 01, oil, out,..., called predicate variables or constant predicates respectively; there may also be variables or constants of types u, u,..., called variables for functions or constant functions respectively, and there may also be constants of some of the types 0(01), 0(011), ..., (ot) 0(01) (on), ... called quantifiers There must be individual variables given by a scheme of generation, and there must be some predicates. If quantifiers are present then the abstraction symbol is required in order to provide arguments of types ot, oil,... for the quantifiers. If propositional variables and constants are excluded the resulting system (which fails to be a primary extension of a propositional calculus) is also called a predictate calculus. If quantifiers are absent the resulting system is called a free variable predicate calculus of the first order with or without functions as the case may be. If individual variables are discarded so that all well-formed formulae of type 1 are constants then quantifiers are useless and the resulting system reduces to a propositional calculus when propositional variables and constants are written in the form
3.1 Definition of a predicate calculus
73
have a diadic predicate calculus of the first order, and so on. Quantifiers of type O(OL) are called simple quantifiers, quantifiers of other types are called compound quantifiers. A predicate or functional calculus of the second order, J^2, is a primary extension of a predicate calculus of the first order obtained by adding quantifiers (simple or compound) for predicates. Simple predicate quantifiers are of types: read' for at least one',' for an unbounded set', 'for all' etc. 0(00),
O(O(OL)),
0(0(011)),...
generally o(oa), where a is a type of a predicate or of a propositional variable, and compound quantifiers are of types: read 'for all pairs', 'for all except a bounded set of pairs', etc. 0(000),
0(00(01)),
0(0(01)0),
0(0(01) (01)),
0(00(011)),
...
generally o(ootfi),o(ooc/3y),..., where a,fl,y,... are types of predicates or of propositional variables. A predicate or functional calculus of the third order, ^ 3 , is a primary extension of a predicate calculus of the second order obtained by adding variables or constants of types 00,
o(ot),
O(OU),
...
generally ooc where a is the type of a predicate or of a propositional variable, and oa/3 where a, J3 are types of predicates or of propositional variables, ..., these are called predicates of predicates, we could also add some mixed predicates requiring for some arguments predicates and for others requiring individuals, these would have types: 001,
010, 0(01) 1,
01(01),
...
and generally oou,otot, ofiyt,..., where a,/?,7 are types of predicates or propositional variables. A predicate or functional calculus of the fourth order, ^"4, is a primary extension of a predicate calculus of the third order obtained by adding quantifiers of various kinds over predicates of predicates or over mixed predicates. And so on. A predicate calculus of the first order without constant individuals or constant functions or constant predicates or constant propositions is called a pure predicate or functional calculus of the first order with or without functions as the case may be. A pure predicate calculus of the
74
Ch. 3 Predicate calculi
second order is defined similarly. A predicate calculus of the third or higher order is called pure if it is an extension of a pure predicate calculus of one lower order and if only variables (for calculi of odd order) or quantifiers (for calculi of even order) are adjoined. For instance a pure calculus of the third order may have variables and constants of type oo, O(OL), because there may be constants of these types in the pure predicate calculus of the second order of which it is a primary extension, but in the extension only variables of these types are adjoined. A predicate calculus of the first order which is without propositional variables and predicate variables is an applied predicate or functional calculus of the first order, with or without functions as the case may be. Similarly a predicate calculus of odd order is applied if it is obtained from a predicate calculus of one order less by adding only constants and is mixed if both constants and variables are added. A predicate calculus of even order with some constant predicates is mixed, it must have variable predicates of appropriate types. A formal system £P is based on a pure predicate calculus of the first order
J^ if the constants of ZF are constants ofSP and if ^ has symbols of each type for which there are variables in SF and has individual variables (we may use the same symbols for individual variables in SP and in 3F) and if whenever
3.1 Definition of a predicate calculus
75
types from among, i\ i",... and possibly individual constants of some of these types, and possibly functions with values and arguments from these types. The order of a many-sorted predicate calculus is defined as for a one-sorted predicate calculus. A many-sorted predicate calculus IF' is based on a pure one-sorted predicate calculus IF when the constants of J^ are constants of IF' and when the axioms and rules of J^ apply to each sort of individual variable in IF'. We shall show later that it is possible to reduce a many-sorted predicate calculus IF' of the first order to a one-sorted predicate calculus IF of the first order by adjoining to IF some constant one-argument predicates. Each such predicate plays the part of saying that its argument is of a certain sort, distinct predicates referring to distinct sorts. Similarly we could consider many sorts of predicates in predicate calculi of higher orders. A predicate calculus is a formal system hence it must be constructive in accordance with the definition of a formal system which was given in Ch. 1. If two predicate calculi have variables of the same types then by a trivial adjustment of notation we may use the same symbols for these variables in both systems. A predicate calculus IF' is weaker than a predicate calculus IF under the following circumstances: (i) IF' is without variables of type oc if IF is without variables of type a. (ii) The constants of IF' can be defined in terms of the constants of IF. (iii) An J^'-theorem
76 3.2
Ch. 3 Predicate calculi Models
A model Jfofa, predicate calculus of the first order &> with only simple quantifiers is a formal system which satisfies the following conditions: (i) Jl has exactly the same constants as !F of types other than i, ii, iii, . . . , O, Oi, OLi,
(ii) Jl is without variables but has constants of types 1 and o and of any of the types a, m, ...,oi,ou,... which occur in 3F. The ^-constants of type 0 are designated or undesignated, at least one is designated and at least one is undesignated, the ^-constants of type o are called elements, the constants of type 1 are called individuals, the constants of types u, in,... are called functions and the constants of types oi, on,... are called predicates. (iii) Jl is without axioms. (iv) The rules of t e n a b l e us to replace any ^-statement by a unique element, called the Jf-value of the ^-statement. (v) If ®{
(
3.2 Models
77
For instance if O ^ X T / . Y ! £,?/}} is an ^'-statement whose sole free variable is £, then J( is to contain a constant of type 01, 3 such that Sa has the same J(-value as 0{oc,A7j.x¥{a,/)]}} for each ^-individual a, where again there is to be an ^-constant H' of type oi such that Y{a, /?} has the same J(-value as S'yff for each ^-individual /?. S' will in general vary as a varies. If J^ contains compound quantifiers then clauses (vi) c, (v) require amendment. Suppose J^ contains a compound quantifier of type o(ott) then if X££'.
78
Ch. 3 Predicate calculi
3.4 The classical predicate calculus of thefirstorder A pure predicate calculus of the first order is classical if it is equivalent to the following predicate calculus of the first order, SFC. Symbols: those of £PC together with: type X
name
I
Voc >Pou>~-
01,011,
E A
o(pi)
individual variable predicate variables existential quantifier abstraction symbol
further variables are obtained by superscripting primes, as for propositional variables. Axioms DnNn, where n is a propositional variable, DTT^NTT^, where n is a one-place predicate and £ is an individual variable, etc., for two or more place predicates. Rules Those for £PC together with: Remodelling scheme \b DDtfxfxi) cancellation, D0G) Building schemes II d D(f>{y} o) existential dilution, DE(kt;. ${£>}) o) £ fails to occur free in ^{I\}, TJ free in lie
DN(p{7j}o) generalization, DNE(k£. {£)) (o, where TJ fails to occur free in OJ, 0{I\} and fails to occur free in Here £, TJ denote individual variables, (f>{r)}, 0) denote ^"^-statements, o) is a subsidiary formula and may be absent, (f>, (j>{7)), N
D7.
(mm ') for
for
E{u-m)-
#(X£.#(X£\$H£, £'}))>
etc.
3.4 The classical predicate calculus of the first order
79
Note that an #~c-proof is direct, the only formulae that can be omitted are duplicates as in 1b. This rule causes the undecidability of 3FC, as we shall see later. 3-5 Properties of the system ^c 1. The schemes la, b, 116, c, e are reversible. In Prop. 2, Ch. 2 we showed the reversibility of la, 116, c for the system 8?c. Similar demonstrations hold for J r c . 16 is reversible II a, l a . PROP.
l i e is reversible. Suppose we have an J^-proof of In this J^-proof corresponding occurrences of N(EE)) ^{£} can only be introduced at II a, e. If a corresponding occurrence of N(E£)(j){£) is introduced at I I a then introduce N
would become Dn^NnTj which fails to be an J^-theorem. This completes the demonstration of the proposition.
80
Ch. 3 Predicate calculi
2. D$N
* *—ii-LI
n e, conditions on variables are satisfied,
Thus the result holds for (E£) i/r{g} if it holds for i/r{£,}. This completes the demonstration of the proposition. Thus Tertium non datur holds in the system !FC. P E O P . 3 . The Deduction Theorem holds in 3^G with the following restriction. If <j>', ...,(f)(s®\-jr ijr, where l i e fails to be applied to any variable which occurs in ftSd\ then f , . . . , ^f^^JDN^f. When lie fails to be applied to any variable in ^ ^ we say that the variables in
f
for
9
""^9
in J V
The demonstration is similar to that of Prop. 6, Ch. 2. We have just shown that (i) (tertium non datur) holds in 3FC, (ii), (iii) follows exactly as in Prop. 6, Ch. 2 and similarly for (iv) except that if the rule used is I I e>
y DN
..., D provided that the variables in D^SK^CJ are held constant.
3.5 Properties of the system
81
We have DND^SK)o)Di/ro) from Prop. 3 whence the result by permutation and cancellation.
UXi6) for x'>
0=1
D8.
SK
n
3= 1
6= 1
d)
tfthenDN] 6
COR. (ii). If<j>\ ...,
X{6)X{SK)-
fr, where (/> is the closure of (j).
1
We proceed as in Prop. 3 except that if lie is used, say — lie, where a X, variable £ free in an hypothesis $ is generalized then we replace it by
lie, condition on variables is satisfied lie. In this way from $\ ...,
lid
Now repeat for
D9.
for
SK
x', K
for 6 =I
X^6)X(SK).
COR. (iii). / / <$> is quantifier-free and is in conjunctive normal form, so that K
(f> is of the form J\ ft6) where ftd) is a disjunction of atomic statements or 6= 1
negations of atomic statements and if C(jx]r is an &c-theorem then We may suppose that in the J^-proof of C^
the free variables in (j>
82
Ch. 3 Predicate calculi
are held constant. Because if one of them, say £, was generalized then this generalization must occur in the J^-proof of C
K
(T{d)
Let ^ be n S 0(*'*°, this distributed becomes £ n fte>r{6)\ call it 0*, 6 = ld' = l
0= 1
r
where 1 ^ r{6) ^ o-{6}, and the summation is over all such r. From 'if N6 DNdilr =?= then , , ' and DN6\jr we obtain DN £ II (j>{e>rmf, whence by ivp* DN(j)*yf T e=i K
the reversibility of 116 repeatedly we obtain DN ]J ^e>T^i/r, for each r. All these J^-proofs are obtained from one tree by replacing part disK
K
junctions of S II
we have })
K
D 2 ^ ^ ' 0=1
6=1 T {
^
for each r. Now consider the places where
enters the J^-proof of D 2 Nftd'T{6})i/r. It will do so either at an 0=1
J^-axiom (T.N.D.) or at IIa or at He only. If it enters by an J^-axiom T{0})^(0,T{0}) replace t h i s b y j]AT?(d 7{e\)M6 TW}\^La' T h i s c o n v e r t s t h e
J^-proof of D S Nfte>TW>ft into an ^-deduction of D S Nftd>T^ft from 0=1
0=1
certain hypotheses <j^e*r^. Once Nfte>r{d)) has entered this deduction it thereafter remains in the subsidiary formulae of building rules because it fails to be governed by a quantifier or by ND or by N in the theorem. We are only considering occurrences of Nft0>T{6}) which correspond to the occurrence of Nfte>T{6}) in ^Nft0'7^. 1
If Nftd>r{d)) enters by l i e then
0=1
is NN(f>(d>T{d}\ where
consider the places where corresponding occurrences of <j>(e>rW> enter the deduction, 00»T0» is an atomic statement so it enters by II a or by (T.N.D.) only. If it enters by (T.N.D.) replace this by
Ila.
3.5 Properties of the system !FC
83
These two cases altogether leave us with an ^-deduction of D 6=1
from hypotheses (jfl>7W\ 1 < 6 < K. In this deduction omit all occurrences of N(jP'r{6)) which correspond to the occurrences of Nft6'7^ we have been considering, so that Nftd>7{6}) fails to be introduced by IIa. We are left with an J^-deduction of ^ from hypotheses 0<^T0», 1 < 6 ^ K, because we have only altered the subsidiary formulae of building rules by omitting from the upper and lower formulae of a rule the same disjunctands and this converts an application of a rule into another application of the same rule, also the parts of upper and lower formulae of remodelling rules have been altered in the same way. These cancellations fail to effect i/r because an occurrence of Nfte>T{d}) in ^N^d'T{6}) fails 6=1
to correspond to any occurrence of Nfte>r{e)) in ^ . The final result when repetitions have been removed, is an ^'-deduction of xjr from hypotheses fto,T{0})i i ^ Q ^ K, this holds for each r we require. an
w
LEMMA. If o),x\ • • • > X^&cfr ^ '> X' • • • > A ^ h ^ ^ where the variables (o, ojr are held constant, then DGJG)', X, • • • > ^ ( l c ) hjr 0 ^. We have Do)G)',x', ...,X{K)\~^CD^O)', by replacing all occurrences of co which correspond to the occurrence of OJ in the first hypothesis by DGJG)'
in
and then putting o)' into the subsidiary formulae, by remodelling, of any rules used: the conditions on variables are satisfied because the variables in co' are held constant. Again we have Di/rco', x'> • • • > y^&JDxlriJr, by replacing o)r by Di/raj' as before in the second hypothesis and then putting ^ into the subsidiary formulae, the condition on variables is satisfied because the free variables in i/r fail to be quantified. Thus we have: (K) D u s i n Ib w e DUG)',/, ...,X ^c ft«>'-> Di/r(o',x'> —itf^rflWr* § §et t h e required result. Returning to Cor. (iii) we have ^ (1>T(1)) ,..., ^(ACjTW)hjr ^ for each r. K
(7(6)
Now the hypotheses arise from distributing n S ^e>e)
so
that we shall
0 = 16' = l
have sets of hypotheses which differ only in their first members; we have already noticed that we may assume that the variables in the hypotheses are kept constant so that we may apply the lemma repeatedly (T(l)
and obtain 2
C
result when the hypotheses ft2>TW for all possible r are the same,
84
Ch. 3 Predicate calculi o-(i)
whencebythelemmarepeatedlyweget £ 0 (MO , S Continuing in this manner we finally arrive at 0', ...j^H-jr^, as desired. C O R . (iv). / / ^ is £Ae closure of
a
^ d where
6= 1
fte) is a disjunction of atomic statements or negations of atomic statements, and if C^xjr is an &c-theorem then $',..., <^K>>V&C ft-
Consider the proof-tree ofC^i/r, if lid is applied to a variable in <j) and if this variable is held constant in \[r or is absent from i/r then omit that application of lid, the result is an J^-proof-tree of C$*^r where $* differs from (j) by omission of generalizations. Every other variable in (f> is first restricted in ^ and later on generalized in i/r. Variables in (j) fail to get generalized. Now consider the portion of the J^-proof-tree of C
—-—"K b/ —>—>-— I I c , e t) for i]',...,r D(Et)) (Eg) N<j>DN{Eri) Nf'o) replace the first piece by <J>',...,
as before but without I I d on N>,
fr"
ft
as before but without lid on
3.6 Modus Ponens 4. Modus Ponens is a derived We have to show
PROP.
Given the J^>-proofs of the upper formulae we have to show how to obtain an J^-proof of the lower formula. The demonstration is by formula induction on the cut formula $. With one of a), x non-null.
3.6 Modus Ponens
85
(a)
D 2 N(j>x (this is to account for all the cancellations of N
formula immediately above D 2 N
mula. We have already shown that the result holds if the right upper forSK
mula is an axiom. If 2 N<j> is in the subsidiary formula of a rule and if the 6=1
result holds for the upper formula or formulae of that rule then it follows at once that it holds for the lower formula of that rule. Thus if we have SK D S N<j>X' 6=1 SK
then we have SK
Daxf> D 2 =r—j^^1 £
* by induction hypothesis
deduction as before,
and similarly for a two-premiss rule. SK
If the whole or part of 2 N<j) is in the main formula of a building rule 6=1 SK
then this building rule can only be I I a since <j> is atomic. By la D 2 Ncfrx 6=1
could become DNifriJr, which since ^ is atomic is different from any of the forms of the lower formulae of building rules other than IIa. Or, by l a SK
D 2 N
an(
SK
*S ^
occurs in i/r,
6=1
this could be of the form of the lower formula of the building rules SK
IIa, 6, c,d,e, but then 2 N
contrary to the case considered. Thus the rule can only be II a. In this
86
Ch. 3 Predicate calculi SK
case the formula immediately above D 2 Ntfix is either x i*1 which case 6= 1
the result is trivial or is of the form we are considering. Thus the only SK
non-trivial case is when 2 N(f>x is in the subsidiary formula. By our 0= 1
supposition 16 with N<j> as main formula fails to be the rule immediately SK
SK
above D 2 N<j>X* If the rule is II a with part or all of 2 N6 in the main 6= 1
formula then we have
6=1
f
^
N,
,
6= 1 6=1
where there may be more Nfts in the lower formula so that we have V
diluted with a formula of the form D 2 N<j>x" (x" m a y be null) or a per6=1
mutation of this, and if the result holds for the upper formula then: Dco
^
Dux' then by dilution we easily get DGJX* This completes this case. It is impossible for x! to be null, otherwise 2iV0 would arise from an axiom, which is absurd. (b) <j) is D(f)f<})" and the result holds for $' and $". We have ^-proofs of DOJDC})'(j)" and DND(j)'(j)"x,fromthe reversibility of II b we then have ^Qproofs ofDN
1 DDGJ6'6"
* by induction hypothesis, la, 6. (c) (f> is N
3.6 Modus Ponens
If a) is absent we have
87
N(j)' 16
X (d)
Do)f{7j}
K
D N < f >w ' { 7 ) } X , . , . . , . . . - * by induction hypothesis;
we wish to demonstrate
It suffices to demonstrate
because from the reversibility of l i e we can obtain an ^>-proof of DN(j)'{rj}x for any variable r\ which fails to occur free in DN
If 2 (Eg) $'{£} fails to occur in the left upper formula then the result 0=1
follows at once by dilution. Otherwise we use theorem induction on the left upper formula. If the left upper formula is an axiom then (Eg) >'{£} is absent. If the left upper formula is the lower formula of any building rule other than the introduction of (Eg)
same rule because D 2 (Eg) $'{£} must be in the subsidiary formula 6=1
of that rule. We had an almost similar situation under (a). But if the rule is an introduction of (Eg)
bymductxonhypothesis, by formula induction,
From the demonstration of Prop. 1 we see that the variable 7} in DN
88
Ch. 3 Predicate calculi
variable £ which is restricted by lid! in the upper left formula might be any variable. If we change all free occurrences of £ in the J^-proof of the upper left formula to a new variable then we obtain an J^-proof of a formula which differs from the upper left formula in that now £ is a new variable, but o) and {EE,) <j)'{£] may have suffered a change of variable to this new one. In this case we should end up with DGJ'X, where co' differs from o) in that one free variable in o) has been changed to a new variable (which fails to occur in o)). By change of variable in the J^-proof of Dcox we can get back to Dcox- This completes the demonstration of the proposition. Note that we have given an effective method for eliminating the cut. This consists in taking a highest cut in the proof-tree and either eliminating it outright or replacing it by a cut higher up the proof-tree or by a cut or cuts with simpler cut formulae. Thus applying the process a highest cut ultimately gets replaced by cuts with atomic cut formulae and these can be made to disappear altogether. 3.7 Regularity PROP.
5. !FC is regular.
We have to show
— , /f , ,, *.
We show from this the result and
B6xlr y{6\ r ;C *
follow, the first by the reversibility of 116' and the last follows easily by Modus Ponens which can be eliminated. We proceed by formula induction on x{n}' The cases when ^{77-} is n or is Nx'{rr} or is DX'{TT}x'i71} are dealt with as in Cor. (i), Prop. 8, Ch. 2. If x{n} is (E£) x{n> £} a n ( i ^ n e result holds for ^{TT, £} then we have
whence by IId, la we get
, £} (Eg) xtt,
3.7 Regularity
89
and by lie DN (Eg) X{>, £} (Eg) X{f, £} DN(Eg) X{f, Z) (Eg) X& as desired. The variable £ can occur in (j) or in i/r or in both. CoR.(i).
DB^o) Dx{
We proceed by formula induction on xi17}- The details are left to the reader. D10 (^ §)#{£} for N(Eg)N${Q. A is called the universal quantifier. PROP.
6. 1-5 and l*-5* and 6 of Prop. 5, Ch. 2 hold in ^c, also
7. B(Eg)Df{QW!£)t{&fc
7
*-
8. B(^g)D^{g}^2)(ilg)^{g}^; 8.* distributivity of quantifiers. In 7-8* incl. the variable E, fails to occur free in i/r or 0{I\}. 9. BN{E£)
\r\lfn
9 where £ fails to occur free in co or free in
' is a derived rule in ^c. The rule l i d ' is reversible. For 7-9* consider 7. We have from Prop. 2 T TT, la, Ha, DND
DftQtDftQ^
on d a t u r r e v e r s i b m t y
1 a, 11 a, in
DNf{E£) Dm f the result now follows by life and definition of B.
90
Ch. 3 Predicate calculi
7*, 8, 8* follow similarly and are left as exercises to the reader. 9. We have DN(Eg) N(j>{^} (Eg) N
DN(E£)N>{£}(E£)N
O>
Tertiumnondatur, ° b
the result now follows by 116' and the definition of B. 9* follows similarly and is left to the reader. 10, 10* are trivial. 3.8 The system ^"c The system 8F"Q is like the system !FC except that we add the symbol A of type 0(01) instead of E and replace the building rules II d, e by I I d' D(fi{y} (o rj is absent from 0) and <j) {I\}, D(A £) (j){£) o) £ is absent from
DN(J){rj] OJ
g is absent from
we then replace D 10 by:
D10'
(mm
for
The systems !FC and tF"G are equivalent via the definitions D 10 and D10'. A convenient system which is equivalent to SFC and to 3f"c is the system IF'Q where we use the building rules lid, l i d ' and one of the definitions D 10, D10'. All our results so far hold for &"c. 7. / / §5 is an ^c-iheorem then so is N<j>, where (j> is obtained from
o b t a i n B
COR. (i). If= without use of l i e then ——.
3.8 The system &"c
We verify that if % is a rule other than He then -~
91
is a derived rule.
This fails for He. P R O P . 8.
Consider the first, we have Tertium non datur
Ila, la
IIdIa
Again DN
Tertium non datur
Ila,la
TT,T 11 a, la
DN(E£) Pm
TTp
the result follows from these by 116'. The remainder follow in a similar manner and are left to the reader. 3.9 Prenex normal forms An e^-statement is said to be in prenex normal form if it is of the form (Qic)x{%}y where (Qj) is a sequence of quantifiers, existential or universal or both, and x{%} is an J^-statement void of quantifiers. (Q%) is called the prefix and x{%} is called the matrix. By repeated application of 7-9* incl. of Prop. 6 and Cor. (i) of Prop. 5 we see that each ^,-statement
92
Ch. 3 Predicate calculi
closed J^-statement in prenex normal form with prefix (Qj) and matrix X{ic}. Let y. be the sequence of distinct variables £',..., Qe) in t h a t order and {Q%) a sequence of quantifiers on these variables in the same order. A variable QK\ 1 ^ K ^ 0 is called general if (AE,M) occurs in (Qj) and is called restricted if (#£<*>) occurs in (Qj). I f l ^ y < / c < # then £<"> is called superior to QK\ and £(/c) inferior to £(y).
An ^-statement ^ without bound variables is said to be tautologous under the following circumstances: ^ is built up from other ^ - s t a t e ments joined together by N and D; we replace each of these part statements b y / or by t, but make the same replacement at each occurrence of a variant, we then have a formula built up from/and t (regarded as of type o) by N and D, we then calculate the value of the statement as follows: replace Nf by t, Nt by/; Dffbyf, Dtf, Dft, Dtt by t. The final result is either t or/. If the final result is always t however we make the initial replacements of the part statements then the ^-statement is said to be tautologous. It is easily seen from Ch. 2 that a ^-statement is a ^-theorem if and only if it is tautologous. An J^-statement
K
We shall find that the J^-statement 0 is a disjunction 2 fr{d\ where 01
3.9 Prenex normal forms
93
the disjunctands are of the same logical structure but merely differ by choice of individual variables, they are variants of a common form. l a is used to bring one of the disjunctands \Jrf, ..., i/r^ to the left when we apply lid or lid' to it. 16 is used to discard duplicates as they occur. The final ^ - t h e o r e m is of the form (#'£')... (Qf*>gM) jjr{£',..., £<*>} where each QM is either E or A and ^{£',..., £(7r)} differs from any of the disjunctands ft',..., i/rM merely by change of individual variables. Suppose we have an ^"^-proof of x m prenex normal form. We first modify applications of II a, if necessary, so that they fail to introduce quantifiers. Suppose that Dx'x" i s introduced by I I a then introduce X' x" o n e after the other, suppose NDx'x" *s introduced at II a then introduce Nx',Nx" separately and apply 116, (this forms two branches), if NNx is introduced then introduce x a n ( i a PPty Hc> suppose that (2?£) x! {£} is introduced at II a then introduce #'{£} instead and apply II d, suppose that {A£) #'{£} is introduced at I I a then introduce x'{v} instead, where 7/ is new, and apply II d'. Repeat this process as long as possible and we shall have used II a only with atomic statements or negations of atomic statements as main formulae. Thus we suppose that the ^>-proof of (Qi) ^{j} only uses II a with main formulae which are atomic formulae or negations of atomic formulae. Note that we do this without using 16. Instead of using the system tF'c we shall use an equivalent system which has the rules IId*, d'* in place of rules lid, d\ where D % >{&*>}
lid'* in lid'* a) is free for £',..., Qd) and so is 0{I\}, and £',..., £(^ are distinct. An ^t-proof will use rules lid*, d'* whenever possible, so it will be without a sequence of applications of II d followed by 16, etc. We now want to show how to modify an J^-proof of a prenex formula so that applications of 16, IId*, d'* come after applications of IIa, 6, c. Clearly the systems tF'c and ^"c are equivalent. We define the rank of an J^-proof of an J^-formula (Qj) ^{j} in prenex normal form, where % stands for £', ...,£(77), as the ordered Sntuplet {v, v',..., v(n)}9 where v is the number of occurrences of applications
94
Ch. 3 Predicate calculi
of rules II a, 6, c beneath applications of rule 16, v* is the number of applications of rules IIa, 6, c beneath rules IId*, d'* which bind a variable standing in the first argument place in ^{r.},..., v^n) is the number of applications of rules Ila, 6, c beneath applications of rules lid*, d'* which bind variables standing in the 77th argument place in ${%}. These are calculated as follows: take for instance rule 16. Mark applications of 16 in the J^-proof o f (Qj) 0{j}, let these be denoted by K\ ...,#<*>, let /i', ...,/^M be respectively the number of applications of rules Ila, 6, c beneath K', ...,K(n\ then /i' +/i"+... +JLC{K) = v, similarly for the other cases. Ranks are ordered lexicographically. We now show how to modify the J^-proof of the prenex ^-formula {Ql)
where ^r is atomic or the negation of an atomic formula. Replace this by
j. a.
DDDjfr
DNi/ra)
DND
3.9 Prenex normal forms
Replace this by DDN
95
116, II a, to introduce N(j) in right upper formula, DNi/rco TTl^ TT . , _T_ . , . . , ^ 1 A l ' ' " +.o introduce ND
Call this case (i). Case (ii) is
Replace by Lj DDNi/rx«> ^— 116, II a to introduce another x i n right
upper formula,
Again we are left with an ^e-proof of (#£) ^6{j} of lesser rank provided we are using a highest case of 16 above 116. The use of rule II a, as already observed, is without any use of 16, so that if we have a highest case of 16 above 116 then the rank has fallen. Suppose we have DD^f 10,
^
He,
DNN<j>(o. Replace this by
DD
Again we are left with an J^-proof of (Q%) <j>{%\ of lower rank if we are dealing with a highest case of 16 above lie. Call this case (i). Case (ii) is DD<j><j> Dijra)
DD
96
Ch. 3 Predicate calculi
Replace this by
DDcjxjyD^o) lie, DD
Again we are left with an J^-proof of (Q%) ^{j} which is of lower rank. Now consider rule lid*. Suppose we have
Replace this by
2)2^{9/} o)
TT
Again we are left with an ^"^-proof of (Q%) ^{j} of lower rank. Rule l i d ' is dealt with similarly, but may require a change of variable. Suppose we have
where o)' differs from co by having NN placed over a disjunctand. The other case where NN is placed over (E£) <£{£} is impossible because the theorem is in prenex normal form. Replace this by
Again we are left with an J^o-proof of (Qj) >{$.} of lower rank. Rule lid'* is dealt with similarly. Suppose we have
) ${§ Nfco
3.9 Prenex normal forms
97
Replace this by
where (E£)${£} is introduced into the branch above the right upper formula in (a) at applications
Nowwehave
JTO^^-DTO^Q^ DD(E£)f{£\N/
where (i?g) 0{£} is in the subsidiary formula. Hence we shall have the same figure with these occurrences of (EE) <£{£} everywhere replaced by
Now add applications of I I a as follows:
Now from (6), (c) and (d) we obtain _
IIa,etc....
r c
J
IIa, etc.
Use this as the right upper part of (6), then finish up as in (b) and we have placed the application of II & above the application of IIeZ*. In doing this we have had to introduce various variants of <£{£}, this is done without using 16 or any applications of II eZ*, d'* that bind variables earlier in the list than the variables we are binding in (a). The effect of this is that in the rank {v,v',..., v^} the first component is unaltered because we have made our alteration without use of 16, and if the variables we are binding is £f®9 then v',..., j/^-1) are unaltered while if® is decreased by one, the other components may be increased. The total result is a reduction in rank.
98
Ch. 3 Predicate calculi
Suppose we have
a*
Replace this by DDX<j>{V}NXG>
DDXttQNfu '
'
'
by the reversibilityof lid'*,
In the reversibility of lid'* we may take the variables £ to be new and distinct from the variables r\. This allows us to apply lie?'*. As in the case of 16 below II d* we have decreased the rank. The reversibility of II cZ'* is performed without use of 16. For completeness we add: Rule lid'* is reversible. By this we mean that if we have an J^'-proof of D(Ag) ^{£} o) then we can find an J^-proof of JD2^{?/} OJ for some 2^{?/}. In the J^-proof-tree of D(A£,)(J){E]G) note the places where corresponding occurrences of (A£,)^{^} are introduced by lid'*. These will be of the form j>s^}ft/ DXtfrf»}<,fi» LEMMA.
J
In the JF^-proof from these places to D(AE>)^>{£)}(0 the part will remain in the subsidiary formulae everywhere. Hence we may replace all these occurrences of (A£) <j){£] by the disjunction of These can enter by IIa, etc., applied to the upper formulae of (/). In this way we obtain an ^"^-proof of instead of one of D(AE,)(f>{^}o). IIa, etc., as before observed, has been done without use of 16, and the only use of lid*, d'* has been on variables later in the list £',..., g(7r) than the variable £, so that the rank of the J^^-proof of (e) has decreased.
3.9 Prenex normal forms
99
To conplete the demonstration of Prop. 9 we make the alterations discussed above starting from the highest available places. Each time the rank is reduced, and as long as the rank is greater than the lowest rank we can always reduce it. This completes the demonstration of the proposition. 3.10
Let
H-disjunctions
(QV)..>W"Vn))n?>-''>^
(!)
be a closed ^^-statement in prenex normal form where the matrix i/r{£', ...,£(7r)} is quantifier-free. Here each Q^ is either A or E. If Q^d) is E then Qe) is called a restricted variable, if Q^e) is A then £(^ is called a general variable. Let there be n' restricted variables in (1) and let there be n" general variables in (1), then n' + n" = n. Form a list of all ordered nrtuplets of natural numbers {*/,..., *>(7r)} ordered by the sum v' + ...+ v^r) and lexicographically for those of equal sum. Take the initial segment consisting of the first K members. Now write down the list
£'
(2)
£(ff) )
b / o • • • ? Z>K
•
J
where the restricted variables in the yth line are x^v'\..., x^n}), {v*',..., v^} being the vih Tr'-tuplet in our list of Tr'-tuplets, and where the general variables in the first line are in order from left to right if £' is general in (1) or
SiT f)
x",..., x^ '
if £' is restricted in (1).
Suppose that exactly the first A restricted variables in line 6\ 6' < 6 are from left to right the same as in line 6, then the general variables are the same from left to right in these two lines up to and including the general variable immediately following the Ath restricted variable. The remaining general variables in line 6 are in order from left to right the next new variables in the alphabetical list x,x',x",... of variables. An example will make this clear. Let (1) be: (Ex') (Ax") (Ex'") (Ex*) (A&) DNpx^x"x^px'x'"x^,
(3)
where p is a three-place predicate. Let K be 12. For greater clarity we write xv instead of x'...' with v superscript primes. 4-2
100
Ch. 3 Predicate calculi
H-scheme of order 12 variables r
g
r
r
x2 ~xx x3 JX±
xx x2
x2
XQ
_x3 x13
x2 _x3
xx x2 _x3
g x3 x±
x8 x15 x5 x9 Xu
x10 ~xx x7 V'X1 _x2 xl± \_x2 xx x12 X,
x
l
xr
xu
line triplet sum line r 1 a? 1 [1, 1, 1] 3 2 [1, 1, 2]i 2 x 5 [1, 2, 1] 4 3 # 11 3 6 12
[2, 1, 1]. [1, 1, 3]" [1, 2, 2] 7 [1, 3, 1] 4 [2, 1, 2] 5 8 [2, 2, 1] 9 [3, 1, 1] [1, 1, 4]" R 10 [1, 2, 3L u
4
x2
g x2 x2 x2 XQ
x2 6 #! x2 7 # x2 5
iCj
8
tf2
9 10 11 12
XQ
x2 XQ #3 x13 #! x2 x. x2
r xY x± x2 xx xx x2 x3 xx x2 xx xx x2
r
xx x2 xx xx x3 x2 x1 x2 x1 x
l
g x3 x4 x5 x7 x8 x9 x10 xxl x12 xu X
15
x3
x1Q
The first twelve triplets have been written down in the prescribed order. In the column headed ' variables' the restricted variables occur in the first, third and fourth places and the general variables in the second and fifth places. The suffices of the restricted variables agree in order from left to right with the members of the ordered triplet in the same row. The general variables are then put in, x% and xz in the first line, x2 and #4 in the second line, since the first line begins with xx and is followed with x2 and the second line begins with xx then the general variable in the second place in the second line is also x2. Generally the second variable, which is a general variable, is x2 whenever the first variable, which is a restricted variable, is xv In the fourth line the second variable (the first general variable in that line) is x6 because this is the first available new variable in the alphabetical list of variables and this is the first time that x2 has occurred in the first place. Generally whenever the first variable is x2 then the second variable is x6. The H-scheme is obtained by writing down line 1 followed by those lines whose initial segment is the same as in line 1 for as long as possible. Thus lines 1, 2, 5 and 11 agree in having initial segments xxx2xx\ These are followed by lines 3, 6 and 12 which agree in having initial segments x1x2x2. This in turn is followed by line 7 which agrees with the above in having initial segments x±x2. The agreement of initial segments is denoted by bracketing. The 17-seheme of order 12 for (1) is the list (2) arranged by bracketing together lines with equal initial segments and ordering lexicographically within the brackets.
3.10 H-disjunctions Write
<&£2&«5
for
qe
for
101
q&°>&°> &»&>&*>
where ($», Qfi, QpQp and #»> are variables in line 6 of (2). We note that the H-disjunction of order 12 namely: 12
(4)
P
n is a tautology, in fact Dq±q12 is a tautology. 2 #0 fails to be a tautology, be0=1
cause we can only have a tautology when fflffiffl is the same as £f)£f )£f) a n d for 1 ^ e,df < 12 this only occurs when (9=1 and #' = 12. Thus the i?-scheme of order 12 for the statement (3) makes the ^-disjunction of order 12 a tautology. Now consider (Ex±) (Ax2) (Ex3, a?4) (Ax5) DNpx5xAx1px1x2x3. (5) Any £T-scheme for the statement (5) fails to make an iZ-disjunction a tautology because Qd) i s alphabetically later than £i0) while ^2e) is alphabetically later than $*\ hence &e)QP£e) fails to agree with g f ) ^ ' ) ^ ' ) for any 6, dr. Consider again the statement (5), an i7-scheme of order K for (5) gives rise to an if-disjunction which fails to be a tautology for any numeral K. A disjunctand of the if-disjunction of (5) is ,3> V4] xVl xVipxVi xa[vi] xVa,
(6)
where (T[v^\ > vx and p\yx, vs, v^\ > vly v3, v±. Consider the 2-valued model e/T in which the individuals are the natural numbers. We can make (6)
take the Jf-value/ by taking: pv1v2v3=f
for
v1
and pvxv2v3 = t
for
v1 > v2,
the values for v1 = v2 are immaterial. Consider the negation of (5) (Axx) (Ex2) (AxSi xA) (Ex5) Kpxhx±xxls[pxxx2xz.
(7)
Thus we can obtain a satisfaction of (7) over the model *A^ We shall demonstrate later the general proposition that if the Hdisjunctions of (1) all fail to be tautologies then there is a satisfaction of the negation of (1) over the 2-valued model in which the individuals are the natural numbers. Note that we lack a method for deciding whether
102
Ch. 3 Predicate calculi
there is a numeral K such that the if-disjunction of order K is a tautology. If we had such a method then the system &c would be decidable, we show later on that the system ^c is undecidable. From the tautology (4) we may obtain an ^^-proof in normal form of the statement (3). Apply universal quantification to the variables x5, x7, x8, xQ, x10, xll9 x12, xu, x15, x16 successively, these variables occur at one place only in (4) so the condition on variables in the rule for universal quantification is satisfied. Delete these variables from the H-scheme of order 12 this leaves: line 1 2 5 11 3 6 12 7 4 8 9 10
x6
The tautology (4) has become 12
(4.1) where q'd is qe if d = 1,2 otherwise q$ is (Ax5) q£{d)&>^Q0)x5, Now apply existential quantification to the fourth variable in every disjunction of (4.1) except q1 and q2 and delete those variables from the J?-scheme of order 12. The disjunction (4.1) becomes 12
(4.2) 0= 1
where qf{ = ql9 & = q2 otherwise q" is {Ex^){Axb)q^^^x^xb.
In
(4.2) ql is the same as q[x so cancel qu, also ql, qfQ, q[2 are the same so cancel ql and ql%, also q% is the same as ql so cancel q%. Thus (4.2) becomes (4.3) D...Dq1q2qlqlq';qlqlql0. 7-times
3.10 if-disjunctions
103
The deleted H -scheme of order 12 has become: line 1
X2
xz
x±z
x2 xz "x x2 x^
3 7 4 9 10
In the disjunction (4.3) the variable x4 occurs only in q2 hence we may apply universal quantification to it, we can then apply existential quantification to the fourth variable in q2 this makes q2 the same as ql so cancel ql and (4.3) becomes: (4.4) 6-times Delete these variables from the ZT-scheme. This is indicated above by a stroke through them and through 5. Now apply existential quantifiers to the third variable in each disjunction of (4.4) except qx and cross these variables out of the ^-scheme. The disjunction (4.4) becomes Z>...Z>
(4.5)
A = 2, 3, 4, 7, 9, 10.
f
In the disjunction (4.5) q 2", q%, q? are the same, so are q± and qfg. Cancel duplicates and we obtain the disjunction (4.6)
Uio and the deleted i/-scheme: r
g
xx
x2
xz
xlz
r
r
g
JUX
JUX
JUZ
L
line i
2 10
In the disjunction (4.6) the variable xls occurs only in q±Q and the variable XQ occurs only in q% hence we may apply universal quantification to them, we can then apply existential quantification to the first variables in q'l
104
Ch. 3 Predicate calculi
and q^Q. Cross out these variables from the deleted //-scheme. q*l and qx0 have now become the same, so omit q'i0. The disjunction (4.6) has become: Vxj) {Ax2) (Exs, x4) (Ax6)
qxxx2xzx4xh,
and the deleted //-scheme is: r
g
r
r
g
X-^
X%
I X^
X^
Xg
L
line JL
2 4
The variable xz occurs free only in qx so we may apply universal quantification to it, we can then apply existential quantification to the third and fourth variables in qv This makes q± the same as q2, so cancel q2. We then obtain the disjunction D(Ex3, x4) (Ax5) qQV^x3x4xb{Exx)
{Ax2) (Exs, x4) (Ax5) qxxx2xzx4xb. (4.7)
In the disjunction (4.7) the variable x2 occurs free only in the first disjunctand, so we may apply universal quantification to it, we can then apply existential quantification to the first variable. This makes the disjunctands the same, cancel one of them, and we are left with (3). The //-scheme for (1) of order K can be written down on a fixed plan for any numeral K, hence the place number of a general variable is uniquely determined by the place number of the superior restricted variables. Thus for the statement (3) the place number of the second variable is uniquely determined by the place number of the first variable, and the place number of the fifth variable is uniquely determined by the triplet of the place numbers of the three superior restricted variables. P R O P . 10. If for some numeral K the H-disjunction of order K of a closed ^Q-statement
The method of demonstration is the same as that given in the worked example. We apply quantifications to the various disjunctands of the //-disjunction of order K and delete the corresponding variables from the //-scheme, and cancel duplicates as they occur. At any stage in the proceedings a variable in the //-scheme is available if it is at the end of its line in a deleted //-scheme and is a restricted variable or is a similarly
3.10 H-disjunctions
105
situated general variable which fails to occur elsewhere in the deleted H-scheme. If there is always an available variable until all the variables are deleted from the H-scheme then we obtain an J^-proof of (j). Suppose that at some stage there fails to be an available variable, then in the deleted if-scheme at that stage the variable at the end of each line is a general variable and each such variable occurs elsewhere in the deleted H-scheme. The lines in the deleted if-scheme are always distinct because identical lines get deleted as soon as they arise by cancelling duplicates. A variable can only occur once as a general variable in a deleted if-scheme but it can occur again as a restricted variable. For instance the variable x2 occurs once as a general variable and six times as a restricted variable in the complete if-scheme of order 12 for the statement (3). The restricted variables which precede a general variable in a line of an if-scheme are alphabetically earlier variables. Thus if there fails to be an available variable then each general variable £ at the end of a line occurs again as a restricted variable in another line which ends in an alphabetically later general variable TJ. In turn there is another general variable alphabetically later than TJ and so on without end. This is absurd because the if-scheme of order K is displayed. Thus there is always an available variable and we may continue to quantify and remove duplicates until we obtain an J^-proof of 0. This demonstrates the proposition. P R O P . 11. If cj) is an ^'c-theorem in prenex normal form then there is a numeral K such that the H-disjunction of order K is a tautology. According to Prop. 9 the J^-proof of (j) can be modified to one in normal form. We then have a tautology (8) 9= 1
where each i/r^ differs from ${£,',..., £(77)} by change of individual variables. From (8) we can obtain 0 by l a , 6, IId, d'. Let 3F'Q be the same as ^'Q except that the individual variables are x*, x*', #*",..., and let ^'c (J 3F'* be the same as the system tF'c except that the variables are those of !F'C and those of 3F'Q . We now change the individual variables in (8) to those of ^'Q by superscripting an asterisk to each variable. In this way let ^(A) become ^(A)* and (8) become (8*). We will give a method of changing the individual variables in (8*) to J ^ variables in such a way that (8*)
106
Ch. 3 Predicate calculi
is changed into part of an //-disjunction. Let (8*) become (9) by this change. Clearly if we change an individual variable at all its occurrences to another one then a tautology remains a tautology. Thus (9) will be a tautology and part of an //-disjunction. Thus there will be a numeral K such that the H-disjunction of order of K is a tautology. Let (j> be
where if ft is zero the initial set of universal quantifiers is absent. Let the variables in ijr'*,..., ^JrOO* be (10) We now replace the ^f-variables by J^-variables in (10) from left to right. We first replace through (8*), (10) the first [i variables in each line of (10) by x',..., x^ respectively, that is ££*, ...,££* are replaced by x' at all their occurrences,..., ^ * , . . . , ffl* a r e aU replaced by x^ at all their -statement, occurrences. Let (8*) then become (8'), it is an ^'Q^^'Q clearly it is a tautology and we can obtain (8) from it. This is because all the other general variables in (8*) are distinct from x\ ...,x^\ By this change (10) becomes (10'). Secondly if the lines V and v" of (10) agree in having the same initial segment and if the next variable is different and is a general variable then we may alter this general variable in one of the lines so that they are both the same. Suppose these general variables are $*>* and $?A>* so that the segments £*, ...,£* and £*, ...,£<*>* are the same. In passing from (8') to
3.10 ^-disjunctions
107
initial segments of the two lines are the same) can take place immediately after the generalization of £££A)*, any such variable is distinct from £*£A)*, again because the initial segments are the same. Having made these modifications so that we generalize £^A)* immediately after (except for permutations) generalizing £*?A)* we now everywhere replace £$?A) * by £
108
Ch. 3 Predicate calculi
variable £p[0] is the same as a general variable in line p[p[0]], say p2[d], and so on. Since the scheme (10) is displayed we must have for some fi' < ju,"'.
Thus we have a general variable in line p^Xff] is equal to a restricted variable (which is available) in line p^'^ld] and so it is impossible to generalize the former until the latter has been restricted, but this restricted variable is inferior to a general variable which must be generalized first, this in turn is the same as a restricted (available) variable in line pfl"'1'2[6] which must be restricted first, and so on until, a general variable in line ps^'[d] is the same as a restricted (available) variable in line p^'id], but this variable is £a itself. Thus finally, we are to restrict a restricted variable £a in line a before we generalize an inferior general variable. This is absurd. Thus there is always an available restricted variable distinct from any general variable. Replace the alphabetically earliest such restricted variable (which is an 3F'* -variable) at all its occurrences by the first as yet unused 3F'Qvariable. This leaves the general variables unaffected. Thus an available variable (which is an J^^f-variable) can always be replaced by an SF'G- variable without upsetting our build-up of an //-scheme by renaming of general variables. Finally each 3F'£-variable is replaced by an tF'cvariable in such a way that the resulting disjunction is a part disjunction of an if-disjunction, because we have chosen the general variables so that this should be so. This completes the demonstration of Prop. 11. 3.11 Validity and satisfaction An J^-statement
3.11 Validity and satisfaction
109
(d) similarly for many-place predicate variables, (iii) We replace the free individual variables by numerals. (iv) We replace a part (A!;) i]r{£) of $ by t if and only if i/r{v} is replaced by t for v = 0,1,2,... otherwise we replace {AE) ^{£} b y / . (v) We replace Dff b y / and Dft, Dtf, Dtt by t. (vi) We replace Nt b y / and Nf by t. (vii) $ reduces to t however the replacements (ii) (iii) are carried out. We lack a test for general validity. This will be demonstrated in Ch. 7. A closed J^-statement is said to be satisfiable over J^ if it can be shown to reduce to t for at least one replacement under (ii), (iii). For example consider: DN(A^DN^}x{QDN{A^^{Q(A^x{^ ( n ) We show that it is impossible for (11) to reduce to/when the above process is carried out. If (11) reduces t o / then N(AE)DNi/r{!;}x{£} and DN(AE) i/r{Q (Ag) x{£} must both reduce t o / . In order that this happen DN\lr{v}x{v} a n ( i ^ M must both reduce to t for each numeral v, and x{v] must reduce to / for at least one numeral, say A:. Then DNJ/T{K} X{K} reduces to t hence ^{K} must reduce t o / , this is absurd. Thus (11) always reduces to t no matter how the replacements are carried out. Consider (3), give t, / to jpX/jiV in any manner, if pv'v"v'" is t for some set v', v'\ v'" of numerals then (3) is t, iipv'v"v'" is always/then (3) is t. 12. A closed ^c-statement is an ^c-theorem if and only if it is generally valid over JV. The J^-axioms are generally valid over JV*. The J^-rules preserve general validity over JV*. Thus J^-theorems are generally valid over Jf. Now suppose that the J^-statement
110
Ch. 3 Predicate calculi
the value / for some assignment of values t, f to each TTV' ... y(A) for each predicate variable which occurs in <j) and for each set of arguments which occurs in an H-disjunction. If a set of arguments fails to occur in any if-disjunction then we give nv'..MX) the value t. Let i^K be an assignment of values t, f to each TTV' ... y(A) which occurs in HK, the JBT-disjunction of order K. irK will only give values to TTV'' ...v^ for those argument sets v' ...v^ which occur in HK. Let J(K be the set of assignments i^K. Now HK is a part disjunction of He for K < 6, hence one i^e will contain all the values given by at least one i^K. We can express this by saying that at least one irK can be extended to become a ^ or that a *Ve with domain of definition restricted to that of i^K becomes a i^K. Each *JlK contains at least one ^ Hence there is a valuation i^ which defines TTV'.. Mx) for each argument set which occurs in some HK and which gives the value / to (j). Now this valuation says that for any values given to the restricted variables there are values that can be given to the general variables, which values depend on the values given to the superior restricted variables, in such a manner that ^ takes the value/. Then N<j) takes the value t and for any values given to the general variables in N
dity over J^K is decidable, we need only replace (J57£) ${£} by 2 0{#} 0=0 K
(and (A £)${!;} by n
and evaluate by truth-tables. COR. (ii). An ^-statement which is valid over JVK for each natural number K but which fails to be valid over JV can be found. Consider the conjunction P of the following J^-statements: {Ax) Npxx (Ax, x', x") CK
3.11 Validity and satisfaction
111
It is clear that P fails to be satisfiable over any JVK. Hence NP is valid over every JVK. But P is satisfiable over Jf (let pxx' be x < x'), hence NP fails to be valid over J^. Another example is the negation of the conjunction Q of the following ^-statements: (Ex)(Ax')Npx'x {Ax, x', x", x'") GKKpxx"px'x"pxulxpx'"x' (Ax) (Ex1) pxx'. Again it is clear that Q fails to be satisfiable over any ^ fiable over JV (let pxx' be Sx = x').
but is satis-
3.12 Independence 13. The symbols, axioms and rules of J ^ are independent. We have to show that N, D, E, p^x\ #(A) are independent. Clearly p is independent otherwise ^-theorems would be without occurrences of p, similarly for x and the other variables. But note that if we omit p we get an equivalent system, similarly for x and the other variables. The only closed J^-formulae of type 00 formed from D, E, X and variables are: PROP.
\p.p, *kp.E(\x.p), \p.E(kx.DpE(kx'.p)),
Xp.Dpp, Xp.DpE(Xx.p), etc.
but these all using A-rule (i) give J3A00, where A stands for any one of the above formula. Hence if we took A as a definition of N then BNtfxfi Whence
DpNp
* 16 P and this is absurd because an ^"o-theorem must contain D. Thus the symbol N is independent. The only closed ^-formulae of type 000 without occurrences of D are: Xpp'.
112
Ch. 3 Predicate calculi
E is independent because the only J^-formula of type 0(01) that we can construct from N, D and variables is Xp0L. (j>, where (j) is of type o and fails to contain E, but this fails to be closed and so violates the conditions for a definition. The demonstration that the ^-axioms are independent is the same as for &>c. The ^,-rules are independent. First, rule l a is independent, because if we omit rule la then we are unable to obtain the J^-theorem DNpp. Any J^>-proof of DNpp fails to use IId,e, because once E enters an .^-proof then it remains in that #^-proof from that place till the base. Thus any J^-proof of DNpp will be a ^ c -proof possibly using 16. Any J^-proof of DNpp will fail to use 116, c because it is without occurrence of NN or of ND, hence an J^-proof of DNpp which omits la proceeds from the axiom DpNp using 16, II a only. We are unable to apply 16 to DpNp so we can only apply II a obtaining DtfiDpNp where (j) is an ^cstatement built up from D, N and p only, other variables and E must be absent because if they were introduced into the ,^c-proof by II a then they would remain in the ^ o -proof from that place to the base. We can only use 16 on Dcj)DpNp if
D(Ex)pxNpx and
DN(Ex)px(Ex)px.
For the rule 16 we note that if we are denied the use of rule 16 then we are unable to obtain: (Ex') (Ax") (Ex" Any J^-proof of (3) which fails to use 16 also fails to use IIa, 6, c (these
3.12 Independence
113
lengthen the formula) and must start with the axiom px'x"xmNxfx"xr" it must then use la, lie, lid twice l i e and lastly lid. But it is impossible to generalize only the first occurrence from the left of x'". 3.13 Consistency 14. The system &c is model consistent. We show that fFc has a model with a sole individual a and two elements t,f of which t is designated a n d / is undesignated. It has the constants N, D, E. N and D obey the same rules as in the model J(c for 8PC. The PROP.
rulefor2?is
It is easily verified that these rules give a model for ^c. 15. The system !FC is consistent with respect to negation. We have to show that if ^ is a closed ^-statement then at least one of
3.14 ^Q with functors The system J ^ with functors or constant individuals or both can be dealt with as the system ZFC. We have the additional rule: v free in
^'
a b s e n t
from
114
Ch. 3 Predicate calculi
This is called the rule of substitution. Here a is a term of type i and is free in ^{a}, i.e. if g is a variable which is free in a then corresponding occurrences of £ are free in
Now suppose that
of atomic statements or negations of atomic statements, from the J^>theorem C(J)DOJX we obtain by Prop. 3, Cor. (iv) <j)r, ...,ftK)\r#r>Da)x, thus is a ^"-theorem. 3, COR. (V). Modus Ponens can be eliminated from a theory whose axioms are disjunctions of atomic statements or negations of atomic statements. Suppose that we have 6',..., ^Y^' Doj^Jr and ^',...,ftK))r&'DNftx then we
PROP.
c
c
have C$DG)X as above whence we have 0', ...,^K)\-^DO)X,
by Prop. 3,
3.15 Theories
115
Cor. (iv). A theory whose axioms contain free variables would normally be based on &"'c rather than on ^c. The rule for substitution of variables merely converts an axiom scheme (as used in ^c) into a set of particular axioms. A theory whose axioms are disjunctions of atomic statements or negations of atomic statements is called a theory in free disjunctive form, free variables are allowed. COR. (vi). If jJ.
is a rule in a theory in free disjunctive form then
^— *,
where co is subsidiary.
We have ^Sffi taking N<j) for to. Thus Cffl, now is a ^-theorem, whence by Modus Ponens twice we get DNtfxo if we have DNi/ro). But Modus Ponens can be eliminated. COR. (vii). The deduction theorem holds in a theory whose special rules are without restrictions on variables. The demonstration is the same as before, the extra rules of the theory behave just like the e^c-rules other than l i e . COR. (viii). Modus Ponens can be eliminated from a theory without axioms and whose special rules are without restrictions on variables or iutroductions of E. We proceed as in Prop. 4. The case when $ is atomic and DN
3.16 Many-sorted predicate calculi A K-sorted classical predicate calculus of the first order is formed from the symbols: type
xc • XL{K)
C : L(K)
pOL>,...,pol(K)
OL', ..., OL(K)
name
individual variable of the first sort individual variable of the /cth sort one-place predicate variable
116
Ch. 3 Predicate calculi
type name es as Poi'i > Voi'i"> - • • 9 PoiWiW tyP shown two-place predicate variables generally Poe\..eW> where e^, types as shown A-place predicate variables for 1 ^ 6 ^ A, is one of *',..., *M E',..., EM O(OL'), ..., O(OL{K)) existential quantifiers of types shown X abstraction symbol ' generating symbol N oo negation symbol D ooo disjunction symbol ( ) parentheses The axioms are T.N.D. for all atomic statements. The rules are those of fFc with restriction and generalization for each type of individual variable, denoted by l i d ' , ...,IId{K\ He', ...,IIe (lc) . We denote the /csorted classical predicate calculus by ^CK. The situation is just as if in $FC we labelled the variables as x^-K+d\ 1 ^ Q ^ K, and stated lid, e separately for each 6,1 < 6 ^ A:. But the main difference is in the argument places of the predicates. Let J r ( J ) be !FC plus constant one-place predicates 8\ ...,$ (AC) and additional axioms DNS'^S'Z,..., DNSMgSMg. We give a method for translating J^-statements into J ^ - s t a t e m e n t s in such a way that 3^CK' theorems translate into jF^-theorems. The translation of an ^CKstatement is obtained as follows: f
(a) <j) is atomic, say P01{B')^L^)X%I). (A)
K n S^x^'^+^p^e')
. .a^&o), the translation is l(0 (A))^-/+^...^^
(A)
+«.
v=l
If two of the individual variables are the same then we omit an occurrence of S followed by that variable, (6) ^ is Ni/r, its translation is Ni/r\ where ^r' is the translation of ^ , (c) ^ is J D ^ X its translation is Di/r'x', where i/r', %' are the translations oii/r\X respectively. (d) $ is {E0g0)f{!;e)9 its translation is (Eg') f'{£'}, translation of
where ^'{g'} is the
3.16 Many-sorted predicate calculi
117
(e) (j)r is the translation of
if and only if its transla-
tion into 3~ is a 3~ -theorem, (ii) If 3~Kis consistent with respect to negation then so is 3T. (iii) If ^ is consistent with respect to negation then so is 2TK. (iv) There is an effective method whereby given a 3~K-proof of a &~Kstatement (j) we can find a & -proof of the translation of
(ii) follows from (i), so does (iii). Ad. (ii) if y is inconsistent with respect to
118
Ch. 3 Predicate calculi
negation then we can ^"-prove (j> and N(j) for any ^"-statement 0, for the case when
^/J; f —, £ fails to occur free in
£ is a variable of sort 6, and where 0'{F} is the translation of 0{F} and o)f is the translation of a). But this is still a case of lid. Similarly for lie. An axiom DnNn translates into Dn'Nn', where n' is the translation of zr, this is a case of T.N.D. and so is a ^""-theorem, add its ^"-proof. The other ^-axioms translate into ^"-axioms by definition of 3~. Thus half of (iv) is demonstrated. For the second half of (iv) first suppose that 2TK is ^CK\ omit each occurrence of S^iv))af-K'^v)+dlv)) and replace the remaining occurrences of &'/*»•&»> by dp. Omit K and II in (A), also replace (Eg) by (EV>£0) whenever S<®£ occurs in the scope of (E£). All this converts the ^"-statement x[r into a ^-statement
Thus
(Exi)pxi by (Ex)(KSixpx). (Axi)pxi becomes N(Exi)Npxi N(Ex) (KStxNpx) (Ax)NKpSxNpx (Ax) CSxpx.
3.16 Many-sorted predicate calculi
119
Let O be the conjunction of those ^-axioms used in the ^-proof of <j>. Let Y be the translation of O into 3T. By the Deduction Theorem we have CO
I(i)
Iaa
and the rule
o) is subsidiary and may be omitted. 0{/?} is called a variant of
120
Ch. 3 Predicate calculi
Prop. 9 goes through because we can push all applications of I(ii) back into the axioms and so above rules la, b, lid, d'. In the definition of tautology a = a must be given t. Prop. 9becomes: P R O P . / 9 . An I^c-proof can be modified so that all applications of rules IIa, b, c, I(ii) occur above all applications of rules 16, lie?, df. An / ^ - p r o o f then divides into two parts; the first part is a free variable J^-proof of a disjunction. The second part consists of applying quantifiers. In Prop. 10, 11 we need to change 'tautology' to ' / ^ - t h e o r e m ' . In the definition of 'generally over e/T' we add: v = v is given t and v = fi, where v is different from pi, is given/. Prop. 12 and 14 carry over without difficulty.
P R O P. / 1 3 . Tine rule I(ii) is redundant otherwise the symbols, axioms and rules of IZFC are independent. We need only show that the symbol / is independent of the other symbols and that I(i) is independent of the other axioms. The symbol / is independent of the other symbols of I!FC because if there was definition of / in terms of the other symbols o£IcFc, i.e. in terms of the symbols of £FC then this will be without use of any free variables, because a free variable in the definiendum must be present as a free variable in the definiens, this leaves only N, D and E and bound variables and with these we are unable to construct a formula of type OIL Similarly the axiom I(i) is unobtainable from the axioms of ^c. P R O P. /14.1!FC is consistent with respect to negation. If we could /J^-prove <j) and N
obtain an ^ - p r o o f of D 2 Nla^oc^KNrfxf) where Io^0)oc{e) were the cases of I(i) used in the original proofs In this replace Io&e)aSe) by Dn^Nn^ and K
we are left with an^ o -proof of D021 NDn^Nn^KN^, but this is absurd e 0 1 because D 2 NDn{d)Nnid)KN
Dll.
V!£)#£} for
read 'there is exactly one thing with the property
3.17 Equality
121
Given a theory 2T there is another theory 2T1 effectively obtainable from 3~ such that ^""-theorems are ^"'-theorems and the ^"'-axioms are disjunctions of atomic statements of their negations. To obtain &' we replace any ^""-axiom of the form K
Dpoca) Dlotfio) n , ,. , ——jT—n——, for a one-place predicate,
Dpocyo) Dlafiaj Dpfiyco '
Dpyocco Dlafia) DpyfSo) '
for a two-place predicate, including / itself,
and so on, and similar rules for Npoc, Npocy, Npyoc, etc. We now show that rule I(ii) is obtainable from these special cases. We proceed by formula induction. (a) 0{a} is atomic; the result holds by hypothesis. (6)
DDIa/3aj<j)'{/3}
DD
122
Ch. 3 Predicate calculi
(c) (j){a) is N(j>'{a) and the result holds for
DNN$"{(}} o) as described. (c'")
\
*s a n i^roduction of E, replace this by Di/r{y,ot}(o
D(E£) f{£, /?} o) and the result follows. This completes the demonstration of the proposition. #«}
PKOP.18
(Aj)CI&4{Q
ft*) {
Ad. (i)
^a} DNIoc£6(a} r<> }
Ila r / .. x
I(ii)
. ,, . T MTr „ usmg the axiom nDIocgNIotg,
3.17 Equality
Ad. (i')
123
(AMI&M* reversibilityofllc*', ^-L- * substitution, M.P. using axiom I(i),
Ad. (ii)
—i-i-2—
lift
and axiom/(i),
Ad. (no 0=1 (
one /? ^, 1 < d ^ /c, must be a otherwise lower formula is/, this is absurd if upper formula is t, hence KIaa
DIoc/So) DI/3yoj
— and The first comes from Iaoc DIaocco DI/SOCG).
The axiom expresses the reflexiveness of equality, the first of the derived rules expresses the symmetry of equality and the second derived rule expresses the transitivity of equality. A relation (a formula of type ou) is called an equivalence relation if it is reflexive, symmetric and transitive. 3.18 The predicate calculus with equality and functors We showed before that !FC with functors is virtually the same as ^c without functors in that we can easily translate from the one into the other. We now demonstrate a similar result for / J ^ in a different way. Let I^a be I^c with functors and possibly with constant individuals (functors without argument places). We translate I^Cf into a theory y without functors or constant individuals in such a way that I^cf *ne~ orems translate into ^theorems and ^"-theorems which are translations of /J^-statements are translations of /J^-theorems. If we had started
124
Ch. 3 Predicate calculi
with a theory based on I^ct then a similar translation is obtained. The translation is as follows. Replace lag Iga Igrj Iafi ^{a}
by lag, where a is a constant and g is a variable, by lag, where a is a constant and g is a variable, by Igyj, where g and rj are variables, by (Eg) Klaglfig, where a and /? are constants, by (Eg) K
where a', ...,a(/c) are constants, but if any of the a's are variables then omit the corresponding Ia^Tj^ and the corresponding quantifier, use different predicates F for different functors / .
*
(1)
for all F and a that we have introduced. If we had started with a theory 3" based on l^Ci then we should also require the translations of the «^"axioms as axioms in the translated system, similarly for ^~-rules. 19. An !Ffstatement is a ^-theorem if and only if its translation is a ^-theorem. Let
3.18 The predicate calculus with equality and functors
125
Let $ be the translation of
the translation of this is We have
8<*
Prop ? 8? M R
* revers lbl htyofll6'. For the rule I(ii)
D
the translation is
We have
D(E^Ka{^{Q(o
D(E£,)Ka{£}b{g}a) t
as desired. For the substitution rule this becomes
(
D(E£)Ka{® ${
Prop
126
Ch. 3 Predicate calculi
We have the tautology CxCDcfri/rDKxfift, hence we have
5>
)> M p
as desired. Thus the ^-rules and the J^^-rules translate into or derived ^"-rules, and the result follows. 3.19 Elimination of axiom schemes P R O P . 20. A theory ^ without functors and based on I^c and with axiom schemes can be replaced by an equivalent 2-sorted theory £f with a terminating sequence of axioms. We are thinking of axiom schemes involving arbitrary statements; for instance Mathematical Induction. Axiom schemes which contain arbitrary terms but are without arbitrary statements can be replaced by axioms provided we add the rule of substitution 11/. To demonstrate the proposition we introduce two new symbols: s and e of types at and 0(01) t respectively, where 01 is the type of a second sort of variable. The construction will be clearer if the reader interprets
Jx for N(Ex', x") (x = sx'x") read (x is an individual', sxx' is the ordered pair xx', sxsx'x" is the ordered triplet xx'x", etc. The first sort of variable is a variable for sequences of individuals. We frequently write (xeX) instead of exX and read it as 'x has the property X\ where X is a variable of the second sort, we call them properties. We now translate the theory 2T into a two-sorted theory £f as follows. 7r£ by 7T& by by
(£e77-), where n is atomic and of type 01, (sggen), where n is atomic and of type ott, («£«£TOT), etc.
3.19 Elimination of axiom schemes
127
We shall frequently write j ^ for sgsg's... sg^K)g^8K\ £nt) is the sequence made up from the members of the sequence £ in their proper order followed by the members of the sequence t) in their proper order. We frequently omit the suffix 8K in %^K) and just write £. We now give some definitions: D12 D13 D14 D15 D16 D17 D18 D19
(£e3 U 3') (£eS n 3') eZ x V)
(geSE)
KJglg^g'eldS)
complement
for for for for for
K(£eE)(£eS)
for
(rieS)
for
iL/£(£n£/ngeS)
D(ieS)(ieE)
(E£)(KJ£'{£"&•£))
union intersection direct product domain inverse permutation identification.
We now show that by means of these definitions a statement ^{£',...,
£^SK)}
becomes
(jr^eA),
where A is constructed from atomic properties by means of complement, union, intersection, domain, direct product, inverse, permutation and identification. Our second sort of variable is a variable for predicates or properties so we must ensure that we can perform operations on these variables corresponding to the operations we can perform on predicates by means of logical connectives. To do this we adopt the axioms: (1) D20
B(geE)N(geE)
(V)
r
X =X
(EX){Ag)B(geX)N(geE).
for (Ag) B(£eX) (geX')
(sxyel)
for x — y,
then (1') becomes (EX) (X = E), i.e. S is a thing of the second sort. We want similar axioms for (2)-(8) below. (2) (3) (4) (5) (6) (')
(8)
£jJ^J\.J c, J c, (£ £
£ EKjnV^Hi) r^J\.o (^ J t, (
128
Ch. 3 Predicate calculi
We want ~ U fl x 2 Cnv1 Cnv2 Id as operations on properties satisfying the above axioms, then we can show that we have J3(£eA) ${£} for any statement §J and some property A. We proceed by formula induction on 0. (a) (j) is atomic, say fi is TTE,' ... gK\ replace as described above. (b) (j) is N(j>' and the result holds for 0', so $'{£,',..., QSK)} has been replaced by fec^A'), we replace N
(3(/^A").
We first replace (t)(A)eA') by (fe^eA^) and (j^eA") by (E^eA^) where £',..., £ ( ^ are the variables 9/',..., 9/(A) and £',..., £(^ in some order, say in alphabetical order without repetitions. We have for individuals £, £' and sequences V:
Any permutation of a sequence can be brought about by repeatedly interchanging a pair of consecutive members. Thus a sequence of natural numbers can be brought into a sequence in order of magnitude by repeatedly interchanging consecutive members. Repeated application of Cnv1 will bring any given member of a sequence to the front then applications of Cnv2 will interchange the first two members then repeated applications of Cnv1 will bring all the members back to their original places except that two consecutive members have been interchanged. Thus by repeated applications of Cnv1 and Cnv2 we can bring any sequence into a given permutation. Thus by applications of Cnv1 and Cnv2 we can bring 7i'...^ into gr>mmmgm where 6' < 6" <...< if 80™ < 6^ apply Cnv1 until g^(7r)> is at the end then apply the direct product d^-Sd^times. This will give us the sequence
Now apply repeatedly until we have filled in all the gaps. Then apply Cnvx repeatedly until % is in front. The new variables introduced can be
3.19 Elimination of axiom schemes
129
K
relabelled so that the whole sequence is £'... Q \ If there were any duplicates in 7]'... rffi then permute until one is in front and its duplicate at the rear then apply Id to eliminate the rear one. Finally replace ^ by (ite^e/i!" U Aiv). This completes case (c). (d) is (E£)^{g,t)} and we have replaced $${£, §} by !?t)eE, we replace {Eg) 0'{£, t)} by tye^E. This completes the description of the replacement. For the full development we require some more axioms: (9) xey Xex
XeY
(10) {Ax,y){Ez){z = sxy) (11) GKJxJyLxy (12) CKKLzu(x = szv) (y = suw) BLxyLvw (13) GK(x = szu) (y = suz)Lxy (14)
(AX)(Ax,y)CK(xeX)(yeX)Lxy,
where Lxy expresses that the sequences x and y are of equal length. For instance the last of these axioms says that two sequences having the same property have the same length. This should suffice to complete the demonstration of Prop. 20. In any system the rules are normally rule schemes. Our method allows us to replace rule schemes by single rules provided we have a substitution rule. For instance the / ^ - r u l e s become: xeU' u l u f u P ~xeU'\)Y\)X\)U' TT
LLa
xeX —==—=; xeX\jY xe@(X u U)
__lib
xeX[jX\jU xeX[)U~xeX U U xeY u U —— ; xeX[)Y[)U xe2X
TT lie
xeX U U —= ; xeX u U
U U
la, ...,IIc give the rules of Boolean Algebra, if we add IId, e we get a Boolean Algebra with a projection operation superimposed on it. In the 2-sorted theory © quantification is allowed over one sort of individual only. A predicate calculus of the second order is obtained from a predicate calculus of the first order by adding quantifiers over predicate variables. It can be shown that a theory ST based on the pure predicate calculus of the second order with axiom schemes fails to be replaceable by a 2-sorted theory without axiom schemes.
130
Ch. 3 Predicate calculi
3.20 Special cases of the decision problem We are as yet unable to show that J ^ has an unsolvable decision problem, namely the problem of finding a uniform method of deciding of any ^cstatement whether it is an ^-theorem, because we are as yet without a precise definition of what we mean by 'solvable'. Later on we shall identify 'solvable' by 'calculable by a Turing machine'. We shall then show that there fails to be a Turing machine which will tell us whether an c^-statement is an ^^-theorem. We have to postpone the undecidability of !FC (which is entirely due to I b) until we have given the theory of Turing machines. Meanwhile we can give some methods for deciding certain types of J^-statements. When we come to the theory of Turing machines it will be seen that these methods can be set up on a suitable Turing machine. P R O P . 21. The monadic !FC is solvable. More explicitly an ^-statement containing only one-place predicates is an ^c-theorem if and only if it is valid in a domain of 2K elements, where K is the number of one-place predicates in the statement. Let ^ be a monadic ^-statement containing exactly K distinct oneplace predicates
22. The ^-statement
is an fFc-theorem if and only if it is valid over a domain with A elements (one element if A = 0). Thus it is solvable.
3.20 Special cases of the decision problem
131
First consider (Erf,..., T/W)
Thus if (Erf,..., 7/W)
If the disjunction 2 ^ w is a tautology then ^ is an ^-theorem. We have:
By taking w to be 2 * ( ^ (A) , where the asterisk indicates the omission of fte), and repeating and then discarding duplicates by 16 we obtain
but the upper formula is an ^r-theorem. Apply l i e ' repeatedly to the lower formula and we obtain the ^ o -theorem i/r. K
If jr is valid over a A-element domain then the disjunction 2
takes the value t no matter how the values t, f are given to its atomic formulae n^... £(d(v)), 1 < 6', ...,6^v) ^ A because one disjunctand will then take the value t, namely that disjunctand that gives the satisfying set £', ...,£(/M). Thus the disjunctand is a tautology and so jr is an !FCtheorem. Thus if jr is valid over a A-element domain then jr is an !FCtheorem. If jr is an .^,-theorem then it is valid over any *WK in particular it is valid over a A-element domain. This completes the demonstration of the proposition. The decision problem for #~o can be framed in several ways: (i) To decide if an J^-statement is valid over JV', this is the same as to decide if an ^.-statement is an ^.-theorem. 5-2
132
Ch. 3 Predicate calculi
(ii) To find the natural numbers K such that an ^-statement is valid over JfK and to decide if it is valid over Jf. (iii) To decide if an .^-statement
COR. (i) A closed &c-statement in prenex normal form with a prefix of the form can be decided as regards validity. We first reduce the problem to the case of a closed ^-statement in prenex normal form with a prefix of the form (Aij, ijf) (E£'9..., £(A)). We give the demonstration for the case ju, = A = 1 and where the matrix is built up from a single binary predicate variable n. (EO(AV,V')(EQ>{n;£,V,V',Q.
(1)
The case when A > 1 is dealt with similarly, the case when ji > 1 follows by repetition of the case fi = 1. We show that (1) is satisfiable if and only if
{AT,, r,') m,
r, r, n
KKK^-,
'?,V,V', Q 4,^- 'e,v/e,
n
#r;T,T,9.n#»;T,T,T,n (2) is satisfiable, where the variable g has been eliminated by writing TT'V for ngv, n"v for nut; and a for TT££, a is a propositional variable. This we have indicated by writing '£' instead of £. We have, as it were, taken the resolved form as far as the first restricted variable is concerned. First suppose that (1) is satisfiable over «^#, where J( is JfK or JV*. Then we show that (2) is similarly satisfiable. L e t / b e a function over c^#2 which satisfies (1), if g is in J( then for arbitrary ij,ijf in ^# we have for suitable
; g, ij, g, H
3.20 Special cases of the decision problem
133
all take the value t. Hence the conjunction
(EC, r, r, n KKKW, Z, v, n1, Q M Z, % z, n w> £> z, v, n
v,z,z,z,n (3) also takes the value t. Now define/'f for f£,v,f"v forfvg, a for/££ so a is t or/. Put these in (3) and we have a satisfaction of (2) over JK, satisfying functions being We now show that if (2) is satisfied over ^#, where JK is JfK or df, then (1) is satisfied over a domain obtained from Jl by adjoining a new element. Suppose then that the functions/,/',/" and a satisfy (2) over J( and that £ = g[v,v'], £' = g'[vl T = g'Wl C = * give the values of £, £', ^", ^7// in the model over J(. It is easily seen, by distribution of quantifiers, that £, £', £" and Q" depend only on the variables shown. Now let a be a new element and define
g[aoc] = a.
Then g is defined over <J( u {oc}. The conjunction (3) takes the value whence so do: (EQ ji{/, a, ?/, 9/', £} = ^ for arbitrary 7/, r/f in ^ , /, a, ?/, a, £'} = t oc, a, V', £"} = t
for arbitrary 7/ in JK, for arbitrary if in uT,
but this says that (1) is satisfied over ^ u {oc} as desired. We now show that there is an effective method of deciding whether a closed ^.-statement, (A£9 £') (EV', ...9^)^{n; £, g',^, . . . , ^ } (0) containing exactly one predicate variable and that binary, can be satisfied. The method with more predicates and with various place numbers is similar. A table of order v for a binary logical function / is a v x v array of fs and/'s giving the value offA/i for 1 ^ A, ju, < v. A table T^ of order (ju, + 2) for a binary logical function/, is said to satisfy (fi{n; £, £',y f ,...,T/^} if $5{/;l,2,3,...,(/* + 2)} = *. 7
(4)
Denote by [I //c/c'], 1 ^ K, K' < /£, that table of order 2 which is obtained
134
Ch. 3 Predicate calculi
from the table T^ when we extract the 2 x 2 principal minor (AC, K'), i.e. the intersection of the K, K' rows and columns. We show that N. and S.C. that (0) be satisfied is that there is a nonempty set S of tables of order ([i + 2) which satisfy (4) and which have the following properties: (A) If TQ is a table of 2 and 1 ^ K, K' < [i + 2, K =f= K', then there is a table T of S such that [T/12] = [TOJKK']. (B) If To is a table of S then there is a table 21' of 2 such that [T'/l] = [T0/l] and T' = [77//1134 (ji + 2)]. i.e., the first two rows are columns are the same. (C) If Tl9 T2 are tables of S then there is a table I7 of S such that If (0) is satisfied then clearly there is a set S of tables of order which satisfy (4) and which have the properties (A), (B) and (C). If (0) is satisfiable then it is satisfiable over Jf and there will be a table over JV x ./K giving the value of//c/c' for all 1 ^ K, K' . In this satisfaction we replace £, £' by any two members of Jf and there will be fi other members of JV for if,..., 9/(^, call these 3,..., (ji + 2), where £, £' are 1,2; there will be other /i members of JV*, if £, £' are K, K', 1 ^ AC, K' ^ /^ + 2 which gives (A). Similarly (B) if we replace £, £' by the same element ofJ^, namely 1. If we replace £, % by d, 6' and again by zr, TT' then there will be a satisfaction when we replace £, £' by #, TT which gives us (C). Let ^ be a table over ^ which satisfies (0) then we choose any two members of N for £, £', say K, K', and there will be fi other members of JV for ?/', ...,9/ ( ^, say y', ...,v ( ^, then the (/^ + 2) minor of &" formed from the K, K\ V', ..., v^ rows and columns is a table I7 which satisfies (4). The set S of tables which satisfy (4) and are of order (/i + 2) is bounded because there are at most 2(/*+2)2 of them and these can be written down and tested to see if they satisfy (4). We now show that if there is a non-empty set S of tables of order (/£ + 2) which satisfy (4) and have properties (A), (B) and (C) then (0) is satisfiable over JK It is an effective process to decide if a set 2 of tables of order (ju, + 2) which satisfy (4) also satisfy (A), (B) and (C). Let
co[v9 K] = |(/c 2 + 2KV + v2 - 3K - v + 2), this is the place number of the ordered pair
3.20 Special cases of the decision problem
135
Let 2 be a set of tables of order (fi + 2) which satisfy (4) and (A), (B) and (C). Denote by S(TV T2) a table of 2 such that
such a table exists by (C). By some numbering of all possible tables we can easily arrange that S(Tl9 T2) is uniquely defined. Let 2X be the set of all tables of order 1 which are minors of the tables of 2. Let 2 2 be the set of all tables of order 2 which are minors of tables of 2. Then 2X has at most 2 members and 2 2 has at most 16 members. We now define a set Z0,O = 0,1,2, ...,of tables of order 1 + 0./i as follows: Zo is an arbitrary member of 2X. If P2 is a member of 2 2 we denote by H(P2) a table of 2 such that [H(P2)/1, 2] = P2, by (A) there is such a table. When ZQ has been determined in such a way that \_Z0\K, K'] is in 2 2 , 1 ^ K, K' < 1 + O.JLC, then we determine Z0+1 as follows : (for Zo this condition becomes [ZQ/1, 1] is in 2 2 , this is satisfied by (B)). Let 0 = OJ[K, K'], then [Ze+1jl, 2,..., 1 + 0./i] = Z6, i.e. Z0+1is the same as Ze whenever Z0 is defined. [Z0+1/K, K', 2 + 0./I, ..., l + (d+ l)./i] = E[Z0/K, K'], this follows from (A) and the hypothesis that \Z0\K, K'] is in 2 2 . Altogether this gives the values for Z6+1 at the points (y, v') where v,v' = K,K',2 + 0./i,...,l
+ (0+l).fi
or
v,v' =
1,2,...,1+0./
We also want this is possible by (C), here A = 0,1,2,...,1 + 0. fi except K, K',
This gives the values of Z0+1 at the points for 1 < v,v' ^ l + (0+l)./i. It remains to show that [Ze+1jv, v'] is in 2 2 for 1 ^ v, v' < 1 + (0 + 1) ./i, so that we can proceed as above to determine Z0+2 from Z0+1. Clearly this already holds for 1 < v < 1+0.ju, and 2 + 0 < v' ^ l + (0+l)./i by our construction. It also holds for 2 + O.ji ^ v ^ l + (0+l)./i because then [Z0+1/v, vf] = [£([Z0IK, K'])/V, V'] which is in 2 2 . Thus it holds generally.
136
Ch. 3 Predicate calculi
We now see that (0) is satisfied over Jf, for let AC, K' be a pair of natural numbers, 6 = G)[K9K']. Then 0{/;#c, Kf,2 + d./i,..., 1 + (# + l)./i} = t, where / is the function such that /[>,*/] is given by
E([Z0JK9 K*]).
This completes the demonstration of the proposition. The same procedure can be applied to
or we can treat this as (Ag, g') (EV',...,rjW)Ktfn;
£,?', ...,^>) (Dng'g'Nng'g'),
to which it is equivalent. But if we apply the procedure to
then it fails, because though we can find conditions analogous to (A), (B) and (C) yet some are necessary and others are sufficient. We are unable to find necessary and sufficient conditions. 3.21 The reduction problem Since we are unable to solve the decision problem for 3FC it is of some interest to find types of J^-statements to which any J^-statement is equivalent as regards satisfiability or as regards validity. We have just shown that any J^-statement of the form (0) can be decided as regards satisfiability, in this paragraph we show that any J^-statement is equivalent as regards satisfiability to an effectively constructible J^-statement of similar form to (0) but with three universal quantifiers at the head of the prefix instead of only two. We thus have achieved some finality in the decision problem as regards satisfaction in the sense that statements of the form (0) are solvable as regards satisfaction, but those like (0) but with 3 universal quantifiers are unsolvable, (in fact equivalent to the general case). Again if the binary predicate is replaced by a unary predicate then we again have solvability. Thus we have gone as far as we can in this direction. There are a variety of other directions in which we could try to get similar results. These come about by considering different types of prefix.
3.21 The reduction problem
137
P R O P . 24. A closed ^c statement $ is satisfiable over J^ if and only if a certain binary 3Fc-statement of the form
(Ag, g', g") {Ei,', ...,ifi»W{p; g, g', g',v', ->V(li)}
(5)
is satisfiable over JV, where p stands for nf, ...,7T^K)9 where 7Tr,...,7T^K) are binary predicates and there is an effective method of finding ft given <j>. (i). If (f> is a closed ^c-statement then there is an effective process for finding a closed binary ^-statement ft such that $ is an £Fc-theorem if and only if ft is an ^-theorem. A binary ^Q -statement is an .^-statement which contains only binary and singularly predicates. Let n', ...,7r(A) be exactly all the predicate variables which occur in the J^rstatement 0. Let n^ have /i^ argument places. Let /i = Max \ji\ ..., /^(A)]. Let n[,..., n[X) be new and distinct oneplace predicate variables, let 7r'2, ...,7r^) be new and distinct two-place predicate variables. Let v be a new individual variable. We form ty from $ by replacing uP>g...gW9f> by LEMMA
(Av)K n TT^^vn^v
(6)
6' = l
throughout (p. Clearly i/r is binary. If (j> is an J^-theorem then cj> is generally valid over Jf. If we replace n'2,..., n^6^, 7r[e) by logical functions over Jf then (6) gives a logical function over JV but if in
138
Ch. 3 Predicate calculi
ing. We define one-place logical functions q', ...,g(A) and two-place logical functions r\ ...,r ( ^ over JVP as follows:
v' = t if and only if vx = v'% (1 < 6 < A). Since ^ is generally valid over Jfp (since «/f^ is enumerated) then it is satisfied over «#> by replacing n^ by g(0) and n^ by r(^. Let \jr become \Jr after this substitution. In particular (6) becomes:
(Av)C U r^^vq^v.
(7)
0'i
Let AA/£ be that logical function over J^ which has the value t if and only if A = [i. Then (7) becomes (Av',...,itf*(n>)C II
0'1 0'=1
this has the same truth-value as p<®£'... g(/*(0)), hence
An ^ o -statement is said to be in Skolem V-normal form if it is closed and in prenex normal form and has a prefix in which each existential quantifier preceeds each universal quantifier, i.e. if it is of the form
where the matrix
Let <j) be a closed binary J^-statement, let $' be a prenex normal form of it. Then ^ is satisfiable if and only if
3.21 The reduction problem 139
J^-theorem. If $' begins with an existential quantifier then conjunct to it (AQDNTT&T^ and reduce the result to prenex normal form beginning with the universal quantifier (AQ. Call the result $". Then B<j>'<j)" is an J^-theorem so $' is satisfiable if and only if $" is satisfiable. Thus 0 is satisfiable if and only if <j)rr is satisfiable. Let <j)" be (Q) % where (Q) is (A1c1)(Et)1)(Qf) and (Qf) is (A$2)(Et)2)(Q"), here (Q), (Qf), (Q") denote sequences of quantifiers. Write: <»[j,tj] for ( ^ ^ n ^ ^ n ^ ^ w ^ . 0=1
(8)
0'=1
Also write (E£)(o0[£, £,t)] for (8). Here n\ ...,TT
Let ^ i v be
(AQ (Eh) (E\)2) (E£) (Q")Ka>0 K', Sl , where the sets of variables j 1 ? t)l9 j , t), J2? ^2 a r e distinct. If ^ i v is satisfiable then so is (Q) #, i.e. $" is satisfiable. We have B(j)"'(])iw is an ^-theorem, because 0 i v is a prenex normal form of <j)'". Also
because we have the J^^-theorems: C(At)) CG>[
whence by Modus Ponens: C(Ai) (At)) COJ [jr, whence, using which is
C(j)'"(Q)x
(10)
as desired, hence C
140
Ch. 3 Predicate calculi
such that (Q) x takes the value t when the predicates in x are replaced by the logical functions just mentioned. Call the set of natural numbers vr, ...,v(A), K',...,KW a T-set, since (Q)x is satisfiable then there are T-sets. Enumerate the ordered (A +/^)-tuplets and let the 0th be <#l>--->#U+/*)>-
Define logical functions £l±[v, K], ..., Q(A+/t)[>, K] such t h a t an[69Onl = t
for
I^TT^A, (A+/0
if and only if 0 X ,..., du+/l) is a T-set. Then (EQ JJ Qw[£, 0 J takes the value 7T=1
t for T-sets otherwise the value/. Thus for these logical functions 0 l v takes the value t because (A%)(Et))(EQ fl o)[£,g®] then takes the 0=1
value taken by (Q)x
so
^ na ^ ^0)[%^](Q)X takes the value f. Thus
K(Ai)(Et))oj[^t)](Ai){At))Co)[i,t)](Q)x takes the value t. Thus 0 iv is satisfiable as desired. Repeat the process until all the existential quantifiers succeed all the universal quantifiers. This completes the demonstration of the lemma. LEMMA, (iii). If (pis a closed binary ^-statement then there is an effective method of finding a closed binary ^-statement i]r in Sholem V-normal form such that $ is generally valid if and only if \jr is generally valid. The demonstration is similar to that of lemma (ii). These are similar lemmas omitting the word 'binary'. L E M M A (iv). If (j) is a closed binary ^-statement in Skolem S-normal form then there is an effective method of finding a closed binary !Fc-statement i/r in Skolem S-normal form and containing exactly three universal quantifiers such that $ is satisfiable if and only if \[r is satisfiable.
Let 0 be (4^...,gW)(^.---.^ ) )^{^--^£ ( A ) »^---,^ ) } where $' is binary and is without quantifiers and
..., ZCn'%%'",
(AC, C, £"') KOK7r%'CTr'CC'K%"'OKn"CC7r"CC'ICC',
(12) (13)
3.21 The reduction problem
141
where £', £" are new. Let the predicate variables which occur in $' be nm9 ...,7T^K+2\ then these are binary. We show that if
F
and (Ag)m, (A£,V)(CU7)Cn£n7i, (A£,V,QCI&iCnKmi£ and (A£tV,QCI&iChrgn&i,
for all singularity and binary predicates which occur in $'" including the equality relation itself. The prenex normal form of
25.f^4 binary ^-statement
{A?, £",£'")(W, ...,*«)#*', -Mx\ g', ...,y^} in Skolem 8-normal form is satisfiable if and only if a binary <^c-statement (A £', £", £") (Ey',..., rf») ir{n; £', £", £", y'9..., ^ \ also in Skolem 8-normal t Prop. 25 follows closely the proof given by Kalmar and Suranyi (1947), though the symbolism has been changed to that of the present author. It is reproduced by permission of the publisher, the Association for Symbolic Logic.
142
Ch. 3 Predicate calculi
form and containing only a single binary predicate, is satisfiable. Also there is an effective method for finding xjr given (j).
Let
(A?, £", nW, ...,^ ) )^K....,»«; £', i", Z",y', ..
be the given binary ^ - s t a t e m e n t in Skolem ^-normal form, where
(15)
where (j) is what ^ becomes when n', ...,TT^ are replaced by p', ...,p (A) respectively and the propositional connectives are replaced by their respective truth-functions.
(17)
where T denotes the logical function which arises from
which consists of the natural numbers J¥', triads of the forms (y{6'\ v«n9 0> and <^'>, v^, 1> and2/, ...,^ A ) . The triads (v^, v^"\ 0> will play the role of the pairs (v^'\ it0"*} while the triads (y^\ v^e"\ 1> serve merely to express the coincidence of the first and second components of pairs. Thusfor
T
= <„>',()>
3.21 The reduction problem
143
we shall define Qrcr = t if and only if v' = v", QCTT = t if and only if K' = AC". v is the first component of r is expressed by QVT, K is the second component of r is expressed by QTK. p' is characterized as the only element of J for which Qp'p' = t, p" is characterized as the only element of J for which NQp"p' = t, pW is characterized as the only element of J for which Qp^pW-V = t, 3 ^ 6 < A. The triads r = (y\ K', 0) are characterized by QrpW = t, we distinguish between the natural numbers and the triads (y", K", 1} = cr by Q^V = * and Qp'cr = / , then the triads or can be characterized by' o* is different from p', ...,#(A) and o-is different from a triad (y'9 K',0) and Q^'cr = / ' . Now if (15) holds then, for numerical functions />(iv), ...,/o(/*+3),
Let
^" denote the set of triads (J/, /c', 0>, ^ denote the set of triads (vf, K',1), & denote the set of {p\ ..., #(A)}, ,/K* denote the set of natural numbers,
We define a binary logical function Q over f by the following table, in this Q has the same truth-value at an entry in the table as the statement standing at that entry.
Qxx'
x' = K xf = X' = {K\ K", 1 >
x' = pW 1 ^ 6f < A [K(d =t= 2)(6>r = 1)
x = pi®
x =x
X*X
D
\ [K(0 = d'+l)(d'* 1 <6> D(0' = 1) (6>' = A)
1) x 3F x v = K' v" = K x 4= a
v — K' */ = /c'
144
Ch. 3 Predicate calculi
Consulting the table we see that (a2) (a6) (6) (c)
Qxx = t if and only if (iff) x = p\ NQxp' = t iff x = p", QxpW-» = t iff x = pW (3 < 6 < A), QxpM = t iff xe^, KKNQxxNQxpWQp'x = t iff
(d) KQxp'Y[NQxpWNQp'x
=t
iff
0= 2
Further supposing that r = (yr, v", 0), a = then (e) (/) (g) (h) (i)
Qvr = t QW = ^ QTV = t Qcrv = t QTO- = t
iff iff iff iff iff
(j)
Q(TT = t
iff
v = v\ V = K\
v" = v, /c" = p, / = K', ACr/ = ^ .
Also s u p p o s i n g t h a t rx =
if
(Z) for
KKKQT1orQT2orQ(rT1Qo'r2
1 ^ 6 < A,
/[^;2/',2/(A)]
Write
y[>;*/',...,*/]
for for
= tf t h e n
rx = r2,
KKNQxxNQxy^Qy'x' KQxy' \{
KNQxy^NQy'x.
e=2
Then for arbitrary elements a/, x", x'" of ^ / and if y' = p',...,y(x) we have (a)
by
KKKQy'y'NQy"y'
U
(ae),
= p{X)
1 < 6> < A,
Qy{e)yV-1)CDDKQx'x'Qx"x"KNQx'y'NQx"y'
6= 3
2 KQxfy^-^Qxffy^-^KBQxfxf"Qx"xmBQx'"xfQx'"xfr.
(18)
This says that if #' = a" = #<*>, 1 < 6 ^ A, then BQx'x'"Qx"x"' and BQxmx'Qx'"x\ and that ^ = ^ , 1 < 6 ^ A. (6) By(c),(d),(/)and(A), CKI[x';y'9
y™] I[x"; y\ y™] (Eu) KK7[u;
y',...,
y™] Qx'uQux".
(19)
3.21 The reduction problem
145
This says that if x', X"EJV then there is a triad (xr, x", 1 >. (c) By (c), (b) (d), (e), (/), (»), (fir), (A) and (j)
CKCKQx'x"Qx'x'"Qx"x'"CKQx"x'Qx'"x'Qx'"x". (20) This says that iix'ejV, x"e$~, x'" e <%, y' = p',..., y™ = p™ then if x' is the first component of x" and also is the first component of x'" then x" and x'" have the samefirstcomponents, and similarly for the second components. (d) By (b), (d) and (k), CK... KQx'^Qx"^y{x'";y',
...,yw]
6 times
Qx'x'"Qx"xr"Qx"lx'Qx"lx" n BQy^x'Qy^x".
(21)
0=1
This says that if x\ xfre^ and o^^e^/" and if x', x" have the same first and the same second components then Bp^x^p^xlxl, 1 < 6 ^ A, where Finally by (c), (6), (e), (g), (I) and the definition of T the fact that for x',x\xmejr and x* = p^x'xW, ...,^+ 3 ) = j W / / (15) holds, 3 C
0 = 4:
n
7 [
^)
y 9
^
( i 7 % i s B m myU
)
0 1
fi+3
n 0,0=1
(22)
This says that for any #', a;", X'^JV there are a;iv,..., x(^+%/T such that if ^ ^ r =
(23)
We have just shown that if (14) can be satisfied then so can (23), namely by the logical function Q given by the table. Now suppose that (23) can be satisfied over JV*, we wish to show that (14) can also be satisfied over JVI
146
Ch. 3 Predicate calculi
By hypothesis we can find a binary logical function over JV*, say Q, and ternary numerical functions over JV
such that on replacing q by Q, y(0) u
tftox'x"^,
by
by a)xrx"x'\ "x'"
(4
in (23) and #', x'\ x'" by ^', ^ , v1" respectively we obtain the value t. Consequently we have CKI[x';y',<»]I[x";y',yW]KKy[a>x'x"x'";y',...,y™] QX'O)X'X"X'"Q
(20') and (21') are obtained by writing p^x', x"x'" for xf-e\ 1 ^ d < 3 p+3
3
/t+3
e=i
0=1
e,6'=i
K n I[p^x'x"x'";y', t/W]0 n I&»;y',ym]K
n
QpWx'x"x'"Te, e, x'x"x'"QTe> g. x'x"x'"pWx'x"x'"W{Q ru^"
KKQT6 e,x'x"x"Y»
;y',...,
y™, (22")
V3,,«^1
for any x',x",x'"ejV and for y«» = ^x'x"x'", 1 < 6> < A. Choose a fixed member of Jr, say 1. Write a(B) for ^ 1 1 1 . Then by (18) for any x',x",x'"eJr,
Qx'x'x"x'"x'x'x"x'",
(23')
NQx"x'xV'x'x'x"x"',
(23")
'/f-iVa;"/
(23^)
(3 ^ 6 < A)
CKQv'v'Qv"v"KBQv'v"lQv"v'"BQv'"v'Qv"'v",
and
CKNQv'x'v'v"v"lNQv"x'v'v"v"'KBQv'v'"Qv"v"lBQv"'v'Qvl"v", e l)
l
6 1)
l
GKQvY - v'v"v" Qv'Y - v'v"v'"KBQv'v'"Qv"v" BQv'"v'Qv'"v".
(24') (24")
3.21 The reduction problem
147
From (23<*>), 1 ^ 6 ^ A, we obtain for v' = v" = v"' = 1, Qa'a',
(25')
NQa"a\
(25")
QaWaP-*,
3 < d < A.
(25<«)
w
Now we show that for any v\ v", v eJV* and 1 < 6 ^ A,
<9 = 1 in (24') put
^WV"
for
i/,
a'
for
/',
^
for
^,
detach (23') and (25'), and we get (26J) and (262). For 6> = 2 in (26^) put X"v'v"vw for i; and use (23") we get %WW, In (24") put
/WV"
(27") for
v\
a"
for
J/',
j;
for
^w,
and replace I W V " by a' in virtue of (26^), then detach (27") and we are left with (26J) and (262). Supposing that we have shown (262(^~1)) for some 3 ^ 6 ^ A then in (262(^-1)) put F W V " for v use the resulting equivalence in (23(^) and we get
in (24<*>) p u t
a«» v
for
v\
for
v\
for
v"\
replace ^-^v'vnvm by a^"1* in virtue of (262^(-1)) then detach (25<*>) and (27^) and we are left with (26f >) and (26^). Consequently we may replace y^v'v"^" by a(^ for 1 ^ 6 ^ A whenever it occurs as an argument of Q. In particular (18'), (20'), (21'), (22'), (23") hold for any v', v\ v^eJf and y^ = dd\ 1 ^ 6 < A.
148
Ch. 3 Predicate calculi
Let JV' denote the set of elements of N for which I\y\ a', a(A)] holds. In virtue of (22'), Jf' has a member. Choose a fixed element b o{J^\ say b = p l v l l l (see (22')). We define predicates^?', ...,^ (A) overJ^' by pWVK = Qddh1>2VKb
(1 ^ 6 ^ A, v9 KeJT').
We now show that these satisfy (14) over^T'. Let y,/ce^'and^ = r1}2VKb. By (22") we have ^(A) ^ = ft)> (28)
LEMMA.
Qvp
(v = v',d=l),
(29)
Q^
(/c = v",0' = 2).
(30)
For any element qe^V' for which Qqa,
(28')
Qvq,
(29') (30')
, we have BQa^qp^vKfor v, mjV', 1 s? 6 < A, i.e. Indeed, for r = U>VKI we have by (19') y[r;a',... ,a«],
BQa^qQa^p. (28")
Q^r,
(29")
Qr/c.
(30")
In (20') put v for )/, ^ for v", r for i/" detach l[v; a', oW], (28) (28"), (29), (29") and we obtain ^ (31) In (20) put v for v', q for v", r for v'" detach i[v; a', a], (28'), (28"), (29'), (29") and we obtain ^ ,„,,, In (20) put K for v', p for v", r for v'" detach /[i^; a', a], (28), (28"), (30), (30") and we obtain ^ m"^ In (20) put K for v', q for v", r for v'" detach /[/c; a', a], (28)', (28"), (30'), (28"), (30'), (30") and we obtain Qrq.
(31'")
3.21 The reduction problem
149
Finally in (21') put p for x\ q for x\ r for x"' detach (28), (28'), (28"), (31), (30'), (31"), (31"') and we obtain which is the lemma. Now let v1', v", v1" be elements of J^\ and let
(this holds also for 1 ^ 6 ^ 3 by definition). By (28') v*, ..., i^+3) also belong to JV'. Let te e, = Td^v'v"v"\ 1 ^ 0, 6' ^ ji + 3. By (28") we obtain,
i
'
^
(32)
>,
(33)
\
(34)
e
{
A,
...,1^^}.
(35)
The lemma now gives from (32), (33), (34) BQa<%,t r: p<W>">,
1 ^ 6 < A, 1 ^ 6', 6" < /i + 3.
Thus by the definition of T we infer from (35)
i.e. the binary logical functions p',..., p^ over ^T' satisfy (14). This completes the demonstration of the proposition. 3.22
Method of semantic tableaux
We shall show in Ch. 7 that ^Q is undecidable, that is to say that any proposed method of deciding whether an J^-statement is an fFctheorem or otherwise will fail to give a result in some cases. We now give a method for deciding whether an J^-statement is generally valid or whether its negation can be satisfied. Of course the method will fail to give a result in some cases. The method consists in trying to find a counter example by the use of semantic tableaux. We try to make a given J^statement
150
Ch. 3 Predicate calculi
Consider an J^-statement (j>. We form two columns and call one an f-column and the other one a t-column, each column will be called opposite to the other and they will be said to correspond. We place
3.22 Method of semantic tableaux
151
f. The method always comes to a definite conclusion, if the first or the last case holds, but fails if the second case arises. If all the atomic formulae are singulary we can obtain a counter example, if that case arises, over a bounded domain. But if there are binary predicates then we have the general case (see Prop. 25), and unbounded domains may be required and the functions introduced in dealing with a general statement in an /-column may introduce such complications that we are unable to tell whether certain atomic formulae occur in both of two opposite corresponding columns. Consider GKN(Ex) Kpxp'x(Ax) Cp"xpxN(Ex) Kp'x
(i) (ii) (iii) (iv) (v) (vi) (vii) (viii) (ix)
KN(Ex) Kpxp'x(Ax) Gp"xpx N(Ex)Kpxp'x (Ax) Gp"xpx (Ex)Kp'xp"x Cp"apa we have substituted for a variable
Kp'ap"af p'a p"a Np"a 1
:
(x) pa
2
:
(0) (xi) N(Ex) Kprxp"x (xii) (Ex) Kpxp'x (xiii) Kpap'a we have substituted for a
variable
(xiv) (xvi)
pa
1 : (xv) p'a
2
p"a
Below entry (xiii) the/-column splits into two columns indicating the two ways in which (xiii) can be made/. The second of these two columns can be closed because (xv) agrees with (vii). Now there is only one/-column and the f-column splits indicating the two ways in which (v) can be made t. This gives us the pair of opposite columns labelled 1, 1 and the pair of opposite columns labelled 2, 1. Both these columns can be closed because the last member of t-2,1 (x) agrees with (xiv) and the last member of/-1,1 (xvi) agrees with (viii). Thus our attempt to make (0)/has failed. Hence (0) is generally valid and so is an ^^-theorem. From the tableau we can find an ^,-proof of (0), as follows: We start with the hypotheses: (ii), (iii), (iv) and (vi)x from these we deduce pa as shown in the tableau but we had already produced p 'a so that we get Kpap'a and hence (xii), thus we have: N(xii), (iii), (iv),
152
Ch. 3 Predicate calculi
where (vi)^ is (vi) with x instead of a. By the Deduction Theorem we have (iii),(iv)
\-C(vi)xCN(xii)(xii),
(iii), (iv) f-C(iv) (xii), Prop. 6 (iii) (iii) (iii) \-CN(xii)N(iv), hC(iii) CiV(xii) N(iv), which is equivalent to (0). Now consider CK(Ex) KpxNp'x(Ex) Kp'xNp"x{Ex) KpxNp"x
(0)
Our tableau is: t (i) (ii) (iii) (iv) (v) (vi) (vii) (viii) (xv)
(Ex)KpxNp'x (Ex) Kp'xNp"x KpaNp'a pa Np'a Kp'bNp"b p'b Np"b p"a
f (ix) (x) (xi) (xii)
(0) (Ex)KpxNp"x KpxNp"x KpaNp"a pa 1 : (xiii) Np"a 2 : (xiv) KpbNp"b
(xvi) pb 21 : (xvii) Np"b 22 (xviii) p'a (xix) p"b
In this case the tableau terminates without the tableau being closed, and we are able to read off a counter example, namely the domain consists of two elements a and b, pa, p"a and p'b are t while pb, p'a and p"b are/. We take pa to be t because as far as making the entry (ix)/we need only take (xiii)/independently of the value of (xii), but to make (i) and (ii) t we must have (xii), i.e. (iv) t. In the ^-column (i) gives rise to (iii), (iv) and (v) and (ii) gives rise to (vi), (vii) and (viii). In the/-column (xii) and (xiii) are the two alternative ways in which (xi) can be made/ of these (xii) closes with (iv). We still have (xiv) and this gives rise to two columns 21 and 22, of these 22 closes with (viii). This column consists of (ix), (x), (xiii), (xiv), (xvii).
3.22 Method of semantic tableaux
153
But column 1 consisting of (ix), (x), (xiii), (xiv), (xvi) can be continued with (xviii) and (xix) from (v) and (viii) respectively, and this column is still open, we could go on with KpcNp"c but it is pointless to do so. The final ^-column consists of (i), (ii), (iii), (iv), (v), (vi), (vii), (viii), (xv) and thefinal/-column consists of (0), (ix), (x), (xi), (xiii), (xiv), (xvi), (xviii), (xix). We read off the counter example by the values given for the predicates p,p',p" at a, 6, in these two final columns. Now consider: C(Ax)(Ex')KDpxx'p'xx'Dpx'xp'xx(Ex) {Ax^BKpxxp'x'x'Kpxx'p'xx' (0) t Dpxap'xa Dpaxp'xx
(0)
1 pxb 11 pxx paa pab pbb
f Kpxxp'bb Kpxbp'xb 2p'xb 12 p'bb 21 pxx pab pbb
paa pbb
22 p'bb p'ab
p'ab p'bb
This gives 4/-columns each of which correspond to the single ^-column at present consisting of only two entries. Now the ^-column similarly splits up into 4 columns, thus 1 pxa LI pax paa pba pab
I2p'xx p'aa
2 p xa 21 pax paa
paa pba
pab
p'bb
p'aa
p'ba
22 p'xx p'bb p'ba p'aa
Each of these four ^-columns corresponds to each of the four/-columns. To make (0) take the value / we require that a pair of corresponding columns be open. We have entered in the columns all the results of substitution over a two-element domain. Thus we require to test 16 cases, viz. any ^-column with any/-column, we tabulate the results:
154
Ch. 3 Predicate calculi /-ll /-12 /-21 /-22 /-ll /-12 /-21 /-22 /-ll /-12 /-21 /-22 /-ll /-12 /-21 /-22
t-ll t-ll t-ll t-ll t-12 t-12 t-12 t-12 t-2l t-2l t-2l t-2l t-22 t-22 t-22 t-22
closed by closed by closed by open closed by closed by closed by closed by closed by closed by closed by open open closed by closed by closed by
paa pab paa paa p'bb paa p'bb paa pab paa p'bb pfbb p'bb p'bb
Thus we have found three open pairs Each of these gives a way of making (0)/over a two-element domain. Namely /-22 and *-ll t paa pab pba
/ p'bb p'ab
S-22 and t-2l
/-ll t
p'bb p'ab
p'bb p'ab p'ba
t paa
p'ba
/
pab
and *-22 / paa pbb pab
p'aa
the values omitted can be chosen arbitrarily.
3.23 An application of the method of semantic tableaux As an application of the use of semantic tableaux we demonstrate the following proposition: 26. Let Cifiijr be a closed ^-theorem where neither N
3.23 An application of the method of semantic tableaux
155
both are constructed from atomic statements as described in the K
n 0=1'
enunciation of the proposition. Suppose also that (f>0 is of the form
while i/rQ is of the form 2 ^o6) a n d that $f} is of the form 2 $0% while 0 = 1
7T = 1
$ f * is of the form n ^ofl where 0O^ and i/rff], are atomic statements or 77=1
negations of atomic statements. Now form the semantic tableau for C
156
Ch. 3 Predicate calculi
Thus each side of the final tableau has l ( ^x ... x A<^x 1<">X ... x i6v) columns which correspond in pairs, we continue these columns with substitutions on the free variables, then if we make all the atoms in one / - column/and all the atoms in the corresponding ^-column t then we shall make C
3.23 An application of the method of semantic tableaux
157
Similarly the entry t can be replaced by Dy'Ny', this in an/-column becomes y' with Ny' below it; this gives y' in that column and in the corresponding column, hence the column is closed as before. Thus columns which were originally closed by a's or /?'s are now closed entirely by y's. We have the closed tableau for Cx* ^o which is an ^.-statement free of quantifiers. We call Xo the sentential power of (p. Similarly if we interchange the treatment of the two sides we get the tableau for C(j)*x*X* is the same in both cases, the structure is the same in both cases, if one divides so does the other and for the same reason. We want to put the quantifiers back so as to get the quantificational power of
Note that if x* i s / then N<j) is an ^-theorem because the tableau for Ncp is closed on its own, and if Xo is t then \jr is an J^-theorem because then the tableau for i/r is closed on its own. Note t h a t / and t can be omitted from x because in the formation of x we can replace KSf and KfS by / , DSt and DtS by t, DSf and DfS by 8, lastly KSt and KtS by S. S o / and t will disappear unless the final result is / or t, but we have discarded this case. Note also that only those y's are used in x which occur positively in both (j) and ^r or occur negatively in both (j> and i/r. Where an atom is said to occur positively in co if it is on the same side of the tableau as a), otherwise it is said to occur negatively in o). Thus only some of the y's are used in x* We restrict a variable which replaces a term which originated from treatment of (f> because terms which at their first occurrence arise from 0 do so by treatment of existential quantifiers. We generalize a variable which originated from treatment of T/T because terms which at their first occurrence arise from i/r do so by treatment of universal quantifiers. Let us return to case (ii) where we were unable to decide whether the tableau closed or was open. A tableau might show that certain compositions of functions were the same, i.e. that f...gx = l...mx for all x, we
158
Ch. 3 Predicate calculi
only need on the t side an .^-statement which has this as an interpretation. By substitutions for variables we can arrive at great complications as regards composition of functions. A composition of singulary functions can be written as a word, i.e. a string of letters or primed letters from a given set called the alphabet, then if we have/. ..gx = Z.. .mx for all x a consecutive part/... of a word may be replaced by a corresponding part l...m. The word problem, which is undecidable, is to find a method which will tell us whether a given word in a given alphabet can be transformed into another given word by using a bounded set of replacements of the type; replace a consecutive part/...*/ by Z...m. In forming tableau we break down formulae until we come to atomic formulae we then use substitutions on the variables and try to close columns thereby. Thus supposing all functions are singulary the problem becomes: can be made the same as b'x'*, ...,UKWK)* respectively by substitutions on the variables using the replacements allowed? This is rather more complicated than the word problem itself. As an example of Prop. 26 consider:
CKKDNy'yDay'DKyNay'DDKPyKyy'KNyp. The tableau is given on facing page. There are 64 pairs of corresponding columns, viz. v, v' where v, v' — 1,... ,8. In the next table we show that the tableau is closed by giving the value of the associated ^-statement.
In many cases corresponding columns can be closed in several different ways. Thus 55' can be closed in three different ways, viz. by oc and NOL being in column 5 which gives/, by y and Ny being in column 5' which gives us t, and lastly by y being in both columns. We have now associated an ^-statement with each column. As we go up the tableau these columns unite by treatment of the *-side. Accord-
3.23 An application of the method of semantic tableaux
159
KKDNy'yDay'DKyNay' DNy'y Day' DKyNay' Ny' a
7
a
KyNa
/
7
KyNa
7'
7 Na
Not 1
2
7
KyNa
r'
4
7'
7 Na
7 Na
3
KyNa
5
8
6
7
/?
Ny
n
V
8'
DDKfiyKyy'KNyfi Kfiy Kyy' KNyfi
7 Ny
r
7
y i
/?
12'
Ny
Ny 4'
5'
Y
ing to our rules for finding x we take the disjunction of the ^-statements associated with lines: 8. lvf and 2v\ 3 / and 4J/, 5v' and 6 / , 7^' and 8J/, The results we show in the following table: ll'/,',7 21/, * 31^,y 41'*, y
12'/,y 22/ 32'y 42'y
13'/,/ 23/, y' 33y 43y
14'/,yf 24/, y' 34'y' 44y
I5y,tfy 25/, t 35/,t,y 45't,y
16'/,y 26/ 36'y 46'y
lVf,t,y,y',Dyy' 18/,y,/ 2f7J,t,y/ 28/, y' 3Vf,t,y,y',Dyy' 3&'y,y'9Dyy' 47% y, y', Dyy' 48'y, y', Z>yy'
Here ^ v' indicates the disjunction of (2v— 1) v' and (2p) p'. In some cases
there are several different ways in which the disjunction can be taken. Thus 47' can give either y, y', Dyy' or t according as to which entry in 77' and 87' we use. The next stage is the disjunction of the pairs (2v— 1) v' and (2y) v'. The result is put down in the next table: 12/,yl3/,y' 14/,y' 15/,t,y 16/,y 17/,t,y,y',Dyy' 26'y 2Tt,y,y\Dyy' 28'y,y',Dyy' 22'y 23'y' 24'y' 25'r,y
160
Ch. 3 Predicate calculi
Again we take the disjunction of the two members in each column. l't,y
2'y 3'y',
4'y' 5%y9 6'y Tt,y,y',Dyy'
%'y,y
We now form the conjunctions of consecutive pairs corresponding to the junction of columns on the/-side. This gives
y; y'; KyDyy',y;
KyDyy',Ky'Dyy',Kyy',y,y',Dyy'.
We have to do this a second time, getting
Kyy;
y, KyDyy', KKyy'Byy', Ky'Dyy', Kyy1
and lastly we require the conjunction of these, however we do this we get something equivalent to Kyy'; this then is the sentential power. We can easily verify that
CKyy'D*K/3yKyy'KNyj3 and CKZDNy'yDay'DKyNay'Kyy' are tautologies. 3.24
Resolved ^Q
The resolved !FC or the resolved I^c is obtained as follows: 7} is a new symbol of type t(ot), the axioms and rules are the same as for ?FC or I^c which ever is the case, except that the symbol E is omitted and the rules lid, e are discarded and the following rule replaces them: H where we have written D21 nM&
for
Here f\^(j>{^) plays the part of a thing which has the property ( It is a term of type 1, it can only be used after ){oc} has been proved for some term oc of type t. The resulting system is called HtFc or B.I^C according as it is without or has the equality symbol. These systems fail to be formal systems because it is impossible to decide whether a term f\^
3.24 Resolved ^c
161
tion for these new functions. The system IHFC is without quantifiers but it is sometimes convenient to have a system using both lid, e and rule H, we then have a system like SFO with functors and constant individuals. A related system is the following system based on I^c\ it is together with the symbol i of type t(ot), we write as usual D22
h
m
for
We have the rule
is read 'the thing with the property (X£^{£})'. The formula is of type i and can be used only after the premisses of J have been proved. The resulting system, which again fails to be a formal system, is denoted by J^Q. From J we obtain r,
From the hypotheses of J' we obtain by !FC (Ag, n) CKK<j>{g)
whence from J and (Eg) K
whence
162
Ch. 3 Predicate calculi
this is called an e-term. £ is called the bound variable of the e-term }. The symbol E and the rules lid, e are omitted and replaced by E where co is subsidiary and can be omitted, this is called the e-rule or the E-rule, we say that this application of the e-rule belongs to the e-term
§ fails to occur free in o) and in If we allow the rule of substitution then rule E' becomes a case of the rule of substitution and so can be dispensed with. If we write D24 then rules E and E' translate respectively into rules lid, e. If we write D25 then we have a definition for the universal quantifier. Thus in the systems E3FC and EISFQ we have definitions for both quantifiers, if we have the rule of substitution then we need only rule E. These two systems are formal because the e-term e^{£} can be used without restriction. The e-term, €
£0{£}> i s r e a d 'the most ^-like thing'; then
un-0-like thing is §J-like', if this is the case then everything is ^-like. The term e^{£} can be used when {A^)N(j){^ is an J^-theorem. For instance if r is a variable for a rational number then er(r2 = 2) is read' the rational number whose square is most nearly equal to 2' but we have (er(r2 = 2))2 =f= 2, so that e(r2 = 2) might be any rational number (see Ch. 12). It is readily seen that if ^ is an i?J^-theorem without occurrences of any e-term then the jE/«^-proof of
3.24 Resolved &c
163
particular term for which 0 holds. If however
Suppose that we have j^\) {
U(p\oCj 0)
in an iJ/J^-proof-tree, where £ fails to
occur free in D0{Tt} o). In this tree replace all corresponding occurrences of £ by a. Axioms remain axioms, applications of rules remain applications of the same rules or repetitions and we are left with an EI^FCproof-tree of D${(X}G). In other words, substitutions may be pushed back to the axioms. Note in particular that rule E is preserved. 27. Rules 116, c, e are reversible in 116 is reversible. We have to show PROP.
Consider the iJ/J^-proof-tree of DNDfrfi'a) and note the places at which a related occurrence oiNDficj)" enters this tree. This will occur at IIa, 6 only. If a related occurrence NDtfi^cfil enters the tree at I I a then replace it by Nfa and if at II6 then strike out the upper right formula and the branch above it and replace ND^>[ (j>[ byN
If in this we strike out related occurrences o£D—<j>" then it becomes
164
Ch. 3 Predicate calculi
this fails to be a case of any rule. We shall have to call the occurrence of ND—(j>" in the c-term related to the occurrence of ND—$" in the upper formula. By this device we preserve the proof. This completes this case. l i e is reversible. We have to show —^ . *. Dj In the U/J^-proof-tree oiDNN^o) consider the corresponding or related occurrences of NN<j>. These will be introduced at II a, c only, omit NN at each of these occurrences throughout the proof-tree and we are left with a tree with Drfxo at its base. Applications of rules remain applications of the same rules or repetitions except that rule E can become upset if a related occurrence of NN
DN
±
Again in the U/J^-proof-tree oiDN(Eg) ${g\ o) consider the corresponding or related occurrences of N(Eg) <£{£} and in these omit (Eg) or the part related to it. These occurrences can only be introduced into the tree at II a, e, when we omit (Eg) applications of rules remain applications of the same rules except that II a, e can become repetitions, axioms remain axioms, but rule E can be upset if (Eg) occurs in the main formula of that rule. We use the same device as before, and the result follows. 28. Modus Ponens is a derived rule o We have to show
PROP.
D(j)(j) DN<j>x
*' Prom the 2?/J^-proofs of the upper formulae we require to find proof of the lower formula. We proceed as in Prop. 4, by formula induction on the cut formula. There are only three cases, namely when the cut formula is atomic, a disjunction or a negation. The last two cases are dealt with exactly as before. The first case is also dealt with as before but the theorem induction
3.24 Resolved ^c
165
requires the additional consideration of rule E. Thus suppose that in our theorem induction we have a case of rule E:
by the reversibility of l i e we obtain D
Here ${£} is atomic, say it is: n<x'...(x(K) where some of the a's contain g or are £. Then <j>{e^N${£}} is 7ra'*...a(/c)s|:, where a(A)* is the result of replacing all free occurrences of £ in a(A) by a, where a is egNnot'... a(A). Thus 0{£} contains fewer occurrences of e than
i
I{U)
'
where a contains e-terms but J3 is without e-terms. If o) and 0{I\} are without c-terms then Dcj){^}(i) is without e-terms. Consider the highest such case of rule I(ii), follow the related occurrences oflocfi up the tree until we come to the highest case of/a'/?' where a! contains e-terms but /?' is without them. Then in the tree above this each IyS must be such that y, S both contain e or neither do. Equations of the type of IyS just mentioned are incapable of producing equations of the type of Iccfl above. Hence the equation IyS must arise from an axiom T.N.D., but this introduces NlyS as well. The descend-
166
Ch. 3 Predicate calculi
ants of NlyS will be in the subsidiary formula of the application of I (ii) which eliminated the e-term This leaves an e-term in the lower formula of that application of I(ii) which will have to be eliminated lower down the tree. This, in turn, in the same manner, will introduce another eterm, which in turn will have to be eliminated still further down the tree and so on without end. This is absurd. Hence the deduction must be without e-terms altogether. Prop. 29 enables us to show the consistency with respect to negation of theories based on EI^C whose axioms are without the e-symbol and which are verifiable. That is to say: a closed statement of the theory without €-terms can be decided using a suitable definition of truth for the theory. If the axioms ^',..., <j>^ of the theory are valid then so is every theorem (j) of the theory which lacks e-terms because a proof of
(£ = ?) for
K(AQBpttp£V(AQBp&p£V.
= is a symbol of type on, we use the more familiar way of writing equalities. More generally equality can always be defined in a theory 2T
3.25 The systems 3&!FC
167
which contains a terminating sequence of predicates, but is without functions. We define (£ = v) for the conjunction of
taken over all the predicates contained in the theory ^ . It is more usual to define D26
(£ = ?,) for
(AQBpZZp&i,
(g * V)
for
and take C(AQ Bp^pv^E, = v) as an axiom, or to define D26'
(£ = ?) for
{AQBp&pr,^
(£ * V)
for
N(£ = V),
and take O(AQBp^p^v (£ = v) as an axiom. In either case if (£ = TJ) we have {AQ Bp^ptyj and (AQBpg^pw^ so that by 3FC pat; may be replaced by par/ and#£a by pyoc in any ^ - s t a t e ment, in other words we have -a) (a (a-a)
aand nd
*>(
Dcj>{fi}oj
by regularity <j) being built up from the sole predicate p, this is the same axiom and rule as for the equality symbol / . D27
(asyff)
for
(A£)Cp£ap£fi9
(ac/?)
for
K(oc c /?) (a # fi).
Read (a c y?) as 'a is contained in /?'. The things which stand in the relation p to oc are less extensive than the things which stand in the relation pto/3. <= is called the inclusion symbol. D28
Smoi for
{A£)Np£a.
'oc is without ^-predecessors', or oc is empty. We can regard the predicate p as being an ordering relation. If pocfi we shall call oc an immediate p-predecessor to /? and /? an immediate psuccessor to oc. We can then identify fi with the class of its immediate ppredecessors, and so identify the relation# with the membership relation. This amounts to reading pocfl as 'oc stands in the relation p to /?' or as c a is a member of the class of things which stand in the relation ptofi'. Now /? is of type i and (Agpg/3) is of type (01). Let A be a symbol of type t(ot) then A(A£p£/?) is of type £ and this term is uniquely fixed by fi. Thus
] 68
Ch. 3 Predicate calculi
if we identity /? with A(Xgp£/?) then we have formalized the above informal exposition. To make this identification we require (/? = A(X|p£/?)) more fully:
BptfpgVkZpgfi)
and J3p/?gp*(Xgp&ff) £
In conformity with previous uses of (X£^{£}) we define D29 If we have a set ^ of things oc, /?,..., and are given a truth-value for each oipococ, pa/3, pfiot, pfifi,..., then for fixed /? we can form the class of things oc for which pafi istf.This gives us another set, say Sf*, whose members are classes of the members of £?, and there is a (1-1)-correspondence between £f and 5^*. Every member of ^ * is a class of members of SP and there is a (1-1)-correspondence between £f and ^ * whereby oc of S? corresponds to the member a* of £?*, where a* is the class of members of £f which stand in the relation^ to oc, call this class (£pz). The relation poc/3 translates into p*oc*fi* or (oc*e/3*), read 'a* is a member of the class /?*'. With this interpretation ^ * is a set of classes whose members are also classes whose members in turn are again classes, and so on, stopping, if at all, only at the null or empty class which is without members. The things a*, /?*, as defined, are subclasses of £f*, and between these subclasses we define the relation p* which we interpret as the membership relation. The members of ^ * are classes of members of «$^*. Classes can be combined by certain operations to yield new classes. Thus two classes can combine to form their union or their intersection. From a single class we form its complement, and so on. Thus we can extend ^ * if necessary so as to contain these and other combinations. We define D30
(#0/?) for
gKpgocplzfl intersection,
D31
(<*U/?) for
^Dpfop^p
union,
D32
oc
^Np^oc
complement.
for
We shall require as axioms or theorems
Bpg(a<\fi)Kp&p£P, Bp£(a{jp)Dpfrpgp,
Bp&Npga.
This suggests that we have in general but this leads to an absurdity. Take NpijT} for ${>)}, then the suggested
3.25 The systems 0^^c
169
axiom becomes: Bpr/^Np^NpT/i], now substitute the term £Np£t; of type i for y and we get: Bp(£Npgi;)(£Npgg)Np(£Npgg)(£Npgli) which is absurd. If instead we take: C(EQpy^Bpy^{^}<j){rj}, and proceed as before we merely arrive at (AQ Np(£Npg£) £, i.e. the term £Np£l; fails to stand in the relation^ to anything. It will be a maximal element in the ordering given by the relation p. D33 @£ for D 34 *g£ for iV(3£,
£ is a proper class.
Thus if we adjoin the symbolA we obtain constant terms of type t of two kinds, sets and proper classes. Let us then change the notation for variables of type t and take them to be: X, X', X",.... Then we define another sort of variable, called set variables, by relativization, thus: D35
(Ex)
D36
$
These give
(Ax)
(EX) K(SX
for
XK(SX(f>{X}. (AX)C&X
We shall want some information as to whether a given class is a set or is a proper class. For convenience we use the letters x, y, z, u, v, w with subscripts or superscripts as variables for sets and X, Y, Z, U, V, W with subscripts or superscripts as variables for classes. We define: D 37
{X, Y}
for
uD(u = X)(u=
Y), pair class,
T> 38 SX
for
il(Ev) KpuvpvX,
D 39 PX
for
H(u c X),
power class,
D40
{X}
for
{X,X},
unit class,
D41
(X,Yy
for
{{X}, {X, F}},
ordered pair,
D42
(X, Y,Z) for (X, (Y,Z)), ordered triplet, etc., in vtuplets, (xr, ...,x^v)}, pointed brackets are put back by association to the right,
D43
V
for &(u — u),
universal class,
D 44
A
for H(u 4= u),
null class,
union class,
170
Ch. 3 Predicate calculi
D45 D46
for p(UV)X, U stands in the relation X to F. 2 u)(Eu, v) K (w = (u, v}) puXpv Y, direct product. 2 X (XxX), 3 X (X2xX),etc, / ®x H(Ev)p(v, u) X, domain, %(Eu)p(y, u)X, range, mx u X or or CnvxX u)(Eu, v) K(w = (u, v}) p(v u) X, converse, X BelX (X c F2), relation, X"Y H(Ev) Kpv Yp(u, Vs) X, transform of Y by X, (Au, v, w) CKp(u, v} Xp(w, v) X(u = w), oneUnX valued relation, $ ib(Eu,v)K(w = (u,v})puv, membership relation, / fi)(Eu, v) K(w = (u, v}) (u = v), identity relation, %(Eu, v, w) K\z =
D47 D48 D49 D 50 D50 D51 D52 D53 D54 D 55 D56 D57 D58 D59 D60 D61
D62 D63 D64 D65
(UXV) (XxY)
3.26 Set theory
171
3.26 Set theory The system £8!FC consists of the binary !FC with exactly one predicate p and the class-forming symbolA of type t{ot), so far we have only given definitions, if we wish to use the system as a set theory we shall have to add some axioms and rules. For instance we want some information as to which classes are sets, and conditions when a member of a class satisfies the defining statement of that class, i.e. for which ^ ^ - s t a t e m e n t s <j) do we have: Bp7/£
(X=Y)
for
(Au)B{ueX) (ueY), and (aeyff)
for N(aej3).
This is the extensional approach, common in mathematics. Two classes are called equal if they have exactly the same members. We add the rule: D(AU)K(XeU)(YeU)o) D(X= T)co " As axioms to tell us which classes are sets we take: Ax. 1. @2#, Ax. 2. <S{x,y}, XX.X.. O.
\£)± JU)
Ax. 4.
CUnX(SX"y,
these say that the union class of a set is a set, the pair class of two sets is a set, the power class of a set is a set and lastly the transform of a set by a one-valued relation (which may be a class) is a set. The free set variables in these axioms mean that we may only quantify with set quantifiers. We add the following rules to tell us when a class satisfies the defining statement of that class: D(xey)(o Jrv ^-1 . •——~~
~——~ .
DD(ueX)(ueY)(o D(ueXuY)oj '
DN(xey)o) xv & .
•
DND(ueX)(ueY)(o DN(ueX{jY)
172
Ch. 3 Predicate calculi
DN(ueX) o) D{ueX)a) '
D(ueX) co ' DN(ueX)co'
D(Ev)((v,u)eX)G>
DN(Ev)((v,u)eX)o)
D(ueX)u> KiO . -^rr,
r
T^z
DN(ueX)a>
i^r—.
±4
D((v,u)e(VxX))(oD«u,v)eX)(o
0.
D((v,u)eX)o) D((u,v,w}eCnv2X)o)'
R8".'
D((u,w,v)eX)
'
DN«v,u)e{VxX))o)DN((u,v)eX)(o DN«y,u)eX)a) DN«v,w,u)eX)a) DN((u,v,w}eCnv2X)o)m DN((u,w,v)eX)(o DN({u9v,wyeCnv3X)co'
30. The rules R2', ...,R9" are reversible. Take for instance rule R 4'. We have an ^-proof of D(ueX) o), we wish to find an^-proof of DN(ueX) co. In the y-proof of D(ueX)o) note the places where related occurrences of (ueX) enter the e9^-proof. These will be at R4', I l a or T.N.D., viz. DN(ueX) (ueX), in the first two cases replace (ueX) by N(ueX) and similarly for all related occurrences, in the third case add above T.N.D. DN(ueX)N(ueX). This is an ^-theorem, from R4" with N(ueX) for o). Now replace related occurrences of (ueX) by N(ueX) and we have an ^-proof of DN(ueX). Similarly for R4" and the other rules. PROP.
D67.
U{i}
for
p(Ex',...,xM)K(y=(1c))
where £ stands for x'9..., a^^ and <j> for (x',..., x^v)}. 31. / / all the bound variables in $ {t)} are set variables and ift) contains the complete list of free variables in
PROP.
and
DN
and conversely.
Prop. 31 says that we can replace the suggested general rule (*) applied to ^-statements without bound class variables by 8 particular cases
3.26 Set theory
173
R2', ...,R9". If all the bound variables in
(xex) by
(Ey){x = y)(xey).
Suppose t) is <#',...,x^} and 0{t)} is x^X)ex^\ We have: R 5', 5" (a) (z#i/> cX ~ (xy) e@X with 2; for v and <#?/> for w, R 9', 9"
<#z2/> cX ~ <#2/z> eCnv3X,
R8', 8" R 5', 5"
~
R 8', 8" R5', 5"
- <#2/> e@ Cnv2CnvsX, (xyz) eX - (zxy) eCnv2X.
(c)
- <#2/> e@Cnv2X.
From these we get: R 5', 5"
(d) <2/z'... BM> eX - <»'... ^ > e^X,
from (a) with ?/ for z, x' for a; and (xr ...x(p)) for 1/, hence by repetition:
(( /<-times
(/) (x'j/Y... *W> eX ~ <«/'a;V... «w> df 2 ?,.Z, from (6) with ?/' for a;, a;' for z and (x" ... x w ) for «/. (/') is equivalent to <x' ...xfir>)e@'gi&sX, (writing ^ 2 for Cnv2 and ^3 for Cnv3), hence by repetition: (g) <»y ... 2 3
(
/t-times
< z V y ... 2/^> 6X ~ {x'x") € ^ ^ 2 X , from (c) with x' fora;, x" for«/ and ($'... t/^) for 2.
> el Lastly
(j) (i7x') {(x'x"... z(">> eX) ~ >
174
Ch. 3 Predicate calculi
(i) (j) is atomic. Thus (j> is either #(A) ex^ or #(A) €x
If A ^ JLL we have If /£ < A we have By (A)
Hence altogether:
(x'... ^ > cX is equivalent to ( ( ( (
2
/i—A — 1- times
=X
Nowwrite
3
2
A —1-times
and
= X.
7 x ("^3 1 ••• (^3^11 x ( F x . . . ( F x « f ) ...))•••)) for X and we obtain: fi—A — 1-times
>
A —1-times
l x( (
3
| (
/t—A —1-times
Since
V x (Fx...(Vx«f)...))•••))• A —1-times
it follows from the definitions of F, x , <^2,
Thus any two ^-statements may be put into equivalent forms with exactly the same free variables. (ii)
?? as desired, (iii) ^{x} is
By induction hypothesis we have and
3.26 Set theory
175
as desired. (iv) ^{r.} is (Ey) (j)'{y, £}. By induction hypothesis we have:
We have-
Wlfr.S}
yP>
This completes the demonstration of the proposition. 3.27 Ordinals The system £? is very useful for model making. For instance we can define the natural numbers thus: 0 for A,
1 for {0}, 2 for {0, {0}}, 3 for {0, {0}, {0{0}}},...
so that Sv is defined as the set of lesser natural numbers. The first transfinite number co is then defined as the class of all natural numbers. To carry this through we have to^-prove that the natural numbers defined as above are sets, otherwise they are debarred from membership of other sets and the process breaks down. @A is easily ^-proved so are: (&(xn y), &(x u y), ©(# x y), (S&x, (&&x, (&x, etc.
and these suffice to show that the natural numbers as defined above are sets. But to show that 0) is a set so that the process can continue into the transfinite we require another axiom. We could take So itself as an extra axiom, but it is usual to take: Ax. 5.
(Ex) KN^mx(Ay) C(yex) (Ez) K(zex) (y c z).
This ensures the existence of a set containing an unending strictly increasing sequence of sets. It is called the axiom of infinity. An ordinal is defined as the class of lesser ordinals and is well-ordered
176
Ch. 3 Predicate calculi
by the membership relation #. The successor of an ordinal x is then x U {#}, and the limit of a class of ordinals X is 2X. D68 Xi^eY
for K(X* c 7 u Y U I)(AU)CKN
i.e. X is well-ordered by Y; for any two members #, x1 of X we have DD(xYx') (x' Yx) (x = x'), and every non-empty subclass of X has a least member in the ordering Y. The official definition of an ordinal is D69
OrdX
for KX1Te£{X c PX),
i.e. X is well-ordered by the membership relation and members of members of X are members of X. It is easy to show Ord 0, i.e. OrdA. On for ftOrdx. On is the class of all set ordinals. D70
X< Y
for
XeY,
X^Y
for D(X < Y) (X = Y).
We shall use a, 6, c, d, a',..., as variables for set ordinals. We have _ ,Tr// l w , x , aea, NK(aeb) (pea), etc.
Any member of an ordinal is an ordinal. In fact an ordinal is the class of all lesser ordinals. The tricotomy holds DD{a < b) (a = b) (b < a). a c: On any ordinal is a subset of and a member of any larger ordinal. On itself is an ordinal but is a proper class SPr On. Thus we are unable to form the successor of On and so the antinomy of the greatest ordinal is avoided. An ordinal is either a member of On or is On itself. D71
LimX
D72
X+l
D73
1 for 0 + 1 , 2 for
or MaxX for
Xu{X}.
1 + 1, etc.
for SX.
3.27 Ordinals
177
We have G(X c: On) OrdHX. The limit of a class of ordinals is an ordinal. We have
C(X <= &n) KOrd I>X(Aa) C(aeX) (a ^ SZ), C(X c &n)KOrd?:X(Aa)C(X
c a)(2Z ^ a),
the properties of the limit ordinal of a class of ordinals. W e haVe
(Ax) B(x + leffn) (xe&n), N(a < b < a+1).
D 74 Kz
for
&(Ea) D(x = a + 1) (x = 0), ordinals of the first kind,
D 75 Kn
for
0w — iTj,
We have
ordinals of the second kind.
C(aeKn) K(a = Sa) (a 4= 0), C(aeKj)D(a = S a + l)(a = 0).
D76 w for the members of w and the members of the members of co are all of the first kind. We have Ord OJ, SOJ and a) e Ku. o) is a set ordinal of the second kind. Members of a) are called intergers. We use i,j,i',... as variables for integers. The principle of Mathematical Induction can be obtained in the form
provided that (j) is without bound class variables. D77
(X ~ Y)
for
(Ez)KKKUn2zRelz(@z
= X)(0tz = Y).
X is similar to Y and both are sets. D78
Sq
for
&$(x ~ y).
The similarity relation for sets. Then
D 79
Fin
for
^(JJ/i) (i ~ x)
InFin
for
S(^4i) j^(i d^ x)
the class of finite sets. D 80
the class of infinite sets.
178
Ch. 3 Predicate calculi
Having defined the integers we can then define rational numbers as triplets of integers, then real numbers as Dedekind sections of rational numbers and lastly complex numbers as ordered pairs of real numbers. This is further discussed in Ch. 7, § 2 8. We are then ready to develop analysis and as explained in § 32 of this chapter we can introduce all topological concepts. An ordinal is either of the first kind or of the second kind or the ordinal is On. If X is a non-void class of ordinals then IIX is the least member of X. Thus for I c f e w e have TlXeX and £m(X{\ IIZ). 3.28 Transfinite induction The principle of transfinite induction is (Aa) C
(Aa) provided that
NimX X^On, (Ev)K{veX)£m{X(\v)9 X^On
N(Ev) K(veOn - X) Sm((0n -X)(]v) _____ ^ X^On
(Aa) C(a c X) (aeX) X = On
We also want to define functions by transfinite induction. It makes for easier reading if we use F, G, H, F',..., as variables for functions, corresponding small letters if they are sets, and B,S,T,R',... as variables for relations, corresponding small letters if they are sets. We want then to define F'a by means of the behaviour of F for arguments less than a. Now F [ a is the function F with arguments restricted to a. Hence the induction should have the form
3.28 Transfinite induction
179
where G is a previously defined function. Thus we shall have (AG) (E\F) (KF^n6n(Aa) (F'a = G\F [a)). The method of demonstrating this is to take the union of all partial solutions. Thus //
for f(Eb) (Kf^nb(Aa) C(a < b) (f'a = G'(f [ a)),
then show that 2 i / has the required property. Something like this is done in detail in Ch. 11. The addition, multiplication and exponentiation of ordinals are defined by transfinite induction thus: D81
+6
for
(
(
)( (Ea) K(v = a = 2a) (v = 2 +1'a),
D82
xb
for tiv(DDK(u = O)(v = 0)(Ea)K(u = a+l)(v
=+bx'ba)
(Ea) K(u = a = 2a) (v = 2 x b'a), D 83
expb for
uv(DDK(u = 0) (v = 1) (Ea) K(u = a + 1) (v = x b exp'ba) (Ea) K(u = a = 2a) (v = 2 e ^ ^a).
The first of these defines the function +bi the second the function x b and the last the function expb. They are all of the form F(a = Gl(F [a). Weusuallywrite (6xa) a
b
for
x^a
for exp'b a.
(a + b),(axb) and ab are all set ordinals. They satisfy some of the usual rules of addition, multiplication and exponentiation. But some rules fail, e.g. l+(o = a). It can be verified that the ordinal (a + b) is isomorphic as regards order to the order type obtained when we stick the order type b at the end of the order type a, and that the ordinal (a x b) is isomorphic as regards order to the order type (c,d),cea, deb ordered by last differences, i.e. (c,d) < (c',df) ifd < d'ovd = d'andc < c\ The ordinal ab is isomorphic to as regards order to the ordering by last differences of functions over b with values in a, but with only a bounded number of non-zero values, i.e. if/and g are two such functions then/ < g if f'd < g'd, where d is the greatest ordinal for which
180
Ch. 3 Predicate calculi
f'd =f= g'd. In fact we could have taken these properties as definitions of addition, multiplication and exponentiation of ordinals provided we have shown yTT m xWeT where
D84
RIsom(*'^\
for
KKKKUn2RRelRSJR = X01R = Y(Au,v)C(u,veX)B(uSv)(RcuTR'v), i.e. R is a (1-1 ^correspondence between X and Y such that two members of X stand in the relation S if and only if their images in Y by R stand in the relation T. Any class of ordinals is well-ordered by £, hence a decreasing sequence of ordinals terminates, otherwise the sequence would be without first member and so would violate the condition of being well-ordered. 3.29 Cardinals A cardinal number is frequently defined as the class of classes similar to a given class. We shall define a cardinal as an ordinal which is dissimilar to any lesser ordinal. This is less general than the usual definition because there may be classes dissimilar to any ordinal. D85
f
for
t(Aa) (C{a ~ x) (bea),
then f is the least ordinal which is similar to the set x. f is called an initial ordinal or the cardinal integer of the set x in case x is finite. The class of ordinals similar to a given ordinal is non-void, because a is similar to itself. Hence the class of ordinals similar to a given ordinal exists and being a class of ordinals has a least member U. But if x is any set then set theory as we are developing it may turn out to be so poor in modes of expression that the (1-1)-correspondence required to show that x is similar to an ordinal may be missing. Thus the concept of cardinal is relative, that is relative to the set theory used. We have U = ti. We divide ordinals into classes, the members of one class being similar to each other, except that the first class is to consist of all the integers. The ordinals of the second class are similar to co, they are the denumerable
3.29 Cardinals
181
ordinals. The least member of class III is denoted by ti, it is the least nondenumerable ordinal. Q for 26(6 ~ o)). D 86 Note that this use of the word' class' is distinct from ' class' as opposed to 'set 5 . D87 JT for &(Eb)(a = $), JV* is the class of integers and initial ordinals. D88
JT'
for
JV-G),
J/*' is the class of initial ordinals. Jf' being a class of ordinals is wellordered, hence there is an isomorphism between the initial ordinals and the ordinals. Let X be this isomorphism. D89 G)a or Ka for K'a. Then GJ0 = G) = No, fi = K1? etc. These cardinals are called alephs. We can define addition, multiplication and exponentiation for cardinals. If otj9 je J is a class of cardinals then 2 % is the cardinal of the union of classes of cardinals otj for jeJ. In forming the union we require that the representative classes be distinct. This is achieved by using ordered pairs {a,j}, aeAjSOCj. Then the ordered pairs are distinct for different jeJ. If the cardinals are initial ordinals & then 5 itself is a representative class of that cardinal. The cardinal of the product of the class of cardinals ocj,jeJ is the cardinal of the class of functions / over J such that/'Jea^-. This amounts to picking out one member from each otj and doing this in all possible ways. This raises the question as to whether there is any such function at all. The statement of the existence of such a function is known as the Multiplicative Axiom, Axiom of Choice (A.C.) or Zermelo's Axiom, If it failed then the product of an unbounded set of cardinals would be zero. The axiom is: Ax. 6.
(EF) (Ax) KF^n VBSmx(Fixex\
This is a very strong form of the axiom of choice because it allows for the simultaneous choice from each set of an element of that set. The axiom of choice occurs frequently in mathematics, sometimes it is possible to avoid it by a more elaborate proof. At the end of Ch. 12 we sketch a demonstration of the independence of the axiom of choice from the other axioms of set theory.
182
Ch. 3 Predicate calculi
Using the axiom of choice we can show that every set can be wellordered, conversely if every set can be well-ordered then A.C. (Ax) (EaJ)KK(f^na) Un2f{x = /"a), so that f'b for b < a well-orders x. By transfinite induction we define a function G so that G^nOn and (Aa) (G'a = F'(x-@(G [a))), where F is the function postulated in Ax. 6. Then G'O = F(x, the member chosen from x by F, G'l = F\x — {G'O}), the member chosen from x-{GiO) by F, etc. Then G'b for b < a well-orders x. If we use Ax. 6 then every set can be well-ordered, hence every set is similar to an ordinal and so all cardinals are alephs, and hence the tricotamy will hold for cardinals. But without the axiom of choice there may be cardinals without any order relationship with any aleph. The exponentiation of cardinals is defined by: a^ is the cardinal of the class of functions over /? with values in a. Thus 2so is the cardinal of the real numbers. The equation 2**o = Nx is known as the Continuum Hypothesis (C.H.). It is now known to be independent of the other axioms of set theory and a brief sketch of this is given at the end of Ch. 12. It can be shown that the sum, product and exponent of alephs is an aleph. The equation 2Na = XSoc is known as the Generalized Continuum Hypothesis (G.C.H.). Again it is now known to be independent of the other axioms of set theory and a brief sketch of this is given at the end of Ch. 12. Many statements about cardinals are now known to be independent of the axioms of set theory. But there are some important theorems about cardinals. D90 <x
We shall show £ < Px. Each member y of x gives rise to a subset {y} otx, hence x can be put into (1-1 ^correspondence with a proper subset of Px, thus f ^ Px. Note that x and Px are sets. Suppose that % = Px, then there is a (1-1)-correspondence between x and Px. Let o-{y} be the correlate of y by this correspondence. Let a be the class of members of y such that y1cr{y\. Let a be the correlate of z so thata = cr{z}. If 2;ea then 2:ecr{2:} by definition of a and Prop. 31, but cr{2;} = a
3.29 Cardinals
183
andsozea. Again if zea then by definition of a ze(x{z} and Prop. 31,i.e. zeoc. We have an absurdity in either case, hence X #= Px. 33. If a ~ /?' and ft c ft and J3 ~ a' and a,' c a, then a ~ /?. Let / map a (1-1) onto /?' <= y? and gr map /? (1-1) onto a' c: a. We can clearly assume that oc(]j3 = A. Now (oc\Jj3) is the disjoint union of sequences PROP.
cr':(b,g%rg'b,...) (be/S),
LEMMA
(i). For any set a there is a well-ordered set w, such that w ^ P4a
and w ^ U.
Consider the class w of well-orderings of a and subsets of a. A well-ordering of a is a class of ordered pairs hence is in P 3 a. Thus w c= P^cc.w is isomorphic to a class of ordinals and so is well-ordered and is isomorphic to an ordinal. If w ^fiethen w would be order isomorphic to a well-ordered subset of a and so w would be order isomorphic to a proper subset of itself, which is impossible. Hence lemma (i). For the moment we assume that 2 P a = Pa. LEMMA
(ii). If y and 8 are disjoint sets such that y u S = P(2y) then
$>Py. 2y denotes the union of two disjoint copies of y, say yx and y2 If/maps y U S onto P{yx U y2) ^ Pji x Py2, then the image of y projected into
184
Ch. 3 Predicate calculi
Pyx is only a proper part of Pyv since y1 < Pyv and hence if £ is outside the projection, / must map some subset of S onto £ x Py2, which means that 8 ^ Py. Whence lemma (ii). Now we have P 3 a ^ w + Psa ^ P 4 a + P 3 a ^ P 4 a by the assumption. Thus by G.C.H. either: w + Pzoc = P 4 a
or
Consider the first case. w + Pzoc = P 4 a = P(2P 3 a), by the assumption. We have, by lemma (ii), w ^ P 4 a, but w ^ P^oc, hence w = P±oc. Thus there is a (1-1 ^correspondence between P^oc and w, thus jF^a is wellordered, but a can be embedded in P^a, hence a can be well-ordered, and we are done. In the other case we have w + P>a = P,a then we have w ^ P 3 a, hence w = P 3 a, and we are done, as before, or w < P3oc, and whence by G.C.H. w < P 2 a, but then ^ = P 2 a, and we are done as before, or W < P x a, now by lemma (i) w ^ a, whence by G.C.H. P = P x a, and we are done as before. It remains to show 2Pioc = P^, for 0 < i < 4. If we put /? = P(a U &> where af]o) = A, then easily 2iy? = Pip, for 0 < i < 4. Let y be new. 2/? = P(au^u{y}) = P(a U w) = /?. Also yff^ y^uy ^ 2/?, so /?Uy= Now 2.2/* = 2/*uW = 20, and similarly 2P^/? = P ^ , for all i. Hence our argument can be applied to /?, and so ft can be well-ordered. But a can be embedded in /? in a natural manner, hence a can be well-ordered. Thus Prop. 33 is demonstrated. 3.30 Elimination of the e-symbol The choice effected by A.C. can also be effected by the e-symbol, thus (Ax)CN£>mx(ey(yex)ex), then tivCNdfrnviu = ey(yev)), is the required function that picks out a member from each set. We could also have the rule. C
provided that
3.30 Elimination of the e-symbol
185
The system Sf with the e-symbol and rule C is called the system CSP. We could also, as in D 24, define the existential quantifier in terms of the e-symbol. The system SP with rule II d replaced by rule E and rule II e' replaced by rule E' is called the system ESf. We could then dispense with rule E' in favour of a rule of substitution. The system CE£f is the system ESf plus rule C. 35. If an £f-statement is a CSf-theroem then it is an £f-theorem. We shall show that if the e-symbol is used in a C^-proof of an Sfstatement ^ then it can be eliminated leaving an e-free proof of ^ , which is thus an ^-proof. Thus (J) is an ^-theorem. This proposition says that if CSP is inconsistent with respect to negation then so is Sf. For if we can C^-prove the ^-statements
this, consider rule C, say, _ ,,
,}J^
, and consider the places where
related occurrences of (EE) <j){£) entered the C^-proof, these will be of the form
n/grrx^/m /> w n e r e 0'{£} i s
a
variant of ^{g} by I(ii), replace
the lower formula by D^'{e^'{^}} GJ', and similarly for all descendents. Entrance by II a can be replaced by entrance by II d, the special ^-rules fail to introduce D(E£) $${£} OJ. The original Cc^-proof-tree becomes an E£fproof-tree of the original statement, because these related occurrences of (E£)(j){Q all occur in the subsidiary formulae, except I(ii), from their introduction to the application of the C-rule under discussion. Thus we require to eliminate the e-symbol from an E^-proof of an ^-statement, free from the e-symbol. We next replace the special ^rules by axioms, thus: if _ ,
is an
Dyrco
^-rule replace it by CD^o)Dj/rco, we recover the rule by Modus Ponens, which can be eliminated from a theory in free disjunctive form. To make use of this result we replace rule R1 by its free variable form viz. DK(XeU)(YeU)a>. ,,, .,.r, ,TT,., —Y^i— because by the reversibility of lie if we can ^-prove n v
186
Ch. 3 Predicate calculi
the upper formula of R1 then we can ^-prove the upper formula of its free variable form. We similarly replace R 5 ' and R5" by D((oc,u)eX)oj —^7—'v.
and
D((v9u)eX)o) " \'v,
,. . respectively.
This assures the reversibility of l i e ' because A can then only be introduced at l i e ' and so the demonstration of reversibility of l i e ' goes through as before, The axioms can be put into resolved form, viz.: Ax. 1. (LxeiXx}), Ax. 2. ({x,y}e{{x,y}}), Ax. 3. (Pxe{Px}) Ax. 4. CCK{(u, v)eX) ((w, v)eX) B{yeu) {yew) (X"ye{X"y}). From these the original axioms may be recovered. The system SP can now be put into free disjunctive form, so that Modus Ponens can be eliminated. If we retain the axiom of infinity then we replace it by: (£2e{Q}), (Aefi), C{ueQ)({u}e£l), then Q, contains the unending set: {A}, {{A}}, {{{A}}},.... Call the resulting system Sf\ then Modus Ponens can be eliminated from Sf'. Let ^ be an E^-theorem then 0 is an ESP'theorem, let x be the closure of the ^'-axioms used in the ES^'-"proof of <j), then by the Deduction Theorem G^r^> is an iJ^-theorem. From its .EJ^-proof we wish to eliminate the e-symbol. By hypothesis the e-symbol is absent from <j> also the ^'-axioms are without the e-symbol, hence Cx
,*
is an E-rule used in the l£^'-proof of 6,
where \jr is e-free and closed; in this replace the e-term e^{Q by a, we are left with a tree with (f> at its base because this is €-free. The above E-rule becomes a repetition, all applications of other rules are preserved except other applications of E-rules, which by supposition belong& to the same e-term. For instance -^ , , ,1^—, E becomes Z)^{e^{£}}6/ n
,\{
, which fails to be a case of any rule. But if we add ^{a} as an extra
axiom we can obtain Di/r{x} o)f by II a, each case of rule E in the j&^'-proof of $ can be replaced by a case of II a; hence we obtain an ^-deduction of (j) from the hypothesis ^{a}. Similarly we get an ^-deduction of
3.30 Elimination of the e-symbol
187
for all d for which i/r{a{6)} occurs as the main upper formula of an E-rule. Since we are supposing that all E-rules belong to the same e-term then ^r is the same in each case. Thus we get an ^-deduction of (j) from the hypotheses ^{a'}, ifr{oc"}, ...,i/r{oc(p)}, where this is the complete list of main formulae in the upper formulae of E-rules used in the V
proof of (j). Hence we obtain an J^-deduction of $ from 2 ^{#(7!r n= l
On the other hand we have Dft{aP>}at*> -i.V_L. _L •
)
TT
But Modus Ponens can be eliminated, and so we get an .^-deduction of (f> from the hypotheses Ni/r{ocf}, Ni/r{ot"},..., Ni/rfaM}, where we have the same set of a',..., a^ as before. Thus by the Deduction Theorem we obtain the ^-theorems CM<x'}$,...,CMoiM}$, and ON 2 {<*<«>}0. Whence 0= 1 v
d)
(e)
C 2 ^{o^ }(j> and CNHi/r{x }^) are J^-theorems and so by Modus Ponens 0=1
is
188
Ch. 3 Predicate calculi
in the €-term j3 then the replacement of oc by another e-term lacking the binding variable of fi has no effect on the rank of/?. And if a is an e-term occurring in the e-term /? and containing the binding variable of /? then a is of lower rank than /?. Also the rank of an e-term is unaltered when we change a free variable to a new variable. We propose to eliminate the e-terms by replacing e-terms by other terms as we did in the simple case when all E-rules belonged to the same e-term, thus in the E-rule
., *. j ^
we replace the e-term e^t/r{Q
wherever it occurs by a, then the above application of the E-rule becomes a repetition; the application of the E-rule ^ , c , i>>^—; becomes Dj/r{e^{Q}ojf -r^. \ ^I0) , which fails to be an application of any rule if (a 4= fi), but we can obtain the lower formula from the hypothesis i/r{(x}. The subsitution of a for €^{£} may alter the end formula
, f* —^,
1 < 0 ^ K. By the
Di/r{ei/r{^}}(i)^)3
J
deduction theorem and &c this would give us: C 2 ft{ote)} 2 ^=1
6=1
without using the E-rule belonging to the e-term e^{^}. On the other hand we have by Modus Ponens
but Modus Ponens can be eliminated. Thus if we introduce Ni/r{oc'}, Ni/r{a"},..., Ni/r{a^K)} as hypotheses we obtain an iJ^-deduction of
the e-term e^{^}.
of C Yl Ni/r{oL(e)}
This gives us an E^c--proof
6= 1
C 2 ifr{ot^} 2 ^{cii^} and 6=1
CNYii]f{a!&)} ^{ci}
6=1 K
we obtain by Modus Ponens 2
can be eliminated so we obtain an JS/J^-proof of 2 ^{cti^}. The reason for
3.30 Elimination of the e-symbol
189
replacing rule II d by rule E is to avoid the existential quantifier binding an e-term, thus:
will fail to occur instead, we shall have in the lower formula
which has an e-term of higher rank. But if there are other applications of the E-rule in the jE/J^-proof of 0{ct} then the substitution of af® for €^{^} may destroy them. The following cases can arise for the E-rule
(i) eMB occurs at most only in fi, the E-rule is then 1 this becomes _ } *• / {. which is the E-rule.
Dfai}} y (ii) €j^{^} occurs in x{v} a n d possibly in fi as well the E-rule is then DM*},*}* this becomes w h i c h is t h e E . r u l D a a x{*x{v h ) ^ here y stands for (iii) One or both of/?, # where S = €£#{£} is contained in y = (so that in the first case y is of the form y'{/?}), and x{v} * the form X'{y'{V}}; the E-rule is then comes •=- /f
/r
^./ / r ^ , ^
D
^
{
^ ^ ^
this } } } a >
be
"
which fails to be an E-rule, f is new to
Dx{?M{v{Q)}}<* X'{y'{^i}}' Similarly if y is of the form y'{8}, and x(v)^s of the form ^'{y'{9/}}, the E-rule is XxrxPss
^ j c h f a ii s to be an E-rule. Similarly if y is of the form
y'{(3,8} and x is of one of the forms x'{Y{y, *}}5 X'{7'{P> y}}> X'fy'fy* V}}>etc-
In all these cases the e-term ^x{j'{Q} ^ °f higher rank than the term y'{/?} for which a substitution is being made.
190
Ch. 3 Predicate calculi
(iv) The only remaining case is when one or both of /?, S are contained in y so that y = y'{/?}, etc., but the variable 7/ in x{v} fau"s to occur in a part y'{ij} of x{v}- I*1 this case the E-rule remains correct as can be seen from case (iii) by omitting y' in xij'iv)} m the lower formulae. In case (ii) the e-term belonging to the E-rule is of higher order than y but is of the same rank as y, but in case (iii) it is of higher rank than y. Thus if we make the substitution for an €-term of highest rank and among those of highest rank we choose one of highest order then cases (ii), (iii) will fail to arise; we can proceed as in the first case where all the E-rules belonged to the same €-term. The result of the elimination of one K
€-term is an UJ^-proof of a disjunction 2 ^ {ct(^}. A second application 0=0
will produce a similar disjunction of these disjunctions which is merely a longer disjunction of the same kind. Finally we can eliminate all the V
E-rules and are left with an ^-proof of a disjunction 2 <j>{oSe)}. If there e=i
are any €-terms left in this disjunction then replace them all with the same new free variable, the result is an J^-proof of a free variable disjunction which is €-free. This is possible because all substitutions had been pushed back into the axioms before we started so that an e-term V
in the J^-proof of the disjunction 2
into the J^-proof at an axiom and if it is replaced by a new variable axioms remain axioms and rules are preserved. From the free variable disjunction we easily obtain (E%) ^{j}. This completes the case when $ is without universal quantifiers. We may suppose that (f> is in Skolem F-normal form, i.e. is of the form: (E$) (At)) fr{£,t)} where the matrix ifr{%,t)} is quantifier-free. Since
3.30 Elimination of the e-symbol
191
hypotheses dc into an J^-deduction of (j) from hypotheses X. Thus a C^-proof of (j) can be converted into an ^-proof of
C{E$ (At)) <%, t)} (Ed flj, fj}.
Thus from the iJJ^-proof of (E%)(Ati))
a'9a",...,a«>
are sequences of terms of type t, €-free but whichmay contain/',/", ...,/(7r). Moreover the ^ c -proof of this disjunction is a free variable ^ c -proof free from the substitution rule, because all substitutions had been pushed K
back into the axioms before we began. If in 2
proof we replace each atomic formula by a propositional variable, distinct formulae by distinct propositional variables, identical formulae by the same propositional variable, we are left with a ^-proof. Thus K
2
mulae of a theory for propositional variables. The same will hold if we replace the terms f^a(fl) by new free individual variables using distinct variables for distinct terms, the same variable for different occurrences of the same term. Consider the term / ^ a ( ^ and the number of distinct occurrences of /',/", ...,/(7r) which are contained in it. Call this number the complexity of f(d)a{fl). Then the complexity of/'a ( ^,... ,/(7r)a(^ are all the same. We associate this complexity number with K
Now arrange the disjunctive terms of 2 (f>{a(e\ \a{6)} in order of increasing 0=0
192
Ch. 3 Predicate calculi
complexity from left to right. We suppose that duplicates have been omitted from the disjunction. Then f(6)a(/l) is distinct from f^aW if and only if 6 4= d' or JJL =)= /JL'. Also f^a(fl) can only occur in a(/O if [i < /if. Now replace /'a', ...,/ (7 %',/a", ...,f(n)a(lc) by new individual variables £',..., £<™>, so that /%<> is replaced by £«/-«*+«>, then £ 0{a<*>, fa<*>} 0=0
becomes: 2 ^{b^, ^'
7r+1)
, ...,^'
7r+7r)
}. In this disjunction the variables
0=0
, ...,Qv'n) are absent from the first (v—1) disjunctions because can only be a member of the sequence a(A) or be contained in a member of the sequence a(A) when JLC < A, also f^ai/l) fails to occur in or be any member of the sequence a^\ Now we can generalize the variables QK~1)rr+1, ...,£,K'n then apply lid repeatedly to the terms in b(/c) this converts the last disjunctand to (E%) (At))^^,^}. We can proceed similarly with each disjunctand obtaining a disjunction of K disjunctions all being (EAjc)(At))
3.31 Complete Boolean Algebras In some Boolean Algebras any subset has an l.u.b. that is, if 9£ is a subset of a Boolean Algebra 88 then there is an element a of 88 such that /? ^ a for each element /? of 9C, and if /? ^ y for each element ft of 9£ then a < 7. We denote a by l.u.b. 9£. Similarly a g.l.b. can exist; in any case if the l.u.b. exists then we call the Boolean Algebra complete. PROP.
36. If 88 is a complete Boolean Algebra then l.u.b. & = g.l.b.3?,
where 2£ is the set of complements of members of SC. We sometimes write U at for l.u.b. {aj, and similarly f|
PROP.
iel
35. l.u.b. l.u.b. a^ = l.u.b. aij9 iel
jej
iel,jej
«nu A = UMA), iel iel
iel
A
iel
iel
iel
3.31 Complete Boolean Algebras
Uai
=
0 iff
cx>i = 0 for
193
iel,
iel
fl oci = 1 iff
a* = 1 for
ie/.
The V is the membership symbol. A Boolean Algebra is said to satisfy the countable chain condition if every disjoint set of non-zero elements is countable. Two elements of a Boolean Algebra are said to be disjoint if their intersection is zero. The l.u.b. acts like the union of an unbounded set and the g.l.b. acts like their intersection. These correspond to the Existential and Universal Quantifiers respectively. Distributive Laws In some complete Boolean Algebras there are extensions to the distributive laws corresponding to unbounded sets. Thus: PI U ay = U n aiT(i) and ieljej
re J1 iel
(J D % = fl U ocT(j)j. jej
iel
lfjj
Here J1 denotes the set of functions with domain / and range in J. But these laws can fail in some complete Boolean Algebras. 3.32 Truth-definitions for set theory A truth-definition for a formal system can be given by formula induction. First a truth-definition is given for closed atomic statements, then the truth-definition for closed compound statements are obtained in the usual manner by truth tables, if we are seeking a standard two-valued truth-definition. If closed atomic formulae are lacking as in ^Q then we usually give a definition of validity. In set theory the closed atomic statements are of the form a closed statement of this form will be true if and only if this in turn will be true if and only if t{x<j){x}} & ^ { a ? } } & (Ey) (x
for some x> but this will be true if and only if which is what we had before, so we must abandon this method.
194
Ch. 3 Predicate calculi
Another way of giving a truth-definition is to construct a model. By this we mean a class of elements V and truth-values for all atomic statements of the form (aeb), where a and b are two elements of F. But we can easily be more general because we can take the truth-values to be members of a Boolean Algebra ^ . Let then ||ae6|| and \\a = b\\ be the members of the Boolean Algebra associated with the statements (aeb) and (a = b) respectively. From the Boolean values of these statements we can find the Boolean values of compound statements thus:
= ¥\l aeV
from these we obtain
\\D
U
||##
aeV
We write \=$ for ||^j| = 1, and h^ for §1 is an ^-theorem. We want to show that, if h^ then |= <j>, i.e. all ^-theorems take the Boolean value 1. First of all we easily show that f= DNcjxj), and, if \= i/r then = |
It is impracticable to introduce all the members of V at once. We proceed by a transfinite process. We start with Vo = A. The elements to be added to Va to produce Va+1 will be functions over Va with values in 38. These will correspond to new ' sets'. Thus/{#} defined over Va with values in £8 will be the 'characteristic function' of a new set. Clearly Vx = {A}. If a is a limit ordinal Va= \J ¥#. p<*
The step from Va to Va+1 is defined as follows: we assume that the
3.32 Truth-definitions for set theory
195
members of Va have been defined and that Vp ^ Vy for fi < y ^ a and that for a, b e Va we have
\\aeb\\=JJJ\K(xeb)(a = x)\\,
(1)
||a = 6||= n \\C(xea)(xeb)\\n fl \\C(xeb)(xea)\\, II
II
' ' I I
V
/ V
xeBa
' II
' '
II
*
' *
(2)
' II '
^ '
xe&b
and that for a, b, c, eVa we have^ a = a,
(3)
(a = b)(b = c)(a = c),
(5)
(a = 6) (bee) (aec),
(6)
eb) (b = c) (aec),
(7)
and we also assume that every member of Va is a function whose values are in the Boolean Algebra 3$. aeVfi+1, aeVp, fi < ot then
if if
aeT^,a;e^a then x,ye@a then
2a = Vfi9
(8)
\\xea\\ = a{o;},
a{#}n |a? = y\\ ^ «{i/},
(9) (10)
a function which satisfies (10) is called extensional. We first put into Va+1 every member of Va. Next we generate each function / from Va to 0$. As @f — Va for each new /, the value of \\x = y\\ has been determined for each x,ye3>f. Hence for each x,ye@f the value of/{#}n ||» = y|| is determined, we discard all functions/for which this value is ^ f{y}. Thus we restrict our choice to extensional functions. For xeVa we define \xea\ to be a{x] for each new a. We now define ||a = 6|| by (2). If a, beVa this duplicates a known result. If aeVa, beVa+1, beVa, this is an acceptable definition, since Q)a c Va 3)b = Va, hence ||#ea|| is determined by (1) and ||#e&|| by b{x}, so ||G(a;ea)(aje6)|| and ||C(a;e6)(icea)|| are both determined. Similarly in the other cases. So (2) holds for Va+V Now if ae@b, beVa+1, beVa we define: ||ae6|| for U \\K(xeb)(a = x)\\ xe9b
this is an acceptable definition since for each xe3)b \K(xeb) {a = x)\\ is determined by (9) and (2). 7-2
196
Ch. 3 Predicate calculi
We have to show that (1) holds in F a + 1 . The only case to consider is beVa+1, beVa, aeSb. In this case S)b = Va, so that aeVa, then we have \\aeb\\ = \\K(aeb)(a = a)\\ ^ U \\K{xeb){a = x)\\.
(11)
xe9b
Now take xeQ)b = Va. Then by (9) \\K(xeb)(a = x)\\ < b{a] = \\aeb\\, thus
U \\K(xeb)(a = x)\\ ^ \\aeb\\, xe9b
then by (11) we infer (1) for F a + 1 . It remains to check that (3)-(7) inclusive hold in Va+1. By (2) we conclude (3), since \\C(xea) (xea)\\ = 1. Also (2) gives (4). LEMMA
We have thus whence
(i). IfxeSb then \\xeb\\ n ||6 = c\\ ^ \\xec\\. ^ ^ ^ ^C(pceh) ( ^ c ) | | = ^ n n ^xec^ ? ||ae&|| n \\C(xeb) (xec)\\ ^ \\xec\\, ||a;e6||n 0 ||O(#e&)(#ec)|| ^ ||a;ec|| if xeQ)b. xe2b
By (2) the lemma follows. (ii). If a, beVa and ceVa+1 then (7) holds. If ceVa then by assumption (7) holds. Thus suppose that ceVa. Then LEMMA
2c = Va. By (8) and Va c Va+1 we have Sib c Sc. Then by lemma (i) \\K{xeb) {a = x)\\ n \\b = c\\ < \\K{xec) {a = x)\\9
whence using (1) ||ae6|| n ||6 = c\\ ^ U K\\(xec) (a = x)\\ xe@c
= \\aec\\ by (1), which is (7). We now verify (5) for oc + 1. Let xeQ)a. Then, by lemma (i) we have ||a?ea|| n ||a = 6|| ^ \xeb\. So \\K(a = 6) (6 = o)H n ||ajea|| ^ ||aje6|| n ||6 = c\\. Case (i) beVa, by lemma (i) ||ge&||n||& = c|| ^ ||^ec||.
(12) (13)
3.32 Truth-definitions for set theory
197
Case (ii) beVa. Then 3)b = Va9 so that xe3)b. Then by lemma (i) we again have (13). Therefore (13) holds in either case. By (12), (13) we obtain = c)\\n\\xea\\ < \\xec\\. So
\\K(a = 6) (6 = c)\\ U \\xea\\ ^ ||a;ea|| -> \\xec\\
and then
\\K(a = 6) (6 = c)\\ ^ \\G(xea) (xec)\\
whence
\\K(a = b)(b = c)\\ < fl ||C(sea) (n;ec)||.
(14)
One can start with xe@c and go through a similar argument to obtain \\K(c = 6) (6 = a)|| ^ n ||0(sec) (sea)||.
(15)
xeQic
By (4), (2), (14) and (15) \\K(a = 6) (6 = c)|| ^ ||a = c||, which gives (5). We next verify (6) for a + 1. Let xeQJc, By (5) we have \\K(a = b)(b = x)\\ <\\a = x\\. Therefore \\a = b\\ n \\K(xec) (b = x)\\ < \\K(xec) (a = «)||, summing both sides over xe3)c and using (1) we get (6). Finally we verify (7) for oc + 1. Let xeQsb. By lemma (i) ||a;e6||n ||6 = c\\ ^ \\xec\\. Therefore Then by (6)
\\K(xeb) (a = x)\\ n ||6 = c\\ ^ \\K(a = x) (xec)\\. \\K{xeb) {a = x)\\ n ||6 = c\\ ^ \\aec\\.
Sum on the left over xeS)b and we obtain (7) by (1). We have now obtained a universe V and a Boolean value for each atomic statement (aeb), where a,beV. So we have given a generalized truth-definition for set theory. Now write ^ ^ _ 1; for h (j> for
fi is an 5^-theorem.
It is possible to ^-prove that if h^ then t=^, i.e. all ^-theorems take the value 1. Thus C becomes a model for SP. The proof must take place in some system which can deal with ordinals because they are essential to the construction of V. The system £P is such a system.
198
Ch. 3 Predicate calculi
An interesting application of this method for denning truth for set theory is that by suitable choice of the Boolean Algebra 0&, we can show that A.C., G.C.H., etc. take values different from 1 and hence are non-theorems of SP. Thus it is impossible to ^-prove them. On the other hand we can find another type of model, namely an 'inner model9, constructed as follows: we start with the null set and by a process of transfinite induction we define all the sets which can be obtained from it by repeatedly performing the operations allowed for the construction of new sets from old ones, e.g. union, complementation, etc., In this way it seems clear that all sets are given an ordinal and so V is well-ordered, so that A.C. holds. It can also be shown that G.C.H. holds in this model. All this can be done in the system SP. Thus A.C. and G.C.H. are consistent with set theory if this is itself consistent. Altogether A.C. and G.C.H. are independent of the other axioms of set theory. The full details are lengthy. 3.33 Predicative and impredicative properties In a second order predicate calculus we can have bound predicate variables and hence we can form properties Ax
property variables of the first order,
X2, X'2,...
property variables of the second order, etc.
The order of a property is the greatest of (i) the order of a free property variable in it, (ii) the successor of the order of a bound property variable in it. Then \x
r>/A/A\—
*' ^ e
revers
ibility of
3.33 Predicative and impredicative properties
199
lie', here A is to be a property of order the same or less than that of the variable X. We obtain this result from l i e ' by everywhere replacing related occurrences of X in the part of the tree above the upper formula of l i e ' by the property A which must be of the same or less order as X, A property is called predicative if it fails to be defined in terms of itself, otherwise it is called impredicative. Thus in ^^ we only have predicative properties. If we try to give a definition of validity to ^l?\ which contains impredicative properties, then we get into trouble because we should require (AX) (/>{X} to be valid if and only if ^{A} is valid for all properties A, but one of these is 3£
D91 Topx for (Au,v) (u, vex.-^.un vex) & (Ay) (y <= x-^Hyex); read lx is a topology'. D 92 xTopy for Topx & y — 2#; read 'x is a topology for y\ D93 xOpy for Topx &y ex; read ' y is an open set in the topology x\
D94 xCly for
Topx&Op(x-y);
read 'y is a closed set in the topology x\ D 95 TopSpx for (Ey) (Topy & x = Sy); read 6x is a topological space'. D 96 IndisTopx for (Ey) (x = {y, 0}); read 'a; is an indiscrete topology'. D 97 DisTopx for (Ey) (x = Py)\ read (x is a discrete topology'.
200
Ch. 3 Predicate calculi
D98 uTopxNeighy for (Ev) (v ^ u&yev &xOpv) &u c x\ read 'u is a neighbourhood of?/ in the topology x\ D 99 yTopxLimz for 2 c= a; & Topx & (^4w) (uTopxNeighy -> (i£#) (# e w n z & # 4= ^/)); read '?/ is a limit point of the subset z of the topological space x\ D100 Z*TOPX for $(yTopxLimz) [) z; read 'the closure of the subset z of the topological space x\ D101 IntzTopx for y(zTopxNeighy); read 'the interior of the subset 2 of the topological space x\ D102 BdyzTopx for z* T o ^ n ( # - z ) * T o ^ ; read 'the boundary of the subset z of the topological space x\ D 103 zBaseTopx for z ^ x & (^y, u)(yex& uTopxNeighy ->(Ev)(yev&vez&y c ^)); read '2 is a base for the topology a?'. D 104 Se^a; for Topx & (#z, ti;) (zBaseTopx & w"z = w); read ' x is separable', x is separable if a; has a countable base. D105 zDenseTopx for z* To ^ = X; read'2 is dense in the topological space x\ D106 uGovw for w ^ Su; read '^ covers w\ D 107 Sep [y, z] Topx for y^opx n s = A & z*T°vx n y = A; read '^/ and z are separated in the topology x\ D 108 GonnTopx for .4 (y, z) (a? = y u z &fifep[y,z] T o ^ . -».y = A v z = A); read 'a; is a connected topological space'. D 109 HausSpx for Topo; & (^4^, v) (^, v e 2# & ^ 4= v. -> (.E/^, ^') (^n wr = A & wTopxNeighu & w'TopxNeighv)); read '# is a Hausdorff space'. (,4y) (fe) (?/(7ora & z(7ora &z^y& (Ew) (w"z c
Compx for
and so on. If we are dealing with a single topology for a topological space
3.34 Topology
201
then we can omit Topx in the above definitions. The whole of general topology can now be formalized without undue difficulty. H I S T O R I C A L REMARKS TO C H A P T E R 3
Predicate calculi differ from propositional calculi by the adjunction of quantifiers, whose intended meaning always has something to do with the cardinal number of things which satisfy a certain statement. Quantifiers were first introduced by Frege (1879). Somewhat later and independently quantifiers were used by Pierce who introduced the term 'quantifier'. Thereafter their use becomes general, though the notation for them varies. The various orders of predicate calculi is due to Russells' theory of types and perhaps to Frege's Stufen and Schroder's Mannigfaltigkeiten. Lowenheim and Skolem in effect gave a treatment of the first order predicate calculus with equality. But the first explicit formulation of the classical predicate calculus of the first order as a formal system in its own right is in the first edition of the book by Hilbert and Ackermann (1928). Thereafter it was much studied as a formal system. Many-sorted predicate calculi were discussed by Schmidt (1938) and Wang (1952). Models by Kemeny (1949) among others. Predicative and impredicative predicate calculi arises from the unqualified use of the concept of' all'. Russell's (1906) P.M. vol. 1, Ch. II, vicious-circle principle, designed to avoid paradoxes, was 'no totality can contain members defined in terms of that totality'. The term 'impredicative is due to Poincare (1905) who condemned impredicative definitions, as did Weyl (1918). In developing the classical predicate calculus of the first order we again use the direct formulation due to Gentzen (1934) and further studied by Schiitte, (1950-1,1960). Prop. 4, the elimination of M.P., is Gentzen's Hauptsatz, the demonstration we give is due to Lorenzen (1951). The prenex normal form is due to Skolem who also found other normal forms. Props. 9,10 and 11 are due to Herbrand (1930) as is the discussion on -ff-disjunctions, hence their name. Much has been contributed to the concepts of validity and satisfaction by Tarski (1933). Prop. 12, the completeness theorem of the classical predicate calculus of the first order is due to Herbrand (1930), Godel (1930), Lowenheim (1915) and Skolem (1920). Cor (ii) is due to Lowenheim (1915). Prop. 13, the independence of the axioms and rules of the classical predicate calculus of the first order was first considered by Godel
202
Ch. 3 Predicate calculi
(1930) and the consistency by Hilbert-Ackermann (1928). The discussion of theories is due to Tarski (1935-6). The classical predicate calculus with equality is implicit in the work of Pierce and Schroder, but its first treatment as a system in its own right is in Hilbert and Ackermann (1928) and again in Hilbert and Bernays (1934-6). Prop. 19, the elimination of axiom schemes, is due to Skolem (1959) who used a formulation of set theory due to Godel (1940). Early attempts at finding a decision procedure for the classical predicate calculus of the first order were unsuccessful (because as we shall see in a later chapter there is none), so research turned to finding decision procedures for special classes of statements. One of the earliest of these was by Lowenheim (1915). He gave a decision procedure for the monadic predicate calculus of the first order. This was followed by work by Skolem (1919, 1920) and Behmann (1922) on the monadic predicate calculus of the second order and of the first order with equality. Prop. 22 is due to Bernays and Schonfinkel (1928) and Prop. 23 is due to Godel (1933), Kalmar (1933) and Schiitte (1934) A detailed account of all known decision procedures for special classes of statements has been given by Ackermann (1954), see also Church (1956). A related type of problem is the reduction problem. Here we try to find special classes of statements such that any statement of the predicate calculus of the first order is equivalent as regards validity to one in the special class, there is a corresponding problem for satisfiability. A simple case is the class of statements in prenex normal form. These special classes are called reduction classes. Prop. 24 is due to Lowenheim (1915) and Godel (1933). The Skolem V- and ^-normal forms are of course due to Skolem (1925) and lemma (iv), the restriction to exactly three universal quantifiers is due to Godel (1933). Prop 25, where we have only one predicate and that one binary is due to Suranyi (1943) and Kalmar (1947) whom we follow closely. Many reduction types with a variety of prefixes have been found by Kalmar and Suranyi, and account of them is given by Suranyi (1959), see also Church (1956). The method of semantic tableau is due to Beth (1955, 1959) and Hintikka (1953, 1955) and Prop. 26 is due to Craig (1953) and Kleene (1952, 1967). The idea of a resolved predicate calculus is due to Hilbert [H-B, II], but the 'T/' symbol (written i) and its use is due to Peano (1897), Frege (1893, 1962) and (1905, 1956). The V symbol is due to Hilbert [H-B].
Historical remarks to Chapter 3
203
He demonstrated two theorems about the elimination of the e-symbol, the first is virtually Prop. 35. An historical account of the predicate calculus has been given by Hermes & Scholz (1932). Many examples are to be found in Church (1956). The system 88!FC is suggested by Kalmar & Suranyi's reduction type whose only predicate is a single binary one. Definitions D26 and 27 go back to Leibniz He took two classes to be the same if they contained exactly the same members. This is the extensional approach, and is used in classical mathematics. But one might consider two classes to be distinct, even if they contained exactly the same members on the ground that the rules for membership were different. This is called the intensional approach, we shall have to speak about it again in later chapters. The concept of set and class as defined in D 33 and 34 is due to v. Neumann (1925) who used it to avoid the syntactic paradoxes which had crept into set theory since Frege and Cantor. The definitions D 37-57 are largely from P.M. But the definition of an ordered pair has received simplification at the hands of Wiener (1912) and Kuratowski (1921). The rules R2'-9" are modifications of the axioms used by Godel (1940) in his account of set theory. Prop. 31 on normal classes is due to Godel (1940). When P.M. first appeared it was considered to have a few blemishes. One was the complicated type theory. Several writers have tried various ways of simplifying this, notably Leon Chwistek (1921, 1927) and F.P.Ramsey (1926), with his simple theory of types. Another thing thought by some to be a blemish is the axiom of infinity. Perhaps the reason for this may be compared to the reasons for considering Euclid's axiom of parallels to be a blemish on his work in geometry (first formalized by Hilbert (1922)), namely, one might think that it should follow from the other axioms. But apparently this is not the case. In fact in Godel's (1940) formulation of set theory the axioms previous to the axiom of infinity are consistent, this is seen by taking ae/3 to be always false. But with the axiom of infinity this is no longer the case, because this axiom postulates the existence of a set. There are many ways of formulating an axiom of infinity, all one requires is the existence of some set with an infinity of members. The last thing thought by some to be a blemish in P.M. is the axiom of reduction. Here this is avoided by the distinction between classes and sets. Set theory is very useful for making models of various mathematical conceptions. Thus we easily get a model for ordinal numbers. So we give
204
Ch. 3 Predicate calculi
a short account of them. Fuller accounts are given in Cantor (1895, 7), Godel (1940), Bernays-Fraenkel (1958), Sierspinski (1928, 1958), Bachmann (1955). We have indicated in Ex. 41-4 inclusive how to develop the algebra of ordinals. Ordinals and cardinals were first invented by Cantor, but finite ordinals and cardinals were known to the Greeks. Ex. 43 (xi) is known as Cantors normal form for ordinals. When we introduce ordinals we immediately require a new axiom, namely the axiom of infinity, its object is to ensure that the ordinals we define are sets, so the process of ordinal construction can proceed. We largely follow Godel's (1940) account of ordinals and cardinals. The alephs are defined as certain ordinals, but there may be other cardinals that are incompatible with the alephs, if we define cardinals as classes of similar classes. It is at this point that we come across A.C., first introduced by Zermelo (1904). Prop. 32 is due to Cantor and Prop. 33 to Cantor and Bernstein (1905). Cantor's theorem immediately gives rise to C.H. and G.C.H. For many years the logical position of these and A.C. was unknown. Then Godel (1940), by constructing the constructible universe, was able to find an 'inner model' in which A.C, G.H. and G.C.H. were all satisfied provided that set theory itself is consistent. Thus A.C, C H . and G.C.H. are consistent with set theory provided set theory is consistent. The proof of this takes place in set theory. Godel's method of inner models, as shown by Shepherdson (1951,2,3), is incapable of showing that the negations of A.C, CH. or G.C.H. are consistent with set theory. Many years were to pass before Cohen (1963, 4, 5) developed an entirely new method for dealing with independence proofs, this was based on the denumerable model found by Skolem. This was a remarkable breakthrough, the original paper was couched in such strange terms that only the most resolute of professional logicians could understand it. Now however, monographs are appearing and the method is explained so that it is available to the general mathematician. There is a monograph by Cohen (1965), Another method of doing the same things was discovered by Solovay and Scott (1970). This proceeds by forming Boolean valued models and gives a generalization of twovalued truth. This method arose because it was noticed that the main feature of Cohen's 'forcing' method was the semi-order it gave rise to. Accounts of this method are given by Rosser (1969), Scott (1966a, 6), Jensen (1967). By suitable choice of Boolean Algebra models can be found in which A.C or C.H. or G.C.H. fail. The complete Boolean Alge-
Historical remarks to Chapter 3
205
bras required came in after Boole, they are discussed by Halmos (1963) and Sikorski (1960). We just show how to set up the model. The full proof that it is a model for set theory together with the independence proofs is given by Rosser (1969). Thus with Godel's result that A.C., C.H. and G.C.H. are consistent with set theory the final result is that A.C., C.H. and G.C.H. are independent of the other axioms of set theory. The proof takes place in set theory so that the result just stated only holds if set theory is itself consistent. This is parallelled in mathematical history by Caley's proof in Euclidean geometry that the axiom of parallels is independent of the other axioms of Euclidean geometry provided these themselves are consistent. The elimination of the e-symbol is due to Hilbert and two theorems about it are given in H-B (1934-6), we give one of these because it is tantamount to the consistency of A.C. with the other axioms of set theory. That is we give an effective method of converting a contradiction in set theory plus A.C. into a contradiction in set theory itself. The chapter closes with some topological definitions, many of which occur in the latter parts of P.M., they show again how useful the system S? is for talking about all sorts of mathematical concepts. Other works on set theory are: Suppes (1960), Halmos (1960), Skolem (1962), Sierpinski (1951), Fraenkel and Bar-Hillel (1958), Fraenkel (1953) with a complete bibliography to 1953 and Fraenkel (1946). EXAMPLES 3
1. Complete the demonstration of Prop. 6. 2. Complete the demonstration of Prop. 8. 3. Put into prenex normal form (Ax) CpxC(Ax')p'x'x(Ax")p"x% B(Ex) (Ex')pxx'(Ex')
(Ex)pxxf.
4. Put the «^>-proof of the prenex normal forms of C(Ax)
CpxpfxC(Ex)px(Ex)p'x,
C(Ex)
NNpxNN(Ex)px,
BK(Ax) Cpxpfx(Ax) Cp"xp'x(Ax) CDpxp"xp'x, into normal form.
206
Ch. 3 Predicate calculi
5. Obtain H-disj unctions for (Ex) (Ax') (Ex") DDpxx'xpxx'x"Npx"xx', (Ax) (Ex') (Ex") (Ax")DDpxx'x"Npxxx"px"x'x, (Ex) (Ax') (Ex") (Ex'") DDpx"x'xNpx"'x'x"px"'x'x. 6. Obtain semantic tableau for the statements of Ex. 5. 7. Find the equivalent of (Ax, x', x") (Ey) KDpxx'p'x'yCKp"x"xp'x'yDpyx"p"xx" according to Prop. 25. 8. Find which of the following are ^-theorems: (Ax, x', x") (Ex®\ x®>, x®)
DCpxx^x^px'x"x^Cpxx^x^ px"x'x®\ (Ax, x') (Ex", x'") CCpxx"px'x"'Cpx"xpx'"x'.
9. Find which of the following are J^-theorems: (Ex) (Ax', x") (Ex"1) GKpxx'px'x"Gpx'"xpx"x', (Ex) (Ax', x") (Ex1") CCpxx'px'x"Cpx'"x"px'x. 10. Reduce (Ex) (Ax', x") (Ex"1) (Axir) CCpxx'xCpx"x"'x'Gpx"xiYxf"pxiYx'x to the form given in Prop. 24. 11. Put in resolved form, using the e-terms (Ax)(Ex')(Ax")
Show that:
for
$ft'$"(x = x'),
&e
for
SAr&"(x = x").
9t?X = MX, 2«X= VxX, S {x, x'} = x U x', X{x} = x, £"{x} = x, C(X c X') (Q)X c CK(X c X') (X" c Xm)CX«X* c X'«XXm,
Examples 3
207
13. Show that (X x X')n {X" x Z i v ) = ( I n X") x (X'n X iv ), U X') = S Z U S I ' , X = SPX, CN£mX{X' c SS(Z x X')), X c P2X,
2 2 / = V,
14. Construct an applied predicate calculus of the second order which has exactly one constant predicate R which is binary. State axioms for order and express 'every non-empty class has a least member in the ordering R\ 15. Set up axioms for a group using one ternary predicate. 16. Find the Skolem $-normal forms and the Skolem F-normal forms for the following: C(Ax)px(Ex)px9 C(Ax) Cpxp'xC(Ex)px{Ex)p'x, C(Ex) (Ex')pxx'(Ex')
(Ex)pxx'.
17. Give the demonstration of Prop. 24, lemma (iii). 18. Show that the conditions (A), (B), (C) and (D) below are necessary in order that (Ag, £', £") (Er/) ^{£, £', g", TJ} be satisfiable. There is a nonempty set S of 4 x 4 tables which satisfy <j) such that: (A) If To is a table of S, v, v\ v" = 1, 2, 3, 4 then there is a table T of S such that: [T/1] = [TO/1] and [T/123] = [T0/v', v', v"]. (B) If To is a table of S then there is a table T of S such that: [T/1] = [TO1] and
T = [T/1114].
(C) If T 1; T 2 are tables of 2 then there are tables T, T", T" of S such that:
[T/l] = [T^l],
[T/2] = [T 2 /l],
T = [T/1224],
[T'/2] = p y i ] ,
[T'/l] = p y i ] ,
T' = [T/1214],
[T"/2] = [Tj/1],
[T'/l] = [T 2 /l],
T" = [T"/1134].
208
Ch. 3 Predicate calculi
(D) If Tl9 T2, T 3 are tables of S then there is a table T of such that: [T/l] = p y i ] ,
[T/2] = [T 2 /l],
[T/3] = [T 3 /l].
19. Apply Prop. 20 to decide which of the following are J^-theorems. (i) (Ex) (Ax') GBp'xpBp'x'p, (ii) B(Ex)Gpxp'x(Ex,x')Cpxp'x, (iii) (Ax') G(Ax) CpxCpx'p'xCpC(Ax)pxp'xf. 20. Apply Prop. 22 to decide (Ax, x', x") (Ey,yf)
Cpxx'Cpyx"DKpyx'py'x'Kpxx"py'x".
21. Apply Prop. 22 to decide (Ax, x') (Ex")
KCpx"xKCpxx'pxx"Cpxx"GNpxx'Kpx"xpx'x".
22. Find the binary SFC statement equivalent to (Ex) (Ax') (Ex") Cpxx'x"px'x"x as regards satisfiability according to Prop. 24. 23. Continue Ex. 22 to find a binary J^-statement with exactly one binary predicate which is equivalent to (22) as regards satisfiability according to Prop. 25. 24. Ditto for (Ex) (Ax')(Ex")\(Ax'") Cpxx'x"px'x"x'", (Ex) (Ax')(Ex", x"1) Cpxx'x"x'"x"px"xx'x"xm. 25. Demonstrate Prop. 12 for the system I^Q. 26. Demonstrate Prop. 14 for the system I^c. 27. Show that the singularly I!FC is decidable. The only predicates in the singulary I!FC other than / are singulary. 28. Show that B(X= Y)(Au)B(ueX)(ueY) and B(X = Y) (A U) (XeU) (YeU) are independent. [See Robinsohn, J.S.L. 4, 69.] 29. Show by formula induction X= Y B where F is free for X, Y in
Examples 3
30. Show and
209
B<j>{ Y}(AX) C(X = Y) 4>{X} B
31. Show
B(XeY) (Eu) (u = X) K(u = X) (weY).
32. Show
X = y(yeX).
33. Show
(A£,',...,^)B
where S is like y except for containing ijr at some places where y contains ^ and £', ...,£,(v) exhaust the variables with respect to which those occurrences of 0, i/r are bound in y, S.
where \]r is like 0, except for containing 8 at some places where $ contains y and £',..., £^p) exhaust the variables with respect to which there occurrences of y, S are bound in
jy 7 / V
T7"\
Rel (X, Y)
/ A
\ ID V
Tr
(Au, v) BuXvu Yv X=Y
36. Show
{u, v} = {x, y) DK(u = x)(v = y) K(u = y) (v = x)
37. Show
(u,v) = (x}y) K(u = x)(v = yY
38. Show
(Ay) (y = Fy).
39. Show
(Av) B(v = y) vXx y = X'x
40. Show (Ax)BKUnX{xeSiX){Eu)(Av)B{u = v {Ax)BKVnX{xe9X) (Ay)B(y = X'x)yXx, Y&nX (Aw)B(weY) (Eu) K{ueX) (w = Y&nX
Z&nX
(Y'u,u)y
(Au)C(ueX)(Y'u = Z'u) Y =Z
210
Ch. 3 Predicate calculi
41. Defining the sum a + b of two ordinals a, b as the ordinal isomorphic as regards order to the order type obtained by sticking the order type b at the end of the order type a, show that: (i) If 0 < b then a < a + b. (ii) b ^ a + b. (iii) If b < a then there is a unique ordinal c such that b + c = a. (iv) If b ^ a and d < c, then b + d ^ a + c. (v) a = b1 + c1 = 62 + c2> c i < c 2J then6 2 < 6X. (vi) If a = 6 + c, c is called a remainder of a and 6 is called a segment of a. Show that the number of remainders of an ordinal is finite. (vii) An ordinal a is called decomposable if a = b + c, 0 < b, c < a. Otherwise indecomposable. Show that the least positive remainder of a positive ordinal is indecomposable. (viii) If cx < c2 are both remainders of an ordinal a, then cx is a remainder of c2. (ix) The least positive remainder of an ordinal a is a remainder of every other remainder of a. (x) The only positive remainder of an indecomposable ordinal is itself. (xi) The only positive indecomposable remainder of an ordinal is its least positive remainder. (xii) If c is indecomposable and b < c then b + c = c. (xiii) If c > 0 and b + c = c whenever b < c then c is indecomposable. (xiv) Any ordinal a is the sum of a finite decreasing sequence of decreasing indecomposable ordinals. (xv) If
a = c1 + c2+...+cn,c1
> c2 > ... > cn
and
cl9c29...,cn
in-
decomposable then cx is the greatest indecomposable ordinal < a. (xvi) If A is a set of indecomposable ordinals then HA is indecomposable. (xvii) Every ordinal can be uniquely represented as a finite sum of non-decreasing indecomposable ordinals. 42. Defining the product ab of two ordinals a, b as the ordinal order isomorphic to the set of ordered pairs (c9d} c < a,d < b ordered by last differences, show that (i) If b± < b2 and 0 < a then abx < ab%. (ii) If ax < a2, b± ^ b2 then axbx ^ a262. (iii) We can have ax < a2, b > 0 and axb = a2b. (iv) If a = be, 0 < 6,1 < c, then b < a and c ^ a.
Examples 3 (v) If abx < ab2 then bx < b2. (vi) If axb < a2b then ax < a2, (vii) If ab± = ab2, 0 < a then bx = b2. (viii) lie < ab t h e n u n i q u e l y c = ab1 + d,b1
211
< a.
(ix) If 0 < a then b = ac + d, d < a, uniquely. (x) If c is indecomposable, 1 < c then ac is indecomposable, (xi) If 0 < a then the least indecomposable ordinal greater than a is aco.
(xii) If c is indecomposable and positive then the next greater indecomposable ordinal is ca). (xiii) Every indecomposable ordinal is divisible on the left by every lesser positive ordinal and the quotient is indecomposable. (xiv) An ordinal is prime if it is greater than unity and is different from the product of any two lesser ordinals. Show that every ordinal > 1 is the product of a finite number of primes. (xv) Show that the number of right divisors of an ordinal is finite. 43. Defining ba for ordinals a, b as the ordinal order isomorphic to the order type of functions over a with a finite number of non-zero values in b ordered by last differences, show that: (i) This order type is that of an ordinal. (ii) If 0 < a < 6, 1 < c then ca < cb. (iii) If 0 < a < 6, 0 < c then ac < bc. (iv) ca+b = ca.cb, 0 < a, 6, c. (v) If b is a limit ordinal, 1 < a then ab is the limit of all ordinals 0 a for c < b. (vi) (aa)c = abc, 0 < a, 6, c. (vii) o)a is indecomposable for a > 0. (viii) If 0 < a, 1 < c then a < ca. (ix) If 0 < d, 1 < c then there is exactly one ordinal a such that ca ^ d < ca+1.
(x) Every indecomposable ordinal ^ co is of the form a)a for some a > 0. (xi) Every ordinal a > 0 can be uniquely expressed in the following normal form: „ d = fc)ci. 71^ + (i) 2 . 7l2 + . . . + (if™. 7lm
w h e r e cx> c2> ... > cm a n d nx,n2, ...,nm a r e i n t e g e r s . 44. A n e-ordinal is a n o r d i n a l e w h i c h satisfies e = co6. S h o w t h a t : (i) I f e0 = co + a)*0 + w(6>CU) + ... t h e n e 0 = 6>e«.
212
Ch. 3 Predicate calculi
(ii) e0 is the least e-ordinal. (iii) If 1 < c < e0 then e0 = ce«. (iv) If 0 < c, c0 = c, cn+1 = G)cn then limcn is the least e-ordinal not less than c. 45. Show that there exist ordinals which satisfy a = o)a. [Consider c0 = ^> cn+i = Mcn>
c
= limcn.]
46. Show that: (Ax, x') D(x = x') K
--->z{n\ x',...,x(0)}
can be expressed in the form
A
2 HK{z',..., z(7r)} FK{cj), ..., = , J, ?}, where 5 stands for 2',..., z(7r) and j for /c = l
x',...,xf®. Hence show that (E%)(A%)F{$,..., = , J , j } is equivalent to one of (Z?s) (^l j) HKFK for some /c, 1 ^ K ^ A. 48. Show that DN(Ax) (Ey) <j>{x, y] (Ex, x', x", x'") (DKKKKK
x")
is generally valid. Ackermann J.8.L. ai (1956), p. 197. 49. Show that (i) (Ex) DN4>{x, x) {Ay) NK
are generally valid. Oglesby (1962).
Chapter 4 A complete, decidable arithmetic. The system Ao
4.1 The system Aoo In this chapter we construct the formal system Aoo. It is a very simple arithmetic with familiar fundamental concepts. These are: the natural number zero, the successor function, the operation of repeatedly applying a function, the operation of forming functions by abstraction, equality and inequality between numerical expressions, and the logical connectives, conjunction and disjunction. The atomic statements are equations and inequations between numerical terms, compound statements are built up from atomic statements by conjunction and disjunction. Negation, material implication and material equivalence are definable, but existential quantification and universal quantification are unrepresentable. We give definitions of AQ0-truth and of A00-falsity for closed AOostatements, and show that they are exclusive properties. We also show that a closed A00-statement is A00-true if and only if it is an Aoo-theorem. Thus the system Aoo is consistent in the sense that Aoo-theorems are Aoo-true; and is complete in the sense that A00-true AOo-statements are Aoo-theorems. We give a procedure which applied to a closed A00-statement will terminate and tell us whether it is A00-true or is A00-false. Thus the system Aoo is decidable. 4.2 The Aoo-rules of formation To construct the system Aoo we first list the A00-signs and attach a type to each proper A00-symbol and give each Aoo-symbol a name which will assist the reader in understanding how the system was first conceived. Parentheses round type symbols are usually omitted by association to the left as explained in Ch. 1. (See table overleaf.) The required unending sequence of variables is obtained by repeatedly attaching primes, thus: x, x', x", .... The table order of the proper and improper symbols is called their lexicographic order. In the system Aoo [ 213 ]
214
Ch. 4 A complete, decidable arithmetic. The system Aoo symbol 0 8 = =1= & V
name
type I
11 Oil
oil 000 000
J
ui(iu)
X
i
zero successor function equality inequality conjunction disjunction iterator operator variable
The improper isymbols are, as usual: X ( )
abstraction operator left parenthesis right parenthesis generating sign
the natural numbers are represented by certain formulae of type i called numerals. These formulae are defined by the following rules: (i) 0 is a numeral, (ii) if v is a numeral, then so is (Sv), (iii) these are the only numerals. Thus 0, (SO), (8(80)), (8(8(80))), ... are the numerals. Note that a numeral is a formula, distinct numerals are distinct formulae. An Aoostatement is an A00-formula of type o, according to the universal rules as given in Ch. 1. The only abstracts allowed are of types u, ui, uu, etc., i.e. (kx
to denote undetermined numerals, to denote undetermined formulae of type i, called numerical terms, to denote undetermined variables of type i, to denote undetermined statements, to denote undetermined functors of type a, ui, etc.
4.2 The Aoo-rules of formation
215
In the last case the type can be introduced as a subscript if desired. From now on we usually omit the concatenation sign. Thus = otfi stands for an undetermined equation between numerical terms, &
(<* = /?) for
D113
(>&>ft)for
D114
(?W^) for
= a/?,
(a 4= fi) for
+ a/?,
&$ft,
v#\
We shall frequently use the parenthesis convention, thus: ^ Vftv x stands for ((<}) vft)vx)
an
d hence for ((V ((V (j>) ft))x)>
The AoO-rules of consequence
The system Aoo has the following axiom schemes: Ax 00 .1
(oc = a),
Axoo.2.1
Offa#0)f
216
Ch. 4 A complete, decidable arithmetic. The system Aoo
Ax 00 . 2.2
(0 4= Sot),
Axoo.3.1 Axoo. 3.2
where a, ft are closed numerical terms and p is a closed function of type in. i... i is the result of striking out each parenthesis and the zero in the Sn- times
formula (Sn) and then replacing each occurrence of S by a corresponding occurrence of i and replacing the parentheses by association to the left. Ax 00 .4.1 Ax oo .4.2
(X£. p{$) PP'... P* = p{p}pf... /?<*>, (7
where /){£} is of type i... i and both sides are closed and £, £' fail to occur Sn-timea
free in p{TL] and I \ is free for £, £' in p{Tt}. When written in full the parentheses are put back by association to the left, viz.: This axiom allows us to apply an argument to a function. We really only need functions of at most two arguments but it is sometimes useful to have functions of any number of arguments. This makes these two axioms more complicated. In them p{£,} is of type i... i so it must be of the form: where a{£, £',£", ...,£(7r)} is a numerical term, or it could be; (Xg'(Xg*(...(Xg<'-»5)...)) or or
(H'(H"
with appropriate modifications if n = 0,1,2. By repeatedly applying Axoo- 4.1, 4.2 we can change any Ef® to a new variable. The first axiom scheme states a familiar property of equality and the two parts of the second axiom scheme state a familiar property of the successor function. We require both parts because if we worked with only one part then some properties of inequality which we require would fail. The third axiom scheme shows how the iterator operator acts. This may become clear if Ax00. 3.2 is written in ordinary mathematical notation: suppose that / is a function of two arguments, then {{Jf)m{n+l)=f{n,{Jf)mn)).
4.3 The A00-rules of consequence
217
If we write g[n, m] for ^ffmn we obtain: g[n + l,m]
=f[n,g[n,m]].
g[3,m]=f[2,g[2,mj\
Thus
since g[0,m] = m by Axoo. 3.1. In general g[n,m] = / [ » , / [ » - 1 , [ f [ » - 2 , ...,/[2,/[l,/[0,m]]]... ]]]]. The system Ao0 has the following rules of procedure: ^{a} v
p
where a and /? are closed numerical terms and the A00-statement form
where a and J3 are closed numerical terms and co is a closed A00-statement and is subsidiary as in R 1, (a #= /?) and (Sot 4= #/?) are the main formulae. Note that since a and /? are closed in R 1 then a variable whether free or bound is unaffected by applications of R 1. The remaining rules are labelled in a different manner because they are some of the rules of &c. In listing them, except in one case, we omit the condition that the A00-statements in them be closed because they can only be used in A00-proofs when this is so. In the exceptional case the condition must be stated otherwise free variables could be introduced into an Aoo-proof. I.
Remodelling rules 0)' V
w'vfv^vw' II.
permutation
Building rules (a) -X— dilution
(b'\ ^yC°
^V0)
composition
218
Ch. 4 A complete, decidable arithmetic. The system Aoo
In II (a) (j) is closed. In II (&') the order of the premisses is immaterial. The Aoo-statements a>, a)' are subsidiary and may be omitted, the other Aoo-statements are the main formulae. The A00-statement x is secondary and must be present. We have omitted parentheses by association to the left and the outer pair is usually omitted. The rules are known by the names beneath them. 4.4 Definition of A00-truth We say that a closed numerical term 7 determines a numeral v under the following conditions: (i) 7 is v. (ii) v can be obtained from 7 by replacements of the following kinds: (a) replace an occurrence of ^paO by a corresponding occurrence of a; (b) replace an occurrence of Jrpoc(Sj3) by a corresponding occurrence of (c) replace an occurrence of X£.p{£}/?/?'.../?(7r) by a corresponding occurrence of p{/?}/?'... /?(7r); (d) replace an occurrence of X£./?{£}/?/?'... /?(7r) by a corresponding occurrence of X£' ./ where £ and £' fail to occur free in p{TL} and TL is free for £ and £' in p{Tt}. If 7 determines the numeral v then 7 = v is an Aoo-theorem and the above replacements give a special type of Aoo-proof of 7 = v. We say that an A00-statement is A00-true if and only if it satisfies the following conditions ^ 0 . (i)
4.5 Definition of Aoo-falsity
219
4.5 Definition of A00-falsity A closed Aoo-statement
4.6 Exclusiveness of A00-truth and A00-falsity We shall see later that A 00 -truth and A00-falsity are exclusive properties. First we show that a closed numerical term determines a numeral. Let the rank of a closed numerical term a be the numeral defined as follows: erase all symbols in a except the iterator and abstraction symbols, then replace each iterator and abstraction symbol by a successor symbol then place zero on the right and add parentheses by association to the right. If a is without iterator or abstraction symbols then the rank of a is to be zero. Let the order of a closed numerical term a be the numeral defined as follows: erase all symbols of a except the successor symbol then place zero at the end and add parentheses by association to the right. If a is without successor symbols then the order of a is to be zero. If a is a closed numerical term of rank zero then a is a numeral and this numeral is the order of a. We now give a method for finding a numeral determined by a closed numerical term a by reference to terms of lesser rank (i.e. whose rank is a proper part of the rank of a) or to terms of the same rank but lesser order. Thus continued application of the method will ultimately stop. If the closed numerical term a is of rank zero then it is a numeral and by condition (i) for determining a numeral we have found a numeral which a determines. Let a be of rank 8n, where n is a numeral, first suppose that a is of one of the forms Jpfly or (X£.£{£})/?/?'...fi(6\ where 7>fi,P'>...,/ff(^ &re closed numerical terms, p is a closed function of two arguments and 8{g\ is a function of type t... 1 with at most £ as free >S0-tiines
220
Ch. 4 A complete, decidable arithmetic. The system Aoo
variable, and £ fails to occur free in #{I\}. The ranks of J3 and y are less than that of a, suppose that we have found that they determine numerals K and v respectively. Then we require to find a numeral determined by J>pKV or 8{K)/?'... fi^ respectively. These are both of lower rank than a except that in the first case if both /? and y are numerals then Jpfly and its rank are unaltered. In this case if v is zero then a determines K, while if v is Sv' then we require to find the numeral determined by pv'{JpKv'). This is of higher rank than a but J>pKv' is of the same rank but lower order than a. Suppose that JpKv' determines the numeral 6 then we require to find a numeral determined by pv'd, but this is of lower rank than a. Thus in each case we are ultimately referred to terms of lower rank than oc or to terms of the same rank but of lower order, thus the process will ultimately stop at a term of rank zero and so produces a numeral which <x determines. Secondly if a is of the form S... Soc', where S ...8 stands for a sequence of successor symbols (in full notation parentheses should be inserted by association to the right) and oc' is a closed numerical term of one of the forms already considered, then we deal with en! as in the previous cases. Note that the reduction can be performed in several different ways so that it might happen that a closed numerical term determined several distinct numerals according to the method of reduction. We now show that a closed numerical term determines a unique numeral. 1. A00-truth and A00-falsity are exclusive properties We have to show that a closed A00-statement fails to be both A00-true and Aoo-false. We first show that a closed numerical term determines a unique numeral. Suppose that the closed numerical term y determines the numerals K, V. Then according to a previous remark we obtain A00-proofs of the equations y = K and y = v and hence by R 1 of K = v. It suffices then to show that if an equation K = v, where K and v are numerals, is Aooprovable using only Aoo-axiom schemes 1, 3.1, 3.2, 4.1, 4, 2 and rule R 1 then K is the same numeral as v. Given a closed numerical term oc we define the standard determination (s.d.) and the standard Aoo-proof (s.p.) of the equation a = v inductively on the rank and order of a. (i) a is zero, the s.d. of a is 0, the s.p. of a = 0 is 0 = 0. (ii) a is #/?, the s.d. of a is Sv, where v is the s.d. of /?. PROP.
4.6 Exclusiveness of A00-truth and A00-falsity
If B fails to be a numeral the s.p. of a = Sv is
,; a = Sv
221
R 1: where
jS = vis the s.p. of /? = y. If/? is a numeral y the s.p. of oc = Sv is Sv = $*>. In this case a is $*>. (iii) a is */p/?y; J3, y are of lower rank than oc let their s.d.'s be K, n respectively. JpKn is of lower rank than oc unless /?, y are /c, n respectively. (a) In the first case let v be the s.d. of JpKn and let /? = K y = n = vbe the s.p.'s of ft = /c, y = n and ,//)/c77 = v respectively, then:
a = Jpfiy
J3 = K y = v
rri
/
. R 1 twice
JpKTT = V
a =v is the s.p. of a = v. In this case if only one of /?, y is a numeral then the rank of the other fails to be zero. If ft or y is a numeral then we omit J3 = K or y = 77 as the case may be. (6) If/?, y are /c, 0 respectively then the s.d. of a is K and the s.p. of a = K is a = /c (3.1). (c) If /?, y are K, Sn respectively then JpKTt is of the same rank but lesser order than a let its s.d. be 6 and let the s.p. of JpKir — 6 be = 6 also pnd is of lesser rank than a let its s.d. be v and let the s.p. of pnd = v be pTrtf = y. Then the s.d. of a is y and the s.p. of a = P is J^/CTT = a = pn(JpKn) (3.2) .—
a — pnd
pnd = v
si) 1.
a = v (iv) a is (X£./?{£})/?/?'... /?(7r), /? is of lower rank than a let its s.d. be K
222
Ch. 4 A complete, decidable arithmetic. The system Aoo
and let the s.p. of /? = K be f} = K. P{K} /?'... /?(7r) is of lower rank than oc let its s.d. be vand the s.p. of />{*}/?'.../?(*> = v be /){*:}/?'.../?<*> = (9. Then the s.d. of a is v and the s.p. of a = v is
a =/>{/?}/?'.../^>(4.)
/
,
.
a = y
Clearly the s.d. v of a and the s.p. of a = v are unique. We now show that a closed numerical term determines a unique numeral. Suppose that the closed numerical term a determines the numerals v and K, then we have A00-proofs of a = v and of a = /c and hence of p = K. The A00-proof of y = K uses Aoo-axiom schemes 1, 3.1, 3.2, 4.1, 4.2 and rule R 1 only. LEMMA. / / the closed numerical terms y and y' have the same s.d.'s then so do the closed numerical terms a{y} and oc{y'}. If I\ fails to occur in a{I\} the result is trivial. The result is also trivial if y is the same as yr. Thus we may suppose that one of y, y' is distinct from a numeral and that TL has an occurrence in oc{TL}. We use induction on the rank and order of a{v] where v is the common s.d. of y and yr. a{I\} is I\, the result is trivial. oc{TL} is $/?{rt} then fl{v} is of the same rank and lesser order than a{y}, hence by our hypothesis /?{y} and P{y'} have the same s.d. By (ii) oc{y} and ot{y'} have the same s.d. a{FJ is Jp{Y}p{T]8{Y} then p{v) and 8{v} are of lower rank than a{v}, if I\ occurs in /?{I\} then by our hypothesis fi{y} and /?{y'} both have the same s.d., say v\ otherwise they trivially have the same s.d., similarly 8{y} and 8{y'} both have the same s.d., say v". By (iii) a, the s.d. of oc{y} is the same as that of «/p{y} v'v", and the s.d. of <x{y'} is the same as that of Jp{y'} v'v\ this is trivial if /2{Tt} is v' and 8{TL} is v". (Note that by the lemma of Ch. 1 y fails to overlap p, /? or S.) If v" is 0 then J*p{y) v'O and •//>{/} v'O, by (iii) 6, both have s.d. v'. If v" is Sv'" then by (iii) c the s.d. of .//){y} v\Sv'") is the same as that of p{y) v'"0 and the s.d. oiJp{y'}v\Sv'") is the same as that of p{y'} v'"d\ where 6 is the s.d. of *//>{y} v'v'" and d' is the s.d. oiJfp{y'} v'v1". Now ^ { P } ^ V is of lower rank or of the same rank and lesser order than a{v}, (this occurs also if /?{I\} and S{Tt} are
4.6 Exclusiveness of A00-truth and A00-falsity
223
both I\ or are both numerals) hence by our hypothesis «//>{y} v'vm and Jp{y'}v'vf/f both have the same s.d., thus 6 = 6'. By (iii)c the s.d.'s of Jp{y}v'{Svr") and Jp{y'}vf(8v'") are the same as those of p{v}v"'d and p{y'} v'"0 respectively. Now the rank of p{v} v'"6 is lower than that of oc{v}, hence by our hypothesis p{y} v'"6 and p{y'} v'"0 both have the same s.d., say v. By (iii) the s.d.'s of a{y} and ot{y'} are both v. a{FJ is
where 8{TL,£} is of type t... t and has at most £ as free variable and SSn -times
j3{Tt}, ...,/?(7r){rt} are closed numerical term-forms. fi{v) is of lower rank than a, hence by our hypothesis /?{y} and fi{y'} both have the same s.d., say K. By (iv) a{y} and a{y'} have the same s.d.'s as ${y, K}/3'{y}... /?(7r){y} and 8{y',K}P'{y'}...p<*\y'} respectively, but 8{V,K}/3'{V}.../3W{V} has lower rank than cc{y] hence by our hypothesis S{y,K}ft'{y}... fi^{y} and 8{y',K}fi'{Y'}:.Pn){y'} both have the same s.d., say 0. By (iv) 6 is the s.d. of a{y} and of a{y'}. This completes the demonstration of the lemma. We now show that if the upper formulae of an application of R 1 are equations and if both sides of each equation have the same s.d.'s then the same holds for the lower formula. Let the application of Roo 1 be
By our supposition and the lemma oc{y} and <x{y'} have the same s.d. and so do /?{y} and /?{y'}. But by hypothesis a{y} and J3{y] have the same s.d., hence a{y'} and fi{y'} have the same s.d. To complete the demonstration that v is the same as K we require to showthat both sides of an Aoo-axiom 1, 3.1,3.2,4.1,4.2 have the same s.d. For Ax. 1 this follows at once because the s.d. is unique. For Ax. 3.1 the result follows from (iii) a, b and the uniqueness of the s.d. For Ax. 3.2, let the axiom be ypa(Sfi) = pfi^pafi). If K, n are the s.d.'s of a, /3 respectively then by the lemma the l.h.s. has the same s.d. as J"pK(Sn) and the r.h.s. has the same s.d. as pn^pKn). IT is the s.d. of/? and 8 is the s.d. of (
224
Ch. 4 A complete, decidable arithmetic. The system Aoo
(X£. S) fi^pocfi) is by (iv) the same as that of S{J>pcif}) which in turn is 86, and the s.d. of (X£. S) nd is similarly Sd, thus in the second case both sides of Ax. 3.2 have the same s.d. In the third case the s.d. of the l.h.s. is the same as that oipnd and the s.d. of the r.h.s. viz. J
By (iv) the s.d. of the l.h.s. is the same as that of Y{K} fi'... fi^ where K is the s.d. of /?. By the lemma the s.d.'s of y{0}/3'...fi(n) and y{*}/?'.../?<*> are the same. This completes the demonstration that if v = K is an Aootheorem then v is K and hence a closed numerical term determines a unique numeral. 4.7 Consistency of Aoo with respect to A00-truth It now easily follows that a closed Aoo-equation is either A00-true or A00-false and fails to be both. Similarly for closed A00-inequations. It then easily follows that a closed AOo-statement has the same property. 2. The system Aoo is consistent with respect to A00-truth. We have to show that each Aoo-theorem is A00-true. We show that the Aoo-axioms are AOo-true and that the A00-rules preserve A00-truth. Since a closed numerical term a determines a unique numeral then both sides of Ax. 1 determine the same numeral, hence by &~00 (i) Ax. 1 is A00-true. Each Aoo-axiom of the schemes 2.1, 2.2 is A00-true because a closed numerical term determines a unique numeral and Sec determines a numeral distinct from zero and zero determines zero, hence by ^00 (ii) the Aoo-axioms 2.1, 2.2 are A00-true. Each Aoo-axiom of the schemes 3.1, 3.2 is Aoo-true by the definition of determining a numeral, for both sides of 3.1, 3.2 then determine the same numeral. Similarly for Aoo-axioms 4.1, 4.2. PROP.
4.7 Consistency of Aoo with respect to Aoo-truth
225
We now show that the A00-rules preserve A 00 -truth. Rule R 1 preserves Aoo-truth. Suppose a = ft V o) is A00-true, where a and /? are closed numerical terms, and that
-pj**-y
R1
The premisses are A00-true because a, fi determine the same numeral. And so do a, y. But the conclusion is false. 4.8 Completeness and decidability of Aoo with respect to A00-truth 3. The system Aoo is complete with respect to A00-truth. We have to show that each A00-statement which is A00-true is an Aootheorem. We show this by formula induction. Let ^ be an A00-true Aoo-statement then <j) is closed. First suppose that (j) is atomic, then PROP.
226
Ch. 4 A complete, decidable arithmetic. The system Aoo
^ is of one of the forms: a = ft or a 4= /?, where a and /? are closed numerical terms. In the first case a and /? determine the same numeral, say 6 and 6 is unique. Then a — 6 and /? = 6 are both A00-provable; from R 1 we obtain a = J3. In the second case a and /? determine distinct numerals which are unique, say they are v and K respectively. Since v and K are distinct numerals then one will be a proper part of the other, let v be S... SK, where S ...S denotes a non-null succession of successor symbols so that S ...8K with appropriate parentheses stands for a numeral. We first Aoo-prove: 8 ...8K 4= K. We have
8(S... 80)^0 S***S(S...S0)*S***S0
Axoo2.1 ^peatedly,
R2
where £***£ acts like S...S. If K is £***£0 then £***(£... £)0 is i>. Thus v 4= /c is an Aoo-theorem. Similarly if K is S ...Sv we have the Aoo-theorem y == j /c using Ax 2.2 instead of Ax 2.1. But we also have a = v and fi = K and so by R 1 we obtain a =)= /?, as desired. Now suppose that
5. If an Aoo-conjunctand is Aoo-provable then so is each conjunctand. This follows similarly.
PROP.
4.8 Completeness and decidability of Aoo with respect to Aoo-truth
227
6. The system Aoo is decidable. Let (j) be a closed A00-statement then we can decide whether (j) is A00-true or is Aoo-false, hence we can decide whether
4.9 Negation in the system Aoo The system Aoo lacks a negation sign, nevertheless negation can be Aoo-defined as follows: D 115 N[(' 0/'] for the result of replacing: = * & V throughout 0. Similarly material can be A00-defined: D116
["^->f"]
T> 117
["><-> f " ]
by * by = by v by & implication and material equivalence
for for
We use square brackets and inverted commas in this type of definition to denote that (j) fails to occur in N["
228
Ch. 4 A complete, decidable arithmetic. The system Aoo
We can divide the closed A00-statements into two classes so that one class consists exactly of the negations of the other class. To do this we enumerate the Aoo-formulae by length and order those of equal length lexicographically. We then run through the list testing for being an Aoo-statement, when we come across a closed A00-statement we put it in the first list provided its negation is absent from the segment of that list so far obtained, otherwise we place it in the second list. We could obtain other definitions of AOo~truth and AOo-falsity which preserve the property of exclusiveness and such that N["
rule
o)r &
Destruction rules l l
a
v
)&>X' concentration
,
(<j> V i/r) & a) dispersion
a 4= /? & a)
R2' £^ In these rules OJ and cof are subsidiary and can be absent, x is secondary and must be present. The Boo-axioms are A00-false and the B00-rules preserve A00-falsity. Thus the Boo-theorems are A00-false and so B oo is consistent with respect to A00-falsity. On the other hand a closed Aoo-false Boo-statement is a Boo-theorem. Thus B oo is complete with respect to A00-falsity. To show that B oo is complete with respect to Aoo-falsity we use formula induction.
4.10 The system Boo (the anti-A00-system)
229
oc =f= v /? =f= K
R 1' repeatedly.
Similarly if oc =)= y5 is A00-false then a = /? is A00-true and so a and /? determine the same numeral, say v. We then have:
" +" ^
Rl'. Note ^ + ^ 4 + 2 '
Rr.
a + /? v # ft If ^4 is ^' & ^" and is A00-false then ^' or
A complete arithmetic was first given by Myhill (1950) who built on the basic logic of Fitch (1942). Subsequently Lob (1953) gave an equivalent system. The system Aoo is a weaker but simpler system, and the system Ao, which follows in Chs. 6 and 7 serves much the same purpose as the systems of Myhill and Lob. It is simpler in that it deals with the natural numbers rather than with sequences or chains. Myhill uses the ancestral
230
Ch. 4 A complete, decidable arithmetic. The system Aoo
(first occurring in P.M.) and Lob uses a limited universal quantifier in order to obtain primitive recursive functions. We use the iterator symbol for this purpose, this had previously been used by Goodstein (1957). Church's (1941, 1936) X-conversion and Curry's (1958) combinatory logic serve somewhat the same purpose as the three systems just mentioned, but they are very differently constructed and were conceived for very different reasons. The successor symbol goes back to Peano (1897), the others we have mentioned before except the generating symbol, this has been frequently used in formal definitions of systems. It is easier reading to have many different letters for variables such as x y z u v w and their superscripted and subscripted varieties, but this makes the definition of the system longer; later on in the book we use several different letters. Greek or Gothic letters in the metalanguage probably go back to Carnap (1937) at least. The system Aoo is set up after the fashion of Gentzen (1934, 1955). As we proceed we try to define a truth-definition for each of our systems, this gives it meaning whereas without this it is purely symbolic. Throughout the book we keep inequality as a primitive, until we have to adopt a negation symbol if we wish to express negation in the system. The proof of Prop. 1 is due to Rowbottom [by letter]. Prop. 4 is the sort of thing the intuitionists like. In fact we try to satisfy their wishes as long as possible, see Heyting (1956). A formalization of recursive arithmetic was given by Curry (1941). EXAMPLES 4
1. Find the numerals determined by: Jf(kX.S)K7T9
J{\xx'. J(\x". S) KX') On, J{\XX'. J$sX*Xm . Ji^X^S)
KX'") OX') lTT,
for K, n = 0, 1, 2, 3, 4. 2. Show that Modus Ponens is a derived rule in Aoo. 3. Similarly for the rule I b. 4. Show that if p[v, 6] = Jf{\xx'.
Examples 4
then
p[v, 0] = K9 p(v,Sd)
where
= cr[v ^ d,
v -^ K = . / ( X ^ ' . J^x"x'".
x") 0^') VK.
5. Show that the Deduction Theorem holds in Aoo. 6. Give Aoo-proofs of a)' y
wWjiv
231
Chapter 5 A00-Definable functions
5. i Calculable functions An Aoo-function is an Aoo-formula of type u, tu9..., these are called one-, two-, ...place A00-functions. If p is a one-place A00-function and if v is a numeral then (pv) is a numerical term and determines a unique numeral called the value of the function p for the argument v. Similarly if p is a two-place A00-function and if v and K are numerals then ((pv)K) is a numerical term and so determines a unique numeral called the value of the function p for the arguments v and K, in that order. Similarly for many-place functions. A calculable function of natural numbers is a rule or set of rules such that given the ordered argument set we can by following the rules effectively find a natural number called the value of the function for that argument set, the value must be unique and a given function always requires the same number of arguments. Thus the Aoofunctions are a particular kind of calculable function in that the rules for finding the value given the argument set is of a particular kind. For example a set of instructions to find the value of a function of natural numbers might be: replace (S(Sv)) by v as long as possible, this would replace an even number by 0 and an odd number by (SO). Clearly we could always choose or alter our notation so that the natural numbers are represented as we have done in the system AOo- We shall say that a calculable function / of natural numbers is AOo~definable if there is an Aoo-formula p of type t... i, where n is the number of arguments of the Sn-times
function, such that pv'... v^n) determines the numeral v when and only when this is the value of the function / for the arguments v'... v^n\ We want to investigate the kind of calculable functions that are Aoodefinable. There are various known classes of calculable functions defined according to the kind of rules allowed in the calculation of the value of the function. Many of these arose from attempts to make a rigorous definition of 'calculable'. One of the earliest such definitions [ 232 ]
5.1 Calculable functions
233
is the definition of primitive recursive function. It was, for many years, thought that the class of primitive recursive functions was co-extensive with the class of calculable functions, but Ackermann produced an example of a calculable function outside the class of primitive recursive functions. Another method due to Turing produces a class of calculable functions more extensive than the class of primitive recursive functions and including the example of Ackermann; this method is thought to form an exact definition of the vague intuitive concept of calculable function. But how does one demonstrate that an exactly defined concept is the same as a vague intuitive concept ? 5.2 Primitive recursive functions The primitive recursive functions are obtained from certain initial functions by applications of certain schemes which produce new functions from already acquired functions. The initial functions are: (i) the successor function S; (ii) the constant function zero. D118
0(Sn) for
Xa'...a^0:
(iii) the identity functions: D119 U{Sn) for \x'...x^\x',
...,U\%Z] for Xx'... x^. <xP"\
Then U{%x'... x^ = x&\ The schemes are: (i) the scheme of substitution:
D120
l^d^Sn.
(a-/[r',...,r^)])^),for 7^'... x^.O-{T'X'
... x^)...
( ^ x ' . . . aP">);
where the type of 0* is 1... L Thus SSd -times
Note that we substitute the undetermined values of functions with the same number of arguments; this is quite general because otherwise, by use of the identity functions this requirement can always be satisfied. Thus
px'x" = pU'(Sn)x' ...x®"W;Sri)x'... «W»), etc.
234
Ch. 5 A00-Definable functions
(ii) the scheme of primitive recursion: D 121
Ta-iw) for Sir
h'...
af*>) {
where cr is of type i ... i and the type of r is i ... i; if n = 0 then replace #7r-times
SSSn-timea
(ax' ...x(n)) by a. From this we obtain by Ax. 3.1, 3.2: ra{n)Ov'... iW = ffrzx'.rxx'v'...
v^) (avf... v^) 0
= oV...i», T*U8V)
(3.1)
v'... iW = J{\xx'.rxx'v'...
iK»>) (ov'... iW) (Sv)
= Tv(J(kxx'. raseV ... v^) (ov'... v^) v) = TV(TO-{7T)VV'
...v^).
(3.2)
This is the scheme of primitive recursion with n parameters. The scheme of primitive recursion without parameters is: r?(0) for so that
7iX.Jr(Ax'x".Tx'x")ax,
f<7(0) 0 = oc and
Thus starting with the initial functions and applying the schemes of substitution and primitive recursion repeatedly we generate a class of functions. Each primitive recursive function has a construction sequence showing how it is built up from the initial functions by the schemes. 1. A primitive recursive function is A00-definable. We have just given the required details and definitions, i.e. we have Aoo-defined the initial functions: successor, constant and identity functions and have A00-defined functions obtained from already acquired functions by the schemes of substitution and primitive recursion. PROP.
2. A closed Aoo-function is primitive recursive. I.e. a closed Aoo-function can be obtained from the initial functions by the schemes of substitution and of primitive recursion. A closed Aoo-function /?{!} is called 7^-0-reduced if the A00-functionform p{&} is without occurrences of zero and without parts of the form If we had allowed other occurrences of the abstraction PROP.
5.2 Primitive recursive functions
235
symbol then we could apply the X-axiom and eliminate them. The order of a X-O-reduced A00-function is defined as follows: strike out every symbol in the term except the iterator symbol, then replace each iterator symbol by the successor symbol add zero on the right and parentheses by association to the right, the resulting numeral is the required order. If a X-O-reduced A00-function is without occurrences of the iterator symbol then its order is to be zero, and the function is either the successor function, a constant function zero or an identity function or the result of applying the successor function to either of these functions. These functions are primitive recursive being either initial functions or functions obtained from the initial functions by the scheme of substitution. Any other X-O-reduced A00-function is of the form: where Xr./?{£}, Xjy{r.} and p{t} are X-O-reduced A00-functions. Clearly if p{t} is a recursive function then
236
Ch. 5 A00-Definable functions
In ordinary mathematical notation the connexion between these orderings is given by {
where + n-
N2.(N2+l)...(N2 Nx = i/ + . . . + i^»>,
and
N2 =
In particular {v} = 2. j> + 1. It is easily seen that each natural number v is uniquely expressible in the form {v\ ...,v{Sn)} since ..,^)}
=
_L
for
x
^
l
Thus a numeral Aoo represents either a natural number or an ($77-)-tuplet of natural numbers or an ordered set of natural numbers in a numbering of all terminating sequences of natural numbers. The context will make it clear which is intended. Also given a natural number v we define primitive recursive functions, called coordinate functions', pt and Pt such that pt[v,Sn,SK] = I/8K\
where
K
Pt[v,SK] = if$ \ where
v = {/,..., v^Sn)} and Sn)
v = (v'}..., v^ ) and
K ^ n, K < n,
in the last case n is the greatest power of 2 which divides v. We have y and
v=
(Pt[v,l],...}Pt[v,S7r]),
in this case TT is as described above. D 122 D123
1 for (SO), 2 for (8(80)),
...,
9 for (£8).
(a + ytf) for
the result of adding /? on the right to a is the result of applying j3-tim.es the successor function to a. + is called the addition function. Instead of D123 we could have had + for (8/Ul)3U{. Then
5.3 Definitions of particular primitive recursive functions
237
Ax. 3.1, 3.2 give
(a + 0) = a, (a + Sfi) = D124
(ax/?)
or
(a./?)
for
The result of multiplying a by /? on the right is the result of applying yff-times addition of a on the left to zero, x is called the multiplication function. Ax. 3.1, 3.2 give (axO) = 0, (a Instead of D 124 we could have had x for (+ jU\ Z7§)8 Ov D125
afi or
(otexpfi)
for
The result of raising a to the power /? is the result of applying /?-times multiplication by a on the left to 1. exp is called the exponentiation or power function. Ax. 3.1, 3.2 give * n 1 c/? 5 a0 = 1, a ^ = a x ^ , D126 !a for ,/(X£i/.0S£)xiy) la. In these definitions a and /? are numerical terms in which £ and TJ fail to occur free. ! is called the factorial function. D127 then
P
for
X
P0 = 0, P(£a) = a.
P is called the predecessor function. D128 then
(<*-=-/?) for (a -L. 0) = a,
./(X^.P^a/?,
(a - /S/?) = P(a ^- /?),
this is a limited form of subtraction. We have (a — 1) = P(a — 0) = Pa. D 129
(a < /?) for (/? ^- a) * 0,
(a > /?) for (/? < a).
D 130
(a < /?) for (a-^ fi) = 0,
(a ^ yff) for (/? ^ a).
Instead of (/? JS- a) 4= 0 in D 129 we could have had (Soc - /3) = 0. In this manner the concept of order is A00-defined. Using these definitions
238
Ch. 5 A00-Definable functions
we can Aoo-prove the familiar properties of addition, multiplication and exponentiation and of order thus: ji:
etc
(a < y) We omit the details because we have given more general results in Ch. 4 from which they can be obtained. Namely an A00-true A00-statement is an Aoo-theorem. For instance the closed numerical terms a and /? determine unique numerals, say S...S0 and 8 80 respectively. Hence (a+/3) = S...S0 + 8 SO, by repeatedly using (y + 88) = S(y + 8) and then (y + 0) = y we get (a+/?) = 8 SS ...80, similarly (yff + a) = S ...88 SO, hence, clearly (a + /?) = (fi + a). To complement rule R 2 we have g g = gfi
ta oc = p We also have for closed numerical terms oc and /?:
When we define primitive recursive functions we need only show how they are defined from previously defined primitive recursive functions by substitution and the scheme of primitive recursion because we have already given complete instructions to obtain their explicit definition by an A00-function. Thus D131
A1[0] = 0, AJUSv)]*: 1.
This is easily put in the form of the scheme of recursion without parameters, thus ^[Q] = Q Alternative definitions for A^v] are (1 ^ (1 — v))9 {v -^- Pv), D132
BJy]
for
(1-*-AJ[v]).
5.3 Definitions of particular primitive recursive functions
Alternative definitions for Bj[v] are D133
Ajjc,v\
D134 BJjc.v] We have:
for
for
J^X^J/.O)
lv, (1 — v).
AX[(K -^ v) + {v - K)].
(1 - At[K,v\),
i.e. Bt[(K - v) + (v - *)].
* [/c, i;] = 0 He
1
239
B2[K, V] = 1
AJJK, v] = 1 H«,
4
# ,
* B2[K,V]
= 0
^
~—* because a determines a unique numeral, suppose a = Sv for
some numeral v, then
A r
-i
*
a
but this is absurd by Prop. 2, Ch. 3, hence a determines 0 and so a = 0 is Aoo-provable. Suppose the function r of type u has already been defined then we define: D 135 (Sr) 0 = TO, (Sr) (/SV) = (Sr) v + r(/S^). This gives
(Sr) i; = ^(Xg^. (r(S£) + ^)) (rO) v.
We shall sometimes write: S rg for (2r)^, and so that
( 2
T£)
2 ^ = 0 if
for ^[/c ^. i;] x
S
T
v < K.
Note that S acts like a symbol of type u(a), it is called the summation function. (UT) 0 = TO,
D 136 This gives
(IIT) (SV) = (Ilr) v x r(Sv).
(Ilr) v = J^Jcfl. (r(Sg) x ?/)) (rO) v.
We shall sometimes write (rg) for and
n (r£) for
£x[/c -i- v] x ( n
II is called the product function.
(nr)p T)(K + £) + AX[K
^ v\.
240
Ch. 5 A00-Definable functions
D137
Max[v,K]
for
D 138
Min [v, K] for
D139
(Maxp)0 = 0,
(V^-K) + K. (K - (K ^-
(Maxp)Sv =
v)).
Max[(Maxp)v,p(Sv)],
where p is a one-place function. Then (Maxp) v = (Ag7/. Max [p(Sv, rj)]) (pO) v. We often write DUO
Max [/)£] for
(Maxp) v.
(Minp)0 = p0, (Minp) Sv = Min [(Minp) v, p{8v)].
We often write
Min [pE] for
(Minp) v.
D 141
Max (a{£} = 0) for
D142
Jfw(a{g} = 0) for
D143
Jfaa;(a{g}^/c)
D144
Min (a{g} < /c) for
Max
for
We sometimes write Max[p£,] for
Jfa^^.^^/S/c ^ a{g}]]. Jfcfw
i f ax
^
0
£
(
and similarly for other cases from D 141... 144 inclusive. D145
[K/V] for
Maxfe.B^g.v
- /c]].
The quotient on dividing J> into /c. D 146
toj; for
A^v]. AS i f r a (2«. (2. |>/2sq +1) = v). 0g
The value of tui> is the successor of the greatest power of 2 which divides v, butTOO= 0. D147 D[V,K] for i f ax [JB2[i; x £, /c]]. 0 ^
5.3 Definitions of particular primitive recursive functions
241
Then D[v, K] = 1 if and only if v divides K. We have D[Sv,0] = 1,
D[0, 8K] = 0. Bern [v, K]
D 148
for
(K -^ \jcjv]. v).
The remainder on dividing v into K. We have Bern [v, 0] = 0, Bern [0, K] = K and K = \KJV] . v + ifem [>, K]. D149
A|>,K]
for
[!((*; + *) - l)/\{v-1).
l^.A^.A^v].
We have A[0, K] = A[J;, 0] = 0, A[v, 1] = v. D 150
cr2[^, K] for
i f ax (Ag, /c] < v). ^^/c].
Then
for positive K.
As J^ increases CT2[V,K] increases by unity each time v is the value of A[TT, V] for some numeral n, provided t h a t K is positive. We have D151
rN[v,K,0] = v, rN[v,K, lN[v,
D 152
K,, Sn]
= N[v, K, n] ^ A[(r a [^[i;, K, V\9 (K ^- n)], (K JL TT)].
cr 3 [>, K, TT]
for
(T2[N[v, K, TT],{K^-
TT)].
We have (T3[v, K, TT] = 0 for TT ^ K, also o*3[0, /c, TT] = cz[v, 0, TT] = 0. In ordinary mathematical notation, if U ^ f/
^ . . . ^ t/^ '
^ = 6>'.(6>' + l)...(<9 / + /c-
and
(
l
I
then i^[^/C,77] = 6>(^>. (6><^> + 1).. . ( ^ ^ > + /C - ^77-)/ !(/C - 7T) + ... + 6>^>/!!.
also Thus if
- ^ - ( f r ' + l ) —(fr' + K - l )
then
TT'
=
O-2\V,TT], TT"
o*3[^? K, 0] = cr2[^, A:] = 6'.
77r/. (77r/ + 1 ) . . > (7T/7 + /C — 2)
so TT' = 6'. Similarly
= 6" =
7T < K,
242
Ch. 5 A00-Definable functions
and so on. Thus n' = 0', TT" = 6\ ...,ifr) = #<*>. Thus if {/,...,/^} = {A',...,A<*>} then / = A', ...,/*<*> = A« D 153
pt[v, K,n]
for
(cr3|>, K,(TT^I)]^
Then pt[v,K,n] = 0<*>-0
Pt[v,n]
for ptUyfi™], wv,n].
Iiv = 2^ 1 >.(2.0+1) then Pt[v,n] =pt[6,K9n]. We have Pt[v,n] = 0 if n > mv. Also Pt[0, n] = P ^ , 0] = 0. P£is also called a coordinate function. If the one-place function /> has already been defined then: D 155 {pQ for £ A [ 2 ^ , (* ^ ( ^ 1))]. l
In ordinary mathematical notation, for 1 ^ /c,
+ D156
for
2^«.(
Then {pE} =
for
(n is distinct from the concatenation sign of Ch. 1.) This is the place number of the ordered sequence of natural numbers obtained by adding the /cth ordered sequence to the right of the yth ordered sequence. We have <0V> = <*n0> = K. D 158 Ji = 0,
5.3 Definition of particular primitive recursive functions
We write
243
p£ for P((K
Definition by cases We wish to define a function p[v, K] such that
(
or[v, K] if TV = 0,
if
rV = 0,
cr"[v, K] if r"v = 0, where < rV + one T'V. r'V = r'V, of0TV, < T'V, r'V +T"V r^,is zero. 0 < TV + T'V, The p[v, K]TVis. defined by0, so that0 exactly cr[v, K] . B^TV] + or'[v, tc]. BJT'V] + cr"[v, K] . B^T'V]. Similarly when there are more cases. 5.4
Characteristic functions
3. Let ${£>', ...,£(7r)) be an A00'Statement whose free variables are exactly £', ...,£(7r), then there is a numerical term ot{£', ...,£(7r)}, with exactly the same free variables which only takes the values 0 and 1, such that for numerals v',..., v^ PROP.
W,->**i .,
and
and conversely in both cases. The Aoo-function \x' ...x(nKa{x', ...,x^} is called the characteristic function of the A 00 -statement ${£', ...,^ (7r) }. Then a{v'9...,it")} = 0 iff
tfv'9...9it«>}
is an Aoo-rule. An A 00 -statement is called primitive recursive if its characteristic function is primitive recursive. Since each Aoo-function is primitive recursive then each A 00 -statement is primitive recursive. The system Aoo is then a formalization of primitive recursion. An Aoo-statement is built up from equations and inequations by means of the logical connectives conjunction and disjunction. In the Aoostatement <}>{£,', ...,£(7r)} make as many replacements as possible of the following sorts.
244
Ch. 5 Aoo-Definable functions
Replace
fi
=y
by
A2[fi,y] = 0,
/?*y
by
5 2 [/?,y] = 0.
If ft has been replaced by /? = 0 and # by y = 0 then replace: i/r&x by •4 1 [/?+y] = 0
and ^ V % b
The result now follows from Props. 1, 2, Ch. 4 (consistency and completeness of Aoo) and remarks after D 134. Note that an A00-statement can have distinct but, of course, equivalent characteristic functions. For instance pv = pv + (crv — crv) for each numeral v. The unique characteristic function which we have defined is called the principal characteristic function. It is unique because the construction of an Aoostatement from equations and inequations is unique. Similarly we define the principal equivalent equation.
D159
for
m,m
0=0
D160
(A&4® for h
In this manner limited existential quantifiers and limited universal quantifiers can be A00-defined. We frequently write {EEnrj)K instead of (E£)K(Er))K and similarly in other cases. We have
and
D161
(vlSegic)
for
(E£,)K{K
= vn£),
v is an initial segment of K or the vth sequence of natural numbers is an initial segment of the /cth sequence of natural numbers. D162
{vESegK)
for
(E^K^V),
v is an end segment of K or the vth sequence of natural numbers is an end segment of the /cth sequence of natural numbers. D163
(vPartK)
for
{E£,I})K{K
= £,nvnn),
5.4 Characteristic functions
245
v is part of K or the *>th sequence of natural numbers is a consecutive part of the Kth sequence of natural numbers. D164
The last clause is a case of definition by cases. Then ((*£)„ ["$K£}"] remains zero until v becomes the least numeral for which
The calculation perpetuates if pn is always distinct from zero. Otherwise the least numeral n such that pn = 0 is the value of g[pO, 0]. If the calculation fails to terminate then g fails to be a calculable function. (X is called the least number operator.
Statements can also be defined by recursion schemes. Let $ and xjr be already A00-defined, let X = X* denote that x arL d x' have the same Aoo-truth values when all free variables are replaced by numerals (same variable by same numeral at all occurrences). Consider:
Replace the predicates by their principal equivalent equations. Replace cj) by oc = 0 and P{v} by pv = 0 and \Jr{v, pv = 0} by or[v,pv] = 0. Then the recursion is ^ .a x .. r pO = cc, p(Sv) = or[v,pv]9
here a and a[v,pv] take only the values 0 and 1, hence so does p\y\. Note that P{v} is free in ifr{v, P{v}} because we lack means of binding any variable that occurs in P{v} by anything that occurs in ifr{v, FJ. In Ch. 8 we shall come across an important case where a variable free in P{v) is bound in ijr{v, P{v}}.
246
Ch. 5 A00-Definable functions
5.5
Other schemes for generating calculable functions
In addition to the scheme for generating calculable functions of natural numbers, which we have called the scheme of primitive recursion, there are several other schemes for generating calculable functions of natural numbers from previously acquired functions. At first sight some of these schemes appear to be more general than the scheme of primitive recursion. We now give some schemes for generating calculable functions of natural numbers and show that they only produce functions equivalent to primitive recursive functions. Then we shall give an example of a calculable function outside the class of primitive recursive functions. A calculable function which fails to be primitive recursive requires for its calculation a search through the natural numbers for one having a certain property knowing that there is such a number, so that if we only continue the search sufficiently long we shall find the required number, but we are without a primitive recursive bound to the length of the search. Consider the scheme p[0, f] = erf, p[8v, f] = T[V, S o-'te^g, I],!], I], 0
where or, cr' andr are primitive recursive, and I denotes K',..., i6d) for some numeral 0 (if 6 = 0 then I is absent). Let p'[v, I] be the primitive recursive function defined by p'[0, f] = cr'[O, &t, 1], p'[8v9 f] = p'[v, I] + [8i>, r[v,p'\y91], I]]. Then
p[v,l] = at XB^ + TIV^-l,p'[v-
1,1],I] xAxv9
thus p[v, I] is primitive recursive. Similarly for [ p [ ] ] < [ P &
«> (\4), m i
etc.
in place of The above scheme is a form of course of values recursion when the value at SP depends on a variable number of previous values.
5.6 Course of value recursion
247
5.6 Course of values recursion We say that a function p is defined by a course of values recursion from functions r, a,
of values recursion becomes p'[Sv,I] = p\v,
1]\T[V,Pt[p'[v,!],Sa'v],...,Pt[p'[p,I],
= T'[v,p'[v,l],t],
say,
this defines the function p' by primitive recursion from the functions T,CT,(T', ....cr^. The required function is defined in terms of p' by
Thus the function p is obtained by the scheme of primitive recursion from the functions T,
p«>XSv, I] = 7<«[v, p'[v, I],..., p«»[v,!],!]. Consider {/o'[^,l], ...,p^\v,t]} call thisp[^,I]. Then P[O)f] = Ktl],... ! ^)[I]} = (r[I] say p[8v,t] = { T ' [ ^ « M M ] , « , 1 ]
^[^,!] > ^J
i<\v,pt[p\y,lld, 1], ..., = T[v,p[v,l],f]
say.
248
Ch. 5 A00-Definable functions
This defines p[v,l] by the scheme of primitive recursion from the functions cr[f] and T[V9/I,1].
The functions / / , . . . , p{6) are defined from p by p'[p,K]=pt[p[v,K],d,ll
We can combine the last two results to cover the case of simultaneous course of values recursion. For example if p[y, t] and/o'jV,!] are defined
p[8v,t] = T[v,p[
p'"iv,t]=(p'U,l]>
we can reduce the definitions to a case of primitive recursion.
5.8
Recursion with substitution in parameter
We say that the function p is defined by recursion with substitution in parameter when: p[Sv,t] = T[v,p[v9l],p[v,i[v9l\], ...,p[v9 where $W[v, |] for o&\v9!],...,
cr£n[v, I],
! for ic',..., /c^>
1^ A^/
We first use ordered tuplets and reduce to P'[O,K] =
p'[Sv,K] = where
*'[K]9
r\v,p\v,Klp\v,a\viK^l..,,pt[via^\viK\lKl
pr\y, K] = p[v, I]
with
K = {K\ . . . , K^S7T)},
a'[v,K] = {cri[^,f], ...,cr^[V,f]}, etc.,
5.8 Recursion with substitution in parameter so t h a t K' = pt [K, Sn, 1],..., K®n) = pt [K, Sn, 8n] and similarly o"i, ...,cr'SJT. The value of p'[8v,K] depends on the values of
Let
249 for
Thus the value of (p'[Sv, £]> depends on the value of (p'[v>£])- Let p"[v,K] = {p'[v,£\)y then p'[v,K] = Pt[p"[v,KlSK] and 0
"[^
Hence
p'[0, K] = (7w[/c], p " [ ^ , /c] = r"[», p^i;, cr^i;, AC]], k],
for suitable o*w and T". We can easily recover p[v, I] from p'^^,!]. Consider then the recursion with substitution in parameter p[0, K] = CT'[K],
p[Sv,
K] = T[V, p[v,
The first few values of p[v, K] are
p[2,K] = r[l,T[0,( p[3, /c] = T[2, r [ l , T[0,
d]
for
T[V,
6,
K],
that is if we put the parameter in the second argument place instead of in the third argument place. We then have p[8v,
K]
= r'[v, cr[p, 0], r\v - 1, cr[v, 1], r'\y ^ 2, a[v, 2 ] , . . . , r'[0, *[v, v],
250
Ch. 5 A00-Definable functions
for suitable cr (cr contains K). This function can be A00-defined: since by Ax 3.2 we have p[8v, K] = T'[V,
The function cf has its first $$y values (all that are relevant) given by W[v, 0] =
K,
O=[>,1] = CT[V,K],
0=0,2] = erO-i. l,(rO,/c]], 0=0,^] = cr[l,cr[2, ...,0-0,^:]...]], 0=0,^] = CT[0,O-[1,O-[2,...,O-0,AC]...]]].
The function o= can be A00-defined by o=0, d] = ./(Xgy. o-O - £', ^']) ^ . Then
o=0,0] = K,
o=0, Sd] = o-O - ^, S(H'V' • ^ - S'I V']) KOI = o-O— (9,crO— (0 — !)> •••> tr[v,/c]...]], as desired. Thus altogether P[V,K]
=
where we have replaced r' by its definition in terms of r. According to Prop. 2 of this chapter a numerical term is a primitive recursive function of all its arguments, thus p[v, K] is a primitive recursive function of v, K. 5.9 Double recursion We say that the function p is defined by double recursion from the functions or, &', o*", r when p[0, *, p] =
p[8v, 0, p] = cr*[v, pi
p[Sp,SK,p] = TO,/c,/oO,o-[v,ic,p],t}],/
5.9 Double recursion
251
Then the values of p are given on the two coordinate axes, while the value at {Sv, SK} is given in terms of the value of p[v'9 K' , p] where {v'9 K'} lies to the left or below {Sv, SK}. Clearly the process of calculating the value of p[v, K, p'\ will terminate. We shall have to find p[8v, 6, p] for 6 < SK and p[v,(r[v,6,p],p]
for Maxa[v,£,p]
for d < SK, and so on. Write
then the value of p[Sv, SK, p] depends on the value
of {p\y, £, p]). Similarly the value o{(p[Sv,g,p]) depends on the value of >!>,£, p]>. Write p'[v9K9p] for (p[v,£,p]). 0
'
>
Then the value of
0
p'[Sv,SK,p] depends on the value of p'[v,& \y,K,p\p\
Thus
p[8v,
£,plpy>
= <>, Pli>\r[v, i, Pt [R'[v,
g, tf],
1£<
where K' > K. Denote the r.h.s. of the last equation by r'[v,K,p'[v,
er>,K',p], p],
p'[Sv,K,p],p]
= T'[v,K,P'[v,cr'"[v,K',plPlr'[v,KP'[V,
1, P], pi P]
if
K> 0.
The reason for introducing K' with K' ^ K is so that the second occurrence of &" can have the same arguments as the first occurrence of tr'". Now take K' = K and denote the resulting expression by T"[V, K,P'[V,
p], pi p'[Sv, K-l,p],
pi
Suppose that for a certain 6, with 0 ^ 6 < K we have denned a function T ^ so that p'[Sv, 8K, p] = T«»[>, K, p'[v,
for
p'\8v, SK, p] = T^[V, K, p'[v,
0 < d < K, K-6, d,p]p],
p],
252
Ch. 5 Aoo-Definable functions
(we have again taken K' = K in (i)). We denote the r.h.s. of this equation by V, K,P'[V, CT'"[V, K, pi plp'[Sv,
K-d,
PI p].
We have thus denned a sequence of functions T',T", ...,7^1 Now define by recursion with substitution in parameter
f f[0, v, K9 d\ 0\ p] = T'[V, K, 6\ 0\ p\ \f[Sd9 v, K, e\ e\ p] = f[d, v} *, e\ T'[V, K^6, e\ e\ pi pi r[d, v, K, df, d", p] = T^[V, K, 0', d\ p \
so that Then p'[Sv,8K,p]
= T[K, V,K,P'[V9CT'"[V,K,p],p],T'[V, = criv[p, K, p'[v, &"\y, K, p], p], pi
0,
say.
Thus altogether [p'[Sv,SK,p] =
o**[v,K,p'[v,
This is a recursion with substitution in parameter. Finally p[v,K,p] =
Pt[pf[v,Kip],SK].A1K
We can obtain an explicit expression for p because we can do so for p1 by the method used for recursion with substitution in parameter. 5.10
Simple nested recursion
The scheme of simple nested recursion is: f p[O,f] =
The scheme of substitution in parameter is a special case of this scheme, and this scheme in turn is a special case of a similar scheme but with the omission of the second argument of r and replacement of 0 by 80. Define a[ic] for cr[l], where {!} = K, ] for {8W|>,A,I]}, V]
for
r[>,A',..
p[v,K] for p[v,f\.
5.10 Simple nested recursion
253
Then ] _ 9p[v9
[p[8v9 K] = T[v,p[v,
Define
P(V,K]
K]9 K]]9 K].
for <£[>,£]>,
then f
0£
where ^ *'], g], g> where
cr[>, A, AC] for
Jfaa; [Max \a^r)[v9 A,
and /c' can be any number greater than or equal to K. = Tj[v9p[v,K']9p[v9*"[v9p[v,K],K]]9K]
say,
where a" is any function dominating cr. Now consider P[V,K]
for
{v9K9p[v9K]}.
Then p will satisfy a similar first equation and also P[SV9K] = {Sv.K.f^V.plv.K^plv^lv.plv.K^K^K]},
(i)
here ^"[P, p[^, /c], /c] may be replaced by {^^9K]9p[v9K]9p[v9K]]9p[v9K]}
= ^[Pt^/c]]
say.
Clearly each of i^, /SP, AT, jo[V,/c], ^[^, &'"\p[v, K]]] is a primitive recursive function of p[v9 (rf/'\j5[v, K]]].
Thus we can take the r.h.s. of (i) to be f\p[v, crm\p[v, K]]]]. Thus it suffices to consider the scheme p[0, K] = or'M, p[Sv, K] = r[/>[i;, cr|>[i;, /c]]]], where a has a left inverse o-L, i.e. crL[cr[i/|] = j;. Therefore it is sufficient to consider the scheme for p*, where p*[v, K] = (r[p[v, K]], p*[09 K] =
p*[Sv, K] =
Therefore we now consider the scheme p[0, K] =
p[8v,
K] = T[p[v9p[v9
K]]].
/c]]]].
254
Ch. 5 A00-Definable functions
For each v, the function \xp\y, x] is a composition of cr's and r's. Thus p[0,
K] =
OT[K],
3 , K] = T3aHo-2T2(T2T(r2[K]
etc.
Define a function x by \X[Sv, K] = (< (
if Sv is odd,
l^^ljf[v, K]]
if Sv is even.
This is a primitive recursive function and p[0,
K]
=
p[Sv, K] = ^ [ 2 ^ , /c].
*[K],
Thus simple nested recursion has been reduced to primitive recursion. 5.11
A Iternative definitions of primitive recursive functions
D 165
BtO = 0, Bt[Sv] = Rtv + B^SRtv)2 JL 5^],
then Rtv is the greatest natural number whose square is less than or equal to v, it is called the square root function. P R O P . 4. All primitive recursive functions are obtainable by adding to the initial functions the four functions; addition, multiplication, limited subtraction and square root and using the schemes of substitution and pure iteration without parameter. The scheme of pure iteration without parameter is pO = a, p[Sv] = cr[pv], so that
pv — ^(k^v.
C[T)])
av.
We first reduce the set of parameters to a single parameter. D166
Ev
for
v-(Rtv)2,
the difference between v and the greatest square less than or equal to v. D167
J[V,K]
for
(V + K)2 + V,
D168
Uv
for
Ev,
D169
Vv
for
Rtv^-Uv.
5.11 Alternative definitions of primitive recursive functions Then
UJ[v, K] = v,
255
VJ[v, K] = K, J[Ud, V0] = 0.
Thus if J[v, K] = J[v', K'] then v = v' and K = K'.J is a pairing function, it gives the number of an ordered pair in a numbering of all ordered pairs. It is sufficient to show how two parameters may be reduced to one parameter, because the repetition of the process will reduce any number of parameters to one parameter. Suppose then P[V, K, 0] = CT[V, K],
p[v, K, Sd] = T[V, K, 0,p[V, K, ff\]9
so that p is a primitive recursive function with two parameters. Let p'[v',0] =
Then
Thus we have eliminated one parameter, hence SA parameters can be reduced to one parameter. Now suppose that P[K,
0] = cr[K], p[K, S0] = T[K, 0,P[K, ff]]9
so that p is a primitive recursive function with one parameter. Let p'[K, 0] for J[K,P[K, 0]], then p'[tc, 0] = o-'[/c],
p'[K, 80] = T'[0,P'[K9 0]],
(i)
where
also
for
J[K,CT[K]]
and
T'[K,V\
p[/c,0]=
for
J[UV9T[UV,K9VV]],
Vp'[K90].
Then p is defined by substitution and iteration with one parameter as in (i), using addition, multiplication, limited subtraction and square root as additional initial functions. Take p"[cr'[g\, Q instead of />'[£, f] and we have
Then
p'{U]=p"[
In the construction of p' from initial functions by schemes the two functions or' and r' will have been previously constructed. So that if they can be constructed as in the proposition then so can p'. Thus
256
Gh. 5 A00-Definable functions
We can replace (i) by a scheme of pure iteration with one parameter, this we now give, though it is independent of what is to follow. Write p"[g, 0] = J[0, g , where
T"[T)]
p"U, SQ = r"lp'U, £]],
(ii)
= J[SU?i,T'[UV,Vr}]] and p'\g, £] = Vp"U,Q-
Thus p'" and hence p is defined by substitution and pure iteration with one parameter as in (ii), using addition, multiplication, limited subtraction and square root as additional initial functions. We now wish to remove the single parameter. Write ] for
K£ Then Also if
for
UR%,
for
U£
UlJ1\£,V] = £, V1JM,v] = VVj.SZ + 0 then V^^U^
and
V^
= 8V^.
For if U8£ 4= 0 then 8£ fails to be a square and so Bt(SQ = Rt£, whence
and
For this set of pairing functions the equations TJX £ = a, P^ ^ = /? have many solutions for £, and J x [[^ ^, V± Q = ^ can fail. Suppose
as in (i). Write />'[£] for p[Ux £, Fx Q then
where
.
L D ,. f J
, „
„ _ if
Whence
I^L
^'K?7/] = ^i[^S] • - ^ I K ^?] + T K C ^] • ^ i K ^G*
Note that ^[O] = 0, ^[/S^] = \[A± Q, where 1 is the constant function 1,
5.11 Alternative definitions of primitive recursive functions
257
i.e. S0v And B±[Q = 1 -^- A^Q. Thus recursion without parameter suffices for the definition of Ax and Bv Now write p'% for [£,//£] then p'0-0,
P"[SQ
= T"[p'n
where r" for JISU^^'I^TJ, 1^]], thus //' satisfies a scheme of pure iteration without parameters, also p% = V±p'%Thus altogether if we adjoin addition, multiplication, limited subtraction and square root to the initial functions then the schemes of substitution and pure iteration without parameters suffice for the construction of all primitive recursive functions with any number of parameters. COR. All primitive recursive functions are obtainable by adding to the initial functions the two functions addition and E and using only the schemes of substitution and pure iteration without parameter. We have to define limited subtraction, multiplication and square root in terms of addition and E and the initial functions and the schemes of substitution and pure iteration without parameter. We first define multiplication and square root in terms of addition, limited subtraction and Q and the schemes of substitution and pure iteration without parameter. Here (1 v is a perfect square, Qv = \ [0 otherwise. Let
F0 = 0, FSv = SFv + 2. Q(Sv).
Now Sv is a square if and only if v = K2 + 2. K, if and only if Fv = K2 + 4/c, if and only if Fv + 4 is a square, thus QSv = Q(Fv + 4). Thus FSv = S(Fv) + 2. Q(Fv + 4) where
B6
for
Sd + 2.Q(d + 4) and
Fv = v + 2[Etv].
Thus F is obtained by pure iteration without parameter from the functions Q, S and addition. Again (Sv)2 = SF(v2) so that the function 'square of is obtained by pure iteration without parameter from the initial functions Q, S and addition. Let TO = 0, TSv = B1[Tv] + 2.B1[(Tv^
1) + (1 - Tv)].
Then Tv is the remainder when v is divided by 3 and is obtained from
258
Ch. 5 A00-Definable functions
addition and limited subtraction by pure iteration without parameter. Now let O0 0 so that G is defined from 89 T and addition by pure iteration without parameter. Then we have
~-PM-
Thus we have denned multiplication and square root in terms of addition, limited subtraction and Q using the initial functions and the schemes of substitution and pure iteration without parameter. Lastly we define limited subtraction and Q in terms of addition, E and the initial functions and the schemes of substitution and pure iteration without parameter. We have Qv = B1[Ev],
(v-=- K) =
since the preceding square is(y + /c)2 This completes the demonstration of the Corollary. 5.12 Existence of a calculable function which fails to be primitive recursive At the beginning of this chapter we defined a calculable function of natural numbers to be a rule or set of rules such that given a natural number we can, using the rules, effectively find another unique natural number called the value of that function for that natural number as argument. We then gave a method for obtaining rules of this type, the resulting functions were called primitive recursive functions. We also gave some other methods for obtaining calculable functions which at first sight appeared to be more general than the rules previously given, but we showed that they only produced primitive recursive functions. There are however other methods for obtaining calculable functions of natural numbers which lead outside the class of primitive recursive functions. We have already mentioned the calculable function constructed by Ackermann which failed to be primitive recursive. Ackermann constructed a calculable function of natural numbers which
5.12 Existence of a calculable function which fails to be primitive recursive
259
increased faster than any primitive recursive function. If we examine D 123, 124, 125, we see that they follow a pattern; we can continue the series by defining a sequence of functions for v = 0,1,2,... as follows S) KIT
= K + 7T,
A^/c, n] =
. A0[/c, 9/]) On = KXTT,
A2[/C,TT] =
. AJ*,?]) In = K\
and generally c, n] = (X£T/ . AV[K, T/]) Kn
Then
\[K,TT]
= K+
(
0
for
> 1.
n,
for
v = 0, (i)
1
for
i/=l,
K
for
^ > 1,
continues the series. This is like double recursion with K as parameter except that the unknown function A is nested in itself instead of having a known function nested in the unknown function. It is easily seen that this function is calculable because the values are given outright on the co-ordinate axes and the value at {Sv, Sn} is given in terms of values with lesser v or same v and lesser n. In simple nested recursion the unknown function is nested in itself but then the recursion is on one variable only while here it is a double recursion. A scheme such as (i) is called double nested recursion. This scheme can lead outside the class of primitive recursive functions, this is why we have put v as a subscript rather than as an argument. If the scheme (i) gave a primitive recursive function of v, K, n then by the scheme of substitution the function X^.A^[^,^] would also be primitive recursive. But this is the function which Ackermann showed to increase faster than any primitive recursive function. The reader might like to evaluate on a wet afternoon, this should convince him of the reasonableness of Ackermann's assertion! We will, however, obtain another example of a calculable function which is outside the class of primitive recursive functions. The gist of the argument is this: we first give an effective enumeration of 9-2
260
Ch. 5 A00-Definable functions
Aoo-functions of type u. This amounts to an effective enumeration of all primitive recursive functions. Let the enumeration be: P\>P\\>P\\\>—>P\K\>—> where \K\ denotes a sequence of tallies obtained by replacing each 8 in K by a tally and omitting 0 and parentheses. We now form the sequence: S(p\ l),S(p\\ 2),S(p\\\ 3),..., as far as we please. Clearly this gives us the successive values of a calculable function of natural numbers. Suppose that this function is equivalent to a primitive recursive function, then by Prop. 1 of this chapter it is A00-represented by a closed A00-function of type u. This A00-function will then appear in the above enumeration, say it is plKl, then plKl v — S(plv{ v) for each numeral v, in particular if v is K we obtain: pllcl/c = S(P1K1K) as an Aoo-theorem, hence Aoo will be inconsistent, which is absurd. Hence this set of rules leads outside the class of primitive recursive functions. In the above we wrote plKl v, where \K\ stands for a sequence of tallies, rather than/)[/c, v], because the latter means that there is an Aoo-functionform p[TL, v] from which p[K, v] arises on replacing I\ everywhere by /c. But the numeral K may fail to occur in plKl v so that plTil v is nonsense. This actually occurs, we have defined a sequence of one-place functions rather than a two-place function, it is in this way that the absurdity is avoided. Aoo is so poor in modes of expression that the enumeration fails to be Aoo-definable, that is to say that there fails to be an A00-function or of type m such that
£ = # = 1 2 3
& 4
v 5
/
t
6
X 7
( 8
) 9
These numerals will act like names of the symbols above them, they will be called the Godel numerals (g.n. for short) of the A00-symbols. Each numeral denotes a unique Aoo-symbol and each Aoo-symbol is denoted
5.13 Enumeration of primitive recursive functions
261
by exactly one numeral. We now replace the Aoo-formulae by numerals as follows: The null formula is replaced by zero. The Aoo-formula consisting of the single Aoo-symbol 2 is replaced by the numeral determined by (v) where v is the g.n. of 2. If the Aoo-formula O has been replaced by the numeral v and the Aoo-formula W by the numeral K then the Aooformula OnxP is replaced by the numeral determined by vnK. These numerals are called the g.n.'s of the corresponding Aoo-formulae. Similarly the g.n. of a sequence of A00-formulae whose g.n.'s are respectively v',..., v(7r) is the numeral determined by (vf,..., v(7r)). These numerals act as the names of the Aoo-symbols, formulae or sequences of A00-formulae as the case may be. The context will make it clear which is intended. Thus a numeral may just represent a natural number or act as the name of an Aoo-symbol or as the name of an Aooformula or as the name of a sequence of Aoo-formulae. Given a numeral v we can find the Aoo-symbol, the Aoo-formula and the sequence of Aooformulae of which it is the name. The g.n.'s have been so chosen that each numeral has been employed in each of the three cases. A numerical term a is called a normal numerical term of kind K if it is either (i) 0 or (ii) one of x,x'.x",..., .x®•K) or (iii) Sfi where J3 is a normal numerical term of kind K or (iv) {{{J{\x®-K+1\'kxV-K+®p)))y)8), where 8 and y are normal numerical terms of kind K and /? is a normal numerical term of kind SK. TO test if a numerical term a is a normal numerical term of kind K we have to test if proper parts of a are normal numerical terms of kinds K or SK and so on until we have to test whether a single symbol is a normal numerical term of kind K' for some calculable K'. For instance if oc satisfies case (iv) we have to test the proper part fl of a to see if it is a normal numerical term of kind SK, if /? is ({{J(\xV'K+*\\x®'K+*>P'))) y') 8'), then we have to test the proper part /?' of ft to see if it is a normal numerical term of kind SSK, and we have to test the proper parts y', 8' of fi to see if they are normal numerical terms of type SK. The process is easily seen to terminate because we are referred to shorter and shorter formulae. The kind required is easily found at each stage of the testing. Hence to be a normal numerical of kind K is a calculable property of a numerical term, and hence of a numeral considered as the g.n. of a numerical term. A normal numerical term of kind K has at most x, x', x", ...,x{2-K) as free variables. This is correct if the term consists of a single symbol, if it holds for terms with v symbols and any kind then it follows at once for terms with Sv symbols and any kind.
262
Ch. 5 A00-Definable functions
An Aoo-function of type u is called a normal function of type u if it is (Ax. a) where a is a normal numerical term of kind 0. Note that if v is the g.n. of T and/{y} is equal to the g.n. of 0{T}, then f{v} is of the form A'VA" n ... nA<*> VA^> where is o ^ and T fails to occur in <J>',..., $<£*>, and A' is the g.n. of O',..., X{Se) is the g.n. of O«w. The function p\K\\y\ qua function of K and v is called an enumerating function for the class of primitive recursive functions. The one-place Aoo-functions p and p' are called equivalent ifpv = p'v is an A00-theorem for each numeral v. We wish to enumerate the one-place Aoo-functions; to do so we need only enumerate the one-place normal functions, because an Aoo-function is equivalent to a normal function by change of bound variables. In enumerating only the normal functions we merely eliminate some duplicates. The normal functions differ from the other functions merely by choice of bound variable.
(
0 if v is the g.n. of a normal numerical term of kind/c, 1 otherwise.
Then ^[0, K] = 1, (0 is the g.n. of the null formula), and t-^Sv, K] = 0 is equal to the principal characteristic function of Sv = 1V(D[2,8V]
=
^ j ^ 10 + 2.
V (E£)v [Sv = <8,1>T<9> & y & K] = 0] V (JE£,£',£")y[8v = <8, 8, 8, 6, 8, 7,11 + 2.K, 8, 7,12 + 2.K) = tjg.k] = m\K\
= 0].
This is a course of values recursion with substitution in parameter. We have given sufficient instructions to find an explicit Aoo-definition for h[v> *]• The four clauses in the last statement are: Sv = 1; i.e. 8v is the g.n. of an A00-formula consisting of the single symbol 0, i.e. Sv = (0) = 1. The second clause says that Sv is the g.n. of a formula consisting of a single variable from among x,x\x", ...,x®'K\ i.e. Sv = <10 + #>, 0 ^ 6 < 2.K. The third clause says that Sv is the g.n. of an Aoo-formula Sfi where ft has g.n. 6 and t±[d, K] = 0. The fourth clause says that Sv is the g.n. of an
5.13 Enumeration of primitive recursive functions
263
AOo-formula (((S(%zP'K+1\teP-K+*>0)))y)S), where y and S have gr.n.'s 6 and 6' respectively and t^d, K] = t^d', K] = 0 and JS has g.n. d" and t-^d", SK] = 0. Notice that when we take the g.n. of an Aoo-formula we must first put the formula into full primitive notation without any definitional abbreviations. Also here we are dealing with Aoo-formulae even if they consist of a single symbol. D 171 tn[v] is the characteristic function of (E£)v{v =<8, 7,10)T<9> & *!& 0] = 0). (0 if v is the g.n. of a normal function, Then tn[v] = \ [l otherwise. D172
S±[v] for
then Si[v] increases by unity each time tn[v] = 0, i.e. each time v is the g.n. of a normal function. Thus S-^v] is the number of normal functions with gr.w.'s less than or equal to v. We can find an unending sequence of normal functions, namely: (T^x.x), (TiX.(Sx)), (Xx.(8(Sx))),.... Thus for each numeral K there is a least numeral v such that /S^IV] = K, and v will be less than or equal to the g.n. 1C[K] of (7<x.(8(8...(Sx)...))). K-times
I A[0] = <8,7,10,10,9), k[8v] = (Pt[k[v], £])\8,1,10>n<9>. Hence the least numeral v such that 8-^y\ = K is a primitive recursive function, &[V] is the primitive recursive bound for the least number operator. D174 f[K] for (\4)ut,a(81[g} = K). Then/is a primitive recursive function which enumerates the g.n.'s of normal functions. Another method of enumeration is given in Ch. 12. D 175
Num [0] = 1, Num [Sv] = <8, ifNum Mn<9>.
Then iV^m [i^] is equal to the g.n. of the numeral v. Note iVwm [0] = (0> = 1. The numeral v is considered as a formula even if it consists of a single symbol. From these definitions we see that
264
Ch. 5 Aoo-Definable functions
is equal to the g.n. of (plKl v), where plKl is the Kth normal one-place function, it is a primitive recursive function of K and v. Again
is equal to the g.n. of (S(plKlv)), it is a primitive recursive function of K and v. Note that if the suffix K in plK] were an argument of an Aoofunction rather than the place number of an Aoo-function in a list of Aoo-functions then it would have appeared in the g.n. of (S(plKlv)) as Num [K] rather than just as K in the argument of/. Let Val be that calculable function of natural numbers whose value for a numeral 6 is the numeral determined by the closed numerical term whose g.n. is d provided that 6 is the g.n. of a closed numeral term and whose value is zero otherwise. Given 6 we can decide whether it is the g.n. of a closed numerical term or otherwise, and if it is the g.n. of a closed numerical term then we can find the numeral which it determines. Hence given a numeral 6 we can find a numeral v such that Val [6] has the value v. If Val were a primitive recursive function then by the scheme of substitution: Val [(8, l,8)nf[/