This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
r)
O. Since 13 € l-sc(F) and CLF satisfies (1)' such an n will be reached. The reqUired value is then CLF(S(n» - 1. This concludes our sketch of the proof of 3.93. 4.
~-computability of ~
fan-functional
The problem solved in this section was first posed in Kreisel 1959. It was solved by Tait (unpublished) around 1962; the solution presented here is a somewhat streamlined version of one which was presented by Gandy at the 1965 Leicester summer school and colloquium. 4.1.
Any y
€
Cl determines a compact subset K of Cl defined by y K = {S: 13 s y}; y
every compact subset of C I is included in some Ky' tional ~ is defined by ~(F,y) = (un)
(IV
13,13'
€
Ky )[13
=n S·
.... F(S)
=
The fan funcF(B')];
it gives a uniform modulus of continuity for F on K • Y
4.2.
Proposition.
~
is recursively countable.
Proof. Let CL F be any associate for F. By Kanig's lemma there is a finite subset, X say, of {u: CLF(U) > O} which secures every 13 in Ky ; X can be computed from CL F, y. And then, for any u on the tree for Ky above the bar X, F is constant on Vu n Ky iff eLF takes the same value on all those points of X which lie below u. Hence ~(F,y) can be computed from CL , y. QED. F 4.3. Let us modify the limit space structure introduced in 3.5 by associating with each F € C2 the collection A(F) ={[F CL ]
:
CL
is a restricted associate of F}
of filters. Then, by 3.93, if {e}(F)+, then the functional AG. {e} (G) is eventually defined and constant on every filter of A(F). Thus
FUNCTIONS OF HIGHER TYPE
417
4.31. Theorem Any computable functional of type 3 over C is con2 tinuous on the limit space structure defined by A. On the other hand we shall prove 4.4.
Theorem
For any y which is not eventually zero, and any F
there is a restricted associate a of F such that
AG~(G,y)
is not
eventually constant mn Fa.
B "K
Let
~'
-
y
G so that G
=
(i) n >
(ii) u
VA and
EO
€
Let a be a restricted associate of a 0 for all n. Let VA " F. We shall construct
- I-sc(F}.
F such that a(B(n»
~(G,y)
'I HF,y).
Choose n so that
~(F,y)
Dom A
-+
9,h(u) < n.
The choice of a and condition (ii) on n mean that Vs(n) is disjoint from the support of A. Vs(n) then G
€
VA'
Hence if G differs from F only on
And if, for example, we set G(e) = F(e)+l i f S(n+l)
= F(e)
then, by condition (i),
~(G,y)
'I
c
e
otherwise QBD.
~(F,y).
4.5.
Corollary
4.6.
Remarks (1) The above proof shows what is sufficiently obvious,
~
is not computable.
that Fa converges to F more rapidly than any filter of the structure considered in 3.5. (2) A very short proof of 4.5 is given in 5.3 below. (3) Theorems 4.31, '4.4 remains true if we consider computations in a parameter G € C 2 and understand by 'restricted associate of F' one which secures every B € I-sc(F,G). Thus ~ is irreducible. (4) Another, particularly simple, example of a recursively continuous non-computable functional of level 3 is: '¥ (F ,G, y)
o if
VB"
Ky' F
(B)
=
G (B)
,
1 otherwise
§5.
~
revealing counter-example.
We give a simple construction due to Hyland for an irreducible type 2 continuous functional. following familiar lemma.
The basis of the construction is the
R.O. GANDY, J.M.E. HYLAND
418
5.1.
There is a recursive predicate R which secures every recursive
element, but not every element, of 2 w •
For example take:
R(u) iff (3 x < R..h(u) )[T(x,x,R..h(a»
U(th(u»
A
3y T(e,x,y)} be the e-th r.e. subset of w.
Let We = {x: 5.2. Theorem
For each e there is an Fe
C 2 such that:-
€
(i) Fe is computable from We; (ii) We is computable from Fe and
~;
(iii) there is a partial recursive functional ¢> such that ¢> (d) = Fe (a) for all recursive a; (iv) there is a partial recursive ~ such that if {z}(~, Fe) then {1IJ (z)} (~) = n for any list ~ of numerical arguments;
n,
(v) l-sc (F e) consists of the recursive functions.
~.
For any X
2
€
w, any n
o if
€
w set
3y[T(e,n,y)
A
(V
x,; y)
1R(X(x»],
1 otherwise,
with R as in 5.1.
Fe can of course be coded by an object in C
Then (i) and (iii) are obVious;
2•
(iv) follows from (iii) by an
argument similar to our proof of 3.93, and (v) is an immediate consequence of (iv).
And to prove (ii) observe that if n
f
We then
AX.Fe(X,n) (= G say) is constantly 1, while if n € We then n, Gn(X) = 0 if X is not secured by R; using ~ we can compute the values taken by G on ZW and so decide whether n n Corollaries 1. C
5.3.
z
€
We or not.
contains irreducible objects.
For suppose
Fe - a; by (v) a is recursive, anq so, by 3.91, Fe is recursively countable.
But then, by 4.2 and (iii), We is recursive, which is
false for suitable e. 2.
~
is not computable.
For if it were We would be computable from
Fe by (ii) and so recursive by (v). 5.4.
Remarks (1) Hinman (1973) first proved corollary 1 by a rather
elaborate 'spoiling' construction.
We see that this is not necessary.
But Hinman's argument is not wasted since it can be used to construct an F which does not have the same countable degree as a function. (2) Unlike the
pro~f
in the previous section one cannot immediately
relativise the argument given here to prove that from any G € C Z'
~
is not computable
FUNCTIONS OF HIGHER TYPE
419
(3) For each e, Fe is an extension to Cl of an effective operation1 that is why (v) holds. Harrington has shown that there are objects F € C 2 which are not extensions of effective operations, hut which do satisfy (v).
(4)
If one modifies the definition of Fe to: Fe(X,n)
y+l if T(e,n,y)A Vx < y r"'l'(e,x,y)A R(X(x»), o otherwise, and chooses an e with We not recursive, then the functionals Gn have the following properties: (a) one can compute a modulus of continuity of Gn at X (Viz. Gn(X»1 (h) one cannot compute, uniformly in n, a bound for the value of Gn• This shows that one can hardly hope to find a constructive proof of the classical theorem that if a function is pointwise continuous on a compact set K then it is bounded on K. 5.5. Bergstra 1975 gives a significant extension of Hyland's construction. Let Ra be defined by Ra(u) iff (3x
S
9.h(u»)[T(a,x,9.h(u»
A (u)x < 9.h(u»),
so that if Wa is not recursive then R secures every recursive a but not every a € C l • We define B~ Cl + {O,l} by
°
Bb(a) = if 3y[T(b,a(O),y) A (\Ix s y) a 1 otherwise.
€
C
.,Ra(a(x»),
We state without proof the fundamental properties of B~. 5.51
Theorem
Let a,b be chosen so that Q <
~a
s
~b1
then
(i) B~ is computable from Wb, (ii) Wb is computable from Wa and ~, (iii) l-SC(B~, ~) consists of the recursive functions. 5.52 Remarks (1) By (ii), use of B~ allows one to jump from Wa up to Wbf interesting results are obtained by constructing functionals which allow a sequence of such jumps; see Bergstra 1976 Bergstra & Wainer 1976 and Norman 1976. (2) We conjecture that only the recursive functions are recursively countable in B~ - this would give a significant strengthening of (iii) of the theorem. A disproof of the conjecture would show that countable recursion at type 3 is more powerful than computation using~. We now turn to an entirely different proof of this fact.
l
420
R.O. GANDY, J.M.E. HYLAND
§6.
~
functional r
Gandy spent some time trying to prove the conjecture: every recursively countable object of type 3 can be computed from~. He then discovered the object r which made the conjecture implausible. Hyland proved that it is false.
6.1.
Definitions (A) Let n,u
n*a, u ....u
€
C
and P*n, p .... u
l
(n*u) (0)
=
(n*a) (i+l)
n,
(P*n)
P(n*S),
W)
F(u~S)
€
(1)
2
for all l
We define
as follows. u if i < 9.h(u), i a(i-9.h(u» otherwise;
= u (L) ,
(p....u )
r
C
(u .... a) (L)
This p .... u mirrors on C
(B)
seq, a, P be given.
€
€
B.
the behaviour of P on V u'
C 3 is defined by
r (P) =
(F·O) (An.
r
(p* (n-t-L)
•
6.2. Theorem r is uniquely specified by the above equation, and is recursively countable. ~.
(2)
If r satisfies (1) then
=
r(p .... u)
(P"u·O) (An. r(F"u*(n+l») for all u ,
Given B
and an associate up for F we can compute an n such that F ..... ii (n ) is constant and so (3)
r (P"" u)
F(S) i f B(n) .:. u ,
Since every path B is secured at some point Sen) where (3) applies, equation (2) defines r(F"u) at all points of the well-founded tree of non-past-secured sequence numbers u. y(u)
= r(F~u)
y (u)
Further if we set
then y satisfies
=
(P~
u*O) (An. y (u* (n+l) ) ,
so that y is recursive in any associate a p•
6.3.
Remarks (1) The computation of rep)
bar-recursion.
The determination of rep)
Pinally rep)
=
y«
QED
».
from a p is an instance of from P differs from bar-
recursion in that there is a single equation (2)
which applies at all
points u, rather than a division into cases according as to whether u is secured or merely securable.
(2) Another way of seeing that r is well-defined is to regard it as defined by recursion with respect to Brouwer's inductive definition of C
2:
FUNCTIONS OF HIGHER TYPE
421
(L) f(ACl. k) = k (11) If F (o ) GCl (0) (An. Cl (n+l», then rtF) Go (An. f(G n+ I » . (3) The functionals say, defined by ~(F)
~
and f can be combined into one functional, 6
= ~(F*O,
An.
~(F*(n+l»).
(4) For a given associate Cl of F there will be an n such that the F computation of f(F) does not require knowledge of F(Cl) for any Cl with Cl(O) > n. But, by suitably choosing Cl F' we can make n arbitrarily large. Thus we cannot determine in an associate-invariant way a compact set K such that f(F) depends only on the behavious of F on K. So it is implausible that we could compute r from ~.
Before we prove that r cannot be computed from ~ we introduce a device due to Bergstra by which computations from ~ can be replaced by computations using only arguments of type 2. For any F let H be F defined by: {: R.h(u) s ~(F,y) 1\
F (V
u
1\
(lfi < R.h(u»)(u i s y(i» = {p}}.
n K )
Y
Thus HF(y) codes, in effect, the behaviour of F on the compact set {a: 'Vi. a (i) s y(i)}. Theorem (Bergstra) (i) H is computable from F, ~. F (ii) F(y) and ~(F,y) are computable from HF(y). (iii) There is a partial recursive ¢ such that if {e} (F,~) = y, then {¢(e)}(HF) = y. (i) and' (ii) are obvious from the definitions. The proof of (iii) is by a fairly tedious induction on the computation and can be found in,Bergstra 1975. We also need to connect the filters associated with HF with those associated with F. Put 9.h(u) 1\ (V i < 9.h(u» (vi s u i» i. ~h(v) Lu = U {vv: v £ L (u)}. For any finite set A of ordered pairs (u,p) let F A = { (v, F (b .... g» : ( '3u e Dom A) (v £ L(u»} L(u) = {v:
where 0
An.O.
R.O. GANDY, J.M.E. HYLAND
422
6.33 Lenuna
V
(i) HF is constant on V iff ( v £ L(u» (F is constant on V u v). (ii) If G = L F and HF(V ) = {p}, then HG(V = {pl. U u) u (iii) F
V F iff HF
£
VA'
£
A
(iv) If G
V F then HG
£
£
A
VA'
All these facts are immediate consequences of the definitions involved. We may note in passing that although for simplicity we used F, F to define A in fact it can be defined (so as to satisfy (iii) and (iv»
recursively from A.
Now we are ready to prove the main
result of this section. 6.4.
Theorem (Hyland) r cannot be computed from
Given any defined computation {e}(F, so that {e}(G,
~)
= {e}(F, ~)
~)
but r (G)
~.
we shall construct G
t- r tr},
We consider the computation {¢(e)} (H of 6.32(iii) and choose a F) restricted associate a for H which is fine enough to ensure that if F a W £ 6 then W F contains a G satisfying r(G) t- r(F). A A
Choose 6 to satisfy: (i) i f 6' " 6 then 6 ~ I-sC(H F); V x 6 (x) > O.
(11)
Let
Y:
An.
r(F-6(m)*(n+l», and let
(6- (m) *0) '" YF ;
then
m
(11 rtF
6(m»
= F(sF) m
for each m.
-F Let v m = Sm(m + 6(m», and let a be a restricted associate for H F
which satisfies: (a) a(6'(m»
=0
for all m, all 6' " 6;
(b) a(u) = 0 if 3m,u'. u ~ u' By 4.31 there is an A such that VA
v
1\
£
6
a
m
£
L(u').
and AH. {¢(e)}(H) is con-
stant on VAl then, by 6.33, (2) i f G
£
V F' then {e} (G, ~) = {e} (F, ~). A
By (a) we can find M such that if 6' " 6 then V (M) does not inter6' sect the support of A; therefore F
(3) V6' (M) n support (A )
=
~.
FUNCTIONS OF HIGHER TYPE
423
For m < M define and
Pm
max{ (u)m+l+o(m): u
wm
v m* (Pm+l) •
Now by (b) i f v
€
Dom(A
F)
€
Dom A}
then v
i
F) n Support (A
Hence
v m'
(4) Vw = ¢. m Finally let G agree with F except as required by: G(V 5(M» G(Vw ) G(V
m
wo )
{PM- l + I}, + I} for 0 < m < M, m- l
{P
{f(F)+l}.
Then, by (3) and (4), G satisfies the premise of (2). f(G~5(M»
Also Hence
G
YM_l(n)
G
Therefore BM-l
r
= P M l
+ 1.
F
1
YM_l{n) if n < o (M-l)
PM-l +1
ifn
o (M-l)
- 1.
and, by (1), M-l (G ...... 6 (M-l» = Pm-2 + 1. €
Vw
Repeating this argument for rna M-2, ••• ,O we find that hence that f(G)
= r(F)+l.
G BO
€
V
wo
and
This concludes the proof. Remark. Hyland's original proof proceeded directly by an induction on the computation of {$(e)}(Hp).
In the context of this paper it
seemed more appropriate to appeal to Theorem 4.31. §7
Effective Operations In a thoroughgoing constructivist theory of mathematics all
objects considered will be, in some way, recursively presented.
In
this section we investigate the lower levels of the type structure R Rl
= {Ra:a a type symbol} of recursive = {a: a is recursive}. For higher
We set Ro = w, types we require an object to
objects.
be presented by a recursive associate which defines a value when-
ever the arguments are recursively presented. of the argument in 3.1-3.3 we define 'a associate for a' follows.
(abbreviated to 'a
€
Following the pattern
R ' and 'a is a quasin GA(a»' by induction on n as €
424 7.1
R.O. GANDY, J.M.E. HYLAND Definition (i) If a € Rl, then a € QA(a) iff a = a. (ii) If a:R + w then a € QA(a) iff: n+l (a) (lib € R (Ii 8 € QA(b» [8 is recursive n+ l) 3m.a
+
(b)
(Vb €
R ) ( \J 8 € QA(b» n+ l [a(B(m» f 0
(iii) a € R iff n+2
(B (m) ] ( V 111) +
f
OJ;
a(B(m»
= a(b)+lJ.
:::la.[a € QA(a)A a is recursive].
7.2
Remarks (1) R2 consists of just those partial recursive functionals from Rl into w which are defined on R and which are conl tinuous on R l considered as a subspace of C l• (2) If F € C 2 is recursively courrt.ab Le then F ~ R € R l 2• (3) But the converse of (2) is false. For example, if P secures j
every recursive but not every sequence and F
= Aa.
(vm)P(a(m»
F € R 2, but F cannot be extended to an element of C 2•
R2
i
{F
Rl: F
€
then
Thus
C 2},
and R is not a sub-structure of C. (4) The significance of the clause '8 is recursive will be clarified below (see 8.3(2». (5) R can also be defined as a category of limit spaces. the definition for R
2(cf.3.5).
VA = {F: F is a continuous map from R A
If
u,p.[ (u,p)€A
Let be a numerical code for A.
l
+
into w F(V
u)
= {p}J}.
We say that the filter
rated by {VA: € X} is recursive just in case X is r.e. easy to see that if ciate iff
6x
6x
+
6x
gene-
It is
converges to G then G has a recursive asso-
is recursive.
limit space R
We discuss
Let
Thus R Z consists of those points of the
w which can be approximated to by recursive filters;
l and for the filters on RZ we take precisely these. tion is readily extended to all types.
This construc-
There is a partial recursive function ¢ such {e}(~) = z and; are quasi-associates (not necessarily recursive ones) for ~, then 7.3 Theorem (Kleene)
that if ~ € R,
{¢(e) }(;)
= z,
The proof of the theorem is similar to the proof of 3.91 and we omit it, and the proofs of the following. Corollary I
There is a partial recursive function
Wsuch
that if
FUNCTIONS OF HIGHER TYPE
425
f
are indices for recursive quasi-associates for ~, and if {e}(~) = z, then {1jJ(e)}(f) = z ,
Corollary 2 The I-section of any a cursive functions.
€
R consists exactly of the re-
Corollary 3 If F € R {e}(F) = z and a is any quasi-associate for 2, F, then AG.{e}(G) is eventually constant on the filter
6a
= {VA: A is a finite subset of {(u,p): o Iu)
p+l} }.
7.4 Remarks (1) Corollary 1 shows that R is closed under computation. (2) Corollary 2 shows that for objects in R 'quasi-associate' and 'restricted associate' are interchangeable. (3) It was natural for us, in defining R, to presuppose the existence of higher type objects and to think in terms of extensions. Corollary 2 suggests that one can also think entiT~ly in terms
of intentions (indices) and recursive functions. how R can be approached in this way.
We now indicate
7.5 We first define a type structure 0 = {Qn: n € w} and a relation 'e € In(a)' (to be read as 'e is an index for the object a € On') as follows: (i) (11)
Io(k) = {k}. o = Wi + (a) = {e: a maps Qn into w
Q
I
n l
A(V b € Q ) (V f € In(b))[{e}(f) = {a(b)}J}, n where {e} is the partial function w ~ w with index e of aRT.
(iii) Qn+l = {a: In+l(a) ~ ¢}. Thus Ql = Rl and II (a) = {e:a = {ell. One can also define the indices for members of Q without refn erence to higher type objects as follows: 7.6
Definition (L) Eo = Wi (ii) En + l
{e: En
~
(11)
j =ok <-> j = k.
Dom({e}) A ('if f,f' € En)[f =nf' .. {e} (f)
(iii) e =n+le' <-> (liff
€
{e} (F')].
En)[{e}(f) = {e'}(f)].
From the definitions one easily proves:
En
= U {In (a):
a
€
Un}'
One can, of course, define the members Qn in terms of the equiva-. lence classes of the relation =n' (See, for example, Kreisel 1959).
R.O. GANDY, J.M.E. HYLAND
426
We shall call the members operations. Observe that weaker definition, due to with hereditarily partial 7.7
of 0 (rather than their indices) effective if e € E then {e} is total on En' A n+ l Myhill and Shepherdson 1955, is concerned functions.
Theorem (Kreisel-Lacombe-Shoenfield) For each n, On = Rn• The difficult part of the proof (which we omit) is to show that
Qn ~ R • Details (at least for the crucial case n = 2) may be found n in Kreisel, Lacombe and Shoenfield 1959, Gandy 1962 (a very succint
proof), or Rogers 1967.
The proof is effective in the sense that it
provides a primitive recursive function $ such that if e € In(a) then $(e) is an index for a recursive quasi-associate for a. For n > 2 one needs an effectively indexed dense basis for R and the n_ l decidability of various facts about the filter bases in R The n_l• necessary proofs are sketched in Kreisel 1959 and given in detail in Tait 1963 and in Hyland's thesis. Hyland also gives the definition of R and Q for arbitrary type symbols 0 and proves R = 00' He o o o observes that a ~ proof of R = Q seems to require that 0 has o o the form 1 ~ O. The proof that Rn+ l ~ Qn+l follows readily from Qn ~ Rn by using corollary 2 of 7.4. Finally it should be observed that the proof of 7.4 is not constructive - it requires, unavoidably an application of Markov's principle. ation see Beeson 1975. §8
~
For an exact description of the situ-
effective operation which is not computable
In this section we construct a ~ € R 3 which is not computable. Let S be a recursive predicate of sequence numbers which does not secure every sequence, but which does secure every sequence which is dominated by some recursive sequence. E.g. S(u) <--> (3
x<~h(u»[T(x,x,~h(u}}A(u)x< ~h(u}J.
(I) If a s B € R1, then (3m}S(~(m}). Consider the continuous map of Baire space into Cantor space (binary sequences) given by: a
B
0 ••• 010 ••• 01 •••••• 10 ••• 01 ••••
a(O}
a(l}
a(n}
This also defines a map of seq onto BiSeq: u
B
0 ••• 010 ••• 01 •••••• 10 ••• 0,
FUNCTIONS OF HIGHER TYPE
(where u
(3u(m)
(u
~m'
For u
427
Seq we define
€
for m < R.h( uB) ,
1
for m
o
otherwise.
R.h(u
B),
Thus (3u is an eventually zero binary sequence which codes u. Let T(= {u:
Vv
secured sequences.
~
u. ,
SlY)}) be the tree of all non-past-
Now we define
A(F) = Max{F«(3u): u 8.1
T}.
€
Lemma A € R 3• Let a be any recursive quasi-associate for some F
€
R2•
must show that A(F) is defined and can be computed from a. = (um) (a (su (mj )
4> (u)
1"
0) ...:... (1
+ R.h(uB
We Set
»;
is the number of O's (if any) needed at the end of (3u to en-
~(u)
sure that it is secured by a. (A)
Thus
is computable,
~
(B) if P
~ ~(u)
and v
u*p, then
~
F«(3V) = F«(3u*~(u»' Now let $ be recursively defined by: $(r) =Max{~(u): R.h(u) =rA('tJi
W
So
is computable.
Let X be the tree of sequence numbers bounded
by 1/1; Le.
x
= {u:
(V i <
(u
R.h(u»
i
S 1/I(i»}.
Now by (1) every path through X is secured by S.
Hence by KOnig's
lemma X nTis a finite set which can be computed from a. Finally we show (*) if v Let v u
€
T-X.
€
€
T, then for some u
Then there is u
X and u*p
c
c
€
X n T, F«(3v) = F«(3u)'
v, with R.h(u) = r say, such that
v for some p > 1/I(r).
But $(r)
~
~(u),
and so by
(B) ,
Now v is not past-secured by 5, so u is not secured by 5 and thus u*~(u)
€
T.
And eVidently
u*~(u)
€
X.
This proves (*).
So A(F) = MaX{F«(3u): u
€
X n T}
and thus is defined and computable from a.
QED.
428
R.O. GANDY, J.M.E. HYLAND
8.2
fi is not computable.
~
The argument is similar to those used in 4.4 and 6.4.
Suppose
is defined, where F £ R • Let y be not secured by S and let 2 be a restricted associate for F which does not secure yB. By
{e} (F) u
F Corollary 3 to Theorem 7.3 there is an A such that: (i) G
VA .... {e } (G)
£
(ii) yB
f
=
{e} (F);
Support (VA)'
Hence there is a v
y such that
c
V B n Support (VA) Define G £ R
2
by:
G(B)
fi(F)+l if B F(B)
Then, by (i), {e}(G) fi(G) > fi(F). Remarks.
~.
v
{e}(F).
£
V B' v
otherwise. But v
Thus fi ~ AG.{e}(G).
£
T, and B £V B' So v v QED.
(1) This counter-example is due to Gandy; it was first pre-
sented at the Logic Summer School, Leicester 1965. (2) The example emphasises the significance of the clause 'B is recursive' in 7.l(iia).
fi is eventually constant on any recursive
filter converging to some F stant on
~
filter
£ R while AG.{e}(G) is eventually con2, (determined) by a restricted associate which
need not be recursive) converging to some F
£ R 2• (3) Does the continuity condition of 7.3 Corollary 3 characterise
the computable functionals at type 3?
More precisely suppose
R .... w has a recursive associate and satisfies the continuity 2 condition: is f computable? f:
§9
Partial Objects Many of the objects and constructions which occur in construc-
tive mathematics are required to be total; for example, both Bishop and Brouwer treat a real number generator as a function whose domain is w.
That is one of the reasons why we have so far only considered
functions which are total over the chosen domains.
Another reason
is that if total objects are to be treated only as extensions, then functions with partial arguments will appear as intentions. However, one's understanding of the continuous functionals is a certainly enriched by a knowledge of the partial ones; and, since recursion naturally produces partial functions it is not surprising
FUNCTIONS OF HIGHER TYPE
429
that it is easier to produce a satisfactory recursion theory for partial than for total objects of higher type. This was first realised and exploited by Platek; it is discussed by Feferman in his paper in this volume. In this section we discuss the relation between the type structure C and the type structure CU of hereditarily partial continuous functions which was first explicitly introduced by Ershov (1972). [We say 'explicitly' because CU consists of the lower parts of the lattices defined by Scott (1970). Despite the
a,
r,
c
e
.L v
X
= y.
9.1 Cl consists of all the partial functions a: Co ~> w. partially ordered by the relation of extension: c S <--> V x,y. a(x) = y + Sex) = y.
It is
a
A formal neighbourhood d is a finite (possibly empty) set of We say it is consistent (and write < k}.
the form {(xi'Yi): i 'd € FCN ) i f l
=
. irreduntant i f x , Yi
Yj
+
(Xi
.L
v
=
xj
.L
v
Xi
Xj )
(i,j < k)
c x . + i = j (i,j < k ) • 1 J Each d € r'CN l d~termines a unique finite (or, in the terminology of lattice theory, compact) element d by: It is
d(x)
Yi if Xi
c
x
for some i < k, is undefinpd cthe~lise
Conversely each finite element determines a unique irredundant neighbourhood. Each also determines a 'neighbourhood':
Ud = {a: d .:. a}. The set of neighbourhoods {Ud: d € PCN 1} on Cl•
generates a TO topology
430
R.O. GANDY, J.M.E. HYLAND
9.2. C2 consists of all partial with respect to this topology. lation of extension. ite set {di'Yi): d i
FCN t
'
i
dj
~ i
If A
€
=
j
i < k} such that
Yj ~ (d i u d j)
It is irredundant iff each C
w which are continuous
~>
is partially ordered by the re-
A formal consistent neighbourhood A is a fin€
Yi d
F:C l
C2
(L, j
< k).
di
FCN l• is irredundant and if, further, €
FCN then we set 2
A(u)
{P:A
Yi if
di
c
a for some i,
is undefined otherwise,
F}.
and U = ~ As before each A (or UA) determines a unique A irredundant neighbourhood. The neighbourhoods generate a To topology on C 9.3.
2•
It will now be obvious how to extend these definitions to all
pure types (and, indeed, to arbitrary types).
Further the following
facts are readily verified.
a
(1) An € C+ is completely specified by its values on the finite n l elements of en' (2) The predicates 'is a consistent formal neighbourhood' and 'is
an irredundant consistent formal neighbourhood' and the relation
c
between finite elements are all decidable. 9.4.
We now turn to the connection between
C and
C.
We start by
making some definitions and stating some simple propositions which follow from them.
Then we shall discuss their significance and the
significance of C itself. 9.5.
We are primarily interested in those an
C
€
n
which correspond
to (hereditarily) total objects: following Ershov (1974) we call them everywhere defined ('ED~').
For a recursion theory we shall
also be interested in partial functions of total objects, so we define C; to be the set of partial maps
w.
We set ED o (1)
t
(u) (x) l -1
fro~
We define a map tl:C '"
u(x) •
l
Cn ~
i~to
C
l
w.
by
Then we set ED = t (C Observe that t is not one-.to-one. For l). I l l -1 t (Ax.k) has two members; namely the finite element Ax.k, and the l function Ax.k (which is undefined at ~) which is extended by Ax.k. -1 We shall think of t (a) as an equivalence class, denoted by [a], l for the equivalence relation:
FUNCTIONS OF HIGHER TYPE a ~ S <-->Df ~,8
€
We wish now to define a t 2 t
(2)
2
(F) (t
l
(~»
C*
+
2
2
F (~) for all a
e
=
EDl A tl(a)
• :C
431
which will satisfy €
ED •
We accomplish this by the definition:
t 2 {F ) (a) = y i f
(3)
V~
€
tl(S).
1
[aJ.F(a) = y,
is undefined otherwise.
(C 2), or, equivalently = t -1 2 = {F: V ~ € EDl.F(a)t}.
Then we define ED 2 ED2
(4)
As before we think of ED classes [F)
2
= t -1 (F). 2
as being partitional into equivalence
Roughly speaking, different members of [F)
correspond to different associates of F; the consideration of 3.24 show that [F) has the cardinality of the continuum, has no minimal elements (w.r.t.
~)
but does have a greatest element (corresponding
to the principal associate for F). lattice without least element.
In fact,
([F],~)
is a complete
If we define
(5) m2 (F ) = Sup[F], • then m is the embedding of C in C mentioned by Feferman; it is a 2 2 2 right inverse of t • 2 The extension of these definitions to higher pure types is immediate: (6)
t n+ l: Cn + l
+
C~+l
is defined by t n+ l (il n+ l) (b n) '" (IlY)( (il + n l
(7)
{il n+ l:
(8) 9.6.
(9) mn(a
n)
(b n )
= y);
EDn.iln+l (bn).j.};
€
= Sup[a n].
Hyland in (.1975) stresses, in effect, that to get a true view
of the relation between map t
Vb n
V bndb n])
C and
C one must fix one's attention on the
and the equivalence classes of its inverse, rather than on
the embedding m.
Ershov (1974) provides a number of examples which
emphasise the naturalness of t and the artificiality of m.
We
state here some of the relevant facts.
C
C,
(1) The topology of, say, a limit space: with each 2 makes is associated with the filter {X: 3 A S F. U ~ X} generated by A neighbourhoods of If we apply t then the filter associated 2 each F € [F] is taken to a filter; the set of all these filters
F.
a given F 3.5.
€
C
is precisely the
limit~space
F the with for
structure described in
2 This structure is thus the finest which makes t 2 continuous.
432
R.O. GANDY, J.M.E. HYLAND
(2) To make t 2 continuous in the topological sense we may define a topology T on C2 by taking as the open sets those X ~ C2 such that -1 • t (X) is open in C This is the 'induced topology' mentioned in 2 2• 3.6.
C
(3) Neither m nor any other embedding of C in is continuous 2 2, 2 in the sense of limit spaces. Even m is not continuous since i t l takes Ax.k into the isolated point Ax.k of • l (4) Ershov shows (1974 Example 3) that m does not preserve composi-
C
tion, nor (for appropriate, not pure, types) application.
For ex-
ample
9.7.
In §3 and in this section we have given a number of reasons
for adopting C as the correct definition of 'hereditarily total continuous object of finite type'.
We are now in a position to argue
that the correctness of the definition follows from the following two premises. (A) Hereditarily total objects are extensions. (B) They can be approximated by finitely presented hereditarily partial objects. (B) means that the degree of approximation will be judged by the topology of C.
As we have seen already at type 1 and certainly at
type 2 there will be many different processes of approximation to a given object, F say; the course of such a process of approximation is described by an object (F) of
c
• Since there are many of these, 2 they must be thought of as different intensions corresponding to
the single extension F.
Since we are positing extensions, the pas-
sage from an intension to an extension is a fundamental and natural one, while the choice of a particular intension to correspond to a given extension must be, at least to some extent, artificial.
Fur-
ther, if we have some continuous operation on extensions, then we must be able to approximate to its value at a given extension F by using any of the processes which approximate to F. we require of a continuous operation.
And this is all
Thus the map t:C
~
C is the
fundamental one, and the structure for defining continuity on C must be the finest which makes t continuous.
But, as indicated in
9.6, these facts completely determine C and its limit-space or topological structure. An argument is never so good as when it is against something. The first thought of several investigators in this field has been to topologise C
I
+
w by taking as a basis the sets
FUNCTIONS OF HIGHER TYPE VA = {t 2 (F) :
A ~ F}
(A
= t 2(UA),
We therefore call it the naive topology.
433 €
F~2).
[See, for example, the
definitions at the bottom of pages 223 and 228 of Ershov (1972). This led him to claim (page 241), erroneously, that the class thus obtained coincided with the Kleene-Kreisel continuous functions. (1974) he corrects - without acknowledging - his error]. topology is easy to handle, and makes m continuous.
In
The naive
We have already
seen in 3.6 that it is incompatible with the ideas which motivated Kleene's work.
Our argument shows exactly what is wrong with it -
the topology is not extensional.
One may be able to recognise from
a particular definition of F that it is constantly zero (and so belongs to V(0,O»' but this recognition is not extensional and does not apply to total objects.
And, finally, if one is not interested
in total objects as extensions then there is no reason to construct C at all - everything can be done inside C. 9.8.
We close this section with two remarks
(1) The effective operations are adequately represented in C. can mimic the definitions of 9.5 so as to get a subset RED sively everywhere defined) of en and a many-one map r onto R n•
One
(recur-
n from RED n n
(2) One might suppose (indeed both Kreisel in (1958) and Hyland in (1975) do seem to suppose) that a formal consistent neighbourhood such as A would appear as one of many possible intensions for the set VA of total functions. any type, we have: if VA so that a given VA with VA'
=
But this is so only trivially.
For, at
.
=
VB then A
c
C determines a unique irredundant A' n
€
FCN
VA·
The proof is an immediate consequence of the conditions given in the literature (Tait (1963), Hyland (1975» VA
~
VB.
for deciding whether
It is also stated, in effect but without comment, by
Ershov (1974) Remark 3, page 218). §lO.
Recursion on C and C In this section we discuss the work described by Feferman in
his paper in our own terms. Because every function C + is determined by its values on the n l finite elements of Cn' there can be no doubt about which elements are to be regarded as recursive.
R.O. GANDY, J.M.E. HYLAND
434
10.1. ('
€
.
.
Definition A member a of C is partial recursive n+ l PRe') iff there is a partial recursive function ~:w ~> w such
that ~
«A» ,
where is the nwnerical code for A. Because 'A € FCN n' and 'A ~ B' are decidable there are a number of different natural definitions, all of which are equivalent to the above. It should be observed that the usual Post-Smullyan type inductive definitions for the graphs of partial recursive functions do not provide a monotonic inciuctive defi.l".ition of PRC; for the" proceed via auxilIary relations ( e.g. {(e,x,y): T(e,x,v)}). Hence the interest of definitions by SChSJ'1Clt.a as discussed by F'eferIT1i\n. The obvious way to set up a definition of 'partial recursive' in C is to use the maps t n of 9.5(6). 10.2. Definition We say that a partial function a:c ~> w is n partial recursive and write 'a € PRC iff there is an € PRC l n+ n+ l such that a = tn+l(a). It is easy to see that this definition coincides with the definition given by Feferman (his section 7) and, when a is total, with Kleene's definition of recursively countable (cf. 3.8). It is also equivalent to an extension of our notion of associate-invariant re-
a
cursion. a(b n)
For a
=
PRC iff there is an index e such that n+ l y iff V8.[B is an associate for b ~ {e}(B) = y]. €
All the stated equivalences follow readily from the fact that there is a natural map from the associates of b onto the equivalence class [b]
= t~l (b)
•
10.3. Because of the universal quantifier over ~nn' the definition of PRC l is very st~ong. For example, there is a partial recursive n+ 6: c 2 x w ~> w such that, for any F € RC {x: 6(F,x)+} is a com2 2, plete rr~ predicate. Unlike PRt or RC, PRC i~ thus not closed under substitution for higher type arguments. In this it resembles the computable functionals and the example used in Kleene 1963 also applies to PRC. Let (1)
~m (x)
(uz ) (z=O "" 3 y < xT (m,m,y»
•
Let 6(F,m) F(a m). Then 6 € PRC since 6 = t 3(AFAm.F(am» . But 3, the domain of Am.6(F,m) is {m: If Y» ,T(m,m,y)} and so, for any given F € RC2, Am.6(F,m) f PRCI•
FUNCTIONS OF HIGHER TYPE
435
10.4. Feferman states that PRC 3 is not closed under S.8. We shall disprove this statement, but will first point out why the matter is problematic.
Let ~: C xC ~> w belong to PRe and let ~ = t 3 (¢ ) (with the o 2 obvious modification required by the presence of the numerical argument). Let
f
AF.F
and
r
AF.F(AX.~(x,F».
Suppose that
is not total; it could nevertheless be the
Ax.~(x,F)
case that AX.;(x,F)
£
Dom
F
for all F
[F]
£
(although we have not constructed an example for this). t but t 3 (fJ (F) +, so that from this that r f PRC.
r (F)
10.5.
Theorem
r '!
t 3 (f).
is closed under S.8.
~~C3
W shall show that there is 0/ (1)
I;j
P
£
[F].~
And then
But one cannot conclude
(F) =
£
PRC 3 such that
1 iffAx.~ (x,F) is total.
r
Then
t,(F) '" ~(F). F(AX.¢(X,F», and this establishes the theorem. Let Om be defined by: 0m(X) We will construct ~
¥(P) =
(2)
'or').
£
=0
if x < m, undefined otherwise.
PRe 3 such that, for any F
1 <-> \iX[F
x.
If F(Ox)t
the~ ~(P)t.
pI
(~)
_
F
ED2
(where 'v' is the strong
EVidently if Ax.~(x,F) is total then
Conversely suppose that ¢(X,F)t for some
£
£
V
F [P].V(F) = 1. £
[F] and least possible
If P(Ox)+' let F ' be defined by F(a) i f
::I
x , ~(x)
'! 0,
or if a.:. 0x+l' is undefined otherwise. EVidently pI c F, s~' ~(X,PI)t; ~lso F:(Ox)t. Thud, by (2), ~(F')t, and F' £ [F]. We have shown that (1) follows from (2). To show that there is a partial recursive
~l tA) =
1 if
3y[A (oY+1H " A(Oy) t "
satisfying (2), set
~
Vx
s
y.~ (x,A)],
undefined otherwise. Since A(d)+ is decidable for any A, d, Vl(A) is a partial recursive function of ; and since if ~l(A)
=1
and
A~
B then ~l(B)
= 1,
we
R.O. GANDY, J.M.E. HYLAND
436
And if ~l(A) = 1 and A c F then F satisfies 3• the RH5 of (2). Finally if F € ED and satisfies the RH5 of (2) 2 let y be the least number such that F(O +1)+' Then for each x ~ y • • y. • there must be a B such that ~(X,Bx)+ and B ~ F. Pick one such Bx x x for each x ~ y and take see that ~l
A
=
{(OY+l,F«\+l»}U LJ{B x: x s yL
= 1,
~l(A)
Then
Thus
PRC
€
~l
= 1.
so ~l(F)
satisfies (2) and the theorem is proved.
Corollary
PRC is closed under 51-59. 3 schemes is easily verified).
10.6.
(Closure under the other
Throughout this paper we have emphasised our interest in
total objects.
It therefore seems appropriate to end with a ques-
tion that differs from that asked by Feferman. Question
Does there exist a natural monotonic inductive process
which generates RC (instead of PRC)? 5uch a process would be interesting for at least two reasons. Firstly it should enable one to prove things about RC without detours through PRC or PRC.
5econdly one would expect to get a
method of generating the continuous functionals by relativising the process to arbitrary functions.
The problematic type is 3.
answer to the question at type 1 is, roughly, negative.
The
The answer
at type 2 (where one can
~ RC is positive and given by l) Brouwer's definition of the constructive functionals. The rather
strong properties of PRC exhibited in this section suggest that it may be easier to answer our question than Feferman's. References: M.J. Beeson 1975, The underivability in intuitionistic formal systems of theorems on the continuity of effective operations, J.5.L.
iQ
321-346.
J.A. Bergstra 1976, Computability and continuity in finite types, Dissertation, Utrecht. J.A. Bergstra and 5.5. Wainer 1976, The "real" ordinal of the 1section of a continuous functional, paper presented at the Oxford Logic Colloquium. YU.L. Ershov 1972, Computable functionals of finite type, Algebra and Logic
II
203-242 (367-437 in Russian).
Yu.L. Ershov 1974, Maximal and everywhere defined functionals. Algebra and Logic
II
210-225 (374-397 in Russian).
FUNCTIONS OF HIGHER TYPE
437
R.O. Gandy 1962, Effective operations and recursive functionals (abstract), J.S.L.
£1
378-379.
R.O. Gandy 1967, Computable functionals of finite type I, in: Sets, Models and Recursion Theory, North-Holland, Amsterdam 1967. P.G. Hinman 1973, Degrees of continuous functionals, J.S.L. 38 393-395. J.M.E. Hyland 1975, Recursion theory on the countable functionals, D.Phil. Thesis, Oxford. J.M.E. Hyland 1977, Filter spaces and continuous functionpls, to appear. S.C. Kleene 1959a, Recursive functionals and quantifiers of finite types I, T.A.M.S. 91 1-52. S.C. Kleene 1959b, Countable functionals, in: Constructivity in Mathematics, North-Holland, Amsterdam 1959. S.C. Kleene 1962a, Turing machine computable functionals of finite types I, in: Logic, Methodology and Philosophy of Science, Stanford Univ. Press, Stanford 1962. S.C. Kleene 1962b, Lambda definable functionals of finite types, Fund. Math. 50 281-303. S.C. Kleene 1962c,
Herbrand-C~del
style recursive functionals of
finite types, Proc. Symp. Pure Math. vol.y 49-75. S.C. Kleene 1963, Recursive functionals and quantifiers of finite types II, T.A.M.S. 108 106-142. G. Kreisel 1959, Interpretation of Analysis by means of functionals of finite type, in: Constructivity in Mathematics, NorthHolland, Amsterdam 1959. G. Kreisel, D. Lacombe and J.R. Shoenfield 1959, Partial recursive functionals and. effective operations, in: Constructivity in Mathematics, North-Holland, Amsterdam 1959. J. Myhill and J.C. Shepherdson 1955, Effective operations on partial recursive functions, Ziet. Math. Log. Grund. Math.
l
310-317.
D. Norman 1976, On a pr?blem of S. Wainer, Oslo preprint. H. Rogers Jr. 1967, Theory of Recursive Functions and Effective Computability, McGraw-Hill, New York 1967. B. Scarpellini 1971, A model for bar recursion of higher types, Compo Math.
II
123-153.
D. Scott 1970, Outline of a mathematical theory of computation, Proc. 4th Annual Princeton Conference on Information Science and Systems, 169-176. D. Scott 1976, Data types as lattices, SIAM Journal on Computing 5 522-587.
438
w.w.
R.O. GANDY, J.M.E. HYLAND Tait 1963, A second order theory of functionals of higher type, Stanford Seminar on the foundations of Analysis.
R. Gandy, M. Hyland (Eds.), LOGIC COLLOQUIUM 76
© North-Holland Publishing Company (1977)
Aspects of constructivity in mathematics by J.M.E. Hyland §1
'1'he aim of this paper is to indicate points of contact between
the followin g three topics whd ch fall under the general heading of constructivity in mathematics: (a)
continuity of solutions in parameters;
(b)
topological models for intuitionistic analysis;
(c)
functional interpretations of analysis.
My interest in these matters arose from two sources. I} In Kreisel 1959, a classical (or not specifically intuitionistic) notion of constructive result is discussed and specifically related to the notion of continuity in parameters cally real number generators}.
there tllought of as typi-
'l'here VIas a clear need for a system-
atic treatment of the way in which constructive proofs give rise to continuity in parameters, though the main tool Vlhich is used in this paper to make the connection (viz. topic (b) above) did not begin to emerge until Scott 1968, 1970. 2} In a seminar on sheaves and logic organized by Scott in Oxford 1975-76, s orne time was spent proving results constructively and interpreting them in sheaf models over topological spaces.
It be-
came apparent to me that if one was interested only in truth in the topological models (as opposed to ones over arbitrary complete Heyting algebras), then one could dispense with the, at times, rather complex constructive proofs; the truth in topological models depended on simple considerations of continuity. The main results of,this paper are contained in §§7,8: those related to 2} above in §7 and those related to I}
in §8.
§§2-6 contain a
variety of preliminary material ideas; it seems likely that some of these may have a rele to play in othe! areas, for example in the development of constructive analogues to classical model theory. The overall level of detail in this paper is very low.
This will
certainly frustrate some, but I did not want the ideas to become obscured by the presentation. of the three topics in constructivity mentioned above, the one Vlhich receives least explicit discussion is ta~t
(c).
Here there is con-
with an issue alluded to by Kreisel in his contribution to this
volume: the lack of significant returns for functional interpretations. The inconclusive remarks about Dialectica and modified realizability interpretations at the close of §7 and §8, indicate at least why it 439
440
J.M.E. HYLAND
is that the classical mathematician has not used functional interpretations to formalize his constructive intuitions.
Interesting
results can be obtained using less subtle ideas: specifically without using constructive information from premises of implications. It would certainly be of interest to find branches of mathematics where applications of functional
interpretations along the lines of
§§7,8 were needed. We apply notions of continuity in parameters to parameters of various types: natural numbers (N), reals (R), continuous maps R
to
R (R
~
R), continuous maps from R
~
R to R,
and so on.
All
this takes place in a suitable cartesian closed category (FIL of Hyland 1977 say). types.
We describe the topological models for these
However sheaves on topological spaces model much more than
finite types; there is a cumulative hierarchy of sheaves which models intuitionistic Zermelo-Fraenkel set theory (IZF) together with Zorn's Lemma
(ZL).
Then the topological models which we introduce can all
be defined in the intuitionistic set theory; in particular R is modelled as the Dedekind (not Cauchy) reals.
Rather than introduce
any particular constructive theory for the types which we do consider, we take as our basic notion of intuitionistic or constructive proof whatever may be proved in IZF + ZL. The direct use of R (as opposed to a zero-dimensional space of real number
generators) is an innovation not only for topic (c),
but also (for logicians) for topic (a).
But it certainly coincides
with the usual interest of mathematicians. §2.
Some important classes of formulae.
we are considering in this two basic types
Nand
pap~r
R.
'l'he basic language which
is that of finite type theory over
'rhus we have ti'pes for all products and
mapping spaces over Nand R, together with application, functional abstraction, pairing and unpairing of all appropraite types.
In
addition we may allow constant function and relation symbols for arbitrary elements of, continuous functions on and open relations on the basic types.
Also in accordance with usual mathematical prac-
tice, we have maps to allow elements of N to be taken as elements of
R.
Particular classes of formulae frOD\ this language will play an
important rele in this paper. First we describe two ways of defining new classes of formulae. Let f and A be classes of formulae.
We define PR(f;A) to be the
CONSTRUCTIVITY IN MATHEMATICS
441
least class of forI'lulae containing all of fl., closed under and such that if
~
E
r and
~
E
PR(r;fI.) then
(~
~)
+
V,A,
V, 3,
is in PR(r;fI.).
Similarly we define HPR(r;fI.) to be the least class of forI'lulae containing all of ts , c Los ed under 1jJ
E
HPR(r;fI.), then
(~
+
1jJ)
A,
v,
and such that if ~
is in HPR(r;fI.).
E
rand
In case fI. is the collec-
tion of atoI'lic forI'lulae, we write (H)PR(r) for (H)PR(r;fI.).
PR(r;fI.)
is the collection of formulae built up from fI. with Qremises restricted to rand HPR(rifl.) the collection of garrop formulae so built up. These ways of defining collections of formulae have alfeady been used in Troelstra 1973 (§3.6.3). We define the class COH of coherent formulae to be the closure of the class of atomic formulae under
A,V,
3.
For some applications
one can extend this definition: for example if all atomic formulae are decidable, then quantifier-free formulae may be treated as atomic in the definition of COHo
There is a precise sense in which
eOH is the intuitionistic analogue of the existential formulae of classical Model theory.
We define the class POS of positive formu-
lae to be the closure of the class of atomic formulae under
A,V,
Again under some circumstances one can extend this definiWe define STRICT the class of and LaC the class of
~
~
formulae to be PR(COH),
formulae to be HPR(STRICTi COH).
The
intersection of STRICT and LOC is SLOC, the class of strictly local formulae; SLoe is HPR(COH; COH). We now discuss an important way of extending the language which we introduced at the beginning of this section.
This extension is
needed for many of,the applications which we give in §7 and §8. ~(x)
is strictly local with
If
just x free then we can introduce a
new type to our language, for {xl~(x)}.
This means in particular
that universal quantification over {xl~(x)} is no longer regarded as involving an implication.
This essentially expands the various
classes of formulae defined above.
In particular it expands SLOC;
and that allows yet further expansion. into detail about all this.
It would be tedious to go
We sirnply stipulate that there be some
convention for the introduction of strictly local
~.
§3 Sheaf models for intuitionism. Sheaves over topological spaces generalize both the Kripke and Beth models for intuitionism: they are themselves a special case of sheaves over a site.
J~et
T be a topological space: we use
442
J.I~.
E. HYLAND
its complete Heyting algebra 1(T) of open sets as truth values. have the usual definitions of the propositional operators (here denotes the value of
~
Propositional logic
We r~l
in 0 (T»: [~
f1J!] f1J!] Interior «T\ [~1 ) the empty set.
1J!]
A
[¢ v 1J!] [~
ljJ]
->-
[ .d
r~]
n
r~]
u
u
r1J!] )
For ease of exposition we restrict attention to structures of the form
XT
= {xl x maps U
€
0 (T) continuously into X}.
X is the sheaf of continuous X-valued functions on T. The case when T X is N the natural numbers with the discrete topology is atypical. more typical case is when X is IR the reals with the usual topology. But X need not he topological: our higher types will be modelled using X's which are non-topological filter spaces (see §S).
A
Define an existence predicate (or predicate of [Exl
=
dom tx)
for all
x
€
~xtent)
E by,
X'r
The quantifiers V and 3 , respectively presuppose and imply existence, so we bring E into their definition: Predicate logic
Interior (() tr Ex ->- ~ (x)] Ix U { [Ex A ~ (xll I x € x.
[(I/x) ~ (xl] [
(3x)~(x)l
€
X'l.,})
r}.
The relations and functions on our structures are required to be extensional not just with respect to the ordinary equality in the sheaf, but with respect to strong equivalence oo. For us, [x = y] = Interior {tlx{t) y(t)}, while [x :: y] = [Ex v Ey ->- x = y ] ; thus [x oo" includes the open sets where neith2r x nor y aI'e defined. Then 1-ary relatia,s R : X ->- ? (T) rnust satisfy T R (x)
[X
n
while a 1-ary function f ; XT
->-
~,
r x = y ] .=.
=Y
D
.=.
R (y) ,
must s a t Ls fy [ f (x )
=
f (y) ] •
of particular interest for us are relations and functions n; ar1.sing in the f o Ll.ovzinq way. Let R be an open set in X then we also denote by R the n-ary relation on R(Xl'''''Xn)
XT
defined by,
= {tl(xl(t), ... ,xn(t»
€
R}.
CONSTRUCTIVITY IN MATHEMATICS
443
Let f be a continuous function from Xn to X; then we also denote by f the n-ary function on X,!' determined by the stipulation (f (xl' •• .,xn) ) (t) = f (xl (t.) , ••• ,x n (t.) • (In the above definitions, on the right hand side we have the original and on the left the defined meaning of the symbols Rand f respectively) • We can now begin to describe the interpretation of the basic language of §2 over a topological space T. The basic types ~ and ~ are interpreted by the structures ~T and R T. Then we interpret the function and relation symbols in the way described above. Thus for example Kx
<
y D
= {tlx(t)
<
y(t)}.
l'Ie call the models we have just introduced the ~ .2i sheaves topological spaces, or the topological~. The reader can now interpret any formula of the basic language which does not involve higher types, in the topological models. For a sentence ¢,
~
we shall say that ¢ holds over Tiff
r¢D
= T.
Remarks 1) A full exposition of the theory of sheaf models sketched above is to appear in the eagerly awaited paper of Scott, Fourman 197? The reader should also consult the pioneering papers Scott 1968, 1970, though the structures there are not quite sheaves but n-sets for ~ = O(T). 2) There is a formulation of the above semantics (so-called Kripke-Joyal semantics - a misnomer as the notion of covering is used in the Beth models) closer to the familiar intuitionistic semantics.
But the treatment we have sketched is far better for
higher order logic. 3) Note that NT is the standard model of the natural numbers in the topos of sheaves on T: an arithmetical sentence is true in the model iff it is true. But for analysis the standard structures differ from their classical counterparts. §4 Truth in fibres and truth in the model In this section we consider only the language of the intuitionistic first order predicate calculus for our base type R; we allow constants for elements of R The discussion would be trivial for T• N. For higher types even over N, the discussion becomes interesting, and §5 is devoted to shOWing how the discussion can be made to go through unchanged for higher types.
444
ited.
J.M.E. HYLAND Suppose cjl(~) is closed with the constants ~ xl"",x n exhibWe say that cjl(~)is true (in the fibre) at t ( Tiff
cjl(xl(t), ••• ,xn(t» (henceforth written cjl(x(t») is true. (Note the convention used without comment in §3, that if ep(~(t» is true then t E r J;;X ~ f EXI A • • • A EXn n i . The relation between [Ex A cjl (x) TI and {tlcjl(x(t»} can be complex. But the following theorem is easy to establish (refer to §2 for definitions) • Theorem 4.l. J,et guage for lR. Then (a) i f ¢ is (b) i f ¢ is (c) i f ¢ is (d) if ep is
.... coherent, [ Ex A [EX A strict, .... local, [Ex A strictly local,
¢ (~) cjl (x)
{tl¢(~(t»}; {t!¢(x(t»}; ¢ (x) TI o Interior ({tl¢ (x(t»}); [Ex A ep (x) D = Interior ({tlep(x(t»}). c
Proof:- By a routine induction on the definition of coherent, strict, local and strictly local formulae. (Needless to say, the same proof goes through for any X in place of lR so long as the atoInic formulae behave as the coherent ones do in (a) above. (4.1) (d) shows that strictly local types {xl¢(x)} are interpreted as the sheaf of continuous maps from T to {xlep(x)}. Thus (4.1) goes throu']'h even \-lith the convention for the introduction of strictly local types. 'i.'his will a qa Ln be true of results which use (4.1) and we will not always comment on this. It follws from (4.1) (c) that if a local formula is locally true in the fibres, then it is locally true in the sheaf model. In particular if a local sentence is true, it is valid in all sheaf models over topolo']'ical spaces. Not many interesting formulae are local so '::his result is rather uninspirin']'. (It should not be underestimated however: for example the principle that for any Dedekind real it is not the case that it is apart from every Cauchy real, is local when properly formulated in the hi']'her types. However (as observed by Fourman) it is provable in the internal logic). Fortunately constructive interpretations of formulae are much more likely to be local than the formulae themselves, and this fact can be exploited to give interesting results. §5 Models for the higher types In this section we give a uniform definition higher types over our basic types ~ and lR. We do place because we wish to consider applications to meters. of secondary importance is the fact that
of sheaves at all this in the first higher type parathe Dialectica and
CONSTRUCTIVITY IN MATHEMATICS
445
and modified realizability interpretations involve higher types even for simple formulae; we wish to make sense of these interpretations in the sheaf models even though vIe do not seem to be able to make much use of them. The main problem which we solve here is that of ensuring that the considerations of §4 go over to the higher types.
In the first
place this means that we must take as sheaves at higher types, sheaves of maps from (open sets in) T to suitable spaces of higher types. example.
To see what these should in general be, consider a simple Let f be in our (still to be defined) sheaf of type JR
-+
JR,
x and y of type [{ (Le. members of «<'1') and suppose that all of f, x and y have their full extent.
=
H{x) < y 1
Now we know that
{t!{f{x)) It) < y{t)},
and we want that to be equal to {t!{f{t))(x(t)) < y(t)}. Thus f must map T to IR
IR,
and for x in ~"f(x) is defined by
(f(x)){t)
=
(f{t)){x(t)).
But f cannot be an arbitrary map from T to IR to be in
~,
IR,
as fIx) is required
that is to say fIx) must be continuous.
The obvious
way to ensure this is to insist that At.f(t) and application (or the evaluation map) be continuous.
In general,
though not in this par-
ticular example, this will take us outside the category TOP of topological spaces; we need to consider a cartesian closed category in which TOP embeds full and faithfully.
Many such categories are
known; the convergence spaces of Choquet, limit spaces or more generally filter spaces FIL (Hyland lY77).
It makes no difference here
which the reader chooses to consider. Our sheaves at higher types over N and
If{
are defined as follows:
we take sheaf of continuous (in the sense of FIL) maps from T to be the space of appropriate type over Nand JR in the category FIL. This ensures that all the considerations of §4 go through unchanged when higher types are introduced. Remarks 1) We are interpreting the finite types over
~
by sheaves of
continuous (in the sense of FIL) maps into the continuous functionals of Kleene 1959 and Kreisel 1959.
For an account of the relation
between these original treatments and FIL see Hyland 1977.
Occasion-
ally hereafter we use the phrase continuous functionals to refer to the higher types in FII, over both IN and IR.
J.M.E. HYLAND
446
2) Though we don't need this fact, it is amusing to note that the spaces we are defining externally at higher types are also the spaces internally defined using FIL, using the intuitionistic set theory (IZP) valid in our topological models.
§6 Functional Interpretations Both the modified realizability and Dialectica interpretations (¢MR and
qP
respectively) of a formula ¢ can be regarded as being
derived from a crude constructive interpretation (¢C)
by simply
varying the treatment of implication in the inductive definition. All these interpretations can be given in the finite type structure (of total objects) over the basic t.ype s e for us the basic types are Nand R, the types of natural and real numbers respectively.
Let I
be an arhitrary one of our interpretations C, MR, D; and assume conI 3~ v... ... + I + 1..1+ -+ + is ( x) ( v y) ¢I (x,y), and W is ("3 s) ( v t) WI (s,t),
ventionally that ¢
whe r e ~,y,~,t are strings of variables.
'l'h en the interpretations are
defined by the following inductive clauses:
¢ is atomic ¢I is cp, Vi)I is ("3~)(3~)(Vy)(Vt)(
(i) if (Lf.)
(iii) (Lv)
(v) (vi)
(¢
/I
«,fI
or zl o)I I
('3r) (Tx) (3~)
(ep (b)
." -+
I = C:
=
is
(3
s) (V t)
ntO -+ WI)'
IJ
-+
+
-+
(ep
-+ W
c)'
(=JS)(\t~)(\tt)((V'Y)¢gH+ wl.rn(S(X),t»,
I = D:
(¢ ... W)
/I
r
MH:
(¢ + VJ)MR is (c)
/I
is (h)(.Jx)(vY)¢I(x,y),
-+ 1)i)C I
('v't) (r>O +
+ ... is (3-+"' X) ( v z) ( Vv -+ y)I (X(z) ,y),
«3z)¢) (a)
(V'y)
D
is
-:17:
(:I~)(
3+ v+ w+ ... -+...... Y)(vx)(vt)(¢D(x,Y(x,t»+
+ ......
V)D(~;(x),t».
'l'hroughout the above, variables are supposed to be sensibly typed. In (iii), n is of type Nand r of type R;
the interpretation simply
uses the definability of v with respect to elementary intuitionistic theories of natural or real nUMbers.
X(z),
Sex), Y(x,t)
are inter-
preted in the obvious way; e.g. S(~) stands for some sequence Sl (x), ••• 'Sn (~).
Finally (vi) (b) is an intuitionistically equivalent
variant on the usual formulation of }ffi; i t brings out the analogy with the other two interpretations. Giving an inductive definition of ¢C is rather artificial; essentially it can be obtained using the notion of the strictly positive parts (s.p.p.'s) of a formula as follows:
CONSTRUCTIVITY IN MATHEMATICS
447
(i) replace v's in the s.p.p.'s of the formula by a definition
V,
using
1\,
....
;
(ii) systematically move all quantifiers acting on the s.p.p.'s of the formula to the front; (iii) bring to the required form by replacing n(Vx)(Jy)( ••• x,y ••• )n by n("3Y)(V'x)( ••• x,Y(xl ••• l"
(Le. usinc;
This brings out the fact that ~c does not make C ........ much use of higher types; if ¢ is (:J x) ( 'V y l ¢ c then the,> maximum Skolem functions).
x is
level (in the usual sense) of the types of
at most one greater
than the maximum level of the types appearing in ¢. We close this section by giving some information about the relation between, ¢, ~C, ¢Iffi and ¢D.
We define classes rand b of
the formulae to be the least classes such that (i) r (ii) b
~
HPR(f).
a system of
~
PR(b) and
In the following theorem f- denotes derivation in
incu~cionistic
logic, which can be much weaker than the
consequences in our basic language, of IZF. Theorem 6.1.
Remark
(a) For all ¢, f- ¢C .... ¢.
llR . . ~ .
(b l I f ¢
E
r, then I- ¢ Iffi .... ¢C and I-
(c) I f ¢
E
PR(HPR(COH», then I- ~[) .... ~c and f-
Some of (6.1) is a simple extension of
~roelstra
cj,D .... c ,
1973 (see
his §3.6.5). §7 From continuity in parameters to truth in topological models In this section we consider various notions of continuity in parameters, and use them to establish that certain propositions hold in all topological models. •
The crudest notion of continuity in
parameters used, depends on the interpretation ¢
C
introduced in §6.
We consider briefly why as yet we have not found a use for ~Iffi (in a case where it differs from ¢C)
to estahlish results about topolo-
gical models. The possibility of applying interpretations to the stUdy of sheaf models arises out of the idea that the interpretation ~I or
(3
x) (if Y) ¢I
expresses more explicitly the constructive content of
The result of this is that ~
~I
is much more likely to be local
is.
Lemma 7.1.
(a) If
~
E
PR(STRICT), then
(b) If ¢ E PR(PR(POS», then tically eqUivalent to local formulae.
~C
is local.
~Iffi
and
D
are intuitionis-
J.M.E. HYLAND
448
Proof:-
Straightforward.
For (b),
~MR
and
~D
need not be local;
hut they can be mad e so by replacing some v's wh.i ch have been defined away by the interpretation. Given a sentence
~,
where
~
C
is ( 3'" x)
(V ...y)~c,
the string ... y is
the string of parameters (or more exactly positive parameters) of
~.
These are the parameters arising in our various notions of continuity in parameters. We discuss our crudest notion first.
He say that ¢ holds w Lt.h
global continuity in paraneters iff ¢ C is true when the higher t~'pes are interpreted as spaces of appropriate type in FIL (see §5). corresponds to saying not only that
~
This
is true but that in transfor-
ming ¢ to epC (as described in §6), the Skolem functions needed can be chosen continuous. Proposition 7.'2. parameters, and
If a sentence ~
£
~
holds vlith global continuity in
PR(S'l'RICT) then
~
holds in all sheaf models over
topological spaces. Proof:-
By assumption (V~)¢(,(~,y) is true for a fixed choice of
continuous~, and so by (4.1)-and (7.1), i t holds over
a represents
't'
(where now
the sequence of constant Flaps from '.i' with values
Thus ¢C holds over T and since
a).
¢C ... ~ (by (6.1)), ep holds over T.
It is an immediate result of (7.1) that suitable formulations of the following are valid in sheaves over any topological space: (i) every continuous function has a least upper bound and is uniformly continuous on closed intervals; (ii) the fan theorem (expre"ssing compactness of Cantor space). The above examples express pure compactness phenomena in analysis, and can be extended to many others (in general one will need to make heavy use of the convention described in §2 for the introduction of strictly local types).
It appears that such propositions
cannot be proved in IZF together with Zorn's Lernma ,
Realizabili ty
interpretations do not seem to have been extended to such strong systems, but I believe that I have a complete Heyting algebra over which the fan theorem fails. One rather obvious defect of the rot ion of global continuity in parameters, is that (even with the use of the convention concerning the use of definable strictly local types), vIe are asking for continuity over an unnecessarily wide range of the parameters. Given a sentence
~,
we can (up to trivial intuitionistic equivalence)
take epc to be a conjunction of the form
CONSTRUCTIVITY IN MATHEMATICS
449
... Xl)A ••.••••• A(l!J n ... Xn), where the Xi are atomic. Then we can give a definition of the (~Il
parameter space of
~
as
->-
{yl for some i, Ls Ls n , where as usual 1/!i are strict.
<jJ
C
(
3.... ->- .... x)1jJi (x,y)},
-+-+
is (;:j x) ( V y) ~C.
I f ¢ is in PR (STRIC'l'), then the
Hence using the property (4.1) (b) of strict, one
can readily see that to make (7.1) go through, it is sufficient that the Skolem functions ~ be defined (and continuousf on enough of their domain to ensure that ¢c(x,y) is true (and so in particu-
c.
lar has a truth value) for all y in the p a r ame t.er space for
What we have just sketched is a notion of global continuity in the parameter space.
We do not pursue it further but turn to our
weakest notion local continuity in parameters.
To show the need
for this notion consider the sentence expressing the existence of 3-3x-y, a root for the cubic x
( Vy)
(
3 x)
(x 3-3x = y) •
Obviously there ia no total continuous function giving x in terms But for any y there will be an open neighbourhood U of y and
of y.
a continuous function giving a root x(z) for each z in U.
So the
sentence does hold with local continuity but not global continuity in parameters.
Another simple but
illuminating example of a sen-
tence of this kind is
(v x) (x
> 0 ... (3 n) [nx > 1 J) ,
where x is of type Rand n is of type N. First we define ¢(x) is (locally) continuous in parameters from A eRn, whe re all free variables in ¢ are indicated, and the length of x is equal to n (for the induction to be smooth we must allow dummy free variables - the restriction to real parameters is one of convenience): (i) if ¢ is at.onu c-, ¢ (x) is continuous in parameters from A iff A is included in {xl~(x)}; (ii) ¢A1jJ is continuous in parameters from A iff both
cj>
and
~I
are
continuous in parameters from A; (iii)
(V x)¢(x,y)
is continuous in parameters from A iff ¢(x,y) is
continuous in parameters from RxA; (iv) ¢(~) ... 1jJ(~) is continuous in parameters from A iff 1/! is continuous in parameters from An{~I¢(x)}; (v) ¢v1/! is continuous in parameters from A iff there are relatively open R,C such that A
=
RuC
and ¢,1/! are continuous in para-
J.M.E. HYLAND
450
meters from H, respectively C; Ivi)
13x)¢lx,;) is continuous in parameters from A iff there are
continuous maps fi:A i for each i ¢Ifily),y)
+
R on relatively open Ai covering A such that
is continuous in parameters in Ai. O
If n sentence ¢ is (locally) continuous in parameters from R
li.e.
the one point space), then we say that ¢ is continuous in parameters. This is our notion of local continuity in parameters - but we drop the "local". Unfortunately the above definition is cwm)ersome.
Despite this
I claim that it does represent the natural notion of Ilocal) continuity in parameters.
Hhat is more, if a sentence is sufficiently
simple for one to be able to read through and understand it, then with little further effort, one can read through and understand what it is for it to be continuous in parameters. Clearly if
¢
holds with global continuity in parameters, then
¢ is continuous in parameters.
Further, we can extend 17.2).
Proposition 7.3. If a sentence ¢ is continuous in parameters and
¢
€
then ¢ holds in all sheaf models over topological
PRISTRIC1~),
spaces. Proof:-
Use induction on the definition of continuity in para-
meters. 17.3) allows us to improve the formulation of the compactness properties which follow from (7.1).
One can use it to show many
other things for example that an appropriate formulation of Dini's Theorem holds in all topological models.
One can also use (7.3) to
analyxe the general question of the existence of solutions to odd degree polynomials in one variable, in the topological models.
One
cannot find even a locally continuous solution in the neighbourhood of a point in the parameter space which gives rise to repeated roots; so by
(8.~)
in general such polynomials are not soluble in
the topological models.
But it is possible to write down a coherent
fornula in terms of the coefficients of a polynomial, which expresses the fact that the polynomial has at least one non-repeated real root.
Thus there is a non-trivial formula which expresses that
separable polynomials of odd degree have roots; this does hold for all
topolo~!cal
models in virtue of 17.3).
We close this section by considering the possibility of using more usual functional interpretations to establish facts about the sheaf models.
The following proposition is an analogue of 17.2).
CONSTRUCTIVITY IN MATHEMATICS Proposi tion 7.4. (a)
451
If the sentence ¢ is in PR (HPR (POS))
(which is
the intersection of r of §6 and PR(PR(POS))) and ¢~ffi holds (i.e. classically over the continuous functionals) then ¢ holds in all sheaf models over topological spaces. (b) I f the sentence ¢ is in PR{HPR{COH)}
(and so
in PR(PR(POS))) and ¢D holds (i.e. classically over the continuous functionals)
then ¢ holds in all sheaf models over topological
spaces. Proof:-
As for (7.2) using (6.1) and (7.1).
Let me say first that I know of no example interesting or otherwise where (7.4) can he used to show that a proposition ¢ of analysis holds in the topological models, and where this could not be done by applying (7.2) or (7.3)
to some
~
which trivially implies
¢.
In fact for the Dialectica interpretation no such example could exist using (7.4) (b) in its present form: one would need significantly to strengthen the result by weakening the hypotheses.
How-
ever the present position is not so hopeless for modified realizability, and it seems reasonable to raise the following open problems.
1) If ¢ is in
r,
then by (6.1) if ¢11J~ holds so does ¢C;
are there
propositions ¢ such that ¢HR implies ¢ ~lith intuitionistic logic, but ¢}ffi does not imply ¢C classically over the continuous functionals? 2) Even if the answer to I} is negative, there is still a difference be tween the apparent range of applicability of (7.2) and (7.4) (a): PR(HPR(POS)) neither includes nor is included in PR(STRICT).
Are
there any interesting formulae in PR{HPR{POS)} not in PR(STRICT)? §8
From truth in topological models to continuity in parameters In this section we describe the most significant aspect of
this vror k for classical mathematics: a simple way of obtaining continuity in parameters as a direct result of the constructivity of proofs.
As
~Ie
shall indicate, for strict formulae, truth over
the parameter space (see §7) implies continuity in parameters; in fact by (7.3) it amounts to the s arne thing.
Of course for differ-
ent propositions the parameter space will be different.
However,
if a proposition has a constructive proof (see §l) then it is valid over any topological space, in particular over the parameter space; if in addition it is strict, it vlill thus be continuous in parameters.
If, as I claim, what it means for a proposition to be con-
J.M.E. HYLAND
452
tinuous in paraMeters can be easily read of from the proposition, then the classical (or otherwise) Mathematician has information immediately available to him arising out of the constructive nature of his proof.
Particular features of a problem may enable him to
improve this information to obtain some form of global continuity in parameters. In order to formulate the basic proposition of this section we define the parameter space of a formula ¢ (i) vii til free variables i -> c -> v-> ->->-> and with bound parameters y (i.e. where ¢ is (3 x) (\ y)¢C(x,y,z». He already described this for sentences ¢ in §7, and in general we may take the parameter space of ¢(~) to be that of the universal closure of
This parameter space can be defined inductively
on the structure of ¢, in an obvious way; but the details are messy and we omit them here. Proposi tion 8.1-
Let ¢(i) be in STRICT, and let A be an open sub-
set of the range of the parameters ~i suppose that over the parameter space of
r ¢ (;;)
I
5. {t Ii (t) <:A}; then
parameters in A. Indication of proof:-
By induction on the structure of ¢.
The only
real interest is in the steps for v and '3 where use can be made of "generic" free parameters, and the homogeneity of the parameter space. Our main interest is in the immediate corollary: Proposition 8.2. If a sentence ¢ is strict and ¢ holds over its parameter space, then ¢ is continuous in parameters. Remark.
As the reader will realize in (8.1) and (8.2) we are treat-
ing the parameter space not just as a set but as a topological space. For higher type parameters of level two and above, this space will not immediately have the structure of a topological space; it will be a more general filter space.
So to make sense of (8.1) and (8.2)
one has to take the induced topology or what here amounts to the same thing, regard the continuous functionals as being in the category of sequential spaces (see Hyland 1977).
As a result there are some
subtle points in the proofs for higher type parameters, but we are not going into details about that here. More interesting than a general proof of (8.1), is the consideration of a special case.
(V
x)
("3
y) (¢ (x,y) -> 1!J (x,y».
Let ¢ and
~)
be coherent and consider
For this formula, the parameter space
X is the space over which x ranges.
The generic element
x of
X is
CONSTRUCTIVITY IN MATHEMATICS
453
the identity on X (considered as an element of XX). considered holds over X, then so does ("3 y) (¢ (x,y) X is covered by sets U and elements yu of_Y But yu:u
-+
Y is then such that ¢ (x,yuX»
for all x in U (since ¢
is strict).
-+ ~j
(x,y».
Thus
x (where_the variable y
ranges over Y) with extent U such that ¢(x,yu) U.
If the formula -+ ~I
-+
-+ ~(x,yu
~
holds over
(x,yu(X»
is true
If this information is
unravelled it amounts to the fact that (V x )
(:3
y) (¢ (x,y)
-+
~) (x,y»
is continuous in parameters. The way (8.2) is applied is expressed in the following proposition. Proposition 8.3. If the sentence ¢ is strict and has a constructive proof (possihly using special axioPls valid over the parameter space), then ¢ is continuous in parameters. Proof:-
By (8.2) and the remark in §l that IZF+ZL holds over any
topological space (and constitutes our notion of constructive proof). Remark. Among the special axioms Vlhich can often he used are bar induction
(wh i ch is valid over all complete metric and compact
Hausdorff spaces) and the axiom of choice from nUPlhers to sets (vRlid if pRrRmeters Rre restricted to the higher types over N). (8.3) has immediate appl.Lca t.Lon to differential and integral e~uations.
This is because
(i) such statements as "g is the derivative of f" can be expressed by strict formulae, and so statements that particular
e~uations
are
soluble turn out to be strict, and (ii) a large body of elementary work on differential and integral equations (the use of contraction mappings and Arzela's theorem for example) is constructive. tial and integral
So continuity of solutions to differen-
e~uations
in parameters, is an immediRte conse-
quence of the constructive way in which the existence of solutions is established.
Of course where we have
uni~ue
existence of a sol-
ution, we get global continuity. One advantRge of the method sketched ahove, over possible realizability methods, seems worth mentioning.
In the constructive
proofs considered, propositions (about sets of reals for example) for which we have no notion of continuity in parameters may occur. But this does not matter at all.
In contrast with realizability,
we do not need to establish continuity in parameters at each stage in a proof. It is clearly not possible to use the modified realizability of Dialectica interpretations to establish continuity of real
454
J.M.E. HYLAND
parameters.
Too many propositions which we will meet in the course
of proofs have only local continuity in parameters, and modified realizability and Dialectica interpretations use total not partial realizing obj ects.
('l'his is not true for the higher types over N,
but we do not discuss the possibility for them here).
An approp-
riate form of realizability using partial realizing objects needs to be developed over R, before a proper general comparison between the traditional ideas of interpretations and the sheaf theoretic methods of this paper, can be made. !3ibliography .J.M.E. Hyland, Filter spaces and continuous functionals
(to appear,
1977) • S.C. Kleene, Countable functionals, in Constructivity in Mathematics, North-Holland (1959). G. Kreisel, Interpretation of analysis by means of functionals of finite type, in Constructivity in Mathematics, North-Holland (1959) • D. Scott, Extending the topological interpretation to intuitionistic analysis I, Compo Math. 20
(1968).
D. Scott, Extending the topological interpretation to intuitionistic analysis II, in Buffalo conference on proof theory and intuitionism, North-Holland (1970). D. Scott and M. Fourman, The logic of sheaves (to appear 197?). A.S. Troelstra, Metamathematical investigation of intuitionistic Arithmetic and Analysis, Springer Lecture Notes in Mathematics 344
(1973).
R. Gandy, M. Hyland (Eds.), LOGIC COLLOQUIUM 76
© North-Holland Publishing' Company (1977)
MODEL
OF PARTIAL CONTINUOUS FlrnCTIONALS
Yu.L.ERSHOV
Institute of Mathematics the USSR Academy of Sciences Siberian Division Novosibirsk In the last years(most fully in the thesis by M.Hyland [6J ) the coincidence of a large number of models of everywhere defined functionals of the finite type has been proved. This coincidence demonstrates the defini te significance of the model
([{ ,
which is the model of the bar-recursive functionals theory as well. However, I am convinced that the most natural way of this model determination is in determining it by the natural model ([; of partial continuous functionals, and that it is model
a:
that is
more fundamental than model (fJ , in the general mathematical sense. The most essential point is that partial objects may be built by the successive expansion of finite objects, which are the elements of model (L: • The history of recursive functions theory, in particular, shows that only after the introduction of the notion "partially recursive function" into usage this theory acquired well-composed structure. 455
456
Yu. L. ERSHOV
Further account is divided into two parts. The first and larger part contains the definitions and the properties of some classes of topological spaces, the efficience of which, I presume, is not covered by the construction of model ([; or the models of
[2].
A-calculus
The second part contains the main results concerning model
(f;'.
SPACES.
In this part we shall consider (in survey) the general mathematical notions which can be used for the determining of the functionals of higher types. We shall describe three
approaches
(growing more and more constructive), which (very relatively) may be called "finite", "effective" and "constructive".
These
adjectives are referred to the objects (being determined) only, and not to the means of discourse, which are absolutely classical. 1•
t
Let
-SPACES
)(
(~)space. The topology on
be a topological
determines the partial order
X
for
:x:,~E
Let
X ~ t c.; Ide X ~ cc ~
~
for any open UCX(-::x:.EU--?~EU)
:X::G.'J ==r
I' -element
is called
'(f ~
i f set
element, then the open set ~
X
Topological space of all )(
itself is
Let
X
be
:c
eX.
Element
is open. If
X
X E'
is
X
1-
~-neighbourhood.
is called
is called
-i-space, if the family o
X
~-neighbOUrhood.
-4-space,
X o £ X - the
,then the triple
conditions: 1.
?.x
yP-neighbOUrhoods form the basis of topology on )( ,
and
elements of
)(
3xo Vx (xo
~.x-).
<X
)
set of all
X0,~
>
f!-
satisfies the
MODEL C OF PARTIAL CONTINUOUS FUNCTIONALS 2.
457
<~, ~ > is the partial upper semi-lattice in <X.~ ) I this
means that if
:Xo,x.l €
X
and
x" ~:x-
is, such X ~)( exists that
there exists the exact upper bound in
X
') x" ~ .xj
.£ =; .[ Xc \ ~"E;' Xo '" ~ d A ~~ d ~ x If
,x.1'
<: X • Xl),:!(. >' such that X, s: X
and
X
if e X .
,satisfying the conditions ./!-spaces on o
is called the basic subspace (or~)
-t
X
X .
;)(0 itself
-epace and coincides with its
with the induced topology is own basis.
.f;;
>
then
for BD:Y tx 6
1-3, arises from some(unique) structure of
Let
then
X •
is the partial order on
Xo
),
of elements x o
for every cc,
On the contrary, any triple
~
and::X.1 ~.:::l:::
)(0 •
and this upper bound lies in
3.
are consistent (that
.:::("'o,X".:t
-t-spaces,
be the category, the objects of which are
and the morphisms are the continuous mappings. THEOREM 1. The cate
0
-.t
of
-s aces is Cartesian closed.
In particular, this means that in the category the direct product
Xy Y ,
tinuous mappings from
t
may give the structure of spaces
C(yx Y, Z)
The topology on
J:;;
there exists
and that on the set C(~ Y) of all con-
-space
X
-t-space
to
Y
~-spaces in such a way that
C (X, CO:,,..?J) CCX, Y) is given and
we
~
are naturally homeomorphic. as the topology of pointwise
convergence, i.e. by the prebasis of the sets of form
U is an open subset Y ,and <~, U) ~ ~{/(/EC(X,Y),I(.x)e0.l--elementsof the space CCX,Y)
where .x~
X,
<x, U) ,
may be described in the following way: let elements of Dy
X ; 1 ?"" ?;z. 0
- the least element of
condition is satisfied
be
x"., ... >~...
.I-elements of
Y ; suppose
Y
be
1-
,and
that the following
458
Yu. L. ERSHOV
will be designated like
Q::::: .n <;x;.. , ~t' >
just be an equality
(.::ct',y,>
~
cs r«
'7i >). X and Y
like set <,.:r,
(J ""'"
will be called basically equivaI:-spaces o lent, if their basic subspaces are homeomorphic. We shall call
X
The;:' -space
basic equivalent to
)(
,any homeomorphism of the basis
continued to the homeomorphic imbedding of The following theorem characterizes complete
THEOREM 2.
mappine; For any
X
A .;:,-space
cal space Y --subspace Z
Y
Y
~f ~
)( .
-t -space
it the complete
Y
into
is
X .
~-spaces.
is complete iff for any topologi-
anv continuous mapping
)~
~
-t-space Y ,
to be complete if for any
~
~f everywhere dense
2-s continued up to the continuous
This continuation is unique.
X
there exists basically equivalent to
~-space(which is obviously unique, to within
homeomorphism). Let us show in brief the corresponding operation of completion. The simplest way of describing it is in the language of the triple
<X
)
X
~>.
0 ) -..;;;
The ideal of a partial semilattice set a)
j
of set
Xo
<X ~ > is any non-empty sub0 >
' satisfying the following conditions:
MODEL b)
Let
~o, x
JeX
E'
1
i
~
OF PARTIAL CONTINUOUS FUNCTIONALS
--> .:9ex; E,j (.:x-o::i.:r..? &.
be the set of all i!ieals of
0 )
oX.t
Xo
459
~ :::Jc:2). '
do (X
g )
the
-
)(0 , the ideals of form ~o = ::: £::x;, IX.1 E: Yo x.1 .$' .:;('"S ,:XoE' ><;. Then the triPle(J(Xo),J/Xo)F)
set of all principle ideals of J
.=t -space
corresponds to the structure of complete The mapping
Xo~
X" gives
.::c t--> ~ ., XE;
mapping
morphic imbedding EXAMPLE 1
Let
X
the homeomorphism
X()
on
and:
X , where ~ :::toco\ro€Xofoa]'will into ~
J (Xo )
be homeo-
x.') .
; \ / be the set of all natural numbers with dis-
*
to
It/'
and give
on ;1/"- ~ /!/u{ ~ ~ the topology with the basis [;V"'~ u Then /\/""'" is a complete basis,
EXAMPLE 2
A/';'. /V
Let j)(/\/) be the set of all subsets
topology, determined by the basis of sets in form
\ M ~ j1/) F ~ /'-1
P(N)
3 ' where
r
is the complete
The elements of
with the
~ ~{ H
;:. -space with the basis
A/.
i
t
then
e 1:IV)
,
'f: -SPACES
t
-spaces may be regarded as the ideals
partial semi-lattice of completion of
;i/
is a finite subset of;V
which is the family of all finite subsets 2. EFFECTIVE
Hm: ~\...~
";;-space, coinciding with its own
is the homeomorphic subspace
/v
i
C
crete topology. Let us join a new element
A /
"lCX.,).
of
~-elements(see the definition of the
~ -space), which gives us some "finite" inter-
pretation. However, purely topological notions are not sufficient for the defining
the notion of "constructive" elements of the
space. For this it is necessary that the basis of the space should be given effectively. Let us make use of the notion "enumeration" for the introduction of the suitable definition.
Yu. L. ERSHOV
460
Let
)<
-t -space,
be
X
X0
Xo
~
- its basic subspace,
- the
v.... be the partial two-place operation of the exact upper bound on X o order on
(and
determined by the topology, and
),
determined on the consistent pairs. The effectivization of the basis of the space
N - ? Xo
meration )}o:
X
we call any enu-
of the basic subspace, such that:
a) The predicate ~ (:::c. ?i) ~ b) The predicate ~ (:::x:>~)
!:==;
Y" X
:!f
v",¥ is recursive;
[Vo':X:',v",f are consistentJ is recursive;
c) There exists the two-place partially recursive function
R,1 (:x,~;-)
such that
The t-space
X
<x, i>
-'>
Jid J<.
E
va cc
U-)'Vo
~
'! = 'l7o d" co:::, d)
.
with the effectivization vo:;V~ X o will be
~ -space ;the
called the effective
~
~ -space
70
X
,having at
least one effectivization of the basis, is called effectivazible
-to -space. THEOREM 1'. The effectivizable subcategory
r;! the category
-s aces form the Cartesian-closed
~
More than that, by the effectivization of the bases of
X
and
Xx Y
-t-spaces
Y
,natural effectivizations of the bases of
and
CC><"Y)
It-spaces
are effectively being constructed.
In the latter case the main role is played by the following lemma, concerning the inclusion of the'?-neighbourhoods of the space
C(><, Y) for any LEMMA. Let -
10 -spac~s
U:=.n <.:x-,' , 1 > cs a
i
I-neighbOUrhoods C(X)
Y),
X and
and
V
=.
~
Y
n<
Uj .1Jj
J~m.
>
be two ~-empty
U~V ~ \lls{O, ... ,n).:J'1<;;;{o,···,/Tl3(n:x,<;;n1Z.&..n~.~nU;)
o
It should be noted that condition
cer
(~J
j"(f
,,;J
of non-emptiness of
neighbourhood is effectively verified in the effective Let )lo;;V~ Xo be the effectivization of the then we shall call any element XE
X,
t -space
such that set l n
X
Jt;J
.
1-
~-space.
I Yo 1"2.
basis,
* zx: 5
MODEL
~
OF PARTIAL CONTINUOUS FUNCTIONALS
461
be recursively enumerable, the constructive element in set of all constructive el.ements)( If
is the effectivization of basis
).{,
Y
tivization of basis sis
we denote as
,and
/0
X
)(
• The
X~ x;,c;;- X!:X.
, /0
is the effec-
is the effectivization of ba-
C Cx, Y), constructed while proving Theorem 1', then
~E;' X ~) EXAMPLE 1. Let enumeration Yo(n+-')~n
)}o:
CC X,
'-f' E'
,/\/'" be
y)
~ ~
':PC::>::.) E
y1.
-t-space, introduced in the previous item;
;V-) /V~ determined in the following way: 'l1J co) ~
*' »
, nE"N) is the effectivization of basis /1/""(coin-
cidingwith N
"*' ), (.A/i')J rv =/1/.
-I/<
EXAMPLE 2. The canonical enumeration
Y: /11' -7' e
eN)
of the family
of all finite subsets of;1/
is the effectivization of the basis
Pc/V},
of the constructive elements
of space
family
P(I'V) 4
consists, precisely, of the recursively enumerable sets.
,/!,-SPACES The effectivization of the basis allows to define the notion of the
C ONSTRUOTlVE
constructive element of the effective
~-space. However, this
is rather a "local" property of the constructiveness than a "global" one. The foll'owing definition is the "global" property of the constructiveness. Let
X
X
ration
~ -space, v.,: tV 7 Xo be the effectivization of basis
be
,then we shall call a constructivization of
v;
,tV --
X
X any
enume-
, such that the following conditions are satis-
fied: a) The imbedding
( Xo,Yo)
and (
Xo
X, v) ,
recursive function b) The predicate
~
X
is the morphism of the enumerated sets
L, e. there exists a one-place generally
R (x,~)~
~o
~X~))/I
= 'V'j-is recursively enumerable.
462
Yu. L. ERSHOV
,X
;:,-space
with the effectivization of basis
constructivization
)}
and the
is called constructive.
The following property is obvious:
.!! X
is the constructive
Xi = X
-t-space, then
in other
words, aQY element of the constructive space is constructive. Let (X,V",)}) and C'y,.J'f0'~) be constructive we shall call the contanuoue morphism from any continuous mapping tfJ:
X
-';>
Y,
J:-spaces, then
(X,vo,")?)
to (Y,/fO,/f)
which is the morphism of the
enumerated set ~ = (X,)}) to the enumerated set:l =: (~ /f), i. e. such a mapping, that for some one-place generally recursive function
g
'1')) =/
J-
takes place.
The family of all the continuous morphisms from (X,Yo»)J) to CY,/'*>,j') we shall denote as Hot;;(.~"~), this is subset oflfor(~l)n('(x,Y). Therefore, on this set we may assign the structure of the topological space(as the subspace of
C (X, Y)),
and try to find the "cor-
rect" enumeration of this set, using the notion of the computable enumeration of the morphisms [1] • Unfortunately, generally speaking,
Mo~ tE,~)
won It contain even the basic subspace of
('(X, Y) I
and won't have the principal. computable enumeration. Therefore, it is not always possible to turn the set structive
N0'C. (]f,
lj) into
the con-
1C-space. Nevertheless, for the most significant case,
all this is possible. We shall call a constructive space over the basis ( in other terms for any constructive basis to
(X",vo)
(X, v
)
-t-space
(X, Yo, v)
[1J -
to be complete
over the approximation), if
(Y:'Vo ,;4) with
the same effective
there exists the continuous morphism from ,identical over
Xo
I t should be noted that for any effective basis
(Xo ,!5, ,v"',
('Y,/)
Yo) there
exists (and the only one, in the reasonable sense) constructive
.fa-space (X,Yo ,))) ,
complete over basis
cs.,Y,,)
• In
brief, it
may be interpreted as follows: as in the case of the construction
MODEL
~
OF PARTIAL CONTINUOUS FUNCTIONALS
463
~-space, we consider the ideals of the partial
of the complete
semilattice (Xo , u..) , and not all the ideals, but only the ~
j
X o ' that })o-.t(I) is recursively enumerable. As for the enumeration V , i t is just the sively enumerable ones, i.e. such ideals
~
principle computable enumeration of the family of-the recursively enumerable sets:
3.
( X 0, u~.,v.. )
£v~j(d) \
THEOREM 3. I f (X,Yo,Y) and
J
is recursively enumerable ideal of
(Y:/"0)
complete ~ their ~,then
Ee. constructive
Ito~ (X, ~) = /'tor {X ,~) ;
~ exists the principal computable enumeration
mily
::!
-10 -spaces,
)}~
!!!..
the~
all morphisms Nor (X', ~) , and this enumeration is such
~ at the natural effectivization Yo'*; /l/ ~C(X, Y)o ~ the basis
of ~ C ex,
y) , v"" E
the constructivization of
-t-space
f(o/C{X',y), ~ (lfo'C(~:t),J}:,ll")iS the constructive complete ~ its basis (CCXX)"
~-space,
,Y:) •
This theorem can naturally be considered as a rather broad generalization of the well-known theorem of Myhill-Shepherdson. Let
(X, Xo,v
-t
o )
-space,
be the effective
CX~v",v)- be
over basis (Xo , v o ) space, then
• If
..;: -space, which is complete
-I: -space,
the constructive
X*
is imbedded into
plete over their bases, is ---------
va) ,
-/!, -spaces,
com-
-
Cartesian-closed one.
EXAMPLE 1. If we consider a constructive same
as the sub-
X* = X~.
THEOREM 1". ~ category ~f the constructive
basis (11/~
X
complete
~-space, complete over
then, as the topological space, it will be the
/1/* , but the constructivization )): ;t.!-/V""will not be
eqUivalent to
Yo
the completion [1 ]
• And more exactly, (/V~
f = ~ (;IV)
y)
is equivalent to
of the enumerated set A/= {~,~
EXAMPLE 2. Let f:;V~.£(t1-JI, as in Example 2 of the previous item,
464
Yu. L. ERSHOV
Y? ~ PctV).f
be the family of all recursively enumerable subsets
of ;\/ ,7T;;V~!R.. - be the Post "s enumeration, i.e. the principal computable enumeration of family
5t.
then
(!.R, r, rr) is the con-
structive
~-space, complete over the basis.
II. MODEL
AND ITS PROPERTIES
~
The above consideration allows now to easily define the model of the continuous partial functionals of the finite types. We shall first define the set a)
0
T
E
b) i f
T ,
(), 't' E
7"'
c)
0
,i.e.
T
of types:
is the type;
then
(»)")
, ( ) \ '?::-)
T
E
is the least set, satisfying the conditions a) and b).
Now for any
6'
following way:
E;
'Z"
Co!=;
we shall define
/\/*, /l/*' is the
t
-space
G
in the
-£-space, considered in
Example 1 (part I).
eelS" X~) ~ CCO""I't)
Co X C't"
~ C(C<s ,C't" ') .
a: ~ { c"" \
Model
oET
j
We shall call the elements of space type
S
• All the spaces
Ccr-
Co-
-
the functionals of the
are effectivizable, complete
~-spaces. The natural effectivizations of bases allow to define subset
C!
of set
Co- of the computable functionals of type
fS".
Another way of defining the model
Ia
a: is
the use of the constructive
-spaces. And namely, let us define model cL"(A/)Of the func-
tiona1s of the finite type as the family of constructive ces ~ (A/) , complete over basis ,O-E' T :
c;,(A/) ~
f = ~(A/);
:i-spa-
MODEL C OF PARTIAL CONTINUOUS FUNCTIONALS
If we consider CIS (I?/) just as an
465
-t
-space, then we may con-
C:.
struct their completion, which is naturally identified with the space C() , and ~(/#)is identified with its subspace
of the
constructive elements. Thus, ~ (N) is the set of all constructive elements (computable functionals of the type 0
) of space
~()'
provided with "correct" (principal,computable, Godel, etc.) enumeration. Let us now construct two models e;j
and
(/)
of the everywhere
defined functionals in the following way: For any
0':;
7'
we shall define subsets
Go
<;;' Co-
and DIS
~ C(5"
in the following way:
4 -t<'C;;o = 00 ~ /V~ Co =: CD =;1/ )"
G<6"I'l:)
o
(Cfl"t: )
~
[
F\ F€'GO",'l:) V'3 E ~( F(d) E' G;"t")};
~ {F\ FE'G~,\':) VifE'(~.cF(l)E 4-)3"
DENSITY THEOREM [3, 4] . I f U and Un. Of> i-
*P E
open
in C(5""
then Un ~¢
¢ .
Let us define the relations of the equivalence and
~(5"" ""'"0==
on ~o
=.
0
t5"
on
in the following way:
is the relation of the equality on
Go
00
466 for
Yu. L. ERSHOV
/,I€- O~ )'J' 'J /£ F, F/E
for
for ~F't£
(-I.; F::ca",,) (-/:c; ') ~ -I~trl &.. d ~'t: (J /;
O-c
F~Ca''r;)F''!::;
GrCo-I'to)
\i'E
G~ (,cg))~?::F,(:-;));
F~ra\~)F"~ VifEOa-(F9:i)~c- Ff";1»).
OCCTI?:)
The following statement is the corollary of the density theorem: If
r: F/E'
F ~c(TI't-) F/; if> 'j "E' GIS")
GCtTn) ,
then F("q) '"'- -c F/(q") __ " (7
i_if
~)d/EO<S"d::::::""d/
F>
,...-/E; Q
-ale)
'J 7
~ IS ~
"
F=<'<S"I?::l
F
1 :
'
F(7)~~FY:J/).
,then
This statement allows us to construct the models of functionals ~ and
([)
in the following way: for
O.~ o;,/~\S'" THEOREM. Model
rc~{G<S'\ ~(;T:)
CZ;;
1111£0
let
Gcr- ~ ~ /,...... IS
i
(/)~{O()I6"E-T).
is naturally isomorphic to the model ~f the
Kleene-Kreisel functionals. Model
~l
e:e T
0
1.s naturally isomorphic -iQ..
of the hereditarily effective operations [71.
The fact that the model ([{of everywhere defined continuous functionals is the derivative of model ties of model
cJ;'
obtain many proper-
in a rather natural way from the easily verifi-
able properties of model del
er: , allows to
V
~ Thus, for instance, the fact that mo-
is the model of BR theory of bar-recursive functionals
of Spector, is proved (and more, by the same means, as Luckhara.ti [8J ) rather easily
[5] •
As has recently been shown by Vogel [9] , i f in the definitions of Kreisel and Scarpellini we reject the everywhere definiteness of the functionals, the obtained models of partial functionals will naturally be isomorphic to
([:'
M.HYland in his thesis considered the notion of countable reducibility ~ c for the functionals of the model ([;( ; it turns out that this notion has good interpretation in model ~ REDUCIBILITY THEOREM. Let .;:
E;'
c, ) -t. E
G-r
(~-I.: ~ Go-~G;
MODEL
~
OF PARTIAL CONTINUOUS FUNCTIONALS
467
); the following equivalence takes place:
REFERENCES 1. Ershov Y.L. (1973). Theory of Enumeration 2, Novosibirsk. 2. Ershov Y.L. (1973). Theory of
A-Spaces, Algebra and Logic,
12, W. 4, 369-416. 3. Ershov Y.L. (1974). Maximal and Everywhere Defined Functionals, Algebra and Logic, 13, W. 4, 374-397. 4. Ershov Y.L. (1976). Hereditarily Effective Operations, Algebra and Logic, 15, /12 6., 642-654.
5. Ershov Y.L. (1974). On the Model tJ7 Ac. N. USSR; 217,
m5,
of the Theory BR, Dokl.
1004-1006.
6. Hyland M. (1976). Countable Functionals, Thesis, Oxford.
7. Kreisel G. (1959). Interpretation of Analysis by Means of Constructive Functionals of Finite Types, in "Constr. in Mathematics", Amsterdam, 101-128. 8. Luckhardt H. (1973). Extensional Godel Functional Interpretation, Lect. Notes in Math.
~
306.
9. Vogel H. (1977). Partielle abzahlbare undendliche Funktionale, Algebra and Logic, 16,
m 1.
R. Gandy, M. Hyland (Eds.), LOGIC COLLOQUIUM 76
© North-Holland Publishing Company (1977)
BOUNDED CONCATENATION THEORY AS A UNIFORM METHOD FOR PROVING LOWER COMPLEXITY BOUNDS
Klaus Fleischmann, Bernd Mahr, Dirk Siefkes* Technische Universit~t Berlin CSD-TR 202 October 1976
Abstract: Recently G~del's method for undecidability proofs has been adapted to prove lower complexity bounds for logical theories by describing computations of time-or space-bounded Turing machines in the theories. We introduce two formalisms, bounded concatenation theory and bounded word theory, as a general framework to carry out such proofs. The method yields all the known lower bounds for logical theories, elucidating the structure of such proofs. Further the method strengthens the results to circuit complexity. Polynomial-bounded word theory is NP-complete, and is useful in proving NP-completeness. In a course at the Technische
Universit~t
Berlin in July 1975, A. R. Meyer
presented a proof for the Fischer-Rabin [1974] result that the elementary theory Th(R,+,l) of addition over the reals has exponential complexity.
With the helF
of ideas of the Fischer-Rabin paper he defined bounded concatenation of words in the theory, and used this notion to describe the computations of time-bounded Turing machines.
This y,ields a clear picture of the structure of the proof:
one
starts with a set of exponential complexity, describes it through formulas involving concatenation, and finally defines bounded concatenation in the theory. Actually the proof yields an exponential lower bound to the very exponentially bounded concatenation.
formalism of
Meyer used this observation to translate the
proof from Turing machines to circuits, thus showing that ThaR,+,l) is even of exponential circuit complexity.
These proofs are presented in Fleischmann-Mahr-
Siefkes [1976].
*Visting at Purdue University on a Fulbright travel grant. The preparation of the manuscript was supported by the Computer Science Department of the Purdue University. 471
472
FLEISCHMANN, MAHR, SIEFKES In the present paper we use a corrected version of the above idea, and
generalize it to arbitrary time bounds. t-bounded concatenation theory t-BWT.
For any function t: IN -+ IN
we introduce
t-BCT and a weaker variant, t-bounded word theory
In either theory we describe the computations of t-bounded Turing
machines, and thus derive a lower complexity bound of
t
for either theory.
By polylin-reduction of either theory to other problems we can get all the known lower complexity bounds for logical theories, be it Turing machine time or space, or circuit size. examples.
We indicate the reduction in this paper for a few prominent
We prove no new bounds, but we hope that the paper clarifies the structure
of lower bound proofs and thus helps to find new results.
and
We use the following terminology.
Let
t: IN -+IN
M be a Turing machine (TM) computing f.
Then
computes
M
length input
be total functions, let in time ----
f
the (resp. some, if
n
takes
x
< tn
steps.
Thus it is natural to say that no TM computing for every X £
t
>
tn
i f for almost all
n£ N and all
f:
xc
[*
-+
[*
of
[*
M is nondeterministic) computation of M on We also call t
t
an upper
TM-time bound
is a lower TM-time bound for
f
for
f.
if there is
t, that is if
n
the (resp. any) computation of
M on input
x
steps.
(The definition for space is analoguous.) seemingly strengthened to so-called all
be a finite set, let
M there are infinitely many n £ m such that for some
TM
of length
[*
takes
f in time
[
n £ IN exists an
x £
[*
ii. e.
of length
Such
i.o.-Iower bounds
-lower bounds ("for all n ... ").
are sometimes M and for almost
This is done by padding and
by an appropriate definition of the length of a formula.
It does not yield a
better result, however, since obviously there are no real
a.e.-Iower bounds
for theories ("for all
[*
M, almost all
n c N and all
x e
of length
n ... II) .
In the first section of the paper we adapt Gtldel's method to describe computations of
TMs
in the elementary theory
CT of concatenation.
In this way
any recursively enumerable set reduces to CT ; thus CT is undecidable and so is any theory to which CT is reducible.
In section 2 we describe t-bounded computations
BOUNDED CONCATENATION THEORY in the elementary theory computable bound for
in time
t
t-BCT
of t-bounded concatenation.
polylin-reduces to
t-BCT; thus
t-BCT, and for any theory to which
(Certain restrictions on
t
473
are necessary.)
t-BCT
t
In this way any set is a lower time
is polylin-reducible.
We exhibit the method on the
elementary theories of addition over the real numbers and the natural numbers for
tn:=2 n
and
tn:=22
n
resp. (Fischer-Rabin).
In the third section we do
the same with t-bounded quantifier-free word theory.
The examples fiere are the
cn/ l ogn) and the monadic second order
monadic'predicate calculus (lower bound
theory of successor on the natural numbers (non-elementary), both due to A. R. Meyer.
We remark that polynomially bounded quantifier-free word theory is
NP-complete.
Cook [1971] was the first one who used the method described in
this paper to p.vve the complexity of a theory, although he did not aim for an exact time bound.
In the fourth section we extend the method and the
results to circuit complexity.
1.
Describing computations in concatenation theory. If in a formal theory one can describe some computation formalism, e.g.
Turing machines or recursive functions, then the theory is shown to be
undecidabl~.
Namely one can then express the halting problem for that formalism within the theory.
This was the idea of
functions in
G~del
[G~del
1931]. who represented the recursive
the logical formalism of Whitehead-Russell.
In terms of Turing
machines the method is as follows. Let
B be a set of words. say
enumerating
B.
(i.d.) qoa
where
B
a
E
{O.l}*
M stops in a state different from
i.d.
M be a Turing machine
is the initial state of a
E
B.
M has a right-sided infinite tape.
is a word over the alphabet
QMvSM'
M. then
M
(Thus we leave it open whether
ql or runs forever, if
set of states and of symbols of M resp. ; let
An
Let
{O,~}*.
That is if' M is started with the instantaneous description
stops in the accepting state
we assume that
~
Let
a
E
B.)
QM and
For convenience SM
be the
$ be a new symbol; let
EM:=QMvS~{$
Thus a halting computation can be
474
FLEISCHMANN, MAHR, SIEFKES
described as a word y:=$da$d l$ ... $dm$ computation and the on the input (1)
y
a
are
{a,l}*
£
starts
di
on
y
where
Such a word
m is the length of the
y E EM'
is a halting computation
a:
y
for some y E EM'
computes correctly: for
if
some Yl' Y2 E E * , Y2 ., E, and some i.d. M
YZ =
y = Yl $ d $ Y2 ' then and some (the) i.d. d
a $ Y3
following
Y is an accepting computation (3)
EM
iff
y = $ qa a ~ ... ~ $ (2)
i.d. 's.
over
iff
d
for some Y3 E EM* according to M.
d
moreover
Y accepts: Y contains the letter
Thus
a E B iff
ql'
there is a word
y
satisfying conditions (1)-(3).
These three conditions are easily expressed with the help of concatenation of words, namely (1)
Y starts
on a:
3y E EM* , w (Z)
£
~.
START(y,a) :" [y
= $qa
Y computes correctly:
a w$
y]
COMPUTE (y): "
[y = Yl $ wlzlz2z3w2$ Y2 ~ Y2 ., E ~ ~
3Y3 E EM* ,
Here
M'
v
Z
E (SM
IJ
QM)*
[M'(z,zl ,z2,z3)
(3)
Y
Y2 =
WIZW
Z $ Y3]]
is some formula describIng the computation of M , e.g.
"
A(
V
qssqEM
v where
A
M is the table of quadruples of M.
accepts:
ACCEPT (y) :"
3YlYZ E EM' [ Y = YlqlY2]
v
BOUNDED CONCATENATION THEORY
475
Define the formula
Then
a
e;
B iff
COMPUTE(y)
"
3y [START (y, z) FM,a is true.
Thus we have reduced membership of
in a formal theory of concatenation of words.
by Quine in [Quine 1946].
ACCEPT (y)].
f\
B to truth
Concatenation theory was considered
He proved that elementary arithmetic and concatenation
theory are reducible to each other. 1.1
Definition:
Let
I
be a finite set.
The theory of concatenation over
CT(I), is the following first order theory without equality:
I,
There is one
ternary relation symbol whose arguments can be variables or constants from I*.
The atomic formulas are written as
x
=
yz.
Formulas are built up as
usual with the help of propositional connectives and quantifiers. of formulas is defined in the natural model We have defined
CT(I)
Satisfaction
I*.
as restricted as possible.
For example we write
concatenation as a relation, not as a function, in order to avoid complicated terms.
Therefore we have to understand most parts of the formula
shorthands for w
=
u =
1.2
CT(I M) - formulas, e.g. we have to replace
x y z
by
3v[v
w
by
3V[V = e
Definition:
Let
xy
C f E*
A
"
= vz ] ,
W
etc.
u = w v]
D S rr*
be sets of words.
C~ D , if there exists a total recursive function
a
£
FM,a as
C is reducible to
g: I* + rr* s.t. for all
I* a
£
C iff
ga
e;
D.
In the literature this relation is known as "many-one-reducibility". serves for transferring undecidability: if D.
C ~ D and
It
C is undecidable, so is
The above construction shows that any recursively enumerable set is reducible
to (the true sentences of)
CT(I)
for some I.
Actually we can do a little better.
Thus
CT(I)
is undecidable.
If we code the symbols from
I M into
CT({O,l})- formula.
{O,l}- words, we can transform the formula
FM,a into a
(Note that in the thus transformed formula
FM,a the input
a
appears also in
D,
476
FLEISCHMANN, MAHR, SIEFKES
coded form.) Now
Thus any recursively enumerable set is reducible to
CT({O,l})
in a theory
serves as a universal tool for proving undecidability.
T we can define concatenation of
reduces to
CT({O,l}).
T and
T is undecidable.
{O,l}- words, then
If
CT({O,l})
Normally elementary arithmetic is used
as such a tool; for our purposes concatenation theory will do better. We can picture the situation as follows: B ~
CT ({0, 1}) ~
T.
Instead of carrying out some diagonalization within the theory we produce an arbitrary set recursive.
We reduce
B to
T in question,
B which is recursively enumerable and not CT({O,l}), and then start off with
CT({O,l}).
We will see in the next section that it pays to structure the proof in such a way.
With different ingredients the same proof structure will serve in many
different settings. We remark that the construction works unchanged for nondeterministic Turing machines, too. all
a
input 2.
By definition a nondeterministic
{O,l}*:
€
a
€
TM
M enumerates
B iff there is an accepting computation of
B if. for M on
a.
Bounded concatenation theory. We will now sharpen the method of section 1.
bounds instead of just undecidability. by Cook [1971] to construct
We will transfer lower
TM-time
It seems that this idea was first used
NP-complete problems: every set decidable in
nondeterministic polynomial time can be reduced in a simple way to the problem in question. (where
t
CT({O,l})
Thus we start with a set
B of TM-time complexity about
is some rather arbitrary function).
t
We do the reduction to
carefully, and thus can do with words bounded in length by • t.
We reduce only this t-bounded concatenation theory to the theory and end up with a
TM-time bound
t
for
T in question,
T.
We start with the observation that the length of words which are referred to in the formula
FM,a' can be bounded if the computation time is.
BOUNDED CONCATENATION THEORY 2.1 t:
Remark:
Let
B
{D,O* be a set decided by a
~
is a total function.
~ ~rn
that for all
a e: {D,O* :
477
TM
M in time
in
where
t
Then there is a function
FM,a is true (sc.
t
iff
E*)
FM,a is true in
E
Let
< tn
steps.
in
n.
Since
assume that each i.d. 's
9- (a)
a e: B,
i.d.
Let
describe an accepting computation
ye: l:M*
M cannot visit more than has length
< tn+2.
tn+l
tape squares, we can
Since there are at most
tn+l
y, we can assume that
9-(y) < (tn+3)·(tn+l) +1.
Q.E.D.
Therefore we can introduce a bound for the length of words concatenated in
CT-formulas.
2.2
Definition:
The theory of
Let
m ~W
t:
be a total function,
t-bounded concatenation over
let
E be a finite set.
E, t - BCT(E), is like
CT(E)
except that in the atomic formulas the arguments are bounded in length by We write:
= YZ
x
A
thus the length of is defined in If
t
9-(x)
~
tn.
Here
n
t.
is an integer in unary notation;
such an atomic formula is about
n.
Satisfaction of formula
E* in the natural way.
is computable, then the theory
of the relations
x
= yz
A 9-(x) < tn
t-BCT
is decidable.
is finite; thus
In fact, any
t-BCT admits effective
quantifier elimination. We rephrase remark 2.1 as 2.3
Remark:
Let
B S {O,l}*
be a set decidable in time
total monotonic function, Le. tn < t(n+l).
t - BCT ({0, 0 ) Proof:
for some t
= O(t
Then
for
subs t i tuting
a
9-(a) + 9- (FM(x)) .
for
x
at
a
If we code
is sti 11 proportional to
9- (a) .
is a
B is reducible to
some k' .
Our next observation is that the reduction of Since
t
2).
k{(tn)2 + 4tn + 4) ~ (t(k'. n))2
theory is a very easy one.
t where
B into bounded concatenation
FM,a arises from a formula
single occurrence, E into
{D,O'
Q.E.D.
FM(x)
by
FM,a has the length
, the length of the coded
Also it is easy to imagine a
TM
FM,a
which, given
478
FLEISCHMANN, MAHR, SIEFKES
a, prints the coded formula 2.4
Definition:
Let
FM,a in time proportional to
C S 1:* , D
s
11*
(t(a))2.
be sets of words, let
s
.
and
be
t
total recursive functions. C is t,s-reducible to D C =:; D , i f C is t,s reducible to D through the function g : 1:* -.. 11* , and g is TM-computable iV time C ~
t
s.
Especially
for some polynomial
D
t,s
and space
t
C is polylin-reducible
and some linear function
to
D if
s.
The concept of polylin-reducibility plays an important role in
comple~ity
theory; see e.g. Stockmeyer [1974]. We strengthen remark 2.3 to 2.5 Remark:
B £ {O,l}*
Let
total monotonic function. some t =
°
be a set decidable in time
Then
t where t
B is polylin-reducible to
is
t-BCT({O,l})
a for
2
(t ) •
In order to get a lower time bound for bounded concatenation theory we need sets which are decidable in time exact
t.
known from abstract complexity theory.
We state the facts we will need following
the terminology of 2.6
Definition:
and
t
The existence of such sets is well-
A. R. Meyer. t: W -..~
A total recursive function
is computable in time
is a
clock
if
tn
~
log n
t'(log t)2.
Thus a clock is a function which is honest in a special way.
For details
and for a proof of the following facts see e.g. Fleischmann-Mahr-Siefkes [1976]. 2.7 time 2.8
Fact:
For any clock t there is a set
t· (log t)2 Fact:
but not in
time
B
~
{a, l}*
which is decidable in
t.
Most "sensible" functions are clocks, e.g.
for any rational c > 1. 2.9 rate.
Main theorem:
Let
t
be a monotonic clock with more-than-polynomial growth
Then for some rational
t-BCT({O,l}) .
c
>
° ,t(c.n)
is a lower
TM-time bound for
BOUNDED CONCATENATION THEORY Proof:
Use fact
2.7
tn < Time (B) By remark 2.5 Since
t
to get
is growing fast enough, there is a number
d·
< t(c·n).
~
~
d
and in space
M'
t~me ~ p(~(a))
TM M'
1M M deciding
decides
FM,a in time
B:
Given
2 p(~(a))
M in time
<
which is
a
< t(c·d·n)
t-BCT({O,l}) a
{O,l}*
~
and in space
contradiction for
Q.E.D. Let
t
be a function as in theorem 2.8.
words, for example a logical theory. T, then for some c >
° ,t(c·n)
If
t-BCT({O,l})
Let
T be a set of
is polylin-reducible
is a lower time bound for
Q.E.D.
If T is a logical theory, then normally we will reduce
T by defining the atomic formulas of
to
T.
Polylin-reducibility is transitive.
2.11 Remark: to
s.t.
•
2.10 Corollary:
Proof:
4
. (log t) ).
.
FM,a through
uses time
2
n.
Assume then is a
t-BCT({O,I})-formula
Thus
k
t-BCT({O,l}) , say in
~ d·~(a)
Then the following
then it decides
(a)
t(c·d·~(a)).
-1
p
° be a rational number.
c >
it computes the
c
for almost all
B is actually polylin-reducible to
in time
<
< t(kn)
° (t
t-BCT({O,I}) for some t
B is polylin-reducible to
for some polynomial Let
s.t.
< tn' (log tn)2.
(tn)2(log tn)4 Thus
B S {O,l}*
a set
479
t-BCT({O,ll)
in
T.
t-BCT({O,I})
That is we
have to find (i)
for any and
n
N and any a
~
csta(x)
(ii) for any model f: (O,l)* for all a,b,c
~
~
(iv)
fa
cst a
{O,l}*
formulas
conn(x,y,z)
in the language of T , and M of
T with domain
M a function
M S.t.
{O,l}*
(iii)
To get
~
satisfies
conn(x,y,z)
in
M iff
a
= be
/\
~(a) ~
cst (x) in M. a a polylin-reduction we require moreover that the formulas is the only element in
M
are proportional in length to
time polynomial in
n
and
~(a)
tn, and
satisfying
n
resp.
and
~
(a)
con
n
and
resp. , and computable in
FLEISCHMANN, MAHR, SIEFKES
480 2.12
Example (Fischer-Rabin [1974]):
Let
ThOR,+,l)
of addition over the real numbers, involving and
as the only constant.
f : {O,l}* +m+ c~ where
la
x = y + z tn:=2 n
One can use
be the elementary theory as
for
a
a
ternary relation
time bound, and
is the binary representation of fa, for the
translation function of
2.11.
Fischer-Rabin define in
ThOR,+,I) bounded multiplication n
x=yz
/\
I\z<2
ZE:"
by induction on n.
2
Bounded concatenation of
O,l-strings is then easily
defined through bounded multiplication of the corresponding binary numbers. For details see Fischer-Rabin [1974], or Fleischmann-Mahr-Siefkes [1976].
The
latter paper gives a variant of the Fischer-Rabin proof by A. R. Meyer which is of the type described here
and contains the definition of
Fischer-Rabin we have to supply the definition of be
a
of length
0, I-word
iff n + a 2n- l + x = 2 n_ l
n.
Let
cst a
conn'
From
a = an_I" .ala O
Then
x = fa
x = 2· (2. (2, .. , (2 3YI'·"Yn-l
+
..
+ a
iff O an_I) .•. + a 2) + a l) a O ,
[Yl = 2 + an_ l
Y2 = 2Yl + a n_ 2 "
/I
~ Yn-l = 2Yn_2 + "i
2 + a
3y[y
n_ l
3y[y
where
/\
2z + a /\ n_ 3 Z
Y
is
y
or
n 2
Z
Remark:
/\ x
x
=
•••
2'Yn_l + ao]
iff
1\
= 2~
+ a
o] ' .. ]]]]
depending on whether
The last formula is of length O(n) 2.13
/\
3z[z = 2y + a _
3Z[...
iff
n
is
even
, and can be used for
or
odd.
csta(x).
The last change in the above definition of the formula
necessary, since the formula above it is of length
n.logn.
cst a
is
It is possible if
we allow in our logical formalism formulas like
axl ... x...
3X[ ... x ... ]]
If we do not permit such we get is
cn/logn .
"collision of bounded variables", the best time bound
BOUNDED CONCATENATION THEORY
481
2.14
Example (Fischer-Rabin [1974]: A similar proof yields a lower TM-time cn 2 for Presburger arithmetic, the elementary theory of addition bound of 2 over the integers. - In both examples, the technically involved part in the polylin-reduction of
t-BCT({O,l}) to the theory in question, is the construction
of a so-called ruler
or
which states that
is of length
2.15
Remark:
x
yardstick.
Ln(x)
of length
D(n)
D(tn).
We conclude the section by reminding that the method works for
nondeterministic
TMs
as well.
extend to nondeterministic
3.
That is a formula
Thus the lower time bounds in examples
2.12+14
TM-time.
Bounded quantifier - free word theory. There are theories which allow to express words and bounded concatenation,
but which do not have quantifiers for words.
The only essential quantifier in the
formula
of the preceding section is the initial
computation".
The remaining quantifiers refer to subwords of
can change the formula
y.
Therefore we
FM,a in the following way: instead of using subwords we
specify the letters in the word quantifier
3y , saying "there is a
y
at various places ; and we drop the initial
3Y, and ask for satisfiability of the formula instead of truth.
Thus
our primitive notion will be Yi
=
s : "the word 'y
contains the letter
s
in the i-th position".
Since we have eliminated concatenation, we cannot use constants for words anymore.
Therefore we have to describe in
FN,a
the input
the formula
Taking into account the space for the indices 0(n 2)
or
2, .. ,n+l, this formula has length
O(n . logn) , depending on the coding used for the indices.
One way
out is to use a I-element input alphabet, and to replace the conjunction by an V-formula.
In exchange for this one has to find unary sets
wanted complexity.
B S 0*
of the
482
FLEISCHMANN, MAHR, SIEFKES Thus let
s
B
0* be a set of unary words.
in time, and thus in space t .
B
we define ( 1)
(2)
a
1\
vi [2
i
<
n + 2
/\
Vi [n + 2
<
i
~
EO
0*
,
n:=~(a)
,
TM
deciding
m:=t(t(a)) + 3
<
START(y,a) ::
Yi
0]
m ->- Yi
\1l]
->-
1\
[yi = s 1
s ls2 s3 s4
Y accepts:
1\
Yi+m+l = ACCEPT
1\
COMPUTE (y) .=
Y computes correctly
~
(3)
a
M be a
(cf. section 1) :
Y starts on input
Vi
For
Let
1\
Yi+1 = s2
Yi+2
s3
"
~
s4
Yi+3
~(slS2S3s4)]
(y) .=
a i[yi = ql] Define the formula START(y,a)
GM,a(Y):= We claim that a
EO
GM,a
/I
COMPUTE(y)
describes correctly the computation of
is satisfiable.
B iff
successive
That is, we claim:
M (which might or might not include a state
$), one can uniquely determine the second of the corresponding
four symbols in the next i.d. of
~(slS2s3S4)'
M.
For the proof one has to verify that given four
symbol~ of an i.d. of
symbol or the marker
ACCEPT(y) .
1\
The rest of
GM,a
M'S computation.
We denote this symbol by
is Gbviously correct.
non-deterministic we have to replace the part
(Note that in case
Yi+m+l = ~(sls2s3S4)
by
M is
a
disjunction to describe the choice of M.) 3.1
Definition:
Let
E be a finite set.
WT(E) , is the following elementary theory.
u,v, ... ,z i,j
for words over
Quantifier-free word theory over
E
There are two sorts of variables:
E, not quantifiable;
for positions (natural numbers), quantifiable.
Natural numbers in'unary notation are available as constants for positions, denoted
BOUNDED CONCATENATION THEORY by
k, ... ,n.
483
As a third type of term for positions we allow
i + n.
There are
three types of atomic formulas: u
i
=
s
for every
"the word i
i
u <
e E , with the intended meaning
5
contains the letter j
s
in the i-th position",
with their natural meaning.
Formulas are built up as usual; but remind that there are no word quantifiers. Interpretation is in 3.2
Definition:
E*.
Let
t : IN
-+-
IN
be a total function, let
t-bounded quantifier-free word theory over except that we allow
tm+n and
Here again
m,n
E, t-BWT(E) , is like
WT(E)
i + tm+n as additional terms for positions, and
require that any atomic formula conjunction.
E be a finite set.
=s
ui
has
i
<
n
or
i < tm+n added as a
are natural numbers in unary notation.
Note that, since the constants for positions are in unary notation, e.g. £(i + tm + n) = O(m+n).- The results of section 2 carry over to quantifier-free word theory with the same proofs: 3.3
Theorem:
rate.
Let
t
be a monotonic clock with more -than-polynomial growth
Then for some rational
c
>
0 , t(c·n)
is a lower
TM-time bound for
t-BCT( {O, l}). 3.4
Corollary:
Let
t
be function as in the theorem.
words, for example a logical theory. to 3.5
T,
then for some c > 0,
Example (A.R. Meyer):
calculus without identity.
Let
If
t(c.n) plk
t-BWT({O,I})
Let
T be a set of
is polylin-reducible
is a lower time bound for
T.
be the (first order) monadic predicate
This formalism contains quantifiable individual
variables, and non-quantifiable monadic predicate symbols, with atomic formulas Px.
The predicate symbols are interpreted as subsets of the domain.
subsets partition the domain into at most
2n
classes.
membership in these subsets provides binary labels elements of the classes.
In this way Meyer uses
Membership and non-
O,I, ... ,2 n
Any n
n-1
for the
predicate symbols to
484
FLEISCHMANN, MAHR, SIEFKES n 2
define an ordering of length
in any domain, and an additional
symbols to "print" letters from a l-element alphabet. formulas of
t-BWT(E)
where
tn:=2
n,
logn
space.
Thus
GM,a
From a binary set <
one gets a unary set
plk.
The reduction,
predicate symbols require
translates into a formula of length O(£(a) 'log £(a)).
A where for some
Time (A)
n
predicate
This defines the atomic
in any model of
however, is not polylin, since the indices of the
k
2
~
e
<
2
2n+ l n d < Time (B) ~
B s .t .
n 2
for
bad time bound is still enough to apply corollary 3.4.
d: =
Ie
<
12
This
Thus with these modi-
fications the above method yields a lower time bound of
2c.n/logn
for
plk.
For a detailed proof see Rackoff [1975] or Fleischmann-Mahr-Siefkes [1976] . 3.6
Example (Rackoff):
The same idea yields a lower time bound of
2c.n/logn 2
for the monadic predicate calculus plus an equivalence relation, e.g. equality. Here the ruler construction is somewhat more difficult.
See Rackoff [1975].
In our next example one can define arbitrary long words with short formulas.
Therefore in the formula
of the computation length of the
y.
GM,a
The function
t
one need not to bound the length is thus used only to measure the
i.d. 's , i.e. the space of the computation.
In this way the
general method is used to prove space bounds. 3.7
Definition:
Let
t:
~ ~W
be a total function, let
E be a finite set.
t-measured quantifier-free word theory over
E ,
except that we allow
as additional terms for positions
tm+n and
i + tm + n
t-~MT(E),
is like
WT(E),
(but do not require bounds for the position variables in the atomic formulas as we did in
t-BWT(E)).
For the next theorem we have to transfer the idea of a clock to space bounds. 3.8 space
We call such a function a "marker" for lack of a better term.
Definition: t
A function
but not in space
t:
m ~m
t(e'n)
is a marker if there is a set decidable in
for some e > O.
BOUNDED CONCATENATION THEORY 3.9
Theorem:
Let
t
be a monotonic marker.
a lower space bound for Proof:
Then for some c > 0,
t.
t(c'n)
Then
Start with a set decidable by a
B is polylin reducible to
t-~~({O,l})
through the
formulas 3.10
Q.E.D.
Corollary:
Let
If t-MWT({O,l})
t
be a monotonic marker.
is polylin-reducible to
a lower space bound for 3.11
is
t-MWT({O,l}).
Similar to the proof of theorem 2.9.
TM M in space
485
~
Let
T be a set of words.
T, then for some c > 0 ,t(c'n)
is
T.
(A.R. Meyer):
Let
W2A
be the weak monadic second order theory
of the successor function on the natural numbers.
That is the first order
theory enriched by quantifiable variables for finite monadic predicates (sets). W2A
was the first monadic theory to be shown to be decidable (BUchi [1960J), and
the first decidable theory for which an explicit lower complexity bound was proved (Meyer [1973] ; following Rabin he calls the theory WSIS). the natural numbers is definable in W2A , one can reduce tkn for large functions t. Let ton = n 2 tk+ln deciding a set of space complexity between
and
k
are used to construct a ruler of length
~,a
to W2A. 3.12
t
Therefore
Remark:
W2A
t k+l,
Since the order of t-MWT(E) Let
to
W2A
~ be a TM
The formulas
t k+l , and to reduce
tk-MWT(E)
is not elementary recursive in space.
As in section 2 , all results hold for non-deterministic
TM
complexity. 3.13
Example:
We combine the ideas of examples
NP-complete problems. p
and in space
Let
q, where
is polylin-reducible to
B be a set.
Let the
p , and thus also
r-BWT(E)
where
3.5+11
r
to deal with
TM M decide
B in time
q, is a polynomial.
= O(p'q)
t-BWT(E)
E
except that we allow arbitrary polynomials to enter
the position terms and the bounds for the word variables. NP-complete, and mignt be a better tool to prove propositional calculus
B
is again a polynomial.
Thus we define polynomial-bounded quantifier-free word theory' over P-BWT(E) , like
Then
Then
P-BWT(Z)
NP-completeness than the
(Cook [1971J, Karp [1972J),
is
FLEISCHMANN, MAHR, SIEFKES
486
3.14 and
Remark: GM,a
It has been observed by several people that the formulas
are of a fixed structure independent of
a fixed number of quantifiers. theorems
2.9, 3.3 and 3.9
is a lower
a.
Expecially they contain
We can therefore strengthen the result of
to:
for some c
>
° and some
k
E~
TM-time (resp. space) bound for truth of sentences of
(resp. t-BWT({D,l}) , or
FM,a
t-MWT({D,l})) with
result transfers to the theories in examples
+
t(c·n) t-BCT({D,I})
This stronger
6 (see Rackoff [1975]).
It does not transfer to the theories in section 2, since there we need C(n) quantifiers (though only 2 different ones) to describe an input of length
n.
Obviously it does not transfer to non-elementary theories as in example 3.11.
4.
Exponentially bounded word theory and circuit complexity. In this section we will use exponentially bounded word theory to prove
that even finite parts of the elementary theory of real numbers under addition (example 2.12) are of exponential complexity under a suitable measure for finite problems.
This is a stronger result than the exponential
lower bound of section 2.
A TM
TM-time
can always be sped up to real time and space
on a finite part of its input by building in a table-lookup for that part. Therefore
TM-time or -space bounds are valid in the limit only.
Any finite problem can be coded into an f: {D,l}n
+
{D,l}
for some n.
n-argument
Boolean function
We restrict ourselves to such functions.
Boolean functions can be computed through circuits (or logical networks) ; thus their complexity is customarily measured by the size of a minimal circuit. A circuit consists of two-argument Boolean functions wired together in a loop-free way.
More formally, a circuit is a finite acyclic connected directed
graph with the following properties: (i)
every node has either two or zero ingoing edges ;
(ii) nodes without in going edges are called input nodes , and are labelled with different natural numbers ; (iii)nodes with two ingoing edges are called gates, and are labelled with
BOUNDED CONCATENATION THEORY
487
two-argument Boolean functions (and a correspondence between arguments and edges). The size of a circuit is the number of its gates. gates computes an obvious way.
n-argument
f , C[f].
Boolean function
°
>
(1
n
+ E) 2
/n
(Fischer [1974]).
Schnorr [1975] {O,l}
7
is called the
< Zn (Shannon, see Fischer [1974]) and
that there are functions of about that complexity, namely E>
f
It is wellknown that the circuit complexity
n-argument Boolean function is
f: {O,l}n
input
Convers.ely any' Boolean function can be computed by a circuit.
,circuit complexity of
for any
n
Boolean function at any of its nodes in an
The size of a minimal circuit computing a
of an
Any circuit with
TC[f]: =min M
defines the
TM-complexity
of a Boolean function
as {~(M)
'TimeM(n) 'log SpaceMen) ; M computes f).
Here
M ranges over all
TMs
which on input
~(M)
is the number of instructions of M
SpaceM(n):= max {SpaceM(x) ;
~(x)
x
E
{O,l}n
compute
f(x) ,
and
= n}.
Fischer [1974] and Schnorr [1975] show that both measureS for finite functions are strongly related, namely C[f]
~
c
l'
TC[f] , TC[f]
<
C
z • (C[f])
2
These relations show that the notion of circuit complexity
is not as accidental
as it looks. Both notions extend naturally to infinite functions f: {O,U* of length
7
{o,n.
If
f n:= fl{O,nn
is the restriction of
f
to arguments
n, we define
C[f](n):= C[fn] , TC[f] (n):= TC[f n]. The above relations between
C and
however, should not be mixed up
TC
still hold for any
with the ordinary
n .
TC[f] ,
TM-complexity of infinite
FLEISCHMANN, MAHR, SIEFKES
488 functions.
For any
TM
M which computes all of the function f:{D,l}*
we have obviously for all TC[f] (n)
~
~(M).
n
TimeM(n).
log SpaceMen}
There is, however, no reverse inequality, since by Shannon's theorem even for a non-computable
{D,l}
+
f.
We will use the term finite
TC[f](n)
TM-complexity for
A. R. Meyer uses exponentially bounded concatenation theory to show that the elementary theory of addition over the reals has exponential circuit complexity, and thus also exponential finite improves the
TM-time lower bound of example 2.12.
Fleischman-Mahr-Siefkes [1976]. Let
EBCT:=
t-BCT({D,l})
concatenation theory. A ~ {D,l}n
TM-complexity.
Let
A
This result
A full proof is given in
We will give an outline here. n ten} = 2
for ~
{D,l}*
be the exponentially bounded
be such a set that for any
is of maximal circuit complexity.
n ,
By the results of Shannon and
Fischer, en < C[A] (n) < 2n for all
e
<
2
one proves that
and sufficiently large EBCT
polylin-reducible to
c >
By polylin-reducing
has exponential circuit complexity. ThOR,+,l)
A to
Since
EBCT
EBCT
is
by example 2.12 , one gets
en < C[Th(lR,+,l)] < for some
n.
z"
and infinitely many n.
The main step of the proof is the reduction of the circuit-hard set to
EBCT.
sequence.
For this purpose one codes the graph of a circuit into a finite Thus one can describe a computation of a circuit through words over
a finite alphabet. x
E
A
A ~=>
Now one defines
A by
there is a function for any function and
f:{D,l}~(x)
g: {D,l}i(x)
+
+
{D,l}
s.t.
{D,l} , C[g]
<
C[f],
fx = 1 .
With the techniques of section 2 one describes functions and circuits which are actually sequences of length
n, 2 by
EBCT_formulas of length
D(n).
<
n 2
TC.
BOUNDED CONCATENATION THEORY
I
{O,l}n
A to
EBCT.
In this way one defines thus polylin-reducing
A
by an
489
EBCT-formula of length
D(n) ,
Finally one shows that polylin-reducibility
preserves exponential circuit complexity.
This proves the theorem.
Obviously the proof works for any theory to which exponentially bounded concatenation theory or exponentially bounded word theory (with word quantifiers) is polylin-reducible.
Thus the elementary theory of addition over the natural
numbers and the weak monadic second order theory of successor are also both of exponential circuit complexity. The method does not work for the theories in examples 3.5+6 (first order monadic predicate calculus) , since
EBCT
cannot be reduced to these theories.
Instead we have to use the above mentioned connections between circuit complexity and finite set
A
~
EBWT:=
theory.
The,yimply that for the above defined
{O,I}'
n TC[A] (n) > d Let
TM-complexity.
for any
t-BWT({O,l})
d for
<
2
and sufficiently large
tn:= 2n
n.
be the exponentially bounded word
The method of section 3 yields
TC [EBWT] (n) > c
n
for some c
>
1
and infinitely many n.
This implies TC[plk] (n) > en/log n
for some c > 1
and infinitely many n.
Thus the same is true for the circuit complexity of left to the reader. used to show that
plk.
The details are
See also Stockmeyer [1974] where a similar proof is W2A
(example 3.11) and the (undecidable) first order
arithmetic both have exponential circuit complexity.
490
FLEISCHMANN, MAHR, SIEFKES
Bibliography BUchi, J. Richard [1960]: "Weak second-order arithmetic and finite automata". Zeitschrift Math. Logik 6, 66-92. Cook, Stephen A. [1971]: "The Complexity of Theorem Proving Procedures". Proc. 3rd Ann. ACM Symp. Theory of Computing, 151-158. Fischer, Michael J. [1974]: "Lectures on Network Complexity". Lect. Notes Univ. Frankfurt/Main, 25 pp. Fischer, Michael J., Rabin, Michael O. [1974]: "Super-exponential Complexity of Presburger Arithmetic". Proc. AMS Symp. Complexity Real Computational Processes; also M.LT. Project MAC TM 43, 24 pp. Fleischmann, Klaus, Mahr, Bernd, Siefkes, Dirk [1976]: "Complexity of Decision Problems. Notes on a Course by A. R. Meyer". InformatikBericht, Techn. Univ. Berlin. Gtlde1, Kurt [1931]: "Uber formal unentscheidbare Slitze der Principia Mathematica und verwandter Systeme, I". Monatshefte Math. Phys , 38, 173-198. Karp, Richard M. [1972]: "Reducibi Ii ty among Combinatorial Problems". In "Complexity of Computer Computations", IBM-Symp. Yorktown Heights, R. E. Miller and J. W. Thatcher ed., New York, 85-103. Meyer, Albert R. [1973]: "Weak Monadic Second Order Theory of Successor is not Elementary-Recursive". Boston Univ. 1972/73 Logic ColI. Proc., Lect. Notes Math. 453 (1975), Springer-Verlag, 132-154 ; also M.I.T. Proj ect MAC TM 38, 24 pp. Quine, Willand van Orman [1946]: "Concatenation as a Basis for Arithmetic". J. Symbolic Logic 11, 105-114. Rackoff, Charles Weill [1975]: "The Complexity of Theories of the Monadic Predicate Calculus". Rapport Laboria IRIA no. 136, Rocquencourt, France, 13 pp. Schnorr, Claus Peter [1975]: "The Network Complexity and the Turing Machine Complexi ty of Finite Functions". .Jahrest agung GAMM Gtlttingen ; also report Univ. Frankfurt/Main, 18 pp. Stockmeyer, Larry J. [1974]: "The Complexity of Decision Problems in Automata Theory and Logic". M. LT. Project MAC TR-133, 224 pp.
R. Gandy, M. Hyland (Eds.l. LOGIC COLLOQUIUM 76 © North-Holland Publishing Company (1977)
THE NETWORK COMPLEXITY AND THE BREADTH OF BOOLEAN FUNCTIONS C . P. Schnorr. Fachbereich Mathematik Universitat Frankfurt Frankfurt a.M. We give a short surview on the relations between n~work size anlthe Turing machine complexity of Boolean functions as well as on linear lower bounds on the network size. We consider the finitary costs of nondeterminism in terms of the network size. We introduce and analyse the breadth-measure for networks and Boolean functions. We establish relations between network size and breadth. INTRODUCTION Whereas most previous work in computational complexity theory focused on the asymptotical behaviour of the run times which are associated with infinite problems (i.e. problems with an infinite input set) we shall be concerned with the complexity of finite problems, i.e. with the complexity of Boolean functions. We shall relate two complexity measures for Boolean functions, the network size and network breadth. Network size seems to be the fundamental measure as can be seen from strong relations to the Turing machine complexity. However this measure is hard to analyse and this analysis was not yet very successful up to now. The network breadth of a Boolean function f is a somewhat simpler but nevertheless interesting measure which can be completely analysed by counting various sets of subfunctions of f . This analysis of the breadth also gives some insight into the network size measure. THE ASYMPTOTICAL VERSUS THE FINITE POINT OF VIEW AN EXAMPLE In order to demonstrate that the asymptotical and the finite point of view really makes some difference in theory of computational complexity, we consider an important example. The computational complexity of deciding whether a given conjunctive form is satisfiable or not, turned out to be a very fundamental problem in computer science. According to Cook (1971) and Karp (1972) this satisfiability problem underlies various combinatorial search-type problems. The functional problem which is associated with the satisfiability problem is as follows. Given a CF
0' = (where the Zj'V variables
1x i
variables x
1'
n
/\
i=1
(z. 1 v z. 2v ... v z. 1, 1, 1,r i )
either are Boolean variables Xi or negated Boolean
) , find Boolean values Y1 ""'Yn for the Boolean
... ,x n of
1"such
that
-0 is
491
satisfied under Xi : = y i
492 i
=
C.P.SCHNORR 1, ... ,n . Let Cod (1") E x..... with X
=
!.O,1~
be a standard binary
encoding of CP' s '0, then a partial function 'It: X'"-} a functional solution of the satisfiability problem, if (1)
dom ("If-)
=
te od
(0)
\
0'
r
is called
is satisfiable}
(2) for all satisfiable CP' s -0: -0 is satisfied under the sequence (Cod ("0» of Boolean values.
t
It is a surprising observation due to Levin (1972) that the asymptotically fastest functional solution is already known, see Schnorr (1976 c) for a proof. Let 7ri be the i-th partial function on a RAMmachine and let Ti(X) be its run time on input x . THEOREM
Levin (1972)
f o for the satisfiability problem such that for any other functional solution f j :
There is a functional solution
To(x):iO (Tj(X) + Ixh. Here Ixi solution
is the length of x . Nevertheless this optimal functional does not yield any specific hints for the.computation
fo
of a satisfying sequence of Boolean values for a given CP merely diagonalizes over all RAM-algorithms
fi
the asymptotical optimality does not mean that respect to CP's with reasonable size.
0-. f
o for i eN. Therefore
fo
is efficient with
The complexity of a finite problem such as a Boolean function n f : X -? X cannot be adequately measured by the minimal run time of all Turing programs for f since any n-ary f can be computed in n. time 0 (n) by some Turing program p with size HPII "" 2 Here the size IIp/l of p is the number of instructions. Instead of this we should ask for the possible trade-offs between program size, run time and space with respect to all Turing programs for f . A typical question is: how fast can we decide with a program of reasonable size whether Cp's with binary, length:li 100 are satisfiable? THE NETWORK SIZE OP BOOLEAN FUNCTIONS A logical network (3 such that
is a finite, acyclic graph with labelled nodes
(1) every node v either has indegree 0 or it has two ordered entering edges. The nodes with indegree 0 are called entries. (2)
the entries ~ are labelled in one-one manner with Boolean variables op(~) , the non-entries v are labelled with binary 2 ~ X . Boolean operations op (..,;» : x
Each node v computes some Boolean function the input variables of ~ :
.,)
res~
that depends on
NETWORK COMPLEXITY AND BREADTH OF BOOLEAN FUNCTIONS for all entries
op ("" ) op (t c
)
[
493
~
V,
" .. ] for any non-entry y resp with ordered' edges ..;)1 --'t v , v ~ " • 2 res~,
It is understood that (3 computes the functions res~ with v£,(3 . Size (~) is the number of non-entries in ~ . With every Boolean function f we associate the network size C(f) : C (f)
=
def
min
t
size ({3)
I~ computes f} •
The network size of f is closely related to the Turing machine complexity of f . Let p be any Turing machine table which computes some Boolean function res With such a program p we associate: p' (1)
the run time T
i.e. T is the maximal run time of p p' p on the inputs x of res p'
(2)
the program size lip ", i.e. IIPI! is the number of instructions of p .
(3) the space bound Sp' i.e. Sp is the total number of tape squares (including those that store the input string) which are used during the computation of p on any input string x of res p' The following theorem has been developed in several stages by Fischer, Pippenger, Savage and Schnorr, see Fischer (1974), Savage (1972), Schnorr (1976 b) • THEOREM 2 For every Turing program p which computes some Boolean function f : C (f) ~ c Tp (llpl/ + Ig Sp ). Here c only depends on the number of tapes and the sQze of the alphabet of program p . Theorem 2 even holds for Turing machines that start with non-empty work tapes. The initial contents of the non-input tapes can be considered as an oracle. However this oracle is not evaluated by special oracle instructions but by the standard Turing machine read- and write-instrpctions. It has been observed by Schnorr (1976 b) that there is a converse to theorem 2 • THEOREM 3 (1) Let p range over all Turing programs with initially non-empty work tapes (oracles), then for all c. >0 :
C.P.SCHNORR
494
(2) Let p range over all Turing programs with initially empty work tapes, then for all E.> 0
Theorems 2 and 3 mean that there is a Turing program p for the Boolean function f with reasonable size lIPII and feasible run time T if and only if f has reasonable network size C(f). P In the following let Cod (0) be a binary encoding of CF r s n
b(i,1)
0=/\
(x
i=1
G"(i,1)
b(i,2) vX
u(i,2)
b(i,3) "X
b(i,3)
)
such that ICod ("6")\ !i: 0 (n 19 n). Here 6'(i,-") with 1 ~ 6'(i,~) ~ n is the index of a Boolean variable and b(i,~) & X is the negation exponent of the literal xb(i,~). In particular let Cod (iI) be a G"( i ,-,,) standard binary encoding of the list (b(i,~), S'(i,.,)) 1'.,)= 1,2,3, i = 1, ... ,n) of binary numbers. Let Sat : x n ~ X n
be the Boolean function with Sat (x ) = 1 n
iff x = Cod (-0)
for some satisfiable CF
"6".
Then theorems 2 and 3 yield the following. CONCLUSION:
=
There is a feasible Turing program for deciding whether CF's with binary length n are satisfiable if and only if C(Sat is of n) reasonable size. In particular.theorem 2 yields the following corollary concerning the classes P and NP of sets which can be decided within polynomial time by deterministic (non-deterministic,' resp.) Turing programs. COROLLARY Suppose C(Sat
n)
is not
polyn~mially
bounded in n, then NP
~
P .
From the asymptotical point of view, determining the computational costs of nondeterminism leads to the question whether or not P = NP However P = NP or not, does not say anything with resoect to the finitary costs of nondeterminism. Using the concept of network size we are now able to measure the finitary costs of nondeterminism. For instance this can be done by the network size of sat But it is even n. possible to express the finitary costs of nondeterminism without refering to a special binary encoding of CF's. With any Boolean function f(y,x) we associate the function f 3 y with 1
f'3"V (x) F
{ 0
if
3Y :
otherwise
f (y ,x)
= 1
NETWORK COMPLEXITY AND BREADTH OF BOOLEAN FUNCTIONS We define NDc(n) = MAX
LC(f
3y
) ·1 C(f)
495
= n}
where f ranges over all Boolean functions f(x,y) with C(f) = n . The function ND measures the cost of nondeterminism (i.e. of one C 3-quantificatio~with respect to network size. A similar function can be associated with any other cost measure for logical networks, e.g. depth and breadth. The functions C(Sat ) and NDC(n) are closely related: n THEOREM 4
For some c e N:
(1) NDC(n)
=C(Satcnflg nl)
(2) C (Sat
n)
=ND
C
+ c n(lg n)3
(cnrlg n1
2)
PROOF (1) Let
f3
be a network for f(z1"",zk' zk+1"",zm) with size(~)
= n. For zk+1, ... ,zm 6 X
we reduce
3z 1,···,zk: f(z1,···,zk, •. ·,zm) = 1 to Sat c nrlgnl Suppose f3 has entries -o with op (,,) = z-y for non-entries" with ordered entering edges "'1 for v = m + 1, ... , n . We generate i'l CF '0 from r
1\
v=m+1
(z" = op (,,) [z
by dissolving each
"'1
~oolean
, z"
for some c.
,,= 1, ... ,m
and
v , "2
-"1 "
--'1
J )
2
expression
z'll= op (,,) [z
"1
, z"
J
2
into at most 4 clauses with 3 literals and Boolean variables z,,' z" ' z" . The CF 0- depends on the Boolean variables z1 , ... ,zn 1 2 and has at most 4 n clauses. Let '"b be obtained from zk+1,···,zm '0 by fixing zk+1"'" zm' Then the construction of "0 implies: for all zk+1"'" zm E X ~
zk+1 , ... , zm
is satisfiable iff
3z 1, ... ,zk: f (z1, ... ,zk, ... ,zm) = 1. We can assume that C (ii
zk+1' ... , zm
)
= c n rlgnl
techniques the function Cod
(-0
zk+1""
,zm
)
~:
for some c eN zk+1, .•• ,zm
Based on sorting
~
can be computed in 0 (n (lg n) 2) Turing
496
C.P.SCHNORR machine steps uniformly in k,m and n, see Schnorr (1976 d). Hence C (1t) ~ 0 (n (lg n ) 3) • Therefore
by theorem 2 C( 3z
... ,zk
1,
C (sat c n flgnl
ct u ,
+ C(Sat c n rlgnl)
[ Cod (11)1 = n . These CF's 2n X --) X be n the Boolean function with f n (z1"" ,zn'x) = 1 iff x = Cod (0) for some CF 0 and is satisfied under z1"" ,zn •
(2) Let-o range over all CF's with
depend on at most n Boolean variables. Let f
-r
Then Satn(x) sorting
=
1 ~
techniques f
3z 1, ... ,zn : f(z1"
.. ,zn'x)
=
1. Based on
can be computed within O(n 19 n) Turing
n steps uniformly in n. Therefore theorem 2 implies C (f ) n
~
0 (n (lg n ) 2). This yields C (Sat ) n
some c EN
~ ND C (cn rIg nl 2) for
BOUNDS ON THE NETWORK COMPLEXITY It is a well known result due to Shannon and Lupanov that max
tC(f)
I
all f
:
x" -)
and that the fraction of those f
X} 62 : x
n
n/
n (1+0(1))
- ) X such that C(f) S2
n/
n converges to 0 as n increases. Although almost all Boolean functions f : Xn ~ X have network size C (f) N 2 n/ n it is a challenging problem to prove high lower bounds on the network size of specific Boolean functions. Of course a function f with high network size can be n constructed by taking the first function f : X ~ X within an : x" ---7 X such that C(f) ~ 2 n/n i However we are not interested in such artificial functions but we like to know the network size of the naturally occuring functions. Only linear lower hounds have been proved up to now for those functions. enumeration of all functions f
Wi th any Boolean function f = f (x , ... ,x 1 n) set of subfunctions
that are obtained by fixing the variables x permutation on
t1,2, ••• ,n}
and k
~
n we associate the
• Let 1"",xk then we consider the function
~
be a
NETWORK COMPLEXITY AND BREADTH OF BOOLEAN FUNCTIONS fr = f(x
497
r(1) , ..• ,x~(n)) which ~s obtained from f = f(x 1, ... ,x n) by
permuting the variables according to been proved in Schnorr (1974).
~
. The following theorem has
THEOREM 5 Let f
=
f(x
1,
.•• ,x
n)
V permutations 6': Then C(f)
~
and suppose that 3g
E
F(f6",n-lt): 1\F(g,2)1\ 2- 3 .
2n-3 .
The proof is based on an easy inductive argument. In any network~ for f there is a nod e w with two entering edges i --? 'V , j ~ v where i and j are entries. The property of f implies that a further edge either leaves i or j. In the case of
\'- I -, / Xi-
<J
,
x, j
'\)
I
~
we can eliminate at least 2 gates from ~ by fixing xi and the graph of the reduced network does not depend on whether we fix xi to 0 or to 1. This elimination process can be Ci:lLr~ed on inductively. Theorem 5 together with the evaluation of the network size of equivalence, see Schnorr (1976 a): n n C ( /\ x , v /\ lXi = 2n-3 i=1 L i=1 yields a 2n-3 linear lower bound on the network size of all except 8 symmetrical Boolean functions f~x1"",xn): COROLLARY (1)
C (f) f
~
2n-3
n : x -7
(2) C (f)
1
(3)
n-1
for all symmetrical Boolean functions X
except those under (2),
for the two constant functions n
n
C (f)
(3)
if f is either
A
i=1
xi or
/\ l x .
i=1
~
n
or 2:. x . mod 2 or if f is the negation of one of i=1 L those functions. Paul (1975) and later on Stockmeyer (1976) have extended the above methods. They succeeded to prove 2.5 n linear lower bounds. In particular Stockmeyer is able to evaluate the network size of the sum modulo 4:
498
C.P.SCHNORR n
L:
C(
i=1
xi = 0 mod 4) = 2. 5 n + 0 ( 1 )
Also Stockmeyers 2.5 n-lower bound applies to at least one half of all n-ary symmetrical Boolean functions. These lower bounds on the network size of symmetrical Boolean functions compare favorable with known upper bounds. Muller and Preparata (1975) observed that all symmetrical Boolean functions have linear network size. The binary representation of
n ~
x , can i=1 L be computed with network size 6 n + 0(1) . Any symmetrical f has a representation f (x
MAX
THE
£C(f)
B~EADTH
I
.. ,x
n
L.
xi) for some rIg (n+111 - ary i=1 g. This implies C(f) ~C(g) + 6 n + 0(1). Moreover the Shannon-Lupanov upper bound yields C (g) .: 0 (n/lg n) . Therefore 1"
n)
= g (
all n-ary symmetrical f}
~ 6 n + o(n).
OF NETWORKS AND BOOLEAN FUNCTIONS
Theorem 5 yields small linear lower bounds on the network size, which are based on the size of the sets of subfunctions. In order to elucidate the relations between the network structure and the size of the sets of subfunctions we introduce another complexity measure for logical networks. The breadth of a network formalizes the number of intermediate results which have to be stored during the evaluation of the network. The following pebble game has been used to prove a time storage trade-off, see Hopcroft et al. (1975). DEFINITION OF THE PEBBLE GAME Given a directed acyclic graph with goal node ~ and an infinite supply of pebbles. Move pebbles according to the following rules. (I)
if pebbles lie on all pred~cessors of node pebble may be put on node ~.
(II)
pebbles may always be removed.
(III)
pebbles cannot be moved twice on the same entry.
~
then a
The goal of the game is to move a pebble on the goal node using the minimal number of pebbles. Let B~(~) be the minimal number of pebbles which are needed to play node ~ in ~ . The breadth of a Boolean function f is defined as B(f) = mint B,,«(3) ) res; = f} . Here
n
p
ranges over all networks that compute f. For instance
B( i~1 Xi)
=3
as can be seen from the following network:
NETWORK COMPLEXITY AND BREADTH OF BOOLEAN FUNCTIONS
499
Similarly the conjunctive normal form of any n- ary Boole/n f can be played w.i th not more than n + 3 pebbles. If n pebbles always lie on the entries then a conjunction can be played with 2 additional pebbles and one additional pebble suffices to store a disjunction of conjunctions. This proves Lemma 1
B(f)
6
n+ 3
for all n- ary f .
It has been observed by Mehlhorn that without the restriction (III) any function can be computed with at most 4 pebbles by playing the conjunctive normal form. Therefore this restriction is necessary in order to get a non-trivial measure. The breadth measure similarly to network size and depth depends on the choice of a finite, complete set of Boolean operations (we choose the set of all 16 binary logical operations) I however this dependence can almost be eliminated. Let R,S be two finite, complete sets of Boolean operations, then simulating the operations in S by networks based on operations in R yields: B
R
(f) £. B
s (f)
+ max
t
B (b) R
I b
&
S}.
The analysis of the breadth of f will be based on the sizes of the sets of subfunctions, the following concept of mode will be important~ mode (f) d~f
min G-
m~x
II F (f6"',
k)\I •
Here S' ranges over a,ll permutations on the variables of f.
THEOREM 6 19 mode (f)
(1)
B(f)
(2)
B (f) ~ 3 fIg mode (f)l
~
+ 5
PROOF
P
(1) Consider the minimum breadth network for f. Let Q be the permutation such that the entry with x o ( i + 1 ) is played after the entry with variable x~(.) within the minimal pebble game on ~ . Let 1 max k
1/ F
(f6'" , k)
II •
500
C.P.SCHNORR Consider the stage of the minimal pebble game on
~
where
x~(j)
has been played but x~(j+1) not. Let ~1""'~r be the nodes which currently have a pebble. Then there is a Boolean function g such that f
!)
(V,';>r
= g resf3 , .•. ,res13
r
x(5"(j+1),,,,,x6'(n)
)
.
Hence F(f6"',j) = F(g,r). This implies B(f)
~
r z, 19 II F (fo ,j)l/ z 19 m0de (f) •
and m = max II F (:f'" ,k) II .. ·,x n) 1, k B(f) !: 3 rlg ml + 5. Let m = rlg I/F(f,k)/ll k
(2) Let f = f(x
consider the following equivalence relation Y 1 Y2 ••• Yk
rv
y~ y; ... Y'k
We shall prove k = 1, ... ,n. We rv
on Xk:
:')
I.::.
def
f (xi= y i I i= 1 , ... , k ) The different equivalence classes can be represented by different Boolean vectors with length m Therefore there exist functions k. k gi(x i=1, ... ,m such that 1,,,,,xk) k for i
'=
1 , •.. ,m k
iff
f (xi = Y i I i = 1, ..• , k ) = f (xi = yi I i = 1 , ... , k) k+l k Obviously g. only depends on g. j = 1 , ••• ,m. and on x k+ 1 ' f 1 J K n-l n-l only depends on g1 ' g2 and x n' Therefore we can compute f in n stages where in stage k + 1, g~+l computed from g~ need:
i = 1, ..• ,mk
and x
k+
j = 1, ... ,m are k+ 1 t ' Within stage k + 1 we
pebbles in order to store g~
m + 4 k
i=1, ... ,m and x k k+ 1 pebbles for the computation of an arbitrary k+1 k k g. (g 1" .. ,gm J
k
'X
k + 1 ) · Here we use Lemma 1 .
pebbles in order to store g~+l
j
1, ... ,m + 1 • k
NETWORK COMPLEXITY AND BREADTH OF BOOLEAN FUNCTIONS Therefore compute (2).
m~x
(2~
501
+ m + 5 pebbles are sufficient in order to k+ 1)
f. This proves B(f) ~3 rIg m]+ 5 and yields the assertion ~
k
The above computation of f by computing recursively the functions gj also bounds the network size of f in terms of the breadth of f . The following theorem shows that high breadth is a necessary condition for all functions with high network size. THEOREM 7 For all n-ary f: C(f) feO (2 B (f ) n ) . PROOF Consider the computation of f in the proof of theorem 6. Within stage k + 1, the functions
g~
i = 1, ...
k+1 gj
,~
g~+1
and x
j = 1, ...
k+ 1
.
'~+1 are computed from
Given the functions
is a function depending on m + 1 k
g~
i= 1 , ... ,m
k
'
arguments. Hence the
Lupanov-Shannon upper bound implies that 2 mk +
1
/ (m k+1)
arbitrary k+1 g,
(1+0(1)) gates are sufficient in order to compute an
g~+1 J
Hence 2
mk+ 1
(1+0(1))
gates are sufficient to compute
j=1, ... ,m + Since m :!f:.B(f) the entire computation of f k 1. k requires at most 2·2 B ( f ) n(1+o(1)) gates. ~ J
On the other hand we can explicitly construct functions that almost have maximal breadth and which have relatively small network size. With a binary sequence x '" x we associate the number n 1x 2 n
L.
i=1 We define
g n 2
g2 n (x 1 ' ... , x with
2n
2n + n 2 --'> X
X
as follows:
,y . ,y , ••• r y ) 1,1 1,2 n,n
502
C.P.SCHNORR
THEOREM 8 The functions g (1) B(g
2n
2n
) ~2n-n
PROOF in theorem 6, it suffices to prove
(1) Because of (1) mode (g
2n
= min
max
S"
)
~
2
2n - n
. By definition mode (g
UF (gG" , k )11. Let
) =
G" be the permutation that takes
2n
k
2n
the above minimum. We construct a subfunction y.
, with 1
i, j
s,
"" (yk , 1 ' . oo , Yk , n)
=
~,J
~
n
6" ( 2n - k)
for k =
The variables x., j = 1 , ••. , 2 n_n J occur by fixing the var iables x the 2 2 x
n 2 -n
2 n _n
g
of g2
such that 0, . . . ,
n - 1 .
are subfunctions of 2n - i
for i =
by fixing
n
0, ••• ,
different possibilities for fixing x
g
which
n - 1 .
Conversely
n yield 1,···,x 2 -n
different subfunctions of g which depend on the variables i=o, ••• ,n-1. This holds since each c ' determines the value
2n - i
J
of the subfunction
g (xi''= c
(a j 1 j=o,oo.,n-1), Le.
i
I '~ =1 , ... , 2n_ n ) at some place
ef(x,:=c, J
j=1, ... ,2n_n)(ajlj=o,oo.,n-1)=ci
J
i f the a . are chosen such that ef( x J
Thus we know
\I
. 2n_ j.=a j
n
F(g, 2 n_n)11 : 2 2 -n.
I 'J=o,.oo,n-
1) = x ~, •
Because of
this implies mode (g n) 2
z
(2) It follows by standard methods that C(x 0(. ( x.
~1
k
=
,oo .,x. )) fO(2
1, ••• , n
~k
k).
This impliesC(zk) ,r,.O(2
n)
for
n
2 2 -n
NETWORK COMPLEXITY AND BREADTH OF BOOLEAN FUNCTIONS
503
The upper bound in theorem 8 can be improved by a more detailed n analysis to C (g ) !:: O(2 Ig n ) , Moreover g can be computed in 0(2
n)
2n
2n
Turing machine steps uniformly in n: sort
o(.(Yk,l""'Yk,n) permutation
::r
on
k= 1, ..• ,n ll, ..• , n
1.
and compute the corresponding Then generate the sequence
(z'>-(k) I k=l, •.. ,n) withzk=x ) within one pass " oI.(Yk,l'···'Yk,n over xl ,x 2'··· ,x2n , compute (zk I k = 1, •.. ,n) by inverting the permutation
~
. Finally compute x
o«zl,···,zn)
CONCLUSION Theorems 7 and 8 show that the network size is a stronger measure than the breadth in the sense that high network size implies high breadth but not conversely. The example of the functions g demonstrates that any conditions on the sizes of the sets 2 n of subfunctions of f are not sufficient to imply a high network size of f and by the above remark such conditions are not even sufficient to imply nonlinear running times on Turing machines. ACKNOWLEDGEMENT: I thank H.Bremer for reading the manuscript. REFERENCES Cook, S.A. (1971). The complexity of theorem proving procedures, 3 rd ACM Symp. on Theory of Computing. Fischer, M.J. (1974). Lectures on network complexity Preprint Universitat Frankfurt. Harper, L.H., Hsieh, W.N. and Savage, J.E. (1976). A class of Boolean functions with linear combinational complexity. Theoretical Computer Science 1. Hopcroft, J., Paul, W. and Valiant, L. (1975). One time versus space and related problems. 16.th. IEEE Symp. on Foundations of Computer Science. Karp, R.M. (1972). Reducibility amony combinatorical problems. in: Complexity of Computer Computations, R.E.Miller and J.W.Thatcher, Eds., Plenum Press, New York. Levin, L.A. (1972). Universal enumeration problems (russian) Problemi Peredaci Informacii, Tom IX. Lupanov, O.B. (1958). A method of circuit synthesis. Izvestia V.u.Z. Radiafizike, No 1. Muller, D.E. and Preparata F.P. (1975). Bounds to the complexities of networks for sorting and for switching. J.ACM. Vol. 22, No.2. Paul, W.J. (1975). A 2.5 n lower bound on the combinational complexity of Boolean functions. 7.th ACM Symp. on Theory of Computing.
504
C.P.SCHNORR
Savage, J. (1972). Computational work and time on finite machines. J. ACM. Vol. 19. Schnorr, C.P. (1974). Zwei lineare untere Schranken fUr die Komplexitat Boolescher Funktionen. Computing 13. Schnorr, C.P. (1976 a). The combinational complexity of equivalence. Theoretical Computer Science 1 . Schnorr, C.P. (1976 b). The network complexity and the Turing machine complexity of finite functions. Acta Informatica, to appear. Schnorr, C.P. (1976 c) . Optimal algorithms for self-reducible problems. 3 rd. ColI. on Automata, Languages and Programming, S. Michaelson and R. Milner, Eds. University Press Edinburgh. Schnorr, C.P. (1976 d). Satisfiability is quasi-linear complete in NQL. Preprint Universitat Frankfurt. Stockmeyer, L.J. (1976). On the complexity of certain symmetric Boolean functions. Preprint IBM Yorktown Heights.
R. Gandy, M. Hyland (Eds.), LOGIC COLLOQUIUM 76 © North-Holland Publishing Company (1977)
COMPLEXITY OF DERIVATIONS FROM QUANTIFIER-FREE HORN FORMULAE, MECHANICAL INTRODUCTION OF EXPLICIT DEFINITIONS, AND REFINEMENT OF COMPLETENESS THEOREMS R. Statman Department of PhilosDphy The University of Michigan Ann Arbor, Michigan ACKNOWLEDGEMENT The author would like to thank Professor Kreisel for teaching him the theory of proofs (a continuing enterprise). He would also like to thank Professor S. Cook (Toronto) and G. E. Mints (Leningrad) for their useful comments and criticisms. INTRODUCTION In recent years a substantial amount of work has been done on the complexity of proofs by quantifier-free rules (e.g., Tseiten, 1970; Cook, 1971; Statman, 1974, Chapter 1; Kirkpatrick, 1974; Galil, 1975; Reckhow, 1976; and Statman, 1977). ~ruch
of this work has been done in connection with the question (*)
Is
v1V~
closed under complementation?
however, little progress has been made at providing an answer.
This paper
approaches the topic from a different point of view, namely, refinements of completeness theorems. Such refinements involve complexity measures ~ and provide bounds for ~-shortest proofs of formulae A as a function of ~A. Part of the job of analyzing a proposed refinement is determining how the given bounds depend on~. One principal result of this paper is that for some very general classes of rules, with minimal conditions on ~, bounds vary polynomially in
~
with respect to improvement on a fixed (universal) system and fixed measure.
The fixed measure (which for proofs is the number of inferences) has the property that, in familiar cases, a polynomial bound for completeness implies (*). The bulk of our results concern the complexity of derivations by rules associated with quantifier-free Horn formulae.
In Section 2 we apply some of our results
to answer a question of Kreisel's (in this special case).
In Section 3 we turn
to refinements of the completeness of the propositional calculus.
505
R.STATMAN
506
Section 1 1.1
We begin by considering a very general class of rules first studied in Statman Rules in this class have a schematic character which lends itself to
1977.
analysis (proposition 1) and this analysis suggests that the principal computationally significant feature of derivations by these rules is their number of inferences (remark 2). Within this class of rules one can identify certain (variants of) familiar systems of proof for the propositional calculus (example 2).
In Section 3 we shall
apply our analysis to the question of refinement of completeness theorems for the propositional calculus.
Proposed refinements are computationally significant
as many of them imply thatu1V~
is closed under complementation (remark S).
1.2
RULES ASSOCIATED WITH QUANTIFIER FREE-HORN
FO~IDLAE
Let 2be a first-order language with a countable number of individual constants
~,
a finite number of function constants
relation constants P. formula H =
M
The rule
~j
i,
and a finite number of
associated with the quantifier-free Horn
A+ A (A.prime) in 2 n+l J
l~i~ i
is
~l~6An + l
(for each substitution 6) (if n=O then ~ is the axiom schema A + If ~is a finite set of quantifiern l). free Horn formulae then the system ~ of rules associated with ~ is
1.3
Example Suppose = E2, then ~ = df{x=x, x=y + y-x , x=y l\ y=z + x=z, xn+ l = fXl",Xi,,,X n A xi=y + xn+ l = fxl···y· .. xn, PxI,,,xj,,,xm l\ X(Y + Pxl, .. y •. ,x m for each function symbol f and l~i~n = arity(f) and each relatio~ symbol P # = and
l~~m
= arity(p)}.
equality in 2.
iW.7(,
o
is the standard complete set of rules for
REFINEMENT OF COMPLETENESS THEOREMS
507
1.4 Example 2 Suppose
:J?
contains the binary function symbol::) (infixed), the (unary)
1'
then "f/J = {Tx ::) (y::>x) , ~ df T(x::)(y::)z) :> ((x::)y):> (x::)z)), Tcex::>j)::)D::)x, Tx::)y A Tx .... TyL .0/1~
predicate symbol T, and the individual constant
contains (a varlant of) the standard Frege system (in the sense of Cook) complete for the propositional language on :> (implication) and
1 (falsehood).
1.5 If iIlU{B) is a set of prime formulae in
Bl ... B
m
of B from
(a)
Bm
=
(b)
B.
E
( c)
there are R
1
B ij
iII
:J?
an ..o/lue-derivation D
is a sequence of prime formulae B. such that 1
B, and for each B either
iII
i
or
E.o/l.Jt,
H 8Aj and Bi
8 (as above) and il ... i
= 8An + l
If there is an .o/ldt-derivation of B from
iII
n
< i such that for Lcj cn
we write
$gz 1-
B.
~ 1.6 Example 3 Let.o/l
1.7
=.o/l.Y{, udt..' o
If
veYtt/X!iII . .
Let
.0/1 = .o/1.Jt'
then
1
B then
{xey}
I~ T((x::)y) :> ceY:Jx)::)D):>
PJl
1-
iII 1- B (completeness) .o/l.Jt
1.8 we define .0/1*, the extension of
&-
by explicit defini-
tions as follows: First, expand
:J?
by including
Second, further expand constants
~t
:J?
=-
by the iterated introduction of new
for closed terms t, and new variables
~t
for
~t,-free terms t, and extend.o/l U .Jt by the inclusion dt o of the defining equations ~t = t (for closed terms t) as axioms.
1.9 Example 4 .o/1~l contains (a variant of) the extended (in the sense of Cook)
508
R.STATMAN
Frege system for ~ and
1 putting
x :: Y df ((x::JY) ~ ((Y::JX )
::>1))
~
1
1.10
The notion of an~-derivation is defined like the notion of an
~-derivation. 1.11
Convention We write F(X) for the set of all F' s (variables, terms, etc.) occurring in (members of) X (for X an equation, derivation, set of equations, etc.). 1.12
We define px, the rank of the variable x, as follows:
if
xr.!l! then
ex = 0; if x = x t then
(a) (b)
ift = c ift )'
(c)
if t
then then
px = 1 px = l+py
and
= ft l· .. t n then px = 1 + max{pxt.
l~i~n}
1
1.13 A
substitution e is a map:
Term also denoted
I
e I.
dome =
V'ble --.-. Term.
{x :
e induces a map Term
ex;tx} and rnge = e" dome.
we write e = [exl/x l, •.. ,exn/xn]. If V ~ if XEV, (S/V)x = x if xiV. e+* is defined by (S+*)x = if xidom~. i f px = o.
e* is defined by e*y = Xey '
~x
if
8 is defined by 8X
1.14
If
0"c
~
dome
~
{xl.' .. xn} V'ble e/v is defined by (S/V)x = SX If
xEdom~.
~
(e+~)x
= t if x = xt'
8X
= ex
=x
= {x =c} &' = {xy =y} c 'y '
is a set of equations
1.15
e is a sOlution of ~ i f tl=t2E~~ et =et l 2. consequences of the completeness of 1ll.Ye,.:
o
1e*t f!Jl.Y( 1 o
The following are simple
REFINEMENT OF COMPLETENESS THEOREMS
509
1.16
$ (a) (b) (c)
is called simple if
each equation in $
has one of the forms x=c, x=y, and x=fx ... x n 1 ==} t 2 l=t 2 1, 2E:¥' if 1< is the least relation satisfying X=CE$ ~ C 1< X, X=YE$ and
x=t
X=t
and t / # t
x~Y ~Y 1< x closure of 1<
and x=fx ,XnE$ 1" < is well~founded,
then
==9
Xi 1< x , and < is the transitive
1.17 A simple $has a cannonica1 solution e = !!"'$ defined by recursion over <
X=CE [fli'
as follows: ex=fex
,ex
==> ex=c,
Y!7
and otherwise ex=x,
l" n, any solution of $
then lj! = lj!.
x~y ~ ex= By , x=fx l" ,XnE[fli' ==9 has the following properties: if lj! is
X=YE[fli' and
y'
and if t is asubterm of ~x for x£V'ble ($)
then t= e y for some YEV'ble($),
[fli'
1.18
If
is finite and ~ = [!ji"" we define sequences <
$
l$i$k$!V'ble(~)1 as follows: p(v)}
Siti>
>
<e for i ifx=y£f!j;;, X1y and zdudx,y}: vdx,y}=}p(u)
then e, = [z/x,z/y] and $'= e'.' 1 i+l 1
i!I"; 1
i f x=t
l,
x=t2~ f¥1 and tl#~t2
then
and t begin with different symbols stop, and if t ,x t f Yl" 'Yn l=fx 1" n' 2= 1 2 and zidudxi'Yi} :vdxi'Yi}~PU ';'pv} then e i = [z/xl,z/Yl""z/xn,zn/Yn1
if t and
,$1 =
1+
of $
ev 1
f!l'. 1.
Suppose max I ox : xEV'ble(Bl')} = 0 then i f S is a solution
*'
then S· A is a solution of $*and i f fJ. is a solution of;;: then S is a solu-
tion of
$.
If e is a solution of
has a solution ~~ is simple.
~
then e· e,=e and e is a solution of
If&{ is
1
31. 8\
simple we write ~* = e$k·ek_l· ...
• e and have : i f lj! is a solution of $* then 1jJ = lj!. e[!ji"'" and i f t is a sub term l of e~x for xEV'ble(~) then t = S[!ji'*Y for some YEV'ble($*), In addition, if max Iox : x£V'ble($)} = 0 then max l ox : X£V'ble(s,:y*&ifl')} = 0; under this condi tion we write ~
~*.
1. 19
A derivation schema (a)
L
(b)
L
i i
= px .• ,x l
n
= Px l,. ,xn
~
= L " .L is a sequence of labels l m
~i
where
BY ASSUMPTION, or
FROM iI" .i e BY 1)1
such that: (1)
throughout S the variables Xi are distinct;
(2)
in case (b)
il ... ie
begins with the same relation symbol as L ' and i j
510
R.STATMAN
(3)
the conclusion of 1)1 begins with P
Each ~-derivation D corresponds to at least one schema ~ (ambiguously) in the obvious way.
The converse is false.
1. 20
Example 5 S
TX l BY R.rx::J(Y::Jx) TX
2
BY ASSUMPTION
FROM 1.2 BY RTx:>y
1\
Tx
-*
Ty
TX 4 FROM 3.1 BY ~x:>y
1\
Tx
-*
Ty
Tx 3
is not SD for any f!ll;Y( -derivation D. 1
Given S as above we define a finite set ~
~L. of equations as follows: ]. For each Li as in case (b) select a permutation substitution ~i such that dom8 i V'ble(H) •. V'bl~(S) n V'ble(rng8 i) = ¢. and j;ii rngB j n rng8 i = ¢. 1 Let H = /)(\ P. t~ .•. t -* Pt ... t , then ~ is the smallest set satisfying l i",v:$.e J ni n i xl = B.tl ... ·,x = B.t E8L • and i f L. P'Yl·"y L then Yl 1 n 1 n i lj J nj =
~
l~]..5.m
==>
8. t i E 1 n
~L.• 1
1. 21
Given Li as in case (b) we write
~i
Proposition 1 S = SD ~
iJ
BS- for some solution 8 of ~S
1.22
Remark For a similar result for systems including logical operations the reader should consult Statman forthcoming. 1. 23
Remark 2 It is easily seen that the proposition is effective in the following sense: (a)
there is a deterministic polynomial-time algorithm ~l such that ~S
(b)
there is a deterministic polynomial-time algorithm sets of equations
§', ~2 8Ji' =
~2
=
~lS
such that, for finite
0 <==~ 8Ji'has a solution.
In addition, the proposition is easily generalized to ~proofs of B (f!ll~
REFINEMENT OF COMPLETENESS THEOREMS derivations of B from
~~~).
Let S be as above, and L;
require that (4) (5)
511
= Px l .. ,x n'
then we
no L of the form (a) occur, and i B begins with P. ..(jlB
~ U {x.=t. : l~i~n}, then S = SD for
Let B = Ptl, .. t n be closed; define (PS
~A'- -proof
D an
of B -F==9' D
~ es'
1
1
B
for e a solution of ~S'
We conclude the
Corollary 1 {(S,B) : S is the schema of an ~-proof of B} E ~ {(n,B) : there is an ~-proof of B with $ n inferences} ~u1V~ (Here, terms may be represented in tree or linear form, and natural numbers n should be represented in unary notation.)
Section 2 2.1
MECHANICAL INTRODUCTION OF EXPLICIT DEFINITIONS The results of Section 1 suggest that we could dispense with derivations in favor of schemata with little computational loss.
However, schemata are
hardly suitable for human understanding in contrast to explicit definitions. We shall provide an alternative by formalizing our analysis in&r.;re' 2.2 At df number of oj:currences of symbols in t . .£.!. If !Term(t) I· AD If number of occurrences of formulae in D. We write ~ t ~ for there is
all ~-deri vation D of B from ~ wi th AD
$
nand max{At : tETerm(D)}
$
m.
Observe that Fact 1 Fact 2 1$*1 s (2·max{at: tdermC$)}
+
1)·1$1
Fact 3
- - I t Ernge U ~t I
$
2· C2'max{at : tETermC$n +
(for $s.t. max{px : xEV'bleC$)}
1}·1 $1
= O}
Fact 4 ---There. is a constant Pl depending only on.;re such that for
.o/l.;re -derivat i ons
512
R.STATMAN t~Term(~SD)}: PI and
D we have max{at
I ~sDI
~ PI'AU
Fact 5 There is a constant P2 depending only on ~ such that
~ ~ P2 at ,At x
= t
0
t f!ll~O
t
2.3 Lemma
There is a quadratic polynomial q (x) such that i f j1ji'* F x=y then
rJ7:'* q( 1$* I) ,P3 ~ ~ x = y
OJ
for P3 a constant depending only on ~o
(This is
o
essentially proved in Statman 1977.) Corollary 2 There is a constant P depending only on ~ such that i f (et l) 4 then ~e*t1 U ~e*t2 U t~~ge ~t ~(e*tl) = (e*t 2) where n "
(I e"e*t
l
U
~e*t2
U
t~gge gt i2
(et
2)
o + ae*t l + ae\2) 'P4
We obtain the following: Proposition 2 There is a constant Ps depending only on ~ and ~ such that i f D is an ~-derivation, S=SD and e=eg
s
for each E~#'S'
U
t~Term«fS)
then
~e*t U
U
t crngs
2.S Theorem There is a constant p depending only on ~and ~such that nm p·n 3p ok • $1- ; B=>$loR!~n' B where k" maxI xt : tETerm($U{B})}($U{B} closed)
WWe
~:7f:
Proof Suppose D is an ~derivation of B from DE by an
f!ll~-derivation
o
of e*E from
U
tETerm(~s)
$, S=SD' and
~6*t
U
U
t crnge
e=e8 s' ~t with
Let
REFINEMENT OF COMPLETENESS THEOREMS
513
\OE ~ ps'\02 and max{\t : tETerm(OE)} ~ Ps Construct S+ from S- as follows: Li.
J
For each Li in S, i f Li " Pxl,· 'Xn FROM i l ... i BY RH, e AA. j .. t j -;. Ptl .. ,t j j L+. P ·yl,,·y for 15.j5.e , and 11 "/,,,P.tl, n J n j lj l:i.j5.e J nj
replace Li in S- by P18*yI1". 8*8. t l 1.
n
l
PI8*8.t l ... e*e.t l 1 l 1 "i
Pe8*eit~".8*8it~e P8*8 it l·· .e*8 itn
P8*X l·· .e*x n and if Li = Pxk .. ,x n BY ASSUMPTION then replace Li in S- by pe*x l.· .8*xn . Let 0#
df
&ldt'u~-derivation with
dt'
\D# ~ p'\D 3 and max{\t : tETerm(O#)} 5. p for p a constant depending only on and
g;.
Now suppose that 0 is closed and tjJ is a solution of If A is an assumption of '0# then either A€
U
8s
such that 0
&'e*t U
U 8t
or
t€Term( 8 ) t crngs S If A is the conclusion of 0# then tjJ6A = B. Let V = (x : xEV'ble(A)
~6AE~.
for A an assumption of 0# with ~6AE~ or A the conclusion of D#}. substitution r by recursion Basis; px=O.
~ver
If x€V'ble(t)
Induction step; px
0
.0
We have max{\t
U
>
O.
Oefine a
p as follows: for Xt€V then rx=
~x.
Otherwise rx=c.
and t l is a subterm of t 2 for XtzEV tl then rx=~tlo Otherwise, if X=X then rx=d , if X=X then fx=d ry , and if c c y x=x then fx=d ft1o. t ffx t .x n 1 t EIE:
tjJS- .
JjJ
t €:Term ( (!IS)
a:'e*t
tErngf}
If X=X
n ~
max{\t
t€:Term(~ lJ B) ,
then fE is an 3l~o axiom and i f EE
Moreover, i f
U
tErnge
&'t then
R.STATMAN
514 fE is either an
.qRdt or o
an
.qR*.Yf-derivation of B from p-raax i xt
/!ll". Yfaxiom .
Put D* = rD#, then D* is an 0 ~ with AD* S P'AD3 and max{At : tETerm(D*)}
5
: tETerm($ U {B})}.
To complete the proof it suffices to observe that ~I--: n,m B ==?
&Gt
B for some e. 2.6
Remark 3 It is clear that there is a deterministic polynomial-time algorithm such that D*
~3
= ~3D.
Section 3 3.1
Below we shall analyze the notion of propositional rule occurring in the logical (proof-theoretic) Ii terature.
If.qR is a system of such rules, the
completeness of .qR says 1= A ="i> ~ A.
Inspection of the usual completeness
f!Jl
proofs shows that for a number of measures c~A
deri vations 1= A ==1' f.--=an
~
of the complexity of formulae and
m'\l .qR-proof D of A with ~D
5 n,
tn
A df there is .qR,)J This naturally leads to the question:
A, for a suitable constant c , where
Is there a polynomial p(x) such that FA =9 i-p(\lA)A? &l,~ That is, can we do substantially better in ~ than formalizing the truth table evaluation of A? We shall reduce a positive answer to this question for certain .qR (truthpreserving) and ~ (admissible) to a positive answer for f!Jl~ and A (0 in certain cases, e.g., when ::J,
1 belong
1
to the language of~ (proposition 3).
3.2 SYSTEMS OF PROOF FOR THE PROPOSITIONAL CALCULUS
Let ~be a propositional language with a finite number of constants and connective symbols.
A sequent is an object ~ where XU Y is a finite set
of formulae and Y ~ ¢.
The interpretation of X j-. Y isPlx::JftIr.
configuration -.!1-1:....!1 ..... Xn~
X 1+ Y
A rule is a
REFINEMENT OF COMPLETENESS THEOREMS (if n=O then the rule is the axiom schema X
r Y).
An
515
instance of such a rule
is a configuration
~l U B"XI~1 U VI ..... ~ ~ ~ ~ _ U U
B"X
r
B"Y U V
for U., V. finite sets of formulae with U = I
1
substitution.
-l-n
U. and V 1
U
l~i~n Vi' and
e a
A system of rules is a finite set of rules.
3.3
Example 6 Let ~ consist of ~ and <j>
1,
then ~ =
r {p ~ q} r r {q} <j>
{p}
{p}
r {p},
.iE.;Jl}
r
<j>
N
is the standard natural deduction system for ~ and
3.4 If
f!ll
is a system of rules an
is a sequence of sequents Xi Cal Cbl
Xn
r Yi
f!ll-proof D
Xl
r W} . {p}
1-
r
Y I' . ,X
n
r Yn of ~
such that
~ Yn = X ~ Y, and for each i
there are REf!ll and i ... i < i such that l m
is an instance of R. An f!ll-deri vation D Cas above) of RI =
.....
~i~i
~~-
X ~ Y
satisfies Cal and for each i (b) (cl
or
~ Y! J
3.5 D is tree-like if each Xi exactly one inference in D.
~ Y
' for itn, is exactly one premiss of i D is faithful if each X! ~ Y! occurs in it. R' J
J
is truth-preserving if whenever a truth value assignment satisfies the interpre-
516
R.STATMAN
tations of its premisses it also satisfies the interpretation of its conclusion. 3.6
Example 7 If
R
is a truth-preserving rule for ~,
1 then
there is a faithful, tree-
like N-derivation of R (completeness of N) . 3.7
df
AD df number of occurrences of symbols in A, and aA IF'm'la(A) I. AD df number of occurrences of sequents in 0, and aD = ISequent (D) I. Let ~ be a natural number valued measure of the complexity of 2-formulae and .o/l-derivations, u is called admissible if there is a polynomial p(x) such that aA ~A
:s p(AA) , and aD
IE - .o/l,~
\'Ie write X
:s
Y if
:s
p(~A),
(the polynomial permits machine coding of A's and D's).
p(~D)
there is an
r
.o/l-proof 0 of X
Y
with )lD
S
n.
3.8
\'Ie say max
A
.o/l
is complete of ~ A =:>In ~ ~ n
{A}.
'f!/l, ~
min r~ ~ m {A} and )lA :s m
L 'f!/l,~
3.9
Let ~ be a system of truth preserving rules for ~ and let admissible measure for 2
and $.
For A in 2
let ,A
placing the constants and connective symbols of 2 some fixed choice) in o,
II
be an
result from A by r e-
b;-their definitions (for
1
Fact 6 There is a linear t(x) such that a,A :s t(aA) 3.10
Lemma 2 ---There is a polynomial p (x) such that X I n
b
y ==? ,"X Lp(n) ,"Y (this follows rN,A
i7f"ll
easily from the existence of faithful tree-like N-derivations of truth-preserving R). 3.11
Lemma 3 There is a polynomial p(x) such that
~ I-~,A {A}
-=?
~~~TA.
REFINEMENT OF COMPLETENESS THEOREMS
517
3.12
Let
~ be obtained from ~ by replacing = by
= of
the 4 t h example.
f¥'= {t l ... t we write (ambiguously)t1c'\f¥':J t for tl:J ( ... (tn:Jt) ... ). n} t* = M/l7 :J x - df'~k7t t so At* = l2'ot .
If Let
Lemma 4 There is a linear .L(x) such that
~~_n, Tt* _~ l_n+£.(ot)Tt.
ko/l~ 1
"rK 1 3.13
We obtain the Proposition 3 If
rL A
{§l is complete and
IP(~(P(OA)))
=9 -
<7V ~
{§l~l
T
is onto then there is a polynomial p(x) such that
TTA
3.14
Remark 4 Lemma 2 holds for the familiar resolution based systems (including Tseitin's extended resolution) of automatic theorem-proving (under the usual coding).
The
proposition is not known to hold for Frege systems with substitution (s-Frege systems in Reckhow 1976) (although a substitution rule could be added to
~,
in the presence of such a rule the remarks at the end of Section 1 are not known to be true).
Not all reasonable measures
p
are admissible.
an interesting non-admissible measure was studied in Statman 1974. 3.15
Remark 5 It follows easily from the results of Section 1 that {(S ,A)
S is the schema of an ,cn~ -proof of TTA} c fffJ
{ (n,A)
1
for ~-formulae A represented in tree or linear form, and natural numbers n represented in unary notation.
1
For example,
518
R.STATMAN
REFERENCES Aho, Hopcroft, and Ullman. Cook.
The Design and Analysis of Computer Algorithms.
"The Complexity of Theorem-proving Procedures" 3r d A.C.M.S. T. O.C.
1974.
1971.
"Feasibly Constructive Proofs and the Propositional Calculus," 7t h A.C.M.S.T.O.C. 1975. Galil.
Dissertation.
Kirkpatrick.
Cornell Univ. Dept. of Compo Sci.
Dissertation.
Kreisel, Mi nts and Simpson. mathematics: Prawitz.
1975.
Univ. of Toronto, Dept. of Compo Sci.
1974.
"The Use of Abstract Languages in Elementary Meta-
Some Pedagogic Examples," Springer Lecture Notes 453.
Natural Deduction.
1975.
1965.
Reckhow.
Dissertation.
Univ. of Toronto, Dept. of Compo Sci.
Statman.
Dissertation.
Stanford Univ., Dept. of Philo.
1976.
1974.
"Herb rand 's theorem and Gentzen' s notion of direct proof," Handbook of Mathematical Logic.
1977.
"Bounds for Proof-search and Speed-up in the Predicate Calculus," forthcoming. "Speed-up by Theories with Infinite Models," in preparation. Tseitin.
"On the Complexity of Derivations in the Propositional Calculus,"
Seminars in Math. V. A. Steklov Bath. Institute 8.
1970.
R. Gandy, M. Hyland (Eds.l, LOGIC COLLOQUIUM 76
© North-Holland Publishing' Company (1977)
PRECIPITOUS IDEALS Thomas Jech* Department of Mathematics Pennsylvania State University University Park, Pennsylvania
This is an edited version of the lecture delivered at the European meeting of the Association for Symbolic Logic in Oxford in July 1976.
The paper gives a
definition of precipitous ideals and presents an example of application of generic ultrapowers. The notion of precipitous ideal is due to the author and Karel Prikry.
A
forthcoming joint paper [3] will deal with precipitous ideals and their applications in detail (see also the announcement [2]). Theorem 1 gives a typical application of precipitous ideals and generic ultrapowers.
The theorem is a special case of a theorem of Prikry and the present
author.
Theorem 2 will also appear in [3]; the proof is a modification of a method
of Kunen.
Theorem 3 is an unpublished theorem of W. Mitchell and Theorem 3* is
an observation based on Mitchell's proof and a result of Magidor in [1]. 1.
Precipitous ideals as a generalization of countably complete ultrafilters. Let
K be a regular uncountable cardinal.
collection (i) (H) (Hi)
I
of subsets of
X~ Y I
{a}
if
I
X a
implies for all I
K
A K-complete ideal over
K
is a
such that
X
I
a < K
for all
and
a < Y
y < K,
then
U o
X
a
E 1.
Set theorists have been interested for years in K-complete ideals that are prime, i.e. for every
X ~ K,
either
X E I
or
K-X E I.
Investigation of these
ideals gave rise to the thriving theory of large cardinals. The most important feature of prime K-complete ideals is that they can be used to form a well founded ultrapower.
For instance, the very first application of
measurable cardinals was the well known ultrapower argument of Scott in 1961 showing that if measurable cardinals exist then
*Supported
by NSF.
521
V I L.
522
THOMAS JECH Ultrapowers have been used, among others, to obtain results on cardinal
exponentiation.
A typical application is the following theorem:
measurable (i.e. if K carries a K-complete prime ideal) and if for all a < K, then 2 K = K+. Since measurable cardinals are inaccessible (and very large), this method does not seem to be of much use for question about "small" cardinals like
~,
~
etc.
While small cardinals do not carry K-complete prime ideals, they do carry K-complete ideals that are not prime. (1)
I
{X ~ K
[x]
(2)
I
{X ~ K
X
<
Let me mention two canonical examples:
K
o
nc
for some closed unbounded
Let us ask the following question: Can the property "I
C
C
K}.
is prime" be weak-
ened in some way so that a modification of the method of ultrapowers would still work and that the weaker property would not necessarily imply that cardinal?
Given a K-complete ideal ~
S
zero, and sets
lover
I
X
K,
let us call the sets in
sets of positive measure.
of positive measure is a maximal collection
each
K is a large
The notion of precipitous ideal seems to give a satisfactory answer.
X E W has positive measure and that
X
S
~
W,
if for every
W. An I-partition W' is a refinement of X E W' there is YEW such that X C Y.
Definition (Jech-Prikry). every set
S
A K-complete ideal
I
n
of I-partitions of
S
:>
n
K is precipitous if for
E W
n
;-..
there exists a sequence
-
X
over
X,
W,
of positive measure and every sequence ::: W
such that
such that
has measure zero whenever
Yare distinct elements of
W'
sets of
An I-partition of a set
W of subsets of
nY
I
for each
I
:>
n-
and such that
n,
It should be clear that if
X
n Xn
n=O
is nonempty.
is prime then it is precipitous.
In §4
I
will discuss consistency of existence of precipitous ideals (over cardinals that are not measurable).
As for the two canonical examples of K-complete ideals given
above, the Lemma 1.2 below shows that the ideal (1) is not precipitous, while a recent consistency result of Magidor shows that the closed unbounded ideal (2) can be precipitous.
PRECIPTIOUS IDEALS
523
The motivation for the definition of precipitousness will become clear in §2. For the moment, let me give an equivalent description.
s
of a set
X E W}
F = {f x
:
X EW F} W F < G meanS that a)
numbers. then and
of positive measure.
A
of functions such that each If
X:::. x, I
X
has domain
G = {gx : X E
is a refinement of
F fx(a) < gy(a)
then
Lemma 1.1.
and
s
functional on f
a E X.
for all
Let
W be an I-partition
is a collection F = {f X: X and its values are ordinal
s,
MG} are functionals on W and b) G,
is precipitous if and only if for every set
S
of positive measure
there is no descending sequence
of functionals on
S.
Proof.
(Outline) If F n < w, is a descending sequence of functionals on S n, then the underlying I-partitions of S provide a counterexample to precipitousness
of
1.
Let
W
o~
WI
~
...
be I-partitions of
which witness that
S
precipitous; without loss of generality we may assume that if and
X, Y then
founded 2-tree. Pz
X ~ Y.
For every
For each
z E S
is the rank function for
a descending sequence. Lemma 1.2. Proof.
The ideal
For each
function from
z E S,
T z'
X:::'K
Ixi
{X
I
X onto
< K}
X EW n, Y E Wn +l : z E X} is a wellpz(X) X EW n}
where form
is not precipitous.
of size
K, let f be the unique order-preserving X Note that for every X:::'K of size K there exists
K.
is a successor ordinal};
fy(a) < fX(a)
We construct I-partitions
W n' Y
a maximal collection of
.for all
then
n < w,
Namely, let
fy(a) + 1
fX(a)
for all
a E Y.
as follows:
K of size
C
a EY :
K
/Yl n Y2/
such that
are distinct, and that for each
Yl'Y2 E Wn+1 such that Y
·Now the functionals A K-complete ideal
UW n= o n
is not
0
K such that
X EW n
{X E
X ET let fx(z) z' Now the functionals F = {f n X
y c X of size
ever
z
=
and each
y = {a : fX(a)
Wn+l
T
I
F n I
C
X and
= {f X over
fy(a) < fx(n)
: X EW n}
form a descending sequence.
K is called K+-saturated if
Iwi
~ K for every
524
THOMAS JECH
I-partition Lemma 1.3.
Clearly, if
W. If
I
is prime then it is K+-saturated.
I
is K+-saturated then
W
WI
pairwise disjoint refinement
W~.
Proof.
Given partitions
is nonempty. 2.
o~
is precipitous. replace first each
~
Then use the fact that
W
by a suitable n nOO U {X : X E W } n
0
Generic ultrapowers. Let
M be a transitive model of ZFC, and let
collection (i)
U of subsets of every
X E U
is in
M (but
U itself need not be in
if
X
U and
y, X
is such that
(iii)
if
X
U and
Y E U
then
for every
X C K such that
*
=*
X
nY
Y C K
M.
A
and
M)
Y E M,
then
Y E U
E U
X E M,
either
f E M with domain
Let us consider the functions
The relation
K be a cardinal in
K is an M-ultrafilter if
(ii)
(iv)
to
I
K,
X E U or
K-X E U.
and let
f
=*
g
iff
{a
f(a)
g(a)}
f
E* g
iff
{a
f(a)
g(a)} E U.
U
is an equivalence relation, and moreover a congruence with respect
Thus the equ!vlaence classes modulo
=*
form a model, the ultrapower of
Mmod U:
(The ultrapower is not necessarily well founded.)
It is easy to verify that the
Fundamental Theorem on Ultraproducts holds in this context too:
iff Consequently, the mapping
jU: M -+Ult,
where
the constant function with value is an elementary embedding of
M in UltU(M).
x
PRECIPTIOUS IDEALS Now let over
K.
I E M be (in M) an ideal over
We say that
a)
XEI
b)
if
525
K and let
G be an M-ultrafilter
G is I-generic over M i f
implies
W E M is
X
t
G
(in M)
an I-partition of
A routine verification shows that
Wn G
K then
G is I-generic just in case
generic set of conditions in the following notion of forcing
X is stronger than
tions are sets of positive measure and If
I 0.
G is an I-generic ultrafilter, we call
UltG(M)
G is a
the forcing condi-
Y if
XC Y.
a generic ultrapower.
We shall now piece all the preliminaries together and obtain the following characterization of precipitous ideals:
Let us regard the universe as th0 ground
model and assume that generic ultrafilters exist (this assumption is harmless since the statements on generic extensions can be reformulated in terms of Booleanvalued models) . Lemma Z.l.
Let
I
be a K-complete ideal over
precipitous if and only if for any I-generic
K G,
(in the universe
M).
the generic ultrapower
is well founded. yroof.
The reader familiar with the method of forcing should immediately see that
the lemma follows from Lemma 1.1 : Functionals on valued names which the condition
S
S
correspond to the Boolean
forces to be ordinal numbers.
Thus a
descending sequence of functionals corresponds to a descending sequence of ordinals in the Boolean valued model.
3.
0
An application of generic ultrapowers.
Theorem 1 (Jech-Prikry). for all
IT < K,
then
If
I
is a K+-saturated ideal over
Thus the same result that, as I mentioned in
Proof.
Let
K
1, holds for measurable cardi-
that carry a K+-saturate~ideal.
~2-saturated
ideal over
M be the universe, and let
G be an I-generic M-ultrafilter over
~l'
I
then
2
a
= ~l
UltG(M)
In
partic~ar,
implies
be a K+-saturated ideal over
2
1
K.
= ~2'
Let
K.
Until further notice, we work in the generic extension ~
IT +
ZIT
ZK = K+.
nals, is also true for all if there exists an
K and if
be the corresponding generic ultrapower.
Since
M[G]. I
Let
is precipitous,
526
THOMAS JECH
N is
a well-founded model; let us identify
class.
Let
j = jG
Since
I
XCI E G for all
CI < y,
j(CI) = a
It follows that a E j(X)
for all
j ; M~
N.
a < K.
j
~ pN(K)
rM(K)
M,
E G.
za
E M is such that
G shows that
y < K
And M-K-completeness of
and
G implies
Also, a standard argument (using the
j(K) > K.
and hence
Assuming that in embedding
{Xa : a < y}
aQ y Xa
then
diagonal function) shows that
a E X if
with the isomorphic transitive
is K-complete, an easy argument using genericity of
G is M-K-complete; that is, i f easily that
N
be the elementary embedding
if
X = j(X)
= a+
X~ K n KEN.
for all
is in
a <
K,
M then for every
a < K
we apply the elementary
and obtain: + M 1= Va < K Z(l = a + Nr= Va < j(K) Z(l = a + N r= ZK = K
+
M[G]
K •
In otherwords, there is in M[G] a one-to-one mapping of Until this moment, we argued in model
M[G]
the K+-chain condition (because (K+)M
M[G].
is a generic extension of
is a cardinal in
cardinality of
P(K)
in
M[G] M,
I
pM(K)
into
(K+)M[GJ.
Let us now step back into
M.
The
M via a notion of forcing which satisfie
is K+-saturated).
and so
Hence, as is well known,
(K+)M[G] = (K+)M.
If we denote
we have proved so far that in
M[G],
~
the
there is a
one-to-one mapping of A into (K+)M. Now we invoke the K+-chain condition once more: I~IM[G] S (K+)M is only possible if ~ (K+)M. Thus we have proved that in 4.
M.
ZK
= K+.
o
Equiconsistency of precipitous ideals with measurable cardinals:
Theorem Z (Jeck-Prikry).
If there is a precipitous ideal then there exists a
transitive model with a measurable cardinal. I will give a sketch of the proof. Kunen. and
cf
Let
The proof follows a similar theorem of
K be the class of all strong limit cardinals
v > K.
v
such that
be some elements of
v > ZK
K such that
PRECIPTIOUS IDEALS IYn
n KI = Yn for all n.
Lemma 4.1.
Let
=
A
{Y : n n
527
= a,l, ... }
Assume that there is a precipitous ideal
and let
lover
\
K.
sup A. Then there is
an L[A]-ultrafilter
W which is L[A]-K-complete, normal, iterahle, and the
iterated ultrapower
Ult~a)(L[A]) is well founded for all
[Iterable means Proof.
Let
if
(X a
: a < K) E L[A],
S of positive measure and an ordinal Y such that
s ~ ~ d
Xa E W} E L[A].]
{a
G be the canonical Boolean valued name for the I-generic ultrafilter.
There is a set
(where
then
a.
represents
is the diagonal function).
in
UltG(V)
Let
x nSf I}
{x E L[A]
U
Y
U has all the properties claimed in the Lemma but possibly normality. the normalization of Using Lemma 4.1.
then
\
U.
We apply Kunen's technique and show that if
Theorem 3 (Mitchell).
If
M[G]
K
L[F].
0
is a measurable cardinal in
in which
K
= ~l
and
I will not give a proof of the theorem. is a normal measure on
K
(in
D is precipitous in
M,
then there is a
carries a precipitous ideal.
K
Let me only say that Mitchell's
proof uses the standard Levy collapse that makes dual of
W be
0
is a measurable cardinal in
generic extension
Let
K = ~l
M), then the ideal M[G].
Thus existence of a precipitous ideal over
wI
I
and shows that if
generated in
M[G]
D
by the
is equiconsistent with
existence of measurable cardinals. A recent result of Magidor gives a model in which the closed unbounded ideal (see (2) in §l) is precipitous.
Magidor's model is a generic extension of a model
which has a supercompact cardinal.
It is reasonable to conjecture that the
assumption that the closed unbounded ideal is precipitous is stronger than measurability (consistency-wise); e.g. one might expect that the assumption implies
528
JECH
THO~1AS
existence of inner models with many measurable cardinals.
There is however nothing
known in this direction beyond Theorem 2. Appendix It turns out that if we want a model with a precipitous ideal, but not necessarily over
then Mitchell's proof can be somewhat simplified.
~l'
More-
over, we can obtain an ideal with a property stronger than precipitousness. Let
I
be a K-complete ideal over
of positive measure. measure there is
Y E D
K,
and let
D be a collection of sets
D is dense if for every set
We say that such that
If
Y c X.
then
A. < K
D
X of positive is
~O-closed
if
whenever ::l X ::l n
is a descending sequence of elements of Xc
-
11 n=O
then there is
D
such that
XED
x.
n
Theorem 3*.
If
M,
is a measurable cardinal in
K
extension in which
K = ~
and
K
then there is a generic
carries a K-complete ideal which has an
~
closed dense set. Note that if
I
has an
~-closed
dense set, then not only
I
is precipitous
but for every sequence of I-partitions
there exists a sequence ::l X ::l
n-
such that Proof.
X
for all
~-closed
I.
and such that
nOO X n
has positive measure.
set.]
wI
and
In fact, that property follows from the existence of a dense Let K,
P
be the standard notion of forcing that collapses cardinals
makes
ordinals (the Levy collapse) is a countable subset of dom(p).
n,
[A similar argument is used in [1] to derive a game theoretical property
of the ideal between
W n
n
A condition
p
K
~
and does not add new countable sets of
a forcing condition is a function
wI x K and such that is stronger than
q
if
p(a,~)
p::l q.
< ~
p
for all
whose domain (a,~)
E
PRECIPTIOUS IDEALS The notion of forcing dition.
WI
P
P
G be a filter on
P,
a
is countably closed and satisfies the K-chain cona {p : dom(p) ~ wI x a} and p = {p : dom(p) ~
x (K-a)}.
Let measure on X
a,
let
For every
529
is in
In M[G),
K. I
just in case
M.
generic over
let
I
for some
X:J Y
Y E
M
of measure
A simple argument using the K-chain condition of K-complete ideal over
K.
For each
f = <Pa : a < K> E F
Let, for each
[[a E If]
U: a set
P
O.
shows that
I
is a
a < K}
let
X
n Tf,
where
f EF
X E U.
and
D is a dense NO-closed collection of sets of positive
We will show that measure.
a normal
has a dense N a-closed set, let F be a such that P E p for all a. a
I
{Pa
D be the collection of all sets
Let
a,
To show that
M) the family of all sequences
(in
M,
U be, in
Let
be the ideal generated by the dual of
f E F,
T
be the canonical name for
f
T
i.e. for each
f;
= P
First I claim that each set in
D
has positive measure.
It suffices to show
that each T intersects each X E U. Let f = <Pa : a < K> and let X E U. f us show that for each condition P there is a stronger condition q and some a E X then
such that pUPa
q
I~ a E If'
Next we show that to find let
X E U
PO
X {a p qa U qa
and
D
is dense.
f E F
G be such that p
does not force
a p
By normality of
U
f
!'=.
was arbitrary
X E U
and a sequence
Let
A
a E X
such that
p EPa'
be a set of positive measure; we want
T n X C A. Let A be a name for A and f has positive measure. Let P'::: PO' The set
A}
.::: Po
and
f =
But this means that
It remains to show that
has measure 1.
such that
there is a set
P
is any
a E If'
such that PO I~
be an extension of
Since a E A.
a E X
But if
is a condition and forces
Let
Yp Po
qa EPa' U G,
a < K> E F
X
n Tf
D
~
such that
For each a,
a E X let p' qa U qa I~ a EA.
qa E p qa
=
and
q~
for all
it follows that there is such that for each
a,~ E Yp'
G a set
q
q
a E X,
U qO I~
A.
is NO-closed.
Let
X ED n
and
f
n
E F,
n <
00,
be such that
Since Let
P X
is NO-closed, the sequences nQo X n;
thus
XED
and
<Xn
n <
w)
and
(fn
n <
00)
are in
M.
THOMAS JECH
530
For each
n,
let
There is a condition
l' E G
such that for each
n X.2.!r
nX
n+l
n, .
we have l' If- Pn+l E G ~ pn E Q. Thus if a E a a n+l is a condition and we have l' U Pan+l .2 l' EPa' then l' U Pa y be such that P E P and let Z = x-v , For each a E Z we Hence if
a E X,
y
P
for each
U Pa0 -c P U Pa1 -c ..• -c a E Z,
It is clear that
p
U Pan c
and arbitrary T f
nZ~
T f
n
X is such that H Pna enceI et have
otherwise.
n
X
n
for each
Let n.
0
References [1]
F. Galvin, T. Jech, M. Magidor - An ideal game, to appear.
[2]
T. Jech, K. Prikry - On ideals of sets and the power set operation, M1S Bulletin 82(1976),593-596.
[3]
T. Jech, K. Prikry - in preparation.
R. Gandy, M. Hyland (Eds.), LOGIC COLLOQUIUM 76
© North-Holland Publishing Company (1977)
A .R.D .MATHIAS
PETERHOUSE. CAMBRIDGE
Part One reviews the development of ZermeloFraenkel set theory and is written for the general reader; Part Two continues with a list, largely collected at the meeting, of open problems in set theory, and is for the specialist; Part Three summarises the philosophical speculations with which the author began his Oxford address. PART ONE
The universe, V, is the collection {xlx=x} of all sets. To give some structure to it, we arrange it in a hierarchy by defining, by induction through ths class On of all ordinals,
V
o = 0, the empty set
V~+l = P(V~) the power set {xl x c V~} of V~
for limit A,
VA =, lJ{v~1 ~
<
A};
and as a consequence of the axiom of foundation, we have V
= U{V~1
~
€
On}.
A natural question to ask, once its use has been observed in mathematics, is this: Is AC, the axiom of choice, true?
531
A.R.D. MATHIAS
532
G~del in 1938 gave a partial answer: he defined a collection L of sets by another recursion on the ordinals:
and
L o
o
Lr,:+l
Def (Lr,:)
r·A
l){Lr,:!r,: < A}
L
'" U {Lr,: Ir,:
for limit A,
" On},
where Def(Lr,:) is the set not of all subsets of Lr,:' as in the recursive definition of the Vr,:'s, but of all definable subsets of Lr,:' definable meaning of the form {x E Lr,:
I
q, (x,y)
y
is true in Lr,:}
for some finite sequence of elements of Lr,: and some formula q, of the language of set theory. We write 'Lr,: q,' to mean 'q, is true in
F
Lr,:' • The class L contains all ordinals, and is such that XEyEL ~ xEL; further all axioms of Zermelo-Fraenkel set theory are true when interpreted in L. Such classes are called inner models and L is the smallest such. The hypothesis that V '" L is known as the axiom of constructibility; G~del showed that if ZF is consistent, that is, is free of contradiction, so is ZF+V = L, and that both the axiom of choice and the generalised continuum hypothesis (GCH) follow from it. Subsequent investigations, summarised in Mathias [1], have shown the axiom of constructibility decides many set-theoretical hypotheses known to be undecided by ZF+AC alone, and applications of it have been made in other branches of mathematics. For example, each of the following statements is a consequence of V = L: every Whitehead group is free (Shelah [2J,[3]) every locally compact normal Moore space is metrisable (F leiSsn.er [4]) there is a counter-example to Souslin's hypothesis (Jensen [5J) for each n there is a space of ( covering ) dimension n the one-point compactification of which has dimension 0 (Ostaszewski [6J) the gap-n conjecture in model theory (Jensen [7]). We may therefore ask instead: Is V '" L?
Sad to say, Cohen in 1963 showed that if ZF is consistent so is ZF + GCH + V "I L; moreover so are ZF + not AC and ZF + AC + not GeH. His method as later reformulated by set theorists, consists
THE REAL LINE AND THE UNIVERSE of an expansion of the universe. algebraE
=
533
Start with a complete Boolean
JB
by
induction on the ordinals:
~
0
13 V1;+l VB
{f1::3u(u
\.J {V~ I U{{I
A
T3
v
cJB1;
and f: u
-+
B)}
1; < A} 1;
On}.
€
For u in the domain of f, f(u) is, approximately, the Boolean truth-value of the statement "u values
[g
pairs g,f
€
€
f]lB and
VB.
€
[g = f]lB
f".
Definitions of the truth
may be given by recursion, for all
Consider the case of the two-element algebra
~=
{ID,1};
v1 is,
once factored by the equivalence relation induced by identifying f and g whenever V. thus
[f
= g]2 = 1,
isomorphic to the original universe
As! is a complete subalgebra of any completem, v
VB
1
may be regarded as an extension of the original
Such extensions are called Boolean extensions.
c
~,
and
~niverse
V.
If we were dis-
cussing instead of V a countable transitive model M, with lB a member of M and a complete Boolean algebra in the sense of M, then we would be able, by factoring lB and hence convert
MB
by a suitable ultrafilter, to
to a countable transitive model N
ordinals as M. is a
ME
~ M having the same
We could then express within N the statement that N
(factored) Boolean extension of M. Being unable to decide whether V
= L,
let us reformulate the
question thus: Is V a Boolean extension of L? One case where the answer is known to be negative is when there exist measurable cardinals.
Originally, a cardinal K was defined to
be measurable if there was a measure taking only the values 0 and 1, defined on all subsets. of K, assigning measure 1 to K, measure 0 to each singleton {1;} (1;
O
each
It was noticed
had the further property that for
A less than KO ' the union of A sets of measure 0 was again of
measure 0; so with a view to generalisation that property of
K-additivity:the measure was added to the definition of the measureability of K. In fact if there is a measurable cardinal then the constructible universe, L, becomes very small in relation to
V: let us denote by
534
A.R.D. MATHIAS
the following hypothesis for every uncountable cardinal A there is a closed unbounded subset XA of A such that ~
(1) for any positive integer n any formula ep (xl,..~xn) of the < < language of set theory and any two ascending n-tuples E;l.,.E;n'
< <
1;l .. ·1;n' from XA, LA
f
$(E;l,..,E;n) -
LA
F
ep(1;l,···,1;n)·
(2) for each member x of LA there is a ep and 1;l, ••• ,1;n in XA such that for all y in LA' LA
~
$(y,1;I, ••• ,1;n) iff y
= x.
Moreover, if A < K, and A,K are uncountable cardinals, X = AOX K, A and LA is an elementary submodel of L in the sense that for any K ep ( ) and xl"'. ,x n " LA. LA
F
ep (xl'··· ,xn)
iff LK
F
ep (Xl'··· ,x n)·
There is a property o/(a) of real numbers a, such that it is provable in ZF that at most one a has the property 0/: if V = L, no a has the property 0/: and that 3 ao/ (a) iff the hypothesis ~ above holds. Moreover, 0/ is expressible purely in terms of integers, real numbers and recursive functions thereof. The unique a of o/(a) is known as 0* : it may be proved that 0# is in no Boolean extension of L. The assertion that of does not exist is denoted by ~ of A measure of the significance of o~ is given by a theorem of Jensen: before it can be stated,some comments on constructibility and Boolean algebras are necessary. First the concept of constructibility may be relativised: if A is a set of ordinals, we define Lo[AJ L1;+l[AJ LA [AJ L[A)
{A} uU{1;+l!1;"A} Def(L1;[AJ)
U{L1;[AJ 11;
Then L[A) is the smallest inner model of which A is a member. A is a class of ordinals, we define
If
THE REAL LINE AND THE UNIVERSE
535
L is thus L[O], and we may generalise the concept of O~ defining
A~
, as is done in
by
Boos [8].
Second, the Boolean algebras occurring in applications of Cohen's method are usually given in terms of a partial ordering, called the set of conditions, which generates the algebra.
A natu-
ral generalisation of that is to consider partial orderings which are a proper class, though caution must be exercised as the Boolean extension corresponding to such a class need not be a model of ZF. With those remarks in mind, let us enunciate Jensen's theorem: Let M be a countable transitive model of ZF + GCH + -, let a be the least ordinal not in M.
Then there is an a
c
o-.lf ;
and
w =
{O,1,2, ••• } such that, writing N for La[a], M.=. N; ZF + GCH + -, of: N
f
V
= L[a];
the cardinals of N are exactly those of L, and in the passage from M to N cofinalities are preserved,
(as are the large cardinal prop-
erties of being Mahlo, weakly compact and ineffable).
Moreover N
may also be construed as a Boolean extension of M with respect to a collection of conditions that is a class of M. Since any model of ZF + AC has a Boolean extension, via a class of conditions, which satisfies GCH, the theorem shows, loosely, that if 0*
does not exist, the universe may be coded by a single real,
in the sense of being contained in some class-generic extension of the form L[a] with a .=. w. To return to
t~e
question "Is V a Boolean extension of L?",
Jensen's coding theorem shows a connection between that question and the existence of
at
In fact Jensen's paper [
] of which the
above is Theorem 1, contains further information.
Call a set X .=. On
set generic over L if L[X) is a Boolean extension of L with respect to a Boolean algebra which is a set in L.
To quote from Jensen [
F-. O~
"A well-known conjecture of Solovay is this: (SC)
If a .=. On is a set such that L[a]
, then a is set-
generic. "Using Theorem 1 we can construct a model in which SC fails ••• There is a weaker form of SC which reads: (WSC) If a
£
L[Ot ] and O~
"Theorem 2
~ L[a]
Assume that A#'
then a is set generic. exists.
Then there is a .=. w such
that (a) a is not set generic (b) L[a] has the preservation properties of Theorem 1
]:
A.R.D. MATHIAS
536
a#- ~
with respect to L (hence (clot
and at
L[a])
are recursive in each other.
"We have thus shown that SC is not provable in ZFC, even from
.., at ,
and that WSC + "0#
Thus severely.
,at
exists" is provably false.
II
emerges as an axiom that constrains the universe
Here is one consequence; others will be mentioned in
Part II. Jensen's Covering Lemma. If countable set X
at
does not exist, then given any un-
~ Y ~ On and X and Y have the same cardinal. If ~ were provable, the effect on set theory would be very marked: in the light of Jensen's theorem the notion of relative constructibility would emerge as the central concept of set theory, and the union of set theory and abstract recursion theory would be almost complete. For a treatment of GBdel's work see Devlin [10]; for Cohen's method see Shoenfield [11] and Jech [12]; for sharps and large cardinals see Drake [13]; and for an exposition of Shelah's work on Abelian groups, see Eklof [14],[15]. ~
On, there is a Y in L, such that X
ot
PART TWO
Blass [16] has shown that the eXistence of measurable cardinals is equivalent to the existence of a non-trivial exactt~~tor from ~
to
Problem I
'If
~;
1
a ~ w
Problem 2
tWD problems suggest themselves:
Find a category-theoretic formulation of a:fl'
-,0*
, or of
exists' •
Find a proof of the equivalence of the two following
statements that makes no mention of ultrafilters:
3 non-trivial elementary embedding of V into some inner model
3
non-trivial exact functor from Sets to
~.
A third version of Solovay's is this: call a n~ predicate ~(a) singular if ' ZF.
3
~1. a~ (a)
The predicate 'a
Problem 3
I
and 'V-=L
= o~
Let ~(al be singular.
theorem of ZF?
"'-,"3
a~ (a)' are both theorems of
' i s singular. Is'
3 a~(al
... of
eXists' a
THE REAL LINE AND THE UNIVERSE
537
Some progress on this has been made by Jensen using the ~,
~
K, discussed in Jensen and Dodd [17J.
Jensen's coding theorem suggests that an attempt might now be made to classify inner models.
There is work by Yop~nka and Hajek,
Balcar [18J and Grigorieff [19J in this direction. Boolean algebras supply extensions of the universe; the ultrapower method of Scott (see Kunen [20] and Gaifman [21]) provides a means, given measurable cardinals, of shrinking it.
The recent
theorem of Dehornoy [22] that the intersection of the first w iterated ultrapowers of Y is a Prikry generic extension of the w-th, supplies ene link between the two.
A second means of combining
Boolean-valued models and ultrapowers is known: the following definition is due to Prikry and Jech. Let I be a K-complete ideal on K.
Let B be the regula~ minimal B In y there is an ultra-
completion of the Boolean algebra P(K)jI. filter U in the algebra P{K)
of standard subsets of K that extends
{x~KI K\XeI}, and with which the ultrapower yK j U of equivalence
classes mod U of standard functions f:K + Y may be formed.
I is
termed precipitous if, with truth value 1, that ultrapower is wellfounded.
For an elementary alternative formulation and for appli-
cations, see the text of Jech's talk at this meeting. Problem 4 (Jech)
If there is a precipitous ideal, does there
necessarily exist a normal one? Theorem (Magidor)
If Con {:3 K (K is K+-supercompact»
ideal of nonstationary subsets of
then Con (the
CUI is precipitous).
Precipitous ideals have been used to obtain cardinality bounds, in the spirit of the proof and statement of the following Theorem (Silver [23])
If 2
~
countable ordinals v, then 2""
=
h" v+l
for a stationary set of
= H'wl+l.
Thus Jech and Prikry have proved from the assumption that there is a precipitous ideal on w the following l If
(*)
ff w
1
1'?' wI
is a strong limit cardinal, then 2
<
tr w 2 .
The same conclusion has been derived by Magidor from the consistency-wise weaker assumption known as Chang's conjecture:
as
Any structure M = <w ••• ) (where lew l' is here regarded 2,wl,R ... !r. predicate) has an elementary~structure
a~ ~~~j
with
A = .1\
and Afu;;l = }foe
A.R.D. MATHIAS
538
But Magidor's work has itself been superseded by results of Shelah [24 r.
5
Problem
Is (*) provable in ZFC?
Problem 6 (Jech): Let $(~) be the ~th fixed point of the ShoW in ZFC that if $( $ ( H' ) ~ 2 1 <$((2 1)+).
ifl )
Bfunction.
is a strong limit cardinal, then
Chang's conjecture is equivalent to (CC given f: l) closed under f).
[w
2
] < W+ w ' 2
:3 A(A:
= HI' Anw
countable, and A
l
An ostensibly weaker form is (CC 2)
for each f: [W
WI there is an A with
+
2]<W
A
f"[A]<w countable. (CC
2)
(CC for each f: [w ] 2 3) f"[A]2 countable. (CC
3)
Vex exf
implies that 2
+
exists.
A still weaker form is
WI there is an uncountable A with
implies that there is no Kurepa
Problem 7
Is CC equivalent to eCl? 3
Problem 8
I f not, is (CC
3)
HI-tree.
true in some Boolean extension of L?
Silver's consistency proof for CC expounded in Devlin [2sJ, l' proceeds via an intermediate extension in which Martin's axiom
+, CH holds.
If we write CC
then Problem 9
I
in the notation (w
2,w l)
==>e(wl'w)
Is (w3 ' W2 ) ~>e(w2'Wl) consistent?
The natural generalisation of Silver's proof leads to a question concerning a version of
MA
for higher cardinals, for which see
Devlin [26]. Problem 10
(Magidor)
Refute the following version of MA:
if B does not collapse cardinals, any diagonalised. Problem 11 (Erd8s, Hajnal) Problem 12
Ifl
dense sets can be
Establish the consistency of w2
Establish the consistency of 2 N' 0
=
that that implies the existence of
~
N1
./I:
+
(Wl+W)
+ SH H ' or show 2
Magidor shows in his paper [27J that if CC is true, then there
2
THE REAL LINE AND THE UNIVERSE
539
is, in a certain Boolean extension, an ultrapower vK/u of the standard universe such that if j: V
VK/U is the canonical elementary embedding, j (wI) is well founded and isomorphic to w • 2 Problem 13
+
Does that statement imply CC?
The problem seems similar to Kunen's question whether if there is an
.H';t-saturated ideal on
Sf l' it 1
is huge in some inner model.
Another property of ultrafilters which turns out to be connected with sharps is that of regularity: an ultrafilter U is (K,A) regular if there is a family of any K of which is empty.
VX:€u
X= I,
A elements of U the intersection of
An ultrafilter U on I is uniform if
and is regular if it is (w,l) regular.
For recent work
on this topic see Ketonen [28], and Jensen and Koppelberg [29]. Problem 14
If V = L, or, more generally, if .,
0:1= ,
is every uniform
ultrafilter on K regular? The case
K
=
}f n is known if V
L or if
K
is not ineffable in
L. Problem 15
That every uniform ultrafilter on wI is regular is a consequence of V
= L.
Is it a theorem of ZFC?
An easier form of Problem 15 is Problem 16
If there is an irregular ultrafilter over wI' is there
an inner model with a measurable cardinal?
Is Chang's conjecture
true? Recall that a cardinal K is huge if there is an elementary embedding j of the universe into some inner model M such that K is the first ordinal moved by j, and every sequence of length j (K) of elements of M lies in M.
For information on these and other large car-
dinals see [30]. Lheorem (Magidor)
If the existence of a huge cardinal is compatible
with ZFC, so is the existence of an irregular uniform ultrafilter on
w2 • Problem 17
Is it consistent to have a uniform ultrafilter U on wI wI such that the ultrapower w /U is of cardinality ~l? Kunen proved that the consistency of the existence of a huge
cardinal implies that of the existence of an on
HI.
Problem 18
The following is still open.
H2- s a t ur a t e d
Can the ideal of non-s-l:a-t:ionary subsets of
~ 2-saturated?
ideal
540
A.R.D. MATHIAS
Problem 19
(Kanamori)
Does the existence of
at
follow from the
existance of an indecomposable ultrafilter on some singular cardinal? For a discussion of the following problem see Shelah [31]. Problem 20
Is there a Jonsson algebra in every successor cardinal?
Problem 21
Is there a Jonsson algebra of cardinality
Silver proved some years ago that if not, in an inner model.
HW
1f\ W?
is measurable
Some further problems on ultrafilters. Let V be an ultrafilter over I. coinitialityof {a€KI/vl
Define icf(K,V) to be the
~
Problem 22
(Shelah)
Can icf(K,V)
Problem 23
(Shelah)
Can icf(wl,V)
~
K for regular V?
=w
for I
= WI?
Call a free filter F on W feeble if for some strictly monoto-1 -nic f:w ... W {X,5,w 1 f "X€F} is the Fr~chet filter. Jalali-Naini and, independently, Talagrand have proved that a filter is feeble if and only if, considered as a subset of 2w , it has the property
of Baire. Call a filter F a p-filter if given Xi € F (i<w)
V i(Y\X i
is finite).
3 Y€F
A p-point is a p-filter which is also an
ultrafilter. Problem 24
Is there a p-point?
Such may readily be constructed using Martin's axiom1 for other constructions see Ketonen [3~.
As no ultrafilter is feeble, a
weaker question is Problem 25
(Kanamori)
Is there a p-filter which is not feeble?
Equivalently, is there a filter G on w that is coherent in Kanamori's sense that whenever X € G and A ,5, G are such that
Vn
< W {A€AI
with
€ G?
nB
Ann = Xnn} is infinite, there is an infinite
The author has recently proved that if
~O#
B
c
A
, a coherent fil-
ter on W exists. Problem 26
Is it provable in arithmetic that the consistency of
"There is a strongly compact cardinal" imples that of "There is a supercompact cardinal".? The next four problems are connected with strong assumptions such as the axiom of determinacy, AD, or huge cardinals.
THE REAL LINE AND THE UNIVERSE
541
Magidor [33] has proved that if "There is a huge cardinal" is consistent, so is
"Vn
(1')
< W 2
Hn
u.
&
IT n + l
=
2
if W > n\x w+l •
How strong is (t)?
Problem 27
Mitchell has shown that (t) implies that
~w is measurable in
some inner model.
V A2 A
> A+"
Problem 28
Prove"
Problem 29
Prove from a large cardinal axiom that all E
consistent.
1
-5
Lebesgue measurable. Problem 30
sets are
Show that AD is inconsistent but that 1 n Determinacy 0:: ) " is not.
"V
Problem 31
-
-n
Find a consistency proof for ".., AC + all limit ordinals
have cofinality w". It follows from Jensen's covering lemma that if cf(w
o~
exists.
l)=cf(w 2)=w, Magidor can get the first a cardinals to be singular,
starting from a+l supercompacts. Problem 32
Obtain consistency proofs for the following consequences
of AD: the closed unbounded subsets of w
l
for some uncountable K, K ~ (K)w;
generate an ultrafilter;
for some uncountable K, K ~ (K)w+W. Some more problems concerned with the negation of the axiom of choice: Problem 33 (Blass)
Does AC follow in ~F from
"V x 3 y
(y can be
mapped onto x and y-indexed choice holds)"? Problem 34
(Pincus)
Find a model of
AC the ordering theorem n, Problem 35
(Pincus)
~the
,
a AC +Va AC which satisfies
prime ideal theorem.
Find ZF models for the following, which are all
known to have Fraenkel-Mostowski models: The Hahn-Banach theorem + The Krein-Mil'man theorem +,AC; Urysohn I s Lemma + -, ACw; The uniqueness of algebraic closures, where they exist,
bwt
not
the existence of algebraic closures. w Here AC means AC for families of sets of power n; AC means AC for n countable families.
542
A.R.D. MATHIAS
Problem 36 (Truss)
Consider the three statements
(P) every uncountable set of reals has a perfect subset (L) every set of reals is Lebesgue measurable (B) every set of reals has the property of Bqjre. Prove in ZF + DC that (P) implies (L) and that (L) is equivalent to (B) •
Two odd problems: Problem 37 (Yates) of order type W 1 ?
Is there an initial segment of the Tuying degrees
Problem 38 (Blass) Let IT = {flf:w + wand f strictly increasing}. Is there a family {Aflf E IT} of subsets of w such that given distinct m~bers
fl ••• fn,gl ••• gm' of IT,
Af n••• n Af n(w\A g )n ••• n (w\A ) 1 n 1 gm
~
0,
and such that each interval {x If (n) s x < f (n+l) } is included in or disjoint from A ? f
(fEIT, nee)
By considering Cohen reals it may be seen that the answer to that is yes if m is not the union of fewer than 2H• nowhere dense sets. Finally, some problems connected with L. Problem 39 (Jech) If V = L, there is a complete Boolean algebra regular with no proper infinite complete subalgebra. Is the existence of such provable in ZFC?
0
Call <S~I~ < wI> a A-witness if V ~ S~ ~ P(~), S~ < A, and Vx c wl{~1 xn~ES~} is stationary. It was shown by Kunen that there is an }f 1- <) - witness iff there is an Z- (; -witness. Let 0 (K ,A) be the statement that whenever <S~> is a K-witness, there is a Awitness
0
0-
o
«2 w)+,2).
2.
0+
+
0«2
3. V=L + 0 (
Problem 40
Is 6(
w
)+,
K) I' 2)
H' 1'2)
h'l)'
•
a theorem of ZFC, or of ZFC + o+?
<>
Problem 41 (Jech) Find a class of statements including and Silver's W such that if such a statement can be shown consistent via forcing countable conditions then it is true in L.
THE REAL LINE AND THE UNIVERSE Problem 42
(Juhasz)
543
Find a set-theoretical assertion which follows
in ZFC from each of the continuum hypothesis, the negation of Souslin's hypothesis, and the assertion that the universe is the result of adding a Cohen real to some inner model, and which implies some of their common consequences. Perhaps a solution to Problem 42 lies
i~
the forthcoming papers
[34J and [35J, which offer a solution to the problem of axiomat ising Jensen's consistency proof for GCH + Souslin's hypothesis in the sense that MA + 2
R.
>
~
1 axiomatises the original Solovay-
Tennenbaum proof consistency proof for SH. Problem 43
Does the existence of a morass imply Jensen's D?
PART THREE The author began his Oxonian oration with an outline of the steps by which he has come to believe that the debate between the various philosophies of mathematics is a particularisation of the debate between various accounts of the world.
Put succinctly, his
thesis is that one's view of life determines one's view of mathematics; though in that form it is often found to be false, as people happily believe inconsistent things.
The author considers that
parallels may be drwan between Platonism and Catholicism, which are both concerned with what is true; between intuitionism and Protestant presentations of Christianity, which are concerned with the behavious of mathematicians and the morality of individuals; between formalism and atheism, which deny any need for postulating external entities; and between category theory and dialectical materialism. The author reached this last parallel through contemplating the Hegelian overtones of category theory, and he was gratified to find i t supported in lectures of Lawvere. The author is most grateful for the stimulating criticism he received of his talk from many mathematicians and philosophers, which he hopes to put to good use in due course.
544
A.R.D. MATHIAS Bibliography
1.
A.R.D. Mathias, Surrealist Landscape with Figures, Periodica Hungarica, to appear.
2.
S. Shelah, Infinite abelian groups - Whitehead problem and some constructions, Israel Journal of Mathematics 18 (1974) 243-256.
3.
S. Shelah, A compactness theorem for singular cardinals, free algebras, Whitehead problem and transversals, Israel Journal of Mathematics 21 (1975) 319-349.
4.
W. Fleissner, Normal Moore spaces in the constructible universe, Proceedings of the American Mathematical Society
5.
~
(1974) 294-298.
R.B. Jensen, The fine structure of the constructible hierarchy, Annals of Mathematical Logic, 4 (1972) 229-308.
6.
A.J. Ostaszewski, A perfectly normal countably compact scattered space which is not strongly zero-dimensional, Journal of the London Mathematical Society, (2),
li
(1976), 167-177.
7.
R.B. Jensen, The morass, to appear.
8.
W. Boos, Lectures on large cardinal axioms, Springer Lecture Notes Volume 499, Logic Conference, Kiel 1974, ed. G.H. Mllller, A. Oberschelp and K. Potthoff, pp 25-88.
9.
R.B. Jensen, Coding the universe by a real, 348 pp manuscript with appendices.
10. K.J. DevIin, Aspects of Constructibility (Springer-Verlag,1973, Lecture Notes in Mathematics No.354). 11. J. Shoenfield, Unramified forcing, AMS Proceedings of Symposia, Volume XIII Part 1, ed.,D.S. Scott, pp357-381. 12. T.J. Jech, Lectures in Set Theory (Springer-Verlag, 1971, Lecture Notes in Mathematics No.217). 13. F.R. Drake, Set Theory, An Introduction to Large Cardinals (North-Holland, 1974, Studies in Logic). 14. P.C. Eklof, Whitehead's problem is undecidable, American Mathematical Monthly. 15. P.C. Eklof, Independence results in algebra, Lecture notes, 107pp typescript. 16. A. Blass, Exact functors and measurable cardinals, Pacific Journal of Mathematics 1976.
THE REAL LINE AND THE UNIVERSE 17. 18.
545
T. Dodd and R.B. Jensen, The Core Model, 207 pp manuscript. B. Balcar, A theorem on supports in the theory of semisets, Commentiones Mathematicae Universitatis Carolinae 14,1 (1973).
19.
S. Grigorieff, Intermediate submodels and generic extensions in set theory, Annals of Mathematics 101 (1975) 447-490.
20.
K. Kunen, Some applications of iterated ultrapowers in set theory, Annals of Mathematical Logic, 1 (1970) 179-227.
21.
H. Gaifman, Elementary embeddings of models of set theory and certain sy subtheories, AMS Proceedings of Symposia, Volume XIII Part 22, ed. T.J. Jech, pp331l0l.
22.
P. Dehornoy, Solution d'une conjecture de Bukovsky, Comptes Rendues Acad.Sci. Paris t.28l (17 Novembre 1975).
23.
J.H. Silver, On the singular cardinals problem, Proceedings of the International Congress of Mathematicians, Vancouver 1974, 265-268.
24.
S. Shelah, A note on cardinal exponentiation, l5pp typescript.
25.
K.J. Devlin, A note on a problem of
Erd~s
and Hajnal, Discrete
Mathematics!! (1975) 9-22. 26.
K.J. Devlin, An alternative to Martin's axiom, Springer Lecture Notes Volume 537, A Memorial Tribute to Andrzej Mostowski, ed. W. Marek, M. Srebrny and A. Zarach, pp65-76.
27.
M. Magidor, Chang's conjecture and powers of singUlar cardinals, Journal of Symbolic Logic, to appear.
28.
J. Ketonen,
~onregular
ultrafilters and large cardinals, Trans-
actions of the AMS, 224 (1976) 61-73. 29.
R.B. Jensen and B. Koppelberg, A note on ultrafilters; and an Addendum. 70pp manuscript.
30.
A. Kanamori, W.N. Reinhardt and R. Solovay, Strong axioms of infinity and elementary embeddings, Annals of Mathematical Logic, to appear.
31.
S. Shelah, J6nsson algebras in successor cardinals, 10pp typescript.
32.
J. Ketonen, On the existence of p-points in the Stone-Cech compactification of integers. Fundamenta Mathematicae, 92 (1976) 91-94.
546 33.
A.R.D. MATHIAS M. Magidor, On the singular cardinals problem, II, 59pp typescript.
34.
K.J. Devlin and S. Shelah, A weak form of the diamond, in preparation.
35.
S. Shelah, Whitehead groups may not be free, even assuming CH,I. l5pp typescript.
R. Gandy, M. Hyland (Eds.). LOGIC COLLOQUIUM 76
© North-Holland Publishing Company (1977)
ADDING DEPENDENT CHOICE TO THE PRIME IDEAL THEOREM David Pincus*
This paper presents a model of ZF set theory in which the principle of dependent choices and the prime ideal theorem for Boolean algebras are both true while the axiom of choice is false. The problem of finding such a model was posed in (2) and remained open, despite considerable effort, in (5).
(5) §I contains a model which I considered to
be the best candidate for the theorem of this paper, together with an incomplete proof sketch along the lines of the original independence proof of the axiom of choice from the prime ideal theorem (1).
No progress has been made on this model.
The model used here is that of (5) §III. modification of) that of (1).
It is obtained by iterating (a
The important difference between this model and the
one of (5) §I lies in the use of hereditarily almost disjoint functions.
These
permit the proof of the prime ideal theorem through iterative ultrafilter averaging.
The combinatorial principles basiC to (1) play no part here.
theorem of (1) is necessary to start the induction.
However, the
It would be interesting to
completely eliminate combinatorics from the theorem. I state without proof some further applications of the method.
The prime
ideal theorem, Hahn Banach theorem and canonical uniform ultrafilter principle (A uniform ultrafilter includes all sets whose complements are well orderable and have smaller cardinality.
The principle states that there is a function assigning
a uniform ultrafilter to the power set of each infinite set.) can be added to the class of automatic ZF transferable Fraenkel Mostowski independences given in (5) §IV.
The arguments resemble those in the appendix to (3).
One can then add
dependent choice to the independence of the prime ideal theorem from the Hahn Banach theorem through Fraenkel is given in (6).
~ostowski
transfer.
Another method for doing it
Fraenkel Mostowski transfer also permits one to add dependent
choice to the independence of the ordering principle from the canonical uniform ultrafilter principle (4).
Hereditarily almost disjoint functions also come into
the Fraenkel Mostowski model. I.
Hereditarily Almost Disjoint Functions I modify slightly the definitions of (5) §III.
fixed regular cardinal.
A function of rank
0
Let H be a set and let K be a
over H is an element of H.
A func-
tion of rank a+l over H is a 1-1 function a with Domain a=K where each a6, 6
(By convention I omit the usual parentheses denoting
*Supported in part by NSF Grant MES 61-6778 547
548
DAVID PINCUS
function values.)
A function of limit rank a over H is a 1-1 function a with
domain aXK where for each S
Henceforth a,b, etc. will
denote ranked functions while A,B, etc. will denote sets of ranked functions. Va, the hereditary value set of a is the set containing a, all possible as's (these are the direct values of a), all possible aSA's etc. H, which may also be functions, are excluded.
Values of members of
In particular it is assumed that no
members of H coincide with functions of nonzero rank over H. partially ordered under the relation a~b++aeVb. defined for sets of ranked functions over H via joint,
! Q.,
disjoint function if
i'
i
a and bare almost dis-
VA=a~AVa.
if they have a finite set of maximal
nically this means VanVb=<jl or for some c
Ranked functions are
Hereditary value sets are also ~
values, MCV
VanVb=i~nV\,)
~'
£..
Tech-
a is an almost
1.
Range a is pairwise AD
Z.
If a has limit rank then every value of a is a direct value and values are interpolable at each rank, i.e. (VbeVa) (VS:rank b<S
a is a hereditarily almost disjoint, HAD, function if it's almost disjoint and its range consists of HAD functions of lower rank.
(Elements of H are HAD.)
is an almost disjoint set of HAD functions if it's pairwise AD.
A
Lemma III 6 of
(5) says that if A is an AD set then so is VA.
If A and B are AD over HI and HZ respectively an embedding of A into B is a 1-1 rank preserving function
~:VA~B
such that
(~a)S=w(aS)
for every asVA. S
The differences between the above definitions and those of (5) are stipulaion Z about functions of limit rank and the fact that all domains are K or aXK for some limit a
These modifications facilitate the coding of HAD functions of
high rank by HAD functions of low rank.
The ctlanges have only minor effects on
the theory of HAD functions developed in (5) 1113-11110. of 11110 a must be done slightly differently.
The easy construction
Also in 11110 a, if as' S
are increasing the function as=a will not itself be an HAD function but S there is an HAD function whose direct values are exactly the values of the as's.
limi~
II.
The Model The model, N, is that of (5) Theorem 3.1.
It is parameterized by the regular
cardinal K satisfying ZK=K+ in the ground model U of ZF + class choice.
It will
be seen to satify the prime ideal theorem, the negation of choice, and the K version of dependent choice.
A related construction satisfies a dependent choice
for all a
(Carry the iteration below through K
steps instead of K+ steps.) N
is built by the following /
N_l=U
iteration.
ADDING DEPENDENT CHOICE TO THE PRIME IDEAL THEOREM
549
N =U(A ) where A is a set of independent (K-closed product) K-closed Cohen o
0
0
generic (over U) elements of 2~
+
Na+l=Na(Aa+f where A l is a set of independent generic (over N 1-1 maps of a) a+ K into Aa• Na_, for limit a, is the direct limit iteration of the N S' S. where A is a set of independent Na-generic Na, for limit a, is Na_(A a) a functions of rank a over A taking values in S~aAe and satisfying stipo ulation 2 of I. N=N +. K-
Independence means genericity in the usual product partial ordering.
The notion
of forcing used in producing No is standard.
The one used in producing N from a (Note: a-=a-l when the latter exists.)
N is more interesting. a_ A condition in N describing fsA consists of: a a_ A K many equations among the hereditary values of f, (f is a function of rank a over A o') B K many stipulations of the form
g(S)=O or g(S)=l for the hereditary values
g of f with gsA o' C finitely many stipulations of the form fS=a where a is in the appropriate A"A' "A
(i.e. if fS=f"A then
The stipulations of C must square with those of A and B.
fe~·f"Ao.)
Finally, the stipulations
of A must specify only equation systems which are possible in terms of relations which hold among the elements of AS' S
One condition extends another if all
three kinds of equations are extended. Observe first that all equalities and inequali ties between hereditary values of f can be decided by parts A and B of a single condition.
Since only K many
equalities and inequalities are involved it suffices to show that if an equality is not forced by P then some extension preserving part C forces the inequality. As an example, suppose f"Arfo is to be forced where both f"A and rank.
fo have the same
For some sequencetof values it must be the case that fAa and fot have rank
a and f"A;=fo; is not forced by part A of P.
One can then introduce an ordinal
V
values of f and g.
which
Using the same sort of argument as in the above paragraph, an
extension can force these values to be maximal common values. By induction
gSS~aAS
The condition can explicitly specify only finitely many common
a~K+Aa
Thus f and g are AD.
is an AD set of HAD functions.
Observe next that the extension adds no new sets of ordinals with cardinal SK.
Parts A and B of conditions are closed under K unions.
As far as part C is
concerned, it should be recalled that one is not only doing forcing, one is also
DAVID PINCUS
550
passing to the inner model generated by A over N _ From the symmetry lemma of a a (5) III 18, it follows that a statement involving fl, ... ,fksNa_ will be decided by a condition whose C-part involves at most fl, .•.• f will decide
K
many values of any function into U.
Therefore, a single condition k. A similar argument shows that
the formation of N _ at limit a adds no new sets of ordinals of size a
~K
S
to any N S'
The forcing has now been reduced to the following form, essentially that of
(5) III 13. ~ a set
A condition PsN describing f .•. ,fksA consists of the following: a a_ l, A(P).{fl, ... ,fk}sU of HAD functions of rank a over a set H(P)=
H(p)nVA(p), B for each xsH(P) a K-closed Cohen condition B(P.x) from U where B(P,x)cK X2. IB(P,x) I=K. and B(P.x) is a function,
f a function C(P) defined on a finite subset of VA(P) satisfying the compatibility conditions:
!
every cs Domain C(P) has rank
Z if cs Domain C(P) has r.ank S then C(P.C)sA
S' 3 if cl'cZs Domain C(P) and clSl'· ..• Sk=cZA l •. · .,A£ then C(P.cl)Sl'· ",Sk= C(P,cZ)A •...• A£. l 4 if cs Domain C(p.cS •.. "SksH(P) and AS Domain B(P,cSl •...• S then l k) B(P.cS l•·· "Sk)(A)=C(P.c)Sl'· ",Sk A.
The partial ordering P$Q holds when there is an embedding W:A(P)7A(Q) satisfying the natural stipulations: C(p.c)=C(Q.Wc). P=Q+7P5Q'Q$P.
Actually.
~
~
if xSH(P) then
B(P.x~B(Q.wx). ~
if cs Domain C(P)
is not quite a partial ordering but reduces to one via
P is unique up to a 1-1 correspondence between H(P) and H(Q).
Even
this correspondence is unique when B(P.x is incompatible with B(P.x whenever Z) l) xl#x and this happens for a dense set of conditions. Z fl.···,f satisfy P if C(P)U{(fi.fi):i=l •...• k} satisfies clauses~. 1 and i k in the definition of C(p). This is done relative to an assignment of f to f but i i when xl#x B(P,x is incompatible with B(P.x such an assignment is unique. Thus, Z l) Z) it will not be mentioned. Density says that every P is satisfied by some k tuple from Aa. ~-continuity says that if ~(xl"" .xk'Yl" "'Ym) has parameters in UU{
there is a condition P with Range C(P)c{gl •• ··.gm} such that fl ••..• f
fi... .. f k
satisfy P and ~(fi"" ,fk.g
,gm) holds whenever l.,· similarly formulate Na_-continuity and N-continuity.
The iterated forcing can also be written down in U.
satisfy P.
k One can
A condition there consists
of an A(P) and B(P) as above but C(P), instead of taking values in an AS' has values in constants for members of AS in the appropriate forcing language. defined in terms of an assignment of actual members of The functions fl" ..• f
k model conditions involving f
S~K~S
Satisfaction is
for those constants.
form a generic set when they are generic for the ground l,
..• ,f
k
of the same respective ranks.
fl" .. ,f
k
are
ADDING DEPENDENT CHOICE TO THE PRIME IDEAL THEOREM
551
seen to be generic exactly when they are HAD functions over a set H whose elements are K-closed product generic for the K-closed Cohen partial order in U. finite subset Df
a~K+Aa
is generic.
CDnversely, every generic set is a subset of
are analogous tD the A in a model N' which has the same a forcing and inner modeling Dver U as N does.
a~K+A~
where the
Every
A~
It will be helpful tD recall the support theory of the Na's and Na_'s. Na=N(a+l)_ it suffices to discuss the Na_'s. supports
Since
A support is a finite GcS~aAS'
Q
when x is definable in N from parameters in UU«AS:B}UG. VG is a_ the class Df sets suppDrted by G. The symmetry lemma (5) III 18 implies that ~
VG is absolute in the sense that VNSG=IINSnvNaG for S
(Here IIN
pDwer class of N S') Using symmetry facts VG
S
denotes the
For every suppDrt G there is a unique l=7G 2++VG l=VG2. l G such that VG element of G is a value of any other. Such a G is and no 2 2 2 l=vG 2 called efficent. Supp a is the set of efficent supports from N Since a support a_ has a natural well Drdering each VG has a canonical well Drdering. The At h element Df this well Drdering is denDted T(G,A).
The support lemma implies that every x
is suppDrted by a canonical efficent support.
Using this in conjunction with the
function T(G,A) shows that a bijection of N with Supp aXOn is definable in Na_ a_ Ill. Regular Ultrafilters If X is a set II«X) denotes the set Df finite subsets of X.
A regular ultra-
filter Dn ! is (by a slight misnomer) an ultrafilter on PII«X) which includes the filter base consisting of all sets of the fDrm
{F£II«X):F~G}
where G£P«X).
The following reduction shows that the real task of this paper is to produce regular ultrafilters on the A a• Reduction: If each A a
~a
relative to N a
K-dependent choice and the negatiDn of full chDice were proved in N by (5) §III.
(K+ choice fails fDr the sequence .) TherefDre, it Dnly remains a to prove the Reduction and to produce the ultrafilters ~a in order for the result to be complete.
Notice that the terminology "relative to" in the Reduction is
important.
PrDducing an ultrafilter on A relative to N would be harder than a producing one relative to N because N contains more subsets Df P
V~
has a prime ideal in
V~.
This will be done by
inductively proving for o
Let B be the given BDolean
For S S_' such that the IS are increasing and each IS is a prime ideal in B If S is a S' limit ordinal one can let IS=A~SIA' If S=A+l I generates a V~ ideal IS in B A S'
552
DAVID PINCUS
Using the canonical bijections of N with Supp SXOn and N with Supp aXOn a S together with the absoluteness of 7G one can identify B and J with a7~ Boolean S S algebra and ideal in N ' The inductive hypothesis extends J to a prime IS and S S the sequence remains in 7~ if one uses the least possible IS in the well ordering of
7~.
If a=S+1 is a successor ordinal one must show that every
7~
Boolean algebra
B of N has a 7~ prime ideaL Every element of N is supported by a GsP
The parameters of this definition are AS' are in
7~.
~S
and the function
G~JG
all of which
It is easy to see that I is a prime ideal once it is observed that
for any x: {GsM«AS):xSJGv-xSJG}s~S'
This is true by regularity of ~S.
The above set includes {G€P«AS):G~Go} where
Go is any support of x in P«A
S)' It remains to deduce the prime ideal theorem for arbitrary Boolean algebras
from the 7~ case. If G and'G 2 are supports say Gl~G2++GlcVG2' This makes the l supports into a predirected set (i.e. a pre-partially ordered set whose reduced partial ordering is a directed set). a filter base in
7~.
Let
vs7~
Thus, the sets of the form {G:G~Go} are
be an ultrafilter including all such sets.
The rest of the argument follows the one above. algebra.
Let Go support B.
canonical prime ideal J
G
If
G~Go
then Bn'1G is a subalgebra of Band it has a
Say xsI++{G:xsJG}€v.
because the above set includes
Let B be a given Boolean
Again for any x, {G:xsJGv-xsJG}SV
where G is a support of x. I The above two arguments are examples of ultrafilter averaging. The prime {G:G~GoUGl}
ideal I is the average of the partial prime ideals J filters
~S
or v.
via the averaging ultraG One can equally well form an ultrafilter by averaging partial
ultrafilters over other ultrafilters. ducing the ultrafilters IV.
Outline of the Construction of the ~a
This will be an important point in pro-
~a' ~et
is constructed simultaneously for all models with the same forcing and
inner modeling as N over a ground model U' which has the same ordinals and a Only the case U=U' will be explicitly discussed but the
K-sized sets as U.
argument would apply equally to another U'. The inductive hypothesis is that the corresponding models where S
7~
~S
exist for all such
The ultrafilter averaging arguments above imply that in all Boolean algebra has a
7~
prime ideal.
The case a=o is the theorem of Halpern and L~vy (1).
They treat Cohen
forcing rather than K-closed Cohen forcing but this makes no difference, as has
ADDING DEPENDENT CHOICE TO THE PRIME IDEAL THEOREM been noted in (3) and (4).
553
As has been indicated the general idea fora>o is to
code elements of A by elements of AS' S
The averaging ultrafilter is usually a
ultrafilter on K or a V~ unbounded ultrafilter on R(a), a
V~
regular
Both of these exist
These ultrafilters are used to average the
partial ultrafilters which come from ultrafilters on sets of codes. One other problem arises.
Suppose two functions are coded by elements of an
A'. There are no common values of the codes available to code common values of o the functions. One can arrange that the functions have common values only of rank o.
The idea then is to absorb these finitely many common values into the
ground model U, forming a new ground model U'.
The remainder functions have no
common values so they can be coded. The finitely many common values absorbed into U would enter as a parameter in the definition of the resulting ultrafilter unless the absorption process is averaged over a symmeteric ultrafilter on VV«A
Thus, the first step in the o)' production of \la is to produce a regular ultrafilter \l on A The existence of \l o' does not follow immediately form the inductive hypotheses because RR«A will o) have members in N which do not exist in any N S
Natural Codes Natural codes are important constituents of the codes used in producing both
\l and \la'
The code a will be separated into an HAD function
~
of the same rank
as a and an information key which tells how a* is to be modified to form the function a" of higher rank coded by a.
Usually the values of a* with rank less
than rank a will also be values of a".
+
Let hsZ~ member of ZK the
~
+
hiK will be called the ~ of hand h* will be the renormalized
given by h*(A)=h(K+A).
.
If a is a generic HAD function of rank S
of a is the sequence of keys of the rank
inductively defined via a*A=(aA)*.
0
values of a and a* is
Notice that MCVa*,b*={c*:csMCVa,b}.
Also, a*
is a generic HAD function of rank equal to rank a. The key of a will tell exactly what to do with a* in order to arrive at a". For each rank
0
value h of a the key of h is an element of ZK.
Using a canonical
partition of K into K many disjoint K-sized subsets one can think of the key as having K many independent bits of information. possible entries for each bit.
Since ZK=K+ there are up to K+
K many of these bits will be reserved for
recombinant information, to be discussed in the next section.
Another K will
554
DAVID PINCUS
dictate the formation of a". Each rank
0
These are discussed below.
value h has a bit, called the placeholder
it from every other rank
value of a.
0
carries the same information. introduced later.
which distinguishes
~,
Except for the placeholder key every heVa
This will be varied slightly in the stacked coding
Notice that a has at most
K
many rank
0
values and there are
K+ possible entries in the placeholder key so there is no problem arranging distinct placeholders.
Of course, if a does not have them it is disallowed as a code.
An exception arises when rank a" is to be equal to rank a. is convenient to stipulate that the key of every rank function and that a"=a*.
0
In this case it
value of a is the zero
The virtue of this is that a generic set
from a generic set of codes al, ... ,a
K
al'N.,a~
comes
This will not necessarily be true when rank
•
a" > rank a.
The next two bits identify rank a and rank a".
There are
K
+ possibili ties
for each so one can do this via a fixed correspondence of ZK and K+.
a is dis-
allowed as a code if not every hcVa carries the same code .for rank a or rank a", if the rank a bit does not correspond to the actual rank of a, or if the rank a" bit does not correspond to an ordinal> rank a. The remaining
K
of a" as follows.
bits are divided up in a canonical way according to the rank
Some correspond to pairs of finite sequences of ordinals and
pairs of ordinals such that each sequence of the pair is a possible value sequence for a". and
fql-.,~EVf
(1. e., if a bit corresponds to (PI' ••Pn,qr'" qm) then fpr"'P neVf
for any f of rank equal to rank a".)
such a pair is always the "unequal."
0
The bit corresponding to
function or the 1 function and is read "equal" or
The other bi ts correspond to value sequences of rank
rank equal to rank a".
0
for an f of
These bi ts are read as placeholder bits.
The first group of bits permits one to read off the diagram of a", Le., the entire set of equalities and inequalities between hereditary values. group permits the correlation of the rank of a*. rank
0
The given rank
0
0
The second
values of a" with actual rank
0
values
value of a" is set equal to h* where h is the unique
value of a having the placeholder key specified in the bit.
Of course, it
is required that the placeholder keys arising in such bits be ones which actually occur for some rank
0
value of a.
The diagram of a" and its rank
0
values determine a" completely.
a is dis-
allowed as a code if the function so determined is not an HAD function of rank equal to the coded rank a".
It is also demanded that the set of values of a"
with rank less than rank a be exactly the set of values of a* with the same ranks. Finally, if rank a= rank a" then a"=a*. A variation of this process is necessary if rank a=o. quired
K
function.
sequence of rank
0
One creates the re-
values by decomposing a* via a canonical pairing
Placeholder keys are unnecessary since the pairing function auto-
matically distinguishes between the various members of ZK+ produced.
ADDING DEPENDENT CHOICE TO THE PRIME IDEAL THEOREM There is one last important restriction to be made.
It will be necessary
that a function of rank K have a unique code of each rank 8
555
As things stand,
In order to provide uniqueness associate to each diagram
a canonical sequence of placeholder keys for its rank
0
values and a canonical
way of rearranging its values of rank less than 8 into an HAD function of rank 8. (If 8=0 it has to arrange its rank
0
values into a K-sequence to be condensed
by the pairing function.)
The rearrangement exists since the value set of an HAD
function is pairwise AD.
Both the placeholder key sequence and the rearrangement
correspond to sets of hereditary cardinal
~K
so they exist in U.
Thus, they can
be made canonical by the well ordering of U (or any U'). Now observe that generic functions have generic codes and visa-versa.
It
has already been noted that HAD functions result from both coding and uncoding. (For uncoding this was required.) The rank 0 values of a" are ob tained from those th of a by truncating at the K place. This preserves product genericity. Similarly adding K many new values to the beginnings of the K-closed product of K-closed K generic elements of 2 + preserves genericity so the code of a generic function is generic.
In the rank
0
case one must also remark the isomorphism of the K-closed
product of K many copies of K-closed Cohen forcing with a single copy of K-closed Cohen forcing. Observe, moreover, that a generic set of codes gives rise to a generic set of functions.
The genericity of the rank
show that the functions are mutually AD.
0
values has been noted so one only need
Let a and b be mutually generic codes.
Because rank a, rank a", rank b and rank b" have bits coding them in every rank
0
value there can be no connnon values between a" and b" unless rank a= rank band rank a"= rank b", a= rank b.
If alb then MCVa,b consists of elements of rank less than rank
Every value of a" wi th rank equal to rank a can thus have only fini tely
many values of sufficiently high rank in common with any value of b" with rank equal to rank b.
There'fore, a" and b" have no values in connnon with r ankg rank a.
Every connnon value of a" and b" is an actual connnon value of a* and b* which are AD as previously noted. It is not true that a generic set of functions has a generic set of codes at lower rank.
This will be the reason for the eventual introduction of stacked
coding. VI.
Recombinant Codes
Let a.bl,... ,b be AD HAD functions aiVb, ...,bk. There is a natural notion of k dividing a by bl'... ,b written albl' ...,b The definition proceeds by induction k• k, when on rank a. Form the function whose values are aAlbl... b or a(8,A)lb l,,,Pk k these are not in Vbl' .. bk and ~ otherwise. Then throwaway the ~ values and reindex the A'S to get albl... b
Since a is AD from each of the bl, ... ,b it is easy k k• to see that K many values aA!bl... b or a(8,A)lb .... ,b are retained (for each 8< l k k rank a in the latter case). Likewise, one can see that a Ib ... , b is a HAD l, k
556
DAVID PINCUS
function and vanVbl, ••. ,bk=<j>. Recombinant codes tell how to reverse this process. a,b bl,
l
b ,b
k
i.e., given codes
in sequence with VanVbl... bk=<j> where a is a recombinant code receptive to
"a,b ,b will be an HAD function for which "a,b ... ,bk"=a" l, k, l, k" The first bit of a recombinant code identifies the diagram of a sequence
bj> ... ,b
with which it can recombine. If this bit is blank the code is nonk" recombinant and all its recombinant bits are blank. Two other recombinant bits in
general are identified with each value of a".
The first one describes a function
from a subset of K or f3xK (S= rank of term when limit) into terms of the diagram of bl> ... ,b~.
It shows what terms from bl... b are i;'-jected into the term. The k second bit associated with the term describes a function from K or crXK into itself.
of "ab
It tells where the values of the term of a" go in the corresponding term
r .. b k". This completes the description of "ab l ... 1J k " . As with natural codes it will be desirable that, given the bl, ... ,b~, each
function have a unique recombinant code.
Again this is easily arranged because
the entire key sequence is an element of U. With recombinant coding a code is then a finite sequence of functions rather than a single function.
One can see that the rank a values of "a",b1,... ,b
k.
Therefore, a generic set of codes (i.e., a set of recombinant code sequences whose union is a generic set of codes) satisfies at least the first criterion to code a generic set of functions. of codes are often not AD.
Unfortunately, the functions coded by a generic set
k"
"abl' .. b
will seldom be AD from "abi,'" b when any k" The best one can say is that "abl, ... ,b is AD k" from "a'bl,... ,b In this case the maximal common values of a" and a '" give rise k". to maximal common values of "ab ... b and" a'b , •• b because, since they have l l k" k" common values, t~eir recombining instructions are the same and, since they have
nontrivial recombining is done.
placeholder keys, a common value of a" and a '" plays the same role in each.
Vp Regular Ultrafilter on Ao Work in N where a is a successor ordinal. If a is a limit ordinal one must a work with finite sC:C;X.K instead of .K but there is no new idea involved.
VII.
The
If sElI«<) say that f l:;i<j:;n, MCVf
l,
... ,f
n
are s independent functions of rank a when for
Notice that i f fl, ... ,f are mutually AD ifjcV{fiA:AE3}nV{f jA:AE:s}. n they are s independent for all sufficiently large s. For each s a set of ~ codes is defined in N _ a
recombinant code of the form abr .. b
Let K=Jsl.
An s code is a
is the natural code of b i* with i k rank a-I and a is a rank a code with a" of rank a. "abl, ..·b will have b i* k" occupying the i th posi tion in s . Notice that Va"nVb1*'. b *=<1> because the rank a" k key of a will not be trivial and all the keys of rank a values of b I"" ,b k will be trivial.
in which b
The set of s codes is a V<j> set in N _ . a
It follows that the set of
rank a values of functions coded by s codes, denoted Ao(s), is also a V set in N _ and thus has a V<j> regular ultrafilter, ~(s).
a
ADDING DEPENDENT CHOICE TO THE PRIME IDEAL THEOREM The general idea is that WN~-M (A (s» <
0
557
is a good approximation to MN~ (A ), <
at least as far as sets with s independent supports are concerned.
Thus,
~(s)
0
will
approximate the eventual ultrafilter with regard to such sets. The first point to remark is that i f fl, ... f the set of their code sequences is also generic. fiA,AEs, by prefixing a key of o's to each rank
n
0
are an s independent generic set One first obtains codes for the value.
One then divides each f
by f i AI"'" f i Ak. The value set of each of these quotients, because of the s independence, is disjoint from, hence generically independent of, each other quotient and each of the fiA,AEs.
Passing to rank
0
i
recombinant codes of these
quotients does not alter these properties. N~_
is said to be an analogue of
and inner modeling over
u.
V~
have the same properties there. ~'(s)
etc.
If fl, ... ,f
N~_
concepts of
N~_
can be relativized to
contain codes for f
A~
and
N~_
These will be denoted by primes, e.g.
are s independent members of
n sequences is generic as remarked above.
analogues of
if it is obtained by the same forcing N~_
A~(s),
their set of s code
Thus, as has already .been noted, many .. ,f
n. l" N~ N~ ) The process of modeling W E«A in an N~_ can now begin. Let XEW W«Ao ' o) let fl, .•• ,f be an s independent support of X, and let N~_ be an analogue 9f N~_ n containing the s codes of f ..., f . An approximation X(N' ,s,fl,... ,f )EWN~-w< n an l, (A~(s» of X will be defined. When N~_, s, and fl, ... ,f are unambiguous, this set n will be denoted as X~. Let d l, ... ,dmEA~ (s ) be given. It must be decided whether {d l, ... ,dm}E;X'. Each d. is either a rank o value of an a" or of a b*. In any case, it is generic 1
independent from all other such rank
values.
0
It follows that do,... ,dm,fl,... ,f
is a generic set. for
N~
There is thus a condition P(d ... ,dm,fl,... ,f in the forcing l, n} which is satisfied by dl, ... ,dm,fl, ... ,f and decides the statement n {a l,· .. ,clmhT(fl,.. ·,fn,A)
where X=T(fl,oo.,fn,A)',
n
Decide {dl, ... ,dm}EX' the same way.
The above decision is independent of the choice of P since the stipulation that P be satisfied by dl,... ,dm,fl,.H,f compatible.
guarantees that any two such P's are n Of more interest is the fact .th a t i f N~_ contains s codes for a
larger s independent support fl, ...,fo.g
l,
.. ,.gr for X then
X(N~_,s,fl,.... fn)=
X~_,s, f l,· ... fn,gl"" .gr)'
For some C X=T(fl,.. ·.fn,gl'... 'gr'c). Some condition R(f l, ... ,fn,gl, .. "gr) satisfied by fl'" .fkgl,· ••gr forces .T(f •...• fk.A)=T(f l, ... , fn,gl" .. Suppose l Q(d .. ,d ,fl,... ,f ,gl,... ,g h:R is satisfied by dl, ... d ,fl, ... ,f ,gl, ... ,g and del,· .m n.r Ijl.n r cides {dl,... ,dm}E;T(fl,... ,fn,gl'''' .8r , C) . P=Q I 01'''' ,am,f l, ...,f n is satisfied by
.s».
d
.••.dm,fl,.... f n and decides {dl ..... dm}ET(f l •••. .in,A) the same way. l" shows the equality of the two versions of X~,
This
The association X+X' satisfies homomorphism laws when defined, i.e. -(X') when both are computed relative to the same support.
-~')=
(xnY)'=X'nY' when
DAVID PINCUS
558
computed with respect to a common support for X and Y.
And if GEH«A
o)
with
X={FEP«Ao):F~G} then X'={FEP«A~(s)):F~G} when X' is computed relative to any
s independent support. I f N*
a-
is another analogue to N _ , X* will be defined the same way in a
and from the same code (assuming the code is present). X'E~'(S)
and
X*8~*(S)
will have the same answer because they will be decided by
the same conditions about the code. The ultrafilter ~ on pNap (A ) can now be defined. filter
VEV~
on K.
N~_
Therefore, the questions
<
a
This is possible since P(K)CU.
support {fl, __ f
Fix a regular ultra-
take any Given XEHNap«A o)
for X. Put XE~ exactly when n} {SEP«K): (VN~_ coding fl, .. ·f (X(N~_,s,fl, ... fn)E~'(s)hv n) The !,bove seems to depend on the {fl,... E } used but it does not. Let {fl,... f n, n gl,... gr} be a larger support for X. fl, ... fn,gl''' gr is s independent for all sufficiently
large s (hence for almost every s with respect to v).
of s X(N~_, s, f I" .. f n) =X(N~_, s, f I ,.... f n' gl ,... ·gr)' same way with respect to fl, ... f to any gl.. .gr' The fact that
~
n
; f
l,
Therefore, XE~
... ,fn,gl'''' gr' and consequently with respect
is a regular ultrafilter now follows.
contains exactly one of X or -X.
For this set
is answered the
Since
Since (-X)'=-(X')
~
{FEP«Ao):F~G}'={FEP«A~(s)):F~G},
{FEH«A :F~Gh~. If X and Yare in u then XnYE~ because the invariance of o) support established above permits the computation of X', Y', and (xnv) , with respect to a common support of X and Y. VIII.
Codability and Independence Ultrafilters on A , a>o apparently cannot be obtained using coding sequences. a
The argument of VII which shows that dl, ... dmfl, .•. E is a generic set breaks down n unless dl, ... d have rank o , The problem Comes from the remark at the end of VI m to the effect that "abl... b and "abi'" b are not necessarily AD. The only k" recombinant coding which will be used in the construction of the ultrafilter on Aa
k"
will have every a being combined
~ith
the
~
"r .. bkEU'.
The stack coding, introduced in the next section will be a variation of natural coding in which a code is a single function.
Now a different problem arises.
If one examines the proof in V showing that a generic set of codes gives rise to a generic set of functions, one finds that two natural codes are severely restricted in the kinds of common values that can exist between the functiorls they code.
s
independent functions will not necessarily have a generic set of natural codes. Stack coding will permit common values to arise between the coded functions. However, the notion of s independence is too coarse for the purpose. common values must be carefully traced down through the ranks. notions help to keep track of these values.
The maximal
The following
ADDING DEPENDENT CHOICE TO THE PRIME IDEAL THEOREM
559
1I*(a) ={ te:ll< (a) : (VSf3tt)} IIg(a)={te:II*(a):Se:t~tnS=¢}
S={s:Domain te:II*(a)'(VSe:Domain S)(SScll«K»} SS={te:S:Domain se:IIB(a)} S*={(s,G):te:S 'Ge:V (A )} o
-+-
<
0
Domain s will ,be denoted by t when this is unambiguous.
~
and
e
will denote the
successor and predecessor operations in tu{a} i.e., aQ is the greatest element of tete. Let fl ..... f be an AD set of HAD functions of rank a. The sets FS(fl... fn,S) n (iI' ...• fn,S) will be defined by downward induction on tu l c}. Fa (f 1" ..fn,'S)={f1 " .. f n}· Fae(fI, .. ,fn,S)={hA:hsFS(fl,... fn,'S)'AssSe} i f S is successor. FSe(fl ....fn.S)={h(Se,A):hSFS(fl' ... fn'S)'AssSe} i f S is limit. In many instances consideration of the case of limit S and successor S is identical except that each use of hA must be replaced by a use of h(Se,A). Under these circumstances, and only these circumstances, consideration of the case of limit S will be omitted. are S independent for tsss when for any ho' hle:&~lo (fl ..... fn.'S), n MCVho,h{o~tFo(f l , ... ,f n'~)u i~2V(VhinFS(f l,· .. ,fn,s»
fl'" .• f
A more conceptual definition proceeds by induction on a and Itl.
¢ independent.
Any fl"'-=n are
Inductively, fl"'£n are'S independent if they are sae (or
{ae}xsae in the limit case) independent according to VII and Fae(fl" •• fn,"t) is an
tl tnae
independent set.
There are several elementary remarks to make.
When se:S
o
the condition
reduces to MCVh o .h{ o~lo (fl'" fn'~)' If fl"'&n are S independent for SSSS' S=A+I, then for all sufficiently large SSV«K). fl .... f
are S'(A,S) independent. In the same situation where S is a limit n ordinal one can say that for all sufficiently large A<S for all sufficiently large
SSIl«K), fl" .. t
are 'S'(A,S) independent. n The singleton {f} is not necessarily
S
independent.
If it is then f is said
to be! codable. If (s,G)sS* fl,···f are (S.G) independent if they are t independent and n F (f ,.•• f ,t)cG. If fl, ••• f are t independent then they are (t,G) independent n o 1 n for all sufficiently large G. f is (s,G) codable if {f} is (t,G) independent. IX.
Stack Coding Given sand G the (s,G) stack code of an (t,G) codable function is an
element of N _ {jJ (G», the analogue of N _ over the ground model U'=U (G). Proceed a a by induction on S
"a" will
The elements of G are assumed to be
DAVID PINCUS
560
arranged in their canonical sequence.
It will not matter that in order to read
"a" from a one must have the elements of G at hand. An (t,G) stack code for a function of rank a will be a function, a, of rank ago
Its values at A for A£ saG (or at (aGg, A) if ag is limit) will be (slag,G)
stack codes of rank aGG for functions of rank ago
Every stack value of a (i.e.,
value of a which is a stack code) will be either a itself or a stack value of an (Here is the first example of sliding over the aG is limit case.
aA,AEsaG.
for others.) a rank
0
By induction finitely many values of a are stack values.
Watch
The key of
value of a stack value of a will have full information only about the
minimal stack code of which it is a value.
This minimal code will be unambiguous
because a will be disallowed as a code unless it codes an (s,G) codable function. In this case, by an easy induction, the elements of 6~tF6("a",~)-G will be exactly the values of "a" coded by stack codes.
Therefore, if b is a rank
0
stack values c and d of a there is a stack value e£VcnVd with b£Ve.
value of the e is the
stack code of the appropriate element of MCV"c","d". The values of a which are not values of any stack code of rank less than ag will be
called~.
The keys of the new rank
about the formation of a" and "a".
0
terms will contain full information
Thus, the keys of the new rank
will code a9 as rank a and a as rank a".
0
values of a
The keys of an old (not new) rank
0
value b of a will code the corresponding information on the minimal stack code containing b.
Since there are finitely many stack values of a and every value
of a is AD from them it follows that every new value of a includes many new rank values.
0
For technical reasons it is also required that every new value of c£Va
includes in Vc all stack codes of a with lower rank.
In order to avoid the
possibility that c becomes an (siS ,G) stack code for some S one can stipulate that these values avoid the positions that are reserved for stack values of stack codes. (Inductively, there are finitely many of these.) A variation on the definition of b* in V will be given for non-stack values b of a.
~
is defined as b* on values of rank O.
Assume that it is defined on all
values of rank lower than b and within all stack codes of rank less than a. b may as well be assumed to be new. if d=eofor some e£Vc.
Thus,
A value d of "c" is said to be representable
It will be inductively seen that every value of
c" with
rank less than rank c will be either representable or a value of fO where f is a stack value of c. O b is now defined to be a function whose values are eO, e£Vb except possibly when ~ is ~ stack code. One is also free to insert or not as values of b O some non-representable values of e" where e is a stack code with lower rank than b O (hence, a value of b). The formation of b from these values is dictated by the keys of the new rank
0
values of b, which have been remarked to exist.
b can be
distinguished from the other values of a by the placeholder keys of its new values. Knowing b, the diagram of a, and the diagram of a", leaves no ambiguity in the
ADDING DEPENDENT CHOICE TO THE PRIME IDEAL THEOREM construction. function.
Of course a will be rejected as a code unless every b O is an HAD
Notice that rank bO=rank b.
a" is now built from a as follows: are the b
561
The values of a" of rank less than ,,9
O
where b is a nons tack value of a and the values of the b" where b is an aA, A£s"9. anA or a"(a9,A) is the appr opria te aA" or a(a99,A)". The other values of a" are formed from the b O with possible insertions of nonrepresentable values of b" where b is a stack code. tated by the key of every new rank
The diagram of this construction is dicvalue of a.
0
a" is built from a" and G (in its canonical ordering) in a manner dictated by the keys of every new rank
0
value of a.
This must of course be consistent
with the previously defined recombining for the stack values of a. If a
l,
••• ,an are a generic set of stack codes then a
"a11l, ... ,"n") are a generic set of functions.
obtained from those of a
The rank
0
l",
••• ,an" (hence also
values of a
1
1l
, •••
,a
n"
are
... ,an by lopping off the first K terms, renormalizing
l, the domains from K+-K to K+, and decomposing, via a pairing function, the finitely K+ many which came from rank 0 stack codes in to K-sized sets of elements of 2. This process will not alter K-closed product genericity. It remains to see that ai" and a
are pairwise AD. Assume the corresponding j" Take a maximal common value set for the O and the a/,",A£s"9 (see (5) ia.) together with the set of those b
fact for stack codes of lower rank. a
iA",A£s"9 where b £ MCVaia I claim that this is a finite set of common values to a and j• i" a " which includes all common values in its value set. j
The arguments of V show that a." and a." have no common values of rank >,,9 J
l
and no common values of rank a9 except possibly from among the a a
jA,A£sa9.
and the iA",A£sa9, Filtering this through an induction, if b " and c", b,c stack codes,
have a common nonrepresentable value, this value is the value of a d" where d is a stack code common to band c.
Notice also that if b is a new value of one stack
code it is not an old value of another. rank
0
This is because a new value has some new
value and the key of such a value has rank codes which distinguish it from
those of old values. Now consider an arbitrary common value e of a and a It is either i" j". representable or a common value of the a.A",A£sae and the a.A,A£s~9 and, hence, a J
l
.0
value of our common value set. If it is representable, say e=d relative to ai" o and e=c relative to a" j , d=c is necessary because all rank 0 values of e must have common prefixes in a
and a since they are a generic set and this implies j i that d and c are built in the same way from the same terms. Therefore, e is a
value of the common value dO of a common value set.
i"
and a
j"
so it is a value of the designated
The other important point is that a generic (~,G) codable function has a unique generic (t,G) code and that a generic set of (t,G) independent functions has a generic set of codes.
The uniqueness of the code is achieved using the
562
DAVID PINCUS
well ordering of U once the existence of such a code is established. Ifa9=o the stack code is a natural code and has already been produced. this case if fl, __ f
n empty common values.
In
are (s,G) independent then flIG, __ ,fnIG are generic with Therefore, their stack codes are generic.
If a9>0 it can be assumed by induction that if f is a generic (t,G) codable function the values fA,A£sa9 have a generic set of stack codes of rank age. Assign these as the aA,A£sa9.
The first step in bUilding the rest of the code a
for f is to divide by G and record the recombinant codes for use in the keys of the new rank a values of a.
The next step is to make a"=f!G.
simplicity it will be assumed that flG=f.
For notational
Carrying G along in the following
arguments would involve no new indeas. Vf is a pairwise AD set, hence, so is {b£Vf: Rank b
of a
O
O
,
It is thus not
of rank a9 whose
The representable values of Vf can be read from what
By induction one already knows what the representable values of the
fA,A£sa9 are. a
O
Others are representable if their rank is
is changed into a* by the following process:
are discarded from a
O
All nonrepresentable values
Then the *'s of all stack codes of rank
•
reinserted as values of the representable value b of a O • some manner prescribed by the diagram of f.
The insertion is done in
In particular, the * of the stack
code of fA is inserted as a*A (or possibly a* (age, A)).
By induction one can be
certain that if b is a value of c and d then the removals and insertions in bare consistent with those in both c and d. It must be shown that the resulting a* is an HAD function.
For the represent-
able b let be denote the result of the indicated deletions and insertions.
For
the stack value b of the fA,A£Sae, b 8 denotes the * of its stack code. Every value of a* is a b 8 • Each b W has K values with rank equal to rank b-l (or if rank b is limit it has
K
values of each smaller rank) because if b is not a value of an f
then b is AD from each fA and has the correct domain. since b does.
many values of b are not discarded.
K
It remains to show that b
W
and c
8
that b is representable and c is a stack value. include every common value of b
8
U
Let xf/ be a common value of bit
If x is a stack value then x 8 is clearly in C.
x is a value of some dEMCVc,d.
Assume first
A finite set whose value set will
and c· is C={de:dEMCVb,c'd is representable}
{el'l: e is a stack value of c with rank
of I
are AD.
If band c are both stack values this follows by induction.
and co.
Thus, each be
It is also easy to see that b W satisfies property 2
If x is representable then
If d is representable then xf/£VC.
If d is not
representable let e be the s tack value of c wi th minimum rank including d in its value set. rank.
rank eW~ rank d or d would be a value of a stack value of still lower
But rank d< rank b because d is a value of b.
hence, x 8 is a value of eO.
Therefore, e
8£C
x is certainly a value of e,
and xf/£VC.
ADDING DEPENDENT CHOICE TO THE PRIME IDEAL THEOREM
563
If both band c are representable then b&and c· are AD for a similar reason. If a maximal common value of band c is discarded a value of b W and c~ is inserted which has an even larger value set, namely e& where e is the minimal stack value having the discarded function as a value.
There always is a minimal such e since ({f} is (t,G) independent.)
the MCV's of stack values are stack values.
Having produced an a* one can now add key prefixes to build a. 2 of I guarantees that every value of f of rank
~ag
Stipulation
is obtained by successive
partitions of the values of rank
0
values of the code of f
product generic taken together with all rank X.
The characteristic function of
... ,f
~(X,t)
~a
i
are K-closed
j
will be formed by successive approximations.
will be thought of as the s approximation where
As in VII it will be necessary to carry the t independent support
sSSS' S~a. l,
values of f .
The (t,G) Approximation to ~a
The 2 valved function f
0
along as an argument of the function until the last step.
n In the first step of this approximation one must consider (t,G) independent
sets and carry G as an argument also.
Thus, ~(X,t,G,fl" .. ,f
will be a 2 valved
n) .... function defined when XsPP«A has the (s,G) independent support fl, ... ,f a) n.
~
will satisfy the following specific properties: 1. If f
l,
s
... ,fn,gl'· .. ,gm is (t,G) independent then
•• ,f gl' ..• ,gm)=~(X,t ,G, f .• , f n) l,· n, l,· 2. ~(-x,t,G,Fl, .•. ,fn)=l-~(X,t,G,fl,••• ,fn) ~ (X, ,G,f
3. I f ~(X,t,G,fl, ... ,fn)=Hy,t,G,fl, ••• ,fn)=l then
....
~(xny,s,G,fl""
,fn)=l
4. I f X={{hl, ... ,~}:{hl, ... ,hk}:,{fl, ... ,fn}} then
....
~(X,s,G,fl'"
.,f~)=l
(t,G) codes exist in models with ground model U(G). Na_(U(G»
can be build as follows: Let
of Na_(U(G»
A standard one,
A~=Ao-G, A~={fIG:fSAS}' N'
is a Q-analogue
if it has the same forcing and i~ner modeling over U(G) as Na_(U(G».
The forcing conditions and generating constants are the same over U(G) as over U. If f is an (t,G) codable function of rank a Cf denotes its (t,G) stack code. C denotes the set of (t,G) stack codes in AGg• The induction hypothesis applies to G a Na_(U(G» so there is a V ~ regular ultrafilter v on C. , Let N' be a G-analogue of N
a-
(U(G».
The set X'=X(N',t,G,fl, ..• ,f )spNp«C')
is defined as follows: A member of P«C') has the form {Chl, •.• ,Ch
n
To decide k}. whether this is in X' notice from IX that, since {Chl' •.• ,C~Cfl' •.• ,Cfn} is generic,
{h , ..• ,h ,fl, ... ,f } is generic (over U). Find a P(h .•• ,hk,fl, ..• ,f ) 1 k n n l, in the forcing for N which is satisfied by hl, ••• ,hk,fl, •.• ,f and which decides a_ n {hl,···,hk}sT(fl,···,fn,A)
564
DAVID PINCUS
where X=T(fl,· .. ,fn,A) in Na. Decide {hI" .. ,hk}£X' the same way. This decision is independent of the P used since any other Q used would also have to be satisfied by hI" •. ,hk,fl,··.,f n and would then be compatible with P. The arguments of VII now give:
I'. If fl,··,fn,gl'·· .,gm is ("t,G) independent and N' contains Cf l,'" ,Cfn,Cg l,··· 'C&n then X(N',"t,G,fl,···,fn,gl'· ··,gm)=X(N',t,G,f l,· .. ,f n)· 2'. (-x) '=-(X'). 3'. (XnY)'=X'nY'. 4'. When X={{hl, ... ,hk}:{hl, .•. ,hk}::>{fl, ... ,fn}} then X'={{ ChI" .. , Chk}: {ChI" • ,Chk}::>{Cf l,· .. ,Cf n}}· If v' is the G-analogue to V in N' the answer to X'
£v'
is independent of N'.
I. e., if N" is another G-analogue of N (U(G) containing Cfl' ... , Cf then a_ n X'£v'+-+X"£v". Therefore, the function ¢(X,"t,G,f ... ,f can be unambiguously l, n) defined to be 1 if X'£V' and 0 otherwise. Properties 1, ..• ,4 of ¢ now follow from properties
l~
•.. ,4'of X'.
¢£V~
since its definition is canonical.
(Make v
canonical by using the least possible one in the well ordering induced by G.) XI.
Construction of
~a
Fix a regular ultrafilter p on each limit Ssa.
K
and unbounded ultrafilters vS on peS) for
(An unbounded ultrafilter includes all cobounded sets.)
The ~ is
selection of p and the vS can be done canonically in U because each !SISK.
the regular ultrafilter on A defined by VII in N . o Cl N A 2 valved function ¢(X,t,f ..• ,f will be defined when X£P a(p«A has a)) l, n) the s independent support f ... ,f It will have the following specific propn. l, erties:
...
1. If f!-, •.. ,f ,gl, .•. ,g -
n
m
is ~ independent then
...
¢(X,s,f l,·· .,fn,gl'··· ,gm)=¢(X,s,f l,··· ,f n)· 2. tjJ(-X,s,f l,··· ,fn)=l-¢(X,s,f l,··· ,f n).
...
...
......
3. If tjJ(X,s,fl, ... ,f )=tjJ(Y ,s,f
. . ... . ,f ) =1. ... ,f )=1 then ¢(XnY,s,fl,
l, n n ... ,~}:{hl'" .,hkb{fl'''· ,f
""*
n
4. I f X={{hl, then tjJ(X,s,f l, .. · ,f n)=1. n}} The definition proceeds by induction on S such that t£SS' If S=o let
...
..,.
¢(X,s,fl,···,fn)=l+-+{G:tjJ(X,s,G,fl,···,fn)=l}£~.
tjJ is defined when fl, ... ,f are s independent because then f •.. ,f are (t,G) n l, n independent for almost all (in fact, for all sufficiently large) G~P«Ao)' This fact together with 1, •.. ,4 of X implies 1, ••• ,4 for tjJls. since its parameters are tjJls* and ~.
o
Notice that tjJjs £V~ 0
I f S=\+l, let
tjJ(X,t,fl, ... ,fn )=l+-+{s£P < (K):tjJ(X,S·(A,s),fl, ... ,fn )=l}£p for l£SS' fl, .•• ,f
n
Here tjJ is defined when f
... ,f are "t independent because then l, n are t'(A,s) independent for almost all (all sufficiently large) seP«K).
ADDING DEPENDENT CHOICE TO lHE PRIME IDEAL THEOREM If
~
565
is a limit ordinal say lj!(X,-;,fl, .. ·,f )=l~{A<~:{sdr (K):lj!(X,S'(A,s),fl, ••• ,f )=l}ep}cvl3 <
n
n
Here lj! is defined when fl, ••• ,f are t independent because then for S' n almost every (all sufficiently large) A<~ fl, ••. ,f are S'(A,S) independent for n almost every (all sufficiently large) seF«K). Properties 1, ... ,4 follow in both for -;es
the successor and limit case from the definedness and induction. Finally, say
Xe~cr
exactly when
,fn)=l .•. ,f of X. This definition is independent of the support n l, If'gl" •. ,gm is another support then since every set is ~ independent fact lj!(X,~,fl'"
for some support f used.
1 implies that:
lj!(X,~,fl'" .,fn)=lj!(X,~,fl'···,fn,gl"" ,gm)=lj!(X,~,gl'" ·,gm)· is an ultrafilter follows from facts 2 and 3. Fact 1 also comes into the cr proof of the intersection property since it allows the computation of lj! to be
That
~
carried out with respect to a common support of X and Y.
That
~a
is regular
follows from fact 4.
BIBLIOGRAPHY 1.
J.D. Halpern and A. Levy, The Boolean prime ideal theorem does not imply the axiom of choice, Proc. Symp. Pure Math XIII Part 1, pp 83-134.
2.
A. Mathias, A survery 'of results in set theory, preprint prepared in connection with the 1967 UCLA conference on set theory.
3.
D. Pincus, On the independent of the Kinna Wagner principle, 2. Math Logik Grund. Math 20 (1974), pp 503-516.
4.
, Two model theoretie ideas in independence proofs, Fund. Math XCII (1976) pp 117-134.
5. 6.
' Adding dependent choice, Ann. Math Logik (to appear). D. Pincus and R. Solovay, Definability of measures and ultrafilters, J.S.L. (to appear).
R. Gandy, M. Hyland (Eds.), LOGIC COLLOQUIUM 76
© North-Holland Publishing Company (1977)
THE ONE/MANY PROBLEM IN THE FOUNDATIONS OF SET THEORY
Rudolf v.B. Rucker Department of Mathematics State University College at Geneseo Geneseo, New York
The question of whether the universe of set theory is a One or a Many is discussed.
It is argued that although
there may be
a single concept of "set", the universe of set theory is not any one fixed extensional object.
A streamlined form of Takeuti's
Nodal Transfinite Type Theory (NTT) is presented as providing a means for making statements about the various higher-order concepts.
Several new relative consistency results involving
NTT are obtained.
It is also shown that if NTT has a model of
a certain natural kind, then 0# exists.
A new theory called
Embedding Theory (ET) is introduced as a natural strengthening of NTT.
NTT is finally presented as the first theory which is
about concepts as well as about sets. INTRODUCTION There is no fixed collection which is the class of all sets. set theory is a Many whi:h cannot be a One.
The universe of
It is, however, undeniable that the
single concept of "set" exists.•• as a sort of name with no referent.
The problem
I have treated is how to understand sentences in which such names appear. It is crucial that the concept of set be treated as something quite essentially different from any set-like extensional object.
For otherwise, what one thought was
the universe of set theory turns out to be another set, and one's hopes of transcending set theory to contemplate the Absolute Infinite are dashed again. Insofar as the concept of set is not represented by any fixed extensional object, it seems possible that ordinary set-theoretic statments about this and other such concepts do not have a truth value in any absolute sense.
However, as Rein-
hardt and others have pointed out, much of our set theoretic intuition seems to arise from talking about the class of all sets. I have endeavored to show that the question of how to talk about objects which do not strictly speaking exist is satisfactorily resolved by Takeuti's NTT. 567
568
R.v.B. RUCKER
The classical One/Many problem of metaphysics is described in §l.
In §2 I
relate the concept of "set" to the concept of "form of a possible thought" in order to bring out the affinity between the universe of set theory and the metaphysicians' Absolute.
§3 contains some heretofore untranslated remarks by Cantor on the
infinite, the Absolute Infinite, and the difficulty in discussing the latter.
In
§4 the exact way in which the Absolutely Infinite universe of set theory fails to be a One is brought out by means of a dialogue between a Monist and a Pluralist. §5 is the mathematical core of this paper.
Here it is shown that Takeuti's
NTT is equiconsistent with a simple theory of indiscernibles; and it is shown that if NTT is slightly strengthened, then we can get 0# out of it.
I introduce a new
theory ET, which says simply that there is a non-trivial external elementary embedding of the universe into itself.
ET, which is proved to be equiconsistent with
NTT, is an interesting theory since, in the light of Kunen's Theorem, it sets the stage for a new set theory without the power-set axiom. In §6 there is a discussion of the way in which NTT resolves some of the issues raised in the first four sections. I recognize that some of my fellow mathematicians are impatient with the sort of considerations in those four sections.
But it is important that set-theorists
should attack the deep foundational questions --- for if all such investigation is left to mathematical amateurs, we can hope for little more than yet another discussion of what "1 + 1 = 2" really means. I am grateful to C. Takeuti
and to K. Codel for early conversations on the
topics of this paper; and to W. Reinhardt for his valuable questions about my ideas as presented in [Ru 4].
I would also like to take this opportunity to thank the
faculty of the Mathematical Institute for organizing a most enjoyable conference. §l. THE ONE/MANY PROBLEM There are two forms of the One/Many problem: i) How many kinds of things are there?
ii) How many things are
the~e?
The natural first answer is that there are
many different kinds of things and many different things. There is, however, a perennial desire to reduce the world's diverse phenomena to a single basic kind, to believe that ultimately all things are built of the same kind of stuff. Urstoff.
Matter, sensation, thought and form have all been ~andidates for
The belief
that there is ultimately only one kind of thing in the world
is called monism of kinds.
Materialism and idealism are both monisms of kinds;
the monism of kinds which asserts that everything is a set will be considered in the next section. Instead of uniting things from the bottom up, one can work from the top down, starting with the assertion that "All is One".
Monism of substance asserts that
everything is a part or manifestation of a higher unity which is usually called
THE ONE/MANY PROBLEM IN SET THEORY
569
the Absolute. It is, of course, obvious that the word or concept "Everything" serves to at least superficially form the world into a One.
In the same way, the bare concept
"set" makes a One of the universe of set-theory, but without answering the real question of whether this universe is in any sense a definite completed object.
The
heart of question ii) is whether or not the world is One in some organic sense, rather than in the merely syntactic and syncategorematic sense just mentioned. Perhaps the principal reason for believing that the world
~
an organic One is
the sort of mystical insight which Lovejoy somewhat slightingly refers to as "monistic or pantheistic pathos", ([L), p , 12).
The fact that i t is occasionally
possible to feel an all-encompassing unity in the world is, however, not conclusive --- as it is equally possible to feel a diversity in the world which defies unification. It is possible to argue for monism of substance in various ways.
One idea is
that ultimately everything in the world is related to everything else, and that the Absolute is the means or essence of this interrelatedness.
Here the Absolute
serves as a sort of connective tissue which fixes the individuals of the world into their perceived relational structure. Another approach is to argue that any two things are, in a sense, the same; and that the Absolute is the one endlessly diversifying thing that exists.
The essen-
tial move involved here in proving, say, that you and I are the same person is to point out that in order to express our individuality we both say the same thing: "I am Ill,
A problem with the more extensional monisms of substance is that if every thing is united in the Absolute, which is itself a fixed and definite thing, then the Absolute must be a component of itself --- which seems paradoxical. This paradox is, indeed, quite real in set theory, where one is forced to sacrifice one of the thre~ propositions: a) no set is an element of itself (genetic formation of sets); b) every mathematical object is a set (monism of kinds); or c) there is a fixed universe of all sets (monism of substance).
In the usual GB or
MK approach, one tries to abandon b), since there seems to be a sense in which the universe of set theory is bound together by"the concept of "set".
Unfortunately,
it is in practice quite difficult to avoid treating the purported class of all sets like a large set.
And i f one' s"class of all sets" is always just a large set in
some better "class of all sets", then it is evident that there really is no fixed and completed universe of set theory.
So unless one is very careful to avoid a
assuming some form of b), then one always ends by dropping c).
(On p. 9 of [Rl]
there is an example of a particular theory about the class of all sets being discussed first in terms of dropping b), and then in terms of dropping c).) The option of dropping a) is not really viable for the set-theorist; not only because a) is essential to the classical conception of set, but also because
570
R.v.B. RUCKER
paradox can be avoided by dropping a) only if certain restrictions on the comprehension principle or on the underlying logic are adopted as well. The philosopher Josiah Royce does take this approach in his essay, "The One, the Many and the Infinite", ([Ro], pp. 473-588), by asserting that the Absolute is what he terms a self-representative system.
He uses the interesting analogy of a
perfect map of England which is to be drawn on a field in England.
Being perfect,
this map includes an exact replica of itself, which includes an exact replica of itself, ad
info
Ideally my consciousness also constitutes a self-representative
system --- assuming, that is, that one of the things in my consciousness is an exact image of my consciousness.
Whether or not this is the case is, of course,
debatable. The difficulty with self-representative systems is that they cannot be directly described or built up by any step-by-step process. once, or not at all.
They must be grasped all at
It is at least questionable whether such an Absolute actually
exists, and pluralism of substance remains a reasonable position. This position is forcefully presented by William James in A Pluralistic Uni~:
" ••• the pluralistic view which I prefer to adopt is willing to believe
that there may ultimately never be an all-form at all, that the substance of reality may never get totally collected, that some of it may remain outside of the largest combination of it ever made, and that a distributive form of reality, the each-form, is logically as acceptable and empirically as probable as the all-form commonly acquiesced in as so obviously the self-evident thing." ([J], p. 34). In terms of set theory this is the view that there really is no class of all sets.
There are only variously large sets --- anyone of which may be falsely
assumed to be the whole universe.
When people speak of the class of all sets they
can never be correctly referring to any specific collection; instead they are referring to the syntactic concept of set.
My sympathies lie with this pluralistic
position, and I will discuss in §6 how the theory NTT can account for the fact that although there really is no fixed extensional class of all sets, it is possible for statements about the class of all sets (such as "..C\...is a measurable cardinal") to be meaningful for us. §2. WHAT IS A SET? Most simply, a set is a ''Many which allows itself to be thought of as a One,"
([e], p. 204).
A set in this classical sense is, as Cantor remarks (ibid), a sort
of little Absolute, a partial world of individuals which are combined into a One by a creative act of thought. It may seem strange that the definition of set has a psychological component, but the concept of "set" has been related to the concept of "possible thought" from the very beginning.
THE ONE/MANY PROBLEM IN SET THEORY
571
For instance, in his 1887 essay, "Was sind und was sollen die Zahlen?", Dedekind proved the existence of infinite "systems" by pointing out that his thoughtworld (Gedankenwelt) was infinite since there is a map which injects the Gedankenwelt into a proper part of itself.
This is the map which takes a thought s into
the thought "s is a possible thought", (see ID], p. 64). This proof is not conclusive since, as Cantor mentions in passing in an 1899 letter to Dedekind, the "totality of everything thinkable" is an absolutely infinite or inconsistent multiplicity such that "the assumption that all its elements 'are together' leads to a contradiction, so that it is impossible to conceive of the multiplicity as a unity, as 'one finished thing'." (see IC], p. 443, or the translation [CD], p. 114). But
~
is the Gedankenwelt an inconsistent multiplicity?
The essential
assumption is that our thoughts are not self-representative systems; that is, that a thought does not include itself as an object.
Therefore when I try to form my
Gedankenwelt into a One, I am producing a new thought which has not been accounted for.
Including this new thought produces a synthesis which constitutes a further
new thought, and so on.
(See §l of the paper abstracted in [Ru3] for a more detailed
discussion of this process.) Incidentally, Dedekind's proof was derived from Bolzano's proof that there are infinite sets: "The set of all true propositions is easily seen to be infinite.
For
if we fix our attention upon any truth taken at random .•• , and label it A, we find that the proposition conveyed by the words 'A is true' is distinct from the proposition A itself ... " ([B), pp. 84-85).
This proof is flawed in the same way that
Dedekind's is, since the totality of all true propositions is in fact an inconsistent multiplicity.
For if the notion of truth were definite,. graspable and named
by the word 'truth', then the sentence "This sentence is not true" would be meaningful --- which is manifestly impossible. So we see that when Cantor speaks of a set as a Many which allows itself to be thought of as a One, he is using "thought" in the sense of "non-self-representative,
rationally communicable thought".
In other words, a set is to be a (ra-
tionally) conceivable form. In view of the fact that, by the Completeness Theorem, any conceivable structure can be modelled set-theoretically; it seems permissible to assert, in the spirit of §2.l of [W], that any fact, thought, or object has a "picture" which is a set.
Once we have gone this far it is tempting to jump to a full monism of kinds,
saying that every conceivable form is a set. This jump is to be accomplished by identifying any given form with one if its set models.
This process is not sO arbitrary as it initially appears, for once we
have modelled some system by assigning sets as labels which stand for the various individuals and relations involved, and by setting up the appropriate artificial membership relation ..• once we have such a set model it can be transformed by the
572
R.v.B. RUCKER
Mostowski collapsing technique into a set model with the standard membership relation.
The only element of arbitrariness in this final model is the choice of
labels for the atoms and non-well-founded portions of the original system.
(The
approach just outlined could possibly be used to refute the claim by Benacerraf and others that there is no distinguished set-theoretic model of the natural numbers.) Supposing that one accepts the identity between the concepts "set" and "conceivable form", can one then say that every thing' is a set? The primary objection to this is that there seem to be things which are not conceivable forms, e.g., the class of all sets or the totality of all conceivable forms.
One might simply insist that these inconsistent multiplicities do not exist
in any sense --- and it is true that they do not exist as completed extensional objects --- but it still seems that they do have some sort of existence, since it is possible to talk about them. Is it at least legitimate to regard the various individual objects in the world as sets?
Certain monists of substance would say not, arguing that in order to fully
express all the aspects of any given thing it is necessary to bring in everything, so that no individual thing would embody a conceivable form after all. There are two other weak points in the view that everything is a set.
First of
all. the experienced fact that things are one way and not another, that I am myself and the world is this world ••• this sort of particularity does not seem to be provided for by saying that I am a certain point in a certain complex relational system.
That is, there does not seem to be any
wa~
fact that it is this world which really exists.
to represent set-theoretically the This objection could be countered
with the claim that every possible world really exists. I will insert here a brief digression on physical monism of substance.
As
Godel pointed out in [Gl], the spacetime viewpoint of relativity theory seens to prOVide an excellent vindication of the idealistic view that the passage of time is illusory, that the universe does not fall into many distinct "nows", and that spacetime is one in substance.
This physical monism of substance has been shaken by
recent interpretations of quantum mechanics which, in order to account for indeterminism, contemplate the existence of every possible physical universe, (see [DG] and Chapter 43 of [MTW]).
In these models the unified aether-like spacetime of relativ-
ity is split into many distinct spacetimes.
Whether one can move through such a
viewpoint to a higher monism of substance which asserts that all the possible universes are points in a static "superspace" is problematic because a) the existence of such a superspace as a One is exactly as debatable as the existence of the collection of all possible thoughts, and b) it seems possible that there could be various sorts of superspaces existing as points in a supersuperspace, etc. To return to the main line of argument, a second, related, objection to the view
that everything is a set is that the set-theoretic model does not seem to
account for the fact that the world is going on.
John Wheeler speaks of this
THE ONE/MANY PROBLEM IN SET THEORY
573
difficulty as it relates to an imagined room full of equations intended to represent the physics of the universe, "Stand up, look back on all those equations, some perhaps more hopeful than others, raise one's finger commandingly, and give the order 'Fly!'
Not one of those equations will put on wings, take off, or fly.
Yet the universe 'flies'." ([MTW] , p. 1208).
This objection could perhaps be met
by the assertion that there is nothing more to the "life" of the world than the various forms and formations which occur. But the question of whether or not the objects of the everyday world are in some sense really sets is not central to this paper.
I have raised it here only
in the hope of stimulating further discussion; and to remind the reader of the close similarities between the universe of set theory, the universe of thought, and the universe of physics. §3.
THE ABSOLUTE INFINITE The viewpoint that every conceivable form is a set is a modernization of the
Pythagorean doctrine that "all things
~
numbers," (see [H], p. 67).
The Pythago-
rean doctrine in its original form was invalidated by the discovery of the existence of continuous magnitudes whose ratios were not equal to that of any two natural numbers. unavoidable.
A pluralism of kinds as regarding mathematical entities seemed
Indeed, so far were such medieval mathematicians as Vieta from the
present mathematicians' set-theoretic monism of kinds, that they viewed it as impossible to add magnitudes which represented lengths to magnitudes which represented areas. Dedekind and Cantor showed that irrational magnitudes could be precisely represented by infinite sets based on natural numbers.
As far as Cantor was concerned
there was no question that the only way to mathematize the continuous was by means of infinite sets.
"One can without qualification say that the transfinite numbers
stand or fall with the finite irrationals; their inmost essence is the same, for these are definitely laid out instances or modifications of the actual infinite."
([C], pp. 395-396.)* There are of course those who would deny that irrationals exist as actual infinities; those who would rather suffer an avoidable pluralism of kinds and view irrational numbers as being best represented by idealized computing devices which
*1 have provided the translations from the German which appear in this section.
I
recently learned that W. Reinhardt has been independently preparing a paper which includes his own translations and comments upon some of the same passages from Cantor as those which appear here.
R.v.B. RUCKER
574
n!h
place of some decimal expansion. There is a die verschiedenen Standpunkte in bezug auf das aktuelle spirited passage in "Uber Unendliche" where Cantor attacks such "Horror Infiniti" as " ••• a form of myopia produce, given any number n, the
which destroys the possibility of seeing the Actual-Infinite, even though it in its highest form has created and sustains us, and in its secondary transfinite forms occurs all around us and even inhabits our minds." ([C), p. 374). Going out from a neo-Pythagorean belief that everything must be mathematically representable by some static form, Cantor actually takes the existence of continuity as evidence for the existence of acutal or completed infinities.
But what is the
"highest form" of the infinite which he alludes to? He distinguishes three kinds of actual infinities, "The actual infinite arises in three contexts:
first when it is realized in the most complete form, in a fully
independent other-worldly being, in Deo, where I call it the Absolute Infinite or simply Absolute; second when it occurs in the contingent, created world; third when then mind grasps it in abstracto as a mathematical magnitude, number, or order type. I wish to make a sharp contrast between the Absolute and what I call the Transfinite, that is, the actual infinities of the last two sorts, which are clearly limited, subject to further increase, and thus related to the finite." ([Cl, p. 378). Later, Cantor amplifies, "The Transfinite with its richness of forms and formations points necessarily to an Absolute, to that 'true infinitude' whose magnitude admits of no increase or decrease, and which is therefore to be quantitatively viewed as an absolute maximum.
This Absolute is in certain ways beyond the grasp
pf the human mind, and does not admit of a purely mathematical representation ••• " ([C), p , 405)
This last sentence seems to echo what a Pythagorean might have said about the incommensurable.
Is there any hope of carrying out a mathematical treatment of the
Absolute, or must such investigations remain, as Cantor suggested, in the province of speculative theology?
Perhaps not --- recall Russell's response to §7 of the
Tractatus, " .•• after all, Mr. Wittgenstein manages to say a good deal about what cannot be said ... " ([W), p , xxi). It seems clear that, just as mathematics had to move beyond number theory in order to treat the incommensurable, mathematics would have to move past set theory in order to treat the Absolute.
The analogy is, however, slightly misleading.
The Pythagoreans could not really question the existence of incommensurable magnitudes.
But it is perhaps possible for a set-theorist to assert that there is
really no Absolute universe of set theory.
I agree with this up to a point
but
it does seem to be undeniable that i) we do have a definite concept of set, and that ii) it would be nice to have a framework in which such concepts could be treated like real objects.
THE ONE/MANY PROBLEM IN SET THEORY
575
§4. A DIALOGUE ON SETS AND CONCEPTS The class of all sets, C, is a perfect example of Cantor's Absolute Infinite, as it is clearly unvermehrbar (not subject to augmentation).
Moreover, if every
conceivable form is, or codes up, a set (as was suggested in §2), then C is very close indeed to the Absolute of the metaphysicians. The question I have been leading up to is this:
Is C a One or a Many?
I will
approach this question and some of its ramifications by way of a dialogue between a Monist and a Pluralist. P:
For all the familiar reasons it is evident that C is not a set. is not a Many which allows itself to be thought of as a One.
Therefore C
So the universe
of set theory is, and must remain, a Many. M:
But you just referred to the universe of set theory, did you not?
Is it not
evident that the concept of "set" serves to bind all the sets into a single vast entity which we call C? P:
When you speak of the concept of "set" you are in fact unconsciously harboring a model of this concept.
Your model, being a thinkable form, is a set.
individual's concept of the class of all sets is in fact just a set.
Any
There are
only the many illusory universes and no correct one. M:
I was not referring to some limited individual's concept of "set".
I wish ra-
ther to direct your attention to the absolute and independently existing concept of "set" which lies beyond our imperfect realizations thereof.
Surely you
can decide for any given entity whether or not it is a set? P:
I can also decide for any given entity if it is an entity or not. But this says nothing! Who is it that gives me entities? "a carefully folded ham-sandwich".
A set is not always presented like
A set is essentially the inner form of a
mathematical structure --- and it has happened to me more than once that I could not immediately discern the form which some given mathematical embodied.
description
Someone who has never heard of ultrafilters cannot pretend to know,
in any real sense of the word, whether or not the concept of "set" includes them. M: You are again basing your argument on individual inadequacies. "set" is objective and external to us.
The concept of
The fact that sets are perceived through
the mind does not mean that they depend on minds for their existence.
In the
same way, objects exist without hands to feel them, sights exist without eyes to see them, and all the possible thoughts exist independently of any thinkers. P:
How can you say that a thought can exist independently of any thinker?
M: Any thought which an individual perceives can, in principle, be perceived by someone else.
And, as Einstein puts it, "We are accustomed to regard as real
those sense perceptions which are common to different individuals, and which
576
R.Y.B. RUCKER
therefore are, in a measure, impersonal." ([E], p. 2).
Given that the same
thought can be perceived by different people at different places and times, it is a natural simplification to assume that thoughts have an objective existence external to us.
The universe of thoughts is a One, and all the sets ---
which are simply the forms of possible thoughts --- constitute a One as well. P:
I
can agree that the individual sets exist objectively and independently of us.
But I still maintain that there is nothing to simultaneously grasp them all and pull them together into a One.
You insist that the concept of "set" does
this, but I still argue that there are so many different kinds of sets that there is, in fact, no such graspable concept. M:
I never said that you could grasp the class of all sets.
I only say that the
bare notion of set as something built up from the empty set by iterated applications of the operation 'set of' is perfectly clear.
e
Since we have the
~
under which all the sets fall, it seems undeniable that the class of all sets
is a One. P:
Suppose I grant you this point for the sake of further argument.
We must agree,
however, that your concepts are essentially syntactic rather than semantic objects.
"e"
ject.
For if
is a name, a defining property, not some completed extensional ob-
e were
a static collection, then it would be essentially a set ...
which is impossible. M:
That seems fair. sets and concepts.
P:
But I don't like having two kinds of mathematical objects: Perhaps we should view sets as special sorts of concepts.
You may, but I prefer not to.
Once you start lumping sets and concepts togeth-
er it is easy to start thinking of concepts as fixed extensions, which they are not. M:
Nevertheless, I think that the (-predicate can be meaningfully extended to the
P:
This worries me.
concepts. exist as concepts.
There are many inconsistent multiplicities of sets which can For instance, the class of all ordinals
and the class of
all models of ZF seem to be represented by the concepts "ordinal" and "set model of ZF".
Now why not view (»c as existing as the concept, "concept which
applies only to sets".
And then why not continue to get a new enlarged set-
theoretic universe on top of what we thought was the unvermehrbar universe of set theory? M:
You yourself said that the concepts are not fixed extensional objects, not sets. A concept is not a collection, but rather a name for a sort of contemplated activity.
We can have all of these objects above the universe of set theory,
but they are not sets. P:
And what comes after all the concepts?
M:
The concept of "concept".
This self-representative system does not lead to
a Russell paradox because it is not assumed that for any two concepts A and B,
THE ONE/MANY PROBLEM IN SET THEORY
577
the question "Does concept A fall under concept B?" has an answer; any more than does the question "Is triangularity virtuous?".
Godel supplies a discuss-
ion of this move in [G2], pp. 228-229. P:
Nevertheless, I think that your belief in the reality of a concept of all concepts is misplaced.
For one thing, you have not given any explanation of what
is meant by the concept of "concept" .•• perhaps because any such explanation could be automatically transcended?
Primarily it seems to me that to assert
that there is a concept of all concepts is false in the same way that it is false to assert that there is a nameable function which transforms any string of symbols into the set, if any, named by the string in question.
If a concept
is a syntactic object, then you cannot expect there to be a concept of all concepts, for this would be a syntactic object which encompasses all syntax.
Just
as Russell's paradox shows that there is no set of all sets, Richard's paradox indicates that there is no concept of all concepts. M:
That's a good argument if you believe that concepts are basically just names. It interesting to note that even if every concept were nameable, you could never really know this •.• since your notion of
the naming process will always be
inadequate. P:
There seems to be a little fuzziness nameable.
in the statement that every concept is
For, the way I see it, we start with a lot of names with set para-
meters, then call them concepts and treat them like objects, then figure out an (-theory for these objects, and finally check that the theory is such that each of the names behaves as an object having the property specified by its name.
How do we decide what the (-theory is?
And why should the process I
just described come out right? M: First, in response to your second question, I'm not sure that the process you describe should work out at all.
Even if every concept is nameable, it will
not necessarily be by one of the names in the language you start out with. Indeed, relative to any of the nameable languages which you use, there always will be unnameable concepts.
Second, in response to your question of how one
discovers the (-theory of the concepts, I would say that this is to be found by a process of thinking about the nature of the concepts in question. P:
I can see that such an approach might enable you to determine the truth value of certain very simple statements about concepts. going to decide i f G'c is well-ordered?
But on what basis are you.
Your approach of "thinking about the
concepts" is just an incomplete proof-th.eoretic procedure. M:
There is no guarantee that my thought processes would not, in the ideal limit, generate a complete theory.
A person's thoughts are not an r.e. set, but rather
what Myhill calls in [M] a "prospective character".
I would say that thinking
about the concppts leads towards a complete theory, and that this theory determines the true set theory.
578 P: M:
R.v.B. RUCKER Why would the theory or concepts have anything to do with the theory of sets? The reflection principle ties the two together. property, then I know that many
R~
If I discover that C has some
have the same property.
If I learn that
R.n.+Sl.behaves a certain way, then I know that many R",,+,,- behave the same way ... ~being the ordinal of C.
It is the behaviour of the higher-order concepts
which produces, via the reflection principle, the harmonies which obtain within C.
I might remark here that it may be that true axioms of infinity are just
those insights into set theory which are gained by meditation upon the nature of concepts.
If this is the case, and if it is the case that in the ideal
limit one finds a complete theory of concepts, then Godel's 1946 conjectures are true. P:
(See [G3] and [Ru2]).
Let's pause here.
We both agree that C is not a fixed extensional object, but
is rather a concept ••• although we aren't too sure what a concept is.
But where
we differ is that you seem to believe that there are certain considerations which will make it evident what the ~ -theory of the well-founded concepts over the sets is. M:
And it is this ultimate E.-theory which regulates the theory of sets.
The
individual sets can be viewed as emanating from, or arising in imitation of, the realm of Absolutely Infinite concepts. P:
I continue to doubt that there is any definite
~
-theory on the concepts.
I
think that when you act as if there is, you are simply falling back into the lazy habit of treating concepts as fixed extensional objects.
Rather than
make this mistake, I would prefer to believe that concepts are purely syntactic entities, names with no referent.
Any
~
-theory which we place on them arises
only by way of some convention which we adopt.
I would like to direct your
attention to a pluralistic theory which determines the behaviour of the essentially fictional concepts from the behaviour of sets, rather than vice-versa. This theory, called Nodal Transfinite Type Theory, or NTT, was invented about ten years ago by Gaisi Takeuti.
In [Da] and [T], NTT is formulated over the language TT of "transfinite type theory".
TT differs from the language of ZF in that it has infinitely many diff-
erent types of variables. In general, if A(vO'v is a TT-formula, then there are, l) A in TT, variables x of type A which are intended to range over R(pa) [A(lL,a)]' It is simpler to formulate NTT over the language TN of what we might call "trans-set name theory".
TN is the smallest language extending the language of ZF
and having the property that if A(vO'v ual constant @A in TN.
is a TN formula, then there is an individl) Let me remark here that "language of ZF" is used in the
broad sense under which all the usual defined symbols are allowed.
THE ONE/MANY PROBLEM IN SET THEORY
579
It. is evident that a TN formula will in p;eneral have the form
The meaning of the TN-formulae is determined in part by the following conven-
tion: 1/1 [@A
Convention R) ~)
For ¢ a ZF-formula, and A ,A TN-formulae, O"" n -<->-¢[(lly)AO[a,y], ••• , (lly)An [a, yJJ.
.. ,@AnJ O" It should not be surprising that the quantifiers of the Ai are not restricted
to R on the right-hand side. Convention R is, however, non-standard in that the a quantifiers of ¢ are not restricted to R on the right-hand side. The reason for a using the present form of Convention R is that it smooths the development of NTT. If one wishes to have a (Ra) have the customary meaning of achieved by replacing a by
a, where O(vO'v
Ra Fa, then this can be
This works sJnce l) O• (R@oFo a) (Ra)-<--r (R(lly)[O(a,y)]P=- a) -<->- RetF a, by Convention R and the choice of 0, respectively. ~he
R@O~
++
vl=v
language of NTT is the language obtained by adding a unary predicate sym-
bol "YL( ) to TN.
That is,.for any TT-term T, erL(T) is an NTT-sentence.
The theory NTT consists of 3 axiom groups. A) The axioms of ZF. Note that, since the @A are individual constants denoting objects in the universe, the assumption of seperation and replacement for ZF-formulae with parameters entails the truth of seperation and replacement for all TT-formulae. ~ , on the other hand, is a relation whose extension need not be an object in the universe --so seperation and replacement need not· hold for formulae in which rloccurs. (This essential point is overlooked in [DaJ, e.g., on p. 34.) B) 1) (\I x)[ 'I\. (x) ... (x So @O 1\ x
t
R@O)J.
2) ('I1x)(1;;!y)[ ('n(x) 1\ x So y S. @O) ... 'l1(y)]. 3) (Vf)[('<j~ € R@O)['l1(f'x)J ... 'fL({0: ('<Jx E. Re)[e E. f'x]}). @O, as above, is the constant determined by the formula O(vO,vl)-<--r vl=v O. Axiom group B says, in short that @O is a strong limit cardinal in the NTT universe; and that if we could form the collection {x ~ ~ @O: 'l1(x)} then we would have a non-trivial filter on @O which is normal for functions from the NTT universe. Keep in mind that since we have only assumed seperation for ZF-formulae, the collection in question cannot be shown in NTT to exist. C)
For any TN-formula 1/I[x}
++
1/1
I\({a e @O:
and any x Q R@o , 1/1 [x
~ Ra J(Ra)}).
If it is true that the only way we ever get our hands on objects of rank @O is by naming them, it might perhaps be more natural to use a weaker schema. C)' For any TN-formula ¢ and any x ¢[xJ
++
~({a
t @O: ¢[xJ(Ra)}).
~
R@O,
~
580
R.v.B. RUCKER NTT is the theory consisting of axiom groups A),B),and C); and NTT' is the
theory consisting of axiom groups A), B), and C)' Although in each case ZF has been assumed for the full universe, it is not immediately clear if ZF holds in R@O' PROPOSITION I Pf:
(NTT) i) R@OF-ZF.
ii) '1l({a: Ra
-<
R@O}).
If we prove ii), then i) will follow a fortiori.
the set x ~ R@O such that x is the diagram of -(R@o,c). diagram of
PROPOSITION 2 Pf:
which implies that 'f\. (Io : Ra
(NTT')i) R@O F- ZF.
ii)
To get ii), simply find
By C),
-< ~O}) ...
'Y\ ({a: % -<
R@O~).
i) It will suffice to prove the reflection schema for
ZF-formula.
For each x c
'll ({et: x (\ Ret is the
~O.
Let ljJ be some
let E = {c< E.@O: (Rol.l= ljJ[x)) ++ (R@OI= iJj[x])}. ljJ,x Now it is evident (by Schema C), Convention R, and our choice of 0) that ~O'
++'ll.({a E @O: ~ \=- ljJ [x ] }) and R@o \:f-ljJ [x]
R@O \= ljJ [x]
++'n( {cl E. @O:R F/=- ljJ [x]}) • a must contain one of these two nodal classes, so by B2), 1lL(E", ).
Clearly E",
o/,X
o/,X
If we view EljJ,x as functionally dependent on x, then since we have (~x ~ R@O) [')(EljJ,x)]' we can use B3) to conclude that i f EljJ = then
n (EljJ) .
But this implies that
ii)
-< a
(\:jx E: Re) [e E:. EljJ,x]}'
({a: Ra reflects ljJ in R@o})' so we are done. It is not hard to see that B3) implies that the intersection of less
than @O nodal classes is nodal. ~({a: R
{e Eo @O:
'fI
R@O})'
Now if we let E =~ EljJ' then t\(E) implies that
Incidentally, the intersection in question can be formed since
i t is definable from the diagram of
These two propositions are essentially proved in IT]. proved that NTT is consistent with V=L.
Here Takeuti also
In [Da] is was shown that NTT is consis-
tent relative to ZF + "There is a measurable cardinal". I now give the principal new trick in my treatment of NTT. Define a sequence of TN-formulae 0n(vO'v l) by induction:OO(vO'v l) ++ vl=v O' and °n+l (vO,v I) ++ vl=@On' Consider now the formula R@oo< R@0l' R@OO
-<
R@OI ++
-n ({ 0'0:
(R@OO
-<
R@0l)(RaO)}), by Schema C,
++ ~({aO: RO'O -< R@OO})' by Convention R, ++ "(\, ({aD: "71. ({a : (Ra -<. R@OO) (Ral)})}), by Schema C, l O ++ ~({aO: 1\({a RaO -< Ra by Convention R. l: l})}), Note that by Proposition 2.ii, the second of the four formulae on the right is true in NTT', so we know that NTT' \-- R@oo < R@OI' In general, the sentence R@On
-<
<
R@On+1 will be rE\duced in NTT' to the sentence 'n,({aO:.. ·
THE ONE/MANY PROBLEM IN SET THEORY
581
in the central formula, this last sentence is equivalent to ll({a
Ra n+ l }) }) ,
which we just proved to be true.
n:
fl({a
n+l:
Ran~
The types of argument just illustrated suffice to prove that the @On are indiscernible for ZF-formulae with parameters from R@O' PROPOSITION 3
If
~
is a ZF-formula and iO, •.• ,i n and jO, .•• jn are two increasing ("d x E:. ~O) [~[x, @OiO' ... ,@Oin] -<-+
sequences of natural numbers, then NTT' \~[x,@Ojo,···,@Ojn]]·
Pf:
The idea is that each side can be shown to be equivalent to a sentence of
the form 1\({a O: .•. Il(ta ~[x,aO, ... ,an]})' •. }). This is accomplished by using n: Schema C, Convention R, and the fact that for any NTT formula ¢ in which a does not appear, 1\({a:¢}) -<-+ ¢.
•
Since NTT' is a subtheory of NTT, Proposition 3 also holds for NTT. Note, however that we cannot draw the x parameters from R@O +1 in this result, for if x were, say, the extension of @O, we would have x=@OO,but xj@Ol' It turns out that no @A other than the @On are ever needed. PROPOSITION 4 ~[@OO"
Every TN-formula is equivalent in NTT' to a formula of the form
.. ,@On]' with
Pf:
~
a ZF-formula.
It suffices to show that for each TN-formula A(vO'v
mula X such that @A =
(~Y)[X[y,@OO,
there is a ZF-for-
l),
.•. ,@On]]'
This is proved by induction on the "degree" of A.
That is, i t will suffice to
prove the statement in question under the assumption that each @B provably equivalent to some
occuring in A is i .;,@Oni]]' with rr a ZF-formula. i must have the form
(~Y)[~i[Y'@OO"
But under this assumption, A(vo'v
l)
e[vO,vl,(~y)[rrO[Y'@OO,···,@Ono]' .•. ,(~y)[rrm[ '@OO,···,@Onm]]]' for some ZF-formula
e.
It is not difficult to eliminate the
~-operators
that the formula just displayed is equivalent to Now we note that @A =
(~Y)[X[y,@OO,
... ,@Op+l]]'
and obtain a ZF-formula X such
x[vO,vl,@Oo, .•• ,@Op] for some p. j
This last result suggests a way of formulating a theory NTT' which is equivalent to NTT'.
Let the language of NTT' be the language of ZF augmented by a con-
stant symbol An for each n £.
ll),
and by a unary relation
'V'\. ( ).
NTT' has three axiom
groups. A) The axioms of ZF. -> (x S A 1\ x ¢; R AO)]' O 2) (\!x)("fy)[('l\(x) 1\ xS: yS: A -> '/ley)]. O) 3) (\;ff)[(Vx E: R\O)["Yl(f'x)] -> "f\.({e E. \0: (\;fx E-Re)[e E:. fIx]})].
B) 1) (\{ x)["1\.(x)
582
R.v.B. RUCKER
C) 'For any ZF-formula ~, and any x t RAO' 1)J[x,A O""'\] ++ 1l({a: 1)J[x,a,A ••. ,An_ O, I]}). A transitive model of NTT' will have the form "U..=(u,t::.,"n,A 1
n ne e
(U)
; where A) U
is a transitive model of ZF, B) RAO is a model of ZF and ~ is a filter on A O which is normal for functions from U, and C) For any ZF-formula 1)J, any x E:. RAO(U), \tF 1)J[x,A
++ 4l({a: 1)J[x,a,A .. ,A O,···,An] n_l]}). O" One can readily see from Proposition 4 that NTT' is an inessential extension of
NTT'; sO that any given model of NTT' can be extended to a model of NTT' (by providing the obvious interpretations of the @A constants); and any model of NTT'can be restricted to a model of NTT' (by ignoring the @A's other than the @O 's). n
It is useful to note that by [T], we can be sure that i f / U,'E. :1'\, A) \= NTT' (U) " n nezo
then we can be sure that
, E:. , 'rL, A ( n
P. NTT' as well.
ne.w
For reasons which will be elaborated upon in the next section, I am more interested in NTT' than in NTT. Note however that one could formulate a theory NTT
in
the obvious way, and that the preceding remarks on models of NTT' and NTT' would apply to models of NTT and NTT, mutatis mutandis. One should keep in mind that if necessarily true that
en E.
11,
we consider a subtheory IT of NTT'. every jO<" ·<jn'
is a model of NTT' as above, then it is not
U, or that {An: n
In order to gauge the role of discernibility schema:
tl
For every
E;
w} EO U.
as opposed the the role of the An's, in NTT'
IT is the theory ZF plus the following
R~O-in
Xc
RAO' every ZF-formula 1)J, every iO<..•
THEOREM 5 (ZF) i) If there is a transitive model of IT, then there is a transitive model of NTT'. ii) If there is a transitive model of NTT',then there is a transitive model of NTT. Pf:
i) Let <,U,E:.,A
n
1 nt.w
F IT, with U a transitive set.
Note, first of all,
that the RAO-indiscernibility schema implies that for any ZF-formula 1)J, any x e RAO' (RAO \=:1)J[x]) ++ (RAlF 1)J[x]). Theref?re RAO-<' RAl' and we can conclude that the RA form an elementary chain of models of ZF. n also a model of IT, in which LAO= RAO' Now form U* = L(U) IIA LJ {A : n ~ w}.
O
the transitive collapse of L(U) I~o U {A : n which can be defined in the structure
(U)
Note also that / L(U) , E:. ,\ "
n
'7 n~w
is
This notation, taken from [Rul] , denotes
~
w}, the set of all the sets in L(U)
•
•
, E:.) by a ZF-formula with parameters
from A U {An: n 6. w}. Let t be the Mostowski collapsing isomorphism in question, O and set A = t'A Note, incidentally, that A = A n* O* O' n. ZU*, E:.,An*> n l':. w is a model of IT such that every member of U* is definable
en.
by some ZF-term with parameters from A U {An*: n t. wL Let ~ (PA be the collecO O tion of all sets of the form {a tAO: U*\=1)J[u,a,A ... ,A such that 1)J is a n_ I*]) O*, ZF-formula, u E: L1 and U*F 1)J[u,A ... ,A O*, n*]. 0'
THE ONE/MANY PROBLEM IN SET THEORY I claim that
nE.W
is a model of NTT I .
follows immediately from the definition of 1\.
583
A) is
evident. and C)
It remains to prove the axioms of
group B). Bl). Here we ~ust show that if ¢ is a ZF-formula, u e RAO' and ¢[U.A O*'" .. ,An*l holds in U*, then {aEAO*:~[u,a,AO*"""" ,An_l*l} is cofinal in AO* ' Now
for any e A
O*'
~ A U* ~ (~a > e)[¢[u,a,AI* •...• A *11. since we can take a to be n O*' Now by the RAO-indiscernibility schema. we know that U* \== (3 a > e) [¢[u.a.
AO*, ... ,An_l*J1 as well.
So the set in question is cofinal in A O*' Say that x'= y,= A and x E. 'l\.. Then by the definition of 'Y1, x = O U* ~ ¢[u.a.AO* ••.. ,A for some ZF-formula ¢ and some u ~ RAO* such {a e A O*: n_ I*]] that U*Fo ¢[U,A ,An*l. Now the fact that U* is the Skolem hull of AO U {An: O*"" n a.to} means that y = {a E. A U* ~ 'IT[v,a,AO*, ... ,Am_l*l for some ZF-formula 'IT, O*: and some v e, RAO*' Now. since x~y, we know that U*~ ('va€. Ao*)[¢[u,a,A o*'" .. ,An_l*l + 'IT[v,a,Ao*, ... ,Am_l*lJ. Using the reflection argument of the proof of B2).
BI), we can thus see that U* F I (¢[U,A •... ,An*l + 'IT[V.A ,Am*J) is impossible. O* O*"" Therefore U*\= 'IT[v,Ao* ..... Am_l*J, which implies that y E.'1'\. • B3).
Say that f is such that ('if x€. R:.cO*) [f 'x
E.. 91J.
Now since u* is the Sko-
lem hull of A U {An: n w}, we can find a u ~ RAO and a formula ¢ such that for O* each x E:. RAO* . f'x = {a E: A U*'l= ¢[x.u.a,AO* ..... A Since each fIXE-Of\., n_ I*]}. O*: we know that U*l=- ¢[x,u.AO*, ... ,An*l for each x E:.RAO*' But this means that U* F ('I;(xE. RAO*)¢[x,u.AO* •... ,An*l, so by the definition of '1'\., it follows that {eE:A
O*: u* 1= x E:. Re)¢[x.u.B.A O*" ... An_l*]}E.~ But U* 'F1l'[X,U,e,A O*'''' '\-1*1 means that BE. f'x. so we finally have {BE:A C'1xE..R Eo fIX)} E:.'I\.. This comO*: e)[e pletes the proof of i).
Cv
.Anr
it) Let'U.. =
and let x be a. subset of'R AO in U*, and say that U*'l= ¢[x, AO'~"" , An*1. It must be shown thatcu..* I=-"f\.({ae AO: ¢[x(\Ra, a,Ao*, .... An_l*l). U* is a Skolem hull, so there is a ZF-term T and aUt RAO such that x = T[U,AO*, ... ,Am* ] in U*'" Now, by the last paragraph, we have assumed that U* \=- 1jJ[T[U,A O*'" .Am*J,AO*' ..... An *]. By Schema C) I this implies that '\.1..* F 91({a: ¢[T[u,a,A O*"" 'Am_I*]'Cl'AO* •..• 'An_I*J}). To complete the proof we will show that'U..* l='Y\({a: T[u.a.Ao* •.•.• A I*] = R/\T[U,AO*, ... ,A )}); for i f these m m_ last two classes are nodal, then so is their intersection. In turn. if the inter~ section is nodal, then so is the class mentioned in the last paragraph as requiring a proof of nodality. Using C) "
we can see that the nodality of the class in question is equivalent
to the statement T[U,AO*'" .,A rn*] = RAO* ~ T[u,A I* •••• '1n+I*l. But this is just a consequence of the RAO-indescribability Schema, which is provable in NTT', as was shown in Prooosition 3.
A
584
R.v.B. RUCKER
Theorem 5 represents an improvement over the result of Davis mentioned above; for if there is a measurable cardinal, then one can iterate the ultrapower and use the successive images of the measurable cardinal to get a model of IT. that insofar as it is obvious that IT is consistent with
V~L,
Note also
Theorem 5 could be
adapted to provide a proof of Takeuti's result that NTT is consistent with
V~L.
Theorem 5 indicates that if every object is nameable in the senSe of the proofs, then 1l does not contribute a great deal.
The next result shows that there are
models of NTT'in which 'll does play a powerful role. Recall, for the statement of the theorem,that x for
~
a is a set of indiscernib1es
Ra means that for any ZF-formu1a W, any a .
x, Ra P=
¢[a1,···,an]~
THEOREM 6
then
(ZF) Let
cu.
~
tl F (3 I ScAO) ['fL(I) Pf:
<. U,
E:, 'Y\,A )
n
n~w
be a model of NTT'.
If {An: n .. w} E:. U,
1\ I is a set of indiscernib1es for RAO]'
n We will write ZF to mean the set of Gode1 numbers of ZF-formu1ae with
free variables vO, •• ,v
n
For each k, let Jk be a function, defined with parameter
, which maps AO onto the set of increasing k-sequences 2 J (a) will be written as . As was noted at A
from A For simplicity, O' the end of the proof of
l, k Theorem 5.ii, there is a nodal class of 6 such that replacing the parameter A by O 6 in the definition of the Jk yields a function, mapping 6 onto the set of increa-
sing k-sequences from 6 which is the restriction of Jk to 6. For each n and each k < n, define a function Ng( , ) of two variables with domain RAO x A as follows. (If k~ 0, then the second argument of Ng is inessential). O N~(x,a) ~ {S Eo AO: (VrW'E.ZFn)[(RAnFW[x,al, ... ,ak,S,AO, ... ,An_k_Z]) -e-e(RAn F=- i[x,a l, ... ,a k, AO" .. , An-k-1])}. It is clear that for each n, each k < n, the functional N~ can be defined in NTT' from the set {AO, ••• ,A It is not hard to see that for ~very n, for every n}. n k < n, every x cR AO' every a E: A Nk(x,a) is nodal. It is simply a matter of O' applying Schema C)' to a formula involving x,a ..• ,a ... ,A and a subset of n, 1, k,A O, ZFn• If the set {Au: n eo o} E. U, then a functional N with domain RAO " AO can be de-
fined by N(x,a)
~ ~ ~
Nk(x,a).
It follows from the AO-completeness of
1\ for
intersections from U that ('
ejn_1 be increasing sequences from I ~60'
W[x,6jo,···,6jn_1] holds in RAO'
I claim that
W[x,eiO, ••• ,6in_1]+-+
THE ONE/MANY PROBLEM IN SET THEORY
585
Rather than proving this in full generality it will be more informative to prove it for the paradigm case n=3. I will write (aO, •.• ,a in place of k_ l) k -1 (J) (( a , .. • ,a _ k O 3 eiO ~ NO(x,O) so RA3 F ~[x,8iO,AO,AI] ~ RA3 ~ ~[x,AO,Al,AZ]'
I» .
3
e. Nl(x,(8 i O» so RA3 \=
RA3 F ~[x,8iO,AO,Al]' 8iZ £. NZ(x, (8io,8il» so RA3 F ~[X,eiO,eil'eiZ] ...... RA3 \= ~[x,8iO,8il'AO]' If we link up the three biconditional. in reverse order, and use the fact eil
~[X,eio,8il,AO] -<->-
3
that RAO -< RA3, then we can conclude that RAO \= [X,eiO,8il,8iZ] -<->- RA3 F Since this result could have equally well been obtained for 8jO'
~[x,AO,AI,A2]'
ejl' and 8j2' the indiscernibility result follows.
A
Note that i f 'Ll is as in Theorem 6, then "U... F 0/1 exists. This does not imply that "0/1 exists" holds in the model of ZF in which the proof of Theorem 6 takes place
unless we assume an additional condition.
This fact, and the fact that the
proof Theorem 6 provides something more than the statement of the Theorem indicates, are expressed in this Corollary. COROLLARY 7 (ZF) Let U =
that the I constructed in the proof of
Theorem 6 is a subset of the set in question.
It is also possible to prove this
result by a single application of Schema C)' to the fact that it is true here that
"lA.1=- AO is the "A O" of a model of IT.
ii) This follows from the fact that the 0# of an uncountable transitive
model of ZF is the real 0/1.
(See, e. g , , Chapter 8 of [Dr]).
"
In line with the modern tendency to replace filters by embeddings, it seems appropriate to introduce a new theory called Embedding Theory, or ET. of ET is the language of ZF plus a binary relation j( ,).
The language
The axioms of ET form
three groups. A) The axioms of ZF.
B) 1) (\lx)(3 :y)[j(x,y)]. 2)
(:l x) [j 'x
l'
Write j'x to denote this y.
x].
3) (\Ix) (\ly)[x E.. y C) For any ZF-formula¥"
H
j 'x E. j 'y]. (3y)¥,[j'x,y]~
(:!y)¥'[j'x,j'y].
It is evident that a model of ET will have the form 'U. =
a model of ZF, j is a non-trivial function from UU, and U ~ j "U clear that i f
-< u.
,f>.
U is
It is quite On the other
hand, one can show in MK that if there is a model tL of ET such that U is a
586
R.v.B. RUCKER
transitive proper class, then V!L.
This is a result of a characteristic feature of
ET, viz. the fact that no ET universe can be super-complete. THEOREM 8
11K \-
Pf: [KRS]).
(<..u,
E.
.f> \=.
ET) ... U
1- V.
This is just a restatement of Kunen's Theorem (see [K ] or 1.12 of
The basic idea is this. Let A O ~ An' then the set j"A i U. ,
if A =
(~y)[j
'y ! y], and let A = j'A Now n+l n.
THEOREM 9 (ZF) i) If there is a transitive model of NTT', then there is a transitive model of ET.
ii) If U is the universe of a model of ET, then U is also the universe
of a model of NTT.
<
n,
Pf: i) Let 'LL = U, E. , A) be a transitive model of NTT'. As in the n ne cr U proof of Theorem 5. i we can form the set T = L ( ) / /A U {An: n e w} and we can let O K = £'A where £ is the Mostowski collapsing isomorphism in question. Now T is n n, (T) the Skolem hull of K U {K n..w} , so if x E. T, then x has the form T[U,KO, .. ,K O n] n: for some ZF-term T, and some u E... LKO' Define a map j Eo TT by setting j 'x = T[u,Kl, .. ·,K n+l]
(T)
I claim that
.
is a model of ET.
To simplify the writing of the
proof, I will omit relativizations to T and the statement of the fact that u or v is a member of LKO' A) holds automatically, and B2) is clear. Bl).
The j defined is indeed a function since if T[U, KO, .. ,K
n]
=
a[v,K
O'"
.. ,K ] , then, by Proposition 3, T[u,K1, ... ,Kn+l] = T[v,K , ... ,Km+l]. Replacing "=" m 1
by "E" in
th~s
last implication, we see that B3) holds as well.
Now I prove C).
Assume that (3
have (3 Y)~[T[u,Kl, ... ,Kn+l]'Y]'
y)~[j 'x,y]. If x T[U,KO, ... ,K then we n], By Proposition 3, this is equivalent to
(3 yH[T[U,K
,Kn],y], Le., to (3 y)~[x,y]. Now choose such a y and say that O'''' a[V,KO, ... ,Km]. We have ~[T[U,KO, ... ,Kn],a[v,KO, ... ,Km]], so by Proposition 3, ~[T[u,Kl" .. ,Kn+l],a[v,Kl, ... ,Km+l]]' which is to say ~[j'x,j 'y]. So we know that
y
=
(.3
y)~[j
'x,j 'y]. ii)
Let
Let A = (uv) [j 'y 1- y], and let O I claim that
be a model of ET.
An+l = j 'An' Let'l\. = {x'::. AO: AO E. j "x}, model of NTT.
A) is evident and Bl) follows from the fact that if x E. RAO' then j'x = x, B2) is clear since if x ~ y, then j 'x ~ j 'Yo B3) is proved
so A ~ j'x and x ~ 1l. O as follows.
(\::j x E. RAO) [f 'x E. 'lI.] -<-+ (''<1 x E. RAO) [A E.. r ' (f 'x)] O ~ ('ix E..RAO)[AO E.. (j'f)'x]
THE ONE/MANY PROBLEM IN SET THEORY +-+ AD +-+
o:
{BE. A
C) is proved as follows. Hx,A O""';\]
E. {e c j 'A
Let
~
o:
587
(\I x E ReHe ~ (j 'f) 'x]}
('If x to ReHEl fo f 'x]) Eo
'Y\..
be a ZF-forrnula, and let x
~
RAO'
AO E. {afoj'A 1jJ[j'x(\ Ra,a,Al, ... An]} O: +-+ A E. j' {a fo A ~[x II Ra,a,A O O"" ,An_I]} O: +-+ {aE.A 1jJ[x (\ Ra,a,AO, ••• ,An_lJ) E.'f\ ... O: +-+
What sort of large cardinal consistency results of the form Con(ZF +
C3
a)~[a])
can we extract from Con(NTT') ?
In [AHKZ] a family of combinatorial large cardinal properties called "flipping properties" is introduced.
In [Z] the particular flipping property of being a
"A-piecewise strongly compact cardinal" is defined.
I will not give the rather in-
volved definition here. The existence of these particular large cardinals is an internal property of the universe, but Zwicker shows that their existence also follows from the type of external properties of the universe considered by NTT and ET.
In particular, he
says that for a model M of ZF with K,A E. M, j ~ ~ is a K-A-outside embedding if M ~ j "M -< M, if K is the first ordinal moved by j, and i f j' K > A; and he shows that if there is a K-A-outside embedding, then K is A-piecewise strongly compact.
From
these considerations we obtain a proposition. PROPOSITION 10
(ZF)
If there is a transitive model of NTT', then there is a tran-
sitive modeI of ZF + (3 K) ("i A) [K is A-piecewise strongly compact]. Pf: The proof is much the same as that of Theorem 9.i.
If we first cut U
down to U {RAn (U): n E. to} there ,and then continue as usual, the An's will be co final in T.
By extending the map j'A
n the external embeddings we need.
§6.
=
•
A + to all of T, for various k, we can get all n k
THE MEANING OF NTT In
the first four sections of this paper I developed the point of view that
i) The class of all sets C is a concept which can and should not be viewed as a definite collection; ii) It is necessary for the advance of mathematics to find some way to interpret sentences of the form 1jJ[C] whose quantifiers range over some concept, or non-extensional entity, which lies beyond C; and iii) It may be reasonable to assume that every concept has a name which it can be identified with. Now, the reflection principle NC] + 1\( {a €. Q: ~ [R expresses the feeling a]}) that the behaviour of the concepts produces the observed harmonies in the universe of sets.
One of the ideas underlying NTT is to reverse this implication and also
say that 'f\.({a
EO. Q:
1jJ[R + Hc]. a]})
588
R.v.B. RUCKER The attempt to pass from knowledge of the truth values of all the
the truth value of
~[C]
~[Ra]
's to
is somewhat analagous to the way in which one attempts to
pass from knowledge of all the partial sums of an infinite series to the sum of the whole series.
According to which convention one adopts, the Grandi series 1-1+1-1+
1-1+ ... can add up to 0, 1/2, 1, or something else.
The fact that this series has
a sum only by convention is overlooked sometimes in order to formulate such puzzles as the Thompson Lamp. One is tempted to say that, insofar as there is no fixed extensional entity which is the class of all sets, the way in which one passes from knowledge of the a's for which conventional.
~[Ra]
holds to a decision on the truth value of
NTT explicitly
~[C]
is also purely
introduces one such a convention in the form of the
primitive concept of nodality. To take the analogy,between talking about concepts and finding the sum of an infinite series,a bit further, note that for a convergent series there is only one summation convention which seems acceptable.
It is worth recalling, however, that
there have been other conventions for determining the sum of even these infinite series --- I am thinking here of the Zenonian convention that any infinite series of positive terms has infinite sum. In the same way, NTT may very well embody the most acceptable convention for interpreting talk about the class of all sets, but this does not mean that other conventions are impossible. For instance, there are some set-theorists who always work in the theory MK. They are always interested in natural models of the form R + for e a strong inacS l' cessible. Is it not fair to say that for these set-theorists, the class {S+l: S is strongly inaccessible} is "nodal", that is, comprised of "typical" ordinals? And can one not then go on to say that for these set-theorists, C F "There is a greatest ordinal"?
You may object that there can be no greatest ordinal, but I submit that
for someone who always thinks in terms of MK, there
~
a greatest ordinal, Le. "On".
The axioms in group B) of NTT can be viewed as formalizing the intent that our truth convention should be so as to make statements about C behave like statements which are actually about some specific model of ZF. C (Le. R@O) to obey modus ponens. ~[x,C]
holds" to "(\I x
e..
B2) forces statements about
B3) allows us to pass from "For every set x,
C) [~[x,C]] holds".
If we are working with NTT', then it
is B3) which ensures that C is a model of ZF. Suppose that we adopt the view that every concept has a name.
Of course, no
single theory which I can actually refer to will name all of the concepts --- for to refer to a theory is to provide a name for it, and this name can be used to lump together all the concepts named by the theory, yielding a concept not named by the theory. name.
I feel that a theory is only able to say things about the concepts it can Looking at Theorem 5.ii with this in mind, one can see that NTT' is really
just as good a theory as NTT.
THE ONE/MANY PROBLEM IN SET THEORY
589
The A constants are used in the language of NTT and NTT' in order to name some n
concepts which are not named by the language of ZF.
For instance, Al names a con-
cept which lies beyond all concepts of the form T[U,n] for T a ZF-term and u a set. To accept the name or concept "AI" requires a leap of faith not much greater than that required to accept the concept variously referred to as "On", "@O", "1. and 0",
"nil.
The only author I know of who has yet had occasion to discuss the concept Al is W. Reinhardt, who refers to it as "A" on p. 200 of [R2].
Reinhardt here describes
how the inability to see that there are concepts beyond Al can lead one to believe that there is an internal elementary embedding from RAI into RAI' I might mention now, that once we are familiar with NTT, we are in a position to move beyond it to include yet more named concepts in our domain of discourse --for instance we can try to get a theory which includes {x:~ (x)} and {A : now} in n
its world.
It is interesting to note that by Corollary 7.ii, conceiving of the
latter concept enables one to conceive of a non-constructible set of integers.
In
my opinion this is the best justification for V/L. To get back to NTT, viewed as an actual theory about all the sets and some of the concepts, why exactly should Schema C) hold? What does it mean to say the concepts ~( ), n, and
{~~
n:
~[~]})?
This is a statement about
On the left-hand side we have a statement that the
concept n falls under the concept ment that the concept
c n:
~[n] ++~({~
fl.
~,
~[a]}
and on the right-hand side we have the stateis nodal.
Since $ is not relativized on the
right, and since concepts occur on both sides of the biconditional,it would be misleading to say that Schema C) reduces talk about concepts to talk about sets and the single concept of nodality. when applied to a ZF-sentence
0
Indeed, Schema C) accomplishes no reduction at all (for
'1\. ({~
e:n: o l) is already equivalent to
0
with-
out any use of Schema C)) --- this is not surprising since it is only in axiom group A) that NTT makes any statements about all the concepts at once. (Note, in particu-
lar, that the schema
R F 0 is nowhere assumed , ) n Schema C) does not really provide an interpretation for statements about all the 0 ++
concepts at once, but it does provide an interpretation for formulae whose quanti-
A typical kind of formula to which Schema
fiers are bound to some specific concept. C) is usefully applied has the+form RT[n] term (i.e. T[n] is n+l, n+n, n ,
F=
~[n],
where T is a simple functional
R or something like that). Then we a], \= ~[aJ}). This is definitely a reduction
(~a)[Rn-(
have RT[n] \= Hn] iff'Y\.({aE.[1: R Tfa] of a sentence about concepts of rank greater than n to a sentence about 1\ and a concept of rank n.
Insofar as we feel that our idea of the behaviour of n is arrived
at by abstracting from the behaviour of all the
~IS.
the form of this reduction
certainly seems reasonable. The way in which Schema C) applies to formulae involving the attractive.
~nts
is particularly
The plausibility of $[[1.1. 1 ••..• A ++ ~({aO: 1\({a .•. ~({an: n] l:
R.v.B. RUCKER
590
.. , })}) comes from the feeling that the behaviour of, say, ~ ren)}) lative to Al is abstracted from the behaviour of a "typical" ordinal a relative to ~[aO,al'"
.,a
O
a larger "typical" ordinal a
l. To sum up, the situation leading to NTT seems to be as follows: 1) Each set ex-
ists as a definite extensional object. C (the concept of "set"),
~
applying only to sets), and so on. a name.
Beyond the sets there are concepts such as
(the concept of ordinal), I? C (the concept of concept 2) We think of each concept as eventually having
To make our naming ability somewhat richer than that of the language of ZF,
we add for each n a name An' with the understanding that each An represents a new degree of unnameability. syntactic considerations.
3) A
partial~
-theory on the concepts can be derived from
E.g., it is evident that if yEo lPC and xa.y , then x€.C.
However, since the concepts are not fixed extensional objects, they do not seem, priori, to possess a eomplete concepts,like ~[a,a].
If
~[~'Al)'
~
-theory as the sets do.
~
4) Given a statement about
we test its truth by looking at all sentences of the form
there is an "overwhelmingly dense" or nodal class of a's, for each of
which the class of a's such that
~[a.a)
is nodal, then we say that
~[~,Al)
holds.
5) The assumptions in axiom group B) formalize certain reasonable intuitions about the concept of nodality just used.
Nodality is a primitive conc·ept arising from
the mind's ability to look at the given universe of sets and concepts from the outside. There is one significant objection to this procedure, and this is that it does not seem to be automatically true that the Eo -theory on the concepts derived from Schema C) will mesh with the partial
~-theory
on the concepts which can be derived
from proof-theoretic considerations of the names involved. prove that 1\({S: S is strongly inaccessible)).
In particular, we can
Since AC is assumed to hold in C,
this implies that 'fL({S: (.3w€.R + e l)[w well-orders Re with length Sn). So by Schema C), (.3 w ~ R~l) [w well-orders C with length ~]. But can we exhibit a name of any such well-ordering of C? A solution to this difficulty may be to use the theory NTT + V=OD which is obtained by replacing axiom group A) by group A)*: The axioms of ZF plus the axiom V=OD, (where V=OD is an abbreviation for ('<;j x) (.3 a) [x is definable in RaJ).
This
would not lead to C F V=OD; for the "ordinals" from which the members of C are defined might even lie beyond all the A's. n
(See Illustration XI of [Ru 1) for an
example of a VfOD universe which has a well-ordering nameable from higher-order concepts.) The assumption that every set or concept can eventually be defined from some ordinal, where the ordinals are allowed to go past
~,
would seem to be in the
spirit of Godel's 1946 conjecture that every ZF-sentence is decidable from some true axiom of infinity (see [G3] and [Ru 2]).
In NTT + V=OD the question at the
end of the last paragraph can be answered by exhibiting the name "~~ODw)[w wellorders R ~ with length c
I".
In conclusion, NTT seems to be a reasonable theory about sets and concepts.
The
THE ONE/MANY PROBLEM IN SET THEORY
591
great virtue of NTT is that it treats concepts such as C as different from sets --that is, NTT does not make the mistake of treating C like some large but definite collection.
So long as one makes this mistake one is still doing set theory, as
opposed to concept theory.
The importance of seeing beyond set theory has been
stressed, in a somewhat different context, by Hajek and Vop~nka: " ••• the authors hope to make some contribution to the task of breaking through the bars of the prison in which mathematicians find themselves.
This prison is set theory and the
authors believe that mathematicians will escape from it just as they escaped from the prison of three-dimensional space." ([HV], p. 12). It. remains to draw a metaphysical moral from the preceding mathematical discussion. On the one hand the Absolute is not merely a Many, a random hodge-podge, a relentless accumulation of particulars --- for in the very act of saying "the Absolute", we acknowledge our conviciton that the world is One.
On the other hand, the Abso-
lute is not a fixed and immutable One, for to conceive of any (non-self-representative) One is to see how to transcend it. This dilemma can perhaps be resolved by viewing the Oneness of the Absolute qua subjective concept, and the Manyness of the Absolute qua objective non-set, as "complementary but exclusive features of the description"; and to say that, "In fact, here again we are not dealing with contradictory but with complementary pictures of the phenomena, which only together offer a natural generalization of the classical mode of description." ([Br], pp. 54-56).
The analogy to quantum mechanics
is not totally unnatural if we agree with Bohr that the complementarity of causal laws and space-time descriptions "bears a deep-going analogy to the general difficulty in the formation of human ideas, inherent in the distinction between subject and object." ([Br], p . 91.
Cf. also the last section of [Be]).
The significance of NTT for the metaphysical One/Many problem is that it shows how it is possible to attain a rigorous logical convention connecting the two complementary views of 'the Absolute. REFERENCES [AHKZ] Abramson, Harrington, Kleinberg & Zwicker, Flipping properties for large cardinals, Annals of Math. Logic, to appear. [B]
Bernard Bolzano, Paradoxes of the Infinite, (Routledge and Kegan Paul, Lon-
[Be]
T. Bergstein, Complementarity and philosophy, Nature 222 (1969), pp. 1033-
[Br]
Niels Bohr, Atomic Theory and the Description of Nature, (Cambridge U. Press,
don, 1950). 1035. Cambridge, 1934). [C]
Georg Cantor, Gesamme1te Abhandlungen, (Georg DIms Verlag, Hildesheim, 1962).
[CD]
Georg Cantor, Letter to Dedekind, 1899, in: J. van Heijenoort, ed., From
R.v.B. RUCKER
~92
Frege to Godel, (Harvard U. Press, Cambridge, Mass., 1967). [D]
Richard Dedekind, Essays on the Theory of Numbers, (Dover Publications, New York, 1963).
[Da]
James Davis, Measurable Cardinals and NTT, Ph.D. Thesis, U. of Illinois,
[Dr]
Frank Drake, Set Theory, (North-Holland, Amsterdam, 1974).
Urbana, Illinois, 1968. IDG]
Bryce DeWitt & Neill Graham, eds., The Many-Worlds Interpretation of Quantum Mechanics, (Princeton U. Press, Princeton, 1973).
[E]
Albert Einstein, The Meaning of Relativity, (Princeton U. Press, Princeton,
[Gl]
Kurt Godel, A remark about the relationship between relativity theory and
1972). idealistic philosophy, in: Paul Schilpp, ed., Albert Einstein: Philosopher Scientist, Vol. II, (Harper & Row, New York, 1959), pp. 557-562. [G2]
Kurt Godel, Russell's mathematical logic, in: Paul Benacerraf and Hilary Putnam, eds., Philosophy of Mathematics, (Prentice-Hall, Englewood Cliffs, N.J., 1964), pp. 211-232.
[C3]
Kurt COdel, Remarks before the Princeton Bicentennial Conference on Problems in Mathematics, 1946, in: Martin Davis, ed., The Undecidable, (Raven Press, Hewlett, N.Y., 1965), pp. 84-88.
[H]
Thomas Heath, A History of. Greek Mathematics, Vol. I, (Oxford U. Press, London,
[HV]
P. Hajek & P. Vopenka, The Theory of Semisets, (North-Holland, Amsterdam,
1921) . 1972). [J]
William James, A Pluralistic Universe, (Longmans, Green & Co., New York, 1909).
[K]
Kenneth Kunen, Elementary embeddings and infinitary combinatorics, Journal of Symbolic Logic 36 (1971), pp. 407-413.
[KRS] Kanamori, Reinhardt & Solovay, Strong axioms of infinity and elementary embeddings, Annals of Math. Logic, to appear. [L]
Arthur Lovejoy, The Great Chain. of Being, (Harvard U. Press, Cambridge, 1953).
[M]
John Myhill, Some philosophical implications of mathematical logic, I: Three
[MTW]
Misner, Thorne & Wheeler, Gravitation, (W.H.Freeman & Co., San Francisco,
[Rl]
William Reinhardt, Set existence principles of Shoenfield, Ackermann, and
[R2]
William Reinhardt, Remarks on reflection principles, large cardinals, and
classes of ideas, Review of Metaphysics VI (1952). 1973). Powell, Fundamenta Math. 84 (1974), pp. 5-34. elementary embeddings, in: T. Jech, ed., Proceedings of Symposia in Pure Mathematics XIII, Part 2, (AMS, Providence, 1974), pp. 207-214. [Ro]
Josiah Royce, The World and the Individual, First Series, (Dover Publications, New York, 1959).
THE ONE/MANY PROBLEM IN SET THEORY
593
[Rul]
Rudolf Rucker, Undefinable sets,Annals of Math. Logic 6 (1974), pp. 395-419.
[Ru2]
Rudolf Rucker, Truth and infinity, Proceedings of the AMS 59 (August 1976), pp. 138-143.
[Ru3]
Rudolf Rucker, On Cantor's continuum problem (Abstract), Journal of Symbolic Logic 41 (1976), p. 551.
{Rulj]
Rudolf Rucker, Talking about the class of all sets (Abstract) ,Journal of
IT]
Gaisi Takeuti, The universe of set theory, in: Bulloff, Holyoke & Hahn, eds.,
Symbolic Logic, to appear. Foundations of Mathematics, (Springer-Verlag, New York, 1969), pp. 74-128.
IW]
Ludwig Wittgenstein, Tractatus Logico-Philosophicus, (Humanities Press, New
[Z]
William Zwicker, Coherent Ultrafilters and a Big Small Large Cardinal, Ph.D.
York, 1961). Thesis, M.l.T., Cambridge, Mass., 1976.
R. Gandy, M. Hyland (Eds.), LOGIC COLLOQUIUM 76
© North-Holland Publishing" Company (1977)
SETS HAVING CALIBRE
~l
John Truss
§l.
It is well known that if the complete Boolean algebra ill satisfies the count-
able chain condition (c.c.c.) then cardinals are preserved in the Boolean ex-
JB.
tension element.
Similarly for forcing using a partially ordered set P with greatest
However the c.c.c. has its
disadvanta~es,
the main one from our point
of view being that ill may satisfy the c.c.c. in V without satisfying it in This occurs for example if ill is a Souslin algebra.
JB.
Thus we are led to consider
stronger notions, which have more satisfactory closure and extension properties. We first became interested in these properties on reading the paper by Kunen and Tall [4], and in particular in the case of "calibre" on learning of Solovay's result that, assuming
A~
, (Martin's Axiom), the measure algebra has calibre Xl'
This paper falls tnto two main sections.
The first, §§2-4, is devoted to
the study of general properties of sets having calibre a
~
1.)
~l
(and calibre
~a
for
A typical such property is that if ill has calibre Xl then it still has
calibre
~l
in any c.c.c. extension.
The second, §§5-6, is concerned with certain
specific Boolean algebras, notably the "amoeba" and "dominating" algebras of [5] and [3], which (assuming AX ) are good examples of algebras having calibre Xl' and which have considerable imp6rtance for the study of Lebesgue measure and Baire category on the real line. shoUld like to acknowledge many helpful comments about the matters discussed here from L. Harrington, M. Magidor, A. Mathias, and R. Solovay.
§2.l.
We summarize briefly our notation for forcing.
A set of conditions (or
"notion of forcing") is a partially ordered set (P, s) with greatest element 1. For p,q
E
P, P is an extension of q if P ,; q.
have a common extension.
p and q are compatible if they
The subset A of P is an antichain if any two members of
A are incompatible, and P satisfies the countable chain condition (c.c.c.) if any antichain of P is countable. The subset D of P is dense if any mewber of P has an extension in D. "dense"
=
"cofinal").
D is dense open.
Let~
(Thus
If in addition any extension of a member of D is in D, then be a family of dense open subsets of P.
Then the subset
g of P is~-generic if (i) any two members of g have a common extension in g, (ii) (p, q) (p ,; q & P IE. g ~ q IE g), and (iii) g intersects every member of~. cAl is a model of ZF, g is Ji-generic if it is &9/1-generic, where ~il is the set of 595
If
596
JOHN TRUSS
dense open subsets of Plying invU'. vI£[g] is then the extension of "11 by g. We are now in a position to state Martin's Axiom. A~
Ct
of
Let (P, 5) be a partially ordered set satisfying the c.c.c., and~ a family
5 ~a
§2.2.
dense open subsets of p.
Then P has a
-generic subset.
Let IB be a complete Boolean algebra inuli, a model of ZFC. "ti IB is the corresponding Boolean-valued universe.
"real world").
reference on Boolean-valued models). zero members of lB.
Let P be the partially ordered set of non-
Thenci8 can be regarded as a "label space" for the extension
of J,/'by anvlf-generic subset 10f P.
1is
in fact anvil-complete ultrafilter on IB,
and is labelled by the "canonical generic ultrafilter" that for each b extension.
E
(utlmay be V, the ([1] is a good
IB, [b
E
l]IB ; b.)
1 invitB.
(1 is
chosen so
In general ~ is a label for x in a generic
When the extension is by an
Boolean algebra IB, we let x lie in,~.
-complete ultrafilter on a complete Members of dlare denoted by themselves,
where no confusion can arise. Any generic extension can be regarded as a Boolean extension. [8 pp.235-7] for example.
See
More precisely, let (P, 5) be a reflexive partially
ordered set in,1{, and IB the complete Boolean alpebra of regular open subsets of P. Then for any,li-generic subset g of P there is an,'U'-complete ultrafilter Jon IB such thatv~[g] ;~[1]' IB is called the complete Boolean alpebra associated with P, and the partial ordering obtained from IB by removing ID) is called the partial orderinp associated with IB.
If IB is a Boolean algebra, not necessarily complete,
then iB, the complete Boolean algebra associated with QB - {G}, 5) has a dense subset isomorphic to lB.
Identifying IB with this dense subset, IB is called the
canonical completion of lB.
If IB is already complete, IB ; iB.
References to IB
satisfying the c.c.c., dense open subsets oflB, etc. will be taken to apply to the associated partial ordering.
In particular, if IB is complete, "ull-generic
subset", ",~-generic ul trafil ter", and 'Clt-complete ul trafi Iter" all mean the same thing.
§2.3.
In this section we recall the material from [8] we need about two-stage Let IB, C be complete Boolean algebras, IB in ,1£ and.
Cohen extensions.
a:
in oJ!".
Then there is a complete Boolean algebra D invU, denoted by IB 0 C, such that
(~)[ ;~ID. The algebra IB 0 ~ corresponds to the two-stage extension ofJ{, first by IB, and then by C. tween pairs (gl,g2
)
In other words there is a canonical 1-1 correspondence be• such that gl IS
,,7
anc~-complete
ultrafilter on IB and g2 is an
Jt[gl]-complete ultrafilter on the denotation of Ii: in Jt'[gl] , and vttcomplete ul t rafilters g on IB 0 /C, satisfying,i£[gl] [g2]
= o!,([g].
Conversely, let IB be a complete subalgebra of D, both in,U.
Then there is
SETS HAVING CALIBRE inJiB a complete Boolean algebra II: such that 18 Finally, suppose that 18 0 II:
=
U.
0
597
~1
(j; 5!! 10.
Then 10 satisfies the C.C.c. in Jiif and
= 1.
only if18 satisfies the C.C.c. in dtand [II: satisfies the c.c.c.]18 §2.4.
Now we define precalibre and calibre and give some elementary properties.
We consider just the case a P has precalibre of cardinality ( b
there is B
~a
B) (b finite
c
( p
~
P has calibre such that {a
A:p
€
b)p
€
co~plications.
~a
such that
q).
$
if for any subset A of P of cardinality
a} has cardinality ~-chain
there is p
~a
€
P
~a'
condition
= ~l
(Thus C.C.c.
< ~a'
regular, to avoid
~a
A also of cardinality
c
P) ( q
€
~a
$
P satisfies the cardinality
and
~
if for any subset A of P (possibly with renetitions)
~a
(~a
- c.c.) if any antichain of P has
- c.c.)
By 2.2 these notions can of course be applied to a complete Boolean algebra E, and we shall not give separate definitions.
has precalibre
~a ~
P satisfies the
- c.c.
~a
Obviously P has calibre The first
arro~'
~
~
a
P
cannot te reversed,
as we shall rewark below, tut the second one sometimes can. If (P :y < S> is a family of partially ordered sets with greatest elements, we denote by
y
II P the "restricted direct product" of the P i.e. the family of y' y<S y functions p with dowain S such that (Vy)p(y) € Py and {y:p(y) f I) is finite,
partially ordered co-ordinatewise. Lemma 2.4.1: (i) P has calibre
~a ~
every dense subset of P has calibre
P has a dense subset having calibre (Similarly for precalibre
~a'
~a
~a ~
and
~a
c .c . )
(ii) Calibre dinality less than
is closed under restricted direct products of car-
~a
Precalibre
~a'
~a
is closed under arbitrary restricted direct
products. (iii) Let P be the partially ordered set associated with the 4-elemept Boolean algebra. precalibre
~a
but not calibre
Corollary 2.4.2: bre ~a+l
~
a
~
Then P a, the restricted direct product of
~a
copies of P, has
~(,.
(i) For each a there is a partially ordered set having precali-
but not calibre
~ . a (ii) For each a there is a partially ordered set havinp calibre
but not calibre
Lemma 2.4.3:
Let (18
~a'
limit number A
<
S,
Y
Y
algebras such that Y l
$
y~S
Y2
S> be an increasinp sequence of complete Boolean
< <
S
~
18
y1
is a complete subalgebra of 18'V ' and for any
18 y is dense in 18
'2
y'
Then i f every 18 y has precalibre
~a'
JOHN TRUSS
598
so does ~ and if ~ < ~ and every lliy has calibre ~~, so does ~ y<~ y' a ~ y<S y Lemma 2.4.4. (Kunen):
A~
implies that any complete Boolean algebra lli satisfying
the c.c.c. has precalibr~ ~l' Proof: Let A = {ba : a < WI} .=.lli - {GIl. Let qa = l:{b : a ,; Sl. Then S (qa : a < WI) is a descending sequence of members of lli. As lli satisfies the c. c. c. there is a O < WI and q such th2t (Va ~ aO)(q = qa)' Let D {b E lli : b A q = 0 a (]~ ~ a)b ,; b~}. From the fact that (~a ~ aO)q = l:{b : a ,; S} it follows
or
that each D
is dense open.
a ~.
ing q, where
= {D a
By A~
a < WI}'
S
1
there is a~ -generic subset g of lli, contain-
We let Al
desired uncountable subset of A. Corollary 2.4.5: direct products.
A~
= {ba
: a < WI} n g, and Al is the
implies that c.c.c. is closed under arbitrary restricted 1
This corollary is also due to Kunen, and follows from 2.4.1 (ii) and 2.4.4.
§3. ~a
We begin this section by obtaining bounds on the cardinalities of calibre complete Boolean algebras. This contrasts with the precalibre
~a
by 2.4.1 (ii) there are examples of arbitrarily large cardinality.
case, where Obviously if
E has a dense subset of cardinality < ~a' lli has calibre ~a' We shov here that GCH implies that the only calibre ~a algebras are of this form. Lemma 3.1:
Suppose P has calibre
least possible cardinality. Proof: there is
~
As ~
< a
~a
to
q~ E
E A.
and let Q be a dense subset of P of the
IQI
F ~a'
IQI = ~a'
and let
Q= {qs
Observe that A is dense in P.
such that qs ,; q.
: S < ~a}'
For let q
Taking the least such S we find that
is the smallest cardinality of a dense subset of P, [AI
there is q such that !{a E A
= qy
~a
Suppose on the contrary that
A = {q~ : (Vy < S) qy c{; qS}'
a
Then
q,; a}[
~.
= ~. a
Then A. Hence
a Let S > y be such that qs E A and q ,; qS'
Then qy ,; qs' contrary
A.
Lemma 3.2:
Let lli be a complete Boolean algebra having calibre
Suppose on the contrary that (lli
sequence of complete subalgebras of lli. there is b ~a
P. q~ E
Since A is dense, a'; q for some
~a'
Then there is
no strictly increasing seouence of complete subalgebras oflli of lenrth Proof:
E
Let
~a'
: S < ~a) is a strictly increasing S For each ~, lliS is not dense in lli~+l' so
E lli - {GI} such that ('Ib ElliS) (b ,; b ~ b = lD). As lli has calibre s S S+1 there is b F (j) in lli such that A = {b~ : b ,; b has cardinality ~a Thus if S}
SETS HAVING CALIBRE
599
~1
b' =AA, b' I O. As III satisfies the K - c.c. and K is regul ar , a a C = UUB : a < K } is a complete subalgebra of lIl. Hence b' , e, so for some a a a < Ka , b ' £ lila' As IAI = K we may take b to lie in A. But then b' ,; b and et a a so by choice of ba' b' = W, a contradiction. Theorem 3.3:
lflll has calibre K then III has a generating set (as a complete (X K < K , and hence itself has cardinality ,; a~et 2 a.
algebra) of cardinality
a
d be an enumeration of lIl. Let lila be the complete subalgebra of III generated by {by y < a}. By Lemma 3.2, A = {ba : lIlal lIl + has a l} cardinality < K • But clearlylll is generated by A. et Let III = {ba : a
Proof:
<
Remark: The significance of this result is that ill is generated by a set of cardinality K if and only if a subset of K •
VB
is obtained from V by adjoining a single (Boolean)
et
N 2 S ,; N • Then any complete Boolean algebra a of calibre N has a dense subset of cardinality < K . In particular, CH implies a a that there is, up to isomorphism, only one complete atomless Boolean algebra Theorem 3.4:
Assume that S < a
~
having calibre K l . (We call this algebra the category algebra. of Borel sets factored by the ideal of meagre sets.) Proof: ity
<
By Theorem 3.3, IlIll ,; Ket •
It is the algebra
By Lemma 3.1 III has a dense subset of cardinal-
Ne< We conclude this section by proving absoluteness results for calibre K
et
and precalibre Ne<' Theorem 3.5:
Let ill satisfy the N - c.c. and
[f has calibre ~a ]lIl = 1, Le.
"ca~ibre
Similarly for precalibre N(X. Proof:
Firstly we remark that
standard embedding of V into example, if
et
~ have calibre K. Then e< N " is preserved under K - c.c. extensions. e< a
~a is the image in VB of the ordinal Ne< under the and that [K = ~ ]lIl need not equal 1, For a et •
VB,
that [(3c
€
~
VB.
-
{~}) I{~ :
c ,; £ell = Ne
Let q be an arbitrary member of III - {O}, and for each a that b
=
VB.
= 2 and III collapses N but not K then N = K is true in l 2, 2 l < ~ } be a family of ~ members of ( - {()} in We show
Let {~ : a
q A [£e = ca]lIl I @, (i.e. c
is one of the values of
choose c such a allowed by q.)
< Net
£e
a a As [ has calibre Ne<' and these choices are made in V, there is B ~ Ne< of cardinality N such that c = A{c : S € Bl F G. Let qa = L{b a,; y £ B}. Then a y S (qs : a < Na ) is a decreasing sequence of members of ill. As ill satisfies the K a c.c. there is S < Ke< and q' such that (Vy ~ a)(qy = q'). We have @ < Q' ,; q, and for each y, if D = {b E III : bAq' = G or (30 ~ y) (0 E B & b ,; boll then D y y is dense open. We let A = {y £ B : by Then as J is generic, it intersects
J}.
JOHN TRUSS
600 each Dv with probability [~
'm
=. B]
= li, so q ' s
q'.
~
Ny
Hence q'
m
!9
~
[A
c ~.sy] We have shown that {q Em: q ~ [(3c E
.
But also
c ~
-- a
0
is dense open, establishing the result.
E [
{@})I{y : c ~ c y }I
-
Ka ~}
=
Observe that we have used here a modi-
fication of Kunen's argument of Lemma 2.4.4, but for an arbitrary (regular) The reason we could only prove 2.4.4 for a
=1
~a'
was that Martin's Axiom only
applies to c.c.c. algebras. The proof for the case of precalibre Lemma 3.6:
vtt and c!f, ]l
Suppose Jll~v¥are models of ZFC,
Proof:
To show that
mdoes
Remark:
not have calibre
in jd,
K
it is enough to show that m does
K
-
c , c ;" is Ill'
But it is not clear whether
We now give various equivalent conditions for m to have calibre K.
JB
Then
is Ill'
K"
useful of these is that m has calibre and
K
is a III property.
K"
Similarly ' 113 satisfies the
or not ''B has precal ibre
§4.
an ordinal which is a cardinal in
in Jr.
K
But ' 113 has cal ibre
0
K
is similar.
and m a complete Boolean algebra in vilnot having calibre
does not have cal ibre
not
~a
a
The most
if and only if it satisfies the K - c.c. a contains no "essentially new" subsets of K a• ~a
Let m be a complete Boolean algebra satisfying the K - c .c., a regular and uncountable. Then the following are eouivalent.
Theorem 4.1 :
m has calibre
(i}
~
(iii)
[(IIA -c Ka HIAI
(iv)
a
a
[eVA ~ On)( IAI = ~a .... (3B E V) (B =- A & IBI = ~a mm
(ii)
~
= Ka .... (3B
For any family (X
s:
V)(B
=- A &
IBI
= K)) m a
1.
K of non-empty open subsets of m there is a) a subfamily of cardinality K with non-empty intersection. a
(v)
S
E
1.
<
The intersection of any decreasing Ka-sequence of dense open subsets of m is dense open.
JB
(ii). Let A E be a Boolean term such that Proof: (i) • mb = [A c On & IAI = K] >~. Let D be the set of all p E m such that p A b = 0 a or [p ~ b & ( B E V)(IBI = K & P $ [B c A]m)]. We show that D is dense open. For let p
~
a -0 be arbitrary, and suppose that p
$
b.
Since p ~ [~=- On & IAI
.
= Ka],
if we let Ps = p A [S E ~], then Ka $ I{S : Ps ~ O}I. As m has calibre Ka there is q ~ 0 such that K $ I{S q $ PS}j Let B = {S q $ Pel. Then B E V, a Ka $ IBI, so there is Bl =- B, Bl E V such that IBII Ka. Thus q $ [B l =- A] and 0
601
SETS HAVING CALIBRE N 1 ~a' giving q E 0.
IBll
(ii)
(iii).
~
(iii)
~
Obvious.
(i).
Suppose {PS : S <
- {oJ.
~a} ~m
We must find a non-zero q
such that I{S : q ~ PS}I = ~a' Let qs = ~{Py : S ~ y}, Then as m satisfies the ~ - c.c. there is S < ~ and q' such that (Vy ~ S)(q = q'). As in the proof of a (l, • y v m Theorem 3.5, if we let -A = {y < H0: : pY E -j} then q' ~ [A is cofinal in ~] , and 0: so as ~
is regular in
JB,
q' ~ [I~I
B E V such that q' ~ [B c A & IBI 1, q ' ~ [q {Py: y E B} E ]Jm.
=
=
~a]m.
By our assumption (iii), there is
Since -J is V-complete with probability It follows that q ~ e, and) {y : q E py}1 = ~a'
~a ]m.
as desired. (i)
~
(iv).
To say that
X ~
m -
{Ol}
is "open" just means that
X ~ P E X). Let X E X ' each S. As m has calibre ~o: there s S in m such that [t s : x s xs}1 = ~C1' Let Y = {X : x ~ xSL Then Y is S a subfamily of < X : S < ~a} with non-empty intersection. s (~p,q) (0 < p ~ q E
is x
~ (j)
Let X = {q E Os : q ~ p} where Os s Then X is a non-empty open set. s By (iv) there is q such thatl{S : q E xS}1 = ~a' Thus q ~ p and (iv)
~
(v).
Let p
~
0 be arbitrary.
is the S'th member of the decreasing sequence. q < n{oS
S < ~a} since the sequence is
n{oS : S
< ~a}
decreasing. It follows that
is dense open.
(v) ~ (i). Let {PS : S < ~C1} ~m - {GIL before there are Sand q' such that (vy ~ S)(qy
=
Dy = {q Em - {oJ: q A q' = 0 or (30 ~ y)(q ~ po)}' Then.
Corollary 4.2:
lfm has,calibre HCI+ l and is
(~a'
As
Let qs = i:{Py : S ~ V}. q'). Let
y < ~C1} is a de-
y
.
< ~C1}
is dense
oo)-distributive, then it is
completely atomic. Proof:
By Theorem 3.3, m is generated by a set of cardinality ~C1'
obtained from V by adjoining a subset of set already lies in V. Theorem 4.3:
Hence V
=
JB.
is regular and uncountable, m
• m
&[[ has calibre Ha]
Proof:
As m is
=
0
= 1.
0
fies the
.,.
~a
- c.c.]
lB a
oo)-distributive, this
JB
be such that
1.
0
~ satisfies the c.c.c. if apd only if m
A similar proof shows that
- c.c. if and only if m satisfies the ~
= n.
is
~ has calibre ~o: if and only if m has calibre
By the last sentence of 2.3, ill
~ satisfies the ~
JB
Then, in the notation of 2.3, if ~C1
satisfies the c.c.c. &[( satisfies the c.c.c.]m =~. m
(~a'
Let m be a complete Boolean algebra, and ~ E
[[ is a complete Boolean algebra] m ~C1
~C1'
Thus
ct
- c.c.
The result now follows easily from Theorem 4.1
[[ satis-
JOHN TRUSS
602 (i)
<>
(iii).
Remark:
Solovay has remarked that (for the case a
calibre ~a without [( has precalibre ~ ]ID the case that iflli has precalibre has precalibre
=1
=
1) lli
0
~ may have pre-
However it is still a IDand [[ has precalibre ~] = 1 then lli ~ ~
~
being true.
~a'
We now prove two theorems about apparent strengthenings of "calibre In fact we do not actually know that these are all strengthenings. it is not clear whether 4.4 (v) is really stronger than "calibre
~a".
For example, On the other
~a'!.
hand 4.S(v) definitely is stronger, at least in the case where a > 1, since it implies calibre Theorem 4.4: ~a
is regular and uncountable. (i)
6
< ~a}
of dense open subsets ofID there is a
: 6 E Xy } is dense open.
For any subset A of ID of cardinality
(iii)
such that ( p E A)(p
< ~a
[(VA
A is the union of <
m has
(vi) ~
(ii)
<>
calibre (iii)
- c.c. for some S
Remark: If a
=
space of ill is
<
such that (Vp
< ~a
<>
E
p}).
~
<
~a sets lying in V)]lli
f.
~a sets lying in V)]ID = ll..
FaT any subset A ofm - {W} of cardinality
(v)
~ ~a
A)(3q E B)(q
there is a subset B of p).
~
~a'
(iv)
~
(v)
a, then (v)
~
~
(vi).
If in addition ill satisfies the
(ii).
1, (i) implies that the filter of comeagre subsets of the Stone ~l-complete.
Proof: (i) ~ (ii). PS}'
-+
there is a subset B oflli
~ ~a
r{q E B : q
=
~ a ) (A is the union of
c
-
ID - {G} of cardinality
Then (i)
~a
[(VA.=. OnHIAI ,;
(iv)
~
c.c. where
-
U{X y : y < S} of ~a' some 13 < ~a' such that for each y < 13,
=
of cardinality
p
~a
Consider the following.
For any family {OS: 13
(ii)
~a
a.
~
Let lli be a complete Boolean algebra satisfying the
partition ~a U{D
for all appropriate 13
~a
Let {PS
13 < ~a}.=.lli - {(j)}.
Let Os
= Ip
Clearly Os is dense open, so by (i) there is So
E lli : pAPS
< ~a an~
a
= ()
or
f~mily
{E : y < SO} of dense open subsets of lli such that (\Is < ~ ) (3y < SO) E c OS' For y a y each y < So pick a maximal antichain of Ey' and let B re the u~ion of 811 these
antichains.
As ~a is regular, and lli satisfies the ~a - c.c., IBI
show that for each 13
< ~a'
Ps
=
L{b E B : b ~ PS}'
Let q
=
We must
< ~a'
Ps - L{b
E
B : b ~ PS}'
Let y be such that Ey .:. OS' As B contains a maximal antichain of Ey ' q F ~ implies that q A b l F 0 for some b l E B n Ey ' As b l E OS' b l A Ps = () or b ~ PS' l Since q ,; Ps and q A b l F 0, we must have b l ,; PS' Hence b l is a summand of L{b
EBb
desired.
~
pJ' giving q A b l
= 0,
a contradiction.
It follows th8t q
= 0,
as
SETS HAVING CALIBRE (ii) ~ (iii). let Ps
p
Suppose p
H E ~]ffi
A
=
ed by p, i.e. 13 such that Ps I O. For each 13 let D S dense open. For suppose x
1:{x $
x E X &x
A q
PS'
Thus x
A
then p
$
.[~
= U{Ax
< q
(J
$
P
qED
Xn
Let!l
= {q
: x
$
S}'
PS'
1 n DS'
$
pS in
X
E !l' giving 13 (iii)
~
E
For each ordinal
c c , , ~has';; v
~
Cl
members allow-
x,;; PS}. or (]x E X)(q
Then as P
x
$
1:{x E X
$
PS)}'
x
$
s Hence there is x E X such that x
P A
Then D is S
S}, q I
(J
Then P
s
Eland as D is dense open there is S $ x ,;; PS' Thus x E so
1,
As q E D there is x E X such that q S Ax, some x ( !l'
(iv).
and
Then [I!ll < KCl~ = 1. We claim that if Ax = {S : x $ PS} x E !l} ~ ~ is clear. !l}]ffi, as desired. That U{A x
On the other hand, suppose 13 E A. q
Cl
-
Hence, by (ii) there is Xc ffi - {OJ of car-
= 1:{x EX: : q A ps = (J $
K]ffi I a.
$
Sinceffi satisfies the ~Cl
dinality < ~Cl such that (\'S)pS
q
[A c On & IAI --
603
~1
Obvious.
!}.
(iv) ~ (ii). Let A ~ffi - Iul , A {pS: 13 < ~Cl}. Let X = {S : Ps E By (iv), X = U{X : Y < 13 } for some Sn < ~ with X E V for each y. As 18 satisfies the
-
-'Y
-r-O
~
- c.c. the possible values of
Cl
-'Y
~ , by So say. For Cl '1 Cl ffi each y < SO' let ~y = A{PS : 13 E ~y}' Since d is V-complete, [~ E J] = n. Sinceffi satisfies the ~Cl - c.c. each ~y has < ~Cl possible values. Hence there is ~
~
are bounded below
BE V, B ~ffi - {(J}, IBI < ~Cl' such that [( y)~ E B]ffi We show that for each 13, pS = 1:{b E B : b that q
=
= 1.
PS}' Suppose on the contrary ffi PS - 1:{b E B : b ,;; PS} I
A [~ $ PS]ffi I O. Let b be the value of q allowed by Y,JB ffi Y , i.e. such that q A [~Y $ PS] A [~Y = b] I a. Then [b s: PS]ffi I ClI, so b,;; PS' This implies that b A q =
q A [13 E X ]ffi
ffi
-'Y
q
A
follows that q
(v)
I
[~ s PS]
~
[~ = b]1B
A
$
q
A
[b E _]18 = q
A
b
= (),
(ii) ~ (v).
Obvious.
(v)
Immediate from the regularity of
~
(vi).
Now we assume that 18 satisfies the (ii). Let A ~ffi - {(H have cardinality
~S
~ Cl
y < ~S by induction on y.
a contradiction.
~Cl'
- c.c. for some 13 <
, A
=
{p, : " < "
l.
~
u
Cl
Cl,
and show that
Define X for y
We shall have Xy = {xy : 0 < ~Cl}' IXyl < ~Cl' and for each 0, {x~ : y < ~S} are pairwise disjoint. Suppose Xy defined for y < YO' is a subset of 18 of cardinality < ~ as given by (v) such that I
0
\0
Cl
0
1: x ;! a + (h E X )(x I (t & x s p, - 1: x)). Adding 0 to X y
JOHN TRUSS
604
is as desired. Theorem 4.5:
Let ffi be a complete Boolean algebra satisfying the c.c.c.
Consider
the following. (i)
For any family {D
partition
~o:
; U{X
open.
S : nEw} of
n
: S <
For any subset A of ffi - {@} of cardinality
(ii)
subset B of ffi - {
of dense open sub s et s of ffi there is a
~o:}
such that for each n, n{D
~o:
[cdA::.
On) (IAI
A)p
E
E{b
Then (i)
~
$
p}. li.
ffi [(VA::. Xo:) (A is the union of ~O sets Lyi np in vn ; 11.
{G} of cardin81ity
For any subset A offfi
~
Xn } is dense
E
there is a countable
$
0:
: S
~o: ~ A is He union of ~O sets lying in V)]ffi
$
subset B of ffi - {@} such that (tip E A) (Jq (vi)
B : b
E
S
ffi has calibre (ii)
~
(iii)
~
~
a
B)q
E
$
~a
there is a countable
p,
$
.
(iv)
~
(v)
~
(vi).
Each of (i) - (vi) implies the
corresponding clause of Theorem 4.4.
§5.
For the rest of the paper we shall be concerned with five particular Boolean
algebras, which we now define. The measure algebra is the algebra of Borel sets of reals modulo the ideal of null (measure 0) sets.
The category algebra is the algebra of Borel sets of
Teals modulo the ideal of meagre (1st category) sets.
The amoeba algebra, the
dominating alfebra, and the amoeba algebra for category are the complete Boolean algebras associated with the following sets of conditions.
The amoeba list of
conditions is the set of pairs (p,x) such that p is an open set of reals, x is a real number (or +00), and
<
~(p)
x.
We let (p,x)
~
(q,y)
~
p::. q & x
dominating set of conditions is the set of pairs (n,f>, nEW, f let (m,f)
~
(n,g)
<>
m $ n & ( i)f(i)
$
g(i) & ( i<m)f(i) ; g(i).
E
~
W
W ,
y.
The
an,' we
Finally the
set of conditions for the amoeba algebra for catefory is the set of pairs (a,D) such that a is a finite sequence of members of 2<w and D is a dense oven subset of 2<w. ( i
E
In this case, c e,r» ~ (T,E)
dom T - dam a) T(i)
E
D.
T extends a (as a sequence) & D::> E &
<>
The two amoeba algebras are described essentially
in [5], and the dominating algebra in [3]. Lemma 5.1:
Let ffi be any of the five algebras just mentioned, and let A ::.ffi - {OJ
have cardinality
~a' ~a
countable Be ffi - {O},
regUlar and uncountable. (~a E
A) (3b
E
B) (b
$
a).
Then Hence
A~o:
implies that for some
A~
~ffi 0:
has calibre
~o:'
SETS HAVING CALIBRE Proof:
605
~1
The result for the category algebra is obvious, since it has a countable
dense subset (and to Solovay.
A~
a
is not needed).
For the measure algebra the result is due
We just prove the result for the amoeba algebra, the other cases
being similar.
The essential points for the dominating algebra and the amoeba
algebra for category are that
A~
a
~
any
~
function, and that 4.5 (i) holds for w<w.
a
functions are dominated by a single
Let P be the amoeba set of conditions. pose A c P.
Since P is dense in ill we may sup-
Moreover {(p,x) E P : x rational} is clearly dense in P, so again we
may suppose A is contained in this set.
For each n and rational q, let P{n,q) be 1 the set of open sets of reals of measure < q - n ' and let A(n,q) ~ {p E P(n,q)
(p,q) E A}.
Then A
~
{( a,q) : a E A(n,q), nEw, q rational}.
It is shown in
[5] that P(n,q), partially ordered by reverse inclusion, satisfies the c.c.c. corollary 2.4.5, (P(n,q))w satisfies the C.c.c. (Actually, ~ establish this).
1
By
is not needed to W
For each a E A(n,q) let Da ~ {( PO,Pl ,PZ"" ) E (P(n,q)) : (3m)a.=. Pm}. Then Da is denselopen, so by A~a there is a sequence (PO,Pl'PZ"") 0f open sets of measure ~ q - n such that (Va E A(n,o)) (3m) (p .=. Pm)' Let B(n,q)
~
{(Pm,q) : mEW}.
Then B
~
U{B(n,q)
n
€
W
& q rational} is as re-
quired. We now observe that ifv~.=. JVare transitive models of ZFC, the amoeba set of conditions P in Jtis not in general equal to the amoeba set of conditions Q in Jr, since ~f"may contain real numbers not invU.
Moreover, even if P is regarded as
a subset of Q in the natural way, P need not be a dense subset of Q, and indeed the complete Boolean algebra (invn associated with P need not be a complete subalgebra of that associated with Q. For the measure algebra however there is a redeeming feature.
Namely, if
x is "random" over ~V (I , e. it determines in a natural wayan w'-generic ul trafi Iter on the measure algebra in,~) then it is also random over,U; and similarly for the category algebra. Lemma 5.Z:
We now- give an analogous result for the other three algebras.
Adopting standard methods of coding countarle sequences of open sets
and functions etc. by real numbers, the predicate "x codes a maximal antichain of the amoeba algebra"
(dominat~ng algebra, amoeb~
algebra for category) is
rr~(x).
Hence, i f uti.=. ,V are transitive models of ZFC, the natural map e from the amoeba algebra ill in vll into the amoeba algebra
a:
in
01' preserves maximal antichains lying
indl, and so for anyvV'"-generic ultrafilter g on It, e- l (g " ell ill) is anult'-generic
ultrafilter on ill. Proof: coding.
(Similarly for the other two cases.)
Let (Pn'xn) be the nth amoeba condition coded by x under the standard Then "x codes a maximal anti chain of the amoeba algebra" ... (\I n) 1J (Pn) < xn & (Vm,n) (m
(Vy) (y codes (p,y') E P
-+
f n -+ 1J(Pm U Pn) ;" min(xm,x n)) (In)1J(Pn up) < min(xn~Y')).
&
606
JOHN TRUSS 1
This is IT (x). I category. Remark:
Similarly for the dominating algebra and the amoeba algebra for
This lemma tells us only that if A is a subset oflli lying in
a maximal antichain of lli
<>
8"A is a maximal antichain of Ii:.
this holds for arbitrary A in
and in fact this is false.
then A is
It does not say that
To illustrate this we
consider the case where lli, ([ are the measure algebras indland ,Yrespectively, and ,V is obtained fromJf by adjoining a single Cohen-generic real.
It is easy to see that in this caseuVcontains an open set a of measure 1 such that for any closed set x coded in ulf, !lex) >
°..,.
)l
(a n x) > 0.
Let A be a
family of pairwise disjoint open intervals with rational endpoints contained in a such that a - UA is countable, and for each pEA, let [p]<Jl, [prj!' be the members o f B , lC containing p . Then Al
=
{[p]Jt: pEA} is a maximal antichain of' B,
For
if x is a closed set of positive measure in,~ such that [x] is incompatible with every member of AI' then !l(x n p)
=
0, all pEA, giving !l(x n a)
=
° and hence
However A = 8"A = {[p] = pEA} is not a maximal Z l antichain of lC, since the complement of a is a closed set of positive measure in-
)lex)
=
0, a contradiction.
compatible with every member of A Z' It follows that 8" lli, the canonical completion of 8" lli in
Similar remarks apply to the other algebras (except the
Another (less direct) way of seeing this is to use Lemma 5.1,
the result of [8] that there is a Boolean extension of the universe in which
A~
is valid, and Lemma 3.6.
I
The next result justifies the use of the amoeba algebras and the dominating algebra in the context of measure and category. Theorem 5.3:
(i)
[)l{x (ii)
Let lli be the amoeba algebra,
x is not random over V}
= O]lli = ~.
Let lli be the amoeba algebra for category,
tt{x : x is not generic over V} is meagre]lli (iii)
=
Then
~.
Let lli be the category algebra and~1i: the dominating algebra in
Then [{x: x is not generic over V} is meagre]lli0[ Proof:
Then
= n.
JB.
We recall from [7] that x is random (generic) over V <> x,does not lie in
any Borel set of measure 0 (which is meagre) coded in V.
It is proved in [7] that
there is a canonical 1-1 correspondence between reals random (generic) over V and V-generic ultrafilters on the measure (category) algebra. The proofs of (i) and (ii) are given in [5].
Our proof of (iii) is similar
to that of (ii). Let x be Cohen V-generic and f dominate every member of (ww) V[x]. exist in JB0[.
We work in Cantor space, ZW.
x and f
Let (qn : nEw> be an enumeration
SETS HAVING CALIBRE in V of the points of Z<"', and for any g n
E
","',
~1
607
let X
{U{[q . ..., og(i)] : n'; U:
g
a sequence of nO's, and ~ denotes concatenation. a(n)
ben) mod 2.
+
abX
=
g
g
{a b b : b EX} of X are comeagre. '"
g
n
Xf
E
g
0 b x.
U{[q .~ og(n)] : n
E ",}
n
~
Z"', let (abb)(n)
=
We claim that for any dense open
It follows that 0 b x contains a set of the form for some function g
E
V[x].
As f dominates g,
E, so x b Xf ~ x b E ~ (0 b x) b x = D, as desired. It will follow from the results of §6 that ifffi is the category algebra,
and [ is the amoeba algebra in . meagre ]m0
§6.
E
On is
coded in V, x b Xf ~ 0, which is enough. Now for every n, x b (qn ~ 0"') is Cohen V-generic, and hence lies in O.
Therefore q .~ 0'" =
For a,b
0,
Then it is clear that X and any translation
subset 0 of 2
E
1
Here, [0] is the set of all (infinite) paths passing through
E ",}.
=
~~.
JB,
then [{x: x is not generic over V}
This section gives partial results about embeddings between the five alge-
bras introduced at the beginning of §S.
Roughly speaking, when attempting to
establish the existence of embeddings, we think of the measure and category algebras as small, and the others as large.
Firstly we recall well-known results
about the measure and category algebras, ffi and C. (1)
Neither ffi nor [ can be embedded as a complete subalgebra of the other.
(See [2] for example). (2)
Let ill 0 ffi be the complete Boolean algebra associated with the product
notion of forcing of two copies offfi; ffi
®ffi
to [; hence ffi 0ffi and ffi are not isomorphic. (3)
Ifffi',
thenffi 0ffi' ~ffi, (\; Tfieorem 6.1:
[I
PI
a:
0
[~a:.
are the measure and category algebras in
0 (i:'
VB,
Va: respectively,
~
Let ffi be the amoeba algebra, the dominating algebra, or the amoeba
algebra for category. Proof:
has a complete subalgebra isomorphic However
Then ffi ®ffi ~ ffi.
Let P be the amoeba set of conditions, and let
= {(p,x)
E
P : p
~
z = {(p,x)
(_oo,OJ) and P
E
P : p
~
(O,oo)},
Obviously P, PI'
and P are equivalent notions 0f forcing, and so we just have to check that P and
z
PI x Pz are also equivalent.
P clearly adjoins an open set A, and PI x P ad-
z
joins a pair of open sets (Al,A Z)' We let (A1,A ) = e(A) <> A = Al U Az U {O}, z just have to check that A is V-generic on P <> ( Al ,A is V-generic on PIx PZ. Z) This is routine.
We
For the dominating algebra we use a pa1r1ng map e to pass from f to (fl,fZ),f l (n)
= f(2n),f Z(n) = f(Zn+l),
and the same technique works for the amoeba
algebra for category.
Now we give the main technical lemma for the dominating algebra, with simi-
608
JOHN TRUSS
lar results for the amoeba algebra and the amoeba algebra for category.
In-
tuitively, the force of the result for the dominating algebra is this.
It is
clear that not every function f which dominates every member of (ww)V need arise from a generic ultrafilter on the dominating algebra. only even values it will certainly not be generic. ultrafilter, (i.e. {(n,g)
E
P : fin
= gin & (
For example, if f takes
If f does arise from a generic i ~ n)g(i) ~ f(i)} is a V-
generic subset of P) then f is called V-generic dominating.
Even if f is not V-
generic dominating, however, it can be turned into a V-generic dominating function by adding a Cohen V[f]-generic function. condition (n,f)
E P
This corresponds to the idea that each
consists of two parts, f ~n being the finite, Cohen, part,
and f the "dominating" part. Lemma 6.2:
Suppose f,x E
is Cohen V[f]-generic.
WW
are such that f dominates every member of (ww)V and x
Then g defined by g(n) = fen) + x(n) is V-generic domina-
ting. Proof:
Let P be the dominating set of conditions, and D
of P. (~i
Let D
I
E:
= {a
dom o)f(i)
Since DI
E
+
<w : (;Jf' E V) ( dom a,f') E D &
o(i)
= f'(i)}.
V a dense open subset
E
(Vi
E: W
f
dom a)f' (i) s f(i) &
We shall show that D is dense open in w<w
V[f] and x is V[f] -generic, x ~ n
I
E:
Dl' some n ,
It follows that the
subset of P determined by g intersects D, establishing its V- genericity.. <w Let 0 E w be arbitrary, n dom o. Let p = (n,f l) where
f (i) = f(i) + o(i) (i < n), f l (i) a (i l antichain of extensions of p in D (in V).
~
n).
Thus p
E:
P.
Let A be a maximal
By Lemma 5.2, A is also a maximal anti-
= f l (i) (i < n) , f 2(i) = f(i) is an extension of p in pV[f], so is compatible with (m,f E: A, 2) 3) say, in pV[f] As f dominates f there is k ~ m such that (Vi ~ k)f ~ f(i). 3 3(i) Define 1 as follows. Dom 1 k. T(i) o(i) (i < n), chain of extensions of p in pV[f]. (i ~ n).
(n,f
r Li ) = max(f(i) ,f3(i)) - f(i) (n f(i)
Now let f2(i)
= f(i)
~
i < k), and let f ' be given by
f 3 (1 ) (i >. k) . We show that f' is as reD Since (n,f are compatbible, and n ~ m, l, 2),(m,f3) f(i) s f 3(i) for n :0; i' < m, so r I i ) = f f(i) and f ' (i) f Hence 3(i). 3(i) (k,f')~ (m,f E D, so Cdon, 1,£') = (k,f') ED. Finally i f f dom 1, i ~ k , 3) so f' (1) f 3(i) s f(i). Hence 1 E D l. +
r I i ) (i < k), f" (i)
quired to establish
Lemma 6.3:
1
E
Suppose A is an open set of finite measure such that for every open
set B of finite measure coded in V, there is a finite union C of open intervals with rational endpoints satisfying A u B
=
A u C.
Then for any Cohen V[a]-generic
real x, V[A] [x] contains a V-generic ultrafilter on the amoeba algebra. Remark:
The point of the condition placed on A is that it is clearly satisfied by
any amoeba V-generic open set, while not in itself being sufficient for genericity.
SETS HAVING CALIBRE
609
~1
Let P be the amoeba set of conditions, and let Q be the set of pairs
Proof:
(q,y> such that q is a finite union of open intervals with rational endpoints and
Q is partially ordered by
u q) < y (y a positive real number, or +00).
~(A
(ql'Yl) ~ (qz,YZ> ~ ql ~ qz & Yl ~ yZ' Q is a notion of forcing in V[A], and has a countable dense subset, namely {( q,y) E Q : y rational} Hence V[A] [x] contains a V[A]-generic subset of Q, giving rise to a certain open set B of finite measure.
We claim that {( p,y) E P : p
~
A u B & ~(A u B) < y} is a V-
generic subset of P. Let 0 be a dense open subset of P lying in V. 0
= {( q,y) E Q
(3( p,y'>
1 open subset of Q. q
that p
~A
u B&
Let (q,y) ~(A
O)(p
~
~(A E
u q) < Z :; y.
Let
Au q & y :; y')}.
It will follow that (q,y>
B & ~(A u B) < y.
c
E
E
We show that 0 1 is a dense 0 1 for some (q,y) such that
Taking (p,y') • 0 such that peA
U
q & y s y', we find
uB) < y', as desired.
Q be arbitrary.
Let z be a rational number such that
Then (q,z> E P.
Let C be a m2ximal antichain of extensions
of (q,z> in 0 (ia V). By Lemma 5.Z, C is also a maximal antichain of extensions of q,z in pYlA]. But (A u q,z) is an extension of (q,z) in pV[A] so it is compatible with some member (A' ,Zl> of C in pV[A].
As A' is codec in V and has
finite measure, there is a finite union a' of open intervals with rational endpoints such that A u A'
Au q'.
We claim that (q
uq', min(z,z') > :; (q,y) in
Q and (q uq',min(z,z')
E 0 1, giving the desired conclusion. Firstly, as (A u q,z > anc' (A',Z ,> are compatible in pV[A],
~(A
u At u q) < min(z,z'), so \l(A u (q
(q u q !
min(z,z'». Q.
,
( A' , ZI)
E
1Jl
q')) < min(z,z') giv i.ng
To see that(q u q',min(z,z')
0 by choice of C and
that A'
c:
E 0 1, observe that A u q u q I & min (z , z ') :; ZI •
Lemma 6.4: Suppose a E (Z<w) W and x are such that (a) ( ~ E Z<w){n : a(n) :; infinite (remember, o(n) :; .p means o(n) extends .p), (b) (VO E V) (3 n) (0 a dense <w open subset of Z ->- ( i ~ n)a(i) E OJ, (c) x is Cohen V[s]-generic. Then V[o] [x] contains a V-generic ultrafilter on the Proof:
a~oeba
~}
algebra for category.
Let P be the set of conditions for the amoeba algebra for category, and n, E (Z<w)n, 8 E w
let Q be the set of all triples (T,8,n> such that nEW, T
~ (T ~ n :; n & T :; T &.8Z :; 8 l,nl) Z l Z,8 Z,n z> l Z 1 ( i)(n l :; i < n Z ->- 8 Z(i) ~ nl)' Q is countable, and hence in V[a] [x] there is a V[a]-generic subset of Q.. This provides a pair (T* ,8* > such that 1:* E (z<w)w and
partially ordered by ('1,8
8*
E
wW.
Define
('<ji ~ n)ljJ (i)
E
ljJ
by ljJ(n)
= a(8*(n))
~ T*(n).
O} is a V-generic subset of P.
We claim that 'j
= {( ljJ
~ n,O>
E
P :
The idea is that 8* "shuffles" the
order in which the a(i) appear, and T* adds small pieces at random.
The purpose
of the third co-ordinate n is tc ensure that each a(i) appears only finitely often, so that property (b) is inherited by To see that
1 is
ljJ.
V-generic, let 0 be an arbitrary dense open subset of P
610
JOHN TRUSS
lying in V, and let (T,8,n}EQ
(3D' E V)((Vi ;, n)a(i) E 0' & « 0'(8(0))" T(O), ... ,a(8(n-l)) '""' r Iri-L) >,0' > ED)},
We shall show that 0
1
is dense open.
It will then f'ol l ov' that for some nand
0' E V, (T * In, 8* ~ n , n > E Q & ( i :> n) a (i) ED' & i;' n , ljJ(i)
=
~ n,D' > E
(ljJ
a(8*(i))
3 n 0,
~
r I i ) ,; 0(8*(i))
(ljJ
~ n , 0 '}
£
D.
Then if
a(j), some j ;, n , so ljJ(i) ED'.
Hence
required.
8S
Let (T,8,n} E Q be arbitrary. aleen-I)) ~ T(n-l) >,2<w}.
Let p =
Thus pEP.
«a(8(O))~
T(O), ... ,
Let A be a maximal antichain of extensions
of p in
u.
pV[aJ.
Since «0'(8)(0)) ~ T(0), ... ,a(8(n-l)) ~ r Ir.-L) }, {
By Lemma 5.2, A is also a maximal antichain of extensions of p in
is, in pV[a], an extension of p, it is compatible with some memcer (X,D'> of A. Let m
= dom
X.
{
k ;, m such that (Vi;' k)a(i) E D'. T '( i)
T(i), 8' (i)
and for m';
= a(8(i))
Thus m;' n , and (Vi < nJx(i)
patibility, n'; i < m'" Xli) E
e (i);
< k , o(i)
roo
:
(3j ;, n)
'"'T(i).
Also, by com-
By condition (b) there is
Choose T' ,8' h2ving domain k so that for i < n
for n ,; i < m, 8' (i) ;, n and X(i) = a (e ' (i)) . .."T' (i) ;
T' (i) E D' and 8' (i) ;, n ,
Then (T' ,8' ,k> ,; (T,8,n> and
(T',8',U E D since «a(8'(0))'---"T'(0), ... ,a(8'(J-l)) ~T'(k-l)},D'} ,; (X,D'> E D. l
We now give results about embeddings. Let ill, [, D, E be respectively the category algebra, the amoeba
Theorem 6.5:
algebra, the dominatinr algebra, and the amoeba algebra for category, and let 11:' ,
D' be the amoeba and dominating algebras in (i) (ii)
ill
eD'.
Remark:
VB.
Then
D can be embedded as a complete suhalgebra of each of [ and E, and
E can be embedded as a complete subalgebra of each offfi
® ['
and
In view of the fact that we are conceiving of ill as being "small" in re-
lation to C, D, JE, this tells us that ill and JE are "almost" the same, and probably that
Q;
Proof:
is too. (i) To embed ill in Il: we show that for any V-generic ultrafilter Jon Il:
there is a V-generic ultrafilter 8(1) E V[J! on D. filter on ill is in the range of 8.
= [d
E 8(lJ][.
The embedding
Moreover any V-generic ultrais then given by
Similarly in the other cases.
Let A Ie an open set of reals arisinr in the natural way from a V-generic ultrafilter on the amoeba algelra. fen)
=
Let x
= ~(A
the greatest i :> 0 such that (~ , ~)
z"
2n
finite measure, so x and fare well-definec.
n (-00,0)), and define f E c
-
A, if any,
=
WW
0 otherwise.
by A has
It is easily checked that f domin-
SETS HAVING CALIBRE
611
~1
ates every member of (ww)V and d is a Cohen V[fJ-generic real.
So by Lemma 6.2,
V[A] contains a V-generic dominating function g, as required. Secondly, let (on: nEw> be a sequence of members of 2<w arising from a V-generic ultrafilter on the amoeba algebra for category. fen) = mi n i dom o(i)
o(i)'; r t n) }, where r In)
=
Define f by
On,.... (I >.
It is easily checked
that f is V-generic dominating. (ii) Since D' can be embedded as a complete subalgebra of [' by (i), it is enough to show how E can be embedded as a compl ete subalgebra of lB
0' D'. Let
(x,f> be a pair such that x is Cohen V-generic and f is V[xJ-generic dominating. Let
£' and x' be given by f'(n)
x'(n)
= fen)
- 2f'(n).
=
the integer part of} fen), and
Then f' is V[x]-generic dominating, and x' is a Cohen
V[x] [f'J-generic member of 2w. over V is meagre in V[x] [f'J.
By Theorem 5.3 (iii) the set of reals not generic In fact the proof there tells us more, namely that
there is a sequence (o(n) : nEw> satisfying (a) and (b) of 6.4. o(n) is just taken to be q ~ Of' (n)) ~ x. By Lemma 6.4 there is in V[xJ[f'] [x'J a V-generic ultrafilter
n
on~.
Corollary 6.6: Proof:
This is what was required.
[{x: x is not generic over V} is meagre]lB
t.
0 ['
This follows from Theorem 5.3 (ii) and Theorem 6.5 (ii).
7. Conclusion. The results of this paper only represent a small beginning of the study of calibre on the real line, and the connections between Lebesque measure and Baire category. We mention here some of the open questions which interest us. 1)
2)
Show that (v) of Theorem 4.4 is equivalent to ''IE has cal ibre Jl
Show that ifJlis an inner model of ZFC, R has measure (Similarly for 1fmeagreH and "random") .
a
<>
~
a
"
there is a
Cohen JJ -generic real. 3)
Show that {x : x not random overJi} has measure
a
<>
there is ano~
generic ultrafilter on the amoeba algebra. There are several questions about isomorphisms left unsettled by Theorem 6.5.
The main one is 4)
Show that [
~
E.
A few results not mentioned there are quite easy to establish. For example, if lB' is the category algebra in VO, D 0' lB' ~D. On the other hand, B 0' ~, ~ [, assuming A~ (an assumption which is presumably unnecessary). For [ has calibre ~l' by Lemml 5.1, and so iflB 0' [' ~ [, [ [ I has calibre ~1]lB = n, by Theorem 4.3. However, Harrington has shown that [[' has calibre ~l]lB = ~.
612
JOHN TRUSS References
[1]
J. Bell. Boolean-Valued Models and Independence Proofs in Set Theory,
[2]
P. R. Halmos.
[3]
S. H. Hechler. On the Existence of Certain Cofinal Subsets of American Mathematical Society, Proceedings of Symposia in Pure Mathematics, Vol.13, Part II, 1974, 155-173.
[4]
K. Kunen and F. Tall. Between Martin's Axiom and Souslin's Hypothesis, to appear.
[5]
D. A. Martin and R. M. Solovay. Internal Cohen Extensions, Annals of Mathematical Logic 2 (1970), 143-178.
[6]
J. Roitman.
[7]
R. M. Solovay. A Model of Set Theory in which Every Set of Reals is Lebesgue Measurable, Annals. Math. 92, (1970), 1-56.
[8]
R. M. Solovay and S. Tennenbaum. Iterated Cohen Extensions and Souslin's Problem, Annals. Math. 94 (1971), 201-245.
Oxford University Press, to appear.
Lectures on Boolean Algebras, Van Nostrand, 1963.
Adding a Random or Cohen Real; Topological Consequences and the Effect on Martin's Axiom, to appear.