Lecture Notes in
Mathematics Edited by A. Dold and B Eckmann Subsenes Adviser
Institut de Mathematique, Universite de Strasbourg P A Meyer
1193
Geometrical and Statistical Aspects of Probability in Banach Spaces Actes des Journées SMF de Calcul des Probabilités dans les Espaces de Banach, organisées à Strasbourg les 19 et 20 juin 1985
Edited by X Fernique, B Heinkel, M. B. Marcus and P.A. Meyer
Springer-Verlag Berlin Heidelberg New York Tokyo
Editors
Xavier Fernique Bernard Heinkel Paul-André Meyer Institut de Recherche Mathématique Avancée 7 rue René Descartes 67084 Strasbourg Cedex, France
Michael B. Marcus Department of Mathematics Texas A & M University College Station, Texas 77843, USA
Mathematics Subject Classification (1980): 46620, 60B05, 60B 10, 60B 12, 60F05, 60F 15 1 60F17, 60G 15, 62D05, 62E20 1SBN 3-540-16487-1 Springer-Verlag Berlin Heidelberg New York Tokyo ISBN 0-387-16487-1 Springer-Verlag New York Heidelberg Berlin Tokyo
This work is subject to copyright. All rights are reserved, whether the whole or part of the material is concerned, specifically those of translation, reprinting, re-use of illustrations, broadcasting, reproduction by photocopying machine or similar means, and storage in data banks. Under § 54 of the German Copyright Law where copies are made for other than private use, a fee is payable to "Verwertungsgesellschaft Wort", Munich by Springer-Verlag Berlin Heidelberg 1986 Printed in Germany Printing and binding: Beltz Offsetdruck, Hemsbach/Bergstr. 2146/3140-543210
Préface Le calcul des probabilités dans les espaces de Banach est actuellement un sujet en plein essor auquel des rencontres internationales sont consacrées régulièrement depuis une dizaine d'années. Les 19 et 20 juin 1985, une trentaine de spécialistes de ce sujet se sont réunis a Strasbourg sous
le
patronage de la Société Mathématique
de France, pour faire le point des développements les plus récents, notailment en matière de fonctions aléatoires gaussiennes, de processus empiriques et de théorèmes limites pour des variables aléatoires A valeurs dans un espace de Banach. Les principaux exposés de ces deux journées ont été rédigés par leurs auteurs, ce qui a permis de composer ces Actes que la Société Springer a eu l'amabilité d'accueillir dans sa collection Lecture Notes in Mathematics. Ces deux journées ont été assombries par la disparition, le 7 juin 1985, d'Antoine Ehrhard qui était l'un des plus brillants représentants de la jeune génération de probabilistes. Nous avons ressenti cruellement son absence, celle du mathématicien bien sûr, mais surtout celle de l'hoilme de coeur sensible et attachant qu'il était.
Les éditeurs
Table of Contents
BORELL, C., A brief survey of Antoine Ehrhard's scientific work.
1
DOUKHAN, P. and LEON, J.R., Invariance principles for the empirical measure of a mixing sequence and for the local time of Markov processes. GUERRE, S., Almost exchangeable sequences in
Lq , 1 q< 2 •
4 22
HEINKEL, B., An application of a martingale inequality of Dubins and Freedman to the law of large numbers in Banach spaces.
29
LEDOUX, M., On the small balls condition in the central Limit theorem in uniformly convex spaces.
44
LEDOUX, M. and MARCUS, M.B., Some remarks on the uniform convergence of gaussian and Rademacher Fourier quadratic forms.
53
MASSART, P., Rates of convergence in the central limit theorem for empirical processes. SCHWARZ, M.B., Mean square convergence of weak martingales.
73 110
YUKICH, J.E., Metric entropy and the central limit theorem in Banach spaces.
113
A BRIEF SURVEY OF ANTOINE EHRHARD'S SCIENTIFIC WORK Christer BORELL Dept. of Mathematics Chalmers University of Technology
Göteborg, Sweden
For a couple of years Antoine Ehrhard gave us pleasure with a series of brillant ideas on Gaussian measures and convexity. The loss of him is the loss of a very seriously working young mathematician as well as the loss of a very good friend. For a complete list of Antoine Ehrhard's publications, see at the end of this survey. As a background to Ehrhard's scientific work it is appropriate to recall the Laplace-Beltrami operator and its relations to isoperimetry. Needless to say, this fascinating area is far from completed and, indeed, it seems very hard to unify since it is rooted in so many branches of pure and applied mathematics. In particular,
P = 2
like the Laplace operator L=
-
A + x.7
R
in
n
, the so-called number operator
Rn merits its own study. This central point underlines most of
in
Ehrhard's papers. To master the number operator and isoperimetry Ehrhard first introduced the
k-dimensional Gaussian symmetrizations and he developed a streamlined
so-called
machinery of general interest
FM].
For brevity, we only recall the definition of
n-dimensional Gaussian symmetrizations. Suppose
p
is the canonical Gaussian measure in
R
n
i.e.
p(dx) = e - I x 1 2 / 2 dx4(-27F n and let
1-;%=<.,h>
h E Rn be a fixed unit vector. Set
• Then to any
fE Lo (p)
n
there exists a unique non-decreasing function
f = goh
such that
f(p)==g(h-' (p)). The
n-dimensional Gaussian symmetrization of
is called the
in the direction of h . Here, if
gE Lo (li'(p))
f
is the indicator function of a set,
f
f becomes
the indicator function of an affine half-space. The number operator
L
is related to Gaussian Dirichlet integrals as follows :
.rf(Lf)dp = n7f1 2 dp , f E COR n ) • Ehrhard's perhaps most central result states that the integral SF ( 1 7 f1 ) 4 decreases, in the weak sense, under Gaussian symmetrizations of decreasing convex function
F :
R
f
for every non-
EASE]. The, familiar isoperimetric
inequalities for torsional rigidity, principal frequency, and Newtonian capacity thereby get their Gaussian counterparts now with affine half-spaces as extremals EASE].
2
The same source of ideas also led Ehrhard to a very neat proof of the Gross logarithmic
Sobolev inequality [LN] and to an inequality of the Poincaré type FASE]. Under the leadership of Professor Xavier Fernique, Antoine Ehrhard very early became familiar with the Banach space aspect of stochastic processes, which has been of greatest significance to Ehrhard's maturity as a mathematician. For an Emportant joint publication, see 1- GR 1I . Another result stemming from this background is the following remarkable inequality :
-1
(1)
(p,(etA+ (1-013)) 0 ,1
-1
( p,(A))-1-(1-0
-1
(11(B)) , o< e< 1 , A,B convex,
where
2 ,,
a
r
(a)
TMS].
e
In particular, if A
the heat equation in A
x (01
(int A)
, then the function
u
is a convex body in Rn and if
x -1
R
satisfying
+ (u(.,t))
u 0
is the solution of
on (BA) X R
and
+ is concave for every t>0 [MS].
u..1
on
Finally, in his last paper FAIP] Ehrhard investigated the case of equality in
as well as in several other inequalities for Gaussian measures. Thus, for
(1)
example, if occurs in
A,B (1)
4 Rn
are non-empty convex domains, Ehrhard proved that equality
if and only if either
A
B
or A and
B
are parallel affine
half-spaces. The arguments leading to this very definite result are extremely pene-
trating and mixed with youthful enthusiasm and conviction. The loss of Antoine Ehrhard is an irreplaceable loss to the area he so succes-
sfully invented. The scientific progress will now proceed much slower and with much less substance, too. But for ever, we will remember an artist ; an artist painting with convex bodies and the Gaussian law.
PUBLICATIONS [CR I : (en collaboration avec X. Fernique). Fonctions aléatoires stables irrégulières.
C.R. Acad. Sc. Paris, t.292, Série I (1981), 999-1001. Régularité des fonctions aléatoires stables. Thèse de 3ème cycle (1982), 9-43. Lois stables et propriété de Slépian.
Ann. Sc. de l'Université de Clermont 71 (1982), 81-94. Sur la densité du maximum d'une fonction aléatoire gaussienne. Séminaire de Probabilités XVI, 1980/81, Lecture Notes in Math. 920, 581-601. Une démonstration de l'inégalité de Borell. Ann. Sc. de l'Université de Clermont 69 (1981), 165-184.
3
FM]
: Symétrisation dans l'espace de Gauss. Math. Scand. 53 (1983), 281-301. Un principe de symétrisation dans les espaces de Gauss. Probability in Banach spaces IV -Oberwolfach 1982- . Lecture Notes in Math. 990, 92-101.
EASE: : Inégalités isopérimétriques et intégrales de Dirichlet gaussiennes. Ann. Scient. Ec. Norm. Sup., 4è série, t. 17 (1984), 317-332.
: Sur l'inégalité de Sobolev logarithmique de Gross. Séminaire de Probabilités XVIII, 1982/83, Lecture Notes in Math. 1059, 194-196.
FAIP] : Eléments extrémaux pour les inégalités de Brunn-Minkowski gaussiennes. Annales de l'Institut Henri Poincaré. Vol.
22 n° 1 (1986) , 149-168.
Sur la densité du maximum d'un processus gaussien. Thèse d'Etat (1985), 87-104.
Thèse de 3ème cycle (soutenue à Strasbourg le 12.02.1982) : Fonctions aléatoires stables. Densité du maximum d'une fonction aléatoire
gaus sienne. Publication de l'IRMA de Strasbourg, no 156. Thèse d'Etat (soutenue à Strasbourg le 24.05.1985) : Convexité des mesures gaussiennes. Publication de l'IRMA de Strasbourg, n° 273.
INVARIANCE PRINCIPLES FOR THE EMPIRICAL MEASURE OF A MIXING
SEQUENCE AND FOR THE LOCAL TIME OF MARKOV PROCESSES. P. DOUKHAN , J.R. LEON
**
ABSTRACT. We show an invariance principle for the empirical measure of a stationary strongly mixing sequence indexed by the unit ball of some
Sobolev space H s .
We
also obtain invariance principle and law of iterated logarithm for the local time of
Markov processes indexed by H s . We note that the regularity condition random variables with values in a compact in the continuous case of the
brownian
s >
d/2 in the first framework for
riemannian
motion on
manifold
E
becomes
s >
d/2-1
E.
** Université Paris-Sud U.A. CNRS 743 "Statistique Appliquée" Mathématique, Bit. 425 91405 ORSAY (France)
Universidad Central de Venezuela Facultad de Ciencias Departamentado de Matematicas Apartado Postal n° 21201 CARACAS (Vénézue1a).
5 1 , INTRODUCTION
This work is divided in two parts. The first one is devoted to investigate a rate of convergence in the weak invariance principle for the empirical process X n of a strictly stationary strongly mixing sequence IC I( ; k=0 1 1,...1 valued in a
metric space E indexed by a compact class F
of
functions satisfying an entropy condition : X (f) = n
2 L (E,p) 1
n
E
VT1 k=1
of uniformly bounded
[f(
k
)-Ef(
k
)1, fE F .
The typical case is obtained for a d-dimensional riemannian compact manifold
E
with F unit ball of the Sobolev space H s of the manifold (see Giné [14]) ; a
result can be shown only if
s > d/2.
In this discrete case we expose some of the
results of [10] made in collaboration with Frederic Portal. Rates of convergence essentially depend on the entropy condition for F . The second part of this paper studies the asymptotic behaviour of
2 1 f(X ) du, fEL (p). Here fX t > 01 is a continuous parameter u t /6 J o recurrent ergodic stationary Markov process with values in a compact riemannian Z (f) = n
manifold
E
or in 0 ; p
denotes the invariant measure of the process.
We first
give an invariance principle in a general framework. We also study the case of the brownian motion on
E ;
classe F , unit ball of on R
d
we give an invariance principle and a L.I.L. uniform on the H s for
s > d/2-1.
Finally we discretize the Z n,A (f) =
We also study the case of diffusions
Zn process
E f(x. KA ) in 0
by
'
: fE F.
We give condition of uniform convergence of this discretization. Our result can be compared to those of [7], which works with the non-uniform case of one dimensional
diffusions. Our speed of discretization is quite lower but
our result is uniform on F. The process Z n (f) has been studied for the case of E
by Baxter and Brosamler [4];
L.I.L.
they prove non uniform central limit theorem and
They use, as we do, mixing techniques.
Battacharya ([3]) extends their
results to a non compact case with martingale techniques. A first uniform result, based on a martingale approach, is given by Bolthausen [5] motion on the
for the case of brownian
d-dimensional torus ; his results are extended here for a general
6 riemannian D.
compact manifold. The non uniform problem is also considered by
Florens [13]
for the case of one dimensional diffusions.
We use, in this paper, mixing and Hilbert space techniques which are by-pro-
[10]
ducts of
for example.
[21] .
shown in Rosenblatt
6(
follows
; v > 01 —
v
ay if a y --->0
a and
F
The
Markov
case satisfies a mixing assumption as it is
The mixing notion used here is strong mixing defined as
is said to be strongly mixing, with mixing coefficient
where,
e > 0 = Sup {1P (An B) - F (A) IP (B) r ; A E F 8 , BE Fc°e+v' — 0
v
t
is the
s
a-field generated by
{x
v
; s < v < -0- • ——
We expose here invariance principles results for discrete and continuous parameter processes to compare them. In the compact seen that regularity conditions
s>
riemannian
manifold case we have
d/2 for discrete parameter becomes
s>
d/2-1
for continuous parameter.
2.
INVARIANCE PRINCIPLE
FOR THE EMPIRICAL MEASURE OF A MIXING SEQUENCE OF RANDOM VARIABLES.
In this section we expose some results of
Frédéric
Portal
;
made in collaboration with
a complete version of this work will appear elsewhere with proofs.
(0 0, 0
Let
[10]
a strictly stationary sequence of strongly mixing random
varia-
-
bles
with values in a polish measured space
c-field
E
of
and
compact subset of
F
is
fak l
C(F)
and there is some
aE]0,1/3]
denote the mixing coefficients of the sequence
valued, where
compact set
p ,
F
is the Borel
a finite entropy
k=1
is supposed uniformly bounded, the law of
density with respect to (here
B(E)
where
p a non negative a-finite measure. Let F L 2 (E,p), we define : n f€F X n (f) = 1 T [f( k )-Ef( k )] , f
The class
(E,B(E)41)
(C(F), 11. 11 0)
is the space of
equipped with uniform norm.
%
has a bounded
such that
fE k }).
contitluous
' Z arK < 00 k=0 The process
Xn
functions on the
7
X n to the centered gaussian
We give an estimate of Prohorov distance of process
with covariance defined for fg E F
Y
by :
CO
Ef(%)g(%)
EY(f) Y(g)
here
T=
WC() )
g
and
E f(f(;) )- T) (g(E,, k )-g) +
Eg(%)
A reconstruction of the process
gives a weak invariance principle with rate
Y
of convergence. The method is based on estimations of central limit theorem rates in Prohorov Xn
metric convergence given In [11] for the finite repartitions of the process
depending on the dimension of repartitions and, from another hand, on estimations of the oscillations of the
X
n
process based on,
F = ff E F-F ;
THEOREM 1. Let
f
2
2
dp <
,
(7E10,1/3]
suppose that there is a
Co
X
satisfying
aGIK
k=0
Let
,
E sup {X 2 (f)
such that :
ff k ; kE Y1
U,V,B 6 are defined for =
p.
2
; fEF } : C {6 U + BV}. (ti —
an orthonormal denumerable basis of
N
LJE4EN
k)
y . 4 ( z e -1 k LJE\kitN
B 6 = Sup
L 2 (E,p) , X
sufficiently big finite subset of (resp.
U=
(res. y =
< a <
5
is positive and satisfies
is the uniform measure on .
'
I\
SuplZ ka Y
) ; TEF e k (f,T k %2 (see [10]) .
2.1. Uniformity test on a d -dimensional com act riemannian manifold p
5
z e -1 Hf u 2 ko k k co)
Remark : The same estimate is valid for the gaussian process
p
values of
the
by :
ffkll 2) ,,,,/
kEN
1-c f 2 (x)) 1/(1 G) p(dx)] k
fe k ; kE Yl
the sequence
p(dx) 11-G J
2,- e,(f,f,) 2 ; fE F kON
dv
sequence
( z f 2 (x 0I/(1-(7)
U
Here
fE k l
then there is a constant only depending on the
E
and the law
y
of
E.
C o satisfies :
We extend results of Gin6 [14] in a mixing framework .
The test of Gin6 rejects hypothesis
uy
pi'
for big values of
:
8 T (s) (w) = n
Laplace Beltrami operator A
H -s
Ek '
H
=( k=0
: v n = -n1
ak
. co
Tr k
is the orthogonal pro-
s Sobolev space of the manifold E(s E ) (f,f.) 2 ) 72 . We write v the empirical n ; k = 0,1,...1
The sequence
satisfies
. Xn
For this we consider the
eigenvalues of the
is the ordered sequence of
f j EE k
n7 k=1
, k=0,1,...1 <
Sup fla k ask /2 1
k=0
eigenspace E k and
with
2 v)11 -s '
(a k (Jks/2 rk) (vn (w)
being the index
s
whose norm is If H
random measure
co
Tak ; k=0,1,. ..J
The sequence
jection
H
:
process which can be written
x(f) = /n f f d(vn -v) . THEOREM 2. If there exist O <
a
a
= 0
(
n
for
sequence of gaussian processes
for
s > d/2 ,
(Sup
{Ix(f)
TY n ; n=1,2,...1
Yn (f)I ; fE
Bs }
with the law of
> pn ) < p n
"v = p"
We use this result to show that, if
X2
fying Trk (v) / 0 for n-1/2 of (T (s) -nC)
random variables
k
some
such that
gaussian for
is
such that :
Y
n
for
and
Bs
T (s)
is
Hs '
denotes the closed unit ball of
a sum of dependent
and
co, we can construct an identically distributed
pn = 0 (0,n(Zn n ))s/d (zn n)(d-2s)/(3d) )
here
n 2 ana <
satisfying
n=0
2/(3b)+8/15 )
F
b < 1/4, 0
;
then the limit law of
from another hand, for
ak / 0
"v p" satis-
then we show that the limit law
some constant
C .
So that the test is consistent.
In the case when
E = S1
continuous functions with index
2.2. Invariance_principle Here
E = 11,2 1
,p
for the is
equiped
is the unit circle
s > 1/2
61der
we obtain a more precise statement.
empirical measure
Lebesgue
with a set of
measure on
on the
R1
real
line.
and
OD
F = {f €L 2 ( 1 ) ;
(k+1) s (f,h k ) 2 (c.f. [23 ] ) . k=0
Hermite functions
K
11
where
fh k 1
the sequence of normalised
9
Under the same mixing assumptions than in theorem 2, we obtain
THEOREM 3. [9] .
pn
n) (5-65)/18 )
0 (An (In n)
for
n
s > 5/6 .
cc if
:
Remark
• The main interest of this result is to escape from a compact context. • In
[10] and [9] invariance principles for non parametric estimators are obtained
with the same methods. Kernel and projection estimators of density and regression
F
function are considered. In this case the class
n.
varies with the index
3, INVARIANCE PRINCIPLE AND L.I.L. FOR THE LOCAL TIME OF CONTINUOUS PARAMETER MARKOV PROCESSES, {X t ; t > 0}
We write
complete separable metric space stationary with marginal law
Markov process with values in a
a continuous parameter
E.
p .
This process is supposed to be homogeneous and
unbounded non-negative linear operator, whose domain, satisfies
L
L
Moreover, its infinitesimal generator
D(L)
is dense in
is an
L 2 (p), and
:
is self-adjoint and onto.
L
(ii) the spectrum of
(iii)0
a single
is
is discrete.
eigenvalue of L associated to the constant eigenfunction 1 . L 2 (p)
Under those hypotheses, the Hilbert space
11.11 (resp. (.,.)) its norm (resp. scalar product) 2 an orthonormal basis of L (p) such that : Lf =
A
m=0
m
(f,ejo m m L
Thus the spectrum of
for
is
fED(L), here
fAm ; m 7 01 •
and
A0 = 0 <
is separable
; we write
{em ; m = 0,1,2,...1
AI
< A 2 <... and
is
eo =1 .
The semi-group P t associated to the
Markov process, defined by co
P f = t
Condition
m=0
(P t f,g) =Ef(X t ) g(X 0 ), satisfies : -A t e m (f,em) e m , fE L 2 (p) , t > 0 .
(iii) implies ergodicity of the process
proposition
2.2) .
{X t ; t > 0}
The operator P t is a contraction verifying
(cf.
Battacharya [3],
10
-
< e
Alt
fE L 2 (
for
Mf11
is strongly mixing with a t
< c e -A l t
for a
0; thus the process
c>0 L 2 (u)
is defined on
G
The Green operator
, (f,1) -
.')
(Rosenblatt
Gf =
by
t > 0}
{X t
[21 1 ) .
P,f dt
if
fE 1
and
Jo
G1 - 0
:
thus
CO
Gf =X À -1 (f,em) em , fE 1 1 m=1 / This operator is continuous on L 2 (p) with range D(L)nl LG=I-S where I is the identity operator on L 2 (u) and S is jection
on
eo
(St
H. II s (resp. (.,.) s )
Hs
The operators Note that
Hs
Lr
Gr
is the range of
the dual space of
HT11 _s=
Sup
We write
s
{!T(f)I ;
(resp.
G s 12
211/2
L s/2 .
and the domain of
[20]
e VII S = 1}
f EHS
an
H -s
f(X ) o u
(*
= t illf, 1
A
117!
Indeed,
E 11 7 nII -s 2
2 C [
.)) 1/2 (T(e m )) f
A-(1+s)
m
<
E 11Zn (f)11• 2 <— n 11f11 2 • s > 0 verifying :
ce
2 AS
E m=1
m
in
r n e (X ) Jo m u
functionnal :
(p
because
)
m=1
f E1
du
valued random variable for cc
H -s
Hs .
n L 2 (p)
.
re
We also write
The aim of this work is the study of asymptotic behaviour of the
It is defined on
for
is an Hilbert space with the norm
the unit closed ball of
Zn (f) = ,
scalar product)
E ),sm (f ,e ' m)
are formally described by Seeley
) , it
1
(H s ,
with norm
; HfIr s
As
and
s>0
r
2 m (f 'e m) <
E m-1
2 ()
for
:
defined by ce
{f€
the orthogonal pro-
= (f,e o )e o ) .
We consider the Hilbert space
Hs =
and verifies
du)
We consider
7n
as
11
z AS m=1 m . in E 1 Z n il2-s - 2 m E_ l A-s m j o (1 . Il 2 _ 2 z A -(1+s) _ E HZ n" -s m=1 m
E 11Z n Hu 2-s
2
- 17-,4 ) (P u em , e m ) u
-A
)e m
-
du
2 mx A -(2+s) n m-1 m
(1-e
m)
Brosamler ([4] ,
is analogous to those of Baxter and
This calculus
du
theorem
(4.3)). gaussian
We now define a
random vector
Z
on
H_ s .
For this we assume the
following technical hypothesis:
[0,1]
There is a uniform random variable on the interval
defined on the same
(H)
(Q,A,
probability space
i.i.d.
We consider an with variance
1
)
I'
that Ol t
s
CO
-(1 +s) < m .
2 i m-1
A, satisfying
(*),
L
gaussian
centered random variables
(*) :
1 / 0
,
Z
An abstract construction of
whose trace is
and independent of it.
A -I/L (f, e) m m=1 m
E liZt1 2_ s = 2 E X rn m=1
Note that
of
satisfying
œ
Z(f) = i2
hypothesis
( m ) 111., 0
sequence
and we define, for
; t > 0}
thus
f€ B s
Z c H -s
a. s .
can be made using the operator We define a
gaussian
measure
T = 2 G l+s
M
H -s ,
on
under
:
(z,T 1 )_ 5 (z,T 2 )_ 5 M(dz) - (r T 1 ,T2 )_ 5 , T1,T 2 H_ s
.
H -s
The law of
Z
is
M.
We also note that measurability condition of the separability of
is satisfied here because of
E.
The random variables
Zn
Bs
THEOREM 4. The sequence
of
Z,
and
valued processes. Here we note the compact subset
[3]
L 2 (p)
C(B s )
H
valued, can be considered as
the space of continuous functions defined on
and real valued
{Zn ; n 7 1
C(B s )
equiped
with uniform norm.
} converges in distribution to
under the hypotheses (1), (ii), (iii) and (*) .
Z
in
C(B s ) ,
12
Proof. Under the former hypotheses, Battacharya shows convergence of finite repar-
titions ([3], result from Let
remark
flattly Fin cH -s F
We note
m
2.1.1.).
concentrated property using De Acosta's method be the
= IT E H
m-dimensional T
-s
11 = {T EH_ s
implies
e
k
=0
c-vicinity
F E the
Note that
{Z n ; n > 1}
The tightness of the sequence
Mm E c:FmE
Fm
T ee = 0 ,
and
H -s
in
[1] .
:
space defined by
.
for k = 0 and k >
of
will
:
and
E X: s (T e k ) 2 < c2 1 k=m+1 "
E Z 2n (e k ) < 2/A k '
Bienaymé-Tchebicev
so
inequality
:
P (Z n E F1E11 ) > p (Z n
MrEr )
P (Z n E E mE ) > 1 - P
1( 4'114.1
P (Z n E F m E) — > 1 - E -2 P
Zn
Thus the sequence
is
E
)
A-s E Z 2 (e )
flattly
n
k
k=m+1
> 1 - 2,-2
(Z n
2
,-s z 2 te ) n ' k' k
A -(1+s) k k=m+1
7
concentrated
COROLLARY 5. The sequence of real random variables
x2
tion to VI! , infinite sum of weighted -5
Remark : Using the direct construction of
Z,
;
Hz n
theorem
m -s2
4
follows.
converges in distribu-
random variables.
note that
-(1+s) 2 : MZH 2 s - 2 E A m m=1 -
(;11 ) m>0
being an
i.i.d.
sequence of normal random variables.
In view to investigate iterated logarithm behaviour of Z n , we now make a direct construction of the
brownian
Levy's construction
([16], 1.5,
14,k ; n . 0,1,..., k
p.
19).
law of
Z.
We use the
Let
k - 1,...,2 n -11
k < 2 n t < k+1, = 0
M,
the Haar basis of
L 2 [0,1]
defined by
Z n k an i.i.d. array of reali' N rt defined on (O,A, P ), we write Z (t) = E E Z x (u) du . N n=0 k n,k j e n,k serie converges normally a.s. Note, for this, that :
xnk(t) , - 2 zations of Z This
odd,
process Z . with base
else, and
:
13 EN =
Sup
f l!ZN (t)-Z N-1 (t)11 -s ; 0 < t < 11 < 2 -(N4-1)/2
From another hand
:
such that
= E exp (-
A
Borel-Cantelli quence
Fernique ([12],
theorem
.
Hz11 2 5 /a 2 ) <
Thus
Max
1.3.2.)
H Z N,k -s ;
P(E N >
covariance is computed as
< 2N ) > 0
a
2 -(N4-1)/2 4n(2 2N )) < Z(t)
A
2 -N .
of the se-
:
funv E(Z,U) - s (Z,V) _ s j Xn,(w) d w
E
=
odd
shows that there is an a
lemma implies then the continuity of the limit
Z N (t) . The limiting r ■, E (Z(u),U) - s = (Z(V),V)
k
n=0 k odd 0
E (Z(u),U)_ s (7 (v),V)_ s = unv E(Z,U)_ s (Z,V)_ s Let
Z(t),
we
part of
fZ(n,t) ; 0 < t < 11 an i.i.d. sequence of continuous realizations of [ti — set Z(t) = Z(k,l) + Z ([t], t-[t]), where [t] denotes the integer k=1 t . The process {Z(t) ; t > 01 is the brownian process with basis M ;
it belongs to
C
(*).
(]R ,H 5 ) under condition
Z(t) (2 t Zn[Zn t]) -1 12 for t 4- co iS the H ..s (B 1 = {fEL 2 (P) ; I fil 1 5- 1 } ) under condition (*)•
PROPOSITION 6. The a.s. cluster set of
4 B 1 of
compact subset
Proof. We see, like results of
3.1.
Kuelbs
Bolthausen ([5]) ,
and Lepage
that this set is
vf (B s ) = v2 B 1
using the
([18]) .
Brownian motion on a compact riemannian manifold.
E
The space
{X t ; t > 0 } LEMMA
is the
(i).
7.
is here a d-dimensional compact
For
brownian
motion on
$ > d/2,
1 1 Z111 2
E
manifold and
is the uniform measure on
and p
s is a
riemannian
E.
bounded r.v.
(ii). v6, s > 0 , s > d/2 - 2/(2+6)0 E Remark
: .
For
s =
d/2 the
.
For
s>
d/2-1
Proof. Note first that manifold
where
E.
c > 0
The
r.v.
moments of every order.
Z E H
0
there is a 6 >
L = -
eigenvalues
A, where
of
is some constant.
A
verifying
(ii).
is the Laplace operator of the Riemann
L , (A m ) m>0
satisfy
From another hand,
ArnA,c
m
Giné ;[14])
2/d
([19])
shows
:
for
ni,-›
14 > d/2
VS
, ]C > 0 , , 1 2-5 - inl E A-ill ' o
[f
Hz 1 11
0)
m7 1 11
E
E E ,
em
(X uU )
Nx il 2 s < C 2 du]
rl , r1 2 em (X u) du . j1 V X 11 2_ s du < C A-m j1 o m=I u o "
11Z 1 11 2...s .'5_ 1 +
(ii)
ex
(1 uo1l em (x ii ) du)2 j 1+6/2
A m-s
= ELf mi
h,k > 0
Let
E dy 2-Es 6 < E A B where \216/2 (r1 ( X ) du) < C 6/2 em " u )
inequality implies
- hp
F
A=
m
m=1
,
because hp
Holder
> d/2, and 2
f i em (X u ) du \) 2 Xm-k p ( jo
B=
(2+6)/6, q = (2+6)/2 ,
h+k = s and p
satisfying
,
e (X ) du )
E (i
and
mn
Jo
m
2
A
-1
thus
u
EB < 2 7
m=1 Setting hp
= kp + 1,
Z n can be rk 7 e (X ) du' 7 [1 1(k) = m=1 J k1 = Note that co
with Z
(1)
1 çb-mixing lemma
. Z1 .
(With O n
c° A -s-2/(2+6) 0 1 2 . Il 2+6 , 2 7 E 1Z 1 ' -s 2-'m=1 m rewrite Z = n -1/2 ,(7(q 1 .• . 4, z n» ) , where n e EH is a stationary sequence of random variables
we have
Brosamler ([4])
Moreover, Baxter and
< c
an
, 0 < a < 1).
show that this sequence is
Thus the following theorem
8
will result from
7 and :
THEOREM A.
(Dehling, Philipp [8]).
Let
{X , V> 1
}
a strictly stationary sequence
of random variables with values in a separable Hilbert space
tation and having a strongly mixing with
(0 <6 < 1).
(2+6) -order finite moment an =
0 0 -(1 +E)( 1+2/6 ) )
together with a brownian motion
E x - x(t)11 =
{ZW ; t > 01
o
(vt
n
at expec-
If the sequence is
for flœ it can be reconstructed
with covariance
probability space such that :
V
H , centered
(Rn t) )
a.s.
F
on another
r
Here
is defined by : CO
E f(x,x 1 ) (y,xv ) + (x,x v ) (y,x 1 )}
(Fx,y) = E(x,x 1 )(y,x 1 ) +
.
v=2 THEOREM 8.
Let
S
>
Z n (2 Zn(2,n n)) -1/2 is conditionally
d/2-1 , the sequence
compact in It s and its a.s. cluster set for
closed unit ball of H1 There is some
52
n-).-x)
B i is the
1/2 B i where
is
'
with
o
11) (S-20 )
1
-
and :
Vw€S20 , VfE H s , 1im [Z(f) (w) (2 Zn(a n)) -1/2 ]
Proof : Using lemma 7 we see that theorem us to make a strong approximation. of theorem.
The second part
A
works for the process
it allows
With help of proposition we get the first part
follows from conditional compactness of the sequence fE H s which is another conse-
and from a.s. convergence of the upper limit for any
quence of theorem applied with have
Zn ;
H =P
xv = Z (I v) (f).
and
(2+6)-order moments because E 17(k) 1
Those random variable
(f) 2+6 < rf ! 24,5 E I117(k)11 24-6 _ il 1 s I 1 II '
we
so that a brownian (UL with same law that (7 (1 1() '')l'../_' motion W(t) such that U, - W(t) a = o (/t .0'1(.0 t) ) a.s., a - /2 IfIl _1 • k
Remarks. . Theorem 8 closes the conjecture (8.11) of [4] for the case of the brownian motion on a compact riemannian manifold. The case of torus T
rity condition
d
, studied by Bolthausen ([5]) , leads to the same regula-
s > d/2-1. The method used is there martingale theory.
. Conservative diffusions. If
t
; t > 01
for some vector field
is a diffusion on V . Let
E
with infinitesimal generator L = -A +V,
v(dx) = 0(x) p(dx)
the invariant measure of this
process (c.f. Ikeda, Watanabe [15]),
the function 0 is solution of
L
If the field
is the adjoint of
L
in
1_ 2 (p).
V
is C
and strictly positive. Operator L is self-adjointed in
L 2 (v)
(c.f. [15]) . Giné [14] shows that the Sobolev norm with index 2 L (v)
is equivalent to 114
valent here.
If the eigenvalues of
L
in
because norms on L
2 L (v)
L 0 - 0, where
0 can be choosen C c° if
s and
V = 2v zn(0) associated to 2 L (p)
are equi-
have the same asymptotic behaviour than those
16
of -A
,
then an analogous of theorem
proof of lemma
3.2.
7
is true for such diffusions
;
indeed the
is still valid here.
Diffusions on Results of
8
Pd
§ 3
Rd solutions of a stochastic
are applied to diffusions on
(S.D.E.) :
differential equation
dX t = 0(X t ) dt + dW t {W t ' • t > W
Here
is a
brownian
([6]).
We write C
real functions defined on
Ed
the space of
E (f,g) = D(7)
of
E
1f11 If the form
E
6
is a function. Such
k
[2] ,
and Streit
([2 ])
time differentiable
2 (x) dx
p(dx) =
and with a compact support. Let
a absolutely continuous law, we define, like
The domain
and
Albevério, Hoegh-Krohn
diffusions have been studied, for example by and Carmona
Rd
motion on
the bilinear form
E
Cl :
on
Vf . Vg dp is an Hilbert space with the norm
(E (f,f)
:
f f2 dp) 1/2
is closed, there is a
self-adjointed
linear operator
L
on
D(v) such that E(f.g) = (Lf,g). It is the case for V0EL (Rd ) ; then 2 C cD(L) , Lf = - Af -6. Vf for fE 2C where 6 = 2 I -1 TID and LI = 0 . If o (n AE L oc then 0 -1 C°03 c D(L ) and L(0 -i f 0-1 Hf for f E d) -1 where H = A + V and V 2 -10c
-
THEOREM B ([2]).
fying
=
2 0 -1
Let
p(dx) =0 2 (x) dx L 2 ,al) , there is a
a law equivalent to Lebesgue measure satisunique solution
dX t = 6(X t ) dt + dW t
{X t ; t > 0 } to the S.D.E.:
.
div 13EL 2loc (Rd ) satisfies div 6(x) > -c i x t 2 c 1 ,c 2 > 0 and (3 E L 4i oc ( d) (D -1 Aci). c Li2 0c (R d ) then the If
is markovian with invariant law The operators H and
L
for some constants
process
IX t ; t > 01
D(L)c:L 2 (p)
have the
p .
with domains
same spectrum. It is discrete if
D(H)c:L 2 ( d )
lim V(x) = +
CO
17 (*) is satisfied with the help of Tamura's result ([24]) dk/2+d/m -I k c > 0 under assumption A m : asserts that lim Condition
i.
ER
> 0
,
ii.
vet
wd 5 3C > 0 ■ = (01, l''"' a d E a
0 < Inf 0.1(x) ; Ix! > R1 < Sup {V(x) Ixl -m ; ix! > R } <
DOE
iii.EC If
COROLLARY 9. assumptions
A
m
> d/2 + dim - 1 H
> 0 p
which
,
Ix!
5 Vx E
„_,c1 itt
,
V(x)1
> R => x.V(x) > C Ixl m
is equivalent to Lebesgue measure,
M > 0
for some
a
and 3 =
V = T -1 AO
satisfies
2 (1) 1 7(D EL 2 (p)nL 410C (1,4 ), -
implies that the sequence
then
Z n converges in distribution to
Z
in
-s . :
Remark
Conditions
A
are satisfied by homogeneous polynomials
m
V with degree
m. A multidimensional
c > 0)
V(x) = c Ix' 2
Ornstein-Uhlenbeck (i.e.
satisfies hypotheses of Corollary
9
if
for some constant
s > d-1 .
We do not get here an iterated logarithm law
(L.I.L.) because the lemma of the
former section is no longer valid. To avoid this problem and obtain a uniform we now reduce the class of functions used. From here we suppose
L 2 (R d )c:L 2 (p) ;
the
Sobolev space of
of Hermite functions is denoted by
Hs
Here
hm
bounded, thus
L 2 (R d ) constructed with tensor products and its norm by
is a bounded random variable with values in
*2 dZi _ s
(/)
L.I.L.
H -s
for
.
The process 7
1
s > d-I/6 : 2
e v- ' ( J1 H v (X u ) du)) d o v N d H ) = h (x ). .h (x = v (xl''"' x d v v 1 1 ' vd d ) ' j=1
for
m-th normalized Hermite function. With the help of the estimate (m -I/12 ,) of Szdgo- ([23]) we see, summing by parts on spheres of N d , that : HZ 1 11 < C d p d -s-7/6 p=1 The strongly mixing property of the X ; t > 01 process implies, by an ana-
and
is the
CO
logous method than for theorem 8,
18 THEOREM 10. For
- 1/6 ,
S > d
there is some
with
Slo c:S2
vwE s•2 o , efE FI s , lim [(Z n (f) (w) -
such that
1P(Q0 ) - 1
f f dp) (2 .0(.0 n))
-1/2 ]
1 Ilff -
/2 *
H
114_1 is valid here because of inclusion
The use of the norm
s
c7H
-1 •
* * Proof : Note thats HL() thus f EH s satisfies integrability condition d < c.c of ([3], theorem 2.7) and individual L.I.L. is satisfied. f "
From the other hand, theorem A still applies. strong invariance principle with speed
Z
process for a
yE
E(Z,H ) *2 -s
satisfies
d ;
Hs
of
,
H
- Fl y )
-s (G(H
-
-
a.s.
Co .
The limit
(x) p(dx)
where Ili
s/2 H y y
Hy =
is
,
H
K
y
- H y ) < 2 A -1 l j
H_ s associated to
of
cluster set of
I
the
6)(
*2 p(dx) < c,c . -s
1
gaussian law
Z.
of
(Z n - EZ n )/ /2 zn(zn n) .
.
The theorem follows
* Cc=11 o s
Note that
) v
Thus we get a compact subset This compact set is the
a
:
and
E(Z,H ) *2s = yEkl d yEE d
the class
)
H -s .
in
its covariance is a trace class operator because
orthonormal basis
Remark.
o ((in (in n)) -1 / 2 )
e -2s (G(H - R
Z n satisfies
The process
, so that this theorem establishes
This result is connected with the conjecture
a uniform
(8.11)
of
L.I.L.
on
[4] .
3.3. Discretization.
6x
Suppose here that
s >
is
{X t ; t > 0}
d/2 and if
a bounded random variable of
is a diffusion on a compact
H_ s ;
for example if
riemannian d-dimensional
manifold this condition is realized. The
discretization
Z n,A For
A > 0
=
of the process
f (X kA ) Z /n 1
fixed, we see that
gaussian process
ZA
Zn
such that
is,
'
[8] implies
for A > 0 : f c Hs Z n,A
converge in distribution to a
: 1/2
vfc H
E(Z (f)) 2 - A T k 2 (AA) (f,e ) 2 , A m=1
if
k(v) = (it24 1-e-v
19
i.i.d.
With the help of an
Z
explicit constructions of
. Z = E m=1 -
,
m=1
E HZA-Z m
riemannian
2s
m
manifold
c As-d/2+1
Prohorov and Levy's distances 0(A D/3 ) for A ,-)-ID , D = s - d/2+1
Then the are
AA
2 m
For a diffusion on a compact
H -s
. E k(À m A) Emm e m=1
Z A - ,/,,
E 1 k(?, mA) - 12771. ?:.'
A
{2 ^ (-21-)} 3
m=1
s > d/2
for A -+ 0, if
of those
d3
,
Thus the A (n) = o (n
n,A discretized
-1/11
gaussian
random variables on
[11]
shows that Du-
for
n÷co ,
AO
if
s>
d/2.
Z
for
A
Z nAk) , n ,,
process
converge in distribution to
If, moreover A(n) = 0 (n-1
,
) r
(n-D/(4D+11)
3 (P Z n,A(n)
For great values of
.
.
) r 0 (A -11/4 n -1/4,)
) when n
d
m
:
can be estimate
d3 (17
AS
:
From the other hand, a precise analysis of the results of dley distance
we give
ZA :
and
v/7-T- Em em m
E IIZA ZIIII _2 s =
Thus
(E ) m>1
sequence of normal realizations
s
this speed is approximately
)
/(4D+11))
•
n -114
then :
20
BIBLIOGRAPHY :
El]
A. De Acosta.
Existence and convergence of probability measures on Banach
spaces. Trans. Amer. Math. Soc. 152, pp. 273-298 (1970).
[2]
S. Albeverio, R., Hoegh-Krohn, L. Streit. Energy forms, Hamiltonian and distorted Brownian paths. J. of Math. Phys. 18, n° 5, pp. 907-917 (1977).
[3]
R.M. Battacharya.
On the functionnal central limit theorem and the law of
the iterated logarithm for Markov processes. Zeit. far Wahr. und Verw. Gebiete 60, pp. 185-201 (1982).
[4]
J.R. Baxter, G.A. Brosamler. Energy and the law of the iterated logarithm. Math. Scand. 38, pp. 115-136 (1976).
[5]
E. Bolthausen.
On the asymptotic behaviour of the empirical random field of
the Brownian motion. Stoch. Pr. and their Appl. 16, pp. 199-204,(1983).
[6]
R. Carmona. Processus de diffusion gouverné par la forme de Dirichlet de l'opérateur de Schreidinger. Séminaire de probabilité XIII, Strasbourg 1977-1978, L.N.M. 721, pp. 557-569 (1979).
[7]
D. Dacunha-Castelle, D. Florens. Choix du paramètre de discrétisation pour estimer le paramètre d'une diffusion. C.R.A.S. Série I, Paris, t.299, pp. 65-69 (1984).
[8]
H. Dehling, W. Philipp. Almost sure invariance principles for weakly dependent vector-valued random variables. Ann. of Prob. 10, pp. 689-701 (1982).
[9]
P. Doukhan. Fonctions d'Hermite et statistiques des processus mélangeants (Submitted to publication, 1985).
[10]
P. Doukhan, J.R. Leon, F. Portal . Principe d'invariance faible pour la mesure empirique d'une suite de variables aléatoires dépendantes.
(Submitted
to publication, 1985).
P. Doukhan, J.R. Leon, F. Portal. Calcul de la vitesse de convergence dans le théorème central limite vis ,sa vis des distances de Prohorov, Dudley et Levy dans le cas de variables aléatoires dépendantes. Prob. and Math. Stat. VI.2 , 1985.
21
[12]
X. Fernique. Régularité des trajectoires des fonctions aléatoires gaussiennes. L.N.M. 480, Springer (1975).
[13]
D. Florens. Théorème de limite centrale des fonctionnelles de diffusions. C.R.A.S. Série I, Paris, t. 299, pp. 995-998 (1984).
[14]
E. Giné. Invariant test for uniformity on compact riemannian manifolds based on Sobolev norms. Ann. of Stat. 3,
[15]
pp. 1243-1266 (1975).
N. Ikeda, S. Watanabe. Stochastic differential equations and diffusion processes. North-Holland, Tokyo (1981).
[16]
K. Ito, H.P. Mc Kean. Diffusion processes and their sample paths. Springer Verlag, Berlin (1974).
[17]
J. Kuelbs. Kolmogorov law of the iterated logarithm for Ranach space valued random variables. Illinois J. of Math. 21, pp. 784-800 (1977).
[181
J. Kuelbs, R. Lepage. The law of the iterated logarithm for Brownian motion in a Banach space. Trans. of the Amer. Math. Soc'. 185, pp.253-264 (1973).
[19]
S. Minakshisundaram, A. Pleijel. Some properties of the eigenfunctions of the Laplace-operator on riemannian manifolds. Can. J. of Math. 1, pp. 242256 (1943).
[20]
L. Nirenberg. Pseudodifferential operators. Proc. of Symp. in pure Math. XVI. Global Analysis, A.M.S. Providence, pp. 149-167 (1970).
[21]
M. Rosenblatt. Markov processes. Springer Verlag, New York (1971).
[22]
R.T. Seeley. Complex powers of an elliptic operator. Proc. Symp. in pure Math. X, A.M.S., Providence, pp. 288-307 (1968).
[23]
G. Szege% Orthogonal polynomials. A.M,S. Providence (1939).
[24]
H. Tamura. Asymptotic formulas with sharp remainder estimates for eigenvalues of elliptic operators of second order. Duke Math. J. 49, pp. 87-119 (1982).
ALMOST EXCHANGEABLE SEQUENCES IN
Lq , 1
q < 2.
Sylvie GUERRE Equipe d'Analyse
U.A. N° 754 Université Paris VI 4 place Jussieu 75230 PARTS CEDEX 19 Tour 46/0 - 4éme étage
Let
(0,03,P)
Lq = Lq (Q,B,P) , 1 s q < +
be a probability space and
• We
consider the two following problems : Problem 1 : Let
(X ) n nE Ti there exist a subsequence
[(X ) nk kE W
(X ) nk kE u
of
which is almost symmetric ?
(X)
n nE N
is almost symmetric if :
(N) such that : V (u.) E R
Ve>0,AkEll
(1-c) Hz
Lq , 1 5 q < + 00 . Does
be a weakly null sequence in
a . xn
P s Pz a, k-hi
7
(1+6) Hz
P
x
, V
q> 2
P
x
N ,
.
nkfi
nk-hrr(i)
Problem 1 was stated for
permutation of
and solved for
q
E
2N
.
in
Problem 2 : Under the same hypothesis, does there exist a positive density
(X ) of (X ) n nE nk kE W Lq (ydP) such that : X +c' 12h <+ — 1/q q 1Zk k=C ' L (ydP) P subsequence
[We will say that
(Xn)
n
E
and a exchangeable sequence
(Z ) k kE N
(Z ) is i.i.d., we will say that k kE V after the change of density y]
(X ) n nE N
is almost
This definition comes from [2] . The following implications are well known :
(X ) nk kE
is almost
(X
is almost exchangeable after the change of density
) k kE N
(X) n k kE N
a in
is almost exchangeable after the change of density
If in addition
n
y ,
after the change of density H
is almost symmetric .
By the Finetti's theorem (cf. [2]), we also have that :
y y
y .
i.i.d.
23
(X )
nk
is almost exchangeable
kE TI
(X ) nk kE F
is almost i.i.d., conditionnally to its tail field.
On the contrary, a modification of Example 2 in [2] by Y. Raynaud shows that there exists an almost symmetric sequence in Lq
with no almost exchangeable subsequence
after any change of density.
A lot of results are known about symmetric subspaces of Lq , 15 q<+ co , in particular about those that are isomorphic to L P for some
p • In almost all these
papers, symmetric sequences are in fact i.i.d. or at least exchangeable : this is
the most natural way to find symmetric sequences in L'1 -spaces. I recall here the results that are closely related to the problems 1 and 2 and that motivated these
questions. Kadec-Pelczynski [9]. For q > 2 , every weakly null sequence in Lq : - either is isomorphic to the unit vector basis of L
2
- or has a subsequence which is almost equivalent to the unit vector basis of Lq . Dacunha-Castelle [4]. For 1 5 q < 2 ,
1-symmetric subspaces of Lq are means of Orlicz spaces.
1 7 • (Case
D. Aldous
q =
J.L. Krivine and B. Maurey [10]. Every infinite dimensional subspace of Lcl , 1 5 q < + co , has a subspace which is isomorphic to L P for some
More precisely : If p E [1,q]
there exists
n ) nE N
on (X )
vector basis of
p
is a weakly null sequence in Lt , 15 q<+ co , (X ) n nE N such that for all e > 0 there exists a sequence of blocks is (140-equivalent to the unit (Y ) n nE N i.i.d. after a change of density.
such that
n nE TI L P and almost
This last result answers positively to problem 1 and 2 if we are allowed to take blocks on (X ) and not only subsequences. n nE Recall now partial known positive answer to problem 1 : THEOREM 1. If q> 2 , every weakly null sequence of Lq has an almost symmetric
subeqnc. If 15 q< 2 , every sequence in L , which is equivalent to the unit vector basis of
£2
has an almost symmetric subsequence.
The case q = 1 The case q
E
2N
is due to H.P. Rosenthal. is proved in [8].
The proof of the general case [6] uses the theory of stability
[10] : in stable
spaces, there is a natural way to find almost symmetric sequences. First, recall a
24
:
few definitions A Banach (
space
X
in
y ) m mE El
lim n,2-{
X
and two ultrafilters ""
lim 11x + y 11 = lim Um m m,2/n m,2/ n,2 /
The type a
(x ) n nE F N , we have :
is stable if given two bounded sequences
(xn)
defined by
nE B
7
and
on
and
hx + y 11 . n m and
2i
is a function from
X to
+ R
such that
:
VxEX, a(x) = lim 11x + x il . n n,2,( For (ci) F R
2
, we define the type aa* re.T
by :
aa * DT(x) = 11m lim ix + ax n + ayml 1
VxEX
n,2t m,lr
a
where
(x n ) nE F and
is defined by
7
2l and
(ym ) yriE ii and
by
sit .
Il al l= a (0) = lim Ilx 11 . n,?,/ n
J.,et
The spreading model F3]
defined by
(IN) R
(x ) ...., n n E Iv
under the norm : k + ... + lim, cr x 1 1 Z Œ. LII = Um ... limp 1 1 1 n. i=1 n. n 1 1 k
and
2'
is the completion of
a k x rik H .
((x ) is supposed to have no convergent subsequences). n nE II In a stable space, every spreading model is 1-symmetric. The proof of theorem 1 (x ) n nE 11 the fundamental sequence of its
in stable spaces for a sequence
uses a sufficient condition (S.C.)
have a subsequence which is almost equivalent to
to
spreading model : (S.C.)
:
a
Let
be the type defined by
K l (C)= (7/a kEN 9 aa
1,
... a E R ' k
k
on a stable space X . If (x ) n nE ET such that Ta a * ... * cr 0- and 1 ,-d1 5 11
l
k
is relatively compact for the uniform convergence on bounded sets of
X , then
has a subsequence which is almost equivalent to the spreading model defined (x ) n nE V by (x n ) nE IN and thus almost symmetric. This condition was used by J.L. Krivine and B. Maurey in the case of written in
L P -types and is
[6]. Its proof uses Ascoli's theorem.
be a weakly null sequence in Lq 1 5 q < + co which is (X ) n nE V equivalent to the unit vector basis of £ 2 (even in the case q > 2 , this is the Let now
only case to consider because of Kadec-Pelczynski's that type
result
f 9]).
It is shown in
[6]
15 q< 2 and that the (X ) verifies (S.C.) : suppose for simplicity that n nE r is symmetric r i.e. : V X E Lq , 0- (X)=0-(-X)] . One a defined by (X n ) nE N
can show that
a
is entirely determined by Poll
L 2 (Û x [0,+ co[ , dP 0
t
c -) t c1+ 1
and a function
uœ belonging to
which is the weak limit in that space of
(1- cos t Xn ) nE il. , by the formula : V X E Lq , K (o- (X) q - lio- H q - 11X11 q ) = ci
<1p 11X
+co
>
where
K=1 q
-, (1- cos t) dt
25
U
X
1- cos t X
<,>
and
2 L (dP 0
is the inner product in
dt
0-1 ) •
Moreover, this representation has the following property : a
u
Lq
uniformly on bounded sets of
n
n-4+co
un
u
a
at
2 L (dP
in
t
,
q+1)
.
Oa
u n -*+= To prove theorem 1, it is thus sufficient to show that if
IC
belongs to dt
Lq ( dP
1
then (u
(g)
such that :
where
= 1- e
-t
2
V, c5"
)has a subsequence which converges in
nE N
(X ) n nE
As
Uœ (w,
n
7 n =V niCS *
19,
is equivalent to
A(w)
t
2
2
, one can show that :
a
AE L
1
c(w,t)
lim e(w,t) = 0 a.e. 0
On the other hand, we know that :
k
n a .a U n = E (1- U 1 7
1
-
i=1
We deduce from tlese two facts that :
, n 2 T
U
i=1 1 - e
n
1
k
A(w)
(cx n, 2 ) i=1 1
(w,
n
t)
where S ri = Sup (!cr 1.1 !, 1 s i 5 k .] n Taking a subsequence of (X ) , we can suppose : n nE N
n„ 2 E ka .) i=1
T Then (II
n
)
>0
converges a.e. to
nE
-Q't2A(w)
and by Lebesgue' s dominated conver-
2 L (cap ® — /L—) • This implies that q+1 t is relatively compact for the uniform convergence on bounded sets of Lq and
gence theorem
(U
n
1- e
)
fl
E
converges also in
IC (a) 1 proves theorem 1 by (S.C.) in that case.
or 2 for weakly null sequenwhich are not equivalent to the unit vector basis of £2 .
This theorem does not give any answer to problems 1
ces of
Lq
1 5 q < 2
The following result gives a negative answer to problem 2 in that case :
THEOREM 2. Let
15 q< p< 2 • There exists a bounded sequence in
(i) (X ) n nE IN
is equivalent to the unit vector basis of
Lq such that : ZP .
26
(ii) (X ) n nE U
has no almost exchangeable subsequence after any change of
density.
(iii)
If
is the type defined by
gives a hope that
is not relatively compact
Lq .
for the uniform convergence on bounded sets of Remark : Property (iii)
K l (c)
(X ) n nE N
(X ) n nE
has no almost symmetric subse-
quence but this question is still open. In fact, the two natural ways to find almost LP -spaces (namely the theory of probability with almost
symmetric subsequences in
exchangeable subsequences and the stability of those spaces) do not work for this (X ) n nE W
sequence
Sketch of proof
[7]. +co
, du p+1 u
(1- cos t u) x(u,w)
Let
U(w,t) = 1-e
= 1
where
u E [0,1/N]
if
FN
u E FN 2k-1 ,N 2k+1 1
1f
= -k
is a fixed positive constant and
bles such that
) -k k E P(.71( = 1) = P(.1( = 2) = 1/2 and
It is possible to verify that
a sequence of i.i.d. random varia-
0. (F k , k 5 IN) = 03] .
defines a symmetric type
U
a
on
Lq (cf. 1- 11])
by the way defined in theorem 1 and such that
U K Hah q =
dt
Li (dP t
---) q+1 .
By construction, the function
U oscilates a lot
no a.e. convergent subsequence] and this prevents
F(F k ) kE K l (g)
is i.i.d. and so has to be uniformly relatively
compact and proves property (iii). On the other hand, it is easy to see that
-K tP
(where
satisfies the inequalities :
-2K tP Uœ (w,t) s 1-e
1- e
U
K =
r
+ m cosu)0
P
du )
17.
.
Using techniques of stability, it is possible to show that these inequalities imply that
g
is "equivalent" to an
Q?-type
be defined by a sequence
of
(Xn)nEIN L P • This proves property (i).
Lq (ydP)
and a subsequence
(Xnk)
such that UT = 1-e
-K t P P )
and thus can
which is equivalent to the unit vector basis
Proof of (ii) : We follow an idea of Suppose that there exists a density
T
F2]. p , an exchangeable sequence
kE N
of
(X ) n nE E
such that :
(Zk) kE N
in
27 X
+co
nk
E 11Z k k=1
m
q L (cpdP)
Then, it is shown in
[2]
that : nk
it
H(w,
1/q
1
)
m
(1 -.e 1/q ) - w - lim k -, + on w(w)
it Z = w - lim (1 - e k) k-4+ on it Zk G ] for all k E E [1 - e
G
Where
is the
g
-
field generated in
2
. t
w U(w,
by the functions
„ )
for
cp(w) liq
t E R
+
(G
is smaller than the tailfield of
Let us show that G, =
w
(w) = r
:
ri _
=
0
X
1/q
du
p+1
" (1-)
2k+1
+m u)] + E cos( 1 /q p+1 k=c u (w) du
k
(w)r
1
(1- cos N
2k -1
m(w)
1/q
u)
du u1
t.--, + m, it is easy to see that :
When
1/N
(1- cos
N
t
p
u p+1
2k+1
T 2k-1
for
(1- cos 0(wwq u)
r
du
u p+1
N
2k+1 du
I N 2k-1 u p+1
k E N.
This proves that
N
c ,....
u) du
t
0 6)) 1/q
0
2k+3 du
[' 2k+1 u p+1 N
As
)
u) ,
t
cos(
m(w)
ri—
[
) •
is also generated by :
G
0 1/N
(Z ) k kE
m
G
belongs to
N
2k+3
2 S 2k+1 N
( k ) kE 11 generates
N
du
u p+1 , G
k
also for all
and
because we can write :
2k+1
YN 2k-1
du
u p+1
is equal to
B .
This situation is impossible because that would mean that : it Z k for all k E N , / )== 1- e
w(w) Pici and this is obviouly false (for example,
U(w,
t
(w) 17c1 c This proves property (ii)
and theorem 2.
,...,
tP co (w) p /c1
28
BIBLIOGRAPHIE
[1] ALDOUS D.J. : Subspaces of
L 1 via random measures. Trans. Amer. Math. Soc., 267, (1981), 445-463.
[2] BERKES I. and ROSENTHAL H.P. : Almost exchangeable sequences of random variables. To appear in Zeitschrift für Wahrscheinlichkeitstheorie verw. Gebiete. [3] BRUNEL A. and SUCHESTON L. : On B-convex Banach spaces. Math. Systems theory, t. 7 n°4, 1973. [4] DACUNHA-CASTELLE D. : Variables aléatoires échangeables et espaces d'Orlicz. Séminaire Maurey-Schwartz, Ecole Polytechnique, 1974/75, exposés 10 et 11 . [5] DACUNHA-CASTELLE D. et
KRIVINE J.L. : Sous-espaces de L1 . 26 (1977), 320-351.
Israël Journal of Math., [6]
GUERRE S. :
[7]
GUERRE S. :
Types et suites symétriques dans LP , 1 A paraître dans Israël Journal of Math. Sur les suites presque échangeables dans
p<+ 00 .
, 1< q< 2 .
Preprint. [8]
JOHNSON W.B., MAUREY B., SCHECHTMAN G., TZAFRIRI L. : Symmetric structures in Banach spaces. Memoirs of the American Math. Soc., May 1979, vol. 19, n° 217.
[9] KADEC H.I. and PELCZYNSKT A. : Bases, lacunary sequences and complemented subspaces of L . Studia Math., TPXXI, 1962. [10] KRIVINE J.L. et
MAUREY B. : Espaces de Banach stables. Israël Journal of Math., vol. 39, no 4, (1981).
[11]
LEVY P. :
Théorie de l'addition des variables aléatoires. Gauthier-Villars.
AN APPLICATION OF A MARTINGALE INEQUALITY OF DUBINS AND FREEDMAN TO THE LAW OF LARGE NUMBERS IN BANACH SPACES
Bernard HFINKEL Département de Mathématique 7, rue René Descartes 67084 STRASBOURG Cédex (France)
ABSTRACT : In a real, separable, p-uniformly smooth Banach space the law of large numbers in the Prohorov setting is studied by a method depending on a result of Dubins and Freedman which compares the distribution of a real valued martingale with the one of the associated conditional variances. Some laws of large numbers of Kolmogorov-Brunk type are also given.
Several recent papers have improved very much the knowledge on the strong law of large numbers (SLLN) and on the law of the iterated logarithm (LIL) for random variables (r. v. ) taking their values in a Banach spaceequipped with a regular norm. The key idea in these papers is to use the regularity of the norm for reducing the infinite dimensional SLLN or LIL to a scalar SLLN or LIL. But then for solving this finite dimensional SLLN or LIL problem, sophisticated techniques - multiple truncations, iteration of martingale exponential inequalities - have to be used. Here we will introduce a new approach of the SLLN in a Banach space with a regular norm, approach which allows both to obtain new results in the non i. i. d. setting and also to show in a simpler way statements which were known previously. The cornerstone of this method is a result of Dubins and Freedman [3] which compares the distribution of a real valued martingale and the ones of the associated conditional variances.
Before to state and to prove the results we recall some de-
finitions.
§ 1. SOME DEFINITIONS.
In all the sequel we will denote by (B, II) a real separable Banach space which is p-uniformly smooth (1 Ù, sup( 1/2 ( 1,
p (t) =
Ilx-ty!!) -
i!xl! = IIYII = 1 ) '
30
satisfies :
P(t)
C tP
C being a positive constant. It is well known that the norm II II is differentiable away from the origin ; let's denote by D the derivative of
H.
If now one associates to D the following fonction F : B
x 0, and :
F(x) = xII F(0) = 0 ,
D(x/
B' :
)
one can check that F has the following two properties [19] :
(IIF(x)II Bt (ii)
C > 0
=
IxI 1333-1
Ilx-F y liP - Px
V (x, y) E 13 2 ,
P I F(x)(Y)
These two properties are crucial for reducing the infinite dimensional SLLN to a scalar one. Other geometrical properties of a p-uniformly smooth space that we will use are its reflexivity [5 ]and the fact that it is of type p [ 15 ] . Usually the SLLN problem in (B.
H) is stated in the following way :
"Let (X ) be a sequence of independent, centered,
B-valued r. v. and denote by k Sn = X 1 +...+X n the associated partial sums ; under what hypotheses does
(S
n
/n ) converge a. s. to 0 ?
(1) "
This point of view is very restrictive because two other asymptotic behaviours of the sequence ( S
/n ) are worth of interest : n P( sup IISn /n1I -<+ ) 1, which behaviour can be called a bounded law of large numbers (BLLN)
(2) ,
and :
P( w : S n (u.1) /n
0 weakly ) =-• 1, which is a law of large numbers in the weak topo-
logy (WTLLN)
(3) . The main goal of this paper is to study the three forms (1), (2) and (3) of the
LLN, under Prohorov boundedness conditions of the r. v. , in p-uniformly smooth spaces ; the good geometrical properties of these spaces allow to see that even if the asymptotic behaviours (1), (2) and (3) seem to be close , they happen under hypotheses which are very different. In an appendix we will also state without proof some Kolmogorov-Brunk type SLLN which can also be obtained by applying the same Dubins-Freedman comparison result.
§ 2. PROHOROV'S BOUNDED LAW OF LARGE NUMBERS. A sufficient condition for the BLLN in the classical Prohorov setting is as follows :
31 THEOREM 1 : Let (X ) be a sequence of independent, centered,
k
r. v. with values
in a real separable p-uniformly smooth ( 1
such
that :
>Q
YkE
1X k
(k / L 2 k)
a. s,
where : L x = Log(Log sup (x, e)).
2
Let's define for every integer n :
A (n) = 2 -2n
E
sup ( E f 2 (X k) :
where : 1(n) = (2 n + 1,
1)
B'
k E I(n)
,
, 2 11+1 ) ,
and , suppose that the following hold : a) The sequence ( S
) is stochastically bounded . n In
b) The sequence ( 2 -211P Log n 2. C > O :
c)
Then :
Z n 1
P( sup
Z k E I(n)
Xk
) is stochastically bounded.
exp ( e / A(n) ) < + -
<+)=1
Sn /n
REMARK : One checks easily that condition b) above holds for instance if the sequence ( n -2 P L
2 P ) is stochastically bounded.
2n 1 k n Xk
Now, let's give the proof of Theorem 1. PROOF : An easy symmetrization argument, similar to the one used in the proof
of [171 shows that it suffices to prove Theorem 1 for symmetrically distributed r, v. Xk So we only consider that case. Another symmetrization Zp argument shows that h) implies that the sequence ( 2-2np Log n E X ) k k E I(n) is L 1 -bounded. of Lemma 2. 1
For technical simplicity we make the following three assumptions - which aren't a loss of generality - : i ) K' > 0 : 486 K' (2p + 2C ) n
2, ,
Vi E I (n) ,
where of course C
)
X.
1/8 and : K' 2 n / Log n a. s.
denotes the constant involved in the fundamental inequality
recalled in Section 1. E exp ( -4K' 2 / A(n) ) <+
a)
.
n 1 -2nn
sup 2 -
Log n E k E I(n)
2P
K' 2 .
Now, let's start the proof itself. -n E If one denotes by T the r. v. 2 X k ' an easy application of the Boreln k E 1(n)
32
Cantelli lemma and of the symmetry of the X k shows that the conclusion of Theorem 1 holds if the following is true :
at>0:
Z n 1
P( T
(4)
>t) <+83.
n
By symmetry [1], hypothesis a) implies :
E S
sup
IIP
/n
n
< + CI) ,
and therefore also :
c=
<
sup E T n
.
So (4) will hold if we find V > 0 such that :
IIP
- E Tn
E P( T n 1
> t' ) <+ .
In order to find such a t', we begin by looking for good bounds for the quantities : u
= P( T
n
where x
I P > 2x ) ,
P - E Tn
n
sup (2. c) will be specified later.
Suppose that the integer n 2 is fixed and denote by Z 1 ,
,Z
the r. v.
2n
(X5 / 2 n) i E 1(n) ; we will also denote T n by T and A(n) by A. The symbol E will denote a sum taken on the set of integers (1,
For finding a bound for u
, 2 n).
, we will consider two cases :
n
First case : Log n
/ A.
By adapting a well known trick of Yurinskii [21] , Ledoux [19] has noticed that a
I
r. v. of the type T
-E
and :
= Z TIk E(
where :
can be written as a martingale :
-E
T
k
(Z
P
I
E( T , Zk )
I
k1'
, a o =(0,
For our needs this decomposition is not precise enough, so we will refine it by setting :
V k = 1,
2n ,
A
k
B
k
=(IZ+ 1
"•
+Z
k- 1
+Z
+ +Z k+1 ' • '
I
>y ) ,
2n
= A
k ' where y > 0 will be chosen later, and : k P
k
E(
TIIP
E ( T
Ak
I
73`
I
1 Bk
k
73`
k
-E( )-E(
The following inequality obviously holds : P( T P -E
TII P > 2x ) P(
In a first step we will bound P( E
ak > x ) + > x).
P(
Pk >
TI! I
A
I k
113 1 B
3
k-1
I ak-1
)
'
33 )
In order to do this we notice for beginning that ( 1 sk sj
211
is a martingale to which we plan to apply the following comparison result of
Dubins and Freedman [3] : ) be a real valued martingale and denote by (Y k ) its incre-
LEMMA 1 : Let (Sk ,
ments. If one defines for every k 1,..., n:
V k = E ( Yk 2
k- 1
) - with a — 0
✓ a >0. V b >0, P(
j = 1,
(
,
) - then :
, n :(Y 1 +...+Yi )
a (V 1 +...+v i ) + b )
1/ (1+ab).
For applying this lemma to our situation we need first a bound for the r. v. 1 this bound will be obtained by an easy change in the computations of the proof of Lemma 1 in [19] :
P IA I 3k )sE(
E (
k ) + p E ( I F( T-Z k)(Z k) IA I 1
T-Zk lIP IA I
+C E(dZ
k
A
)
) k '
k
and :
E ( T
P IA
k
1 a k-l ) E ( T-Zk
) I 1 a, I k-1 ) - P E(IF(T-Z0Z, .1() A k-1
IA
I k - C E (1lZ k IP IA
P
k
I k
a k-1 ) '
From these two inequalities one deduces : p E(
E(T-Z, k )(Z k )
IA I I
C E ( 11Zr k P IA I
3k )
+ p E ( 1 F(T-Zk)(Zk) IAk 113
C p E 1lZ k
k-1 )
The same computation can obviously be done for -
I
cek
I
p E ( I F(T-Zk)(Z) 'AI I +p E ( I F(T-Zk)(Zk) IA
TA k
k ' so :
) + C E ( Z k IP IA 1 a k ) 1
k
)
1 3 k-1 ) C p E 1l Z k
)•
lAk
From this inequality it follows easily that :
E cel2c
ak-1)
8p 2 E(F 2 (T -Z k)(Z k) IA
If one puts now a
p( 1 k 2
E E E(a
n Ak ) '
2 flc-1 ) 8C p
1 2P E
(
P k_
-
one sees that :
2 E( IT Zk 12(P-1)I I 73' ) s 8p 2 sup ) A + 8C IM2a / Log n k k-1 12) Ak 1 sk 1,r_zkl 8(p-1)) 1/4 A+ ,,„2 Kt 2 cr/Log 2 3/4 ou p s 8p a sup (E 1 k 2–
By assumption a) and symmetry there exists L > 0, such that : sup sup
n 1 sk 2n
(E 1T-Z
k 8(p-1)) 114
L.
34
Hence :
8p 2 A 013 " L + 8C 2 K' Z a' /Log n
E E E (oek Z I 3)
and so by assumption c) , one has for n large enough :
E E E ( cy2 k
I
k-1
a3/4
)
Applying now Lemma 1 for a = a - 1/2 and b = 1, one obtains : Œ -1/2 zE(ct,2 .1:= ( Ea, >xt P( E ak >x ) ) k "k 2 kk-1 ) + 2 + P( Ea >x, Ea <Œ k E Œk 1 3 k-1 ) 1 ) k 1/2 u1/4 1/4 a Ax- 1 ) ) s a( From the inequalities : Œ
P(
sup 1
E Z. > y/3 ) 1 sjs k
2n
2 P(
E Z.
>y/3 )
,
an application of Hoffmann-Jibrgensen's Lemma ( [14] Lemma 4.4 ) gives that for every y
81 :
P( Z ak >x ) s 2 (2) s 4 (2) Now , fix y
n E
3/4 3/4
1/2
P
(
P( E Z
sup ( 486, 24 c),
,
2
>y/27 ) s 32 P ( E Z
k
k
>y/81 ).
such that :
32 P( z Z k >181 ) s 1/2
- such a choice is of course possible by hypothesis a) - ; y being fixed, we put : x = y/243 . For such a couple (x, y) one has :
P(
-E
2 P(
T 1 13 > 2x )
(6)
Pk > x )
In the next step of the proof we will bound the right-hand side of (6) by using another martingale result, also due to Dubins and Freedman [3] LEMMA 2 : Let (Sk , a k) 3.11.1 be a real valued martingale and denote by (Y k ) its
increments. Suppose that : n Then:
lY
k
I
1
a. s. .
VuER.VX>O,Vv> 0, X( u+ Y 1 +... -Fy n)
E ch (
where :
e(X/v) ) ,
) s ch (Xu/v ) exp v+ V k
I
+...+V
n
1,.... n ,
Vk = E ( Y
a k-1 )
:
e(x) = exp x - 1 -x . For applying this lemma to our situation, we first notice that by the same computations as for the
r.v•k
one has :
35
p E ( 1 F(T-Zk)(Zk)
I 13 k I
I 'Bk
la k
) + C E ( Z
+ p E ( I F(T-Z k )(Z k) I B I I O
k-1
If for every k = 1, „. , 2 n one defines : 1-p Y = ( y Log n / KT(2p+2C p ) ) k
k
one has : V k = 1,
s 1
1Yk 1
, 2n ,
' Bk
) + C E ( Il Z k
I
B
I k
-K-1 ) .
,
a. s.
Now we want to bound : a
n
=P(EY
k
> xy
1-p
Log n / K 1 (2p+2C ) ) ,
by using Lemma 2. Let's notice that : V
1
(Log n) 2 (8p 2 A + 8C 2 K 12 /Log n) / K' 2 (2p+2C ) 2
+... +V
4 Log n
Hence : a
n
s P( (1 / v+V i +... +V) E Y k > xy 2
Remembering now that p E
'II, 21 and
1-p
Log n / K 1 (2p+2C ) (v+4Log
).
also the assumption i) made on K', one ob-
tains : a
n
s P( (1/ v+V + +V ) E Y > 16 Log n / (v+4Log n) ) . 1' • ' k 2n
Applying now Lemma 2 for X =17 = 4 Log n, one gets : -4 a . s exp 4Log n / ch(8 Log n) s 2 n n So for the integers n n (x) and such that Log n s K 12 / A , one has : 0 -4 u s4n n Now we go to the second case.
Second case : Log n > K 12 / A . It is easy to see that the relation (6) remains true - for the same couples (x, y) and that the only thing to do is to apply Lemma 2 in a different manner.
If one defines : V k = 1,
, 2n ,
yk
= ( KTy l-P / A (2p+2C ) ) r3 k
one has : V k = 1,
, 2n,
Yk I
a . s.
Furthermore : 2 12 2 2 V + +V /Log n)) / A2 (2p+2C p ) 2 s 4 K' / A s K 12 (8p +8C (K 1 • • 2n So un can be bounded by : 2 Pi( (1/ v+V I +... +V n ) E 5k > xlç y1 2
/ (2p-E2c. ) (Av + 4K12)
Now we apply Lemma 2 for X =y = 4K 12 / A and we obtain :
36
u
s 2 exp (4K' 2 / A) /
n
ch (8K' 2 / A)
4 exp - (4K' 2
/ A) .
Putting together the results obtained in the two cases, one gets : Vn
N(x),
un s 4 ( n -4 + exp - (4K' 2 / A)
).
Hence : n1 n and this finishes the proof of Theorem 1.
§ 3. PROHOROV'S LAW OF LARGE NUMBERS IN THE WEAK TOPOLOGY. It is natural to expect that this asymptotic behaviour can be obtained under some "weak topological refinement" of
the hypotheses of Theorem 1. The precise result
is as follows :
THEOREM 2 : Let (X ) be a sequence of independent, centered, r. v. with values k in a real separable, p-uniformly smooth ( 10 : Vk E1NT,
1 X0 s K (k/L 2k)
a. s.
Let's define for every integer n and every f E B' : Z
X(n, f) :-• 2 -2n k
E I(n)
E f2 (X ) k
•
Suppose that assumptions a), b), c) of Theorem 1 are fulfilled and that the following one holds also : YE> 0 , 1TfEBI,
d)
Z n 1
exp ( _ E / X(n, f) ) < + cr .
Then : P( tu : S n(tu ) /n --' 0 weakly ) = 1.
PROOF : By Theorem 1, we know that : P( sup n A standard argument then gives [13] : E
sup n
(7)
Furthermore the one dimensional Prohorov SLLN [20 1 implies : Vf E B',
f( Sn /n) -' 0
a. s.
This property and (7) show that (Sn in , a(X 1 , ... , X n) ) is a weak sequential amart of class (B) (for the definition and the main properties of weak sequential amarts, see for instance [4] V.3 ).
37
The space B being reflexive,
the conclusion of Theorem 2 follows immediately
from a well known convergence theorem of weak sequential amarts due to Brunel and Sucheston [21.
§ 4. PROHOROV'S STRONG LAW OF LARGE NUMBERS.
The sufficient condition for the SLLN in the Prohorov setting can now be guessed easily from Theorems 1 and 2 ; the statement is as follows :
THEOREM 3 : Let (X) be a sequence of independent, centered, r. v. with values r,
in a real, separable, p-uniformly smooth ( 1
,
such
that : 2K>0:
1X k l sK(k/L 2 k)
a. s.
Suppose that the following hold : S
b')
2 -2nP Log n Z k E I(n)
c')
y e >0,
n
Then : S
/n
n
0
in probability.
a')
Xk 2p
-' 0 in probability
E exp (_ C / A(n) ) <+ n 1 0 a. s.
In
REMARK : It is easy to see that b') holds for instance if : -2p n Ln Z X 1 2P 0 in probability. 2 k 1 "lcS n PROOF : By a classical argument [17 ] it is sufficient to consider the symmetrical case. By symmetrization ( see [10] proof of Lemma 1 ) 13') implies : b")
lim 2 -2 " Log n E + k E I(n)
E Xk
The ideas of the proof of Theorem 3 are the same as those used for proving Theorem 1 ; so we will keep the same notations as in the proof of Theorem 1 and we will only detail what needs to be detailed. First one notices [1 1 that from a') it follows that : E T
n
11P
S
n
In
P
0 7 and also :
O.
So the the conclusion of Theorem 3 will be true if we show : x > 0 ,
E n 1
P( T n
- E T n P > ?,x ) < +
.
Without loss of generality it suffices to check (8) only for x E 0, 243 -13 [
(8) Fix
such an x. As previously, we denote by u n the general term of the series involved in (8) ; we will bound u
n
by considering again two classes of indices n, slightly different
38
from those taken in the proof of Theorem 1. Let's suppose - and this of course isn't a loss of generality - that : ✓n
2, VjE I(n) ,
a. s.
1 X. s 2 n /Log n J
For simplicity we put
0 = exp ( -8(2p+2C ) Ix) . P
We consider first the following situation :
First case :
A Log n 0 ,.
It is easy to see that if one puts y = 243 x
1/p
, relation (6) holds for n large enough.
Now we will again apply Lemma 2 ; for this we introduce the following sequence ( Yk ) : ✓ k = 1, .... 2 n
. Y k = (Log n/(2p+2C p ) ) P k
One has : ✓ k = 1, ... , 2 11
I Y k I s 1 a. s.
and : V
+ +V n 1• • ' 2
where : d
( 2(Log n) 2 A + 2Log n dn ) , -4 0, by condition b") .
n
Therefore : ✓n
V 1 +.. .+ V n s 4 0 Log n. 2
n(x) ,
k
Applying now Lemma 2 with y = 4 e Log n and X = 4 y (2p+2C p ) /x , one gets : u
n
s ( 2/ch2Log n) exp 4 0
1 /2
Log n s 4 n
3/2 .
It remains to consider the complementary situation :
Second case : A Log n >
e.
If one chooses now :
A (2p F2C p ) )
Yk = ( 0 /
-
Ok
one has : Vk=1
2
n
,
IY
k
I
1 a. s.
and : ✓ n > n'(x) ,
V +... +V 1 zn
40
2
/
A.
2 Applying finally Lemma 2 for y = 4 0 2 / A and X = y x one gets : 4 2 s (2 / ch(x 3 0 7/8 /2A) ) exp (4x 0 /A) s 4 exp ( - x 3 0 7 /8 / 4,11) . u n Collecting the partial results we obtain : yx E
I), 243 - Pr ,
Y n N( x)
u
n
a
N(x) Er ,
s4(n
a
y (x) > 0 :
+ exp (- y
(X)
/
A ))
and (8) immediately follows from hypothesis c') .
,
39
REMARK : Hypothesis b') in Theorem 3 seems at first glance somewhat surprising and artificial. To shed light on its meaning we will now give some corollaries of Theorem 3 which will show that b') is a very weak hypothesis. COROLLARY 1 : Let (X k ) be a sequence of independent, centered, r. v. with
values in a real, separable, p-uniformly smooth ( 1
K > 0 : Vk
a. s.
Suppose that the following hold : 1)
n -13
lixk
E 1
2)
in probability
_4 0
[
n
eXp(-
V
A(n) ) <+
n> 1 Then :
S
0
In
n
a, s.
The proof of this result is very easy. First : z 2p 2 -2np -2np s (c/Log n) 11 X II k
kEI(n.)
E k E I(n)
X
k
P
and so hypothesis b') of Theorem 3 is fulfilled by applying 1). By symmetrization one has : n-P
E 1
EX P3
0;
n
the space B being of type p, it follows that the sequence ( S n /n ) converges to 0 in L(B). All the assumptions of Theorem 3 being fulfilled, this ends the proof of Corollary 1. In the case p = 2, Corollary 1 reduces to Prohorov's SLLN proved in [1l] . Let's stay for a moment in this situation p 2 for making precise the difference between the hypotheses of Theorem 3 and those of the result in [11] , One observes first that the following easy corollary of Theorem 3 holds : COROLLARY 2 : Let (X k) be a sequence of independent, centered, r e v, with
values in a real, separable, 2-uniformly smooth Banach space (B, that :
EK>O: y k E N
K (k/L 2 k)
I! X k II
a. s.
;
Suppose that the following hold : a') S n / n
0 in probability
( 1 / n 2 1, 2 n )
c`) Then :
6
> 0 ,
E 1 sk< n E n> 1
J X k 11
-4 0 in probability / A.(n) ) <
exp ( S
2
n
/n
12, .
0 a. s.
such
40
The gap between hypothesis b') in the above statement and assumption 1) in Corollary 1 is clear. It is easy to give examples of sequences of r • v. which belong to the domain of application of Corollary 2, but not to the one of Corollary 1 ; the sequence considered in the last section of [18 ] provides such an example. In [11] it has been noticed that in the special case of Hilbert spaces - which are of course 2-uniformly smooth - condition 1) in Corollary 1 is necessary for the SLLN in the Prohorov setting. More generally, is it possible to simplify the hypotheses of Theorem 3 by making additional cotype restrictions on B ? Godbole has characterized the spaces of cotype q in terms of SLLN for the q / kg -1 ) ( [6] Theorem 2.1 ) ; unfortunately his result will of no k help in our situation. sequence ( X
The ( partial ) result we are able to prove is as follows : COROLLARY 3 : Let 3/2 s p s 2 and 2, s q s 2p - 1 ; consider a sequence (X k ) of
independent, centered, r. v, with values in a real, separable, p-uniformly smooth Banach space (B, 11 1),
of cotype cl.
Suppose that the following hold : i)a- K>0 :VkEN, 11 X E n>1
ii) V e > 0 , Then :
S
n
k
11 s K (k/L k) 2
exP ( - E /
/ n ,-. 0 a. s.
MO
) S
4.)
a. s.
n
/ n -4 0 in probability.
PROOF : Of course the only thing to do is to check that the weak law of large num_ bers implies the strong one. As Sn / n -4 0 in probability, it is sufficient to consider the symmetrical case. By cotype q and assumption i), one has : lim n -ci Z E r Xk I q , n -4 + ee lskri Hence : -2p n Ln 2
E 1 sksn
E 11 X
k
o.
2 P s K 21)-(1 n -ci (L2n)1 _2p
E 1 sksn
E 11 X k d cl ,
and Corollary 3 immediately follows, by application of Theorem 3.
REMARK : Let's suppose that conditions i) and ii) of Corollary 3 are fulfilled for a sequence of symmetrically distributed r. v. X k . If the weak law of large numbers holds, then, by Godbole's result n -cl implication holds : S
n
In
' 0
in probability
Z Iskn
= n -cl
1 Xk 1
E 1 skn
q
1 X
-4 0
k
a. s. . So the following
q' 0
a. s. ;
this shows the very special geometric nature of these spaces which are both p-uniformly smooth ( 3/2 s p s 2 ) and of cotype q (2sqs 2p - 1 ) .
41
§ 5. APPENDIX: SOME LAWS OF LARGE NUMBERS OF KOLMOGOROV-BRUNK TYPE.
In [9] and [12] the SLLN of Kolmogorov and Brunk are studied in 2-uniformly smooth spaces under hypotheses requiring both assumptions on the strong and on the weak moments of the r. v. . Results stronger than the classical ones known for type 2 spaces are obtained. By applying - in a more elementary manner as above - the martingale technique developed in section 2, Kolmogorov-Brunk type SLLN can also be obtained in
p-uniiormly smooth ( 1
THEOREM 4 : Let (X ) be a sequence of independent, centered, r. v. with values
k
in a real, separable, p-uniformly smooth ( 1
1
)•
Suppose that :
) There exists a sequence of positive numbers (d.) i, J If j E rN , a. s. x.J .5 cl• J ii) S / n •-• 0 in probability . n
converging to 0, such that :
I
If one defines for every integer n :
I"(p,n) = 2 211P
E j E I(n)
J
1 2X. P
then the following implication holds :
( a k integer, k
1:
E n 1
( Mn) + lip, n) ) k <+ 113 )
=S n / n --, 0 a. s. .
As an obvious corollary of Theorem 4 one has the following result which can be compared with the well known law of large numbers in type p spaces of Hoffmann-
JSrgensen and Pisier [ 15 ] : COROLLARY 4 : Let (X k) be a sequence of independent, centered, r. v. with values in a real, separable, p-uniformly smooth ( 1
a)
S
n
/ n -• 0 in probability
b)
E k 1
c)
1,11 :
Then :
k -2 P E X E n 1
2p k
< +13
(An)) i
<+,3 .
S
n
/n -. 0
a. s. .
42
Bounded laws of large numbers and laws of large numbers in the weak topology can of course also be considered in the Kolmogorov-Brunk setting. We only give an example of such a result :
THEOREM 5 : _ Let (X k ) be a sequence of independent, centered, r. v. with values in a real separable, p-uniformly smooth ( 1
b) k c)
Ej
Then :
k -2 P E X 1 1 :
E n 1
k 1 2P ‹+
12
( A (n) )i < + CC
.
P( sup II S n / n 11 < + 0) ) n
Let's conclude by mentioning an application of the above results. It is well known that all the classical situations in which one can conclude that a Banach space valued, symmetrically distributed r. v. X - and it is always possible to reduce to that case - satisfies the LIL,
are the union of a Prohorov type BLLN
and of a SLLN ( 1-16] , [7] ) or of a law of large numbers in the weak topology [8]
. Therefore it is not surprising that Ledoux's recent necessary and sufficient
condition for the LIL in uniformly convex spaces [19] can also be obtained as a corollary of Theorem I and Theorem 5. The computations for proving this observation are left to the reader.
REFERENCES.
[1]
DE ACOSTA, A. : Inequalities for B-valued random vectors with applications to the strong law of large numbers. Ann. Prob. 9 (1981) , 157-161
[2]
BRUNEL, A. et SUCHESTON, L. : Sur les amarts faibles à valeurs vectorielles. C.R. Acad. Sc. Paris 282, Sér. A (1976), 1011-1014
[3]
DUBINS, L. E. and FREEDMAN, D.A. : A sharper form of the Borel-Cantelli lemma and the strong law. Ann. Math. Stat. 36 (1965), 800-807
[4]
EGGHE, L. : Stopping time techniques for analysts and probabilists. Cambridge University Press - Cambridge 1984
[5]
ENFLO. P. : Banach spaces which can be given an equivalent uniformly convex norm, Israel J. of Math, 13 (1972). 281-288
[6]
GODBOLE, A. : Strong laws of large numbers and laws of the iterated logarithm in Banach spaces, PHD, Michigan State University 1984
[7]
HEINKEL, B. : Relation entre théorème central-limite et loi du logarithme itéré dans les espaces de Banach. Z. Wahrscheinlichkeitstheorie very/. Gebiete 49 (1979), 211-220
43
[8]
HEINKEL. B. : Sur la loi du logarithme itéré dans les espaces ré fl exifs. Séminaire de Probabilités 16 - 1980/81 - Lecture Notes in Math 920, 602-608
[9]
HEINKEL, B. : On the law of large numbers in 2-uniformly smooth Banach spaces. Ann. Prob. 12 (1984), 851-857
[10]
HEINKEL, B. : The non 1. i. d. strong law of large numbers in 2-uniformly smooth Banach spaces. Probability Theory on Vector Spaces III - Lublin 1983 Lecture Notes in Math 1080, 90-118
[11]
HEINKEL, B. : Une extension de la loi des grands nombres de Prohorov. Z. Wahrscheinlichkeitstheorie verw. Gebiete 67 (1984), 349-362
[12]
HEINKEL, B. : On Brunk's law of large numbers in some type 2 spaces. to appear in "Probability in Banach spaces 5" - Medford 1984 - , Lecture Notes in Math
[13]
HOFFMANN-JQ5RGENSEN, J. : Sums of independent Banach space valued random variables. Studia Math 52 (1974). 159-186
[14]
HOFFMANN-JORGENSEN, J. : Probability in Banach space. Ecole d'été de Probabilités de St Flour 6 - 1976 - Lecture Notes in Math 598, 1-186
[15]
HOFFMANN-JORGENSEN, J. and PISIER, G. : The law of large numbers and the central limit theorem in Banach spaces. Atm. Prob. 4 (1976), 587-599
[16]
KUELBS, J. : Kolmogorov law of the iterated logarithm for Banach space valued random variables. Ill. J. of Math. 21 (1977), 784-800
[17]
KUELBS, J. and ZINN, J. : Some stability results for vector valued random variables. Ann. Prob. 7 (1979), 75-84
[18]
LEDOUX, M. : Sur une inégalité de H. P, Rosenthal et le théorème limite central dans les espaces de Banach. Israel J. of Math. 50 (1985), 290-318
[19]
LEDOUX, M. : La loi du logarithme itéré dans les espaces de Banach uniformément convexes. C. R. Acad. Sc. Paris 300, Sér. I, n° 17 (1985), 613-616
[20]
STOUT, W. F. : Almost sure convergence. Academic Press, New York 1974
[21]
YURINSKII, V. V. : Exponential bounds for large deviations. Theor. Prob. Appl. 19 (1974). 154-155
ON THE
SMALL BALLS
CONDITION IN THE
CENTRAL LIMIT THEOREM
IN UNIFORMLY CONVEX SPACES
Michel Ledoux Département de Mathématique, Université Louis-Pasteur 7, rue René-Descartes, F-67084 Strasbourg, France
Introduction. Let E
be a Banach space. By random variable with values in E
we mean a measurable map X from some probability space (C4T,AD) into E equipped
a(E)
with its Borel
such that the image of 1P
probability measure on S(E) . Denote by (X ) n n copies of X
and, for each n 1 , S n
with values in E if the sequence
X1
by X defines a Radon
a sequence of independent Xn . A random variable X
is said to satisfy the central limit theorem ( CLT CS
in short)
converges weakly to a Gaussian Radon probability
nvn e
measure on E . In his remarkable work on the Glivenko-Cantelli problem, M. Talagrand [T] observed the following characterization of the CLT for random variables with a strong second moment which relates the central limit property to conditions on small balls : if X
takes its values in E and 11E(
X
< oo , then X satisfies
the CLT iff, for each g > 0 , S 11m Lnf W[
n
n
> 0 .
■/171
(Actually, as detailled in [T] , this equivalence holds in the more general setting of non-separable range spaces and in the framework of empirical processes.)
Although it seems rather difficult to verify these conditions on small balls, the preceding property is intriguing since it reduces a central limit property in Banach spaces to some kind of weak convergence on the line by taking norm. This property also lies at some intermediate stage since, as we will see below, a random
45
variable X K in
E
with values in
E
satisfies the CLT iff there exists a compact set
such that
lim ink IP( --21-1 EK1 > 0 n (S
and the sequence
/fn)
n E
is stochastically bounded as soon as for some
M>0
11S lim inf IP( n-'
n <M 1171
> 0 .
M. Talagrand (oral communication) raised the question whether the equivalence
he proved holds without the strong second moment assumption which is not necessary in general for the CLT . In this note, we answer this question in a positive way in uniformly convex spaces. Precisely, we will establish the following result :
THEOREM 1
. Let
with values in
(i)
E • Then X satisfies the CLT iff
lim t 2 F( t
a uniformly convex Banach space and X a random variable
be
E
X
>t3=o
oc)
and (ii) for each E > 0 , lim inf TP( n cx)
Is
F
< C
3>o.
/71
This result will follow easily from a new quadratic estimate of sums of independent random variables in uniformly convex spaces obtained in [L2]
Preliminary results. We begin this section by a characterization of the CLT which follows easily from the concentration's inequality of M. Kanter [K] . I
am
grateful to Prof. X. Fernique for useful informations on this result.
PROPOSITION 2 . Let X
be a random variable with values in a Banach space
Then X satisfies the CLT iff there is a compact
lim ink IP n
E K
> O .
set
K in
E
E.
such that
(1)
46
Further, the sequence (S
//17)
n)
n E
is stochastically bounded iff for some M > 0
IS I < M
11m 1sf IPC fl - Oo
> 0 .
(2)
Proof. The necessity of (1) and (2) implies the stochastic boundedness of X is symmetric. There exist 8 > 0
is obvious. Let us first show why (2)
CS n /N51) n and k
o
• Assume to begin
E
with
that
such that for all integers k k
o
and n S
IPC
nk
<M
Firc
>8.
By Kanter's inequality
dsnd
3
•(1
kip(
> 1,1,./Tc ))
1
8
sin and thus
!
Is
> m sA- 3
kip(
_2_ 48
JR
2
It follows that the sequence (S
ECIlx cy <
for all
co
n
is stochastically bounded and also that
/WI) n ET
ce <2 . In the non-symmetric case, let
copy of X ; the symmetric random variable
X- X'
EC
X
<
implies that X
op
be an independent
satisfies (2) (with 2M instead
of M ) and hence the preceding conclusions apply to that
X'
X- X I
. In particular, we have
; therefore the strong law of large numbers combined with (2) must be centered. Hence the conclusion to the second part of Propo-
sition 2 holds by classical considerations involving Jensen's inequality.
The first
part is established in the same way.
Since in cotype 2 spaces, random variables X
such that (S 4/Ti) n n
is
stochastically bounded satisfy the CLT [P-z] , the previous proposition yields immediately the following corollary.
COROLLARY 3 . Let E
be a Banach space of cotype 2 and X a random variable
with values in E . Then X satisfies the CLT iff (2)
holds.
47
We now turn to the small balls condition. Since for a centered Gaussian Radon probability, each ball centered at the origin of positive radius has positive mass, it is clearly necessary for a random variable
1S 1 --E—
for each C > 0 , lim inl
<
X
CLT
to satisfy the
that
E 3 > o .
(3)
n 1121
M. Talagrand [T]
is also sufficient for
showed that when 1E(1 XH 3 < oo , (3)
X
CLT • For the sake of completeness, we reproduce here Talagrand's
to satisfy the
proof of this result ; it will illustrate the idea we will use next in uniformly convex spaces.
THEOREM 4 . Let
IEC
that
42 1
be a random variable with values in a Banach space
X
< co . Then
Proof. Let C > 0
X
satisfies the
and
be fixed
CLT iff (3)
8 . 8(e) > 0
E
such
holds.
be such that
1S 6
< c 1 >
lim inf n
Choose a finite dimensional subspace
E
map
Et
E/Ii and
T(X)11 2 1
.
H
of
E
quotient norm given by
8.E 2 . For each
n
, 11T(S n)11 -
T
such that if
Ircx) T(S)
denotes the quotient
= d(x,H) , then
11
can be written as a
martingale
!T(S)
IECHT(S n )D
= E
di
i=1
withincrementsd.,i = 1,...,n , such that, for each Efd2i 3 s lEf
IT( xi )
2
i
1
[Y] ) and thus by Chebyschev's inequality
(cf
ITC
-11 )11 _
RUT(
-2-1 )1111 > E 1 5 E -2 ECHT(X) xfil
Since
e
1 t
lim inf 13 (
cto
11T(
--11
)fJ
<
E 1 >
2
1
s ô.
48
and hence, intersecting,
lim sup TEC T( n 00 X
)d1 < 2c
AT:1
therefore satisfies the CLT
by classical arguments (cf [P2] )
A short analysis of this proof shows the central rae of the martingale trick leading to the quadratic estimate and of the integrability condition
EC
4
2
3
<
CO
providing tightness at some point. An improved version, in uniformly convex spaces, of the previous martingale argument of Yurinskii was recently obtained
in [L2]
in some work on the law of the iterated logarithm. It will allow us to establish Theorem 1 which thus characterizes in those spaces the CLT through the small balls condition (3)
and a moment condition which is necessary for the CLT .
Recall that a Banach space E is uniformly convex if for each C > 0 there is a
8 = 8 (c) > 0 such that for all x,y
Itx _ y one has 1 - ---2
with
in E
c,
Y = 1 and 1 x y
> 8 . According to a well-known fundamental result of G. Pisier
[P1] , every uniformly convex Banach space E is p-smooth for some p > 1 i.e. admits an equivalent norm (denoted again
)
with corresponding modulus of
smoothness
p(t) = satisfying
sup ( kix + ty
p(t)
1
-ty0 - 1 ,
+
Kt' for all t > 0
= 1 '}
and some positive finite constant K .
This p-smooth norm is uniformly Fréchet-differentiable away from the origin with derivative
D : E
(01
and F(0) , 0 , then 1 F (x) C > 0 (cf
P-1 D(x/I
E* such that if F (x) = =
P-1
for all x
X
) for x / 0
in E and, for some constant
DI-J] ),
F (x) -F (y)11
C x
y p -1
for all x,y in E .
(4)
The following lemma was the key point in the proof of the main result of EL2] . It will allow to achieve our wish in the next section.
49
LEMMA 5 . Let E
be a p-smooth Banach space for some p > 1 with norm 1
satisfying (4) . Let also (Y.)
be a finite sequence of independent bounded hen HSH11P
E-valuedrandomvariablesandlet
1E(
1 1)/
can be
written as a martingale
1 1)- ECI with increments d
1 = 1,...,n , such that, for each i
2 r 2 2p lEtF (S -Y1)(-Yi))
r Etd3 . 1
2c2 E rjj y. d2p 1 CH
1 11
where C is the constant appearing in (4) .
Before turning to the proof of Theorem 1 , let us point out that a quotient of a p-smooth Banach space E is also p-smooth, and, if norm of E , property (4) holds true for any quotient norm of
denotes the p-smooth with uniform
constant C .
Proof of Theorem 1 . We may and do assume that E is equipped with a p-smooth
1.1
for some p > 1
for which (4)
and Lemma 5 hold. By the previous remark,
these will also hold for every quotient norm with uniform constant C . We assume moreover p <2 .
Condition ( i)
is well-known to be necessary for X to satisfy the CLT
( EA-A-C] , EP-Z] ). Let us show the suffiency part of the theorem and assume first that X is symmetric. Proposition 2 and (ii)
imply that the sequence 2
(- /1/ 1-1 nEN
is stochastically bounded and thus, from the integrability results of LP2] , we know that
sup E C(
n ,, P ) I
KP < co .
N/71
Let
e > 0
be fixed. For each n ,define
(5)
50 X.
u. 1
= u(n)
=
-- 2--
/17_
I,t1 tilX i 1 S A 1 ' i -
n andsetUri = . Eu.;(i) and 1 1=1 number 8 = 8(e) > 0 such that Urnif IF(
n
< e
Un
Since the sequence
(1i)
combine to imply the existence of a real
>6
(S /A) n n E
does not contain an isomorphic copy of
co
[P-Z]
Theorem 5.1 of
is pregaussian, that is, there exists a Gaussian random variable the same covariance structure as
IEC
T( G )11 2 1
s
e2 P
E
and
ensures that G in
X
E with
X . The integrability of Gaussian random vectors
allows then to choose a finite dimensional subspace denotes the quotient map
(ii)
is stochastically bounded under
H of
E
such that if
T
E E/H 2 p2 ic2 P 1 )
We now apply Lemma 5 to the sum
-
(7)
T(U ) n
in
E/H ; F p will therefore denote below
the Fréchet derivative of the quotient norm of
E/H . For each
n , we have by
orthogonality,
1E01 T(un)1P - F[IT(un)P3 1 2 1 n n 2p2 E IE(F 2p (T(Un -u.1 ))(T(u i ))1 + 2 0 2 r lENT(u i )11 2P 1 1 .=.1 i=1 n 2(p-1)3 + 2 0 2 n 7E( u 1 (n)11 2P 1 S 2p2 n-1 7E (1 T(G) 2 1 E lEf U - u n il 1=1
(8 )
since by independence 2
sup 1E flx*(T(x)) I 1 lEaT(U n - ue (P-1) 1 1E(F2p (T(Un -u ))(T(u i ))1 s n-1 x* E (E/ )* H 1 (where the supremum runs over the unit ball
*
sup
)
E(Ix* (T(X))1 2 1 =
H 1 Now, by symmetry and
sup
(E/H )1- of the dual of
2 21 1E(Ix* (T(G))1 1 s laT(G))1
x* e(E/ )* I-1 1 (5) , for each
E/H ) and
i = 1,...,n ,
51
E[ n u.
(
(3E(
12 C p 1) 1 ) 1/2 (p 1 ) -
-
U
K .
n
Further, n
TECdu i (n)
2P )
13 ( 7., 11> t
n i-P
dt 2P
0 1
r( X >
n
dt 2 P
so that lim
nIE( u(n)
0
n
by
(i) and dominated convergence. These observations and (8)
lim sup IP( n 00
I
therefore imply that
T(Un )d P - 1E( T(un )
e
-2p
,2 ( p-1) 2] ( 22 p I\ TEt T(c)
s
(by (7) ) and since (6) holds, by intersection,
11m sup
n
1E (
1T un) (
<
2e
cho
which easily implies that X satisfies the CLT
In the general case, let X' random variable
X - X'
(using (i)
one more time).
be an independent copy of X ; the symmetric
satisfies (i)
and (Ii) and thus the CLT and therefore
so does X .
Conclusion. It is an open problem to know whether Theorem 1 holds true in any Banach space ; since we used the fact that random variables satisfying (10 pregaussian in spaces which do not contain an isomorphic copy of c
o
are
, a general
statement should probably include the condition
(iii) X
is pregaussian
as an additional (necessary) assumption. It would also be interesting to know in what spaces, random variables satisfying (i) (and possibly (iii) ) verify the CLT iff the sequence
CS /1Ti) n n
ix is stochastically bounded (which is weaker
52
than (ii) ). At the present, only trivial situations in which this happens (such that cotype 2 spaces or spaces satisfying A-Ros(2) [L1] like L (1
p
spaces) have been described.
References.
[A-A-G]
de Acosta, A., Araujo, A., Giné, E. : On Poisson measures, Gaussian measures, and the central limit theorem in Banach spaces. Advances in Prob., vol. 4, 1-68, Dekker, New York (1978).
[H-J
On the modulus of smoothness and the G -conditions
Hoffmann-4rgensen, J.
in B-spaces. Aarhus Preprint Series 1974-75, 11 ° 2 (1975). [K]
Kanter, M. : Probability inequalities for convex sets. J. Multivariate Anal. 6, 222-236 (1978) .
[L1]
Ledoux, M. : Sur une inégalité de H.P. Rosenthal et le théorème limite central dans les espaces de Banach. Israel J. Math. 50, 290-318 (1985).
[L2]
Ledoux, M. : The 'Law of the iterated logarithm in uniformly convex Banach spaces. Trans. Amer. Math. Soc. (May 1986).
[Pl]
Pisier, G. : Martingales with values in uniformly convex spaces. Israel J. Math. 20, 326-350 (1975).
[R2]
Pisier, G. : Le théorème de la limite centrale et la loi du logarithme itéré dans les espaces de Banach. Séminaire Maurey-Schwartz 1975-76, exposés 3 et 4, Ecole Polytechnique, Paris (1976).
ER-Z]
Pisier, G., Zinn, J. : On the limit theorems for random variables with values in the spaces L
[T]
p
(2
p < 00) . Zeit. ftir Wahr. 41, 289-304 ( 1 97 8 )-
Talagrand, M. : The Glivenko-Cantelli problem. Annals of Math., to appear (1984).
[Y]
Yurinskii, V.V. : Exponential bounds for large deviations. Theor. Probability Appl. 19, 154-155 (1974).
SOME REMARKS OF
GAUSSIAN
AND
ON THE
UNIFORM CONVERGENCE
RADEMACHER FOURIER QUADRATIC FORMS
M. Ledoux
M.B. Marcus
Département de Mathématique
Department of Mathematics
Université Louis-Pasteur
Texas A & M University
67084 Strasbourg, France
College Station, Texas 77843, U.S.A.
1. Introduction ,00
Let
(
g j n n=0
be an i.i.d. sequence of normal random variables with mean
zero and variance 1 and let (Eno° nn=0
be a Rademacher sequence, i.e. a sequence
of i.i.d. random variables where P(E
0
= 1) = P(E
0
= -1) - 1 . Let
be a sequence of complex numbers satisfying E la m,n m,n and Rademacher Fourier quadratic forms as follows : X (s,t) =
Xe(s,t) =
Eagge m n m n m
i(ms+ nt)
iCins E a CCe m,n m n m
nt
,
12
< co • We define Gaussian
12
(s,t) E
[0,2i 7
E
[0,27J
(s,t)
12
.
We are concerned with the uniform convergence a.s. of the stochastic processes X (s,t)
and X (s,t) . (By convergence we mean e
N n-1 i(ms+nt) lim EEa gge m,n m n N n=1 m=0 and similarly for Xe (s,t).) It is known that if these series converge in probability then they converge uniformly a.s. and in
L P of their sup-norm. This
Professor Marcus was a visiting Professor at the University of Strasbourg when this work was initiated. He is very grateful for the hospitality and friendship that was extended to him while he was there. His work on this paper was also supported by a grant from the National Science Foundation of the U.S.A..
54
is given in
[2]
forms (cf. also
(g'3
Let
respectively for Gaussian and Rademacher quadratic
[3]
and
[9] ). and
fe , }
be independent copies of
from the decoupling inequalities that for all
p
as in (1.1) and (1.2) with finitely many non zero
(1.3)
1
and
a
m,n •
EllX g (s,t)11 P
E11 E a g g' e gms nt) II P m,n m n m
ElIx e (s,t)11 P —
Ed
[e
and
n
X (s,t)
n
.
It follows
and
X (s,t) c
and
(1.4)
E a E E'e i(Ins+nt) 11 P m,n m n m
where the constants of equivalence only depend on sup (s,t)E [0,27 ] 2
.H
and
indicates
.1 . We will show how (1.3) and (1.4) follow from the usual
decoupling inequalities in the Appendix.
f x(S
We associate with
and
(s,t) E[0,27]2
(X e (s,t)3
] (s,t)E [0,27 2
the
pseudometric 121 e i(ms+nt) d((s,t),(s',t')) = Ç E l a m,n' ' m
d
where
(s,t) , (s',t')
E
10,277 2 . As usual let
1 entropy of [0,2n 2 radius
c > 0
N([0,27] 2 7 d;c)
denote the metric
d , i.e. the minimum number of open balls of
respect to
in the pseudometric
e i(ms t +nt 1 )12)1/2
d
that covers
,2 [0,2n] . A number of people
have recognized that
(1.6)
J 1 ([0,27 ] 2 ,d) =
co I log N([0,27] 2 ,d;e) dc <
op
0 is a sufficient condition for the uniform convergence a.s. of (1.1) or (1.2). It appears for example in
[4]
(stated in a somewhat stronger form involving
majorizing measures). In fact C. Borell pointed this out to us and raised the question, can this sufficient condition be improved ? The suspicion that (1.6) can be improved is reasonable because there are some obvious cases in which
oo 217 1 2 ,d) 1 /2 (E0 7
0
(
log N ([0,27 2 ,d; e)) 1/2 de <
co
55
is sufficient for the uniform convergence a.s. of X (s,t) or X (s,t) . One of c these is when a
is a product ; (details will be given in § 2). Nevertheless,
m,n
no smaller function of the metric entropy than the one appearing in (1.6) suffices as a sufficient condition for the uniform convergence a.s. of X (s,t) or X e (s,t) . To be more precise, for all functions f : R +
1R+ such that f(0) . 0 , f is
increasing and lim f(x)/x - 0 (and also satisfies a weak smoothness condition 00
which will be given in §2), we give examples of Gaussian and Rademacher Fourier quadratic forms which are not uniformly convergent a.s. but for which oo ( 1 .7)
f Clog N C[0,27] 2 ,d;e)) de < cm
J([0 27] 2 ,d) '
.
0
These examples also show that contrary to our experience with Gaussian processes no condition on the metric
d
alone can give necessary and sufficient conditions
for the uniform convergence a.s. of (1.1).
Pertaining to our study, X. Fernique [6] has obtained an important corollary of the deep recent result of M. Talagrand [13] on the existence of a majorizing measure for bounded Gaussian processes. Fernique's corollary deals with stationary vector valued Gaussian processes ; using this corollary (in a form adapted to our needs, Theorem 3.1 of this paper) and Theorem 1.1 of [8,Chapter I] we obtain the following theorem which extends the equivalence between Gaussian and Rademacher Fourier series to quadratic forms.
6 (s,t)1 (s, t ) E Theorem 1.1. Let (X (s t g ' )/(s,t)E[0,27]2 and tX as given in (1.1) and (1.2). If only finitely many of the coefficients
0 , 2 Tr ] 2
a
m,n
be are
non zero, then for all p 1
(1.8)
EdX g (5,t)11 P
where, as above,
11.d =
EliX e (s,t)H P
sup (s,t)E [0,211J
.
and the constants of equivalence depend
only on p . In particular, X (s,t) converges uniformly a.s. iff X (s,t) does and (1.8) holds in this case.
56
Developping further the arguments of the proof of Theorem 1.1 , we characterize the a.s, uniform convergence of X (s,t) and Xe(5t) in terms of one-dimensional g entropies following Fernique's characterization of boundedness and continuity of vector valued stationary Gaussian processes. Assume for notational convenience that a
m,n
= 0
(1.9)
n and set for s, s', t, t' E [0,27] ,
if m
d (s,s') = d((s,0),(s',0))
d
and
2
(t,t') = d((0,t),(0,t'))
and for j = 1 , 2 d ) 1/ 2([0 ' 27]' '
(1.10)
Recall that the space C
=
a.s.
(log N([0,27],d .c)) 1 /2 de . 0 , of a.s. continuous random Fourier series, is defined
as the space of all sequences of complex numbers
E a g e
int
n n
a - fa 1°° n n= 0
in
4 2
such that
t E [0,27]
7
converges uniformly a.s. equipped with the norm
Fa-
= E
sup I Eag e int n n n t E [0,2n1
The dual space C * a.s.
of
Ca.s.
.
has been carefully described by G. Pisier as a
space of multipliers M(2, 2 ) ; we refer the reader to [10] , [8]
for further
details. We will not be directly concerned with this here although various interpretations of what we obtain can be given in the language of [10] .
Define for each m
(1.11)
A l = (a
and n the one-dimensional sequences
°11° m,n m=0
Set further for each T
in C* a. s.
d.(t,t') = ( E 1 J
and
A
co . (a m,n n=0
2
and j
T, A i > n
1,2
12 l e i nt I
-e
int'12 ) 1/2 7
t, t' E [0,27]
whenever it is defined. As a corollary of the results of Talagrand and Fernique we obtain :
57
Theorem 1.2. Let
X9. (5 7 t)l (s,t)c [0,2n:2 be as given in (1.1) with only
finitely many non zero coefficients am
E
,n
. Then for
( E lam,n't 2 ) 1/2 m
(1.12)
+
j = 1 or 2 /J[0,27 ],d ) 1/ 2
J /2 ([0,2u],d 1 ) + 1 sup
1 T, J , /_ ([0,27j,d) i
T E c* a. s. 11T11 ••-1 where the constants of equivalence are numerical constants and defined as in
(1.10) with
T
d_
instead of
, T, J/ ([0,27j,d) 2
is
d_ .
We note that by considering Cauchy sequences Theorem 1.2 gives necessary and sufficient conditions for uniform convergence a.s. of also of
X (s,t)
and, by Theorem 1.1 ,
X (s,t) . Actually it is easily seen that, by Theorem 1.1 of
Ea ,Chapter
]
(1.12) also implies (1.8) . It is customary to leave out the "diagonal" (i.e. the terms
a
m,m
) when
studying random quadratics forms. For one thine (1.3) and (1.4) are no longer true, in general, if amm / 0 . Furthermore, the diagonal terms can be handled separately. We write
(1.13)
a 2 e im(s+ t) = E a m,m m
a
e im(s+ t) m,m
(a2 a m,m m
E,2 e im(s +t)
By standard symmetrization argument and Theorem 1.1 [8,ChapterI] , the second series on the right in (1.13) converges uniformly a.s. if and only if
Ea g e
rn
imu
m,m m
u E [0,2rr]
converges uniformly a.s.. The first series on the right in (1.13) is a deterministic Fourier series and the dichotomy betwee n unboundedness and uniform convergence a.s. is no longer relevant. Obviously in the Rademacher case one only gets on the right in (1.13) •
e E a m m,m
im(s+t)
58 2. Random Fourier quadratic forms and entropy
The sufficient conditions that we know for the uniform convergence a.s. of (1.1) and (1.2) follow immediately from well known results and techniques. We will present them here for the convenience of the reader.
the Fourier quadratic forms (1.1) and
Theorem 2.1. If J 1 (10,2n1 2 ,d) < co (1.2) converge uniformly a.s. and
(2.1)
E
sup
C ( ( E
,1X (s,t)1
m
(s,t) E [0 , 2n1 c- g
j cr0,21112 ,d) )
lam,n 12N2 I
where C is a numerical constant. (2.1) is also valid with X replaced by X . Conversely, if either of the Fourier quadratic forms X a.s. then J
V2
([0 2n] 2 ,d) < œ
e
converge uniformly
.
'
12 < op . One can I be complex numbers satisfying E lb m,n m
Proof. Let fb
easily
or X
m,n
constant C > 0 such that
(2.2)
p(
>
gg
Eb
in
12 ) 1 /2 )
E lb m< n
C exp ( X/C) , V X > 0
m ' 11
An estimate similar to (2.2) holds when fg
n
I
is replaced by (c I n
(see e.g. [1] ).
Using these estimates Theorem 2.1 follows from the usual extensions of Dudley's theorem as presented for example in [11] or [5]
since we have that
X (s,t) - x (st,t , )
Eexpce
g
d((s,t),(s',t')) for some u > 0
and similarly with X replaced by X .
For the converse we note that by (1.3) the uniform convergence a.s. of (1.1) implies that of
E ( E a g, ) g e m n m,n n m
ims
s E [0,27]
59
in which, to simplify the notation, we take
independent from
E a gi l m,n n
a
m,n
= 0
m n • Since (g
if
m
is
and is symmetric, the uniform convergence a.s. of
(1.1) implies, by Theorem 1.1 [,Chapter I] , that of
E ( E lam,11 12 1 ; 1/2 -rn e ims n m
7
S E [0,2U]
7
which is equivalent to
J 1/2 ([0,27] 7 d 1 ) <
(2.3) where d
is defined in (1.9). Similarly we sec that
1
(2.4)
OD
J 1/2 ([0,2n],d2 ) <
OD
By the triangle inequality the metric
d((s,s 1 ),(t,t'))
d 1 (s,s 1 )
d
defined ir (1.5) satisfies
+ d2 (t,t') .
Therefore (2.5)
N([0,271 2 1 d; e)
1\1([0,27],d 1 ; e/2) N(E0 7 27] ,d2 ; c/2)
and thus we see by (2.3), (2.4) and (2.5) that the uniform convergence a.s. of (1.1)
Exactly the same argument applies if (1.1) is
implies J 1/2 ([0,2u1 2 7 d) < ai replaced by (1.2) .
In an analogous fashion one can show that
r/2 ([0 ' 271r
a)
co (log N([0,271 r :a;
=
e)) r/2 de
<
co
0
implies the uniform convergence a.s. of r-dimensional tetrahedral (i.e. restricted to the subset of a
for which m
and Rademacher Fourier forms. Here, as above, distance in
L
2
d
1
< m
2
<
r
attention is
) Gaussian
denotes the corresponding
.
We now give examples which show that if one wishes to give metric entropy conditions for the uniform convergence a.s. of (1.1) or (1.2) solely in terms of the metric
d , then the sufficient condition of Theorem 2.1 can not be improved.
60
Let f
R+ R+ , f(0) = 0 , be an increasing function satisfying lim x/f(x) =
lim f(x) = cc . Furthermore we assume that for some C > 0 and all n X
CD
op ,
f(2n )
El2n)
(2.6) n=n
o
2
o
co large enough
o
C
n
2
•
n0
(Such a condition will follow if for some positive finite numbers M
and K ,
f(2xK 1(x)forallx>14.)Let(11.1be a sequence of integers such that + N i _ l and let (1).1 be a sequence of positive numbers. Their precise
N. > N 1 +
a) 2 2 Define values will be specified later but we assume already that E b. N. < op -J J=1 J co (ms + nt ) 12 X (s,t) = (s 7 t) E 10,27] Z b. 7 (2.7) g-g,e .
j=i
3 m,n E i(j)
m
1 , I(j) - ( n : N
1
+ ... + N. J-1
n < N
+ ... + Ni 3 ,
1
N.0 =0.Notethat(j)=N..we will show that for some appropriate choice of J (N.3 and (b_l depending upon f , the process X defined by (2.7) does not g J J converge uniformly a.s. even though op J. f ([0 ' 2712,d) = where
d
12 f(logN([0 7 2rrj ,d;e)) de < op
0
is the metric associated with X
as given in (1.5).
By independence and Jensen's inequality we see that for each J
E II E j=1
g g e i(ms +nt)11
E m,n E 1(j) m
mn
1
E sup b. Ed g g e j J m,n E I(j) m n
i(ms+nt)11
m
and, by Remark 1.3 [8,Chapter V]] , for each j
E
E gg e mn m,n E I(j) m
(ms 4- nt
( E
sup
g 2 t E [0,273 n E I(j) n
eint12
- VT N. )
C N. log F.
(Throughout this example C will denote a strictly positive constant, possibly changing from line to line.) Thus we have that
61
(2.8)
sup J >
Ell E b_
E
j=1
i(ms+nt)
g g e
m,nE 1(j) m
d
C sup b J.N J:Log N. j>
m n
To obtain an upper bound for the left side of (2.8) we note that co d((s,s 1 ),(t,t 1 )) = ( E b2.
e
(ms + nt)
e
2 i(ms' + nt t) 120/
I )
3 m,n E i(j)
j=-1
m
+
5(s,s?)
where 8 denotes the one-dimensional metric given by
co .5(t,t') = ( Z
b 2 N.
j=1
Z
l
eint
eint'i2)1/2
3 n EI(j)
It follows that oo 2S f(2 log N ([0,2n],8;e)) de 0
J ([0 2712,d) f '
We now estimate this entropy. For
sup 8(u,0)
u(s) =
lu I
and each j we define
s > 0
sup
'13 .(s )
Jul
s
( le s nE 1(j)
mu
- 1 1 2 ) 1/2 •
Following [8, p.123-124 and Lemma 3.6 Chapter II] we see that Co f( 2 log N ([0,27],8 ;
de
E))
0 oo 1 1 , , 2 .1/2 C ( ( E b.2 N. ) + f f (2 log 7 ) da(s) ) j=1
0
co
cx)
+
C(. (E b2. N2
E b j=1
j=1
1
'J f (210g J O
) d&/s) )
Furthermore,
1
co f (2 log
) a& _Cs)
0
z f(21.(+2) (.& _(2 -k ) -
_(2-(1+1) ))
k=0 oo
f(2) j.(1) +
Now for each
•
and
E L(2 -k ) (f(21(+2) - f( 2k)) • k=1
s > 0 N.-1
(2.9)
13..(s)
sup ( E lp inu - /1 2 ) 1/2 +
lu]
i(N i +...+N. )u sup
lu]
VT le
'
3-1
— 11
62
(s)
Let us denote by
the first term of the right hand side of (2.9). It is easy
k ,
toseethat N .-1
(2 -k )
2 -k (
n2 ) 1/2
2 -k N 3/2
n=0 for
j
that for
j
and (2.6) we see
q'.(2 -1 )
sufficiently large. Using these estimates for sufficiently large
[log2Nj]
oo E '?.(2 -k ) (f(2k+2) - f(2k)) k=1 J
E k=1
2 A-. (f(2k+2) - f(2k)) J op
E
+
3/2 -k , .421(+2) -f (2k)) 2 J
N.
k=[log N.]+1 2 j C
k
By a similar decomposition of the sum on op E
j
k=1
[(N 1
+...+N. ) 2
-k
J-1
co
j
k=1
( N.2 J
-k
we cret
A 2] (f(2k+2) - f(2k))
A 2) (f(21+2)
2(21))
c
f (3logN.) .
Putting this all together we see that
(2.10)
Since
eo C ( ( E b j=1
.1 f ([0,27 1 2 ,d)
f
co
N2. ) 1/2 +
E b .N _f (3logN ) ) . J J j= 1
lim f(x)// x - 0 , we can first choose
satisfies
x
[N.1
such that
co
.2 j f(3logN.) lim IogN. and then take =
.2
\ \
N-f(31-ogN.))
j > 1 •
For these choices, we see from (2.10) that
12 J ([0,2n j ,d) < en
even though,
by (2.8) and the integrability properties of Gaussian quadratic foins, X (s,t) g does not converge uniformly a.s..
63
Thus (1.6) can not be replaced by (1.7), with any increasing function f satisfying
lim -4
f(x) //x = 0
and (2.6), as a sufficient condition for the
CO
uniform convergence a.s. of (1.1). Note that exactly the same argument applies for the corresponding Rademacher Fourier quadratic form, i.e., with (c c 3 m n (gmgril
replacing
in (2.7). 12 . In some cases J 1/2 ([0,27J ,d) < œ is necessary and sufficient for the
uniform convergence a.s. of (1.1) and (1.2). One of these, which is trivial, is when the coefficients
(am,nI
vanish outside some one—dimensional set of indices.
We write (1.1) in the form
(2.11)
E
where m < n
akgnagnice ik(ms + nt)
( s ,t) E [0,271 2
are non—negative integers. Using Gaussian decoupling we see that this
series converges uniformly a.s. if and only if z
(2.12)
e ik(ms + nt)
k "Oic mnk
,
(s,t) E [0,2171 2
converges uniformly a.s. where (g;a 3 is an independent copy of f (gmk grilk )
is an independent symmetric sequence in k
. Since
it follows from Theorem 1.1
[8,Chapter I] that the series in (2.12) and consequently (2.11) converges uniformly a-s. if and only if J holds
ifmkgnk/
1/2
([0 211] 2 ,d) < oo . Once again exactly the same argument '
in (2.11) is replaced by (e c ) mk nk *
The finiteness of
1 J1/2([0'27]2 ,d)
marginal processes formed from X (s,t)
Theorem 2.2. Let X (s,t) the processes X (s,t ) o t oE [0,27]
also implies the continuity a.s. of the and X (s,t) . c
be as given in (1.1). Then if
j 1/2 (1° ' 2712 ' d) <
and X (s ,t) are uniformly convergent a.s. for each fixed g o
E [0,27] . Conversely if X (s,t ) and X(s t) converge o g 0 12 uniformly a.s. for some t E [0,2n] and s E [0,2rr] then J 1/2 (10,27J ,d) < oo . o o and s
o
This theorem is also valid if X (n,t)
is replaced by X (s,t) .
64
be as defined in (1.9) . Let us note that
Proof. Let d l (s,s') and d 2 (t,t')
(2.13)
N([°,271,di ;20
j = 1,2 .
N([0,27 ],d;c)
We will show this for j = 1 . The proof when j = 2 is completely similar. Let 1 ,
(s k' t k ) ' k
, N([0,27: 2 ,d;C)
that cover [0,27 2 in the metric
be the centers of the balls of radius
d . For a fixed k consider
.
B c,k = U ((s,0) E [0,27 1 2 : d((s10),(skttk)) < c 1 Let (;'.
k'
0)
be some fixed element in B
c,k
. It follows from the triangle inequality
that B
,°)) < 26 e t k C U ((s,0) E [0,271 2 : d((s,0),(e k
since
d((s,0),(s k ,t k )) + Since d
1k
that if J/
(2. 1 4)
E
2
) = d((s,0),(
(To ' 277 2 ,d)
(
m n
<
k
,0)) we have verified (2.13) when j = 1 . It follows
co
then
J1/ /2
12)V2 gme ims
la
([0 27] d ) < ' 1
co
and this implies that
s E [0,217 1
m'n
converges uniformly a.s.. By Theorem 1.1 E8,Chapter I] , the uniform convergence a.s. of (2.14) is equivalent to that of
E ( E a g'e m,n n m n
int
°) g e m
ims
s E [0,2n]
which, by decoupling inequalities similar to (1.3), implies the uniform convergence = 0 for all a.s. of X (s,t ) . (To avoid confusing notation assume that a m,n o m n
in all these series.) A similar argument shows that
X (s t) g o
converges
uniformly a.s.. The converse follows from the proof of Theorem 2.1 since in the relevant part of the proof of Theorem 2.1 the only property of the continuity of X (s,t) that is used is that its marginats are continuous. All the above statements remain valid when X (s,t) is replaced by X (s,t) . c The following corollary is immediate.
65
Corollary 2.3. Assume that a £2
. Then J
/2
m,n
= a a m n
where (a
n
is a real sequence in
12 ([0,27_1 ,d) < oo is necessary and sufficient for the uniform
convergence a.s. of X (s,t) and Xe(st) .
A1 r , Proof. Since ta2 j E , and because of (1.3) , X (s,t) converges m uniformly a.s. if and only if E a g e ims converges uniformly a.s.. The result m m now follows from Theorem 2.2. The same proof is valid for X e (s,t) . Now let us consider stochastic processes of the form
(2.15)
EaE gge m,n m,n m n m< n
where
a
and
m,n
indexed
i(ms +nt)
(s,t) E [0,27: 2
,
(g 1 are as given in (1.1) but where (e .} is a doubly n m,n
sequence of random variables with P (c- 1 0,0
P( e00 . —1) . ,
,
that
which is independent of (gn } . It follows from Theorem 1.1 [8,Ghapter I
the series in (2.15) converges uniformly a.s. if and only if J 1/2 ([0,2n17 2 ,d) < oo where
d
is as given in (1.5). This observation has an interesting interpretation.
Let (0,,P)
be a probability space on which
(Emyn)
is defined. For each w E 0
we consider the Gaussian Fourier quadratic form
(2.16)
E a E (w)g g e m
The metric
i(ms +nt)
,
72 (s,t) E [0,211J .
d (as defined in (1.5)) is the same for all these processes irregardless
of w E 0 . Furthermore, if J 1/2 ([0,217]
2
d) < œ then for almost all w E
Û
the
series in (2.16) converge uniformly a.s. (with respect to the probability space supporting (g
n
). However, the series in (2.16) do not necessarily converge
uniformly a.s. for all w E 0 . This can be seen from the examples we just gave of Gaussian Fourier quadratic forms which do not converge uniformly a.s. but for which 72 J 1 /([0,27 J ,c1) < oo . Thus, unlike the random Fourier series studied in Es] 2 metric continuity of Gaussian Fourier quadratic forms is not determined by the L
d
given in (1.5). Once again exactly the same argument applies if (gn }
replaced by (En}
in (2.16).
Nevertheless, entropy still can be used to characterize a.s. uniform
is
66 convergence of Gaussian and Rademacher Fourier quadratic forms. However, as we will see in the next section, it is necessary to consider the supremum of classical onedimensional entropies over a family of metrics.
3. Gaussian random Fourier series with coefficients in a Banach space
Fernique's corollary
Theorem 3.1. Let
sequence of elements of
E
sup
t E Lc) ,2n]
(B,11.11) B
of Talagrand's theorem
[6]
[13]
be a Banach space with dual
is the following.
B* . Let
3.cril
be a
with only finitely many non zero terms. Then
dE gx eint H n n n
E!lEgnxiJI
(3.1) E
sup
Eg < x* x > e int l n [0,27] n n
sup
tE x* E B* .
where
Therefore uniform convergence a.s. of random Fourier series with coefficients in a Banach space is characterized through conditions involving
a family of
classical one-dimensional entropies. We will see in the proof of Theorem 1.2 that
(3.1) can be used to obtain a similar result for Gaussian and Rademacher Fourier quadratic forms. Using a set of one-dimensional entropy conditions instead of a single two-dimensional entropy condition to characterize uniform convergence a.s. of random Fourier quadratic forms seems to be necessary, as was shown by the class
of examples described in § 2. Theorem 3.1 sheds some light on these examples. Indeed, the quadratic form (2.7) can almost be realized as a random Fourier series with coefficients in a Hilbert space. Define a sequence
(x
n
of elements in
setting
Vj> 1 , VnEI(j)
where
(eic l
xn
=
r
denotes the canonical basis of
X(t).Egxe int , n nn
b.
/
e
) 1/2
Ni
kEI(j) k
A2 • Consider
tE[0,273 .
02
by
67
Clearly
Hx(t)11 2 = E
n
11 2 +
E < x ,x > g g
mn mn
m n
e i(ITI-11)t
(3.2)
= E
Thus (I)
11X(t)11
+ 2 E b. j
J
j
E
EI(j) m
g g cos((m -n)t) . mn
is closely related to the quadratic form (2.7). For the choices of
§ 2, X(t)
J x_ = (
)
1/2
is unbounded a.s. since for
E
e
kE TO)
( bc*.i d = 1 )
k
we have
E
e int l _ E < x44.- x > n n t E [0,27] n sup
E
b 1/2
sup
t E [0,27]
C (b iN i logN i ) for some absolute constant of the form
1/
E n E T(j)
g e inti n
2
C . Likewise the examples of §2 show that no condition
J f ([0,2171,15) <
oo
(see (1.7)) where
e l m-n)t 1 12 ) 1/2
i(m-n)t 8(t,t') = ( E l< x ,x > 1 2 le m n m,n
is sufficient for the uniform convergence a.s. of the Gaussian quadratic form in
(3.2). (The relationship between Theorem 3.1 and metric entropy conditions is clearly
E
sup
IlEgxe int H n n
EHE g x n n
t E [0,2TT1 n
+
J,/2 ([0,27],d x *) r/
sup
1
where
dx*(t,t')
= ( E< x * x n> 1 2 1e
1nt
e
int'12)1/2
.)
We now use Theorem 3.1 to obtain Theorem 1.1.
Proof of Theorem 1.1. We will give the proof in the case
(1.4) we will prove this theorem with
X (s,t) and X (s,t) g
X'(s,t) -Eaggie m,n m n g m
i ms+ nt)
E
p = 1
By
(1.3) and
replaced respectively by
(s,t) E [0,2n 2
68
and X' (s.,t) =
E
a
m,n
m < n
c c'e
i(ms + nt)
(s,t) E [0,2n 2
n
ITI
where, to simplify the notation we take a
finitQly many of the probability space
(Q,,31,p1) and
(g
are non zero. Let us define
a m,n
(W,Y,P)
and
(g1"1 1
and
(g,,, 1
(e m 1
and
m
E
w E Q
Eg
Ee
Eg i Let
(QX0',3)0 1 ,PXP') .
denote expectation on the product space
us ( E am,ngm(w)e3 as a sequence in
and consider
on the
on the probability space
and denote their corresponding expectation operators by
E c , . Let
us fix
m n . Recall that only
if
n
of elements
[0,2a1 . By Theorem
in the Bahach space of continuous complex valued functions on
3.1 we have E
g'
sup lE ( E am
s,t n m
g (w)e
,n m
ims
) g'e
int i
E
g'
) supl ( E am ,n gm (we s n m
ims. ) g
(3.3) g (w)e ims ) g' sup E sup l E ( E a g' m,nm
e inti
t n m
By Theorem 1.4 [8,Chapter I] we can replace
fe r,1 1
by
(g
(3.3). Doing this and taking expectation with respect to
E sup IX'(s,t) ! s,t
EsuplE(Ea g')ge / m,n n m s m n
ms
m
in the last term in
3
we get
I
(3.4)
sup IE(Ea
+ E sup
t m n
c'e m,n n
int
) g e m
ims
.
Let us denote the first and second term to the right of the equivalence sign in
(3.4) by I I
and II . By Theorem 1.4 [8,Chaptcr I] applied twice
12 ) 1/2 e e us — E sup 1E ( E la s m n m'n c')ce ims IE EsuplE(Ea s m n m'n n m
In analyzing
II
II
sup IX'(s,t)
.
s,t
we repeat the steps of (3.3) and (3.4) and obtain
E sup lE (
a c'e int ) s,t m n m,n n
(3.5)
—E sup IE(Eac teint) t m n m 'n n
me ims
a I m
int )ge ims 1 E, c sup- E suplE( E am,n c'e n m t s m n
69
Denote the first term to the left of the equivalence sign in (3.5) by I' second by
and the
II' . Using the same argument as above we have
g) e'e int I' = E sup lE ( E a t n m m,n m n
— E sup t
I E ( E am,n ETTI:) c ' e int n
E
l
E sup s,t
m
We use Theorem 1.4 [8 ,Chapter I]
II' — E
I
sup E
on
I X'e (s,t) I
.
and obtain
IT'
e sup 1E(Eam,n e nl e s m n
int
)e ims m
I 1
E sup s ,t
X '(s,t)
I
.
Thus we have shown that
(3 .6)
ElIX'(s,t)11
C E
The reverse inequality in (3.6) is obtained by
C
for some absolute constant
1)q(s,t)11
the contraction principle. This completes the proof of Theorem 1.1.
We finally prove Theorem 1.2. Proof of Theorem 1.2. It will be sufficient to show that
E
1IX
(s ,t)11
sup
E
1E
s E [0,2"a]
+ E
sup
1 2)V2 -
la
E
(A i l
j = 1 or 2
where as before a
m,n
1
lam 11
1 2 ) 1/2 gne int l
n m sup
TE C* sE [0,27] a.s.
for
e ims
min
IE(E
sup t E [0,271
(3.7)
E
m n
= 0
if
Eimsl < T,A > g m m
m
m > n
and where the sequences
have been defined in (1.11). Indeed, Theorem 1.4 [8,Chapter I] then clearly
implies (1.12). Following the notation of the preceding proof we show (3.7) for
j = 2
and with the left hand side replaced by
EHX 1 (s,t)11
proof of Theorem 1.1 (cf. (3.3) , (3.4) and the estimate of
by (1.3). As in the I ), we have
70
E supl Xics,t) I s,t
(3.8)
E sup l E ( E la 12) m,n n m s
II
II
m
ims ) g e int n
the second term to the right of (3.8). It is plain that
= E
2 ims E A g e
sup
m m C
where El is the norm on
then follows easily from a new application a.s. . (3.7)
of Theorem 3.1 but this time in
II
2 ge ims l
g e sup E , sup I E ( E a m,n m t n in
+ E
Let us call
1/
-E
g
C
a.s.
2 TEA g rn
. Indeed ,
sup TE C*
m m
ims 2 E sup E < T,A > g e m m s m a.E
I
and by Theorem 1.4 [8,Chapter I] ,
2 E A g m
m m -
= EsuplE(Ea g) 9'e int I t n m m,n m n E sl)-P I E Ç E t n in
am,n
2 ) -1/2_, s Int
Appendix.
Proof of (1.3) and (1.4) : the argument that we give here was shown to us by Gilles Pisier. We will first prove (1.4). It follows from Theorem 2 , 17=1 that for
p > 1
(A.1)
E 11X c (s,t)11 1) -
EH E
a
m
(e e' + e'e m,n mn mn
where the constants of equivalence depend only on
p )ems+nt)d i(
p . Therefore by the triangle
inequality
EllX 6 (s,t)H P
Ej
C
e etei(ms+nt)ilp a m
Therefore, taking (A.1) into consideration, we see that to establish (1.4) we need only show
(A.2)
, i(ms+nt) iip
ee e C'EHEa m
EH
E am,n(em e'l' + eln'e n )e m
71 Let
0
be a real valued random variable uniformly distributed on
[0,27]
and let
D I be an i.i,d. sequence. Let (e'l be an independent copy of (9 1 . We define n n n inen in e' principle for quadratic tn = e t' n = e n . It follows from the contraction forms, Theorem 1 , [7]
(A.3)
a
m< n tn
We replace
that (A.2) holds if
z a ( t t* t' t )■ e icms+ nt)!!p mn mn m,n m
m,n m n
by
tne
ins
° , t'n = tle n
expression on the right in (A.3) where
int s
°
o
t
and
(s-s ,t-t ) in the o o are fixed in [0,27] . These
(s,t)
and
H
o
by
changes do not change the numerical value of this term, i.e.
(e i(ms o +nt o ) t t E a mn m
= E =
,i(nso+mto) t , t ) e - i(ms °+nt 0 ) e “.ms+nt)HP mn
ct t , +e i[(n-m)s o + (m-n)t o l t , t )ems +nt) li p Z Eli a mn m, n mn
m
Finally we note that by Jensen's inequality
27 27 H =
47
2
f
00
H ds dt
o o
2rr 2rr (t + r e 1 l (n-m ) s0+ (111-11)t Jds dt t't m ,n m n 2 o o mn m
Ell Z
a
S
ms + nt)Ilp
= Ell Eam,n m tTn e i ( Ins -1-11 t)HP m
Thus we have obtained (A.3) and consequently (1.4). The same argument works in the Gaussian case. We introduce gill and al") — ig' for g.1) and in place of (t 1 and (t'3 in (A.3) where g = g
n
n
as defined in the introduction and take Since
(7:1 )
n
to be an independent copy of
gn is rotationally invariant the exact same argument as above gives (1.3).
References
[1]
A. Bonami : Etude des coefficients de Fourier des fonctions de
LP (G) . Ann.
Inst. Fourier (Grenoble) 20 , p. 335-402 (1970). [2]
C. Borell
Tail probabilities in Gauss space. Vector space measures and
applications, Dublin 1978. Lecture Notes in Math. 644 , p. 71-82 , Springer
(1979).
.
72
[37
C. Borell : On the integrability of Banach space valued Walsh polynomials. Séminaire de Probabilités XIII. Lecture Notes in Math. 721 , p. 1-3 , Springer (1979).
[4]
C. Borell : On polynomials chaos and integrability. Prob. and Math. Stat. 3 , p. 191-203 (1984).
[5]
X. Fernique Régularité de fonctions aléatoires non gaussiennes. Ecole d'été de St-Flour 1981. Lecture Notes in Math. 976 , P. 1 -74 , Springer (1983).
[6]
X. Fernique : Fonctions aléatoires gaussiennes valeurs vectorielles. Preprint (1985).
[7]
S. Kwapien : Decoupling inequalities for polynomial chaos. Preprint ( 1 985).
[8]
M.B. Marcus and G. Pisier : Random Fourier series with applications to harmonic analysis. Ann. Math. Studies 101 , Princeton Univ. Press (1981).
[9]
G. Pisier : Les inégalités de Khintchine-Kahane d'après C. Borell. Séminaire sur la géométrie des espaces de Banach 1977-78, Ecole Polytechnique, Paris (1978).
[10] G. Pister : Sur l'espace de Banach des séries de Fourier aléatoires presque sûrement continues. Séminaire sur la géométrie des espaces de Banach 1977-78, Ecole Polytechnique, Paris (1978). [11] G. Pisier : Some applications of the metric entropy condition to harmonic analysis. Banach spaces, harmonic analysis and probability, Proceedings 1980-81. Lecture Notes in Math. 995 , p. 123-154 , Springer (1983). [12] M. Schreiber : Fermeture en probabilité des chaos de Wiener. C.R. Acad. Sci. Paris, Série A , 265 , p. 859-862 (1967). [13] M. Talagrand Régularité des processus gaussiens. C.R. Acad. Sci. Paris, Série I , 301 , p. 379-381 (1985). [14] D. Varberg : Convergence of quadratic forms in independent random variables. Ann. Math. Statist. 37
p. 567-576 (1966).
RATES OF CONVERGENCE
IN THE CENTRAL LIMIT THEOREM FOR
EMPIRICAL PROCESSES by Pascal MASSART Université Paris Sud U.A. CNRS 743 "Statistique Appliquée" Mathématiques, Bit. 425 91405 ORSAY (France) -
SUMMARY :
In this paper we study the uniform behavior of the empirical brownian bridge over families of functions F bounded by a function F (the observations are independent with common distribution P). Under some suitable entropy conditions which were already used by Kolnnskii and Pollard, we prove exponential inequalities in the uniformly bounded case where F is a constant (the classical Kiefer's inequality (1961) is improved), as well as weak and strong invariance principles with rates of convergence in the case where F belongs to
L 2+6
(P) with
E 10,1] (our results
improve on Dudley, Philipp's results (1983) whenever F is a Vapnik-Cervonenkis class in the uniformly bounded case and are new in the unbounded case).
1.
Introduction
2
Entropy and measurability
3
Exponential bounds for the empirical brownian bridge
4.
Exponential bounds for the brownian bridge
5
Weak invariance principles with speeds of convergence
6.
Strong invariance principles with speeds of convergence
APPENDIX : 1. Proof of the lemma 3.1. 2. The distribution of the supremum of a d-dimensional parameter
brownian bridge 3. Making an exponential bound explicit.
Key words and 2hrases : Invariance principles, empirical processes, gaussian processes, exponential bounds.
74 1. INTRODUCTION 1.1. GENERALITIES. (X,X,P)
Let
be a probability space and
(x n ) 01 be some sequence
indepen-
of
-
dent and identically distributed random variables with law
brownian
1 n n i E_ i 6 x i
stands for the empirical measure
P
bridge relating to
and we choose to call empirical
the centered and normalized process
Our purpose is to study the behavior of the empirical
F,
where
F
rich
on a
(Q,A,Pr).
enough probability space
Pn
P, defined
is some subset of
brownian
vn - /6
bridge uniformly over
L 2 (P).
More precisely, we hope to generalize and sometimes to improve some classical
results about the empirical distribution functions on of quadrants on ]R
d
),
in the way opened by
R d (here
Vapnik, &rvonenkis
F
is the collection
and Dudley
.
In particular, the problem is to get bounds for:
(1.1.1) where I
Pr
( Hv n !! F > t)
4 F stands for the
mations of convergence,
vn
for any positive
uniform norm over
F
t,
and to build strong uniform
gaussian process indexed by
by some regular say
,
F with some speed
approxiof
(b n ) .
First let us recall the main known results about the subject in the classical case described above.
1.2.
THE CLASSICAL BIBLIOGRAPHY.
We only submit here a succinct bibliography in order to allow an easy comparison with our results (for a more complete bibliography see Concerning the real case
P
and are optimal
(d=1) ,
[26]) .
the results mentioned below do not depend on
:
1.2.1.
(1.1.1) is bounded by to
Dvoretsky, Kiefer
and
C exp(-2t 2 ),
where
Wolfowitz [24] (C < 4/2
C
is a universal constant according according to
[17]).
75
1.2.2. The strong invariance principle holds with
b n - log(n) ,
Komlos,
according to
In
Major and
Tusnady [37] .
In the multidimensional case (d
_> 2).
1.2.3.
(1.1.1) Kiefer
C(E) exp (-(2-E)t 2 ),
is bounded by
[34] .
In this expression
E
E > 0,
for any
cannot be removed (see
[35]
according to
[28]).
but also
1.2.4. The strong invariance principle holds with b
n
= n
1 2(2d-1) Log(n),
according
[8].
to Borisov
P
This result is not known to be optimal, besides it can be improved when
[0,11 d .
uniformly distributed on
1.2.5.
If
d
-
In this case we have
is
:
2:
The strong invariance principle holds with
b n - (Log(n))2 , /6
according to
Tusnady [50]. 1.2.6. If d > 3 :
—
The strong invariance principle holds with to
Csbrgb
and
1.2.5
1 3 2(d+1) (Log(n)) 2 , b n -n
Révész [14] .
and
1.2.6
are not known to be optimal.
Let us note that even the asymptotic distribution of (the case where in
according
d
- 2
and
P
ky n il F
is the uniform distribution on
is not well known
[0,1] 2
is studied
[12]). Now we describe the way which has already been used to extend the above results.
1.3. THE WORKS OF VAPNIK, hRVONENKIS, DUDLEY AND POLLARD. Vapnik
V
and
Cervonenkis
rally called V..-classes
-
introduce in
[51]
some classes of sets
for which they prove a strong
large numbers and an :exponential bound for
(1.1.1) •
-
which are gene-
Glivenko-Cantelli
law of
76
P.
Assouad
[40]
also
studies these classes in detail and gives many examples in
[3]
(see
for a table of examples).
P-Donsker
The functional
classes (that is to say those uniformly over which some
central limit theorem holds) were introduced and characterized for the first time by Dudley in
[20]
and were studied by Dudley himself in
[27]
Some sufficient (and sometimes necessary, see bounded) conditions for of entropy conditions
F
to be a
P-Donsker
[21]
and later by Pollard in
F
in case
[44].
is uniformly
class used in these works are some kind
:
Conditions where functions are approximated from above and below (bracketing, see
[20])
are used in case
F
is a
P-Donsker
restricted set restricted set of laws on respect to the
Lebesgue
X (P
F
is a
P-Donsker
of laws including any finite support law (the
-
belongs to some
is often absolutely continuous with
v .. Kol icnskil
measure in the applications) whereas
conditions are used in case
bility assumptions
P
class whenever
class whenever
V.L-classes
the classes of sets of this kind, see
are
P -
and Pollard's
belongs to some set under some measura-
[21]).
In our study we are interested in the latter kind of the above classes. Let us recall the already existing results in this particular direction. Whenever
F
is some V..-class and under some measurability conditions, we have:
1.3.1.
(1.1.1) Alexander in
is bounded by
[1]
and more precisely by
C(F) where
C(F,E) exp(-(2-dt 2 )
for any
in
E
]0,1],
according to
:
(i+t2)2048(D+1) exp(-2t 2 )
D stands for the integer density of
F
(from
,
in
[2] *
Assouad's
,
terminology in
[3]) .
1.3.2. f
(1.1.1)
is bounded by
4e'
D
E
\i=0
2 ))p(-2t ex 2 ) , according to Devroye in [16].
* Our result of the same kind (inequality 3.3.1°)a) in the present work) seems to have been announced earlier (in [41]) than K. Alexander's one.
77
1.3.3.
1 The strong invariance principle holds with b n -.= n
2700(D1), + according to
Dudley and Philipp in [23]. Now let us describe the scope of our work more precisely.
2, ENTROPY AND MEASURABILITY.
From now on we assume the existence of a non-negative measurable function F such that !f! < F,
for any f
in F.
v We use in this work Kolcinskii's entropy notion following Pollard [44] and the same measurability condition as Dudley in [21] . Let us define Kolcinskii's entropy notion. Let
p
and p (p) (X) F 2.1.
be in [1,+.[ . A(X) stands for the set of laws with finite support for the set of the laws making FP integrable.
DEFINITIONS. Let c
be in ]0,1[ and Q
N ( P ) (c F Q) F "
be in 4. 0 (X) .
stands for the maximal cardinality of a subset
G
of
F
for which:
(!f g!P) > EPQ( FP)
Q holds for any f,g c-net of (F,F)
in
G
with fg (such a maximal cardinality family is called an
11P) (.,F,Q) . relating to Q). We set NP ) (.,F) = sup QEA(X)
Log(N F(p)(.,F)) is called the (p)-entropy function of (F,F) . The finite or infinite quantities :
d(F) = inf fs>0 ; limsup 0
s
E N
(p) F
(c ' F) < col
e(F) =inf fs>0 ; limsup c s Log(N ( P ) (c,F)) < col F F 6-* 0 are respectively called (p)-entropy dimension and (p)-entropy exponent of (F,F).
78 Entropy computations. from that of a uniformly bounded family as
F
We can compute the entropy of
:
follows
- (T 1 (F>0) '
Let I
fE Fl ,
then
:
( ) N( F . ,' F) < N 1 P ( ' I) Q
For, given FP
so
E A(X)
in
A(X),
either
Q(F)=0
and so
qP ) (.,F,Q) = 1,
or
Q(F) > 0,
:
and then
Q(F P )
NPP ))(. ,I 1 ( 1 Q(F) n
N F( P ) (.,F,Q)
Some other properties of the
(p)-entropy
are collected in
The main examples of uniformly bounded classes with finite
[40]. (p)-entropy
dimension
or exponent are described below.
2.2.
COMPUTING A DIMENSION According to Dudley
:
THE V..-CLASSES.
[20]
on the one hand and to
Assouad [3]
on the other we
have
d (1 P (S) = pd whenever
[3]).
S
is some
[45] .
COMPUTING AN EXPONENT Let d
Dk
with real density
d
(this notion can be found in
Concerning V..-classes of functions, an analogous computation and its appli-
cations are given in
2.3.
V.C.-class
See also
:
HOLDERIAN
THE
a
be an integer and
[21]
for a converse. FUNCTIONS.
be some positive real number.
We write
8
for the greatest integer strictly less than a.
Whenever
x
belongs to
P„c1
for the differential operator
Let Let
II.11
be some norm on
k to A d , lk a kl k k ' Bx 1 1 ...x d d
and
stands for
f
and
Rd
Ad be the family of the restrictions to the unit cube of
13-differentiable functions
k 1 ”' +lc d
such that
:
Rd
of the
79
max sup
ID
!k!<13 x€Rd Then, according to
k “x)1 + max sup 1D k f(x)-D k f(y)! < 1 . !k!=i3 x#y Hx-y!1
[36]
on the one hand and using Dudley's arguments in
the other, it is easy to see that
[19] on
:
e (I P ) (Aa,d )
a
Measurability considerations.
Durst
M P-PE
and Dudley give in
[21]
an example of a V..-class
S
such that
1
So some measurability condition is needed to get any of the results we have in view. So from now on we assume the following measurability condition (which is due to Dudley
[21]) to be fulfilled :
(M)
.(X,X)
is a
Suslin space
. There exists some auxiliary Suslin space from Y onto F such that : (x,y) F
and we say that
T(y)(x)
(Y,y)
is measurable on
is image admissible
Suslin
via (Y,î)
and some mapping
(XxX, X
T
V)
.
This assumption is essentially used through one measurable selection theorem
[47]
which is due to Sion
(more about
Suslin spaces is given
in
[13]).
2.4. THEOREM. Let H on
X.
be some measurable subset of
Then
A
rable mapping from
XxX .
We write
A for its projection
is universally measurable and there exists a universally measuto
A
Y
whose graph is included in H .
A trajectory space for brownian bridges. We set
: l(F) =
We consider
: 1 °3 (F)
F-*
; h oT
is bounded and measurable on
as a measurable space equipped with the
(Y,Y)1 . a-field generated
80
by the open balls relating to
1 °3(F)
a-field because
11.11
(which is generally distinct from the Borel
is not separable).
P
This trajectory space does not depend on
[20 1 ) but only on the measurable representation
any more (as it was the case in
(Y,T)
of F .
From now on for convenience we set :
(Q,A,Pr) X
where on
\ (X ,Xw ,P )
(X ,X ,P )
8([0,1]), P oe
X)
[0,1] , B([0,1]) for the Borel
stands for the Lebesgue measure on
[0,1] and
duct
(ex [0,1],X°3
G-field
for the completed probability space of the countable pro-
of copies of
(X,X,P) .
The following theorem points out how
1;3 (F)
is convenient as a trajectory
space.
2.5. THEOREM. For any a in R n , Moreover , setting on
(F,p p )1, U b (F)
I a iX. i=1
is measurable from
u b (F) =
:F4- IR
is included in
l(F).
Q
to
l_T(F).
;his uniformly continuous and bounded Provided that
(F,p p )
is totally boun-
ded this inclusion is measurable.
L. 2 (P)
Where
is given the distance p (f-g), with 2 Gp : f÷P(f 2 )-(P(f)) 2 . For a proof of 2.5. see [21] (sec. 9) and [40] where it is also shown that many reasonable families (in particular
A
et ,u
and the "geometrical"
V.C.-classes) fulfill (M) . 2.6. REMARK. Since whenever
F
fulfills (M) it follows from [21] (sec. 12) that
UP n -P!! F -4-0 a.s.
(1) N F (.,F) < . and therefore : sup
(2)
NF
(E,F,Q) < NV q)
- ,F)
for any
E
in
]0,1[ .
( QEP2) F (X) This implies that the local behavior of the entropy function is unchanged when taking the sup in 2.1. over the set of any reasonable law .
81 3. EXPONENTIAL BOUNDS
FOR THE EMPIRICAL BROWNIAN BRIDGE
u
We assume in this section that for some constants any
f
in
F;
U - v-u
we set
F-u = {f-u ,
and
and
v,u
for
f E Fl .
The following entropy conditions are considered
:
a) d U(2) (F-u) < 00 b) e U(2) (F-u) < 2 Using a single method we build upper bounds for following two situations
1°) F.
Observe that
(1.1.1)
that are effective in the
: U2 ; ilap2 H F < -4-
nothing more is known about the variance over
In this case we prove some inequalities which are analogous to
quality
2°)
Hoeffding's
ine-
[30]. We assume that
H4H F
This time our inequalities are analogous to Bernstein's inequality (see Bennett
[5]) . 3.1.
DESCRIPTION OF THE METHOD.
N =mn .
We randomize from a sample which size is equal to Dudley's
[20]
or
Vapnik
following an idea from
In Pollard's
v and
Cervonenkis [51] symetrization
Devroye [16] ,
we choose a large
Effecting the change of central law type inequality, we may study
Pn -P N
: P-* P N
instead of
technics,
m=2
but here,
m.
with the help of a Paul
P n -P
[44],
where
Pn
Lévy's
stands for the
randomized empirical measure. Choosing some sequence of
-
measurably selected
-
nets relating to
PN
whose
mesh decreases to zero and controlling the errors committed by passing from a net to another via some one dimensional exponential bounds, we can evaluate, conditionally to
PN ,
the quantity
HP n -PN I! F .
82
Randomization . N=nm ( m
Setting from [1.n]
[1,N]
into
w
is an integer), let
be some random one-to-one mapping
whose distribution is uniform (the "sample w is drawn without
replacement"). The inequalities in the next two lemmas are fundamental for what follows :
3.1. LEMMA. For any
in
, we set
S = N
Sn =
i=1 N
\2
1=1 andU N -(max.)) - ( min 1 1
(.)) ; 1
10)
i=1
the following three quantities are, for any
S positive E , lower bounds for- Log
J
Pr (!lf
SN T1-1 > E)):
2n E 2 UN
nE
2°)
2 + EU N N
2 3°)
nE
2
2
2 ma N
These bounds only depend on
(U N ,aN ).
through numerical parameters
Bound 3°) is new ; concerning 1°) (due to Hoeffding [30]) , Serfling's bound is better (see [46 1 ) but brings no more efficiency when
m
is large.
The proof of lemma 3.1. is given in the appendix.
i From now we write
P n for the randomized empirical process
n
. 6 n i=1 No)
The
inequality allowing us to study the randomized process rather than the initial one is the following :
83
3.2. LEMMA. The
random elements
Besides, whenever
UPn-PNF andn-P F are measurable. Mcs2PI ! F —< p 2 , the following holds
2 19 2 2 , ) Pr( !P n -13 a E n any positive c and any a in (1
for
For a
proof
> E)
, n' F > (1-a) Tr E)
P n-PN
, where n = N-n .
this lemma see [16] using Dudley's measurability
of
arguments in
[21] (sec. 12) . Statement of the results.
3.3. THEOREM. The Pr
following quantities
( !v n !! F > t) 1°)
a)
h)
where k 2°)
are, for
any
positive
t
and
ri , upper bounds
for
:
if
d (2) (F-u) = 2d
if
2 0 1-1,F (1) (1 + L) 3(c1-1-1-1) exp (-2 U2 (2) < 2 , e (F-u) =
,
I
)
t k+n t2 0n,F (1) exp (O 1,F (1) (0) ) exp (-2 -7) U2 6-c C( 777 ) (when c increases from 0 to 2 so does k). Suppose a) if
that !!720 F < a2 , with G < U, then d u(2) (F-u) = 2d , 2 3 ( d + fl) ex p( o n,F (1) (2)- 4(d+n) (1 + L) U 02
h) if
0 n,F (1) exp(0
2(0- 2 +(3U+t)) ) in
4 2) (F-u) = c < 2 ,
11,F (1) (.g)U
where p - 2c(4-C)
t2
(when
5 (; ) 2n r
+1-, ) e Xp
t
2(0.2 + increases from 0 to 2 so does 2p) .
2 (3u ()P+fl +t)))
84 The constants appearing in these bounds depend on
N (2) (.,F-u)
F
only through
n .
and of course on
Comments.
2.2.,
From section
F
d (2) 1
the assumption
is some V..-class with real density Thus
bound
the factor
1')
1°)
0(F,) t 2nd
1°)
in
a) improves on
1.2.3. 1°)
moreover the optimality of
d.
F
but is less sharp than
on
F - A a,d Bakhvalov proves
[0,1] d
then
(2t2)i
2.3.
then, from section in
[4]
P
that if
in the real case
;
exp (-2t 2 )
we have
:
(*)
d e (2) 1 (F) = a- .
In other
stands for the uniform distribution
:
1_
Hn H
a
d
F > C n2 1°)
Thus we cannot get any inequality of the
e (2) 1
1.2.1.
i=0
Suppose that
,
a) is discussed in the appendix where we prove that
1'H- cc
respects,
in another connection
is the collection of quadrants on R d )
( Ilm n r F > t) > 2
Pr
1.3.1. ;
a) is specified in the appendix.
d-1 lim
is typically fulfilled whenever
a) is sharper than those of
In the classical case (i.e. bound
(F) <
surely
or
2°)
type in the situation where
(F) > 2 . The
border line case
:
,
For any modulus of continuity in the same way as
Ad by changing
greatest integer for which
(1)(u) u -I240
It is an easy exercise, using
we can introduce a family of functions
u÷u a
q)
and defining
as the
holds.
Bakhvalov's
!v
into
A ,d
method, to show that
:
> C (Log(n))Y
d
provided Of course
that
e 1(2)
4)(u) = u 2 (log(u -1 )) Y (Ad) 4D,
)=2
and
P
is uniformly distributed on
and we cannot get bounds such as in theorem
[0,11 d .
3.3.
(*) So, there is a gap for the degree of the polynomial factor in the bound 3.3.2 ° )a) between 2(d-1) and 6(d+n) .
85
But the above result is rather rough and we want to go further in the analysis of the families Then the
around the border line.
Acp,1
(2)-entropy
A a, 1 concerning the Donsker
plays the same role for
property as the metric entropy in a Hilbert space for the Hilbert ellipsoids concer-
pregaussian
ning the
1
1 (i)
A(1),1
:
property, that is to say that the following holds
P-Donsker
is a functional
2 (2) (Log(N 1 (E,A ))) q), 1
class whenever 10
dc
<
co
.
1 (ii) A
(1),1
X-Donsker
is not a functional
Log(N (2) (E,A (1), )) 1 1
and in this case we have
(E Log(c)Y
(i)
follows from Pollard's central limit theorem in
(ii)
follows from a result of Kahane's in
qb(u) -
In fact, if we set
e (t) n n KnLog(n)
E
t --,- E n>1
V 1 Log(u)1 u )1 ,1
belongs to
,
[32]
about Rademacher trigonometric series.
(E n !W(e n )!)
(E n )
with
being independent of
L n>1
>
A (1),1
W(e nLog(n) n )1
W
some Wiener process provided that of G
.
So
(*) We write
A
f
(lb, 1
is not
g
, when
brownian
W(1)
pregaussian
0 < lim
that
1,
where
(E n )
we may write (We n ))
1 nLog(n)
bridge G for
N(0,1)
(ii)
is proved.
(fg 1 ) <ï
:
•
is some and
:
(IW(e n )!) -
is almost surely unbounded on
The same property holds for any
66
p K -1K4-
L 2 ([0,1]),
E IW(e)
By the three series theorem the series almost surely and therefore
p.
p K , the following holds
So that, with probability more than NI
[32]
(27nt) .
Let us consider a standard Wiener process on as
[44].
we have from
cos
(*)
22 .
with some probability
e n (t) = /2
is a Rademacher sequence and
.ci , (u)
class whenever
u )2 ILog(u)I
(
diverges to infinity
A
1
f-)-G(f) + fW(1)
is
random variable independent
(fg -1 ) <
86
"An upper bound in s?:tuation 2°) is also an oscillation control". If we set
= {f-g ; G (f-g) < o , f,g E Fl , it is not difficult to see that:
G
N
314. U
thus changing
2U
into
situation 2°) hold with
(2) G +U) 2U (" a and d
G
U 2d
into
' if necessary the upper bounds in
the constants being independent of
instead of F ,
a
because of 3.4. . In particular if
F
d, we set :
is a V.0.-class with real density
A(a,n,t) = Pr ( Hy n !! Ga >
.
At it is summarized in [231 Dudley shows in [20] that is small enough, o = 0
n > 0(t
and
(!Log(0
-r
)
A(a,n,t) < t
whenever t
r > 8 .
with
Applying 3.3.2°) a) improves on this evaluation for then
A(c,n,t) < t
t
whenever
t
is small enough, o
and
- "iLog (t)1 ) t
n > 0
-4 bound 2°)a) depends on
In order to specify in what way the constant in
F,
we indicate the following variant of 3.3.2°)a) .
3.5. PROPOSITION. If we assume that o
E
1
]0,1[
in
Ha 2PII
]0,1[ that
in
N
(2)
(E,F-u) < C (Et E) -2d for any
< a2
with
F—
depending only on
E
o
g
not exceeding
and a constant
K
E
]0,1[
in
and some
U, then there exists some depending only on
C such
that : Pr ( Hy n !! F >
From now on
L
t 2 14d ( -d o -4d
t2 2 U(31.1+t),) 2(a +
x+-Log(xve).
stands for the function
3.6. COROLLARY. Let
(D n ) .
(F ) n
be some sequence of V.. -classes fulfillino(M) with entire densities
Then (with the above notations) Pr (
t ; whenever
G
2 n
= o(1/(D n L(D n )))
and
an 2
0
Gn - 0 (070 .
> t)
0
for any positive
87 (Provided that
in D n = o(--) such a choice of o Ln ' n
does exist).
Comment. { v n (f), f E F n } [38] (Lemma 2) and applying 3.6. the process i admits finite dimensional approximations whenever D n = o (Lon g(n)) and provided According to Le Cam
that Le Cam's assumption (Al) is fulfilled. This result improves on Le Cam's corollary of proposition
< 21 is needed.
for some
y
Proof of
3.6. F
Let
4, V.C.-class with entire density
be a
w > d (or w > d if d
show that, for any
(2)
in
]0,1[ ,
(2) (E,F) N1 C
C 31 e 5D
2 3D
in
w-D , K =
d
.
it is easy to
:
2w
(2D) D .
: e
for any
c
Hence, for any
1'
is "achieved"), we have
with in particular when
So from Stirling's formula we get
tant
and real density
1+(112 LogE) exp (2w) (1 + 2!LogE )w (E,F)
N1
E
D
[20] (more details are given in [401)
Using Dudley's proof in
for any
D n = 0(n -Y )
3 where
]0,1[
10, 1-1 i/-
in
we have
and some universal cons-
:
N 1(2) (E,F) < C 3 (2e) 5Dc -4D — 1 thus, applying
3.5. to the class
G
yields
3.6.
œn We propose
below
another variant of
inequality
3.3.2°)a), providing an
alternative proof of a classical result about the estimation of densities.
3.7. PROPOSITION. --
If we assume that some positive bound of Pr
d U(2) (F -u) = 2d < co and
( Ily n H F > t)
o V,F,r1W
t 2 3(d+I)
(n(1 + 7))
In the situation where
3.3.2°)a) .
is, for any positive
U
t, given by :
a _4(d+n) exp ( (0)
t2 2 U 2 (o + -- (CLLn (IL + c5)+t)))
VT' in is large this inequality may be more efficient than
88 Application to the estimation of densities : minimax risk.
K
Let
K q)
where and
M
= 11)(y'M )
(y)
kxk
is some
V.L-class
[45]
(.-x),
for any
P
Now if we assume that
Rk ,
measure on
is a
We set
K
E
ME
Rk
2
1 1 E- 2,2]
, xE R k
: 10,1[
in
C
where
w
and
depend only on
is absolutely continuous with respect to the
f
the classical kernel estimator of its density
n
K
into
that the class
of functions and so
N 1(2) (c ' K) < CE -w
(x) = h -k P n
with fixed
tp
Lebesgue
:
h
M
and
is
k.
so that
f 2 (x)dx <
.
T = E(f n )
Proposition
3.7.
gives a control of the random expression
F = th -k/2 K(' -x )
x E Rk 1 , G IL Ln
So, if we assume that
Pr
R
matrix.
K
where
Rk
y in
for any
is some continuous function with bounded variation from
Pollard shows in
is a
Rk :
be the following kernel on
= C
> h -k —
(vçj D n > t) < 0(n a ) tI3 exp
and
c2'
U = h - k/2
C2 >
where
we get, setting
2 (c 2
fn -T
by choosing
:
il .[K2(x)dx. j
D -n stip!f f(x)1 : - x (x)n
t2 0 (J_Ln )
t
)
41" for any
t
in
[1 + 04
and some positive a and
E (vrj D n ) < T + 0(n œ )
T
exp
2(c 2
13 .
T2 0 ( LLn ) %KJ
for any
T
in
We choose
[1, + .[,
provided that
T = 0(/g) ,
thus
nh k >
:
E(D) n
0 ((-15) 2 )
Hence, after an integration:
T In7
)
89 f
Provided that
T-f can be evaluated so that the minimax risk associated to the uniform R k and to 9 can be controlled with the same speed of convergence as
expression distance on in
[29] ,
belongs to some subset of regular functions G , the bias
via an appropriate choice of h
.
3.8, SKETCHES OF PROOFS OF 3.3., 3.5., 3.7.
= (fT?, fEF1
First, by studying the class G
u=0
and
v=1.
s,13
and positive
We set
3.3..
Let us proof theorem
parameters such as
: N =
:
p
a,
(in
F,
instead of
[42]) .
we may assume that
All along the proof we need to introduce
]0,1[) ; r, m
; a
(in
]1,+.1); q
(in
(in
10,21)
which are all chosen in due time.
and y
,
mn
(More details are given in
and
E = --
E' ' (
1 - 1)
(1-a)
E .
1/171
We write Pr (N)
(x
(.)
The
IP n
(
r
( Iv n
- P N II > EI)
> t)
for short.
will follow, via
3.2., from
a bound for
which is at first performed conditionally on
(x
1'
...,xN ) .
chain argument. Let
(T i ) i>1 be
a positive sequence decreasing to zero.
For each integer j of
.4
instead of
and
' xN )
A bound for Pr Pr
for the probability distribution conditional on
2.4).
A projection
2 < T. — J
2
a
T.-net F. can be measurably selected (with the help
7i may be defined from
F onto
Fi so that
holds.
Then
Il(P n - P N ) ° (Id - 7 r )I
<
n
j>r+1 So,if(.)is na
- P N ) ° (7. j
a positive series such that
j>r+1 Pr
where
A and
B
(N) (P nN
are the
I
>
E')
<
A
n. < p —
+ B
(x1''"'xN)-measurable variables
:
- Nr Pr (N) (r( P n - P N ) 7 r > (1 - P) C) B = E N2 Pr ( '11J' n -P N) 0 (7,-7J-1 J - )r > fi. J s') j>r+1
A
J-1 we get
:
90
stands for NV ) (TF))
N-
(where
is the principal part of the above bound and
A
B
is the sum of the error
terms.
1 ° ) or 2°) of Lemma 3.1. are needed to control
Inequalities whether case Bound
A
according to
1°) or 2°) is investigated.
3°) in Lemma 3.1. is used to control
B ,giving : ,2 2
2. exp ( 3.8.1. 4m 1 j2 - 1) j>r+1 N J 1 I1 7-71 Choosing fl- = ( j - 1) -a and r - 2 + '(- ) I, (so E fl- < p holds whenever Lu j>r+1 J > 2), the control of the tail of series 3.8.1. is perfb-rmed via the following B < 2
E
-
:
elementary lemma 3.8.2. Lemma. Let
tp
[r, +0,4 -* R. Provided that
following inequality holds
where
q)
is an increasing convex function, the
:
1 E exp (- 11)(j)) < exp (-t ( r)) j>r+1 cl (r) tpi 'd stands for the right-derivative of tp . We choose
13
1
13 -
under assumption h)
.
3.3. in case 1°).
Proof of theorem
We choose
under assumption a) and
a
= t -2 , m = [t 2 i and
T.
A
and apply
.
J
VF
< 2 M r e 10 exp (-2t 2 (1-2p))
3.1.1°), then :
P EIN -a.s.
Under assumption a). Considering the type of inequality we are dealing with we may assume that .
J
N < C t2d j 2((1+1)d (instead of N. < C' t 2d' j2(a+l)d'
J
We choose
, p = t -2
and a A
and
for any
d'
> d) .
—
- Max (2, 1 +
4d
, so :
< GnF(1) ( 1 +t 2 ) 3(d+n) exp (-2 t 2 ) — ,
P NN -a.s.
91
whenever
oN
0 n,F (1) exp (- 2 t 2 )
B t 2 > 7+4d(a+1) .
Now the above estimates are deterministic, so using
3.2., theorem 3.3. is proved in situation 1°)a) .
Lemma
3.5. note that, setting a
With the idea of proving proposition
, under the hypothesis in 3.5. , that
method gives
with
Pr(
= 2 , the above
v n !>t) is bounaed by :
-1 4d (2+t 2 ) 12d exp (- 2t 2 ) K 1 (E o t) K 1 depending only on C , whenever t 2 > 7+12d.
Under assumption b).
We may suppose that N J We set for y
< exp (C t
p = t-2 Y , where
> y(d -
and
tion when a = +.0
13 > 1
(1+2y(P.:1Hf ) ) = 2(1-y) , then we choose a large enough y( C)
to hold, where
: 2(1-y( ))= k).
(Namely
- 0
A
j (a+5) )
So
is the solution of the above equa-
:
(1) t k+n ) exp (-2 t 2 ) n,F (1) exp (0 n,F
and
B = o (1) exp (-2 t 2 ) n,F t 2 >+5+ C t C 2 213+2 —
whenever
So theorem
Proof
3.3. is proved in case 1').
of tneorem 3.3. in case 2 0 ).
We set
(4)- a and choose m=(,0 (1 , a =
The variable A
P - (P -G1 and
is this time controlled with the help of
2
problem is to replace
3.1.2°), so now the
2
GN by up
In fact, let O N be the
(x l ,...,xN )-measurable event :
2 2 2 aN2 - a 2p '' > sl , where a(f) =P N (f ) - (P N (f))
N
Each term of the following estimate is studied in the sequel Pr where
A'
= E (A II
j - (erh3)
T
c)
( and
rP n -P N
> E') < Pr
B' = E (B 11
c)
(a N )
+ A' + B'
for any
:
f
in
F.
99
Bounding
is a prColeT of
Pr (ON )
F2
For, setting
_
type
1')
f E F} , we have :
a2p < IPN - Pll 9 + 2 11PN - P F(2) 2 (2) Since N (.,F ) —1 ( 12 ,F) and F 2 fulfills (M), we may use the bounds < N 1 2 in 3.3.1°), so, choosing s , we get : Pr (ON ) < C o exp V2nm
IIGN2
The evaluation of
kJ12\1 A' <
A'
< a2 +s
and .8'.
holds on
2 N r exp (5 (42-q ) exp (
Moreover
-t
:
gives
2
,
0 > 2 .
whenever
2 (0 2 + ( t +j -c1/2
: B
Now
Orcl , thus applying 3.1.2°)
< 2
, 22 (j...1) 23 ) N.2 exp (- nc 4 a j>r+1
the proofs are completed as in case
a = Max (2,1 + 4d T1 ) large enough for
under assumption a) and
1-
< p + n and (3 > 1
To prove proposition
q-2
11, choosing this time o = (2-c)
and
)+ 1) • 1,,,itn
+
- 1
et
to hold.
3.5 we choose a = 2, so
A' < 2 C c o-2d e ' a-2d (4) 2d (2qt,o26d ) exp (
t
2
02+
2 1.6.1
B' < 2 C 2 c-4d a-4d (4)4d (2+T 2) 12d exp ( o whenever
4
)
cp 2 > 8 + 12d .
Besides, using
3.8.3., we get :
Pr (ON ) whenever
2 2 (cp -8)
2 4 —> 7+12d,
(1-
2 2 I2d < 2 K1 4 ) -4d () 4d (2 + TiF) exp (- 2;2 ) which completes the proof of proposition
3.5
via Lemma
Proof of proposition 3.7. We assume that
u=0
and
v=1 .
Inequality
3.3.2°)a) may
be written
:
3.2.
93
t
o
Pr (
:
v
> t) < K a
n
t2
2
-a
M
1 ( t ) OE2 exp ( -2
2 (02
a
(3+t) ) ) in
2
whenever -- > 5 . G
2 —
Defining the following sequences by induction : a. = 5 J+1 b- = with a
1
o
Pr (
and b
0
/6
,
v ri ll > t) < Ka i a
44d+
( 1
ii,71
/171
1 a
1-,
G
1 )
j , 1-
no
AT)
we call - 1
2b
(2
M. the following inequality :
,2 t2 t 2 '1 2 , ) whenever L'.> 5 (7) exp ( 2 — \ 2(c2 +b. + -11.-)/ G c
in
J
from M. by the The, assuming that M- holds, it is possible to deduce M j+1 J J same way as 3.3.21a) from 3.3.1°)a) (technical details are given in [421) . Then inequalities (M.) hold by induction. J =
LLn + [77]
Using inequality Mo , where
and a few calculations yield proposition 3.7. .
/4.
EXPONENTIAL BOUNDS FOR THE BROWNIAN BRIDGE
2 We assume that P(F ) < section still hold
. We want to show that the bounds in the preceding
for the brownian bridge.
4.1. THEOREM If e
(2)
(F) < 2 , then there exists some version G
of a brownian bridge rela-
ting to P whose trajectories are uniformly continuous and bounded on (F,p p ) Moreover, setting c = e Pr ( G 0
p F
fl,F
(2)
(F) ,
2 2 2 if o II < a < P(F ),
an upper bound for
> t) , is, for any positive t and 11 , given by :
(1) exp (0
) fl,F
2 t ( p(F 2 ) p2+ -n ( t ) 2p-ç+n + ( I)2p+n,) e xp H --2) 2c
(2) or, if more precisely d F (F) = 2d <
, by :
4.1.1.
94
2
0 fl,F (1) (P(F )) p
where
2d +ri
2 t2 t—2)2d+fl exp (- --2) 2a a
(1
a
is defined in the statement of theorem
4.1.2.
3.3.
Comments.
4.1.
In the framework of theorem
brownian
bridge is an easy consequence of the proof of
well known result (see than the more general written
:
Pr
(
G
F
[18 ] ) .
Moreover the bounds in
Fernique-Landau-Shepp
t 2 ,) , --2 2o
> t) < C(u) exp
regular
the existence of a
4.1.1., 4.1.
>
a
but is of course a
are in this case sharper
[25])
inequality (see
for any
version of a
f OP
that can be
F
Proof of theorem 4.1.
If
F
is countable :
The calculations are similar to those of the proof of theorem
(F,F) relatng to P
of course a sequence of nets in
is directly given
the following single inequality is used instead of Lemma
4.2.
then
V
gaussian
be a real and centered
Moreover
3.1.
random variable with variance
> s) < 2 exp (-
,2 2v 2
for any positive
The choice of parameters being the same as in the proof of
2
.
v2
: Pr (V
T-
but here
:
LEMMA Let
3.3.
2 a
--2(a+)
j
P(F )
),
4.1.1.
4.1.2.
and
are proved.
Since
s.
3.3.2°)
4.1.1.
oscillation control, the almost sure regularity of Gp follows from
(except
is also an
Borel-Cantelli.
The general case.
(F,p r )
Since
constructa
is separable
regular version of
the familiar extension principle may be used to
brownian
on a countable dense subset of F. this version.
,
bridge on
Inequalities
F
from a regular version defined
4.1.1.
and
4.1.2.
still hold for
95
Comment. The degrees of
The optimality of bound 4.1.2. is discussed in the appendix.
and in 4.1.2. ; the reason is
the polynomial factors are different in 3.3.1.2°)a)
that bound 3.1.3°) is less efficient than bound 4.2. .
5, WEAK INVARIANCE PRINCIPLES WITH SPEEDS OF CONVERGENCE
We assume from now that
P(F
2+6
) < co
6
for some
in 1 0,1] .
Using the results in sections 3 and 4, we can evaluate the oscillations of the empirical brownian bridge and of a regular version of the brownian bridge over F ,
E k -valued processes
so we can control the approximations of these processes by some (where
E
k
is a vector space with finite dimension
The Prokhorov distance
k ).
between the distributions of these two processes is estimated via an inequality from
Dehling [15] allowing reasonable variations of
k
with
Oscillations of the empirical brownian bridge over F
from 3.3.2°)a)
F
■,),1 over
The oscillations of
n
.
are controlled with the help of a truncation
(the proof in this case is straightforward ) on the one hand and of
a slight modification in the proof of 3.3.2°)b) (truncating twice) on the other hand. We shall not give any proof of the following theorem (the reader will find it in [421) .
5.1. THEOREM. = P(F
We set
2+6
then an upper boundfor
) .
If we assume that
Pr ( Iv rj F > 0
H52! P F — <
2 6
is, for any positive
2 < P(F ) t
given by : a) If
42) (F) = 2d < co ,
0 (1)n7d
( t ) 8d ex _ ( P
t2
802 ,, ,,,LLna, ■ ) + 128 n -6/2 o-26 t-2+6
whenever the following condition holds :
with
such that
AG
>
1,
t 2 /02>1,
96
n 5/2 cy 2+26 t -6 (2) (F) = eF
b) If
> 64 p 5
5.1.1.
—
< 2 2 exp
OF(1) exp (O n,F (1) ( -10 ) 2P+n (1+
240+ 0 11,F (1) exp (- 116 q ) 2-2p+fl )+ 011 (1) (1+p5)2 ( n () 2-2 13- fl) -6/ 2 ( G-2-6+a-5) + 0(1) 1_1 5 n -5/2 CT -26 t -2+6 (p
for any positive
is defined in the statement of
n
3.3.) whenever 5.1.1. and the following hold :
n 5/4 CT 2+5
5.1.2.
Remark.
Note that
Yukich in [54] also used KolCinskii-Pollard entropy conditions to
prove analogous results to theorems
3.3. and 5.1., but our estimates are sharper
because of the use of randomization from a large sample as described in section
3 .
Speed of convergence in the central limit theorem in finite dimension. We recall below a result that is due to same direction is due to
Dehling [151 (the first result in the
Yurinskii [53]).
5.3. THEOREM. Let
(X.)
We write and Let
G
be a sample of centered
1
n
for the distribution of the normalized sum of these variables
gaussian distribution whose covariance is that of X 1 . k be an euclidian pseudo-norm on R and 7 2 be the Prokhorov distance
for the centered I
2
that is associated to
7
where
P k -valued random variables.
K
2
HI 2
(F n' G)
.
If
E ( IIX 1 II 2 +6 ) = p
r3 1 4 1/4 ,i1/2, (1 + IL(n -612 k-1 IA)! k 1 p )
is a universal constant.
Weak invariance principles for the empirical brownian bridge. In order to build some regular versions of
brownian bridges with given projection
97
on a finite dimensional vector space (or further in section 6 on a countable product of such spaces), we need two lemmas.
LEMMA
5.3.
(Berkes, Philipp [6 ] ) .
Let R 1 ,R 2 ,R 3 be Polish spaces,
Q 1 and Q 2 be some distributions respecti-
vely defined on R 1 xR 2 and R 2 xR 3 with common marginal on R 2 .
Then there exists
a distributions Q on R 1 xR 2 xR 3 whose marginals on R 1 xR 2 and R 2 xR 3 are respecQ 1 and Q 2 .
tively
Remember that l(F)
The following lemma is fun-
is generally not separable.
damental to avoid this difficulty (see [23]) .
The space
Q
to be mentionned below
is defined in Section 2.
LEMMA
5.4.
(Skorohod [48])
Let R 1 ,R 2 be Polish spaces and Q ginal is
q
on R 2 .
If
V
be some distribution on R 1 xR 2 with mar-
is a random variable from
q , then there exists a random variable Y
tribution of (Y,V)
is
from
Q
to R 2 whose distribution
Q
to R i such that the dis-
Q .
Concerning our problems of construction the point in the sequel is that the
distribution on
l oe (F)
T
of a regular version
of a
brownian bridge is concentrated on
a separable space. Now we can state some weak invariance principles for the empirical brownian
bridge with speeds of convergence. 5.5. NOTATIONS.
From now
[0,1] x IR +
y and 13 are positive functions that are respectively defined on and
[0,2]
by :
x
Y(x,Y) - 8 + 2y (4 1 x)
and
13(z) -
--
where, as in the statement of theorem 3.3. , p(z)
2(1-p(z)) z(2-2p(z)+z)
2z(4-z) 4 + z(4-z)
98
5.6.
THEOREM Under each of the following assumptions there exists some continuous version on
(F,p p)
of a
where
e (2)
brownian (B n )
(a n ) and and a)
d (2) are defined (2) (F) = 2d < If d
T <
a')
If
in Section
W ) (E,F) <
42)(F) =
d
T
FEL 2+6 (P)
and that
2) :
= 5n = 13(n-1)
C E-2d
(LE-
)
1,d
E
for any
in
]0,1[ .
= 13n = 0((Ln) (1/ 2)+d
<2 un = 0((Ln) -T )
for any
( lk) n -G (pn) ii p > a n )gn
y( 6,d)
an h) If
that Pr
G
are defined hereunder (we recall that
an for any
P,
bridge relating to
and
0((Ln) -5 )
13n
s.
and any positive
Proof of theorem 5.6.
be an oscillation rate (depending on
Let a
F
of
on a
a-net
We approximate Setting d
into
ted.
2d
vector space Writing
uniformly over
, pp (f,g) <
irco
,
by
GG
and
[49] ,
B = ucc (Fn, ,GG ) .
5.1.
liv n -vnaHa il F <
to
G
on the
ofvnIF(G)
be the corresponding
gaussian
G
a
(changing
can be
evalua-
G(a)
distribution.
with respective distributions
: ( 11 v n
11 F ( G ) > B) <
k-dimensional
1.1i F(G) and applying (Q' ,A' ,Pr' ) and two
there exists a probability space and
(o)
such that
be a projection
vn 0110.
we may apply theorem
be the distribution
Pr' where
GI
F
for the Prokhorov distance associated to
theorem
l oe (F(a))
7 0.
a
F n,a
1 °3 (F(a))
random variables on
vn
if necessary), hence the quantity
Besides, let
Strassen's
relating to
a
and
P
F(a)
G= {f -g
n)
B
F n,G
and G
99
5.3. ,
So, using lemma of a
brownian
5.4.
applying lemma
Q
with
Pr(
Gp
bridge
we may ensure the existence of some continuous version
with
V:
CO-*
\) nF(G) 1
=
,
( HvO G > 0
Pr
5.2.
and
C-
is used to control
a k < N F(2) (2,F)
noticing that
C
Moreover
Pr
B
is constructed on
.
( dG p l G > a
(with
we get
:
and
F(a) )
2.6. .
is evaluated with the help of theorem
t
.
t)
1 4 F(cy) < 114 2
according to remark
are completed via an appropriate choice of
6,
Gp
( Hyll -G p F > 2t+B) < A+B+C
a Theorem
we may assume that
and then,
t v n F(a) -G pIF(g) 1 F( a ) - d(v il -GF)0 110 11 F , Pr
A
G(a) - G piF(G)
such that
lry niF(0) -G p F(o rt F(a) > B)
Hence, noticing that
where
P
relating to
a
4.1.,
so the calculations
.
STRONG INVARIANCE PRINCIPLES WITH SPEEDS OF
CONVERGENCE.
The method to deduce strong approximations from the preceding weak invariance principles is the one used in
[43]
to prove theorem
2:
the weak estimates are used
locally, giving strong approximations with the help of maximal inequalities and via
Borel-Cantelli
lemma.
Maximal inequalities. As was noticed in from the one given in
[23], [10]
the proofs of the following inequalities may be deduced
and in
[32] .
Notation. We set
X. J
= 6 X. - P J
for any integer
j
100
6.1.
LEMMA (Ottaviani's inequality).
Sk
We set
•E j.1
j .
X.
Then, for any positive a
(1-c)
Pr (max
ilso F
>
the following inequality holds:
,
( 1%11
2a) < Pr
F> a)
k
where
max
Pr
lis k r F
(
>
a)
.
k
:
available
6.2. LEMMA Let
bles
(Paul Lévy's inequality).
be independent and identically distributed
(Yi)1
(B,
where
trical
then
where
k
:
normed
is a
)
11S 1,11 > "
Pr (max
vector space.
a) <
2
Pr
B valued -
If we assume that
( 11 S n i>
random varia-
Y1
is
symme-
a) holds for any positive
,
Strong approximations for the empirical brownian bridge.
6.3.
THEOREM.
(Y i ) j>1
Under each of the following assumptions some sequence versions of
brownian
defined on
0
a) if
such that
<
a')
1
H En j=1
(X.-Y.)
- 0(n -(1 )
(X -.)I
a.
s.
F
, N F(2)(e,F) < CE -2d (1+LE -1 ) d
F e F(2)„ = < 2 , n E i=1
y(.,.)
and
13(.)
for any E in
((Ln) (1/2)+d
1
- 0(Ln)
(X.-Y.)11
F
13 < (3() .
Where
may be
2(14--y(777 '
1 0-1 for any
(F,pp ),
d(F) = 2d <
if, more precisely
h) if
that are continuous on
:
1 1 En -VT1 j=1 for any a
P
bridges relating to
of independent
are defined in
5.5.
a.s.
](),1[
(Ln) (5/4)+(d/2) ))a.s. ;
101
For a proof of 6.3., see [42].
Comments. When passing from weak invariance principles to strong ones, the speeds of convergence are transformed as follows within our framework : (i)
n -y ,n
2(1+-y))
in case a). (ii)
Ln -/2
Ln
in case h) . Transformation (ii) appears in theorem 6.1. (under 6.3 .) from [23], it is not
(i) in the same theorem (under 6.4.).
the case for transformation
On the contrary transformation (i) is present in finite dimensional principles and appears to be optimal in that case : more precisely, the rate of weak convergence towards the gaussian distribution for 3-integrable variables is ran-
ging about
n-1/2
when the rate of strong convergence is ranging about
n-1/6
(see
[39] for the upper bound and [9] for the lower bound), in the real case .
Application to V1% classes. -
1 =1
Applying theorem 6.3. with real density 0(n -a )
in the case where F is a V..-class with
d, we get a speed of convergence towards the brownian bridge that is
for any
a
<
1
18+20d
.
This improves on
in the classical case of quadrants in
6.4. INVARIANCE PRINCIPLES IN
1.3.3.
but is less sharp that 1.2.4.
Rd .
C(S).
Following an idea from Dudley in [21] (sec.11), the study of the general empirical processes theoretically allows one to deduce some results about random walks in eeneral Banach spaces.
As an application of this principle
C(S)
metric space (SK) and the space ped with the uniform norm
1.
CO
equipped with the Lipschitz-norm:
.
Let
let us consider a compact
of real continuous functions on
X
S,
equip-
be the space of Lipschitz-functions on
S
102
N(c,S,K)
We write
K(s,t) > c
cardinality
for the maximal
for any st in
ix(t)-x(s)
+ sup t#s
' X -*
L
K( s ,t )
F = {65 ,sES}
fills
1.
(X,
(M) .
L. ) is a
Suslin
F =
and
Q
Moreover, for any distribution
N F(2) (.,F) < N(.,S,K) .
S
such that
:
H L
.
space (but is not Polish in general), so
2 Q((6 s - 6 t ) )< so
of
R.
We may apply our results through the following choices
Then
R
of a subset
2 K
(2)
in p 2 (X) we have
:
2 (5,t) Q(F )
11.11 ,0 =
Besides
F ful-
.
Therefore, considering a sequence (X)>i of independent and identically distributed
C(S)-valued
X l (s)-X l (t)1 with
6.3.
E(M 2+6 ) < .
:
random variables such that <M
E(X 2+6 1 (t o )) < .
and
K(s,t)
,
for one
to
Lipschitzian
N(.,S,K)
S,
in
theorem to get speeds of convergence towards the
structure depends on
s,t
for any
S.
in
we can apply some
gaussian
5.5.
or
distribution, whose
(the central limit theorem for such uniformly
processes as above is due to Jain and Marcus in
[31])
APPENDIX
1.
3.1.
PROOF OF LEMMA
First let us recall
Hoeffding's
lemma (see
(291) .
Noeffding's Zemma. Let
S
be a centered and
E(exp(tS)) < exp We may assume that
.
drawing
for each
i
w
[u,v] -valued ( t2,or-u)‘ 2 ■
[1,n] .
,
for any
is chosen as follows
with uniform distribution in
\ )
8
random variable, then
-
t
in
:
R .
:
a partition I
(J 4 ),,,,_ '
such that
iJ i l =m
103
J i - with uniform distribu-
independently in each
w(i)
• then, drawing an index
J
The following evaluations are conditional on
tion - •
J ,,,.,
not depend on We set
giving 3.1. .
S S n N Z - -- - -N n
place transform of
Then setting
but the last bound will
and we write A
for the logarithm of the conditional La-
Z . z
. = I
in R :
c. , we have , for any s
m iEJi
1
J
n A(s)
Log
-
7
(1
eXp (.-n (C.j - Z•))) i
m jEJ i
i =1
then, since the logarithm is a concave function :
SN A(s) < n Log (,1
fl) - n AN ()
exp q ( ,i j=1
QN for the uniform distribution on
where, writing
the logarithm of the Laplace transform under
Cramer-Chernoff transform of
Z
Q N of
fc.1 ,...,41 1 ,
AN stands for
x _.- x-E n (x) .
S'El
is larger than that of 7
'N
Therefore the
S'n
(-i ) under e n
-
N
Entgn
'N where S
n
stands for the sum of
n Li.d. random variables with common distribution
QN • Then, Hoeffding [29] and Bernstein [5] inequalities yield 3.1.1°) and 3.1.2°) .
In order to prove 3.1.3°) we may assume that S N = 0 (otherwise changing.i
nto
SN j - Tr) • Then, applying Hoeffding's lemma to the conditionally centered random variables
.. and setting u. = min
1 l i (t) - LogE
J
jEJ i
. J
and
v i - max . , we get : jEJ.I J
(exp (t( x(i) -y)) <
(v.-u.) 2 2 18 1 t , for any t in R.
Hence
n 2 s, s A(s) - E l() < --i n — 2 i-1 8n
n
2 s (v.-u.)2 < --1 1 — 2 1.1 4n
and therefore :
2 A(s) < 4s4.1 m 4 yielding 3.1.3°) via Markov's inequality.
n E i=1 jEJ.1
2
104
2. THE DISTRIBUTION OF THE SUPREMUM OF A d-DIMENSIONAL PARAMETER BROWNIAN BRIDGE. We use Goodman's work in
[28] to give a lower bound of the probability for the
brownian bridge to cross a barrier.
supremum of a
Notations. We set of
11'
I
= [0,1] and write for any integer s
Moreover, for any
.
in
I
d
d,
, we set
l d for the element
(1,...,1)
p(s) = s l ...s d .
A.1. Theorem. Let d
process, then, on the one hand
(1)
Wd be some standard d-dimensional parameter Wiener
be an integer and
WAS) < t II4d (1 d ) = at) < h d (a,t)
Pr (sup sEI
:
d
for almost any real number
a (in
Lebesgue sense) and any positive
h d (a,t) = 1 + exp(2t 2 (a-1)) and on the other hand
Proof
d-1 141 (2t2(a-1))i Z (-1)' i!1 1-c.,11 (a) i=0
:
Wd (s)-p(s) W d (1 d ) > t) < d-1 (2t 2.1 d 1 =0 1' sEI
(ii)
Pr (sup
of
t, where
. exp(-2t 2 )
.
theorem A.1.
If d-2 the whole proof is contained in
[28] .
[28] yields the following inequality : 0 Ind(s) < t 1Wd (1 d )=at) < j (1-exp(2t 2 r)) dF t,d _ 1 (a,dr) A.2.
Otherwise, proceeding exactly as in
H d (a,t) = Pr(sup sEI d where
a-1
d-1 IW F t,d-1 (a ' r) = Pr(W at ) d-1 (s)-rtp(s) < t, vsEI d-1'(1 d-1 )'---" We want to proceed by induction.
It is enough to notice that
:
A.3. Lemma. Pr(W k (s)+ap(s) < t, VsEIk 1Wk( 1 k)+= r3) for any integer
k , any positive a
Pr(W k (s) < t, vsEI k W k (1 k ) =
and almost every 8
in
R
(in
Lebesgue sense).
105
Proof of
A.3.
Since
W
is a regular
k
gaussian
process, it is enough to show that the expec-
tation and covariance functions of the processes
W k (1 k )+a
same conditionally to respectively Since
W k is gaussian,
E(W k (s)
W k (.)+p(.)
W k (.)
are the
W k (1 k ) .
and
W k (1 k )=y)
and
and
E(W k (s)W k (s l ) W k (1 k )=y)
are
respectively linear and quadratic functions of y, then the knowledge of
E(q(1 k ) W(s) q(sI))
with
l+m+n <4, yields :
E(W k (s)W k (1 k )-y) -, p(s) y , E(W k (s)W k (sTW k (1 k )=y) = p(s)p(s I )y where
sns' = (s l As il )...(s k Aq) ,
+ (s A s') - p(s)p(s')
A.3. . A.1.(i) .
Let us return to the proof of
Using lemma
2
A.3., we get : F t,d-1 (a ' r) = H d-1 (a-r t) '
SO
:
F t,d-1 (a ' r) < h d-1 (a-r ' t) Then, integrating by parts, inequality
A.2. becomes :
0 H d (a,t) < F t,d _ 1 (a,0) - exp(2t 2 r)Ft,d_1(a,r) l 0 a-1 + 2t 2 f
exp(2t 2 r)F t,d _ 1 (a,r) dr
a-1 hence H (a d '
t) < 2t 2 .r°
exp(2t 2 r) h d i (a-r,t) dr :
But an easy calculation yields
0 2t 2 r1 exp(2t 2 r) hd-1(a-r't) dr = h d (a ' t) Ja-1 Therefore inequality
A.I. (i) is proved by
A.1.(i) holds when
induction (it is shown in
[33]
p.
284
that
d=1) .
In order to proof (ii), we notice, following pr (
converges weakly in
.
C(I d ) towards
wd E.I o < wd ( 1 d )
[7], p.84, that : <
E)
the distribution of the
brownian
bridge
:
106 W d - p(.)W d (1 d )
whenever
c
converges to
0.
(i) gives :
So, inequality
Wd (s) - p(s)Wd (1 d ) > t) > 1 - h d (0,t)
Pr (sup
sEI
d
(ii) is proved.
therefore
Comment. Theorem in
A.1. was proved by ourself (see [40] and [41 1 ) but also by E. Cabana
[11] * . In another connection, inequality
t 2h(d)
with
h(d) 7 d-1
A.1. (ii) ensures that some polynomial factor
cannot be removed in bounds
-
3.3.1°)a) and 4.1.2. .
3, EXPLICITING AN EXPONENTIAL BOUND. The calculations yielding
3.3.1°)a) are slightly modified here, where the entro-
py condition a) is replaced with a more explicit one.
A.4.
Theorem.
If we assume that
F
is
[0,1]-valued and that
E 2) m(2)( c,F), < ,1+1/Log( N (1+Loq(c -2 )) d C -2d ) "1 then, an upper bound for Pr ( nl F 7 t) is, for any \) a
l ■
4H(t) exp(13) exp(-2t 2 )
+
for any
t
in
E
in
]0,1[
[1,+.[ , given by :
4H 2 (t) exp(-(t 2 -5)(Lt) 2 )
where
H(t) = K6/5 exp(16d) (1+Lt2) 5d t6d Proof of
A.4.
In the proof of
Lt 22 + 1 , then LLt 2 oN A < 2H(t) exp(13) exp(-2t )
3.3.1°)a) we choose a
=
B < 2H 2 (t) exp(-(t 2 -5)(Lt) 2 ) whenever *
P 01 -a.s.
t 2 _> 6+4d , yielding A.4. via lemma 3.2. .
Thanks to
M. Wchebor and J. Leon for communicating this reference to us.
107
Comment.
Assumption a') is typically fulfilled whenever case d of
F
is a V.L-class.
In that
may be the real density of F (if it is "achieved") or the integer density
F (see the proof of 3.6.).
REFERENCES
[1]
ALEXANDER, K.
[2]
ALEXANDER, K. PitobabiZity inequaatiez o
(1982). Ph. D. Dis6ettation, Mas. Int. Tech.
(1982).
empcaZ ptoce66ez and a Zaw itoLated togatithm. AnneLE6 0S P/L0babitity (7984),v0Z.12,N°4,1041-1067.
[3]
ASSOUAD. P. Denzi-0. et dimenzion,Ann. /n5t. Fou&L, G/Lenobte 33, 3 233-282.
[4]
BAKHVALOV, N.S. On apptoximate catcutation 06 muttipte integtat6 (in Ru/mian). Vatnik Mo6k , Set. Mat. Meth. A&tton. Fiz. Khim. (1959) 4, 3.78.
[5]
BENNETT, G. Ptobabiaty inequaUtie/s Sot 6(110 06 independent ,tandom vat-LabZez. J. AMA. Stati}st. A6ZOC. (1962), 57, 33-45.
[6]
BERKES, 1. PHILIPP, W. Apptoximation theoterro Sot independent and weakty dependent tandom vecto. Ann. PfLobabiUty (1979), 7, 29-54.
[7]
BILLINGSLEY, P. Convetgence o pir_obabitity mea&ftez. Witey, New Yotk.
[8]
BORISOV, I.S. Ab6tAadtis oS the Colloquium on non poutametAid 6tAti.zticat.
(1983)
,
in6elLence, Budapat (1980), 77-87. [9]
BREIMAN, L, On the tait behavLot 06 601)16 06 independent tandom vaniabtez. Z. WaAAchein. Vetw. Gab. (1967), 9, 20-25.
[10]
BREIMAN, L. Ptobabitity. Reading Mas. Addizon-WezZey (1968).
[11]
CABANA, E. On the ttanzition deity o a muttidimenziona. pakameten_ WieneA. p/Locu6 with one baiftiek. J. Appt. Pkob. (1984), 21, 197-200.
[12]
CABANA, E., WSCHEBOR, M. The two- panametet Bkownian btidge. Pub. Univ. Simon Botivat.
[13]
COHN, D.L. (1980). Mecoute theoty, Binkhalset, Bos -ton (7980).
[14]
CSORGO, M. , REVSZ P. A new method to pfLove S-t-A,m6en type Zaw4 O4 invatiance pAincipZe II. Z. WarL6chein. VeAw. Gab. (1975), 31, 261-269.
[15]
DEHLING, H. Limit thecm_em6 zurms og weakty dependent Banach pace vatued tandom vatiabta. Z. WaA6chein. VeAw. Gab. (1983), 391-432.
[16]
,
0E PROVE, L. Bounds Sot the uniSotm deviations c),
J. o( MuttivaA. Anat. [17]
(1982),
empikicat mecoukez.
12, 7z-79.
INCHI PU, A un,i6otm bound So 't the tait ptvbabitity QS Kamogotov-Smiknov 6tatistic6. The Anna e6 oSA Stati5tin (1985), Vot.73,N ° 2, 821-826.
108
[18]
DUDLEY, R.M. The 6izez 04 compact 6ubet6 o4 HitbeAt pace and con.anuity o4 Gatiimian pfLocez6eis. J. Functionat Anaty6-iis (1967), 1, 290-330.
[19]
DUDLEY, R.M. Metkie ent/Lopy o4 orne cZa6u g zet with die/Lentiat boundaAia. J. AppAoximation Theoky (1974), 10, 227-236.
[20]
DUDLEY, R.M. CentAat timit theotem6 4oA. empiAicat mea6une6. Ann. PfLobabitity (1978), 6, p. 899-929 ; co/L/Lection 7 (1979), 909-911.
[21]
DUDLEY, R.M. ccivit Ftoun 1982
[22]
DUDLEY, R.M., DURST . EmpiAicat PtocezLez, kpnik-elEvonenkiis cta26u and Poison pucez6eis. Puba. and Math. Stat. (Wtoctaw) 1, 109-115
Lectuke Not
n Mathematin n° 1097 .
(1981).
[23]
DUDLEY, R.M., PHILIPP, W. InvaAiance pkinciptez 4ot LLOTO 06 BanaCh 6 pace Z vatued t-Landom element/5 and empiAicat_ p/Eoce66e6. Z. WaAAchein. VeAw. Gab. (1983), 82, 509-552.
[24]
DVORETZKY, A., KIEFER, J.C., WOLFOWITZ, J. Aymptotic minimax chatactelL 04 the 6amp1e diztAibution Çuncton and o4 the ctaz6icat muttinomiat ezti,matvA. Ann. Math. Sat. (1956), 33, 642-669.
[25]
FERNIQUE, X. Rëguta/Eité, de p/Loce46u4 gati/mienz. /nvent. Matk. ( 1971 ), 12, 304-320.
[26]
GAENSSLER, P., STUTE, W. Empicat p/Locu6ez : a 6EoLvey g /Le6uit's OIL independent and identicatty dts.ftibuted Aandom vatiabtez. Ann. Pkoba. 7, 193-243.
[27] GINE, M.E., ZINN, J. On the cent/Eat timit theo/Lem 40it empi/Eicat p/Eoce46ez. Annatz g Pubabitity (1984), vot. 12, n ° 4, 929-989. [28]
GOODMAN, V. DitAibution utimation4 4oit 4unctionat6 o4 the two pafrumete/L Wiene/L p1OCe.56. Anna tis o4 Pubabitity (1976), vot. 4, n ° 6, 977-982.
[29]
HOEFFDING, W. Pubabitity inegtial_Lae.z on um g bounded )tandom vaAiabtez. J. Amet. Stati.4t. A)S60C. (1963), 5E, p. 13-30.
[30]
IBRAGIMOV, I.A., KHASMINSKII, R.Z. On the non- panametAic denAity Zap. Naucha. Semin. LOMI 108, p. 73-87 (1981). In Ru/mian.
[31]
JAIN, N., MARCUS, M.B. CentAat timit theotem 4oA. C(S)-vatued nandom vaAiabtez. J. Functionat Anatyi6 ( 1975), 19, p, 216-231.
[32]
KAHANE, J.P. Some Aandom 6elEiez g 6unctionz. Lexington, Ma. D.C. Heuth (1968).
[33]
KARLIN, S. TAYLOR, H.M. A Pl./Ezt couue in Stochaztic Pnocaza (1971). Academic Pke66, New-Vo/Lk.
[34]
KIEFER, J.C. On ta/Lge deviation4 g the empinicat d.4. o4 vectot chance vaAiabtu and a taw o4 itefLated toganithm. Paci4ic J. Math. (1961), 11, 649-660.
[35]
K/EFER, J.C., WOLFOWITZ, J. On the deviationz g the empinic dizt/Eibution 4unction o4 vectok chance vatiabta. T/Ean6. Anie/L. Math. Soc. ( 7958 ), 87 p. 173-186.
imata.
109
[36]
KOLMOGOROV, A.N., TIKHOMIROV, V.M. E-entiLopy and E-capacity g zetz in Aunctionai z pace. AmeA. Math. Soc. TAanzt. (SeA. 2) (1961), 17, 277-364.
[37] KOMLOS, J., MAJOR, P., TUSNADY, G. An appuximation o6 paAtiat 6um's oA independent RV' z and the 6ampte OF. T. Z. WaAzchein. VeAw. Geb. (1975), 32, p. 111-131. [38]
LE CAM, L. (1983). A iLemaith on empinicat meazuuz. In A Feztzchei(lt goit Pilch L. Lehmann in Horion. o W Sixty-FiAth Binthday (1983), 305-327 WadzwoAth, Belmont, CaZiAoAnia.
[39]
MAJOR, P. The appAoximation o6 paktiat zumz oA independent RV'. Z. Wwochein.
VeAw. Geb. (1976),
35, 213-220.
[40]
MASSART, P. Vitezze de conveAgence dam te théotéme de ta timite cent/Late poun te pir_ocezzuz empiAique. The. de 3e cycte n ° 3545 de t'UniveAzité. de Pana-Sud (1983).
[41]
MASSART, P. Vitezzez de conveAgence daris e théoiLLme cent/a timite pOuh da pAoca6u4 empiAiquez. Note aux C.R.A.S., t. 296 (20 juin 1983) SéAie 1, 937-940,
[42]
MASSART, P. Ratez oA conveAgence in the centAat timit theOhem 4ok empiticat pucezzez (Apnit 1985),zubmitted to Ann. Int. Heniti PoincarLé.
[43] PHIL/PP, W. Atmozt 6UAR invaniance pAinciptez 40A 6uft16 or 8-vatued !iandom vatiabtez. LectuAe Notez in Mathematin 709, 171-193. pAocezzez. J. AmtAatian
[44]
POLLARD, D. A centAat timit theokem AoA empiAicat Math. Soc. Sen.. A (1982), 33, 235-248.
[45]
POLLARD, D. Ratez o4 6thong uni4Ohm convehgenCe (1 9E2).
PkepAint.
[46] SERFL1NG, R.J. Pubabitity inequatitiez goir_ the z um in zampting without keptacement. Ann. Stat. (1974), vot. 2, n ° 1, 39-48. [47]
SION, M. On uni6mmization g 6Q..t.4 in topotogica Soc. (1960), 96, 237-245.
[48]
SKOROHOD, A.V. Theony Pitob. Appt.
[49]
STRASSEN,
V. The Ann. Math.
z pace. TAanz. Anie. Math.
(1976), 27, 628-632.
exatence g wiobabitity meazunez with given Stat. (1965), 36, 423-439.
maAginatz.
[50] TUSNADY, G. A Aemaitk on the appiLoximation og the zampte OF in the muttidimen6ionat cause. Peniodica Math. Hung. (1977), 8, 53-55. [51]
VAPNIK, U.N. hRVONENKIS, A.Y. On the uni4oAm conveAgence oA ulative 6tequenciez og event to theik ptobabititiez. Them. Pub. Appt. (7971), 16, 264-28 .
[52]
YURINSKII, V.V. A zmootking ne. diztance. Theny Pub. Appt.
[53]
YURINSKII, V.V. On the ehhoh
Them Pub. Appt.
[54] YUKICH,
'y gok. eztimatez (1975), 20, 7-10.
oA the
Civy-Ptoholtov
oA the gauzzian appuximation don_ convaution4. (1977), 22, 236-247.
J.E. UnigoAm exponentiat boundz AoiL the notmatized empirLicat pfLocezz 0985) . PiLepnint.
MEAN SQUARE CONVERGENCE OF WEAK MARTINGALES
Mariola B. Schwarz air Mathematische Statistik
Institut
Universitgt Göttingen Lotzestr. 13, D-3400 Geittingen
[4]
In
it was shown that the mean square convergence of vector-valued martin-
gales in spaces of Rademacher type or cotype 2 is closely related to the following property of a Banach space valued martingale Radon measure space
on
'Y f
(B,B) (8
the Borel
f = (f ) n
there exists a Gaussian
a-algebra of subsets of the Banach
B) such that
(*)
(dx)
x*f1'22 = r lx* (x)1 2
for every
x E B . (*)
We give a characterization of the class of martingales satisfying
1
not containing
uniformly and having an unconditional basis
convergence of the series
where E 1 1 S(ef) 1 ' e n 2 n ' n=1
(e)
n
for spaces
(e ) in means of n S
is the dual basis and
is the standard square function. Furthermore we give necessary (resp. sufficient) conditions for the of martingales in spaces of type
L -convergence 2
2 (resp. cotype 2).
These conditions characterize Banach spaces of Rademacher type or cotype 2 . Throughout, Borel
B
denotes a separable Banach space,
a-algebra of subsets of
A family
[f
n
respect to the filtration an
and
(0,3,P)
the dual space,
8
the
a probability space.
B-valued random variables forms a weak martingale with
1 n E NI of
Pettis integrable on
B
B
n and
n E NI (Pettis-)
A
B-valued weak martingale
M
, such that the inequality IX f !
I'Lf 1 n
E N1 M
if for every
E(f
n E N , f
1 a ) = f k
n
for
is
measurable,
n n ksn (see e.g.
[3]).
is uniformly bounded if there is a constant
*
holds a.e. for every
* * x E B and every
n E N . Note that if
f ,--- [f
1 n E N1
is a
B-valued (weak) martingale then n * * * * x f= (x f 1 n E 1\11 is a real martingale for each x E B . n
n E 1\11 is a n I with difference sequence Pi 1 n E 1\11 . In the following
f = [f
* x f=
B-valued uniformly bounded weak martingale
n
IOWA 1. Let
(e ) be an unconditional basis in B n nE N basis. Assume that B does not contain l n uniformly. Then the following conditions are equivalent
and
(e n )
nE N
the dual
111
a) the series EllS(e
n
2
e
is convergent in
n
b) there exists a Gaussian measure
r
yf
on
B ;
(B,B)
such that for each
x
E B
ix(x)12 B * * * * Proof. x f = (xf ) is for eachxEB areal martingale with increments n * Thus the square function S of x f is given by * 1/2 S(x 0 ...= (E 1 x* (dk )1 2 ) . k * * Let the functional T:B X13-, 12 be given by * * * * * * * T(x ,y ) = E E x (dk ) y (dk ) , x ,y E B . k T
*
.
is well defined everywhere in the Cartesian product
B x B since f is of * * T is positive (i.e., T(x ,x ) > 0 for every B*) * * * * x E and symmetric (i.e., T(x * ,y* ) T(y ,x ) for all x ,y E B* ) Since f is uniformly bounded we have * * * * T(x ,y ) (E E x () 2 ) 1/2 (E y (dk) 2 ) 1/2
weak second order. Furthermore,
= IS(x1S(y f) 2 2
= b[*6 2 1 H*0 2 M1 x which means that T
,
l y*
is bounded.
** B . Using a theorem of Banach (see * e.g. [1 ] ), which states that an element u of B belongs to the image of B in * ** the natural embedding of B into B , if the functional u(v), v E B , is * * is contained in B . continuous in the B-topology on B , we show that TB T
We can consider
Assume that
E
* B
as a map of
into
a) is satisfied. Since
* 1/2 S(e:f) 11 2 en =E (E(E(e* (111. )) 2 ) 1/2 e = E (Te *(e)) en n k n n n
the series
*, *, 1/2 n le n ))
E
By theorem 2.1. in
B , i.e., * * Tx (x ) =
lx* (x)1
T 2
B .
is a covariance operator of a Gaussian measure
Y(dx) .
B
Since
11x* f1 2
[2]
is convergent in
11S(x* f)11 = Tx* (x* )
the implication a) Conversely, if
b)
follows.
b) is satisfied, the series
)) 1/2 e E 11S(e* 011 e = E (Te*n(e* n n n n n n 2 is convergent in
B
as
T
is a covariance operator of
'y
'y
on
112
We recall that a Banach space does not contain
n
uniformly if it is of certain
• Banach spaces of Rademacher type 2 are of some Rademacher
Rademacher cotype r <
cotype r <03
1
(see, e.g., F5]).
Using Theorem 5.3. and 5.4. of THEOREM 1. Let
(e ) nE N n
[4] , Lemma 1 and the above remark we get B
be an unconditional basis in
and
(e ) n
N
the dual
basis. Then
i) B series of
f
is of cotype 2 if and only if for every
E HS(e
e 2 n
n
B
in
the convergence of the
is a sufficient condition for the L
2
convergence
f . ii) B
E 1 1 S(e n n
the convergence of the series
is of type 2 if and only if for every f
B
e n in —
4
is a necessary condition for the
L
2
convergence of
f •
COROLLARY 1. In Hilbert space (and only in this space) the convergence of the series
E11S(e f)I e n 2n
is a necessary and sufficient condition for a martingale
N
be the family of all
for each
x
*
lx* (x)1 2
B
E
THEOREM 2. Let
such that
y f (dx)
(e ) n
F2] N
p > 0 • Assume that
constants
(B,S)
for which there
* B .
Using Theorem 2.2. in
and
f = (f)
B-valued weak martingales
y f on
exists a Gaussian measure
1x* f1 1, 2 = 2
to
L -norm.
converge in the Let
f
2
c l (p)
and
and the techniques of the proof of Lemma
be an unconditional basis in
B
c 2 (p)
does not contain
c2 ( P )
(e)
flE N
the dual basis
l n uniformly. Then there exist
such that for each
c i (p) 11E S(e*f) 2 ePsr n n n • B
B ,
1 we get
f
E
(dx) e Z11 S(e*n f)1 2n
•
REFERENCES
[1]
S. BANACH :
Théorie des opérations linéaires, Warszawa 1932.
f- 2 1 S.A. CHOBANYAN AND V.I. TARIELADZE : Gaussian characterizations of certain Banach spaces. J. Mult. Anal. 7, 1977. [3]
K. MUSIAL :
Martingales of Pettis integrable functions. Lecture Notes in Math.
794, 1979. F4]
NGUYEN DUY TIEN : On Kolmogorov's three series theorem and mean square convergence of martingales in Banach spaces. Theor. Prob. Appl. 24(2), 1979.
[5]
W.A. WOYCZYNSKI : Geometry and martingales in Banach spaces. Advances in Prob., Dekker, 1978.
METRIC ENTROPY AND THE CENTRAL LIMIT THEOREM IN
BANACH SPACES
Institut
§1.
J. E. Yukich Recherche Mathématique Avancée Université Louis Pasteur 7 rue René Descartes 67084 Strasbourg, France de
INTRODUCTION
The intent of this article is to study the relationship between (i) the central limit theorem in Banach spaces, (ii) the Donsker property for unbounded classes of functions, especially subsets of the Banach dual, and (iii) metric entropy with LP bracketing, p > 1 . Before exposing the main results, let us first set in Place the framework for empirical processes, the setting of this paper. Throughout, take (A,A,P) to be a probability space and x i , i> 1 , the coordi-
nates for the countable product (A ,A ,P ) of copies of (A,A,P). nth empirical measure for P is defined as
The
n P(B) = n
7 1
, BA
j=1 fx4131 Given a class F<7.1 L 2 (A,A,P) of real-valued functions with envelope F F (x) = sup If(x)1 , let S:=t (F) be the space of all bounded realf.GF
valued functions on
F;
equip
S
with the sup norm, i.e., for s
S
let := sup fcF
We note that When F
Is(f)1 •
(S, H H) is a Banach space, non-separable for F infinite. F is finite P a.e. the function-indexed emnirical process
v(f) (to)
n 1/2
f(dP n -dP) (w) , w
e Acc ,
f e F ,
that F is a is a random vector v n with values in S . Recall [7,8] P-Donsker class if v n converges in law in S to a Gaussian process G P ' indexed by F , with a.s. bounded p r -uniformly continuous sample paths (abbreviated BUG). Here, 2
*
p P fg) :=
(f-g) 2 dP - (f(f-g)dP)
2
.
Current eldress:Lehigh University,Math.,Bethlehem,Penn.18015-USA
114
G
necessarily has mean 0 and the same covariance as
cov(G p (f), G p (g)) = f fg dP
f f dP
v
n
g dP .
Clearly, a necessary condition for F to be P-Donsker is that F be G p BUC , i.e., that the limiting Gaussian process Gp can be chosen to be BUC. Define the mapping
h:
S
by setting
h(x)(f) = f(x) - f f dP lefillingtherandollivariablesXj=h(x.), Dudley foreach has shown [6] that if F is Gp BUC then the P-Donsker property for F means that X 1 satisfies the central limit theorem in (s, H 11) abbreviated X i ( CLT . (Recall that if X is a random variable with values in a Banach space B and if (Xn ) 11cN is a sequence of independent copies of X , then X satisfies the CLT if L( 1=1
X./VT) 1
B . See [12,16] for an exposiIn this way we see that central tion of the CLT in Banach spaces.) limit theorems for the empirical process \) 1 (f) , f eF , can be viewed as central limit theorems with respect to the norm H h on the Banach space S . converges weakly to a Radon measure on
Throughout this paper let (B, H 11) denote a not necessarily separable Banach space , B * the dual space, and )4, the unit ball of B * . Now a subset H of B *1 is called a forming subset (see [7]) for B if and only if H x11 = sup Ih(x)1 for all xc. B . Clearly, hrH by the Hahn-Banach theorem * is always a norming subset, a fact ' B1 which we shall exploit in the second section. In this way, limit theorems in B can be viewed as limit theorems for empirical measures on B , uniformly over a class of functions, * such as B i , since for .CgB and x(1),...,x(n) E B +6
x (n)
)(f) = f(x(1)+...+x(n) ) .
In particular, We note that X (with L(X) = P) if and only if B1 is P-Donsker.
satisfies the CLT in
B
115
The relationship between limit theorems for empirical processes and limit theorems for Banach space valued random variables has been studied by several authors. For example, Dudley has shown [7] that the Jain-Marcus CLT for C(S)-valued random variables [13] is in fact a conDudley has also sequence of Pollard's CLT [17] for empirical processes. shown [7] that the Fortet-Mourier strong law of large numbers [15] in separable Banach spaces is a consequence of the DeHardt-Dudley law of large numbers for a class of functions [3,7]; we will return to this implication later on and will also provide a short and simpler approach.
(f) often involve a metric entropy condition on the index set F . In this article we will consider metric entropy with LP bracketing, defined as follows [7]: Limit theorems for
n
Definition 1. Given f,g : A-0- 1R, define the bracket [f,g] := Let p 1 , such that f(x)
FCL P (A,A,P) .
Let NM
:= Nr1 (c,F,P)
>0
mirdm:3
and such that Vf cF 3 l<j<m such that fr [f i ,f;] [fm ,f;1 ] (f is referred to as metric entropy - f.) 1) dP<E P 1 . Now log N ( P ) (c) [ J J with LP bracketing.
f
We recall that metric entropy with
1, 1 bracketing has been espe-
cially useful in the study of empirical processes; see e.g. the works of DeHardt [3], Dudley [4,7], Dudley and Philipp [8], Alexander [1], Borisov [2], Gin é and Zinn [11], Pyke [18], Koreinskii [14], and Yukich [20,22, In particular 23]. See also the recent monograph by Gaenssler [10]. 8 N (1) has been used to describe the weak convergence of n (f) [4,7, ]. [ 1 bracket One of the main ideas running through this article is that L ing is not a natural condition for describing weak convergence; L 2 On the other hand L 1 bracketing bracketing is a more natural choice. is a more natural condition for describing laws of large numbers. These points will be discussed in the following sections. Finally, we will also use metric entropy without bracketing which has enjoyed wider use, and which is defined as follows:
Definition 2. Let ff F 3 1<j
f(f-f.) P dP <E P } . ferred to as the metric entropy of F . < m
with
such that for all ,f such
N ( P ) (c,F,PI := Now
logN4 P ) (E,F,P)
is re-
116
Having set in place the framework for this paper, We now proceed Section to the main results, a summary of which was announced in [21]. two discusses the relationship between (i) and (iii) of the first paragraph, section three studies the relationship between (ii) and (iii), and section four considers metric entropy with L 2 bracketing.
§2.
METRIC ENTROPY AND THE CLT FOR IIII UNIT BALL OF THE BANACH DUAL
Our starting point is the following result of Dudley [7], which adds to Mourier's well known law of large numbers [15]. Theorem 1.
(cf. section 6.1 of [7]I
X., X 1 ,... he a sequence of i.i.d. random variables with values in a separable Banach space (B,11 t1). The following are equivalent: (i) X satisfies the strong law of large numbers (SLLN),
(ii) ElH < (iii) M(1) [
]
Let
and
(E B * P)
VE > O .
'
It is reasonable to ask whether analogous results hold for those X satisfying the CLT . The following theorems answer this query in the affirmative, showing that the study of the CLT in separable Hilbert spaces
11
N ([ 2 ] (E,F1,P) , where
is conveniently studied through the use of
*
15 the unit ball or the Hilbert dual.
In fact, in the same way that the equivalence of
X CLT
and EX}
0 , I1X1! 2 < co
characterizes, modulo an isomorphism, separable Hilbert spaces, the following results show that the same is true for the equivalence of the conditions B
1
is P-Donsker and inf N (2) (E B * P) ' s >0 [ 1
c°'
P
centered.
117
This is contained in the following results.
X be a random variable with vaiues in (B, 11 ) which need not be separable; L(X) = P Then for all p >1 the following are equivalent: (i) EIIXIIP < , and Theorem 2. Let
,
(ii)
If
inf N ( P ) (a , B 1* ,P) < oc. c>0 P is tight then the following are equivalent:
N ([ P )
(E,q,P) <
Vs > 0 .
Remarks Using the equivalence of (iii) and (iv) witn p = 1 we directly obtain the double implication of Theorem 1 without using (1)
the SLLN property of (2)
X .
As is well known
[12,16] the moment condition El general neither necessary nor sufficient for X e CLT ;
X11
2
is in
it may also be
inf N ED) (a,F,P) 0 necessary nor sufficient for F to be P-Donsker. The interest of Theorem 2 stems from the fact that when F is B 1* these are in fact equivalent conditions and they consequently share the same properties. easily seen that the entropy condition
The next result is essentially a consequence of Theorem 2.
Theorem 3. Let
X
Banach space (B,I[ of P: (i)
be a centered random variable with values in a Consider the following properties P . 1) ; L(X)
X eCLT ,
(ii) B 1* is a P-Donsker class of functions,
(iii)
l XI I
Co
(iv)
N (2) (E * P) [ ] 'B
(v)
inf N 2 L I s>0
Va > 0 ,
c B,P) < Co .
We have: (a)
P is tight and all equivalent. If
B
is a Hilbert space then (i)-(v) are
118
(h)
If B is separable, then the equivalence of (i), (ii), (iii), and (v) is equivalent to the fact that B is isomorphic to a Hilbert space.
For the proof, we need only observe that when B is a separable Hilbert space, the equivalence of (i), (ii) and (iii) stems from the introductory remarks. For separable type 2 Banach spaces B we have sharp metric entropy conditions insuring the P-Donsker property for B 1* .
Theorem 4. Let (B,11 II) a centered law. If
be a separable type 2 Banach space and
P
(2) inf N [ ] (E ' B * P) < c>0 * Conversely, if P is a law on any then B 1 is a P-Donsker class. separable Banach space B and if Iq is P - Donsker, then for all p<2 , N (p) [
(E B
P) < cc' VE >0 .
For the The proof of Theorem 4 follows at once from Theorem 2. first part we need only use implication (ii)-4(i) with p = 2 ; for the second we use implication (iii) (iv) with p < 2 together with the fact that if X cCLT then EIXI <œ Vp< 2 , see e.g. [16],
Proof of Theorem 2. We will first show (ii) (j) Let c> 0 and m := m(c) =N ([ P ) (c, B *1 , P) . By definition there is a collection of brackets [fj ' with
' j = 1,
f(x) < f(x) < fi(x)
,m , such that given any
f EB *1
1 < k < m
VX E B
and
f
(f I-' (x)-fk (x)) P dP(x))< E P .
* * and thus for fixed xc B we B 1 iNow s a naming subset of B f(x) runs over all of the values between - NI and rxd as See that f runs over B *1 . This implies that for any fixed x , the sum of the
119
y
differences of the brackets
(f -.'.(x)-f.(x))
must be at least as large
j=1 Indeed, suppose this were not the case.
as 2 1k11 that
ji =
Then
3
xo
e
B
such
f - (x 0 ) - f (x ) < 2 lixol 0
Thus there exists an
-11x011 < cï < rx01
and a
6> 0
such that the open
interval (a-6, a -1- 6) is disjoint from the union of closed intervals
M
_
,U 1 [f.(x )X(x )] . J 0 J 0 j= an f B7 such that
Moreover, by the Hahn-Banach theorem, there exists
f(x 0 ) = a
there exists 1< j<m
This leads to a contradiction since
such that
f.(x) < f(x) < f(x)
vxc B .
Thus, we have shown that
HX11 <
e(X) - f(X)
a.s.
j=1 For all
p > 1
there exists a constant
HxdP <_
j=1
Ellx11 13
<
such
((e-f.)(X))P a.s. J
Integrating this with respect to P
N ([ P )] (c ' B * P)
C := C(p,m)
and using the definition of
shows that
c.m. EP <
completing (ii)=0(i) . it suffices to consider the single bracket def i (x) = - 114 and f(x) = ilxil and to take E = 2 fIlx11 P dP(x).
To show (i)--4(ii) fined by
P is a tight measure then we may show the stronger implication (iii)-(iv); our proof is inspired by the proof of Proposition 6.1.7 of [7]. Let p> 1 be fixed and note that for all E> 0 there If
is a compact set KB such that
f
11x1I P dP < c 13 / 4
The elements of
BI , restricted to
K , form a uniformly bounded
120 equicontinueus family and hence this family is totally bounded for the sup norm on K by the Arzel- Ascoli theorem. Take f , , fm e B m < , such that Vf B i 1 ' I!f - f 11 for some
K
j .
g5. +Ill = f.3 + c/ 4
4 Let on
Then for every
ilgi ,m
gi = fi
on
K , g i+m (x) = Ilxil
f B i* ,
g5
E/4
K , g i (x) = - 11x11 for
if Hf -5 11 K < c/ 4,
< (2c/4)P + 2 j BK
I1x P
for
x K , for all then
x K , j = 1,
gi < < gi
,m .
and
d.p
< (c/2)P + EP/2 < c P . Thus, NIP/
(,14,P) < 2m <
proof of Theorem 2.
§3.
, proving (iii)=-4 (iv) and completing the Q.E.D.
THE P-DONSKER PROPERTY AND METRIC ENTROPY The results of the above section show that
N (2) [
i s often an
appropriate tool for describing the P-Densker property for above results also suggest that
N (p) [
(c F P)
Bi ;
the
13+2 , will not in _
general be a satisfactory tool to describe the P-Donsker eroeerty for LP(A,A,P) . unbounded classes F This is indeed the case: the following propositions show even in the presence of an envelope condition on F , that N EP) (c,F,P) , [ + 2 , can not possibly give sharp results. This may be seen by considering suitably chosen subsets of the dual to (r i L), the Banach space of bounded functions on IN equipped with the sup norm. More
ll
precisely, these subsets are classes of functions of the form
121
F
fyj
:=
a,$
a.s.1 :s.= 0 j 3 Ai 3
or
1) ,
where A j , j >1 , is a sequence of disjoint subsets of A with p. := P(A) =j -5 for some $> 1 and a j ..-j c4 for some 0< a < $ - 1 . Since elements of Fag define measures on 11\1' , we may view Fa, $ as a subset of the dual to (C,Ir
.
We will consider the cases p =1 , p >2 and l< p< 2 in this order; much of what follows may be found in [19]. Our first proposition shows that a theorem of Dudley, which is recalled below, is far from being the "best possible".
Theorem (cf. Theorem 3.1 of [S]). Suppose that F has envelope F F c LP(A,A,P) for some p> 2 . Suppose that there exists y , 0<',( < 1- 2/p and M< such that N(1) (e" F P) < exp(ME — [ for E
small enough. Then F is a P-Donsker class.
Proposition 1.
For all 2 < p< 4 there exist P-Donsker classes F with
FF e LP(A,A,P)
(1) and N [ ] (E,F,P) > 2 E -Y
where y is any number less
than 1/2. Proof. Fix 2
a> 0 such that a+ T 2 < Let $ = 2a + 2 + T and let F := F a0g as
and choose
and a(p 2) < 1 .
T
and
above. Since $ >pa + 1 it is easily verified that F F e LP . Also, by Theorem 2.4 of [5], F is a P-Donsker class since $ > 2a + 2 implies a./177 < 3
.
It remains to find a lower bounflçor N [ ] (E,F,P) .
3
If the individual terms a.p. are greater than or equal to E for all ,1 j and only Now a.p. > 2 0 . 3 _<jo =j o(e) , then N (1) (e F, P) — [ »a-$ Let y = 1/(8-a)— e and thus we may take 1 . if j c o By taking a and T small enough it is clear that y 1/(a+T+2) . Q.E.D. may assume any value less than 1/2 . (1) The next proposition shows that N [ ] can not provide sharp results.
In particular, if 2
N (1) [ ] (e,F, P)
2
for
if F
is such that F F e LP
and
(p-2)/(p-1) < y < 1/2 , then F may or may
122
not be a P-Donsker class of functions.
Proposition 2. F
2
For any
F F E LP and the P-Donsker property.
N (1)
with envelope
), > (p-2)/(p-1) , there are classes
and [
F
( E '
]
E P) < —2 --'( which do not satisfy
'
Proof. Let 2 0 such that p <3-6 . Let . Now a = (1-6)/(p-2) 2a+ 2 , and consider the class F=F a, F F E LP but by Theorem 2.4 of [5], F is not P-Donsker. Now if j K E -1/ ( 3-a-1) for some suitably large constant K a.p =
then j>jo
j a 6
3 j
N (1) (c F, P) < 2•2 i° ' [
and thus
j>jo
Setting -r =1/ ( -04-1) = (P - 2)/(1) - 1-6) and letting y +(p-2)/(p-1) , giving the desired result.
0 we see that
3
Q.E.D.
We conclude this section by exploring the relationship between
P-Donsker classes and p > 2
1
and
N ( P ) (c F, P), p> 1 . We consider the cases [ in this order. We first show that N (r P )] , p > 2 ,
will not in general furnish sharp results for the P-Donsker property.
Proposition 3. If with FF LP
q >2
F
p < q/2 +1 , then there are classes
and
P)
and N
2E -Y
F
> q , such that
may or
may not be a P-Donsker class.
FF
Proof. We first show that there are P-Donsker classes
F
p< q/2 + 1 , such that
arbitrarily
large.
Let
2a + 2+6
q> 2
3 (e F ) P) > 2 " for y > q
N [(g)
p< q/2+ 1
and
6.
LP,
6 > 0 , a =2/q-2) ,
becomes then N ( g ) (c" F P) a (3 [ arbitrarily small. To see that this is
F := F
and show that if
arbitrarily large for
be fixed, let
with
a c.lp >cg for all J a j(El p i. c g if and only if
actually so, note that if the individual terms
j <j 0 (E) , then jq a-13 >Eg
if and only if
Nr1 (c,F, P) > 2E that
F
N ([ g )] (E,F, P) >2 j° .
jg 6_ /(13 g a)
; moreover
is P-Donsker and
y= q/st co
F F E LP , since
On the other hand, if N
F
Now
as
F
y= q/ ( 3-qa) 6 ,k 0 .
then
Finally, we note
(3 > pa 1 .
F, P) < 2
[
may not be P-Donsker, even if
If
for
is of the form
y> q >2 , then Fag .
Indeed,
123
let 0
J then
( .›. 3—Jo
a j 1 A. )
If there is a j o=j (e) o
F= F
dP _ < cg
N
e F P) < 2.2 j° . Since the A. are disjoint, this inequality [ (" g E is satisfied whenever Y j qa < E q , i.e., whenever j 0 j>j Letting a+ 0 Thus, if y = q/(g-qa-1) , then N F P) < 2E -'Y . — [ Q.E.D. we obtain y + q , completing the proof. ]
N (q) [
Finally, we consider Proposition
4.
F P) (e"
with
1
is fixed, 2
If 1
q(p-2)/(p-q) < < q/2 , then there exist classes F F P) < N (q) (E " [
, such that
F
with
FFE L P ,
may or may not be P-Donsker.
Proof. Let 1 < q < 2 and 2< p < 4-q be fixed. Let a < 1/(p-2) be is not P-Donsker and fixed and let g := 2a +2 . Then F := F F F E LP . The computations of the second hL.lf of the proof of Proposition 3 show that if y q/(g-qa-1) = q/((2-q)a+1) , then Letting at 1/(p-2)
shows that
N ([ 1(e,F, P)
y +q(p-2)/(p-q) , as desired.
In a similar way, the first half of the proof of Proposition 3 Let 6,a,g be as in the proof shows that F may not be P-Donsker. of Proposition 3. As a +0 and 6+ 0 we see that ytct/2 , as
desired.
§4.
METRIC ENTROPY WITH
Q.E.D.
L
2
BRACKETING
Dudley (c.f. Theorem 6.2.1 of [71) has shown that if formly bounded class of functions on (A,A,P) with
F
is a uni-
124 f /log N (1) j o [
(E2 ,F,P) dc <
(1)
,
then F is a P-Donsker class. The above sections suggest that (2) (E,F,P) is a more natural choice for the description of the PN [ 1 Donsker property; we are thus lead to the following Conjecture 1. Let
F F E L 2 (A,A,P); suppose that
FCL 2 (A,A,P) with
(2)
/log N ([ 2) (E,F,P) dE
0 Then
F
is a P-Donsker class of functions.
Note that this conjecture should be compared with Pollard's
central
limit theorem [17] where NE1 iR replaced by a random entropy. Also, F, (2) is generally weaker than (1). for uniformly bounded Now the following theorems support the conjecture and show that Recall that N( 2 ) condition (2) is actually necessary in some cases. is given as in Definition 2. Theorem 5. Let P be a probability measure on JR with a density f(x) such that (f(x)+f(-x)) is decreasing for x large; let H be the It! < 1} class of functions { x-4- e itx The following are equivalent:
(i)
/log Nri (E 2 ,H,P) dc<
,
0 (ii)
/log N 2)] (E,H,P) dc<
,
0 (iii) j /log N (2) (E,H,P) dE< Co , and 0 (iv)
H
is P-Donsker .
The proof of Theorem 5 closely parallels the proof of Theorem 1 of [22] where the equivalence of (i), (ii), and (iv) is demonstrated; we do not provide the details here. Our final theorem should be regarded as a generalization of the Borisov-Dudley-Durst theorem [2,7,9] which characterizes when the class of all subsets of
IN + is p-Donsker.
125
Theorem 6, Let
fm
m > 1 , be a sequence of positive functions on IR 1/2 with disjoint support; let Hf m 11 2 = (ffm2 dP) and suppose m 22
f : Ac2 N+ 1 .
mA m The following are equivalent:
(1) (ii)
H fmil 2 f
(iii) F
A ogNri (E,F,P) dc< Co ,
and
is P-Donsker.
Proof. (i)
Observe that standard Gaussian processes techniques give (iii), see [19] for details.
Let us prove (i)4(ii)-, this will essentially be a consequence of the Borisov-Dudley-Durst theorem. 2 + Take IfmU2= M ‹. and let Q be a probability measure on N defined by Q({10)
If-mil 2 /m .
Now we claim that
N
(2) (1) 2 IN + (E,F,P) = N (s /M, 2 ,Q) [ 1 [
(3)
where, as usual, the metric entropy for a class of sets is taken to be the metric entropy for the class of respective indicator functions. To show (3) we show that every bracket for the class to a bracket for
2 N+ and vice versa.
a bracket [hi, h.] for 1
hT1 =
f incA : m 1 for some sets A.1 that
and and
By minimality of
F corresponds (2)
F will necessarily have the form +
11. / = mEA.-1- f m 1 N+ + + . A./ , A.c:A. belonging to 2 1— 1 '
Now observe
126
f
(hI-hi) 2 dP = f(
2 f ) dP mEADA1 m 2 fmdP
ri 1 1
2 Hfmll 2 mcA +. \ A. (1
M
• 1
- 1 A.1
)dQ . A.1_
+ - 2 1/2 Thus, if If(h.-h-) dP1 N (1 ) (6 2 /M, 2t\T+, Q); to show the reverse in[ [ equality we just need to trace these steps backwards. By the Borisov-Dudley-Durst theorem, we have
J
/iog N (1) (6 2 2 N+ , Q) de < 0 [
‘@41
1
11 2
<
CC
(4)
Combine (3) and (4) to obtain (i)4=> (ii) ; this completes the proof of Q.E.D. Theorem 6.
§5.
FINAL REMARKS
Having shown the efficacy of metric entropy with L 2 bracketing, much still remains to be done. Apart from resolving the validity of (2) proves useful in the conjecture, it remains to be seen whether N E ] studying the bounded and compact laws of the iterated logarithm CUL). Based on the results of [20,23] it seems reasonable to make the following
127 Conjecture
2.
Let F CL 2 (A,A,P) with
Fi,c L 2 (A,A,P) ; suppose that
lg o N C2) [ ] (e F P l 1
5
/
dc <
.
.0 / logloglog N (2) (s,F,P)
[ 1
Then
lim
l yn (f) < sup ffF Vloglog n
a.s.
Were it true, this conjecture would establish sufficient metric entropy conditions for the bounded LIL weaker than those provided by [8]. Whether N (2) provides general rates of convergence for v (f) more refined than those previously established by L 1 bracketing techniques (see e.g. [1,8,14,23]) is also an open question. Note added in proof: After the writing of this paper I was informed that Dr. M. Ossiander has obtained a proof of the conjecture announced in section 4; at the time of writing, however, I have not yet seen Ossiander's proof.
REFERENCES 1. Alexander, K. S. (1984). Rates of growth and sample moduli for weighted empirical processes indexed by sets, preprint. 2.
Borisov, I. S. (1983). Problem of accuracy of approximation in the central limit theorem for empirical measures, Siberskij Matematicheskij Zhurnal, V. 24, no. 6, pp. 14-25 = Siberian Mathematical Journal, July issue, 1984, pp. 833-843.
3.
(1971). Generalization of the Glivenko-Cantelli DeHardt, J. theorem, Ann. Math. Statist., 42, pp. 2050-2055.
4.
Dudley, R. M. (1978). Central limit theorems for empirical measures, Ann. Prob. 6, pp. 899-929; correction 7, (1979) pp. 909-911.
128
5.
Dudley, R. M. (1981). Donsker classes of functions, Statistics and Related Topics (Proc. Symp. Ottawa 1980), pp. 341-352, New York, Amsterdam: North Holland.
6.
Dudley, R. M. (1984). An extended Wichura theorem, definitions of Dansker class, and weighted empirical distributions, preprint.
7.
Dudley, R. M. (1984). A course on empirical processes, Lecture Notes in Mathematics, no. 1097.
8
Dudley, R. M. and Walter Philipp (1983). Invariance principles for sums of Banach space valued random elements and empirical processes, Z. Wahrschein. verw. Geb., 62, pp. 509-552.
9.
Durst, Mark and R. M. Dudley (1981). Empirical processes, VapnikChervonenkis classes and Poisson processes, Prob. Math. Statist. (Wrocyaw), 1, no. 2, pp. 109-115.
10.
Gaenssler, P. (1983). Empirical processes. Institute of Mathematical Statistics Lecture Notes - Monograph Series 3.
11.
Giné, E. and J. Zinn (1984). Some limit theorems for empirical processes, Ann. Prob.,12, pp. 929-989.
12.
Jain, N. (1977). Central limit theorem and related auestions in Banach space, Proceedings of Symposia in Pure Mathematics, vol. 31, pp. 55-65.
13.
Jain, N. and M. B. Marcus (1975). Central limit theorems for C(S) valued random variables, J. Functional Analysis, 19, pp. 216-231.
14.
Koreinskii, V. I. (1981). On the central limit theorem for empir ical measures, Theor. Probability Math. Statist., 24, pp. 71-82 Tear. Verojatnost. i. Mat. Statist., 24, pp. 63-75.
15.
Mourier, Edith (1951). Lois des grands nombres dans un espace de Banach, C. R. Acad. Sci. Paris, 232, pp. 923-925.
16.
Pisier, G. (1975). Le the6rème de la limite centrale et la loi du logarithme itéré dans les espaces de Banach, suite et fin, Sém. Maurey-Schwartz 1975-76, Exposé IV, Ecole Polytechnique, Palaiseau.
17.
Pollard, David B. (1982). A central limit theorem for empirical processes, J. Australian Math. Soc., Ser. A. 33, po. 235-248.
18.
Pyke, Ronald (1984). Asymptotic results for empirical and partialsum processes: A review, The Canadian Journal of Statistics, 12, no. 4, pp. 241-264.
19.
Yukich, J. E. (1982). Convergence of empirical probability measures, Ph.D. thesis (M.I.T.)
20.
Yukich, J. E. (1984). The law of the iterated logarithm and the empirical characteristic function, preprint.
21.
Yukich, J. E. (1985). Théorème limite central et entropie mdtrique dans les espaces de Banach, C. R. Acad. Sci. Paris, to appear.
22.
Yukich. J. E. (1985). Weak convergence of the empirical characteristic function, Proc. of Amer. Math. Soc., to appear.
23.
Yukich. J. E. (1985). preprint.
Rates of convergence for classes of functions,