converges to
Z
choose an orthogonal matrix K
K 1 Mn Z
K -'--Z
so that
.
When we apply this theorem to the sequence that if
such
K 1Yi K
Applying the latter result to the matrices
.
we deduce that converges to
s(M)e1
Mn e1 <Mle1,e1>
lim
Sn = Yn...Y1
then
Y11Y2....
we find
converges a.s. to some direction
(this improves Theorem 111.4.3). Choosing
x
equal to some basis
Z
1 61
we see that the directions of the rows of e. 1 a.s. to the same limit. vector
Sn
converge
VI.4. Regularity of the invariant measure
We shall now see, following Guivarc'h and Raugi, that although the p-invariant distribution
is often singular, it always has some
v
regularity.
Suppose that
PROPOSITION 4.1.
T11
Then there exists
on
is contracting, strongly
T > 0
irreducible and that, for some
is finite.
f exp T R(M) dp(M)
,
p-invariant distribution
such that the
a > 0
v
P(]Rd) satisfies -
Sup d
and
u
dv(x) <
f
ycP(7R )
Proof any
:
If
y
are two orthogonal unit vectors, we have, for
x
Ix A y I
2
1
=
I
12
IIx
- <x,y>2 2 <x,u>2
.
Hence 11-11
Il
I<x,u>l
llxAy11 Since, by Theorem 1.2, there exists some
Sup
f(
lxII {
the proposition is clear
-
}a dv(x) <
.
Under the above hypotheses, there exist
such that for any
C > 0
such that
I<x,u>
Ilu 11=1
COROLLARY 4.2.
a > 0
x
in
P(]Rd) and
v({Y ; 6(x,Y) < E}) < C
Ea
x
and radius
v(B) = J
a(x,Y)
B 6(x,y)a
e d(x,Y)-a
dv(Y) < E:(' 1
and
.
Proof : This follows readily from the fact that if center
a > 0
e > 0
dv(Y)
B
is the ball with
1 62
As a consequence let us prove that the dimension of
V
is
positive. First recall the definition of the Hausdorff dimension of a
A
subset
(P(1Rd),6). The reader is referred to
of the metric space
Billingsley [5]
for details. For any
I > 0
and
E > 0
let
ks(A,E) = inf E (diam B.) i
where the infinimum extends over all countable coverings of A by closed balls
of diameter less than
B,
i
E
,
and let
R(A) = lim R,(A,E) E4O
A
The Hausdorff dimension of
dim A = sup{(
is defined by
i (A) = W} = inf{ : R(A) = 01
:
13
dim V = inf{dim A ; A is a Borel subset such that V(A) J01. Now if
v(A) # 0
E v(Bi)
V(A) Za(A) >_ v(A)
so that
and if
and
(B
<
)
i
is such a covering we have
C E (diam Bi)a
dim A ? a . This proves that
dim v2_! a > 0.
According to Ledrappier [46] one may conjecture that, for
2 x 2
matrices,
dim v = - 2y f
y) (x) dv(x) du(g)
Log d(B
(9)
(This is actually proved in [46] but for a weaker notion of dimension).
We have seen (see 11.7.2) that plane
H = {z E
IC
;
Imz > 0}
g.z when
g = [ a
d
]
.
is just the action of 1R U {-} p
, then
If
.
V
GP
Sl(2,1R) acts on the upper half
by the formula
az + b cz + d
Also that the extended action on the real axis Sl(2,]R) on
P(1R2)
if we identify
P(1R2) with
is the closed subgroup generated by the support of
is carried by the so-called limit set
L = {X E ]RU {oo} ; ](gn) C G11
L
of
Gu
,
i.e.
, his g n i = X} n_ a_
(This follows from the fact that
L
is closed and
G
- invariant or
u
1 63
converges
directly from Corollary 11.7.2 which asserts that to a random variable with law
V)
.
L may be arbitrarily small
It is known that the dimension of
V may be (and is often)
(see e.g. Beardon F3]). In particular singular.
It is worth rephrasing the above corollary in the context of projective transformations.
Let
COROLLARY 4.3.
G1(2,1R) such that
be a probability on
1i
strongly irreducible and contracting. If for some eT £(M)
Tu
is
,
is the probability measure on
is finite and if v
dp(M)
T > 0
]R
f
which satisfies
J
f(ax+b) dp(a,b,c,d) dV(x) = J f(x) dv(x) cx +d
for any bounded continuous function
a > 0
such that
C > 0
and
]R, then there exist
on
f
(i) For any
in
y
]R
J Ix-yl-a dv(x) < C (
(ii)
J
(1+x2)a/2 dv(x) < C
.
In particular the distribution function
F(x) = v(--,x]
is H6Zder
continuous.
Proof and
:
The mapping
'Y(°°)
= e1
]R U {oo} > P(]R2) defined by P(x) =xe1 +e2
T :
sends
v
onto the unique
P(]R2). Therefore, by Proposition 4.1 such that for any real
,
p-invariant measure on
there exist
y d(T(y),T(x))-a
dv(x)
<
C
dv(x)
=
C
J>R
and
( J
]R
But
d(1(y),Y(x))
II (x e1+e2) A (yet +e2) II Ix e1 +e 211
Ily et + e 211
C > 0
and
a > 0
1 64
Ix-y I /1+J
and
6(1'('),T(x)) =
1+yZ
I(Xe1 +e2) A e1 II
1
1+xZ
1+xZ The proposition follows immediatly.
satisfies a law
v
Finally we notice that the probability measure
of pure type with respect to the natural measure md . Recall that
]R U [-I, m2
On the projective line
and
d
P(]R ),(see II.5.1).
is the unique rotation-invariant distribution on
can be identified with the Cauchy
distribution and is equivalent to the Lebesgue measure.
PROPOSITION 4.4.
Let
be a probability measure on
p
has only one p-invariant distribution v
- or
V
and md
- or
v
is purely discrete.
Proof : Write
component of
v
on
P(IR)d). Then
is absolutely continuous w.r.t.
-either
for the discrete (resp. continuous) P * V2
It is readily seen that
.
and
are mutually singular ,
(resp. v2)
V1
V
G1(d,]R) which
is continuous.
Therefore the equality *
V = V entails that thus
.
But
v1 + u
V2
and
11 * v2
have the same mass,
By uniqueness either
V = V2
or
v2 > }1 * v2
V2 = 11 * V2
*
V = u
Suppose now that
V
.
v2
is continuous and write
V = V1
Va.c
.
(resp. V ) for s
its absolutely continuous (resp. singular) component w.r.t.
for any M
Gl(d,]R)
in
Mmd is equivalent to
and
m
.
d
Since
(see 11.5.1)
(
u * Va.c =
I
M Va.c dp(M)
is absolutely continuous, entailing as above that
11 * Va. c
uniqueness of the continuous or
Example 4.5.
V
p-invariant measure either
V
= V
a.c
By
is absolutely
is singular.
It was already noticed by Kaijser in [38] that is it easy
to exhibit singular invariant measures. Consider for instance two non negative
2 x 2
matrices
1 65
[ai
bt
c,
dt
M1
Idet Mil = 1
such that
show that if
a2
1 d2 J
L c2
M2
ci > 0
,
b2
and
> 2
d
i = 1,2
for
.
Let us
is the Bernoulli measure defined by
P
u({M1 }) = P({M2}) = 1/2 then the
11-invariant distribution
We shall consider
V
on
prove that the Lebesgue measure of the support For
i = 1,2
is singular.
P(1R2)
]R U {-} and
as a measure on the projective line
V
and
x
in IF, U {m}
V
of
S
is zero.
set
a.x + b. i
q.(x) =
i
c.x + d.
i
a2
a = Max(a1,
It is readily seen that if
.([O,a]) under
is contained in
T.
,
[O,a]
implying that
Therefore
.
(x) I =
.
Now, since for
<
2
1
12
di
(cix + d
we see that if
p = Min(d1,d2)
then
o Oi o
(Oi
, i )
...
is an interval of length smaller than
A = U {(c 1
,
p n a . Hence if
... , 4i ) [O,a] n
A
then the Lebesgue measure of
V(A)=(11n * V) so that
S
[o,a]
n
2
1
;
,in = 1,2}
i1
is smaller than
=_L_
2n p n a
.
But
,...,¢i )-1(A)) = 1 n
E
2n it,...
is contained in A . This shows that for any
n > 0
<_ 2np-na . Since p > 2 , m(A) = 0
m(A)
Exercise 4.6.
Let
11
be a probability measure on
a unique invariant distribution v
V
is the smallest closed subset
M
Exercise 4.7. (or
[O,a]
x ? 0
any
of
then
[0,a]- is invariant
is contained in
S
bt
b2) c2' d1' d2
c1
in
Tu
Gl(d,]R) which has
P(]Rd). Show that the support
on S
of
P(]Rd) which satisfies
.
(a) Suppose that
P
is a probability measure on
G1(d,]R)
S1(d,]R)) which has a bounded density with compact support w.r.t.
166
the Haar measure. Show that (b) Deduce that if
has a bounded density.
V
is absolutely
V
is not singular then
P
continuous.
Show that the Cantor distribution is
Exercise 4.8.
some non-irreducible probability measure
VI.5. An example
Let
(5.1)
:
p
p-invariant for
Gl(2,IR).
on
Random continued fractions
{An, n
1}
>_
be a sequence of independent random
positive integers with a common distribution
p , defined on some
1P). Consider the 2 x 2 matrices
(0, A
0
[1
Yn and call
p
A
n
their distribution. For any
in the projective line
x
]R U {co} , 11 Al + A2+ 1
+An+x Notice that Z
with values in
converges a.s. to a real random variable [0,1]
The distribution
.
V
of
is the
Z
p-invariant distribution on the projective line. This follows readily from the fact that the
A n's
are positive integers and from the
elementary properties of continued fractions (see e.g. Billingsley [5]). (Much less obvious is the fact that this also holds for arbitrary real i.i.d.
A
n
under an integrability assumption, which is an immediate
consequence of Theorem 3.1). Let us show that
V
is singular as soon as the
a.s. constant (in this case
Tu
A,'s i
are not
is strongly irreducible and
contracting, cf. Exercise 5.4). This result is due to Chatterji C1?. First recall that any real number
t
in
(0,1)
may be expanded
in a continued fraction, i.e. t = lim (a1 (t)
n-
where we define inductively
;
a2(t)
...
;
an(t))
1 67
1
(a 1) =
When
a1+(a2;...;an)
is irrational this expansion is unique. In particular since
t
A1(w),...,An(w)
law of
V
V-a.s. unique. As a consequence the
is continuous, the expansion is
under
1
(at;...;an) =
a1
under
is the law of
]P
a1(t),...,an(t)
V
Consider now the mapping
T
(0,1) -> (0,1)
:
defined by
T(t) = t - [t] Since T
(a1(T(t))
;
a2(T(t))
;
a3(T(t))
;
...) = (a2(t)
;
a3(t)
;
...)
,
is equivalent to the shift on the sequences of integers. The 0-1
law thus implies that
is invariant and ergodic under
V
(1+x)-1
well known that the measure under
T
dx
on
(0,1)
(see e.g. Billingsley). Therefore either
w.r.t. the Lebesgue measure or a1(t), a2(t), ...
v = 1
(0,1)
(Log 2) -1
.
But it is
is also ergodic V
is singular
(1+x)-1 dx
.
But
are not independent under this latter distribution
(elementary computations). This proves that
(5.2)
T
V
is singular.
Let us outline a proof of Ledrappier's formula (9) in this
simple situation, which is in this case due to Kinney and Pitcher [43]. Let
Jn(w) = Y1 (w)...Yn(w) 0,1] It is readily seen that we may write
Mn = Y1...Yn =
where
qn = Angn-1 + qn-2
[Pn_i
Pn
L qn_1
qn
The length .
IJn(w)I
of
Jn(w)
IJn(w)I = IMn(w)'0 - Mn(w)'1I =
But, since
q
n
+
qn2 Therefore, if
'
2 qn-1
]E(Log A1)
associated with
n,. n
{gn_I(w)(gn(w)+qn-1(w))}-
5 q n-1
2
lim
satisfies
(Yn)
15
gn-1(gn
+q n-1)
2 + 2 - qn-1 qn
is finite and if
y
is the Lyapunov exponent
then
Log IJn(w)I = - lim n Log IIMn(w)e2112 = -2y
On the other hand
IF a.s.
168
V(Jn(W)) = V({t e (0,1)
n II
p({A()})
i=1
i
(Recall that under
V
a1(t),...,an(t)
,
distributed according to if
p(i) = pi
a1 (t) = A1(u),...an(t) = An(W)})
;
are independent r.v.
p). Making use of the law of large numbers,
,
n
n Log V(Jn(W)) = 7E (Log p(A1)) E
pi Log pi
a.s.
i=1 (Note that since
Log x < x-1
E pi Log i = ]E(Log A1)
and since
is
finite
IEpiLogpil =EpiLog Z + 2EpiLogi < (E 2)-1 + 2 1E (Log A1) i pi i is also finite).
Therefore
Ep Logp
Log V(Jn (W) )
i
lim
n- Log J. (W) = In other words if for any
in
t
In(t) = is a (0,1)
;
2y
i
]P
a. s.
we set
(0,1)
a1 (s) = a1 (t) ,...,an(t) = an(s) }
then
Log V(In (t) )
him n-
Ep. Logp. 2Y
Log IIn(t)
1
V-a. s.
Onecan deduce from this formula (see Kenney and Pitcher [43]) that
dimV= -
E pi Log pi 2y
In order to recover Ledrappier's formula (9) it suffices to note that if
M = (0
1)
then n
\)(M-In(t)) = p(a)
p(ai(t)) = p(a) V(In(t))
ll
i=1
(use the formula
V({s ; ai(s) = a, a2(s) = al(t),
an+1 (s) = an(t)}))
Thus
d (M
N))
= p (a)
J Log d(M dN)
on
(0,I)
and
1V) dV dp(M) = E pi Log pi
169
An interesting example of such a measure is the Minkowski
(5.3)
measure denoted by distribution on
(0,1). We may define it as the
on
?
]R U {oo} where
(0
n)
=
11-invariant
is defined by
p
n
I
n
,
2
In order to describe it let us notice that if
is the Bernoulli
p
measure which verifies i(A) = 11(B) = 1 /2 for
B=( then
0)
p-invariant. (This is proved as follows
is also
?
B=(
11)
ti
:
if
V
is
ti
p-invariant then
Z(6A+6B) * V =v
v=26A *v+2 6B * (26A+Z dB)
and
V
2 6A * v+ 4 6BA * v+ 4 6B 2* v By induction we have for any
n > 0
,
n
V= E - 6 m=1
and, going to the limit
v But
m=1
*v
d
Bn
2n
,
m=1 2m
E L 6
p =
* V+
Bm-1A
2m
V
dBm-1A
, hence
V = ?
by uniqueness of the invariant
Bm-1A
2m
measure).
We now define inductively the Brocot sequence of order p1(n) p2 ( )
q1 (n)< - .. -
po(n) qo(n)
po(0)
For
n
0
p1(0) 1 q1(0) = 1 . The sequence of order
n = 0 , qo(0) = 1 and
is obtained by adding to the sequence of order
pi(n) + pi+1 (n) qi(n) + qi+1 (n)
n
the fractions
i = 0, ..., 2n-1 '
For instance the first three Brocot sequences are order 0
0
1
1
order 1
order 2
1
0
0+1
1
1+1
=
0
0+1
1
1
1
1 +2
3
2
1
1
2
1
1+1 2+1
__
1
1
3
1
n + 1
170
Ifs M
2 x 2
is a
matrix such that
Idet MI = 1
q and
,
it follows from Bezout's theorem that q
M = (P -P q '-q
P) q
Whence, since and
x
B'x =
x+1
11
MB-0 =.P MA-1 = MB-1 = + . Making use of this q q P +q result one sees immediately that if Y1,...,Yn are in {A,B} then
MA-0 =
,
Y ...Y n
Pi (n)
10,11 _
for some (with
0
_<_
i < 2n
Y. = A
or
.
B)
(n) Ci
Pi+1(n)
4 . (n)
1
i + 1(n)
Since there exists only one sequence
Y11 ...,Yn
which satisfies this relation we conclude that
Pi+1 (n)] _
i(n) ' qi+1 (n)
rpi(n)
(In
pi+1 (n)]
)( qi(n)
U
' qi+1(n)
Obviously this formula defines
?
1
2n
)
uniquely.
Related examples may be found in Chassaing, Letac and Mora D,] Exercise 5.5
.
is also taken from this paper (see also Letac and
Seshadri [51]). It gives a non-trivial example where the Lyapunov exponent is explicitely computed.
Let
Exercise 5.4. p
A be a not a.s. constant real random variable, and
be the distribution of
Y= (0
1
A)
1
a. Show that b. Let
Tu
be the
v
is strongly irreducible and contracting.
p-invariant distribution on the projective line
1R U {oo}. Prove that the Lyapunov exponent equal to
y
associated with
f Log x dv(x) , when there exists some
T > 0
]E{IAIT} is finite. Show that this is also true when A integer-valued, as soon as
Exercise 5.5.
[11]
distributions on
[51]. For
]E(LogIAI)
a > 0
,
11
is positive-
is finite.
let
and
as
]R+ defined by
aa(dx) = 2 exp - 2 1 0
dx
va
is
such that
be the two
171
2 (x + X) 1(0,co) dx
va(dx) =
where
j ur-1 exp -
Kr (a) =
a
(u + u) du
0 2
a. Consider two independent random variables the law of
X
(resp. A)
is
va
(resp. Aa)
.
X
and
A
such that
Show that the law of
A)- 1
(X +
is
Va
.
(Hint
:
compute the Laplace transforms).
b. Show that the Lyapunov exponent distribution of
0 (1
1
A)
y
2Ko(a) is equal to
aK
1 (a)
associated with the
SUGGESTIONS FOR FURTHER READINGS
We provide some recent references on limit theorems for random matrices and related topics. We make no claim for completeness but the quoted papers often contain a large bibliography.
(1) Lyapunov exponent for stationary sequences
The main properties of Lyapunov exponents in the stationary case can be found in Ledrappier [46] . Guivarc'h [32] (see also Royer [62] Virtser [69],[70]) gives a criterion ensuring that two given exponents are distinct for markovian products. See Ledrappier [48] for an assumption implying that the exponents are not all equal, in the general stationary setting.
(2) Boundary theory
After the fundamental work of Furstenberg (see [20] , the set of bounded harmonic functions was determined
E221, [23] )
- for absolutely continuous distributions on connected groups by
Raugi [59]
(see also Guivarc'h E311),
- for distributions on discrete groups of matrices by Ledrappier
[47]
.
(3) Limit theorems (3.1)
Onecan find a proof, under our usual irreducibility
assumptions, of - the functional central limit theorem, - the law of iterated logarithm, - the renewal theorem,
173
174
- the local limit theorem,
for the sequence
Log II Sn x II
in Le Page [49], [50]
properties are studied in Guivarc'h [33] The central limit theorem for
Sn
.
Recurrence
(see also Bougerol [9]) written iri the polar and the
Iwasawa decomposition is proved in Raugi [59]
.
References to earlier
proofs and applications can be found in Tutubalin [68]. (3.2)
Properties of the solutions of the difference equation on
d IR
Xn+1 = Yn Xn + Bn Bn
(where
[41] d = 1
is in
IRd
and
and in Le Page [50] ,
Grincevicius [34]
(3.3)
.
Yn
in
G1(d,1R))
are studied in Kesten
Stationary solutions are given in [8]
.
For
proves a central limit theorem.
Without irreducibility assumptions, the central limit
theorem is not yet fully understood. The latest reference is Raugi [60].
(4) Positive matrices
The reader will find in Cohen [14] a nice account of the applications of products of positive random matrices to demography and an extensive bibliography. Kesten and Spitzer [42] study the convergence in distribution of such products.
(5) Linear stochastic differential equation
A goog introduction to this subject is the survey of Arnold and Kliemann [1]
. A nice application is given in Pardoux and Pignol [58].
BIBLIOGRAPHY
[1] ARNOLD, L. and KLIEMANN, W. (1983). Qualitative theory of
stochastic systems. In "Probabilistic Analysis and Related topics", A.T. Bharucha-Reid (ed.), Vol. 3, 1-79, Academic Press, New York.
[2] ARNOLD, L., CRAUEL, H. and WIHSTUTZ, V. (1983). Stabilization of linear systems by noise. SIAM J. Control Optim. (21), 451-461. [3] BEARDON A.F. (1966). The Hausdorff dimension of singular sets of properly discontinuous subgroups. Amer. Journ. of Math. (88), 722-736.
[4] BELLMAN, R. (1954). Limit theorem for non-commutative operations. I. Duke Math. J. (21), 491-500. [5] BILLINGSLEY, P. (1965). Ergodic theory and information. Wiley and Sons. New York.
[6] BIRKHOFF, G. (1957). Extensions of Jentzsch's theorem. Trans. Amer. Math. Soc. (85), 219-227.
[7] BOUGEROL, P. (1984). Stabilite en probabilite des equations differentielles stochastiques lineaires et convergence de products de matrices aleatoires. C.R. Acad. Sc. Paris, (299), Serie 1, 631-634.
[8] BOUGEROL, P. (1984). Tightness of products of Random matrices and stability of linear stochastic systems. To appear in Ann. Probab.
[9] BOUGEROL, P. (1985). Oscillation des produits de matrices aleatoires dont 1'exposant de Lyapounov est nul. To appear. [10] BREIMAN, L. (1968). Probability. Addison Wesley. Reading Massachusetts. 175
176
[11] CHASSAING, P., LETAC, G. and MORA, M. (1984). Brocot sequences and random walks on
Sl(2,1R). In "Probability measures on
groups 7", ed. H. Heyer. Lecture Notes in Math. n° 1064. Springer Verlag. Berlin, Heidelberg, New York, 36-48.
[12] CHATTERJI, S.D. (1966). Masse, die von regelmassigen Kettenbruchen induziert sind. Math. Annalen (164), 113-117. [13] CHEVALLEY, C. (1951). Theorie des groupes de Lie, t.2
:
groupes
algebrigues. Hermann, Paris. [14] COHEN, J.E. (1979). Ergodicity theorems in demography. Bull. Amer. Math. Soc. (3), 275-295.
[15] COHEN, J.E. and NEWMAN, C.M. (1984). The stability of large random matrices and their products. Ann. Probab. (12), 283-310. [16] DEKKING, F.M. (1982). On transience and recurrence of generalized random walks. Zeit. fur Wahrscheinlichkeitstheorie and Verw. Gebiete. (61), 459-465. [17] DUNFORD, N. and SCHWARTZ, J. (1958). Linear Operators, Vol. 1. Interscience.
[18] FELLER, W. (1971). An Introduction to Probability Theory and its Applications. Vol. 2. Wiley. New York, London, Sydney, Toronto. [19] FREIDLIN, M.I.
and VENTSEL, A.D. (1984). Random perturbations of
dynamical systems. Springer Verlag. Berlin, Heidelberg, New York.
[20] FURSTENBERG, H. (1963). A Poisson Formula for semisimple groups. Annals of Math. (77), 335-383. [21] FURSTENBERG, H. (1963). Non-commuting random products. Trans. Amer. Math. Soc. (108), 377-428.
[22] FURSTENBERG, H. (1971). Random walks and discrete subgroups of Lie groups. Adv. Proba. and related topics (1), 3-63. [23] FURSTENBERG, H. (1972). Boundary theory and stochastic processes in homogeneous spaces. In "Harmonic analysis on homogeneous spaces", Symposia on Pure and Applied Math. Willamstone, Mass. [24] FURSTENBERG, H. (1980). Random walks on Lie groups. In "Harmonic analysis and representations of semi-simple Lie groups". J.A. Wolf, M. Cahen and De Wilde (eds.). D. Reidel Publishing
1 77
Company. Dordrecht, Holland. [25] FURSTENBERG, H. and KESTEN, H. (1960). Products of random matrices. Ann. Math. Statist. (31), 457-469. [26] FURSTENBERG, H. and KIFER, Y. (1983). Random matrix products and measures on projective spaces. Israel J. Math. (10), 12-32. [27] GLASNER, S. (1976). Proximal Flows. Lecture Notes 517. Springer
Verlag. Berlin, Heidelberg, New York. [28] GORDIN, M.I. and LIFSCHITZ, B.A. (1978). The central limit theorem for stationary Markov processes Soviet. Math. Dokl. (19), N° 2, 392-394.
[29] GREENLEAF, F. (1969). Invariant means on topological groups. Van Nostrand. New York. [30] GRINCEVICIUS, A.K. (1974). A central limit theorem for the group
of linear transformations of the real axis. Soviet
Math. Dokl.
(15), 1512-1515.
[31] GUIVARC'H, Y. (1980). Quelques proprietes asymptotiques des produits de matrices aleatoires. In "Ecole d'e"te de SaintFlour 7 - 1978", ed. P.L. Hennequin.Lecture Notes in Math. n° 774. Springer Verlag. Berlin, Heidelberg, New York. [32] GUIVARC'H, Y. (1984). Exposants caracteristiques des produits de
matrices aleatoires en dependance markovienne. In "Probability measures on groups 7", ed. H. Heyer. Lecture Notes in Math.
n° 1064. Springer Verlag. Berlin, Heidelberg, New York, 161181.
[33] GUIVARC'H, Y. (1984). Application d'un theoreme limite local a la transcience et a la recurrence de marches de Markov. In "Theorie du Potentel". Lecture Notes in Math. n° 1096. Springer Verlag. Berlin, Heidelberg, New York, 301-332. [34] GUIVARC'H, Y. and RAUGI, A. (1985). Frontiere de Furstenberg, proprietes de contraction et theoremes de convergence. Zeit. fur Wahrscheinlichkeitstheorie and Verw. Gebiete. (69), 187242.
[35] HENNION, H. (1984). Loi des grands nombres et perturbations pour des produits reductibles de matrices aleatoires. Zeit. fur Wahrscheinlichkeitstheorie and Verw. Gebiete (67), 265-278.
1 78
[36] HEWITT, K. and ROSS, A.
(1963). Abstract Harmonic Analysis 1.
Springer Verlag. Berlin, Heidelberg, New York. [37] IKEDA, N. and WATANABE, S.
(1981). Stochastic differential
equations and diffusion processes. North Holland-Kodansha Amsterdam, Takio.
[38] KAIJSER, T. (1970). On distribution problems for random products of non-commutative matrices. Report n° 22. Uppsala University. [39] KAIJSER, T. (1972). Some limit theorem for Markov chains with
applications to learning models and products of random matrices. Report Institute Mittag-Leffler, Djursholm, Sweden. [40] KAIJSER, T. (1978). A limit theorem for Markov chains in compact
metric spaces with applications to products of random matrices. Duke Math. Journ. (45), 311-349. [41] KESTEN, H. (1973). Random difference equations and renewal theory for products of random matrices. Acta Math. (131), 207-248. [42] KESTEN, H. and SPITZER, F. (1984). Convergence in distribution for products of random matrices. Zeit. fur
Wahrscheinlichkeitstheorie and Verw. Gebiete. (67), 363-386. [43] KINNEY, J.R. and PITCHER, T.S. (1966). The dimension of some sets
defined in terms of f-expansions. Zeit. fur Wahrscheinlichkeitstheorie and Verw. Gebiete. (4), 293-315. [44] KUNITA, H. (1984). Stochactic differential equations and stochastic flow of diffeomorphisms. In " Ecole d'ete de
Saint-Flour 12-1982", ed. P.L. Hennequin. Lecture Notes in Math. n° 1097. Springer Verlag. Berlin, Heidelberg, New York. [45] LANG S. (1965). Algebra. Addison-Wesley. Reading, Massachusetts. [46] LEDRAPPIER, F. (1984). Quelques proprietes des exposants caracteristiques. In "Ecole d'ete de Saint-Flour 12-1982", ed.
P.L. Hennequin. Lecture Notes in Math. n° 1097. Springer Verlag. Berlin, Heidelberg, New York. [47] LEDRAPPIER, F. (1984). Poisson formula of discrete groups of matrices. To appear in Israel Jour. Math. [48] LEDRAPPIER, F. (1985). Positivity of the exponent for stationary sequences of matrices. To appear.
179
[49] LE PAGE, E. (1982). Theoremes limites pour les produits de matrices aleatoires. In "Probability measures on groups", ed. H. Heyer. Lecture Notes in Math. n° 928. Springer Verlag. Berlin, Heidelberg, New York, 258-303. [50] LE PAGE, E. (1983). Theoremes de renouvellement pour les produits de matrices aleatoires. Equations aux differences aleatoires. Seminaire de Probabilites. Universite de Rennes. [51] LETAC, G. and SESHADRI, V. (1983). A characterization of the
generalized inverse Gaussian distribution by continued fractions. Zeit. fur Wahrscheinlichkeitstheorie and Verw. Gebiete. (62), 485-489.
[52] LOOMIS, L.H. and STERNBERG, S. (1968). Advanced Calculus. Addison Wesley. Reading, Massachusetts. [53] Mc CRUDDEN, M. and WOOD, R.M. (1984). On the supports of absolutely
continuous Gauss measures on
Sl(2,1R). In "Probability
measures on groups 7", ed. H. Heyer. Lecture Notes in Math.
n° 1064. Springer Verlag. Berlin, Heidelberg, New York, 379397.
[54] MOSTOW, G.D. (1955). Self-adjoint
groups. Ann. of Math. (62), 44-
55.
[55] NAGAEV, S.V. (1957). Some limit theorems for stationary Markov chains. Theor. Proba. Appl. (2), 378-406. [56] NEWMAN, C.M. (1984). The distribution of Lyapunov exponents Exact results for random matrices. To appear. [57] OSSELEDEC, V.I. (1968). A multiplicative ergodic theorem. Trans. Moscow Math. Soc. (19), 197-231. [58] PARDOUX, E. and PIGNOL, M. (1984). Etude de la stabilite de la solution d'une E.D.S. bilineaire a coefficients periodiques.
Application au mouvement des pales d'helicoptere. In "Analysis and Optimization of Systems, Part 2"
;
A. Bensoussan and J.L.
Lions. (Eds.), Lecture Notes in Control and Info. Sci. 63. Springer Verlag.
[59] RAUGI, A. (1977). Fonctions harmoniques et theoremes limites pour les marches aleatoires sur les groupes. Bull. Soc. Math. France, Memoire 54, 127 p.
180
[60] RAUGI, A.
(1980). Quelques remarques sur le theoreme de la limite
centrale sur un groupe de Lie. C.R. Acad. Sc. Paris, (290), 103-106.
[61] REVUZ, D. (1984). Markov chains. North Holland. Amsterdam, New York, Oxford.
[62] ROYER, G. (1980). Croissance exponentielle de produits markoviens de matrices aleatoires. Ann. I.H.P. (16), 49-62.
[63] SAZONOV, V.V. and TUTUBALIN, V.N. (1966). Probability distributions on topological groups. Th. Proba. Appl. (13), 1-45.
[64] SENETA, E. (1981). Non-negative matrices and Markov chains, 2nd ed., Springer Verlag. Berlin, Heidelberg, New York.
[65] SIEBERT, E. (1982). Absolute continuity, singularity, and supports of Gauss semigroups on a Lie group. Mh. Math. (93), 239-253. [66] TUTUBALIN, V.N. (1965). On limit theorems for products of random matrices. Theor. Proba. Appl. (10), 15-27. [67] TUTUBALIN, V.N. (1969). Some theorems of the type of the strong law of large numbers. Theor. Proba. Appl. (14), 313-319. [68] TUTUBALIN, V.N. (1977). The central limit theorem for products of random matrices and some of its applications. Symposia Math. (21), 101-116.
[69] VIRTSER, A.D. (1979). On products of random matrices and operators. Theor. Proba. Appl. (24), 367-377.
[70] VIRTSER, A.D. (1984). On the simplicity of the spectrum of the Lyapunov characteristic indices of a product of random matrices. Theor. Proba. Appl. (28), 122-135.
PART
B
RANDOM SCHRODINGER OPERATORS
INTRODUCTION
Numerous equations arising from one dimensional discrete physical systems lead to the analysis of a linear second order difference operator H, acting on a complex sequence 'Y
n
, n e 7L , by
(H'1' )n = b -1 ((-AY ')n + an 'fn)
In this formula, A is the discrete Laplacian (AY')n ='n+1 +T n-1 and an bn , are two fixed sequences of real numbers with bn > 0,
Tn
'
representing the physical properties of the medium. Generally such operators are associated to "time dependent" equations and we give some typical examples
:
(i) A solution of the Schrodinger equation i
-i) t satisfies a
0(n,t) = Y'n
a
at
= H of the form
HT = a'Y where H is the classical Schrodinger
operator, that is the operator associated to b
n
= 1, Vn E 2Z, and a
n
is
the potential at site n. 2
(ii) A solution of the wave equation bn
iat satisfies a
0(n,t) = 'n
a
2 - _
of the form
at
2
HT = a 'Y where H is the "Helmotz operator" that
is the operator associated to an = 0, Vn E Z Z, and bn is the diffusion coefficient at site n.
(iii) A solution of the heat equation bn (A )n of the form at = ¢(n,t) = Tn a XtsatisfiesH'Y =a4' where H is the Helmotz operator.
Similar equations and operators also appear in quasi-one dimensional systems associated to an infinite wire of finite cross section with R sites. We have only to replace the sites n by (i,n) where i e (1,
..
,
R), (the integer k is called the width of the strip) and
the real sequences an,bn by matrices sequences. It is well known that the spectral properties of the operator 11
183
184
viewed as a self adjoint operator on an Hilbert space, govern the asymptotic behavior of the solutions of the associated time dependent equation. In quantum mechanicsthe number I$(t,n)12 (normalized in a itH
of the time f) n dependent Schrodinger equation, represents the probability of presence of
suitable way) associated to a solution (t,n) = (e
a particle at the site n at time t. Roughly speaking, when T is associated to the continuous spectrum of H then we have the "diffusion" T
behavior
lim
1
T}+ 2T -T
I$(t,n) 2dt = 0, and if T is associated to the
point spectrum of H then we have the "localization" property lim
Sup t
E
I4(t,n)12 = 0.
(See D. Ruelle [53] for more details). In
n,N
deterministic systemswith periodic structure, it is known that only diffusion behavior occurs. But it has been remarked that localization appears when this regular structure is perturbed by impurities or inhomogeneity in the medium. Thus a "metallic" wire suddenly becomes an insulator. We first give below a brief historical survey of the mathematical approaches to this subject.
(1958) that for the
It was first announced by P.W. Anderson [1]
classical multi-dimensional Schrodinger operator with an independent identically distributed family of random potentials, the spectrum has to be pure point for a "typical sample" assuming the disorder "large enough". It was later conjectured by N. Mott and W.D. Twose [46]
(1961)
that in the one dimensional case, this should be true at any disorder. The works of H. Furstenberg, H. Kesten [20]
(1960), H. Furstenberg [19]
(1963), V. Osseledec [49] (1968) provided the essential mathematical
background used in the first rigorous approaches of the subject. It was first proved by H. Matsuda, K. Ishii [44]
(1970), A. Casher
J.L. Lebowitz 110] (1971), L.A. Pastur [50]
(1973), Y. Yoshioka [64]
(1973) that there does not exist an absolutely continuous component in the spectrum of H. An essential step was achieved in 1973 when I. Ja. Goldsheid, S.A. Molcanov, and L.A. Pastur [24] gave the first proof to the conjecture of Mott and Twose (they actually dealed with the continuous case). Their original proof has been later considerably simplified and extended by R. Carmona [7] [52]
(1982),
[8]
(1983), G. Royer
(1983), J. Brossard (1983). In the "discrete" case the same result
has been obtained by H. Kunz, B. Souillard [37]
(1980), J. Lacroix [38]
(1982), F. Delyon, H. Kunz, B. Souillard [13] (1983). Moreover Goldsheid
gave a similar announcement in a strip [23](1981) and the proof can be
185
found in J. Lacroix [39]
(1983) [40]
[41]
(1984).'All these previous
proofs of localization are rather technical and at times the essential guiding principles are not easily understood. Fortunately, in the late of 1984, S. Kotani [36] clarified the situation, giving a rigorous statement to an earlier claim of R.E. Borland [5]
(1963).
Our essential goal is to provide a direct and unified treatment to the foregoing problems, in the general setting of operators H introduced at the begining of this discussion here in after called Schrodinger operators. The essential tool will be the theory of products of i.i.d. random matrices developed in the first
part of this
book. We are mainly concerned with the independent case but a lot of definitions and properties are given in the general ergodic case. The "almost periodic case" is also of great physical and theoretical interest but most of the proofs have nothing to do with random matrices.
Interested readers have to look at the survey of B. Simon [57] where they can also find an extensive bibliography. Since it seems that the theory of random matrices can hardly be used in the multidimensional case (up to now,limiting procedures in strips whose width goes to infinity have not been successful) we restrict ourselves to the one dimensional case and strips.
In chapter I the essential definitions and properties related to the spectral analysis of the deterministic operator H are given. In particular we construct a sequence of "good approximations" of the spectral measure of H and establish the existence of "slowly" increasing generalized eigenfunctions. Moreover the links between the singularity of the spectrum and the fundamental notion of hyperbolic behavior of a product of matrices are pointed out.
In chapter II we define an ergodic family of Schrodinger operators which contains as essential examples the classical Schrodinger operator and the Helmotz operator. Some weak properties of the spectrum of H considered as a subset of JR are given, before introducing the essential
concept of Lyapunov exponent. Positivity of this exponent is carefuly studied since this property is crucial in order to obtain absence of absolutely continuous spectrum. The distribution of states describing
186
the asymptotic behavior of the eigenvalues distribution for the operator restricted to "boxes" is of great physical interest and we discuss in detail its regularity properties together with the links with the Lyapunov exponent (Thouless formula). Kotani's criterion insuring localization property is then introduced in the general ergodic case but its main application to the independent case is discussed in the following chapter. Finaly we give a straightforward application of the central limit theorem on SL(2,1R)
to the asymptotic
behavior of the conductance.
Chapter III is devoted to the proof of the conjecture of Mott and Twose both in classical Schrodinger and Helmotz case. In the first model, Kotani's criterion gives immediately the solution, but in the general case the proof is more involved and requires some Laplace analysis on SL(2,IR) . As a consequence extra properties of the
distribution of states are obtained.
All these foregoing results are generalized in the chapter IV to the case of a strip. Most of the previous proofs in the one dimensional case
can be translated with some care. But some problems
are much more involved, especialy positivity of Lyapunov exponents. General results given in the first part of the book are then very useful. The proof of localization in the general case requires also much more work since Laplace analysis on symplectic groups needs some knowledge about symplectic geometry.
Numerous related topics, non-stationary processes for instance, are not tackled when they don't appear as direct applications of products of i.i.d. matrices, thus we don't intend to provide a complete survey in the theory of random Schrodinger operators. Moreover we are aware of that a lot of pioneer and connected works are not cited since we have focused our attention to a precise mathematical aspect of the subject.
CHAPTER I
THE DETERMINISTIC SCHRODINGER OPERATOR
I.1 The difference equation. Hyperbolic structures
be the linear space of complex sequences T = (Tn) where n
Let
runs through the set of integers FL . The operator H is associated to two
given real sequences a and b with b
n
# 0
Vn c 2Z, and acts on o+by the
formula
(HT )n = bn
1 [T1
- Tn+l + anTJ
For a complex number A every solution of the difference equation HT= A lies in a two dimensional subspace of.ice spanned by the solutions p(A)
and q(A) constructed from the initial values po(A) = q-1(A) = 1, p-1(A) = qo(A) = 0, such that
:
Tn(A) = pn(A) To(a) + qn(A) T-1(a) From now in order to avoid too complicated notations we don't write the variable A in the solutions of the difference equation. A solution Y' of
the difference equation is constructed from initial values To and W-1 by a product of "transfer matrices" Yn defined by
Y
=
ra
n IL
n - Ab n 1
-1
0
187
:
188
n S=YYn-1 n
if n >, 0
Yo
Sn = Yn1 Yn+1 ...
Y
Thus
=S
n+1 J
-1
if if n
oI
n
if n >, 0
n
,
T_
n-1
1
lyn
=S
o
if
n
-1
The transfer matrices Yn and therefore the products So belong to the group SL(2,O) of two by two matrices with complex entries and of determinant one. If A is real then Y. and Sn belong to the subgroup SL(2,]R) with real entries. The construction of the solutions of the difference equation by such products of matrices is the essential link between the two parts of this book.
of SL(2,O) is said "hyperbolic" if the eigenvalues of
A matrix 0
have distinct moduli. If V is an eigenvector associated to the eigenvalue of modulus stricly less than one, then the sequence IlanVll
goes to zero when n goes to + - and for any vector W non proportional
to V the sequence TJII goes to +- when n goes to + -. (The same situation occurs when n goes to -
,
replacing V by the eigenvector
associated to the eigenvalue of modulus strictly greater than one). Let now (sn ) n e 7l ,
that s
n
be a sequence of matrices of SL(2,¢). We say
has an hyperbolic structure in the positive direction if there
exists a non zero vector V of Q2 (called the contractive vector) such = 0. It is readily seen
that lim IIsnVII
n- +-
to V then lim Is
n->+W
WII =+ W
n
that for any W non proportional
(from the determinant property). We define in
the same way an hyperbolic structure in the negative direction. The existence (for some A) of hyperbolic structures for the sequence Sn is the most important question discussed in the following chapters. In particular for a real A the existence of hyperbolic in both directions with the same contractive vector n implies the existence of a real non zero solution Y' of the difference structures for S
lim
equation with
4'
Inl _+W
n
= 0. The next "Osseledec's theorem" is a basic
tool to prove the existence of hyperbolic structures
PROPOSITION 1.1
Let (C.), h
ri
E
i
be a sequence of matrices of SL(2,O)
such that :
(i) lim 1 LogjjC n++. n
n
-1
..
1 II
=Y
189
(ii) n lim-
Log
0
n
Then there exists a non zero vector V such that
lim
n Log
n,+-
n ... 1
VI
I
and for any vector W non proportional to V
lim
...
Log 11 C'
n
1
WII =Y
The proof of this proposition can be found in [49]. The euclidian norm of a matrix of SL(2,¢) is no less than one, thus y is non negative.
When y is strictly positive 0sseledec's theorem asserts the existence of an hyperbolic structure for the sequence sn moreover we see that the sequences 1IsnVII
and 9' n 'n-1 ... Q1 or IlsnWII converge to 0 or
+ - with an exponential rate.
We now give two obvious
(but useful) lemmas related to the
solutions of the difference equation. The Wronskian of $
..yy
and T in obis
the sequence W ( ' ) Wn(T'$)
= '4 $n =
LEMMA 1.2
$n+1 Tn
Let $ and T be two solutions of the difference equation
Hu = Au, then W(T,$) is constant and this constant is zero if and only if $ and T are proportional.
Proof
in
$n
:
S
n
for n < -1
and
N-1 $n-1J ITn+1
$n+1 S
Tn
$n
n
for n,0
Since the determinant of Sn is equal to one the result is obvious.
LEMMA 1.3
Let $ and T be in £, m and n two integers with m S n, then
we have the "Green formula" n E
bk ((HT) k $k - Tk(H$)k) = Wm-1(T,$) - Wn(T,$)
k=m Proof
:
Straightforward computation.
190
Exercise 1.4
:
Let P be a solution of HP = AP, m and n two integers with
0 < n. Prove that if bk is positive for m . k
m < -1
ITn Yn+lI +
Tm Pm-ll =
n then
2
ty
I3 m iI (boI'YoI
+ b-1IT-1I2)
1.2 Self-adjointness of H. Spectral properties
The Green formula of the lemma 1.3 suggests that H should be a
symmetric operator on a suitable Hilbert subspace of ,. From now we assume in all this chapter that the sequence b is strictly positive and bounded away from zero (that is the sequence b-1 is bounded)
We denote by YO the subspace of ou defined by
{P ce/ E bn
ITnI2 < +m}
(when the symbol E has no index of summation it's understood that this index should run through ZL). Endowed with the scalar product E b
subset of
d is an Hilbert space. If D is a dense linear n 4n Tn such that H(D) C `% we denote by (H,D) the linear operator
with domain D. Let Do be the dense linear subset of % of sequences with a finite number of terms different from zero. From the Green formula we know that (H,Do) is a symmetric operator and it's not difficult to see that its adjoint (H,Do)
is equal to (H,D1) where
D1 = {P E 4U/ HY e & _ (P e :%/ E a2 bn1 lPn12 < +m } LEMMA 2.1
If the sequence a is bounded then H is a bounded self adjoint
operator.
Proof
Let a,
HY II = (E bn-1 then by the (E
and
R2 b-1
n
be the bounds of the sequences a, b-1 and P e dam:
pn-1 + Pn-1 - an triangle inequality
Pn12)1/2 :
I2)1/2 + (E b1IY 12)1/2 + (E b-1 a2 IP I2)i/2 n n n n-1 n IT n+1
:
IIHPII. II Y11 (2a + aR) Thus H is a bounded operator on'. The assumption on the sequence b
191
implies that ki G Q2 (2Z) and hence
lim
Inl
= 0 for P e r . By the
P
---
Green formula we see that H is a symmetric operator on
and hence
self adjoint. In many applications the sequence
LEMMA 2.2
is not bounded, so we prove :
a
The operator (H,D1) is self adjoint.
Proof : As remarked in the lemma 2.1, H is a symmetric operator on D1, z
hence (H,D1) is a symmetric extension of (H,Do) with (H,D1) _ (H,Do) the result follows from the theory of self adjoint extensions of symmetric operators 19 1.
From now, when we speak of the operator H, it will be always understood that we actually deal with the self adjoint operator (H,D1). As we shall see in the exercise 2.3 below, the sequence b has not to be bounded away from zero in order that (H,D0) possesses a self adjoint extension, but such an assumption seems reasonable since a solution of HP = AP which is non zero at infinity does not have any physical meaning in general.
Exercise 2.3 (
For n e N let C (b
) 1/2
b
n
be the sequence 1/2
If we assume that E Cn = + n=0 prove that the operator (H,D0) is essentially self adjoint (and (H,D1) Cn = min (bn bn+1
'
-n -n-1)
)
is its self adjoint extension). Hint
:
Use the result of 1.4 to prove that when
a # 0 the equation
HP = AP has no non zero solution in dV2 (see [4] VII theorem 1.3).
We can apply to H the general theory of self adjoint operators
which we may find in [9]. There exists a resolution of the identity E which associates to each borel subset d of 1R the projection Ed of d& and for P, P in QID,oO,T(d) = <Ed4,P> is a bounded complex radon measure on JR
(We denote the positive measurelo
number with
m A
0, R1 = (H - XI)
-
,
a complex
by
01
is a bounded operator on '% and
the spectral theorem asserts that < RO ,P> =
J(t-A)- I
do can be written as the orthogonal sum of three closed subspaces invariant under H (Lebesgue decomposition of p) defined by
:
192
you
r
{ c
db/ °
is absolutely continuous}
16P = {$ c
Y./ °
is pure point}
dl' =
V
e
=
p / °
is singular continuous}
The spectrum Q of H is the union of
G-a,
(]-p, s which are the
spectra of the restrictions of H to the invariant subspaces
Apa, 6 p
A=
We say that the spectrum of H is pure point if we have
%s
in
pp
this case 49 has an orthogonal basis of eigenvectors. In our case this Lebesgue decomposition of 'X can be obtained by the decomposition of a single measure on ]R as it is shown below. Let ek be the orthogonal basis
defined by (ek)n = bnl do (we remark that
of
II ek 112
P e Ob, = 4`n). We denote by °m,n the measure
=
bkl and for
°em,en , by on the
measure an n and a the "spectral measure" a = °° + a-1. For each n in 7L,
pn(X) and qn(A) defined in the first section are polynomial functions in A and thus the operators pn (H) and q (H) are well defined. n
pn (H) e° + qn (H) e 1 = en
PROPOSITION 2.4
The equality is obvious for n = 0 and n = -1 and is proved by -ek+l - ek-1 + ak ek induction for all n from the identity bk H ek = Proof
:
COROLLARY 2.5
For each T in QU the positive bounded measure a
is
absolutely continuous with respect to a. Proof
:
Let 4 be a Borel subset of ]R with °(d) = 0. Then
IIEAe°1I2 = Go (A) = 0 and IIEe 1112 = a-1(d) = 0, thus by the proposition 2.4
:
E,en = pn(H) Ede° + qn(H) E,e 1 = 0. This implies that Ed = 0 and hence
aT(A) = IIEAT112 = 0.
This corollary has a direct consequence for the Lebesgue decomposition of
.
Let aa, °p, as be the three parts of the Lebesgue
decomposition of a.
PROPOSITION 2.6
With the above notations,
, q a,
(T p, (3-s are the
topological supports of the positive bounded measures
°,
°a,
°p,
moreover x is an eigenvaZue of H if and only if °(a) > 0. Proof
:
By Weyl's criterion we know that A eG-if and only if there
a
193
exists a sequence ('n).30 in D1 with lim
= 0 and
11 (H - AI)'Yn11
11 Y.11 = I.
n ->+ Taking in account that for A c IR and 'Y e D1 we have the relation
II (H- AI) Y'112= 1 (t-a)2 daY (t) the result follows for W a
other parts of the spectrum we remark that if Y =
"Lebesgue decomposition" of Y e I then oa = aTa
+
y,P
and a. For the + Ys is the
ay = ors
,
We associate to H the measure valued matrix S given by
:
,
up = aYp.
a-1,0
r
a0,-1
a-1
The spectral measures a
LEMMA 2.7
can be constructed from the
m,n
spectral matrix $ by the formula IPn
pmq )
am,n
$I
qr. Proof
:
Using the expressions of en and em given by the proposition 2.4
it's enough to compute am,n(t) = <E,, em , en >.
Exercise 2.8
Prove that {EAe°, E,e 1 / A is a Borel subset of 7R} spans
a dense subspace of I. Exercice 2.9
Prove that (an + ari+l) and a have the same nul sets.
Some authors work with the operator H restricted to the half axis
N denoted by +H
.
This operator is defined on the sequences `Y = (Tn)n,0
by
(+HP)o =
bo1
(-Pl + ao'Yo)
(+HY)n = (H'Y)n if n , 1 The Hilbert space + db , the spectral measures +am,n , the resolvent +R
are defined in the same way than for H and we set +a = +a o
Exercise 2.10
Prove that
(i) pn (+H) e° = en (ii) For each Y' in
respect to
(n a IS) + ,the
(iv)
+a Y,
is absolutely continuous with
a
(iii) +am,n = Pn pm +a +
measure
o
{ EDe
(m,n a 1V)
/ 4 is a Borel subset of IIt} spans a dense subspace of
+,{
ov
194
For m,n E IN prove the following relations
Exercise 2.11
M
Pn Pm d+a =
n
-2
+
A pn pm d a = bn
(ii)
:
m do
b- 1
n
-1
an dm - bn
-1
bm
The next exercise shows that when
n+1 ({dm
n-1
+ Sm
J mA # 0 then there exists an
hyperbolic structure in the two directions for S , with distinct n contractive vectors. Let +m(A) = J(t-a)-1 d+a(t) for
Exercise 2.12
(i) We define the sequence ( P
for n
m(A) Pn + qn
Prove that H
+T
= A
n e 2Z, by
for n , 0
+'Yn = <+R, eo, en> Tn =
),
m X# 0
+T
-1
+T-1
= 1
(ii) Deduce from (i) that for n 3 0
<+RA e o, en> _ +m (X) pn + qn
(iii) Prove that
+ b n=0
+T.1
2
r
- J m(+m(A))
n
6ma +
Hint
RA -
<+Ra +RA
+
R
= (a-a)
+
R
+
RA and compute
e o, e o>
(iv) The same property holds for the negative half axis and a complex number A with
m a# 0 is not an eigenvalue of H. This provesthe above
assertion about hyperbolic structures. Exercise 2.13
Prove that it's possible to recover the sequences (an)n>,0
and (bn)n>,0 from +a.
Hint
use 2.11.
:
Exercise 2.14
The sequences +T(a), -'V(A) are constructed for the two
half axis in the same way than in 2.12. Prove that the symmetric matrix
n
,
em>
= a
m X O 0) is given for n < m by
(
+`1(m)
`Y (n)
W( T, +y')
(where W is the wronskian)
Exercise 2.15 that n k=0
2
Let A be a real number and n a positive integer. Prove d
d
bk Pk = Pn a Pn+l - pn+1 dl pn
195
1.3 Slowly increasing generalized eigenfunctions
We say that Y'
is a generalized eigenfunction if H'Y = AY for a
E
real A and 'Y is non zero. (When ' E
'Y
is an eigenfunction). A fairly
general theorem about Carleman operators asserts that such sequences
are slowly increasing [4]. Actually it's possible in the simpler case of difference operators to give a direct and easy proof of this result. As we have seen in 2.5, °m
,n
is absolutely continuous with respect to °
(thank's to the inequality I°m,nl ` 2 (Qm + °n)). Let S be a matrix of
density of S with respect to a.
For a almost all real A, the matrix S(A) is symmetric and
LEMMA 3.1 positive.
Proof
:
For each borel subset S of ]R, the real matrix $(5) is symmetric,
positive since ao,-1 = °-1,0' and for real x,y
(x,y) $ (4) xy1 1= 0x e0 + y e1 (5) The conclusion follows easily.
>, 0.
\\1
For a almost all real A there exists a generalized
PROPOSITION 3.2
eigenfunction Y' such that for each E > 0
-T (n) lim InI ±m Ii -+E
=0 ra
For ° almost all X we can write
Proof
O1
K* where K is an
S = K ILO S
orthogonal matrix, a and 6 are two positive numbers with a , S, a +a =1. By virtue of the lemma 2.7 0
nnn =
( P,
4 )K
Pn
[0
SJ
K ( qn
°
/'Y
If we denote by
see that on >, a
the firstcolumnof K and 'n
°
= pn'o +
we
bn' and that the
2 `Yn 0 . We know that on(IR) =
0
qn'Y-1
f2
1+E
sequence bnlis bounded then for any e > 0 we have
E
n#0 n
n
E
n#0 n
,f2 thus
(
< + oo
1+E
o a.e. 'P2
0
This implies that lim
InI >m
n
1+E
° a.e.
do < + m
196
The same result for the operator
Exercise 3.3
H on the half positive
2
axis
for +a almost all A, lim
:
Pn
n->+- n l+e
= 0
Prove that the Proposition 3.2 could be
Exercise 3.4
For a almost
:
all real A there exists a generalized eigenfunction 'Y such that for every
c
we have
t2(TL)
c 22(7L) .
1.4 Approximations of the spectral measures
The spectral measures a are generally obtained as weak limits m,n of spectral measures of the operator H restricted to "boxes". These
limits are independent from the boundary conditions on the boxes and this allows us the construct new approximations which are absolutely continuous with respect to the Lebesgue measure on]R.
A box A is a finite subset [M , N] of 7L , and we say that a sequence of boxes is going to 7L if the associated sequences of M and N are going
to - - and + - respectively.
For a box A anda real number x we define the operator n Hx on the Hilbert space of sequences ('Yn) , n e [M , N], (with the scalar ,gyp
product inherited from 7) by (AHxY)M = bMl (-i'1
(AHx'Y)n = (HT
N=
1 + aM'M)
for M+l
)n
bN1(-T
n
N-1
N-1 + (aN x) Y'N)
Let ^p and "q be the solutions of HW = A'Y with initial values
^qM = 0. It's easy to see that Hx is a self FPM qM-1 = 1' PM-1 adjoint matrix on ^90, its eigenvalues are the roots of the polynom '
^ PN+1
nam X
they are simple and if A is an eigenvalue we have
(A) _
N
APn
Apm(kEM bk Pk)
Let x c1R, m,n a 7L be fixed, then the sequence
PROPOSITION 4.1
converges weakly to a
Proof
:
m,n when A goes to 7l.
Fix A with 3m A
0, x c ]R, n e 7l . A function
n
4'
iA
be extended to
by ^'n = 0 if n 0 A ; with this convention
in
n ax m,n
nt At can
197
( (nH-a
I)n'Y)
(H-aI)nY'
n'YM
+
e
M-1
+
n
TN eN+l
- xn Y' N eN
When
kl - + the sequence ek converges weakly to zero, thus if the Al is bounded when A goes to 2Z (and hence weakly compact) the sequence eM-1 + ATNeN+l sequence AO = ATM x A T eN converges weakly to zero N n A 4'
is bounded by
RA en the sequence
when A goes to 7L . Taking A T =
b-lJIm al-1 . With this choice we have en = (H-A)AP + A and
n
R. e n =
A Y
RA"
If T is any weak cluster point of AT we have
.
'Y = R. en and thus A R. en converges weakly to % en when A goes to TL
Taking in account that I(t-A)_1
_
en a
em> =
d
AOx
n(t)
(t-A)-1 d om,n(t)
I
we obtain the expected result since it is well known that convergence of integrals of the continuous bounded functions t -i
t1X
for mA # 0
implies weak convergence.
LEMMA 4.2
For
m X
Aq eM>_- n N+1
o we have xAqN
eM
A
PN+1 - x PN Proof
C
Let Y` e
n
with
TN+1/- YN-1 "
YM+1
(A Hx
- aI)`Y = e
M
this equation can be written
TM+11 M
1
TM+1 = (aM - AbM) TM
TN-1 = (aN - AbN) TN - x`YN
and we obtain the relation
YNNJ = YN YN-1 ... YM
:
[fr]
L
taking in account that ' N ... YM
from
= `YM.
LEMMA 4.3 Proof
:
If
If
PN+1
L PN
qN+1l
the statement follows
qN
m A > 0, m z . 0 then APn+l - zAPN # 0
APN+1 -
zA PN = 0, z is an eigenvalue of the operator A Hz
(defined in the same way as AHx). For z = x + iy the adjoint operator (AHz)* is given by
:
198
M:n
((^HZ)* `Y)n = (A11zp)n
N-1
((A Hz)" `Y)N = (AHz'Y)N + 2iybN1 TN For an eigenfunction P associated to z
`Y, `Y> = X11TI12 = = aIITII2- 2iylTN12
thus
(3
mA)
IT I
'
'
N12 and this is impossible, the right member of
'N
this equation being strictly positive. We recall without proof some basic facts about the Poisson kernel of the upper half plane. These results are proven for the unit disc in [32]. Let C. be the Cauchy distribution with density 1 II
(t-a)S2+S 2
dt,
z = a + i6 with S > 0. Let f (z) be an analytic function on m z > 0 such that f can be continuously extended to IR U{m}. Then this extension satisfies the Poisson formula
f(z) =
f(t) dCz(t)
:
(Sm z > 0)
J
Furthermore let F(z) be the function associated on m z > 0 to a positive ua+us+up is the bounded measure p by :F(z) _ J(t-z)-1 dp(t). If u =
Lebesgue decomposition of p, then (i) lim 6m F(x+ic) exists for Lebesgue almost all x of IR and is equal t 0
to the density of
a.
(ii) (µs+pP) is supported by the subset {x a IR/lim 5m F(x+iE) _ +m} E+0 In particular if lim im F (x+it) exists and is finite for all real x, Ey0 then p is absolutely continuous with respect to the Lebesgue measure on
IR and its density is given by this limit. Its also useful to remark (
that
dC
J Z dp(t), that is,
m F(z)
m F(z)
is the Poisson integral of
dt
p and the above statements are nothing else than Fatou's theorems. %
We define Aam n as the average of A am n with respect to the Cauchy measure e by the formula A
aM'. (A) =
1am,n(A) dCi(x)
PROPOSITION 4.4
The measure
am
n
has a density with respect to the Apnnpm
Lebesque measure on IR given by
2 II
A2
pN+1+ pN
199
din
In particular we have the .formula 1
Proof
:
A 2 PN +
1
II
nax
Thank's to the relation
Ap np Aax n m M
=
m,n
=
1
2
PN+1
it's enough to
prove 4.4 for m = n = M. By the lemma 4.3, for 3 m i> 0 the function z
nqN+1
z -* -
qN
is continuous on Ci m z3 0 and has obviously a limit _ ZAPN pN+l when IzI -> + m . From the lemma 4.2 and the properties of the Poisson integral
nq
- iA q
N
N+1
(
F(A) = II1 J(t-A)-1 do(t) M
A
A
pN+1 - 1 pN {{
As it is readily seen that for A real limb m F(A + it) _ 64.0
1
A 2 pN+l
+ n Pi2
the result follows from the above remarks.
The sequence
nconverges weakly to a by the proposition 4.1 m,n m,n (this is true for am,n and hence for the averaged Cr
when A goes to 7L
measures). One of our essential goals is to obtain (under some hypothesis) that a
is pure point, and it seems surprising to work m'n n x am which is absolutely continuous (rather than with am,n n which is pure point). But we shall see in the chapter III that this nti
with
choice is well adapted to the proof of. localization in the random case.
Exercise 4.5 (HT)n
Hint
Compute
^
and am,n for the Laplace operator am,n
-fin-1 - Tn+l
pn(A) _ sh(n-M+1)w
A
(where ch w = - 2
sh w d am,n
and for A c ]-2,+2[,
da
Prescribing
Exercise 4.6
cos[(m-n) Arcos - 2 (1-A2) -1 /2 211
boundary values at each endpoint of the box AHy'x
A, we define for x,y a JR the operator
by
(nHy'x 1')M = bMl(-'FM+1 + (aM y)TM)
if M+1 : n < N-1
(AHY,x `F )n = (HT )n
(^Hy'xf)n =
bN1(-TN-l
Prove that for' m A # 0 <^ nRy,x
M
M
+ (aN-x) TN) A
A
qN+1 - x qN A
1 ,
e
e >
- xApN+y(AgN+1 - x'qN)
200
Let +Hy be the operator on the "box" 10, +-[ with the
Exercise 4.7
boundary condition y at the endpoint
0 (see 4.6).(Wh+n y = 0 this is
just the operator +H defined in 2.10. Let +m(A) = J
dto(t)
defined in
2.12.
m(A)
(i) Prove that <+Ry e o, e°> =
1-y+m(A)
X
Hint
use 4.6 with N = 0, x = 0
:
(ii) Prove that the measure
+ay
++'a'
= J
dCi(y) is absolutely continuous
with respect to the Lebesgue measure on R. + MM
Hint : J m
m(a)
is bounded on J m A> 0
1 - i+m(A)
(Remark that Exercise 4.8
mA
> 0 m +m(A) > 0) .
Compute
m(a) for the Laplace operator (exercises 2.12
and 4.5).
1.5 The pure point spectrum. A criterion.
Let A be an eigenvalue of H associated to a normalized eigenfunction f (the eigenspaces are one dimensional !), then
a m,n (X) =
en>
and the density matrix S(A) defined in 1.3 has the form 2 0 Y
1
2 Y
o
+Y
Yo -1
2
-1
TT
Y' 2 1
In this case S(A) is of rank one and each column vector is proportional to
0 1
Therefore, in order to prove that A is an eigenvalue it seems natural to
prove that each column vector of S(X) can be taken as initial values of
a solution of H'Y = A' which is in
a almost
everywhere so this is only an heuristic argument ...). If I is an open subset of 7R, we say that the spectrum is pure point
on I if as(I) = as(I) = 0 (or Gj afl I = (5s (lI = o).
201
Let I be an open subset of]R for which there exists two
THEOREM 5.1
positive constants C and p with p < 1 such that for each box A and ti Im-n
m,n a A,
r"' am
2p (II)
Cp
Then if E bn 1
.
< + -, the spectrum of H is pure point on I and
the associated eigenfunctions faZZ off exponentiaZZy at infinity. Proof
:
On the set of bounded measures on ]R endowed with the weak
topology (that is convergence of integrals of bounded continuous functions), for each positive lowersemicontinuous function f, the map p --' IpI(f) is lowersemicontinuous. Together with proposition 4.1 this
Cp (m-n). Choosing m = 0 and m = -1 in the
implies
lemma 2.7 this can be written :
fIP
n
I
+q
doo
doo,-1I do < Cp InI n
do
doo-1 + q
do
da-1 I
fI
Pn
n
do
do
C P I n+1 I
do
Observing that a sequence in k1(2Z) is necessarily in k2(ZZ) hypothesis E b112 p
nI
the
< +- implies that for a almost all A of I the
two column vectors of the matrix S(pa) can be taken as initial values of a solution of HT = A'Y with 4' e off. These column vectors can't be
simultaneously zero (the trace of S(A) is equal to 1, a a.e.). Hence a almost all A of I is an eigenvalue
and a is pure point on I. For an
eigenvalue AI letting To = ao(X) and W-1 =
ITnI ` Cp n
Go'- 1(A)
we obtain
j
5.2
(i) In the above proof,the hypothesis of 5.1 has only to
hold for m = 0 and m = -1,but as it's easily seen these two statements are equivalent. (ii)The same result holds when we replace ^a^'
mn
by ^ox m,n
(for a fixed x or a bounded sequence x^) in the hypothesis of theorem 5.1, but checking such a property
seems much more difficult,
the measures 'ax being not as tractable as the m,n have'a nice formula (4.4).
n
am,n
for which we
(iii) If the sequence b is bounded (in which case oV is
isomorphic to k2(7L) the assumption
Exercise 5.3
E bn /2
p
ni < +- is vacuous.
Let I be an open subset of It, then if E b /2I a I(I)<+. n n m,n
202
the spectrum of H is pure point on I.
The criterion 5.3 can be obtained as in [37] using the following "Ruelle formula" proved in [53] I I1
T I 12
lm
=
c
lm T-+.
A+7L
1
T
:
T
0
I
I
eitH
l A
c
T I I2 dt
where P is in, IIc is the projection from Pb onto the subspace
V + OVS and 1Ac is the projection operator [1Ae(4)] n = n if n ¢ A
=0 Exercise 5.4
if nrA
Let I be an open subset of R.
(i) Prove that if
for each integer m (or only for m = -1 and m = 0) we
have I IIc EI emll = O,then the spectrum of H is pure point on I. I
(ii)Prove that if for each m (or m = 0 and m = -1) E bn(Iam,nl(I))2 <+m
then the spectrum of H is pure point on I.
eitH'v 112
fT
T
Hint
IlAc J
dt `niAbn(Iam,nl (I))2
0
(iii) Observe that the criterion obtained by restriction of (ii) to
boxes is stronger than 5.3.
1.6 Singularity of the spectrum
Osseledec's theorem 1.1 together with proposition 3.2 about slowly increasing generalized eigenfunctions, are powerful tools to obtain singular properties of the spectrum. The two following lemmas and the
added remark
actually contain essential ideas developed in the sequel.
We assume in this section that lim 1 Log(1 + I a n+±m InI
n
I
+ b. ) = 0.
I
We say that the sequence Yn(a) has an hyperbolic behavior if there exist two strictly positive numbers y+(a) and y_(X) such that
lim 1 LogIISn(a) II = y, (X) n+±- I n I
If the sequence Yn(a) has an hyperbolic behavior, Osseledec's
theorem 1.1 insures the existence of two contractive vectors V +M and V (x)
203
LEMMA 6.1
Let m be a positive continuous measure and A a Borel subset
of IR, such that the sequence Yn(A) has an hyperbolic behavior m
a.e.
on A. Then the spectral measure o is orthogonal to m on A. Proof
:
Osseledec's theorem 1.1 implies existence of contractive
vectors V+(a) and V -(X) m a.e. on A. Such A are eigenvalues of H iff V+(a) and V -(X) are proportional b
n
.
from the hypothesis on the sequence
The measure m being continuous such countable set of A is m
negligible. This implies that each non zero solution of HY = aW is exponentially growing in at least one direction of 2Z for m almost all A
in A. Since for a almost A there exists a non zero solution of HW = aW which is slowly increasing, the conclusion follows.
LEMMA 6.2
Let A be a Borel subset of IR such that the sequence Yn(A)
has an hyperbolic behavior a
a.e. on A. Then the spectral measure a is
pure point on A. Proof
:
Osseledec's theorem 1.1 implies existence of contractive
vectors V+(a) and V -(X) a
a.e. on A. Moreover for a almost all A,
there exists a non zero slowly increasing function W solution of H' =a' and thus V -(X) and V+(a) have to be proportional a
a.e. on A. Hence,
on A, a is supported by the eigenvalues of H (if any).
REMARK 6.3
Lemmas 6.1 and 6.2 can hardly be directly applied to a
deterministic operator H. But we shall see in the sequel that in the "random case" and under broad conditions, for a given measure m, the hypothesis of 6.1 are satisfied for "almost all samples". Thus we could think that occurence of the hypothesis of 6.2 is highly improbable...
Nevertheless we shall see that, in the independent case, this hypothesis holds with probability one amazing fact in localization theory.
!
This is certainly the most
CHAPTER II
ERGODIC SCHRODINGER OPERATORS
II.1 Definition and examples
We now suppose that (a
bn), n E 2Z, is a stationary random process
n '
This means that the real random variables (an(w), bn(w)) are defined on some complete probability space (D, a , F) and that there exists an
invertible measurable transformation S of U leaving F invariant and In general we don't write the such that an+1 = an oS bn+l variable w and when a property depends only upon the common law of the o.e.
,
sequence (an,bn) we omit the index n and speak of the variables (a) and (b). We say that the family H(w) of associated operators on
A
2is
ergodic if a B invariant measurable subset of 0 is of zero or one F measure. It's easily seen that H o 6 = U 1 H U where U is the shift
operator on
2Z , (UT
)= T n-1, n-1'
The most important example in this book will be the "independent case", that is (an,bn) is an independent identically distributed
sequence of random variables with values in l2. Then (C,a,, F) is constructed as the 2Z power of a probability space (1R2, w = {(xn'yn)}, n E 27., we define 0(w) = ((x
For
and an(w) = xn,
n+1' yn+1)} bn(w) = yn. An other example is the "quasi periodic case" where (C,a.,F) is the one dimensional torus [0,211] endowed with the Lebesgue
probability, e is the "rotation" 6(w) = w + 2fl a with a an irrational
number. The sequences an = aegn, bn = b o0n are constructed from two
205
206
real continuous functions a and b on 0. A particular case is the "Mathieu operator" associated to the functions b(w) = 1, a(w) = 6 cosw where 6 is a real number. A degenerated example is given by the "periodic case" where (o,3.
1P)
is the finite set {0,1,2,
..
,
r-1} with
the uniform distribution. The "rotation" modulo r acts by 0(w) = w+l. More general examples of almost periodic or limit periodic cases are also of great interest [57]. Ergodicity does not require any disorder as shows the periodic case (we don't assume any mixing property) nevertheless that is a sufficient assumption to imply ]P a.e. properties
of the spectrum of H(w) as a set. If we want to obtain stronger properties, as absence of absolutely continuous part in the spectrum, some randomness is necessary (the spectrum is absolutely continuous in the periodic case). In some sense the Lyapunov exponent defined in the section 3 is a good indicator for the required disorder as it is shown in the section 5.
Throughout this chapter we assume that H(w) is an ergodic family of random operators.
11.2 General spectral properties
We assume in this section that the random variable (b) is strictly positive and bounded away from zero, then by 1.2.2 for ]Palmost all w the operator H(w) is self adjoint on the Hilbert space
dl-1)(w)
(note
that the scalar product is itself random). The shift operator U is a
unitary map from X-8 onto 'A thus H and H o0 are unitary equivalent.
For 4 , T in d0 o 6 we have the equality < (E6. 0 thus a n
o
LEMMA 2.1
T> = <E6 U¢ , UT> and
B = a n+1
Let A be a Borel subset of]R, m and n two integers, then the
function am,n(A) defined] a.e. on 0 is mesurable. Proof
:
By a monotone class argument it's enough to prove this property
for an open interval A = I
E. For such a A we have
((a-6
am,n(A) = lim lim
t+o 610
1
R J- W
m
cm,
n
e > dt
207
rb thank's to : lim 1 f e1o II
a
By virtue of 1.4.4 we obtain
A- ZZ
II
(am,n(a)+am,n(b))
:
em,en> = lim 1 f(A - (t+ie))
t+ie
am,n(]a,b[) +
m