This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!
9T. The action-angle variables for the second symplectic form are group version of the actionangle variables for the first symplectic structure. Theorem 2.4. Let 7 S be the poles on the spectral curve of the normalized eigenvector ip of the matrix function L £ C9Q . Then the two-form wgr defined by (20) is equal to g+r—l (il) induces a map acting on the curves in £), which in turn induces a map on thepprobability measures on the space of such curves, which we denote by the same letter. By a conformal map we understand a bijection which locally preserves angles, i.e. is analytic or anti-analytic function (so it might change the orientation). Moreover, if we start drawing the interface from the point a, we will be walking around the yellow cluster following the left-hand rule — see figure 1. If we stop at some point a' after drawing the part 7' of the interface, we cannot distinguish the boundary of 0 from the part of the interface we have drawn: they both are colored yellow on arc a'b and blue on arc ba' of the domain Q \ 7'. So we can say that the conditional law of the interface (conditioned on it starting as 7') is the same as the law in a new domain with a slit. We expect the limit law A to have the same property: (B) Markov-type property: The law conditioned on the interface already drawn is the same as the law in the slit domain: A(fi,a,6)|7' = A ( n \ 7 , , a , , 6 ) If one wants to utilize these properties to characterize A, by (A) it is sufficient to study some reference domain (to which all others can be conformally mapped), say the upper half-plane C+ with a curve running from 0 to 00. Given (A), the second property (B) is easily seen to be equivalent to the following: (B') Conformal Markov property: The law conditioned on the interface is a conformal image of the original law. Namely, if G = Gy is a conformal map from C+ \ 7' to C + preserving 00 and sending the tip of 7' to 0, then A (C+, 0,00) |V = G - 1 (A (C+, 0,00)). To use the property (B'), we describe the random curve by the Loewner evolution with a certain random driving force w(t) (we assume that the curve is a.s. an allowed slit). If we fix the time s, the property (B') with the slit 7[0,s] and the map G(z) = gs{z) — w(s) can be rewritten for random conformal map Gt+S conditioned on Gs (which is the same as conditioning on 7') as Gt+s\Gs = Gs (Gt) • Expanding G's near infinity we obtain z-w(t C, if (a) = « , 0 for some ( e H , ||£|| = 1. Elements a £ A, will be called noncommutative random variables. ((oi - C, fJ.a(P) = 0 B, B which is given by the orthogonal projection of L2{M,T) onto L'2(B,r). 4.9. The coalgebra structure and corepresentations for dx-.B are at the heart of some important analytic subordination results in free probability. Here is the simplest example. Let X = X*, Y = Y* be free in (M,T). Then there is an analytic function u : {Imz > 0} - » { I m z > 0} so that T((X ,0. Expand Wj(0,0,^,^) = X ] / dW(t) +udt), two solutions with different initial conditions will meet after some time. Unfortunately, no such characterisation of the strong Feller property is available in the non-linear case, the closest approximation to it being the coupling method described in the next section. However, if the range of the nonlinearity F is contained in the range of Q, the strong Feller property can in some cases be recovered by performing a Girsanov transform to eliminate the non-linearity [15]. In some cases, it is also possible to recover the strong Feller property by adapting Malliavin's proof of Hormander's theorem to the infinite-dimensional case, but to the knowledge of the author, only the particular case of a reaction-diffusion (V*)®9) when p and q are given integers, for the different kinds of matrix groups G. This gives us what we are looking for in a special case, namely when each representation a.i is of the form V®v ®{V*)®q. The two following results allow us to reduce the general case to this particular one. Lemma 5.1. Let G be any compact Lie group. Consider a = (a\,...,ar) and (3 = (/3i,... ,/3r) two r-tuples of representations of G. Assume that, for each i — l , . . . , r , the representation on is a subrepresentation of Pi. Let ipaj be a spin network on Gr. Then there exists J S Endc(/?i <8> • • • <8> /3r) such that ipa,i=ipp,jProposition 5.1. Let G be a compact Lie group. Let a be a faithful finite-dimensional representation of G. Then any irreducible representation of G is a subrepresentation of a®p (g) (a v )® 9 for some integers p,q > 0. In this statement, a v denotes the contragredient representation of a. We use the convention a® 0 = C, the trivial representation. For matrix groups, Proposition 5.1 ensures that every irreducible representation arises as a subrepresentation of some tensor product of a number of copies of the natural representation and its contragredient, that is: any matrix coefficient of a representation of a matrix group is a polynomial function in the entries of the matrix and their complex conjugates. We are now reduced to prove the following result. Proposition 5.2. LetG be a group of the following list: 0(n), SO(n), U(n), SU(n), Sp(n). Let r > 1 be an integer. Let a be a r-tuple of representations of the form V®p <8> (V*)®q, where V is the natural representation of G. Then any spin network tpaj on Gr is a linear combination of products of Wilson loops. (q{0))DqDp" 0. {x) •••))*. 'M £=0+ „ = (Gj1 (• • • (x) •••)Gf)a 6 Diff(a). (17) The gauge and spatially diffeomorphism invariant scalar product is given by ( W n W ^ ' ) > D i f f := [r/ Diff (F')](F). l{x) = e3/2h(ex)eiS^x)/£ where h,S € S(R3). )(l-* w )) 0, 0(v) = 0 for v < 0 and ±i = e ± 7 r i / 2 . Applying to the integral (11) the stationary phase method and taking into account identity (13), we find that u(r, z, t) = e ^ W ' ^ ^ , 0 (25) (for all t). Indeed, it follows from the Newton equation r"(t) = —2V'(r(t)) consequence of (16)) and expression (17) that r"(t) = 4 M 2 r - 3 ( t ) + 4eA'(r(t)) (P -
9r
tj =
J^ 5\nk(ls)A6z(ls).
(22)
s= l
Theorems 2.1 and 2.3 provide a framework for the existence of so-called bi-Hamiltonian structures. It was first observed by Magri that the KdV hierarchy possesses a bi-Hamiltonian structure, in the sense that all the flows of the hierarchy are Hamiltonian with respect to two different symplectic structures. If Hn is the Hamiltonian generating the n-th flow of the KdV hierarchy with respect to the first Gardner-Zakharov-Faddeev symplectic form, then the same flow is generated by the Hamiltonian Hn-\ with respect to the second LenardMagri symplectic form. Periodic chains. The two symplectic structures w and uigr are equally good in the case of a single matrix function L(z), but there is a marked difference between them when periodic chains of operators are considered (see details in [13]). Let Ln(z) € C{D) be a periodic
Algebraic versus Liouville integrability of the soliton systems chain of matrix-valued functions with a pole divisor D, Ln = Ln+x. chains is C(D)®N. The monodromy matrix Tn(z) = Ln+N-.i(z)
Ln+N-2{z)
57
The total space of such
• • • Ln(z)
(23)
is a meromorphic matrix function with poles of order Nhm at the puncture zm, i.e. Tn(z) e C{ND). For different n they are conjugated to each other. Thus the map C{D)®N .—* C(ND)/SLr
(24)
is well-defined. However, the natural attempt to obtain a symplectic structure on the space £(D)®N by pulling back the first symplectic form w on C(ND) runs immediately into obstacles. The main obstacle is that the form UJ is -only well-defined on the symplectic leaves of C{ND) consisting of matrices with fixed singular parts for the eigenvalues at the punctures. These constraints are non-local, and cannot be described in terms of constraints for each matrix Ln(z) separately. On the other hand, the second symplectic form w 9r has essentially the desired local property. Indeed, let Ln be a chain of matrices such that Ln G £(D,D_). Then the monodromy matrix defines a map f:£{D,D-)®N
^C(ND,ND^)/SLr.
The group SL^f of .^-independent matrices gn € SLr,gn
(25)
— 9n+N
actson£(D,£>_)® J V by
the gauge transformation Ln -> gn+iLng'1 (26) which is compatible with the monodromy matrix map (25). Let the space 'Pchain be defined as the corresponding quotient space of a preimage under T of a symplectic leaf CBQ C £(ND,ND-)/SLr Pchain = ( f - 1 ( P 1 C o ) ) / 5 L ^ v . (27) The dimension of this space is equal to dimPchain = N(degD)r(r
— 1) — 2r + 2.
Theorem 2.5. The pull-back by T of the second symplectic form wchain = T*{u>gT), restricted to T _ 1 (£Q>)> *S 9au9e invariant and descends to a symplectic form on Pchain- It can also be expressed by the local expression N
1 Wchain = 2 5 Z
ReS
*«
51 ^ n=l
(^nllSLn(z)
A « * „ ( * ) ) dz,
(28)
where *„+i = i „ $ n , ^n+N — ^nK, ^ = diag(fcj)5lJ. All the coefficients of the characteristic polynomial of T(z) are in involution with respect to this symplectic form. The number of independent integrals equals dimP c hain/2.
3. Periodic chains on algebraic curves The Riemann-Roch theorem implies that naive generalization of equations (1, 2) for matrix functions, which are meromorphic on an algebraic curve T of genus g > 0, leads to an overdetermined system of equations. Indeed, the dimension of r X r matrix functions of fixed
58
IGOR KRICHEVER
degree d divisor of poles in general position is r2(d — g + 1). If the divisors of L and M have degrees n and m, then the commutator [L, M] is of degree n + m. Thus the number of equations r2(n + m — g + 1) exceeds the number r2(n + m — 2g + 1) of unknown functions modulo gauge equivalence (see details in [5]). There are two ways to overcome this difficulty in defining zero curvature equations on algebraic curves. The first way is based on a choice of L with essential singularity at some point or with entries as sections of some bundle over the curve. The second way, based on a theory of high rank solutions of the Kadomtsev-Petviashvili equation, was discovered in [9]. There it was shown that if in addition to fixed poles the matrix functions L and M have rg moving poles of a special form, then the Lax equation is a well-defined system on the space of singular parts of L and M at fixed poles. In [9] it was found, that if the matrix functions L and M have moving poles with special dependence on x and t besides fixed poles, then equation (1) is a well-defined system on the space of singular parts of L and M at fixed poles. A theory of the corresponding systems was developed in [5]. In is instructive to present its discrete analog, that a theory of the discrete curvature equations (2) with the spectral parameter an a smooth algebraic curve. We begin by describing a suitable space of such functions Ln. Let T be a smooth genus g algebraic curve. According to [11], a generic stable, rank r and degree rg holomorphic vector bundle V on T is parameterized by a set of rg distinct points 7 S on T, and a set of r-dimensional vectors as = (als), considered modulo scalar factor as —> \sas and a common gauge transformation a* —»
J GLr.
In [9,12] the data (7,a) = ( 7 s , a s ) , s = l,...,rg, i = l , . . . , r , were called the Tyurin parameters. Let D± be two effective divisors on V of the same degree T>, Below, if it is not stated otherwise, it is assumed that all the points of the divisors D± = J2k=i ^ n a v e multiplicity 1, Pj^~ =fi P*i, k / m. For any sequence of the Tyurin parameters (7(n), a(n)) we introduce the space £ 7 ( n ) >a (n)(D+,D-) of meromorphic matrix functions Ln(q),q € T, such that: 1°. Ln is holomorphic except at the points 7 S , and at the points P^ of D+, where it has at most simple poles; 2°. the singular coefficient Ls(n) of the Laurent expansion of Ln at 7S Ln{z)==±M+0(l),
za = z{ya),
(29)
z — zs is a rank 1 matrix of the form La(n) = Ps(n)aJ(n)
<—> L^(n) =
fi(n)<4(n),
(30)
where /3s(n) is a vector, and z is a local coordinate in the neighborhood of 7S; 3°. the vector af(n + 1) is a left null-vector of the evaluation of Ln at 7s(^ + 1), i-e. a , ( n + l ) L n ( 7 , ( n + l ) ) = 0;
(31)
4°. the determinant of Ln(q) has simple poles at the points p£~,*ya(ri), and simple zeros at the points Pj7,7s(n + 1).
Algebraic versus Liouville integrability of the soliton systems
59
The last condition implies the following constraint for the equivalence classes of the divisors
[D+] - [D-] = J2 Mn
+ !) - 7-(n)] G JF),
(32)
s
where J(T) is the Jacobian of I\ If 2N > g(r + 1), then the Riemann-Roch theorem and simple counting of the number of the constraints (29)-(31) imply that the functional dimension of £ 7 ( n ) i a ( n )(D + ,£)_) (its dimension as the space of functions of the discrete variable n) equals 2V(r — 1) — gr2 + g + r2. The geometric interpretation of £ 7 ( n ) ) Q ( n )(D + ,D_) is as follows. In the neighborhood of js the space of local sections of the vector bundle V1^a, corresponding to (7, a), is the space Ta of meromorphic functions having a simple pole at ys of the form
/(<>= / ^ L + O C 1 ) .
A S £C.
(33)
z - z(7 s ) Therefore, if Vn is a sequence of the vector bundles on T, corresponding to the sequence of the Tyurin parameters (7(71), a(n)), then the equivalence class [Ln] of Ln modulo gauge transformation (26) can be seen as a homomorphism of the vector bundle Vn+\ to the vector bundle Vn(D+), obtained from Vn with the help of simple Hecke modification at the punctures P^", i.e. [L n ]eHom(V n + i,V n (Z? + )). (34) These homomorphisms are invertible almost everywhere. The inverse matrix-functions define the homomorphisms of the vector bundles [L-^GHomO'n.Vn+i (£>_)).
(35)
The total space £JV(-D+, DJ)\ of the chains, corresponding to all the sequences of the Tyurin parameters, is a bundle over the space of sequences of holomorphic vector bundles £(D+,D-)
^{V„}.
(36)
The fibers of this bundle are just the spaces £ 7 ( n ) )Q ,(„)(D + , £>_). Our next goal is to show algebraic integrability of the total space CN(D+, D_) of the 7Vperiodic chains, Ln — Ln+N (see details in [6]). Equation (32) implies that the periodicity of chains requires the following constraint on the equivalence classes of the divisors D±: N([D+]-[D_])
= 0GJ(T),
which will be always assumed below. The dimension of dimCN{D+,
£N(D+,D-)
£>_) = 2NV{r - 1) + Nr2 + g.
(37) equals (38)
E x a m p l e . Consider the case of 1-periodic chains, i.e. the stationary case Ln — L. Let D+ = K. be the zero-divisor of a holomorphic differential dz, and let CK be a union of the spaces Ci(lC, £>_). Then the factor-space CK/SLr is isomorphic to a phase space of the Hitchin system that is the cotangent bundle T*{M) to the moduli space of rank r stable vector ([5]).
60
Let Ln G equation
IGOR KRICHEVER
£N(D+,D-)
be a periodic chain. Then the Floque-Bloch solutions of the V>„+i = Lnipn
(39)
are solutions that are eigenfunctions for the monodromy operator Tn1pn = i>n+N = Wt/jn,
Tn = Ln+N^i
•• •L„+1L„.
(40)
The monodromy matrix Tn(q) belongs to the space of the Lax matrices introduced in [5], Tn G CND+, T~l G £ND~. The Floque-Bloch solutions are parameterized by the points Q = (w, q), q € r , of the spectral curve T defined by the characteristic equation r-l
R(w, q) = det (w • 1 - TnQ(q)) = wr + ^ r ^ w * = 0.
(41)
i=0
The coefficients ri(q) of the characteristic equation are meromorphic functions on T with the poles at the punctures Pj!~. Equation (7) defines an afhne part of the spectral curve. Let us consider its compactification over the punctures Pjf. As shown in [6], in the neighborhood of P£ one of the roots of the characteristic equation has the form w = (z-
z{P+))~N
(c+ + 0(z - z(P+))) .
(42)
The corresponding compactification point of T is smooth, and will be denoted by P£. In the general position all the other branches of w(z) are regular at P£. The coefficients ri{z) are the elementary symmetric polynomials of the branches of w(z). Hence, all of them have poles at P£ of order N. Note that the coefficient ro(z) — detT„ 0 has zero of order N at The same arguments applied to Ln x show that over the puncture Pk there is one point of f denoted by P^ in the neighborhood of which w has zero of order N, i.e., w = {z-z
(P^))N
(c~ + 0(z - Z(P-)))
.
(43)
Let us fix a normalization of the Floque-Bloch solution by the condition that the sum of coordinates ipQ of the vector ipo equals 1, Y%=i ^o = 1- Then, the corresponding FloqueBloch solution ipn(Q) is well-defined for each point Q of T. Theorem 3.1. The vector-function ipn(Q) is a meromorphic vector-function on T, such that: (i) outside the punctures P^ (which are the points of T situated on marked sheets over Pj!r) the divisor 7 of its poles % is n-independent; (ii) at the punctures P£ and P^ the vector-function ipn(Q) has poles and zeros of the order n, respectively; (Hi) in the general position, when T is smooth, the number of these poles equals g + r — 1, where g = NV(r-l)+r{g-l)
+l
(44)
is the genus ofY. Let SD+'D- be the space of the spectral curves, which can be seen as a space of the meromorphic functions r^z) on T with poles of order N at the punctures P£, and such that
Algebraic versus Liouville integrability of the soliton systems
61
r 0 has zeros of order N at the punctures Pk . The Riemann-Roch theorem implies that gD+,D- j s 0 £ dimension dim5 D +" D - = NV{r -l)-(g-
l)(r - 1) + 1.
(45)
The characteristic equation (41) defines a map CN{D+,D-) —> SD+'D-. Usual arguments show that this map on an open set is surjective. These arguments are based on solution of the inverse spectral problem, which reconstruct Ln, modulo gauge equivalence (26) from a generic set of spectral data: a smooth curve T defined by {r;} G SD+'D-, and a point of the Jacobian J(F). Theorem 3.2. The map described above Ln —> {T, 7} descends to a bijective correspondence of open sets CN(D+,D.) I GL? ^ { f e SD+-D~, [7] G J(T)}. (46) Restricted chains. Let us introduce subspaces CN+Q A~ C CN(D+,D-) with fixed equivalence classes of the divisors of Tyurin parameters
of the Lax chains
[ 7 („)] =C7 + n ([£>+]-[£>_]) e J ( r ) ,
(47)
and with fixed determinant detT = A = ro(q)- The subspace of the corresponding spectral curves will be denoted by S& G SD+'D~. The points of 5 A are sets of functions ri(q), i = 1 , . . . , r — 1, with the poles of order N at P£. For the restricted chains the equivalence class [7] G J ( f ) of the poles of the Floque-Bloch solutions belongs to the abelian subvariety Jc{?)=K1(C
+ N(r-l)[D+]/2),
IT, : J(f) — • J(T).
(48)
Corollary 3.1. The correspondence £^DA
I GLNT <-» { f G SA, [7] G Jc(t)}
(49)
is one-to-one on the open sets. Lax equations. In order to treat the zero-curvature equations (2) as a dynamical system on the space of chains, it is necessary to solve first a part of the equations and define Mn in terms of Lm. Unlike the stationary case considered above that can not be done, if Mn has fixed poles outside the punctures P^ . The singular parts of suitable matrix functions Mn at these points can be constructed locally in a way identical to the theory of discrete zero-curvature equations with the rational spectral parameter. Note that detL„ has simple pole at P£. Therefore, the residue of Ln is a matrix of rank 1, and can be written in the form /ifc(n)p^(n), where hk(n) and Pk{n) are r-dimensional vectors Lemma 3.1. Let Ln be a formal series of the form 00
Ln = /i(n)p(n) T A- 1 + £ ^(nJA* i=0
(50)
62
IGOR KRICHEVER
where h(ri),p(n) are vectors and Xi(n)
are
matrices. Then the equations
n, 4>n+lLn = 4>*n,
(51)
where
*; = ££(n)A\
(52)
The equations (51) have unique formal solutions of the form
such that ( $ > „ ) = « $ n ) = 0,
(<#>„) = 1,
(54)
and normalized by the conditions i
Xo(0)=g(-l),
5^x^(0) = 0,
i>0.
(55)
For the proof of the lemma it is enough to substitute the formal series (52) or (53) in (51) and use recurrent relations for the coefficients of the Laurent series. Let us fix a point Po on T and local coordinates in the neighborhoods of the punctures P£. Then the Laurent expansion of Ln at the punctures defines with the help of the previous Lemma the formal series 4>n ,
C = A"ffX(n)AM, # - = A - n f £ c - ( " ) A
(56)
From the Riemann-Roch theorem (see details in [5]) it follows that there is a unique matrix function M„ ' such that: (i) at the points 7 s (n) it has simple poles of the form: ft(n)Q (
Mn =
; f+Q(l),
Z
z.(n) = z(lt(n)),
(57)
Zs\Tl)
where /xs(n) is a vector; (ii) outside of the divisor 7 it has pole at the point P * , only, where M ^
= (z - z{P±)Yl
(58)
(iii) MA ' is normalized by the condition MA ' (-Po) = 0. Note, that although <£„ and
Algebraic versus Liouville integrability of the soliton systems
63
Theorem 3.3. (i) The equations daLn = M£+1Ln
- LnM«,
da = d/dta,
a = (±k,l),
(59)
define a hierarchy of commuting flows on an open set of£jy(D+, -D-), which descends to the commuting hierarchy on an open set of £N(D+,D-) j GL^. (ii) The flows (59) are linearized by the spectral transform and can be explicitly solved in terms of the Riemann theta functions. In general the flows (59) do not preserve the leaves of the foliation CN+c A~ C CN{D+,D-). The linear combinations of basic flows which preserve the subspaces of the restricted chains are constructed as follows. Let / be a meromorphic function on T with poles only at the punctures P^. Then we define 6
K^E^"-
( °)
a
where c£ are the coefficients of the singular part of the Laurent expansion
f = Hi±k,i)i.z-^Pt)Tl
+O0).
(61)
df = d/Otf,
(62)
!>0
Theorem 3.4. The equations 8fLn
= M^+1Ln
- LnMl,
define a hierarchy of commuting flows on an open set of CN+£ A~, which descends to the commuting hierarchy on an open set of £N+c A~ / GL^f. Hamiltonian approach. At first glance the construction of the Hamiltonian theory for the periodic chains goes equally well on an arbitrary genus algebraic curve. The two-form Q(z) on £N(D+,D-) with values in the space of meromorphic functions on T by the formula identical to that in the genus zero case. JV-l
il(z) = J 3 Tr {V-l+l5Ln A 8^n) .
(63)
n=0
Let us fix a meromorphic differential dz on T with poles at a set of points qm. Then the formula w = --^vesgttdz, 1= {js,Pjt,qm}, (64) \ei defines a scalar-valued two-form on CN(D+,D-). This form depends on a choice of the normalization of \I>n. A change of the normalization corresponds to the transformation ^i'n = tynV, where V = V(z) is a diagonal matrix, which might depend on a point z of T. The corresponding transformation of Q has the form: n' = n + S(Tr(lnWv)),
v = SVV'1.
(65)
Let XD+,D- be asubspace of the chains £N(D+,D-) such, that the restriction of 5(\nw)dz to XD+'D- is a differential holomorphic at all the preimages on t of the punctures Pk .
64
IGOR KRICHEVER
T h e o r e m 3.5. (i) The two-form w, defined by (64) and restricted to XD+'D-, is independent of the choice of normalization of the Floque-Bloch solutions, and is gauge invariant, i. e. it descends to a form onV = XD+*D- j GL^. (ii) Let j s be the poles of the normalized Floque-Bloch solution tpn. Then g+r—l
w=
J2
Slnw(%)ASz(%).
(66)
By definition, a vector field dt on a symplectic manifold is Hamiltonian, if the contraction id,Lo(X) = uj(dt, X) of the symplectic form is an exact one-form dH{X). The function H is the Hamiltonian corresponding to the vector field dt. The proof of the following theorem is almost identical to the proof of Theorem 4.2 in [5]. Theorem 3.6. Let da be the vector fields corresponding to the Lax equations (59). the contraction of w, defined by (64) and restricted to V, equals i9auj = SHa,
Then
(67)
where H(±k,i) = res p ± (z-z
(P*))~
(lnw) dz.
(68)
The theorem implies that Lax equations (59) are Hamiltonian whenever the form w is nondegenerate. The spectral map (46) identifies an open set of £N+Q A~ / GL^ with an open set of the Jacobian bundle over <SA C SD+'D-, i.e. £%!&-/GL?
^
SA.
(69)
The fibers of this bundle are isotropic subspaces for u. Therefore, the form w can be nondegenerate only if the base and the fibers of the bundle (69), restricted to V, have the same dimension. Consider the case g > 0. Let dz be a holomorphic differential on T. Then, for each branch of w = Wi(z) the differential 5(\nw)dz is always holomorphic at P * . Hence, V = CN{D+,D-)/GL?. Recall that dimSA
= (r-l)(NV-g
+ l),
dimJ(f)
= (r-l)(NV
+ g-l)+g.
(70)
Therefore, for g > 0 the form w is degenerate on V. For 3 = 1 the space V is a Poisson manifold with the symplectic leaves, which are factor-spaces ?*=£%}&
/GL?
(71)
of the restricted chains. In that case dim S& = dim J c ( r ) = NV(r — 1). As in the genus zero case, the arguments identical to that used at the end of Section 4 in [5] prove that the form OJ is non-degenerate on V*.
Algebraic versus Liouville integrability of the soliton systems
65
Corollary 3.2. For g = 0 and g = 1 the form OJ defined by (64) descends to the symplectic form on V*, which coincides with the pull-back of the Sklyanin symplectic structure restricted to the space of the monodromy operators. The Lax equations (62) are Hamiltonian with the Hamiltonians Hf = ] T xesq(lnw)fdz. (72) The Hamiltonians Hf are in involution {Hf,Hh}
= 0.
Now we are in the position to discuss the case g > 1 mentioned in the Introduction. The space V* is equipped by g-parametric family of two-forms ivdz, parameterized by the holomorphic differentials dz on T. For all of them the fibers J c ( r ) of the spectral bundle are maximum isotropic subspaces. For each vector-field df defined by (62) the equation hf^dz = 5HfZ holds. Equation (10) implies that each of the forms u>dz is degenerate on V*. Let us describe the kernel of w<jz. According to Theorem 3.2, the tangent vectors to J c ( f ) are parameterized by the space A(T,P±) of meromorphic functions / on T with the poles at P^ modulo the following equivalence relation. The function / is equivalent to / i , if there is a meromorphic function F e A(t, P*) on T with the poles at Pj^, such that in the neighborhoods of these punctures the function ir*(f — f\) — F is regular. Let Kdz C A(T,P±) be the subspace of functions such that there is a meromorphic function F on T with poles at P^ and at the preimages n*(qs), dz(qs) = 0 of the zero-divisor of dz, and such that / — F is regular at Pj^. Then, from equations (67) and (72) it follows that: / £ Kdz '—• id,^dz = 0. Let Kdz be the factor-space of Kdz modulo the equivalence relation. Then the Riemann gap theorem implies that in the general position Kdz is of dimension 2(g — l)(r — 1), which equals the dimension of the kernel of u>dz- Therefore, the kernel of u>dz is isomorphic to K-dz- Using this isomorphism, it is easy to show that the intersection of all the kernels of the forms Udz is empty, and thus the family of these forms is non-degenerate.
4. Variable base curves Until now it has been always assumed that the base curve is fixed. Let MA he the space of smooth genus g algebraic curves T with the fixed meromorphic function A, having poles and zeros of order TV at punctures Pjr, k = 1 , . . . ,V. For simplicity, we will assume that the punctures P^ are distinct. The space M.& is of dimension dim At A = 2(D + g — 1). The total space £AT,A of all the restricted chains corresponding to these data and the trivial equivalence class C = 0 £ J(T) can be regarded as the bundle over M.& with the fibers v CN+Q A~ = J C J / O ' A - ^ ) - ^ definition, the curve T corresponding to a point (I\ A) 6 At A is equipped by the meromorphic differential dz = din A. The function A defines local coordinate everywhere on T except at zeros of its differential. Let WA be the form defined by (64), where dz = din A and the variations of Ln and \t n are taken with fixed A, i.e. 1
N 1
WA = - 2 E
res
q€l
~
* E ^ (*;rJi(A)*Ln(A) AtfMA)) din A, / = { 7 . , ^ } .
(73)
n=0
Then WA is well-defined on leaves XA of the foliation on £JV,A defined by the condition: the differential <51nw(A)dlnA restricted to XA is holomorphic at the punctures Pk . This
66
IGOR KRICHEVER
condition is equivalent to the following constraints. In the neighborhood of P fc there are (r — 1) regular branches wi of the multi-valued function w, defined by the characteristic equation (41): wf = c f + 0 ( A = F l ) , *= l,...,r-l. (74) T h e leaves X& are denned by 22?(r — 1) constraints: Scf = 0 i—• cf = c o n s t f .
(75)
Note t h a t the differential 5 In w(A)din A is regular at P * for the singular branches of w, because the coefficients c^ of the expansions (42) and (43) are also fixed due to the equation T h e factor-space P A = XA/GL™ is of dimension d i m P A = 2NV(r - 1) - 2V{r - 1) + 2(2? + g — 1). T h e space <SA C S'A of t h e corresponding spectral curves is of dimension dim<S A = (r - 1)(JVP -g + 1)- 2V(r - 1) + 2(2? + g - 1). T h e second and the third t e r m s in the last formulae are equal t o the number of the constraints (75) and the dimension of MA, respectively. For the case r = 2 the last formulae imply the match of t h e dimensions dim P A = 2 dim <SA. For r = 2 the spectral curves are two-sheet cover of the base curves, and the fiber of the spectral bundle is the P r i m variety JQ(F) = Jprim(r). T h e o r e m 4 . 1 . For r = 2 the form w ^ defined by (73), and restricted to P A is nondegenerate. Ifjs are the poles of the normalized Floque-Bloch solution ipn, then g+i—1
WA =
Yl
g+i—1 5lnw
(ls)
A(51nA(7 s ) =
S= l
^
Shiw(js)
A 6h\w(j°),
(76)
3= 1
where a : F —* T is the involution, which permutes the sheets of F over F. For every function f £ , 4 ( r , P * ) the Lax equations (62) are Hamiltonian with the Hamiltonians Hj = 2_\ res g ( / I n w) d i n A.
The Hamiltonians spectral map.
Hf are in involution.
Their common
level sets are fibers J p r i m ( r ) of the
References 1. I. Krichever, D. H. Phong, "On the integrable geometry of N = 2 supersymmetric gauge theories and soliton equations", J. Differential Geometry 45, 445-485 (1997); arXiv:hep-th/9604199. 2. I. Krichever, D. H. Phong, "Symplectic forms in the theory of solitons", in Surveys in Differential Geometry I V , edited by C.L. Terng and K. Uhlenbeck, International Press, 1998, pp. 239-313; arXiv:hep-th/9708170. 3. L. D. Faddeev, L. A. Takhtadjan, Hamiltonian methods in the theory of solitons, SpringerVerlag, Berlin, 1987. 4. A. G. Reiman, M. A. Semenov-Tian-Shansky, "Integrable systems", chapter 2 in Dynamical Systems VII, Encyclopaedia of Mathematical Sciences vol. 16, V. I. Arnold and S. P. Novikov, eds., Springer-Verlag, Berlin, 1994.
Algebraic versus Liouville integrability of the soliton systems
67
5. I. Krichever, "Vector bundles and Lax equations on algebraic curves", Comm. Math. Physics 229, 229-269 (2002); arXiv:hep-th/0108110. 6. I. Krichever, "Integrable chains on algebraic curves", arXiv:hep—th/0309255 . 7. N. Hitchin, "Stable bundles and integrable systems", Duke Math. Journ. 54, 91-114 (1987). 8. V. Zakharov, A. Shabat, "Integration of non-linear equations of mathematical physics by the method of the inverse scattering problem. II", Funct. Anal, and Appl. 13, 13-22 (1979). 9. I. M. Krichever, S. P. Novikov, "Holomorphic bundles over algebraic curves and non-linear equations", Uspekhi Mat. Nauk 35, 47-68 (1980). 10. B. A.Dubrovin, I. M. Krichever, S. P. Novikov, "Integrable systems", in Dynamical Systems IV, Encyc. Math. Sciences, Springer-Verlag, Berlin, 1990. 11. A. Tyurin, "Classification of vector bundles over an algebraic curve of arbitrary genus", Amer. Math. Soc. Translat., II, Ser. 63, 245-279.(1967). 12. I. M. Krichever, "The commutative rings of ordinary differential operators", Funk. anal, i pril. 12, 20-31, (1978). 13. E. D'Hoker, I. Krichever, D. H. Phong, "Seiberg-Witten theory, symplectic forms and Hamiltonian theory of solitons", arXiv:hep-th/0212313 .
Mathematical quasicrystals: a tale of two topologies ROBERT V. MOODY (U. Alberta)
In quasicrystals, both mathematical and physical, there is an interplay of two notions of order: a local order and a long-range order manifested through diffraction. These two types of order lead to two very different notions of closeness, the local and autocorrelation topologies, which are in principle unrelated. Given a discrete point set in Rd, each one leads to a dynamical hull. It is exactly the context of the cut and project formalism that marries these concepts. In particular we give a new characterization of regular model sets in terms of a mapping relating the two dynamical hulls.
1. Introduction The distinguishing feature of physical crystals and quasicrystals is their point-like diffraction. This phenomenon is a measure of their internal long-range order. It is particularly remarkable for quasicrystals in as much as they do not seem to be based on repetition of a fundamental cell. By contrast, the physical properties and processes of formation of quasicrystals seem to be largely dependent on their local structure and one can suppose that similar local environments must result in similar physics. These two forms of order — long-range and local — can be put into a setting in which they are seen as a manifestation of two very different topologies. In quasicrystals, physical and mathematical, they are married in a remarkable way. This paper is about these two topologies and what is means for them to live in harmonious co-existence. From the very beginning of quasicrystals, it was observed that indexing the Bragg peaks required more than the expected number of parameters: for example, in the icosahedral cases, six rather than the familiar three for crystals. This suggested that the atomic positions of these substances might be modeled by projecting from a lattice lying in a higher dimensional space. Thus arose the cut and project formalism, which has been a mainstay in attempts to understand and model quasicrystals. The very same mechanism has been seen to be an alternative way to describe many of the well-known tiling models and has played an important role in the development of the mathematics of diffraction in aperiodic sets. In all these cut and project systems two things happen: the two topologies are in coexistence and the resulting structures satisfy the Meyer condition. a In this paper we will see that in fact these two properties characterize the cut and project formalism. In more detail, beginning with a discrete subset A in real Euclidean space M.d we construct two dynamical hulls: X(vl) and A(/l), derived from the local and long-range topologies respectively, each of which carries an R d action. The connection between them, when it exists, is the existence of a continuous mapping f3 : X(yl) —> A(A). In the case that A is Meyer, this mapping exists and is 1-1 almost everywhere if and only if A is a regular model a
I n the case of modulated model sets, the Meyer property may be destroyed.
68
M a t h e m a t i c a l quasicrystals: a tale of two topologies
69
set (see Propositions 4.2, 6.1 for the precise statements). In this case A is also pure point diffractive. Perhaps the most remarkable point about this is the intricacy of the almost 1-1-ness. Except for crystals — in which case (3 is an isomorphism — the set of points at which the map is not 1-1 is dense in A(A).
2. Uniformly discrete sets and two topologies We start by assembling the basic definitions for discrete point sets. We work in Rd with the standard metric. b For c € Rd, we let Br(c) be the ball of radius r about c, with Br := Br(0). Definition 2.1. A subset A C Rd is locally finite if its intersection with every ball Br(c) is finite. Let r > 0. A subset A C Rd is called r-uniformly discrete if all the sets x + Br, x £ A, are mutually disjoint. We will denote by D r the set of all r-uniformly discrete subsets of Rd. This space admits an action by Rd — namely the translation action. For A G D r and x G Rd we denote the x-translate of A by x + A. A locally finite subset A C Rd has finite local complexity if for all R > 0, as c runs over Rd, there are, up to translation, only finitely many sets of the form A fl BR(c). A subset of A G D r is called Delone if it is relatively dense: that is, there is an R > 0 so that for all c e R ^ y l n BR(c) =£ 0. The uniform discreteness is the hard-shell condition for atoms, which is very reasonable. It implies local finiteness. Finite local complexity is also natural enough, though one should note that it imposes restrictions on orientational order. Finite local complexity can also be characterized by the property that A—A
is locally
finite.
(1)
A strong form of this, which we will need and discuss later, is the Meyer condition: Definition 2.2. A C Rd satisfies the Meyer condition if A—A
is uniformly discrete (for some r > 0).
(2)
It is a Meyer set if in addition it is relatively dense. We now come to the two notions of closeness of point sets that are the fundamental concepts of the paper. We will define two topologies, each defined by means of a uniformity — a description of the sets of point set pairs that are to be considered close to one another. Definition 2.3. For each pair (s,BR(c)), define
consisting of a positive real number and a ball,
U(s, BR{c)) := {(A, A') e D r x D r : (v + A) n BR(c) = A' n BR(c), b
for some v e Bs}.
(3)
Almost everything in this paper carries over to the setting of separable locally compact Abelian groups with minor changes.
70
ROBERT V. MOODY
These sets form a fundamental system for a uniform structure on D r whose topology has the sets U(s,BR(c))[A] := {A1 € D r : (A, A') € U(s,BR(c))} (4) as a neighbourhood basis of A. We call it the local topology on D r . This uniformity can also be described by a metric, though it is not particularly natural. The intuition is that two sets are close if after a small shift they agree on a large ball. One should note that the uniformity is certainly invariant under translations, but the individual entourages U(s, BR(c)) are not. The second notion of closeness is based on average matching of the two sets. Definition 2.4. Let A, A' € D r . MA MS r #{(A A A') C) Br) d(A, A') := hm sup —^ —-L '- , r—>oo
._. (5)
VO\.\JtSr)
where A is the symmetric difference operator. It is rather easy to see that d is a pseudometric on D r . We obtain a metric by defining the equivalence relation A = A' ^d{A,A')
=0
and factoring d through it: D r = := D r / =
and d : D r = x D r = —> R> 0
(6)
and note that the translation action of Rd is retained on D r = . We call the resulting topology on D r the coarse autocorrelation topology and denoted by j3 the natural mapping (which is an Remapping): /? : D r —+ D r = .
(7)
Proposition 2.1. i) D r is a complete metric space in the local topology [15,18]. ii) D r = is a complete space and the Hausdorff completion o / D r in the coarse autocorrelation topology [13]. The first of these statements is quite intuitive. The second is trickier since it is hard to see how to build up a point set from a Cauchy sequence of sets in which the matching is improving only in density and not in particular patches of space. A simple example shows why we should not expect (3 to be particularly nice with respect to the local and autocorrelation topologies. E x a m p l e 2 . 1 . Let An := Z\{—n, -n + 1 , . . . ,n}. Then {An} converges to 0 in the local topology and to Z in the autocorrelation topology. If instead An := ([—n, n ] n Z ) U { . . . , — n — 4, —n —2}U{n + 2,n + 4,...} then convergence is to Z in the local topology and the sequence fails to converge in the autocorrelation topology.
Mathematical quasicrystals: a tale of two topologies
71
The coarse autocorrelation topology needs to be modified to put it on an equal footing with the local topology. The local topology is obtained as large coincidence after a small shift. As it stands A, A' can be close in the coarse autocorrelation topology only if they have a large statistical coincidence — no small shift involved. We introduce the autocorrelation topology by defining a uniform structure on D r with the fundamental system of entourages U(s, e) := {(A, A') e D r x D r : d(v + A, A') < e for some v £ Bs} . Proposition 2.2. D r is complete in the autocorrelation topology.
3.
Dynamical hulls
In the dynamics of a physical system one considers a (compact) configuration space or phase space and the time evolution of points of this space. Of special importance is the notion of recurrence, in the sense of the time-orbit of a point returning very closely to the initial state. In the geometry of point sets and tilings a parallel construction on points sets is possible, using the translation action of the underlying space as the group action, instead of time. This provides a powerful way of understanding the internal geometry of a given point set. We begin with D r and its translation action by Rd, take a single point set of interest, A £ D r , and then construct the closure of its orbit under translation: Rd + A. Our two topologies provide two separate interpretations of the orbit closure of A e D r : X(A) = Rd + A in the local topology, A(A) = Rd + A in the autocorrelation topology, which are the local and autocorrelation hulls respectively. Local hulls have been used extensively in tilings and the study of aperiodic sets. The autocorrelation hull is the subject of [13]. It is the interplay between these two that we wish to understand. It is useful to note that both X(A) and A(A) can be interpreted as completions of our starting space Rd. In each case one simply provides Rd with a new topology by pulling back of the uniform topology on D r to Rd using the mapping x t-> x + A. In both cases, points that are usually close in Rd remain so. But now also x, x' e Rd are close if (x + A,x' + A) are close in the local (resp. autocorrelation) topology. It is interesting that A(A) has the structure of an Abelian group, a fact that arises from the group structure of Rd and the translation invariance of the autocorrelation entourages. In particular it has a unique probability measure 9A.. In general X(A) does not get a group structure in this way. We need to know what the basic open sets for this new autocorrelation topology of Rd look like. It suffices to consider open sets around 0, and for these a basis consists of the sets U'(s,e)[0) =Bs xPe where P e := {t G Rd : d{t + A,A)<e}. The elements of Pe are called the statistical e-almost periods of A.
(8)
72
ROBERT V. MOODY
Proposition 3.1. Let A G D r . i) X(yl) is compact if and only if A has finite local complexity, ii) A(yl) is compact if and only if, for all e > 0, Pe is relatively dense. Thus both X(yl) and A(A) become dynamical systems. A dynamical system which is the closure of an orbit is minimal if and only if it is uniformly recurrent (for each point x and each open neighbourhood V of x the set of t in Rd for which t.x is in V is relatively dense in Rd). In the case of X(A) this condition on A is called repetitivity. In the case of A(yl) is comes for free with the compactness, though one can see that in fact it amounts to the relative denseness of each of the sets of e-almost periods Pe.
4. Model sets Fortunately there are large families of examples in which we can get a good feel for what X(yl) and A(A) are like. A cut and project scheme is a triple (Rd,H,L) of locally compact Abelian groups in which L is a lattice in Rd x H and for which the natural projections 7TI,7T2 satisfy TTI\^ is injective, and TC2(L) is dense in H: Rd ^-RdxH
^^H.
(9)
Z We let L := ITI{L) and * : L —> H be the mapping -KI O (TTI|J;) —X- By hypothesis, L is a discrete group and the group T := (Rd x H)/L = {(t,t*) \ t G L} is compact. The obvious Md-action on T (x + (t, t*) + L H-> (x + t,t*) + L) makes it into a minimal dynamical system. A regular model set (denned by the cut and project scheme (9)) is a non-empty set of the form A = x + {t G L : t* G W} where W C H is compact and satisfies the conditions
W =W
and
eH(dW)=0
and where OH is Haar measure on H. It is possible to replace the cut and project scheme by one with a smaller H if necessary, so that for u G H, u + W = W if and only if it = 0 [17]. We will assume that this condition holds in what follows. The regular model set A is generic if dW n L* = 0. Generic model sets are repetitive. 0 It is not hard to show that model sets are always Delone sets [11,12] the uniform discreteness coming from the compactness of W and the relative denseness from its non-empty interior. In fact model sets are even Meyer sets, as can be seen by applying the previous sentence to the model set A - A = {t G L : t* G W - W}. Regular model sets have uniquely defined autocorrelations,11 so we can consider the autocorrelation group A(A). A key point is that A(A) and T are isomorphic as topological groups [13], so in fact T(A) for a regular model set has a very natural interpretation — namely the completion of the orbit of A under the autocorrelation topology. c d
Model sets were introduced by Y. Meyer [9] in his study of harmonious sets. T h e autocorrelation can be expressed, using a Weyl-type theorem for uniform distribution [12,17], as an integral over H which is dependent only on W.
Mathematical quasicrystals: a tale of two topologies
73
Proposition 4.1. [13] Let A be a regular model set of the cut and project scheme (9). Then A(A) ~ T(yl) and the isomorphism is also a G-mapping. Now using the torus parameterization of Schlottmann [18] we have a fundamental result connecting our two topologies. Proposition 4.2. [18] Let A be a regular repetitive model set. Then the mapping /3 of (7) provides a continuous surjective W1,-mapping j3 : X(A) —» A(A). This mapping @ is 1-1 almost everywhere, in the sense that the set of elements ofA(A) which there lie more than one point of\(A) is of Haar measure 0.
(10) over
It is interesting to look at the nature of the 1-1-ness condition. We have pointed out that A(A) ~ T. Now T may be viewed as a parametrization of the different sets A(x, y) := x + {t G L : t* G — y + W} that are obtainable from W by changing varying x and y: A(x,y) = A(x',y') if (x, y) = {x',y') mod L. Each point set A' in X(A) is repetitive, since A is, and if )3(A') = (x, y) mod L then x + {t<EL:t*
£-y
+ W°} C A' C x + {t G L : t* G -y + W} ,
that is, it is determined by some set — y + W between the interior of — y + W and — y + W itself. The situation is easy if (— y + dW) D L* = 0 for then there are no points of A coming from the boundary and there is only one possibility for A'. But if t* G (—y + dW) D L* then there are at least two preimages under (3, one of which has t and one which does not. As y varies over H there is a dense set of translates — y + W for which —y + dW meets L*, whence the singular cases are inextricably bound in with the non-singular cases. For the construction of the mapping X(A) see [18]. It was noted by Anderson and Putnam [1] that locally X(yl) has the structure of an open ball in Rd times a Cantor set. To see this let us suppose that 0 G A. The set BT x Z where Z is all the points A' G X(A) for which 0 G A' is an open set neighbourhood of A in X(A). Furthermore, for each finite patch S of points of A containing 0, the set of points A' of X(A) containing S is a both open and closed in the induced topology on Z, so Z is totally disconnected. Z itself is closed in X(/l), hence compact, and it is also perfect, so a Cantor set. The cut and project formalism is sometimes criticized because of its introduction of nonphysical internal dimensions. In fact these internal dimensions are of completely physical origins (that is, determined by the geometry of the model set). We have just seen that T is none other than A(A) which is the completion of R d under the topology that considers the long-range order of A. We also have a remarkably simple description of the internal space H. Consider the subgroup L of M.d generated by the set of differences A — A. Supply this with the topology of the (coarse) autocorrelation — that is the topology for which an open neighbourhood basis of each point t G L is given by the sets t + Pe, e > 0. Then H is the completion of L under this topology [3]. So the internal space is simply a reflection of the almost periodic structure of A.
74
ROBERT V. MOODY
5. Diffraction The diffraction of a distribution of density in space is usually described as the square absolute value of some suitably normalized Fourier transform of this density. Alternatively it is the Fourier transform of the volume averaged autocorrelation. For finite sets of scatterers or for crystals, both lead to the same thing. With quasicrystals one has to be careful — because of issues of convergence, only the second one makes mathematical sense. Formally the definitions are as follows: Let A £ D r be locally finite. We represent this set as a non-zero regular positive Borel measure in the form of a Dirac comb
where 5X is the unit point (or Dirac) measure located at x.e Define wR = w| B and CoR = (UR)~- Then, the measure y(X>
7
"
. .UR**R
•
l
V-
vol(B fl ) ~ vol(B fi )
g
V
is well defined, since it is the (volume averaged) convolution of two finite measures.5 The autocorrelation ~/u of u; exists and is the limit of (7L ') in the vague topology as R —+ 00, if this limit exists [7]. It is then a positive definite measure with 7W = YlteA-A vW^t where the coefficients are given by rj(t)= 'V
lim R-^oo
}
Y
Vol( BR)
^
1= lim R^oo
* VO[(BR)
.#(AD(t K
+ A)nBR). K
'
(11) '
X
'
x-y=t
The second inequality uses the fact that as R becomes large the error caused by the mismatching of BR and t + BR becomes negligible with respect to the volume of BR. The function 7? can be extended from A — A to all of Md by setting r)(t) = 0 for all other t. It is a positive definite function. What is particularly relevant here is the fact that for all t,
d(A,t + A)= lim *{{A*£?BR) R—»oo
= 2(??(0)
- *'» '
(12)
WOl\tiR)
as can be seen from A A (t + A) = (A U (t + A))\A n (t + A). Thus the metric d and the autocorrelation coefficients 77(f) are directly related, whence our choice of name for the autocorrelation topology. The autocorrelation measure 7W is a positive definite, translation bounded measure and, as such, has a well-defined Fourier transform 7^ (also defined on Md), which is a positive and also translation bounded measure [4,7]. e
In a more realistic setting these points might be weighted to represent different scattering intensities or convolved with some atomic profiles. f If / is a function on R d then / is the function / ( x ) = /(—x) and for a measure /i, the measure p, is defined
by M) = Kf) for all /.
Mathematical quasicrystals: a tale of two topologies
75
Definition 5.1. j w is the diffraction of w. The pure point part of this measure is called the Bragg spectrum. The measure w is pure point diffractive if j u is a pure point (also called discrete or atomic) measure on R d . Proposition 5.1. [6] The measure u is pure point diffractive if and only if the autocorrelation measure j u is strongly almost periodic. This key result of the theory introduces a new concept — strong almost periodicity — which by virtue of its appearance here is crucial for the fundamental understanding of pure point diffraction. However it is not particularly intuitive. Fortunately, under the assumption that A — A is uniformly discrete (the Meyer condition), we have Proposition 5.2. [3] The autocorrelation measure 7W of u> is strongly almost periodic if and only if, for all e > 0, Pe is relatively dense. Thus, for our point set A, pure pointedness of the diffraction hinges around the relative denseness of the sets Pe — in other words the compactness of A(A).
6. The main theorem Proposition 6.1. [2] Let A C Rd be a Meyer set with a well-defined autocorrelation. Suppose that j3 : X(A) —> ^(^1) is continuous and 1-1 almost everywhere. Then A differs in density 0 from a regular model set. Furthermore, if A is repetitive then i) ii) Hi) iv) v)
X(7l) is the hull of a regular model set; X(/l) is uniquely ergodic and minimal (strictly ergodic); A is pure point diffractive; X(vl) has pure point dynamical spectrum; X(A) has continuous eigenfunctions.
j3 is a bisection if and only if A is a crystal. Thus model sets are characterized as Meyer sets for which a good map from ~K(A) to A(A) exists. All of ii) through v) are consequences of being a regular model set. It is conjectured that, along with the Meyer condition, collectively they are equivalent to the existence of the existence of the continuous almost everywhere 1-1 map /3. In outline, the proof of proposition 6.1 for repetitive A may be sketched as follows. The Meyer property implies finite local complexity (see equations (1) and (2)) and hence compactness of X(yl) (see proposition 3.1). Since (3 is continuous, A(A) is compact, whence by proposition 3.1 the sets Pe of almost periods are relatively dense. This implies pure pointedness, but more importantly, it provides us with a way to create the cut and project scheme. We let L be the subgroup of Rd generated by the set A — A and give it the group topology for which the sets Pe form a neighbourhood basis at 0. This is a uniform topology and its Hausdorff completion <j> : L —> H provides us with the internal group H. The subgroup L := {(t, (f>(t)) : t E L} of Rrf x H is a lattice and R d , H, L) is the cut and project scheme.
76
ROBERT V. MOODY
Its *-map is the mapping
the model set x + {t £ L : t* £ — y + W} and W = W. The uniform distribution properties of [12] and the almost everywhere 1-1-ness can be used to show that the boundary of W has measure 0. Hence X(A) = X(yl') is the local hull of a regular model set.
7. Additional thoughts The point of the paper is to show the relationship of the nature of quasicrystals to the intertwining of topologies of long-range and local order. Prom a physical point of view such a connection seems to be a basic underlying assumption. If small samples of quasicrystalline materials are supposed to be representative of some materials that can be extended indefinitely in space, and if the long-range order is supposed to be a defining characteristic of these materials then there has to be some notion of convergence in long-range structure on the basis of local convergence of structure. The present version of this work was intended to shed light on the nature of model sets, and as such makes some assumptions that are physically unrealistic. The primary of these assumptions are the treatment of atoms as points of equal scattering intensity and the very strong assumption of the Meyer condition. It might be as well to show what this innocent looking condition implies. Suppose that A is a Meyer set. Then A — A is uniformly discrete. A consequence of this (due to J. Lagarias) is that A — A C A + F for some finite set F. A consequence of that is that A ± • • • ± A (any fixed number of terms) is also uniformly discrete. For a whole circle of equivalences one may consult [10] or the starting point for all these results [9]. Recently Strungaru [14] proved that for any Meyer set (even weighted Meyer sets) the support of Bragg spectrum is relatively dense in R d . In other words all Meyer sets are, in the general sense of a pervasive Bragg spectrum, quasicrystals. The rigidity of Meyer sets, and even finite local complexity, limits the applicability of this work in understanding physical quasicrystals. Its value is more in pointing out the way in which the tension between long and short range order can be resolved in a nice mathematical setting Future work will focus on removing the Meyer condition and relaxing the notion of the point sets under consideration to measures. Generalizing in this way forces one to generalize the local and autocorrelation topologies and will create new versions of X and A. But one expects that some version of proposition 6.1 will emerge. A multi-colour version of the characterization of model sets by dynamical systems appears in [8].
Acknowledgments Thanks to Michael Baake, Daniel Lenz, Nicolae Strungaru, all of whom have contributed substantially to this project. I would also like to thank the Natural Sciences and Engineering
Mathematical quasicrystals: a tale of two topologies
77
Research Council of C a n a d a a n d t h e Banff International Research Station for supporting this research.
References 1. J. Anderson, I. Putnam, "Topological invariants for substitution tilings and their C*-algebras", Ergodic Th. and Dynam. Sys. 18, 509-537 (1998). 2. M. Baake, D. Lenz, R. V. Moody, "A characterization of model sets", in preparation. 3. M. Baake, R. V. Moody, "Weighted Dirac combs with pure point diffraction", preprint arXiv: math.MG/0203030. 4. C. Berg, G. Forst, Potential Theory on Locally Compact Abelian Groups, Springer, Berlin, 1975. 5. N. Bourbaki, Elements of Mathematics: General Topology, Chapters 1-4 and 5-10, reprint, Springer, Berlin, 1989. 6. J. Gil de Lamadrid, L. N. Argabright, Almost Periodic Measures, Memoirs of the AMS vol. 428, AMS, Providence, RI, 1990. 7. A. Hof, "On diffraction by aperiodic structures", Comm. Math. Phys. 169, 25-43 (1995). 8. Jeong-Yup Lee and R. V. Moody, "Characterization of model multi-colour sets", Annales Inst. Henri Poincare, in press. 9. Y. Meyer, Algebraic Numbers and Harmonic Analysis, North-Holland, Amsterdam, 1972. 10. "Meyer sets and their duals", in The Mathematics of Aperiodic Order, ed. R. V. Moody, NATO ASI vol. 489, 1997, pp. 403-441. 11. R. V. Moody, "Model sets: a survey", in From Quasicrystals to More Complex Systems, eds. F. Axel, F. Denoyer and J. P. Gazeau, EDP Sciences, Les Ulis, and Springer, Berlin, 2000, pp. 145-166; arXiv:math.MG/0002020. 12. R. V. Moody, "Uniform distribution in model sets", Can. Math. Bulletin 45, 123-130 (2002). 13. R. V. Moody, N. Strungaru, "Point sets and dynamical systems in the autocorrelation topology", preprint (available on the author's website). 14. N. Strungaru, "Almost periodic measures and long-range order in Meyer sets", Disc, and Comp. Geometry, in press. 15. C. Radin, M. Wolff, "Space tilings and local isomorphism", Geometriae Dedicata 42, 355-360 (1992). 16. W. Rudin, Fourier Analysis on Groups, Wiley, New York, 1962; reprint, 1990. 17. M. Schlottmann, "Cut-and-project sets in locally compact Abelian groups", in Quasicrystals and Discrete Geometry, ed. J. Patera, Fields Institute Monographs vol. 10, AMS, Providence, RI, 1998, pp. 247-264. 18. M. Schlottmann, "Generalized model sets and dynamical systems", in Directions in Mathematical Quasicrystals, eds. M. Baake and R. V. Moody, CRM Monograph Series vol. 13, AMS, Providence, RI, 2000, pp. 143-159.
Strings through the microscope V O L K E R SCHOMERUS (SPhT
Saclay)
Over the last few years, string theory has changed profoundly. Most importantly, novel duality relations have emerged which involve gauge theories of brane excitations on one side and various closed string backgrounds on the other. In this lecture, we introduce the fundamental ingredients of modern string theory and explain how they are modeled through 2D (boundary) conformal field theory. This so-called 'microscopic description' of strings and branes is an active research area with new results ranging from the classification and construction of boundary conditions to studies of 2D renormalization group flows. We shall provide an overview of such developments before concluding the lecture with an extensive outlook on some present and future research that is motivated by current problems in string theory. This includes investigations of non-rational and non-unitary conformal field theories.
1. Introduction During the last years several new elements have entered the picture of string theory and they have inspired many novel ideas in a variety of fields, including high energy physics and cosmology. The stringy image of our world (see figure 1) contains super-gravity propagating in a 10D background with some dimensions being compactified. In addition there are branes stretching out along p + 1-dimensional hyper-surfaces. Excitations of these branes give rise to gauge theory and matter that can propagate along the brane's world-volume. As inspiring as this picture has been for many recent developments in physics, it can also be misleading, especially when applied to some extreme situations in which the space becomes strongly
yi
I J
1 1
/
1
\
/ Figure 1. The picture of modern super-string theory contains closed strings which propagate in a 10D background, p-branes stretching along p + 1-dimensional surfaces and open strings whose end-points move within the branes' world-volume.
78
Strings through the microscope
79
curved or even singular. It is therefore crucial to keep in mind that such an image of the world only arises in limit of string theories in which the involved length scales are long compared to the string length and hence that an important task in string theory is to develop techniques which allow computing stringy corrections to the picture we sketched above. This goal has been a fruitful challenge for more than two decades now and it has lead to many new insights within the last years. Such developments are the main focus of this lecture. To begin with, we shall take a much closer look at the various elements of figure 1, enlarging them so that we can see the strings' finite extension. In the process we shall learn how to model the picture in mathematical physics. This will involve the whole wealth of boundary conformal field theory (CFT), a domain that is also known for its beautiful applications to critical phenomena (see e.g. the lecture of Smirnov for some recent applications). As we scan over the picture of the world, we shall gradually set up a dictionary between its various elements and concepts of 2D boundary conformal field theory. After this introductory part we shall start discussing some of the recent string inspired developments in boundary conformal field theory. Our presentation will follow the traditional devision into rational vs. non-rational conformal field theory. The former is relevant for the study of strings on compact components of the 10-dimensional world. This subject has seen an enormous boost in recent years and by now there exist very powerful techniques that are essentially universal, i.e., apply to very large classes of compact target spaces. Non-rational conformal field theory, on the other hand, is relevant for string theory in non-compact spaces. As we will discuss, such non-compact backgrounds are an essential ingredient in string theory duals of (large TV) gauge theory. They are also vital for studies of time-dependent string backgrounds. Applications to these two domains have pushed non-rational conformal field theory to the forefront of research in string theory.
2. Strings, branes and boundary conformal field theory The main purpose of this section is to zoom in on the elements of the picture we sketched above, to understand what they look like in full string theory and how they are modeled in mathematical physics. 2.1. Closed strings and bulk conformal field theory The first element we are going to enlarge is a junction at which three closed strings come together (see figure 2). Our aim here is to assign a number to this junction which we can interpret as an amplitude for the joining/splitting of closed strings. This number depends on the particular background we consider and on the states <j) of the three closed strings that participate in the process. The latter decorate the three external legs of figure 2 and they are taken from an infinite set of possible closed string modes. When we deal with strings in flat space, for example, states are characterized by a center of mass momentum and an infinite variety of different vibrational modes. To assign an amplitude to the vertex, we consider the latter as an image of a 3-punctured 2-sphere P 1 under the parametrization map (X^(z,z))n=o,i,...,9- Being the parametrization of the closed string's world-surface, XM are components of a 10-dimensional bosonic field
VOLKER SCHOMERUS
Figure 2. The interaction 3-vertex of three closed string modes $ „ is obtained from a 3-point function of a 2D conformal field theory on a closed surface P 1 . The fundamental fields of the 2D conformal field theory arise through the parametrization of the closed string's world-surface.
with action SCS[X} = S[X] = ^Lj
[ dzdz (g^(X)
+ B^(X))
3X»dXv
+ •••
(1)
where the dots stand for additional terms e.g. involving Fermions. If our strings propagate in flat space then the background metric g and B-neld B are constant. This implies that the action is quadratic and computations in the resulting free field theory can be reduced to Gaussian integrals. For more general backgrounds, however, both g and B depend on the coordinates of the background and we are dealing with (a special class of scale invariant) 2D non-linear u-models. Our discussion here has brought us to the first entry in our dictionary: it is claimed that possible closed string backgrounds are associated with 2D conformal field theories on closed surfaces. Closed string modes cf> correspond to fields <J>(z, z) in the conformal field theory. The 3-point functions of such fields are determined by conformal symmetry up to some constants ( * i ( z i , z i ) $2(22,-22) $3(23,23)} =
C(
\z12r"
12
12 A;23
1*23 r " 2 3 |zi3
where z^ = Zi — Zj and the exponents Ay are certain linear combinations of the scaling dimensions Aj of the fields <3>j. Along with the spectrum of these scaling dimensions A;, the 3-point couplings C are known to contain all the information about the conformal field theory. They obey various consistency conditions which might be difficult to solve, starting from the seminal paper by Belavin et al. [9], many solutions have been constructed. In string theory, the 3-point couplings provide the amplitude of the 3-point vertex, i.e., they tell us how likely it is that two closed string modes <j>i, 4>2 combine into a single closed string in the mode
Strings through the microscope
81
objects are very much like extremal black holes, only that they extend in p spatial directions. Since closed string theory is considered as an interesting candidate for a consistent short distance completion of gravity, we are lead to the obvious problem of finding a description for branes in string theory. To answer this question let us now enlarge another element of our image (see figure 3). We claimed above that branes are charged and massive objects. As such, they can interact with various other objects in the bulk, e.g. through exchange of gravitons and higher closed string modes. When such a closed string mode hits the brane or is emitted from it, we obtain the picture shown in figure 3. The parametrization of the closed string world-surface now
Figure 3. The interaction of closed strings with a brane is related to the bulk 1-point function in a 2D conformal field theory on a disc. Different branes correspond to different boundary conditions imposed along the boundary of the disk.
involves a map from a disc to the 10-dimensional background such that the boundary of the disc is embedded into the world-volume of the brane. We ensure this by imposing Dirichlet boundary conditions for all components of (X11) which are associated with directions transverse to the brane. Our discussion here motivates the following general proposal that was first formulated by Polchinski [45]: branes in some closed string background correspond to conformally invariant boundary conditions of the associated conformal field theory. It is well known that boundary conditions in conformal field theory can be characterized by the 1-point functions of bulk fields $. Using once more conformal invariance and a conformal mapping from the disc to the upper half-plane, the 1-point functions are easily shown to possess the following general form
<$CM)r- =
flBUW v _
r|2AA
In other words, conformally invariant boundary conditions are uniquely determined by the 1-point couplings B((p). The latter provide a measure for how strongly a given closed string mode (j> couples to the brane and hence in particular encodes information on the mass and charge of the brane. 2.3. Open strings and boundary fields At this point we are still missing the open strings that we would expect to be around as soon as we introduce branes. This is indeed the case. To argue that open strings are indeed part
82
VOLKER SCHOMERUS
of our present setup, a brief look at lattice spin models with boundaries may be helpful. The latter are closely related to the 2D continuum field theories we are dealing with. Moreover, it is intuitively obvious that such lattice systems contain a set of excitations which can only live along the boundary and which depend on the specific boundary condition we impose. In the 2-dimensional Ising model with free boundary conditions, for example, we can measure the boundary magnetization. Once we fix the boundary spins, however, measurements of this quantity become trivial and hence do not correspond to an observable of the model. In continuum field theory, boundary excitations are described by fields which can only be inserted at points along the boundary, i.e., to so-called boundary fields. These are exactly the objects that we need in order to model open strings. As in the case of closed strings, there exists an infinite number of open string modes tp and to these we assign boundary fields ^f(u) of the corresponding boundary conformal field theory. The spectrum of open string modes depends on the brane we consider, just as the spectrum of boundary fields depends on the boundary condition we impose along the boundary. There is one new set of couplings that comes with the vertex of three open string modes (see figure 4). The computation of such a vertex involves a disc with three fields inserted
V2 Figure 4. The interaction 3-vertex of open string modes ty„ is obtained from a 3-point function of boundary fields in a 2D conformal field theory on the disc or half-plane.
along the boundary. Once more, this amplitude is determined by conformal invariance up to a set of 3-point couplings,
2 3 7/.. tm / , „ &23 |7W l 2 T1122 |lW |Ul3 |A 2 3 23r A
where and the u, are coordinates for the boundary of the upper half-plane. Needless to say that the 3-point couplings O encode the probability of two open string modes ipi,ip2 to combine into a single open string in the mode ip^. This concludes our journey through the various elements of figure 1. Along the way we have learned to characterize boundary conformal field theories through three different sets of couplings, the 3-point couplings C of the bulk fields, the couplings B appearing in the 1-point function of bulk fields and the 3-point couplings O of boundary fields. This is a rather abstract way to think about boundary conformal field theories, especially when compared with the action (1) for closed strings that we started our discussion with. Since it is sometimes helpful to think about conformal field theories in terms of such actions, we
Strings through the microscope
83
would like to mention that in the case of open strings, S[X] gets modified to Sos[X] = S[X] + - ^ - 7 / duAll(X)duX'i(u) Iva! Jan
(2)
where the first term is the same as in formula (1) with E being replaced by a surface with boundary. The new boundary term means that open strings can couple through the velocity duX** of their endpoints to a new background field, namely to a vector field A^(X). We interpret A as a gauge field on the brane and think of the string endpoints as carrying charges. The action (2) is to be supplemented by Dirichlet boundary conditions on X for all directions transverse to the brane. 2.4. Instabilities, dynamics and RG-flows In the last three subsections, our dictionary has received quite a number of entries, but there are a few more that we would like to add in passing. To motivate these additional entries, we note that many of the existing string backgrounds are actually unstable. This applies e.g. to the 26-dimensional bosonic string theory and it is a common phenomenon in the context of branes. In fact, configurations of several stable branes tend to be unstable, the most famous example being the pair of a brane and its anti-brane. In general relativity we can easily detect instabilities of some given static solution to the classical field equations. All we need to study is the spectrum of fluctuations around the solution. Modes that are exponentially enhanced are associated with instabilities. Let us now see how we can find unstable modes in string theory. To this end we consider some static background. Its conformal field theory description involves a product of a time-like free field X and a unitary conformal field theory whose target is the 9-dimensional spatial slice of the static background. This theory possesses an exponentially growing mode if we can add the following conformally invariant terms to the action 5S~
[ dzdzee*x°iz'*)$(z,z)+ JT,
[ duee*x°(u)y(u), JdT,
(3)
where $(z,z) and $f(u) are bulk and boundary fields in the conformal field theory for the spatial slice and the coefficients e$ and e* must be real in order to have an exponential growth with time. But such exponential fields of a time-like free boson possess a positive scaling dimension so that the fields $ and \I> must have scaling dimensions A<j> < 2 and A>j, < 1 if we want SS to be scale invariant. In other words, instabilities of a closed string background and branes therein correspond to relevant bulk and boundary fields, respectively. If we wanted to control the full decay process generated by an instability, we would have to solve the theory with interaction 5S. At present, no example of such a theory has been constructed within 10D string theory. Nevertheless, some insights into aspects of decay processes have been obtained using a proposed relation with renormalization group (RG) flows. This requires, however, that we reduce our ambitions and only try to identify the final state of the decay process rather than the whole dynamical evolution. Let us consider an initial state which is encoded in a CFTi for the 9-dimensional spatial slice of the static background. Next we choose some relevant bulk or boundary field $ or * to trigger the decay. After all radiation has has escaped, we expect our system to settle down in a static
84
VOLKER SCHOMERUS
final state that is again described by a product of a time-like free boson and a CFT2 for the spatial slice of the final stable state. Hence, the dynamics has lead us from some CFTi to CFT 2 with the help of relevant fields $ or \I>. It is obviously tempting to think that CFT 2 is the IR fixed point of the RG flow from CFTi on the RG trajectory that is generated by adding $ or \I>. Supporting evidence for this proposal is strong in the case of boundary instabilities, but the proposal seems much harder to justify when we deal unstable bulk theories. A more thorough discussion of these issues and many further references can be found in the literature [27]. 2.5. Summary: dictionary between ST and B C F T Let us pause for a moment and review all the entries in our dictionary between string theory and 2D boundary conformal field theory: Closed string background X
2D CFT Cx on closed surface E
Closed string mode <j>
Bulk field $(z, z), 2 e E, in Cx
Closed string vertex C
3-point function of bulk fields
Brane B in background
Boundary condition (BC) for Cx => BCFT Cx on open surface £
Open string mode ip
Boundary.field * ( u ) , u G <9£, in Cx
Open string vertex 0
3-point function of boundary fields
Instability of a background
Relevant bulk- or boundary-field
Initial/final state of decay
UV/IR fixed point of an RG-flow
Recall that the last line has the status of a conjecture. To test our dictionary one may form sentences in string theory (or gravity) and check that they translate into meaningful sentences of (boundary) conformal field theory.
3. Rational BCFT and strings in compact backgrounds We are now prepared to begin reviewing results on the explicit construction of boundary conformal field theories. Systematic rational conformal field theory model building usually starts with theories whose target space is a compact group manifold G. It then proceeds to cosets and orbifolds. Among them one finds all known models with interesting applications to statistical physics and string theory. Here we shall explain the most central results of the field using one special example, namely the group G =SU(2)= S3. A few comments on various generalizations and extensions are collected at the end. 3.1. Strings on the 3-sphere Before we look at strings moving on a 3-sphere, let us recall that, to leading order in a', cr-models of the form (1) are conformal invariant if the background fields satisfy
Strings through the microscope
a'Tl^(g) - jH^HJT
85
= 0(a'2).
Here, 7c is the curvature of the background metric g and H = dB. Hence, if strings move in a curved background, then a non-vanishing magnetic field B is unavoidable. With this in mind let us consider strings on a 3-sphere S3 = SU(2). Their world-surfaces are parametrized by a group valued map h : E —> SU(2) and these parametrization fields appear in an action of the form
S[h] = -£- f dzdz tr(h-ldh){h-l8h) 47T J
+—
/ V ^ / i " 1dh)*\
(4)
127T J
The second term is known as the Wess-Zumino-Witten (WZW) term. Locally, it may be rewritten in the form of the second term in equation (1). We also note that the parameter k is a measure for the size of the 3-sphere and that, in the quantum theory, k must be integer. The WZW model possesses a large symmetry, given by two commuting actions of the affine Lie algebra su(2) k . Each of these two algebras is generated by the Laurent modes J%,n G Z, of an su(2)-valued conserved current J = J^t^ with relations {JZ,JZl}=ifl/pJZ+m
+ kn6^6n^m.
(5)
The two affine Lie algebras act on the space of fields in the theory, extending the actions that are induced by the usual left and right translation of the group on itself. It turns out that a wide class of boundary conformal field theories can be written down in terms of data from the representation theory of their infinite dimensional symmetries. The most important such data are the set J of unitary representations, the so-called modular S-matrix, the Clebsch-Gordan multiplicities N of the fusion product and an infinite dimensional generalization of the 6J-symbols which is known as the Fusing matrix F. For the SU(2) affine Lie algebra, explicit formulae can be spelled out without much effort. In this case, the requirement of unitarity leaves us with just a finite number of representations. We label them through j € J = { 0 , 1 / 2 , . . . , k/2}. Their conformal weights are given by AJ = J(J + l)/(k + 2) and for the modular 5-matrix one finds S
Sln ij = \l Ik- TJ7 + 2""
TT(2» + 1 ) ( 2 J + 1 )
T k +^ 2
(6)
•
With the help of the Verlinde formula it is not difficult to compute the following fusion rules TV from the modular S'-matrix, for k-
\i- j\,...,min(i+j,
k-i-j),
otherwise. They are similar to the Clebsch-Gordan multiplicities of the Lie algebra su(2), apart from the truncation which appears whenever i + j > k/2. Formulae for the fusing matrix also exist in the literature. Since they are a bit more involved, we shall not present them here. Let us only mention one property concerning their limiting behavior as we send k —» oo,
hmM^]=U^}.
W
86
VOLKER SCHOMERUS
This concludes our list of representation theoretic data for the affine Lie algebra. We shall see these quantities again in a moment when we write down formulae for the couplings B of closed strings to branes on S3, for their open string spectra and the 3-point vertices O of open string modes. 3.2. Branes on group manifolds Many constructions of conformal invariant boundary theories are ultimately based on fundamental observations made by J. Cardy [13]. When applied to the case at hand, they provide us with a finite set of k + 1 boundary conformal field theories which we label by J = 0 , 1 / 2 , . . . , k/2, just as we enumerate the unitary representations of the corresponding affine Lie algebra. Recall that these boundary theories can be uniquely characterized through the couplings B that appear in the 1-point functions of the bulk fields. According to Cardy's solution, these 1-point couplings are simply given by the matrix elements of the modular .S-matrix,
{ 2 J =
^^
7kw^
(9)
where Aj = j(j + l)/(k + 2). The superscripts a,b = —j,... ,j, placed at the symbol
A ^d,h+1
h^dh,
(10)
where Ad denotes the adjoint action of the group on its Lie algebra. For higher groups G, such a field B is gives a non-trivial potential for the pull-back of the canonical WZW 3-form to the branes' world-volumes. Such 2-form potentials on conjugacy classes of a group G were considered in the mathematical literature. There they appear in connection with deformations of the theory of co-adjoint orbits. Our geometrical interpretation of the boundary theories with 1-point functions (9) may be justified in several different ways. The argument we want to give here is based on the observation [21] <*f (*. *)>J(k) " ~°° /
dn(g) S(ti(g) - tf0) 4>f(g),
(11)
JSV(2)
where dfi(g) denotes the Haar measure on SU(2), ^(g) ~ DJab(g) are the wave functions of the lightest closed string modes and fl is the azimuthal angle on the 3-sphere. In taking the limit, we allowed the boundary label J to depend on the level k and we defined -do := 27rlim J(k)/k. Since 0 < J < k/2, the angle ??o lies in the interval [0,n]. The appearance of the ^-function on the right hand side shows that closed string modes indeed detect a spherical object which is localized at the azimuthal angle •& = i90.
Strings t h r o u g h t h e microscope
87
Figure 5. Maximally symmetric branes on a 3-sphere are localized along conjugacy classes of SU(2), i.e., they are either point-like or they wrap a 2-sphere in S 3 . Such spherical branes exist only for a discrete set of radii.
3.3.
Open strings on group manifolds
Let us now turn to the open strings which can propagate along the k + 1 different spherical branes on 5 3 . According to our dictionary, this means that we want to obtain the set of boundary fields and the 3-point couplings O for each of the above boundary theories. Following general arguments, it is possible to show that the space of boundary fields carries the action of a single affine Lie algebra rather than two commuting actions as in the case of bulk fields. This reflects a similar reduction of the geometric symmetries: whereas there exist two commuting sets of (left and right) translations on SU(2), only a special combination, namely the adjoint action, leaves the conjugacy classes invariant. Under the action of the affine Lie algebra su(2) k , the space Hj of boundary fields decomposes as Hj = HJJ = Q.NjjiV,
(12)
where Vj, j = 0 , 1 / 2 , . . . , k/2, denote irreducible unitary representations of the affine Lie algebra su(2) k , and where NJJ3 are the associated fusion rules (see equation (7)). Note that only integer spins j appear on the right hand side of equation (12) and that in the limit k —• oo, the summation on the right hand side is truncated at j m a x = 2 J. This means that the decomposition of TLj is as close as it can be to the decomposition of Mat (2 J + 1) into su(2) multiplets. In fact, there is a correspondence between the spin j multiplets of matrices Yj £ Mat(2J + l) and ground states in V, C "Hj. It is also worth pointing out the similarity between the labeling of open string ground states and spherical harmonics Y£(ip,6). Only the cut-off at j m a x = 2 J on the spin j does not appear for functions on a 2-sphere. Boundary fields V"? associated with ground states of the open string are labeled by the representation j = 0 , 1 , . . . , j m a x and o = —j,..., j . Their 3-point vertices were found [4] using important previous work by Runkel [53] on minimal models, (^(Ul)^(W2)*cfcM)7 = K 2 r A l 2 | u 2 3 | - A - K 3 | - A - [ : ^ ] F J f c [ ^ ]
(13)
where the symbol in square brackets stands for the Clebsch-Gordan coefficients of su(2). The latter guarantee that both sides of the equation transform in the same way under the action of the zero mode algebra su(2). The non-trivial part of equation (13) therefore concerns the relation of 3-point vertices for open strings with entries of the Fusing matrix.
88
VOLKER SCHOMERUS
We conclude this subsection with a brief remark on an interesting link to non-commutative geometry. Let us observe that the 3-point couplings simplify significantly if we send k to infinity,
<*?(uo ^(uamte))^ 0 0 = {jjjj}[:ikc]-
(14)
Here we have used the property (8) of the fusing matrix. It is straightforward to check that the numbers appearing on the right hand side of this equation arise naturally from the multiplication of ordinary (2J + 1) x (2J + 1) matrices. In fact, the same numbers appear when we rewrite the product Y* • Yjj' as a linear combination of the su(2) multiplets ycfc € Mat(2J + 1). In this sense, the 3-point vertices for open strings on branes in SU(2) provide an infinite dimensional deformation of matrix multiplication. The emergence of non-commutative matrix algebras in the context of open string theory is not surprising. As we have stressed before, the end-points of open strings behave like charged particles which can couple to the vector potential A of the magnetic field on the brane. Hence, the non-commutativity we encounter in the context of open strings is directly related to a similar phenomenon for e.g. electrons in a strong magnetic field. Relations between branes, open string and non-commutative geometry were initially discovered for branes in flat space [14,16,55] and they have been studied extensively for several years [17,59]. 3.4.
Brane dynamics on group manifolds
Now that we have an exact conformal field theory solution for spherical branes on S 3 , we would like to briefly comment on some dynamical processes. Here we shall use the conjectured relation with RG-flows in the 2D boundary conformal field theory. It turns out that the study of boundary renormalization groups flows in models with an SU(2) current algebra is a classical problem of mathematical physics that was first addressed in the context of the Kondo-model. The Kondo-model is designed to understand the effect of magnetic impurities on the lowtemperature conductance properties of a 3D conductor. The latter can have electrons in a number k of conduction bands. If the impurities are far apart, their effect may be understood within an s-wave approximation of scattering events between a conduction electron and the impurity. This allows to formulate the whole problem on a 2-dimensional world-sheet for which the coordinates (u, v) are associated with the time and the radial distance from the impurity, respectively. One can build several currents out of the basic fermionic fields. Among them is a spin current J(u,v). Its Laurent modes satisfy the relations (5) of a su(2)k current algebra. This spin current is the one that couples to the magnetic impurity of spin JM which is sitting at the boundary v = 0, oo
/
duk^{u,Q).
(15)
-oo
Here, the matrices A M ,^ = 1,2,3 form a (2JM + l)-dimensional irreducible representation of su(2) and the parameter A controls the strength of the coupling. Note that the term (15) is identical to the coupling of open string ends with velocity J^iu, 0) to a background gauge field Ap = AM (see formula (2)). Hence, AM may be interpreted as a constant non-abelian gauge field on one of our branes. For M > 1, the interaction is marginally relevant so that, according to the proposal formulated in subsection 2.4, switching on such non-abelian gauge
Strings through the microscope
89
fields represents an instability. Its effect on the brane can be understood by searching for an RG fixed-point along the RG trajectory generated by the term (15). Fortunately, a lot of techniques have been developed to deal with perturbations of the form (15), going back even to the work of Wilson. From the old analysis it is known that nontrivial fixed points are reached at a finite value A = A* of the renormalized coupling constant A if 2 JM < k (exact- or over-screening resp.). These fixed points have been identified through several different approaches and a simple rule summarizing the results of such investigations was formulated by Affleck and Ludwig [1]. This rule can be applied directly to our branes on SU(2) [22] and it shows that e.g. point-like branes carrying a constant U(M) gauge field decay into an extended spherical brane with label JM = {M — l ) / 2 .
Figure 6. Studies of RG flows in models with a SU(2) current symmetry show that point-like branes carrying a constant U(M) gauge field decay into an extended spherical brane with label JM = {M — l)/2.
For large values of k, the result we have just formulated admits an easier derivation using some of the geometric structures we have outlined above. It is widely known that massless open string modes give rise to a gauge theory on the brane to which they are attached. But due to the presence of the B-field on our spherical branes, the open strings detect a non-commutative (matrix) geometry (cf. last subsection). Hence, we are tempted to conclude that a special non-commutative gauge theory should be associated with branes on the 3-sphere. This is indeed the case and the precise form of the relevant gauge theory has been determined following the usual rules of string theory [5], at least to leading order in a'. Once the action of this gauge theory is found, one can search for instabilities and classical ground states. The outcome of such an analysis confirms very nicely the formation of spherical branes from point-like branes that we have described in our discussion of RGfixed points above. This agreement is not accidental. In fact, when k is large, the action of the non-commutative field theory is a functional on the space of boundary couplings whose stationary points approximate zeros of the /3-function. The analysis of RG-fixed points through non-commutative field equations has significant technical advantages over more traditional approaches. These are related to a very efficient book-keeping of the possible boundary couplings through non-commutative variables. 3.5. Overview: various generalizations and extensions The material presented in this section has been generalized in many different directions and we would like to list a few of these developments before we leave the rational boundary
90
VOLKER SCHOMERUS
conformal field theories. a Not surprisingly, results similar to the ones we have reviewed here are also available for all other compact simple simply connected Lie groups. For many of the higher groups, moreover, there exist new families of maximally symmetric branes that are associated with outer automorphisms of G and hence have no analogue in the case of SU(2). Their boundary couplings B and open string spectra were first studied by Felder et al. [21] (see also [25,43]). Results on the open string couplings and dynamics have also been obtained [6]. In addition to the maximally symmetric branes, i.e., those that admit an action of the whole group G, one can impose boundary conditions which break part of this symmetry. Investigations of such branes were initiated in a paper by Moore et al. [39] and a rather systematic construction has been developed more recently [50]. It is worth mentioning that symmetry breaking branes on group manifolds possess interesting applications to defect lines in 2D conformal field theory. With the theory of strings and branes on group manifolds being under such good control, one can start to descend to orbifolds and cosets thereof. Studies of branes and open strings in curved orbifold backgrounds possess a long history [10,11,24,42,49]. The theory of branes in coset models has been treated by many authors (see e.g. [19,23,26,39] for some early contributions and references).
4. Recent progress for strings in non-compact spaces The last part of this lecture is devoted to some developments in the area of non-rational boundary conformal field theory. As indicated in the introduction, these are highly relevant for the study of dualities between string and gauge theory and for time-dependent processes in string theory. Here we shall explain these motivations in some detail and then focus mainly on one particular model, namely on the Liouville field theory. 4.1. String/gauge theory dualities According to an old observation by 't Hooft, structures found in closed string amplitudes are very reminiscent of features in the perturbative expansion of large N gauge theories. This suggests that it might be possible to compute gauge theory amplitudes from string theory. Though the idea has been around for a long time, only a single concrete example was known until 1997: The duality between the quantum mechanics of hermitian matrices and a toy model of string theory with a 2-dimensional target space (see e.g. the lectures of Klebanov [31] and references therein). After branes had entered the stage of string theory, the situation changed drastically. We may grasp the fundamental role of branes for such developments through the following short analysis of figure 7. The image shows a simple string diagram that admits two rather distinct interpretations. We can either think of a process in which a closed string mode is emitted from one brane and propagates a distance Ax through the background before being absorbed by a second brane. Alternatively, the process can be seen as a pair creation and subsequent annihilation of open strings which end on the two involved branes. Though none of these interpretations is distinguished a priori, actual computations may favor one of them, depending on the separation Ax between the branes. If they are very far apart only a
The references provided in the following paragraph are highly incomplete. A more extensive list can be found e.g. in the lecture notes [56].
Strings through the microscope
91
the massless closed string modes contribute significantly to the amplitude and hence one can safely perform the string theory computation in its super-gravity approximation. For short distances Ax between the branes, however, many massive closed string modes start to enter the computation and we better switch to a description in terms of light, i.e., mildly stretched, open strings. This reinterpretation leaves us with a one-loop computation in the gauge theory of massless open string modes. The reasoning we have just gone through hints toward an intimate relation between closed string models and gauge theory. This relation has a few interesting features. Observe, for example, that it does not preserve the underlying space-time dimensions since closed strings propagate in the 10D background while gauge bosons cannot leave the branes' p + 1-dimensional world-volume. In addition, the relation also mixes different loop orders as can be inferred from figure 7. Here, we found a closed string tree level amplitude that contains information about a gauge theory one-loop diagram.
Figure 7. Depending on the distance Ax between branes the shown string amplitude can be approximated by either a tree level computation in super-gravity or a one-loop gauge theory computation.
With all this in mind it seems no longer surprising that branes provide us with many new and concrete dualities between gauge theories and models of closed strings. The most famous example certainly is Maldacena's duality [35] between M — 4 Super-Yang-Mills theory and string theory on AdSs xS5. It involves an SU(N) gauge theory at large JV with 't Hooft coupling A = SYM-W2 on one side and an AdSs space with curvature radius R2/a' ~ y/\ on the other. Hence, if we want to study gauge theory at finite 't Hooft coupling, the curvature of AdS 5 cannot be neglected so that string effects become important. Not only does this bring us back to the main theme of the lecture, it also hides novel challenges arising from the fact that the relevant closed strings propagate on a non-compact curved background. Similar observations can be made for all the other known examples of such dualities between gauge and string theory [2]. Therefore, extending the methods and results reviewed in the previous section to non-compact target spaces becomes an important new task for conformal field theory. Here we shall restrict our attention to the example of Liouville theory. Since it appears as constituent of 2D string theory, Liouville theory enters one side of the aforementioned duality with matrix quantum mechanics. More recent developments show that even this duality, though it pre-dates the discovery of branes in string theory by several years, ultimately arises is the same way as all the modern Ad S/conformal field theory-type dualities.
92
VOLKBR SCHOMERUS
4.2. Liouville theory and its applications Liouville theory may be considered as the minimal model of non-rational conformal field theory. It describes the motion of strings in a single direction with an exponential potential SL[X] ~ f dzdz (dXdX
+ fj,e2bx).
(16)
Here, X is a free bosonic field with background charge (linear dilaton). Note that a change of the parameter fi can be re-absorbed in a constant shift of the field X. Hence, the dependence of Liouville theory on \x is rather trivial. On the other hand, the correlation functions of the model display a very non-trivial dependence on the second parameter b. If we send b to zero while rescaling x = bX and A = b2^,, we recover a model of particles in the potential V{x) = Aexp2x. The full field theoretic model contains both perturbative and non-perturbative corrections to this limit. The latter turn out to render the quantum theory self-dual with respect to the replacement b —* b _ 1 . After these remarks on the model itself, let us briefly comment on two interesting applications. It is well known that a flat space with trivial background fields must be 26-dimensional for bosonic strings to propagate in it. This restriction to 26 dimensions, however, may be circumvented through the presence of non-trivial background fields. An example realizing such a scenario is the famous 2D string theory. It is constructed from a product of a 1dimensional free bosonic field with Liouville theory at b = 1. Obviously, such a closed string theory is of little practical interest, but is provides a valuable toy model for bosonic string theory. Compared to its 26-dimensional relative, the 2-dimensional model has the advantage of being perturbatively stable. The second application is much more recent and also less well tested. It has been proposed [28] that a time-like version of Liouville theory at b = i describes the homogeneous condensation of a closed string tachyon. A short look at the classical action (3) makes this proposal seem rather plausible. Recall that actions of this form describe a tachyon decay with a closed/open string tachyon profile $ / * . If we consider a closed string tachyon which decays homogeneously, then we must choose $ = \i = const. In addition, the parameter e^, is now forced to be eM = 2 so that the interaction term (3) becomes scale invariant. After a Wick-rotation of the field X° = iX, the interaction term looks formally like the interaction term in Liouville theory, only that the we have b = i, just as it was claimed above. Though on first sight this may all seem rather convincing, there are many subtleties hidden in the relation between Liouville theory and tachyon condensation. We will be able to display them more clearly once we have presented the exact solution of Liouville theory. 4.3. The solution of Liouville field theory The solution of Liouville theory on closed and open surfaces has been a major success of the last ten years. We shall briefly spell out some of the most central formulae before we return to the applications. Let us begin with a few simple remarks on the spectrum of the model. It is rather obvious that ground states (fip of closed strings in the exponential potential can be labeled by a real number P > 0 which corresponds to the momentum of an incoming wave in the region X —> — oo where the potential vanishes. The states (j>p are stringy analogues of the particle
Strings through the microscope
93
wave functions
Here, we have also introduced the notation a = (a\ + a.i + a^)/2 and otj = a — a.j. In addition, the special function T is obtained from Barnes's double Gamma function by T(a):=rj1(a|6)6-1)rj1(Q-a|6)r1).
(18)
The solution (17) was first proposed by H. Dorn and H. J. Otto [15] and by A. and Al. Zamolodchikov [66], based on extensive earlier work by many authors (see e.g. the reviews [58,63] for references). Crossing symmetry of the conjectured 3-point function was then checked analytically in two steps by Ponsot and Teschner [46] and by Teschner [63,64]. Different conformally invariant boundary conditions were found and studied in several papers [20,62,67]. As one might suspect, there exist two different types of boundary theories, one describing point-like objects, the other corresponding to space-filling branes with an additional exponential boundary interaction that is felt by the end-points of open strings. It turns out that a point-like brane can only sit in the region where the Liouville potential V becomes large. The simplest such brane possesses a 1-point coupling of the form
Open strings on this brane possess a discrete spectrum without continuous zero modes, in agreement with our geometrical intuition (see [67] for details). Let us remark that the brane with boundary coupling (19) is only one in an infinite set which is parametrized by two positive integers (n,m). The status of the boundary conditions with (n,m) ^ (1,1) as physical branes in Liouville theory, however, is less clear. Extended branes in the Liouville background, on the other hand, obviously possess one continuous physical parameter. It enters the theory through an additional boundary interaction of the form VB ~ Ms e x P bX. Note that constant shifts of the field X have been used before to rescale the bulk parameter fi. Hence, there is no freedom left now to absorb the new boundary parameter /ig- Consequently, coupling constants of the boundary theory display a non-trivial dependence on HB as can be seen in the case of the bulk 1-point coupling, [20] B.(P) = (irw(b2))
' cos(2nsP)-±
_25/4i7rP
"•
Here, the parameter s is related to the boundary coupling ^B by the relation (i2B sin 7r62 = fi cosh2 irsb.
(20)
94
VOLKER SCHOMERUS
Each of these extended branes comes with a continuous spectrum of open string modes. For imaginary values of the boundary parameter s, one may also encounter additional discrete states [62]. The 3-point vertex for open strings on the extended branes has been constructed by Ponsot and Teschner [47]. The formulae are more complicated and we refer the interested reader to the original papers. 4.4. Some applications and extensions In this subsection we would like to comment on some of the applications and possible extensions of the results we outlined above. As we pointed out earlier, Liouville theory is a building block for the string theory dual of matrix quantum mechanics, 'MQM
/? / dt '/
l
-{dtM{t)f + V{M{t)) ,
where M(t) are hermitian TV x TV matrices and V is a cubic potential. To be more precise, the duality involves taking TV and f3 to infinity while keeping their ratio K = N//3 fixed and close to some critical value KC. In this double scaling limit, the matrix model can be mapped to a system of non-interacting fermions moving through an inverse oscillator potential, one side of which has been filled up to a Fermi level at A
r~\ Figure 8. In the double scaling limit, the hermitian matrix model can be mapped to a system of noninteracting fermions moving through an inverse oscillator potential, one side of which has been filled up to a Fermi level at A/t = re — rec ~ g7 •
With a quick glance at figure 8, we conclude that the model must be non-perturbatively unstable against tunneling of Fermions from the left to the right. This instability is reflected in the asymptotic expansion of the partition sum and even quantitative predictions for the mass m ~ a/gs of the instantons were obtained. The general dependence of brane masses on the string coupling gs along with the specific form of the coupling (19) have been used recently to identify the instanton of matrix quantum mechanics with the localized brane in the Liouville model [7,32,40,41]. In this sense, branes had been seen through investigations of matrix quantum mechanics more than ten years ago, i.e., long before their central role for string theory was fully appreciated. Concerning the application of Liouville theory to the condensation of tachyons, there exist a few quite encouraging recent developments. Let us recall that the description of tachyon condensation requires sending the parameter b to b = i and evaluating the correlators for imaginary momenta P. We can at least find one quantity for which this procedure may be carried out easily. It is the coupling of closed strings to a decaying brane, i.e., the 1-point coupling B in the theory (3) with $ = 0 and # = HB- In fact, from the formula (20) we read off that
Strings through the microscope
(exp (iEX°(z, z)) ) ~ ME
-^-
95
E
.
This result may be checked directly [33,60] through perturbative computations [12,44,51]. Other quantities in the rolling tachyon background have a much more singular behavior at b = i. In fact, Barnes's double T-function T2{x\b,b~1) is a well denned analytic function as long as Re b ^ 0. If we send b —> i, on the other hand, T^ diverges. Nevertheless, it can be shown [57] that the function T and the bulk couplings (17) possess a well defined limit. These limits, however, are no longer analytic and hence cannot be Wick rotated. It turns out that there are at least two different limiting theories. The first one occurs for real momenta (Euclidean target space) and it coincides with the interacting c = 1 limit of unitary minimal models that was first discovered by Runkel and Watts [54]. A limiting bulk theory with imaginary momenta (Lorentzian target space) has also been constructed [57]. Future investigations will show whether this does provide an exact description of closed string rolling tachyons. Even though Liouville theory is the only non-rational model that has been solved completely, there exist at least partial solutions for a few other non-compact backgrounds. These include the H£ = SL(2, C)/SU(2) model, a non-compact relative of the group manifold SU(2) and Euclidean version of Ad S3 ~ SL(2,R). The bulk structure constants C of this theory were found several years ago by Teschner [61] and couplings B of closed strings to various branes have been proposed [34,48]. Open string couplings C for non-compact branes in #3", on the other hand, remain unknown. Let us point out that / 7 | model itself suffers from an imaginary magnetic field and hence is unphysical. But its solution represented a key step toward the construction of two rather important string backgrounds. The first concerns the description of strings in Ad S3. In a series of papers [36-38], amplitudes in this Lorentzian version of the H% model have been worked out and interpreted. Another interesting application arises from the coset space H3 /R. This model became famous as the Euclidean 2D black hole [65]. It is a non-compact analogue of parafermions and hence a close relative of N = 2 minimal models. Non-compact parafermions and N — 2 minimal models have been argued to be T-dual to the Sine-Liouville (as reviewed in [30]) and the N = 2 Liouville theory [29], respectively. References to the original literature and recent result on branes in these backgrounds can be found in the recent literature [18,52].
Acknowledgments I wish to thank the organizers of the ICMP 2003 for the opportunity to present this overview.
References 1. I. Affleck, A. W. W. Ludwig, "Critical theory of overscreened Kondo fixed points", Nucl. Phys. B 360, 641 (1991). 2. O. Aharony, S. S. Gubser, J. M. Maldacena, H. Ooguri, Y. Oz, "Large N field theories, string theory and gravity", Phys. Rept. 323, 183-386 (2000); arXiv:hep-th/9905111. 3. A. Yu. Alekseev, V. Schomerus, "D-branes in the WZW model", Phys. Rev. D 60, 061901 (1999); arXiv:hep-th/9812193.
96
VOLKER SCHOMERUS
4. A. Yu. Alekseev, A. Recknagel, V. Schomerus, "Non-commutative world-volume geometries: Branes on SU(2) and fuzzy spheres", JHEP 09, 023 (1999); arXiv:hep-th/9908040. 5. A. Yu. Alekseev, A. Recknagel, V. Schomerus, "Brane dynamics in background fluxes and non-commutative geometry", JHEP 05, 010 (2000); arXiv:hep-th/0003187. 6. A. Y. Alekseev, S. Fredenhagen, T. Quella, V. Schomerus, "Non-commutative gauge theory of twisted D-branes", arXiv:hep-th/0205123. 7. S. Y. Alexandrov, V. A. Kazakov, D. Kutasov, "Non-perturbative effects in matrix models and D-branes", JHEP 0309, 057 (2003); arXiv:hep-th/0306177 . 8. E. Barnes, "The theory of the double gamma function", Philos. Trans. Roy. Soc. A 196, 265 (1901). 9. A. A. Belavin, A. M. Polyakov, A. B. Zamolodchikov, "Infinite conformal symmetry in twodimensional quantum field theory", Nucl. Phys. B 241, 333-380 (1984). 10. M. Bianchi, A. Sagnotti, "On The Systematics Of Open String Theories", Phys. Lett. B 247, 517 (1990). 11. L. Birke, J. Fuchs, C. Schweigert, "Symmetry breaking boundary conditions and WZW orbifolds", Adv. Theor. Math. Phys. 3, 671-726 (1999); arXiv:hep-th/9905038. 12. C. G. . Callan, I. R. Klebanov, A. W. W. Ludwig, J. M. Maldacena, "Exact solution of a boundary conformal field theory", Nucl. Phys. B 422, 417 (1994); arXiv.hep-th/9402113. 13. J. L. Cardy, "Boundary conditions, fusion rules and the Verlinde formula", Nucl. Phys. B 324, 581 (1989). 14. C.-S. Chu, P.-M. Ho, "Noncommutative open string and D-brane", Nucl. Phys. B 550, 151 (1999); arXiv:hep-th/9812219. 15. H. Dorn, H. J. Otto, "Two and three point functions in Liouville theory", Nucl. Phys. B 429, 375-388 (1994); arXiv:hep-th/9403141. 16. M. R. Douglas, C. Hull, "D-branes and the noncommutative torus", JHEP 02, 008 (1998); arXiv:hep-th/9711165. 17. M. R. Douglas, N. A. Nekrasov, "Noncommutative field theory", Rev. Mod. Phys. 73, 977 (2001); arXiv:hep-th/0106048. 18. T. Eguchi, Y. Sugawara, "Modular bootstrap for boundary N = 2 Liouville theory", JHEP 0401, 025 (2004); arXiv:hep-th/0311141. 19. S. Elitzur, G. Sarkissian, "D-branes on a gauged WZW model", Nucl. Phys. B 625, 166-178 (2002); arXiv:hep-th/0108142. 20. V. Fateev, A. B. Zamolodchikov, A. B. Zamolodchikov, "Boundary Liouville field theory. I: Boundary state and boundary two-point function", arXiv:hep-th/0001012 . 21. G. Felder, J. Frohlich, J. Fuchs, C. Schweigert, "The geometry of WZW branes", J. Geom. Phys. 34, 162-190 (2000); arXiv:hep-th/9909030. 22. S. Fredenhagen, V. Schomerus, "Branes on group manifolds, gluon condensates, and twisted K-theory", JHEP 0104, 007 (2001); arXiv:hep-th/0012164. 23. S. Fredenhagen, V. Schomerus, "D-branes in coset models", JHEP 02, 005 (2002); arXiv: hep-th/0111189. 24. J. Fuchs, C. Schweigert, "Symmetry breaking boundaries. I: General theory", Nucl. Phys. B 558, 419 (1999); arXiv:hep-th/9902132 . 25. M. R. Gaberdiel, T. Gannon, "Boundary states for WZW models", arXiv:hep-th/0202067. 26. K. Gawedzki, "Boundary WZW, G/H, G/G and CS theories", arXiv:hep-th/0108044. 27. M. Gutperle, M. Headrick, S. Minwalla, V. Schomerus, "Space-time energy decreases under world-sheet RG flow", JHEP 0301, 073 (2003); arXiv:hep-th/0211063. 28. M. Gutperle, A. Strominger, "Timelike boundary Liouville theory", Phys. Rev. D 67, 126002 (2003); arXiv:hep-th/0301038. 29. K. Hori, A. Kapustin, "Duality of the fermionic 2d black hole and N = 2 Liouville theory as mirror symmetry", JHEP 0108, 045 (2001); arXiv:hep-th/0104202. 30. V. Kazakov, I. K. Kostov, D. Kutasov, "A matrix model for the two-dimensional black hole", Nucl. Phys. B 622, 141 (2002); arXiv:hep-th/0101011. 31. I. R. Klebanov, "String theory in two-dimensions", arXiv:hep-th/9108019.
Strings through the microscope
97
32. I. R. Klebanov, J. Maldacena, N. Seiberg, "D-brane decay in two-dimensional string theory", JHEP 0307, 045 (2003); arXiv:hep-th/0305159. 33. F. Larsen, A. Naqvi, S. Terashima, "Rolling tachyons and decaying branes", JHEP 0302, 039 (2003); arXiv:hep-th/0212248. 34. P. Lee, H. Ooguri, J. w. Park, "Boundary states for AdS(2) branes in AdS(3)", Nucl. Phys, B 632, 283 (2002); arXiv:hep-th/0112188. 35. J. M. Maldacena, "The large JV limit of superconformal field theories and supergravity", Adv. Theor. Math. Phys. 2, 231 (1998), [Int. J. Theor. Phys. 38, 1113 (1999)]; a r X i v : h e p - t h / 9711200. 36. J. M. Maldacena, H. Ooguri, "Strings in AdS(3) and SL(2,R) WZW model. I", J. Math. Phys. 42, 2929 (2001); arXiv:hep-th/0001053. 37. J. M. Maldacena, H. Ooguri, J. Son, "Strings in AdS(3) and the SL(2,R) WZW model. II: Euclidean black hole", J. Math. Phys. 42, 2961 (2001); arXiv:hep-th/0005183. 38. J. M. Maldacena, H. Ooguri, "Strings in AdS(3) and the SL(2,R) WZW model. Ill: Correlation functions", Phys. Rev. D 65, 106006 (2002); arXiv:hep-th/0111180. 39. J. Maldacena, G. W. Moore, N. Seiberg, "Geometrical interpretation of D-branes in gauged WZW models", JHEP 07, 046 (2001); arXiv:hep-th/0105038. 40. E. J. Martinec, "The annular report on non-critical string theory", arXiv:hep-th/0305148. 41. J. McGreevy, H. Verlinde, "Strings from tachyons: The c = 1 matrix reloaded", JHEP 0312, 054 (2003); arXiv:hep-th/0304224. 42. M. Oshikawa, I. Affleck, "Boundary conformal field theory approach to the critical twodimensional Ising model with a defect line", Nucl. Phys. B 495, 533-582 (1997); arXiv: cond-mat/9612187. 43. V. B. Petkova, J. B. Zuber, "Boundary conditions in charge conjugate sl(N) WZW theories", arXlv:hep-th/0201239. 44. J. Polchinski, L. Thorlacius, "Free Fermion Representation Of A Boundary Conformal Field Theory", Phys. Rev. D 50, 622 (1994); arXiv:hep-th/9404008. 45. J. Polchinski, "TASI lectures on D-branes", arXiv:hep-th/9611050 . 46. B. Ponsot, J. Teschner, "Liouville bootstrap via harmonic analysis on a noncompact quantum group", arXiv:hep-th/9911110. 47. B. Ponsot, J. Teschner, "Boundary Liouville field theory: Boundary three point function", Nucl. Phys. B 622, 309-327 (2002), arXiv:hep-th/0110244. 48. B. Ponsot, V. Schomerus, J. Teschner, "Branes in the Euclidean AdS(3)", JHEP 0202, 016 (2002); arXiv:hep-th/0112198. 49. G. Pradisi, A. Sagnotti, "Open String Orbifolds", Phys. Lett. B 216, 59 (1989). 50. T. Quella, V. Schomerus, "Symmetry breaking boundary states and defect lines", JHEP 06, 028 (2002); arXiv:hep-th/0203161. 51. A. Recknagel, V. Schomerus, "Boundary deformation theory and moduli spaces of D-branes", Nucl. Phys. B 545, 233 (1999); arXiv:hep-th/9811237. 52. S. Ribault, V. Schomerus, "Branes in the 2-D black hole", JHEP 0402, 019 (2004); arXiv: hep-th/0310024. 53. I. Runkel, "Boundary structure constants for the A-series Virasoro minimal models", Nucl. Phys. B 549, 563 (1999); arXiv:hep-th/9811178. 54. I. Runkel, G. M. T. Watts, "A non-rational CFT with c = 1 as a limit of minimal models", JHEP 09, 006 (2001); arXiv:hep-th/0107118. 55. V. Schomerus, "D-branes and deformation quantization", JHEP 06, 030 (1999); arXiv: hep-th/9903205. 56. V. Schomerus, "Lectures on branes in curved backgrounds", Class. Quant. Grav. 19, 5781 (2002); arXiv:hep-th/0209241. 57. V. Schomerus, "Rolling tachyons from Liouville theory", JHEP 0311, 043 (2003); arXiv: hep-th/0306026. 58. N. Seiberg, "Notes On Quantum Liouville Theory And Quantum Gravity", Prog. Theor. Phys. Suppl. 102, 319 (1990).
98
VOLKER SCHOMERUS
59. N. Seiberg, E. Witten, "String theory and noncommutative geometry", JHEP 09, 032 (1999); arXiv:hep-th/9908142. 60. A. Sen, "Rolling tachyon", JHEP 0204, 048 (2002); arXiv:hep-th/0203211. 61. J. Teschner, "On structure constants and fusion rules in the SL(2,C)/SU(2) WZNW model", Nucl. Phys. B 546, 390 (1999); arXiv:hep-th/9712256. 62. J. Teschner, "Remarks on Liouville theory with boundary", arXiv:hep—th/0009138 . 63. J. Teschner, "Liouville theory revisited", Class. Quant. Gray. 18, R153-R222 (2001); arXiv: hep-th/0104158. 64. J. Teschner, "A lecture on the Liouville vertex operators", arXiv:hep-th/0303150 . 65. E. Witten, "On string theory and black holes", Phys. Rev. D 44, 314 (1991). 66. A. B. Zamolodchikov, A. B. Zamolodchikov, "Structure constants and conformal bootstrap in Liouville field theory", Nucl. Phys. B 477, 577-605 (1996); arXiv:hep-th/9506136. 67. A. B. Zamolodchikov, A. B. Zamolodchikov, "Liouville field theory on a pseudosphere", arXiv: hep-th/0101152.
Critical percolation and conformal invariance STANISLAV SMIRNOV
(Royal Inst, of Technology, Stockholm)
Many 2D critical lattice models are believed to have conformally invariant scaling limits. This belief allowed physicists to predict (unrigorously) many of their properties, including exact values of various dimensions and scaling exponents. We describe some of the recent progress in the mathematical understanding of these models, using critical percolation as an example.
1. Introduction For several 2D lattice models physicists were able to make a number of spectacular predictions (non-rigorous, but very convincing) about exact values of various scaling exponents and dimensions. Many methods were employed (Coulomb Gas, Conformal Field Theory, Quantum Gravity) with one underlying idea: that in some sense the model concerned has a continuum scaling limit (as mesh of the lattice goes to zero) and the latter is conformally invariant. Recently mathematicians came up with some new (and rigorous) approaches. We will describe some of the progress made, much of it due to Lawler, Schramm, and Werner. We will center on the percolation which has a simple definition, and is now fairly well understood. In Bernoulli site percolation with p G [0,1] each vertex of some graph is declared open (or colored blue) with probability p and closed (or colored yellow) with probability 1 — p, independently of each other. We are interested in graphs, which approximate geometry of the plane, especially square, triangular, and hexagonal lattices. One studies clusters, which are connected subgraphs of a given color. One can also color edges (bond percolation) or, in a planar graph, faces. Since vertices are colored independently, the model has locality property: observables for disjoint areas are independent. However, the model exhibits a complicated behavior. It is now well-known that in these models (both in site and bond cases) a phase transition occurs: there is a critical value pc E (0,1) such that when p < pc, there is no infinite cluster of open vertices, while if p > pc, there is a unique such infinite cluster. The value of pc is lattice-dependent. It was shown by Kesten that for bond percolation on the square lattice pc = 1/2, whereas for site percolation on the square lattice pc ~ 0.59 with exact value still unknown. See the textbooks [5,7] for a good introduction to the subject. The percolation probability 6(p) that the origin belongs to the infinite open cluster is positive for p > pc (but strictly smaller than 1 as the origin might be surrounded by closed vertices), see [5]. Arguments from theoretical physics predict the exact power law for 6: 9(P)-(P-Pc)5/36,
P^Pc
+.
(1)
The exponent 5/36 is supposed to be independent (unlike pc) of the planar lattice involved. Several other predictions were made, for example if CQ denotes the open cluster containing
99
100
STANISLAV SMIRNOV
the origin (which might be empty), the expected number x(p) of vertices in it, provided it is finite, was predicted to behave like X(p):=E(#C0|#Co
p^pc.
(2)
Another power law was predicted for the correlation length:
«*-(£*£
#Co
P-^Pc:
(3)
where \y\ denotes the distance from the site y to the origin. All these results were first conjectured (on the basis of experiments and heuristical arguments) and then predicted (using Coulomb Gas methods, Conformal Field Theory, or Quantum Gravity) by physicists. Now they have been rigorously established for site percolation on triangular lattice by a combination of [8,11-13,18,19]: Theorem 1. Scaling laws (1), (2), (3) hold for percolation on the triangular lattice. Here we restrict ourselves to the discussion of rigorous methods, see [19] for references to the physics literature. Below we sketch some ideas which contributed to the proof. We start with conformally invariant scaling limits for percolation, and then describe their connections to SLE, or Schramm-Loewner Evolution, a way to obtain conformally invariant random curves, introduced by Schramm. We conclude with some open questions.
2. Harmonic conformal invariants For some time it appeared difficult to formulate rigorously that percolation has a conformally invariant scaling limit, cf. [1]. One simple interpretation, which attracted much attention, was suggested by Langlands, Pouliot, and Saint-Aubin in [9]. Given a topological rectangle (a simply connected domain Q, with boundary points a, b, c, d) one can superimpose a lattice with mesh 5 onto 0 and study the probability lis (fi, [a,b], [c,d\) that there is an open cluster joining the arc [a,b] to the arc [c,d] on the boundary of Q. In [9] extensive computer experiments were performed to check that there is a limit II := lim,5_o IXs, which is independent of the lattice, and depends only on the conformal modulus of the configuration CI, a, b, c, d. The latter conjecture authors attributed to Aizenman. The results were conclusively positive, leading to the conjecture that crossing probabilities have a conformally invariant scaling limit. Using Conformal Field Theory Cardy was able to derive in [4] (unrigorously) the exact value of the crossing probabilities. Assuming conformal invariance, he mapped the domain to the upper half plane C+, three boundary points to 0,1, oo, and derived a differential equation for II as a function of the fourth point. The solution turned out to be a hypergeometric function 2F1:
Critical percolation and conformal invariance
101
There are several definitions for the function 2-Fi- It can be written as an integral, leading to the following formula for F: F{U)
= r („ (1-v)) Jo
-2/3
dv
f\v( 1-v))
-2/3
dv.
Jo
Later Carleson observed that the very same hypergeometric function maps the half-plane to an equilateral triangle, and there Cardy's formula takes a particularly easy form. Namely, suppose that we map Q conformally to an equilateral triangle with side length 1, with three points going to vertices a, b, c and the fourth point to some z G ab. Then crossing probability II([a,2], [b, c}) converges to the distance from a to z:
6-+0
I.
Cardy's formula along with conformal invariance was proved in [18] for the site percolation on the triangular lattice: Theorem 2. For the critical site percolation on the triangular lattice, as mesh goes to zero crossing probabilities converge to a limit which is conformally invariant and satisfies Cardy 's formula. We will start by sketching the proof. The method is different from that of Cardy, and the hypergeometric function arises in a different way. As we will see later, SLE gives this function yet another interpretation. For this particular model it is known by results of Kesten and Wierman that pc = 1/2, so each vertex of the triangular lattice with mesh 5 (or equivalently each hexagon in the dual honeycomb lattice) is open or closed with equal probability 1/2. The method utilizes self-duality of the model (there is a horizontal yellow crossing if and only if there is no vertical blue crossing) and the fact that pc = 1/2 (which allows to change the colors to the opposite while preserving the probabilities). Like Cardy, we fix a domain Q and three boundary points a, 6, c, and study the behavior of II(fi, [ab], [cz]) as the forth point z moves. However we allow z to move inside domain D. as well, considering a new function Ha(z) :— H(z,a,b,c,Q,S) which is the probability that a yellow path from the boundary arc ab to the boundary arc ca separates z from be. On the arcs ab and ca this function becomes the crossing probability n , whereas on the arc be it vanishes. One defines functions Hb and Hc symmetrically. By Russo-Seymour-Welsh estimates (see [5,18]) one can bound their Holder norms, so as 5 —• 0 we can choose a uniformly converging subsequence.
102
STANISLAV SMIRNOV
It turns out that these three functions are (approximatively) discrete harmonic, and so their limits are harmonic. The discrete derivative of a function daHa in the direction a is the difference Ha(z') — Ha(z) for two neighboring sites z, z' with z' — z = ad. Let Q denote the area above the lowest (i.e. closest to 6c) yellow crossing from ab to ca. Then Ha(z) = P(z G Q), and so Ha(z') - Ha(z) = F(z' e Q) - P ( z e Q) = P(z' e Q,z $ Q)-F(z
e Q,z' i Q).
The two probabilities on the right hand side have an easy combinatorial interpretation: e.g. in the first case the lowest yellow crossing of hexagons passes though the edge zz', and by duality there is a blue crossing from z to the arc be (which prevents existence of a lower yellow crossing). So the first term is the probability of three multicolored crossings from z to the three boundary arcs (whereas the second is the same for z'). A combinatorial argument, based on the fact that pc = 1/2, and employed earlier by Aizenman, Duplantier, and Aharony [3], shows that such probabilities are independent of the colors of crossings, as long as both colors are present. If we change the colors from yellow-yellow-blue to blue-yellow-yellow, we will have a similar picture, but instead of crossing a yellow path between ab and ca in the direction a we will be crossing a yellow path between be and ab in the rotated by 2TT/3 direction aexp(27ri/3). So probabilities contributing to daHa are identified with probabilities contributing to 9aexp(2iri/3)-^6In this way one arrives to Cauchy-Riemann equations (in the basis of cube roots of 1): daHa
« C>aexp(27ri/3)-£ff> ~ 9a exp(47ri/3)-^c-
(5)
This implies that the scaling limits of the functions H satisfy exactly such Cauchy-Riemann equations. Moreover, on every boundary arc we know the values of one of them: e.g. Ha(a) = 1, Ha = 0 on be. Such boundary value problem has a unique solution, given in the half-plane by a complexification of Cardy's hypergeometric function. It is even easier to see this in Carleson's form: in an equilateral triangle these three functions are linear. Note also that this problem and hence its solution are conformally invariant. Since the resulting limit is independent of subsequence chosen, we conclude that functions H have a scaling limit, which is conformally invariant and satisfies a complexification of Cardy's formula. Once Cardy's formula is established, one can prove some statements about the scaling limit, [18]. For example, consider domain 0 with three boundary points a, b, c and the lowest (closest to be) yellow path from the boundary arc ab to the boundary arc ca. By a priori estimates of Aizenman and Burchard [2] the family of the laws of the lowest crossings (for various values of the mesh) is weakly precompact, so every sequence has a weakly converging subsequence. To prove that there is a scaling limit, it is sufficient to show that this limit is independent of the chosen subsequence. But by Cardy's formula for any curve 77 we know the probability (in the limit) that the lowest crossing went completely below 77, which is exactly the crossing probability for the domain obtained by cutting a part of 0 away along ??. Such events generate (by disjoint unions and complements) the BorelCT-algebraof crossings with Holder topology, and so the limiting law is determined uniquely. Moreover, it is conformally invariant since the events involved are.
Critical percolation and conformal invariance
103
Building upon this, one reasons similarly to show the existence and conformal invariance of the scaling limit for the perimeter of a cluster, i.e. an interface between two clusters of opposite color. It turns out that it is characterized by the same properties as Schramm's SLE(6), so they coincide. Alternatively one can aim from the beginning at proving that the interface converges to SLE(6) — an approach we discuss below. Once this connection is established, one can employ SLE in calculating percolation exponents.
3. Loewner Evolution Loewner Evolution is a differential equation for a Riemann uniformization map for a domain with a growing slit. It was introduced by Loewner in [15] in his work on Bieberbach's conjecture. Loewner Evolution allows to write differentials of various functionals defined for planar domains when domain is perturbed by adding a slit, and it was successfully applied to many optimization problems. There are two standard setups: radial, when the slit is growing towards a point inside the domain, and chordal, when the slit is growing towards a point on the boundary. In both cases we choose a particular Riemann map by fixing value and derivative at the target point. We will restrict our discussion to the chordal case, though radial is equally interesting in our context. Chordal Loewner evolution describes uniformization for the upper half-plane C + with a slit growing from 0 to oo (one deals with another domain fl with boundary points a, b by mapping it to C+ so that a, b — i » 0, oo). Loewner dealt only with slits given by smooth simple curves, but more generally one allows any set which grows continuously in conformal metric when viewed from oo. We will omit the precise definition of "allowed slits" (more extensive discussion in this context can be found in [10]), only noting that all simple curves are included. The random curves which arise from lattice models (e.g. cluster perimeters or interfaces) are simple curves. Their scaling limits are not necessarily simple, they have no "transversal" self-intersections. For such a curve to be an allowed slit it is sufficient to visit no point thrice. Parameterizing the slit 7 in some way by time t, we denote by gt(z) the conformal map sending C+ \ jt to C+ normalized so that at infinity gt{z) = z + a(t)/z + 0(l/\z\2), the so called hydrodynamic normalization. It turns out that a(t) is a continuous strictly increasing function (it is a sort of capacity-type parameter for 7 t ), so one can change the time so that
gt(Z)=z+^+o(jLy
(6)
Denote by w(t) the image of the tip •fit). The family of maps gt (also called a Loewner chain) is uniquely determined by the real-valued "driving term" w(t). The general form of the Loewner's theorem can be stated as follows: L o e w n e r ' s t h e o r e m . There is a bijection between allowed slits and continuous real valued functions w(t) with w(0) = 0. This bijection is given by the differential equation 2 (7) dtgt(z) = — ^ 77T, go(z) = z gt{z)-w{t)
104
STANISLAV SMIRNOV
4. Schramm-Loewner Evolution Loewner's theorem states that a deterministic curve 7 corresponds to a deterministic driving term w(t). Similarly a random 7 corresponds to a random w(t). One obtains S L E ( K ) by taking w(t) to be a Brownian motion with speed K: Definition. Schramm-Loewner Evolution, or SLE(K), taking w(t) = y/RBt, n e [0, 00).
is the Loewner chain one obtains by
It is shown in [16] that the resulting slit will be almost surely a Holder continuous curve. So we will also use the term SLE for the resulting random curve, i.e. a probability measure on the space of curves (to be rigorous one can think of a Borel measure on the space of curves with Holder norm). Different speeds K produce different curves, which become more "fractal" as K increases: we grow the slit with constant speed (measured by capacity), while the driving term "wiggles" faster. For example, for K < 4 the curve is a.s. simple, for 4 < K < 8 it a.s. touches itself, and for n > 8 it is a.s. space-filling (i.e. visits every point in C+). See [16] for the discussion of basic properties of SLEs and references. We follow Schramm to show that this choice of w(t) arises naturally. Schramm decided to describe the scaling limits of cluster perimeters, or interfaces for lattice models assuming their existence and conformal invariance. This naturally led to introduction of SLE in [17].
Figure 1. Critical site percolation on triangular lattice superimposed over a rectangle. Dobrushin boundary conditions produce the interface from the lower left to the upper right corner. The law of the interface converges to SLE(6) when mesh goes to zero.
The reasoning went similarly to the following: consider a simply connected domain Q with two boundary points, a and b. Superimpose a lattice with mesh 5 and consider some lattice model, say critical percolation with the Dobrushin boundary conditions, coloring the vertices blue on the boundary arc ab and yellow on the boundary arc ba. This enforces existence (besides many loop interfaces) of an interface between yellow and blue clusters running from a to b, which is illustrated by figure 1 for a rectangle with
Critical percolation and conformal invariance
105
two opposite corners as a and b. So we end up with a random simple curve (a broken line) connecting a to b inside Cl. The law of the curve depends of course on the lattice superimposed. If we believe the physicists' predictions, as mesh tends to zero, this law converges (in a weak-* topology) to some law A = A(Q, a, b) on curves from a to b inside O. Furthermore, in this setup the conformal invariance prediction can be formulated as follows: (A) Conformal invariance: The law is conformally invariant: for a conformal map
+ s)-i
\GS = (z- w{s) + • • •) o (z - w(t) + •••) = z - (w(s) + w(t)) -\
,
106
STANISLAV SMIRNOV
concluding that w(t + s) - w(s)\Gs = w(t). This means that w(t) is a continuous (by Loewner's theorem) stochastic process with independent stationary symmetric (apply (A) with anti-conformal reflection <j>(u+iv) — —u+iv) increments. Thus w(t) has to be a Brownian motion with certain speed K £ [0, oo): w(t) — sfkBt. So one logically arrives at the definition of SLE, and what we call Schramm's principle. A random curve satisfies (A) and (B) if and only if it is given by SLE(K) with certain K £ [0,oo). The discussion above is essentially contained in Schramm's [17] for the radial version, when slit is growing towards a point inside and the Loewner differential equation takes a slightly different form. To make this principle a rigorous statement, one has to ask the curve to be almost surely an allowed slit. In order to use the above principle one still has to show the existence and conformal invariance of the scaling limit, and then calculate some observable to pin down the value of K. For percolation one can employ locality property or Cardy's formula to show that K = 6 (we will discuss below why SLE(6) is the only SLE matching Cardy's formula). So Schramm concluded in [17] that if percolation interface has a conformally invariant scaling limit, it must be SLE(6).
5. SLE as a scaling limit One way to show that certain random curve coincides with SLE is to determine infinitely many observables, which could prove difficult (for percolation, locality helps to create many observables from just one). Fortunately, the situation turns out to be much nicer: if one can show that just one (nontrivial) observable has a limit satisfying analogues of (A) and (B), convergence to S L E ( K ) (with K determined by the values of the observable) follows. This was demonstrated by Lawler, Schramm and Werner in [14] in establishing the convergence of two related models: of Loop Erased Random Walk to SLE(2) and of Uniform Spanning Tree to SLE(8). We describe how a modification of their ideas gives another proof that percolation perimeter converges to SLE(6). For percolation on a fixed lattice in a domain O with boundary points o, b and Dobrushin boundary conditions consider interface running from a to 6, see figure 1. Already mentioned estimates [1,2] imply that collection of interface laws on lattices with different mesh is precompact (in the weak-* topology on the space of Borel measures on Holder continuous curves). So to show that as mesh goes to zero the interface law converge to the law of SLE(6), it is sufficient to show that the limit of any converging subsequence is in fact SLE(6). Take some converging subsequence, whose limit is a random curve 7 with law A. Though 7 is a scaling limit of a simple curve, 7 itself will a.s. be non-simple. But known a priori estimates [1] show that 7 a.s. visits no point thrice and so is an allowed slit. So we can (by mapping Cl to C+ and applying Loewner's theorem) describe 7 by a Loewner evolution with a (random) driving force w(t). It remains to show that w(t) = V&Bt.
Critical percolation and conformal invariance
107
Add two more points on the boundary, making fi a topological rectangle axby and consider the crossing probability 11,5 (Q, [a,x], [b,y]) (from the arc ax to the arc by on a lattice with mesh 6). Parameterize the curve 7 in some way by time and assume t to be small enough so that 7[0, t] does not reach x,y. The crossing probability conditioned on 7(0, £] coincides with crossing probability in the slit domain Q \l[0,t], (an analogue of the property (A)) we can write by the total probability theorem Us (SI, [a,x], [b,y]) = E(U5 ( O \ 7 [ 0 , t ] , [l(t),x], [b,y])).
(8)
If enough a priori estimates are available, identity (8) also holds for the scaling limit II := lims^o 11(5 °f the crossing probabilities, which we know to exist, be conformally invariant, and satisfy Cardy's formula. Mapping to half-plane and applying conformal invariance for the map gt(z) — w(t) we write n(C+,[0,a:],[c» > y])=E(n(C + \7[0 ) t],[7(t),a:],[cx3 ) y])) = E(IL(C+,[0,gt(x)
-w(t)],[oo,gt(y)
-w{t)]))
•
(9)
By conformal invariance (under Mobius transformation z i-> (z — y)/(x — y)), and Cardy's formula n(C+,[0,a;],[oo,y])=n(c+,
y
,[oo,0]
=F
x-y
x-y
for Cardy's hypergeometric function F. Rewriting in this way both sides of the equation (9) we arrive at
x \ ^-y)
_EFfgt(x)-w(t) \9t(x) - gt(y),
This provides some information about w(t), but it is difficult to use before we get rid of gtTo that purpose we fix the ratio x/(x—y) =: 1/3 (anything not equal to 1/2 would do) and let x tend to infinity: y := — 2x, x —> +00. Using the normalization gt(z) = z + 2t/z + 0(1/z2) at infinity, writing Taylor expansion for F, and plugging in values of derivatives of F at 1/3, we obtain F{l\=EFf
*{x)-v,{t)
3J
\gt(x) = EF
gt(-2x),
x - w(t) + 2t/x + 0(l/x2) (x + 2t/x + 0(l/x2)) - ( - 2 i + 2t/(-2x) +
F(>_^I +
| J , + o(^
r(2/3) ^ E x r(l/3)r(4/3) 3 2 /325« 2
0(l/x2))
(„(,)» - 6 t ) + 0 ( i ) .
108
STANISLAV SMIRNOV
Observing that coefficients by 1/x and 1/x2 on the right hand side should vanish, we conclude that Ew(t)=0, Ew(t)2-6t = 0. (10) Applying the same reasoning to domain Cl \ 7(0, t] relative to Q, \ 7(0, s] gives identities (10) for the increments of w(t). Thus w(t) is a continuous (by Loewner's theorem) process such that both w(t) and w(t)2 — 6i are local martingales so by Levy's theorem w(t) = ^/&Bt, and therefore SLE(6) is the scaling limit of the critical percolation interface. The argument will work wherever Cardy's formula and a priori estimates are available, particularly for triangular lattice, giving the following [18]: Theorem 3. Consider the site percolation on the triangular lattice in a simply connected domain Q with boundary points a and b and Dobrushin boundary conditions. Then the interface running from a to b has a conformally invariant scaling limit, which coincides with SLE(6). 6. Calculations for SLE The value K = 6 appears in the argument above because of the specific values of the conformal martingale considered. The above reasoning also indicates that very few functions can arise as conformal invariant martingales for SLEs. Indeed, this was used by Lawler, Schramm, and Werner to determine the values of various probabilities and expectations which are conformally invariant or covariant. We will sketch one of their calculations, showing that assuming convergence of the percolation interface to SLE(6) one can derive Cardy's formula from the properties of SLE. Of course, Cardy's formula was used to establish the convergence in the first place. But the argument below provides new insight into it and can also be applied in many other situations. The scaling limit of the crossing probability II (C+, [0, x], [00, y]) is equal to the probability that SLE(6) (which is the scaling limit of the percolation interface) touches the interval [a;,+00] before the interval [—00,y]. This can be seen in the figure 1: the vertical gray crossing forces the interface to touch the upper side of the rectangle before it touches the right side. We will calculate similar probability for S L E ( K ) , which is driven by w(t) = y/HBt. Denote this probability by IiK. By conformal invariance (under the Mobius transformation z >—> (z — y)/{x — t/)), it depends only on x/(x — y):
nK(c+,[o,x],[oo,z/]) = n J c + ,
X
x-y
-,1
=:f
H "fe)'
(n>
Denote Xt := gt(x) - y/nBu Yt := gt(y) - y/HBu Ut :='Xt/(Xt - Yt). The function IIK (C + , [0,x], [oo,j/]|7[0, £]) is a local martingale (with respect to the usual filtration for Bt). Applying conformal invariance for the map gt(z) — \fnBt as before, we write
n K ( c + , [0, x], [00, y ]| 7 [o, t]) = n K (c+ \ 7 [o, t], h(t), x], [«>, y\) = n K (C+, [0,gt(x) - y/UBt], [00,gt(y) - yfcBt]) = UK{C+,[0,Xt],[oo,Yt])
=
FK(Ut).
Critical percolation and conformal invariance
109
Thus FK(Ut) is a local martingale. Let us calculate its differential. By Loewner equation, 2 dXt = -^-dt - y/iidBu
2 dYt = —dt -
VndBt.
So by Ito's formula JTT
J
du
d
^t
_
2
Xt+Yt
y/R
~ x7=YtdBt
* = xT=Yt = xtYt(xt-Ytf = 2{Xi
+Y
^-Yt)ds
•X-tit
- yfcdB. = 2j^^ds u t/tU
- yfcdB.,
(12)
~ t)
where we changed the time so that ds = dt/(Xt — Yt)2. Applying Ito's formula to FK (one also has to prove that it is smooth) we get dFK(Ut) = F'(Ut) Uy~™*ds
- V^dB^j
= (2ut^lUut)F'(Ut)
1
+
-^Kds
+ ^ " ( ^ ) ) ds -
yfcF'(Ut)dBs.
Since FK(Ut) is a martingale, the drift term has to vanish, leading to the following differential equation for FK (which is similar to the Cardy's equation):
This equation can be integrated by writing .,
_ , , ,.,
FZ(u)
4 l-2u
( 4,
V
which leads to FK(u) =d+C2
[U (v(l - v)YA/K Jo
dv.
From obvious boundary conditions FK(0) = 0, FK(1) = 1 we derive the values of constants, arriving at FK(u)=
fU (v(l-v)yi/Kdv
I f
{v(l-v)yi/Kdv.
For K = 6 (and only for this value of K) solution coincides with Cardy's hypergeometric function. For K < 4 there is no solution, reflecting the fact that SLE is then a.s. a simple curve which a.s. doesn't touch the boundary.
7. Scaling exponents Calculations for SLE like the one above and knowledge about percolation can be combined to establish the values of mentioned scaling exponents. The method is explained in [19], and proceeds as follows.
110
STANISLAV SMIRNOV
For a variety of percolation models Kesten has reduced in [8] most of the predictions about behavior near pc to establishing the so called one- and four-arm exponents for critical percolation. Namely, one has to prove that on the lattice with mesh 5 the probability of a yellow path extending distance > 1 from the J-neighborhood of the origin is x 55/is, while similar probability for two yellow and two blue paths simultaneously is x <54/3, 6 —• 0. Kesten's proof is quite involved and based on differential inequalities. These probabilities can be expressed in terms of the percolation interfaces, and with some technical work, it can be shown that they have the same asymptotics as similar probabilities for the interface scaling limit, which is SLE(6) by [18]. The required 5/48 and 4/3 exponents for SLE(6) are established by Lawler, Schramm, and Werner in [11-13].
8. Conclusions We described advances related to percolation on triangular lattice in the plane. It is interesting that the mathematical progress did not come along the lines of theoretical physics arguments, but from the different new directions. But as the physicists' approaches before, all of the described techniques use conformal invariance in an essential way and therefore are two-dimensional in nature. So unfortunately the progress made does not seem to have implications in higher dimensions. As for the other planar models, the Schramm-Loewner Evolution describes all possible conformally invariant scaling limits for interfaces. Calculation for other values of K is not much different from the case K = 6: most problems can be reduced via stochastic analysis to solving ordinary or partial differential equations. Of course, some of the resulting PDEs might be non-integrable. But it seems that more problems can be solved in this way than by theoretical physics' methods. For example, the so called "backbone exponent" for percolation which was out of reach for physicists was proven by Lawler, Schramm, and Werner [13] to coincide with the main eigenvalue of a certain PDE. Unfortunately, this eigenvalue does not seem to admit a nice formula. However some difficulties might arise with technical points in calculations: SLE(6) shares the locality property of percolation, so writing estimates is easier. Moreover, the relation between various scaling exponents, which was rigorously established for percolation by Kesten is mostly missing for other models, where it might be technically more difficult. The remaining part, establishing that other models have conformally invariant scaling limits, seems even more challenging. Lawler, Schramm, and Werner [14] have shown that the Loop Erased Random Walk converges to SLE(2), while a related model, perimeter of the Uniform Spanning Tree, converges to SLE(8). Recently Schramm and Sheffield defined a new random curve, Harmonic Explorer, which converges to SLE(4). In all these cases, like in percolation, some observables were shown to be discrete harmonic functions with various boundary conditions implying existence of a conformally invariant harmonic limit. There are two differences though: for these models discrete observables turn out to be exactly harmonic (for percolation only approximatively), and the proofs (similar to one sketched above for percolation) rely heavily on Loewner Evolution, whereas for percolation one can construct the scaling limit without appealing to SLE. Despite extensive predictions, similar problems are open for other critical lattice models in
Critical percolation and conformal invariance
111
t h e plane. P r o b a b l y there one should also look for discrete harmonic or analytic observables. A few cases which stand out are: — Other critical percolation models: the interface should also converge to SLE(6). Bond percolation on square lattice and Voronoi percolation share some duality properties of the site percolation on triangular lattice, so one might start with those. — Ising model at critical temperature: the interface should converge t o SLE(3). Here Kenyon's approach to dimer models provides some insight, but technical estimates are still missing. — Self Avoiding R a n d o m Walk: it should converge t o SLE(8/3), which also describes Brownian Frontier or percolation lowest crossing. Detailed study of SLE(8/3) by Lawler, Schramm, and Werner is discussed in Lawler's paper in this volume. There are also conjectures for t h e full spectrum of Q-state P o t t s models a n d FortuinKasteleyn models. We refer t h e reader t o expository works [6,10,20] for more details.
Acknowledgments T h e author is a Royal Swedish Academy of Sciences Research Fellow supported by a grant from the K n u t and Alice Wallenberg Foundation. He was also supported by the Goran Gustafssons Foundation. Many of the discussed results are due to Greg Lawler, Oded Schramm, and Wendelin Werner.
References 1. M. Aizenman, "The geometry of critical percolation and conformal invariance", in STATPHYS 19 (Xiamen, 1995), World Sci. Publishing, River Edge, NJ, 1996, pp. 104-120. 2. M. Aizenman, A. Burchard, "Holder regularity and dimension bounds for random curves", Duke Math. J. 99, 419-453 (1999). 3. M. Aizenman, B. Duplantier, A. Aharony, "Path crossing exponents and the external perimeter in 2D percolation", Phys. Rev. Let. 83, 1359-1362 (1999). 4. J. L. Cardy, "Critical percolation in finite geometries", J. Phys. A 25, L201-L206 (1992). 5. G. Grimmett, Percolation, Springer-Verlag, Berlin, second edition, 1999. 6. W. Kager, B. Nienhuis, "A guide to stochastic Lowner evolution and its applications", J. of Statistical Physics, to appear; arXiv:math-ph/0312056 . 7. H. Kesten, Percolation theory for mathematicians, Birkhaiiser, Boston, 1982. 8. H. Kesten, "Scaling relations for 2D-percolation", Comm. Math. Phys. 109, 109-156 (1987). 9. R. Langlands, Ph. Pouliot, Y. Saint-Aubin, Conformal invariance in two-dimensional percolation, Bull. Amer. Math. Soc. (N.S.) 30, 1-61 (1994). 10. G. F. Lawler, Conformally Invariant Processes in the Plane, book in progress, available from http://www.math.Cornell.edu/~lawler/books.html. 11. G. F. Lawler, O. Schramm, W. Werner, "Values of Brownian intersection exponents I: Halfplane exponents", Acta Mathematica 187 237-273 (2001). 12. G. F. Lawler, O. Schramm, W. Werner, "Values of Brownian intersection exponents II: Plane exponents", Acta Mathematica 187, 275-308 (2001). 13. G. F. Lawler, O. Schramm, W. Werner, "One-arm exponent for 2D critical percolation", Electron. J. Probab. 7, 13 pages (2001). 14. G. F. Lawler, O. Schramm, W. Werner, "Conformal invariance of planar loop-erased random walks and uniform spanning trees", Ann. Prob., to appear; arXiv:math.PR/0112234 .
112
STANISLAV SMIRNOV
15. K. Lowner, "Untersuchungen iiber schlichte konforme Abbildung des Einheitskreises, I", Math. Ann. 89, 103-121 (1923). 16. S. Rohde, O. Schramm, "Basic properties of SLE", Ann. Math., to appear; arXiv:math.PR/ 0106036(2001). 17. O. Schramm, "Scaling limits of loop-erased random walks and uniform spanning trees", Israel J. Math. 118, 221-288 (2000). 18. S. Smirnov, "Critical percolation in the plane: Conformal invariance, Cardy's formula, scaling limits", C. R. Acad. Sci. Paris 333, 239-244 (2001). 19. S. Smirnov, W. Werner, Critical exponents for two-dimensional percolation, Math. Res. Lett. 8, 729-744 (2001). 20. W. Werner, "Random planar curves and Schramm-Lowner Evolutions", in Lecture notes from the 2002 Saint-Flour Summer School, LNM, Springer, to appear; arXiv:math.PR/0303354 .
The energy of charged matter* JAN PHILIP SoLOVEjt
(Princeton)
In this talk I will discuss some of the techniques that have been developed over the past 35 years to estimate the energy of charged matter. These techniques have been used to solve stability of (fermionic) matter in different contexts, and to control the instability of charged bosonic matter. The final goal will be to indicate how these techniques with certain improvements can be used to prove Dyson's 1967 conjecture for the energy of a charged Bose gas — the sharp N7/5 law.
1. Introduction It is my aim in this contribution to review the main techniques developed to study the problem of stability of matter or rather stability or instability of ordinary matter. By ordinary matter I mean a macroscopic collection of charged particles (nuclei and electrons) interacting solely through electrostatic or electromagnetic forces. The problem of stability of (ordinary) matter interacting through electrostatic Coulomb forces was first formulated by Fisher and Ruelle [18] and proved in the seminal papers of Dyson and Lenard [12,13], although, of course the problem of stability of individual atoms goes back to the origin of quantum mechanics. An important assumption needed is that either the negatively or the positively charged particles are fermions, i.e., obey the Pauli exclusion principle. Without the fermionic assumption there is no stability as proved by Dyson in [11]. The importance of the Pauli exclusion principle for stability had been pointed out in the celebrated work of Chandrasekhar [5] on gravitational collapse and stability of white dwarfs. It was, however, not until the work of Dyson and Lenard that the importance of the exclusion principle for the stability of ordinary matter was emphasized. In [11] Dyson makes a very precise conjecture regarding the nature of the instability without the exclusion principle. It is my ultimate goal here to discuss the proof of this conjecture, which for the main part, is joint work with E. H. Lieb. Since the work of Dyson and Lenard there has been a lot of activity in the area of stability of matter. Most celebrated is the work of Lieb and Thirring [31] giving an elegant proof with a bound of the correct order of magnitude. Several variations of the model have been studied. Relativistic effects and magnetic interactions have been included. I shall review some of these results below. In recent years the attention has turned to studying the effect of quantizing the electromagnetic field, what is often referred to as non-relativistic quantum electrodynamics. I will not get into this recent development here. •Work partially supported by NSF grant DMS-0111298, by EU grant HPRN-CT-2002-00277, by MaPhySto — A Network in Mathematical Physics and Stochastics, funded by The Danish National Research Foundation, and by grants from the Danish research council. © 2003 World Scientific. This article may be reproduced in its entirety for non-commercial purposes. tOn leave from Department of Mathematics, University of Copenhagen, Universitetsparken 5, DK-2100 Copenhagen, DENMARK.
113
114
JAN PHILIP SOLOVEJ
2. Charged m a t t e r in q u a n t u m mechanics The systems we discuss here are all given by iV-particle Hamiltonians (N being a positive integer) of the form
j=l
l
'
l
J
'
3
Here Zj £ R is the charge of particle j , Xj € M is the coordinate of particle j , and Tj is the kinetic energy operator for particle j . We shall here consider the kinetic operator to be of one of the following four types. (1) The standard non-relativistic kinetic energy T
3 = 3
-7T—&3-
2mj
3
(2) The relativistic kinetic energy
Tfel = ^/-c2Aj + my - mjC2. (3) The magnetic kinetic energy
(4) The magnetic Pauli kinetic energy
Above, rrij > 0 refers to the mass of particle j , A : R3 —> R3 is the vector potential for the magnetic field, and
The Pauli kinetic energy acts on spinor valued functions (spin 1/2 particles), see the discussion of the relevant Hilbert spaces below. The term U in (1) represents the magnetic field energy
U = ^- f |Vx A{x)\2dx. °7T JR3
We are using units in which Planck's constant h = 1 and the fundamental unit of charge e = 1. In these units the physical value of the speed of light c is approximately 137 or more precisely the reciprocal of the fine structure constant a. Many objections may of course be raised about the Hamiltonian HN- Even with the relativistic kinetic energy operator the Hamiltonian is not truly relativistically invariant. For spin 1/2 fermions the appropriate kinetic energy operator would rather be the Dirac operator. Also the magnetic field is treated classically as opposed to as a quantized field.
The energy of charged matter
115
Unfortunately, we do not today know how to describe a mathematically consistent fully relativistic model in which to even formulate the problem of stability. As is well known the Dirac operator is not bounded from below and introducing quantized fields requires a renormalization scheme. These questions are being actively studied at present, but we shall, as already mentioned, not discuss them here (see [4,16,25]). One approach to the Dirac operator would be to restrict to positive energy solutions for the free Dirac operator. For the non-magnetic case this would lead to a problem essentially identical to the problem with the relativistic kinetic energy T R e l . In the magnetic field case this is somewhat more difficult (see [27]). We now turn to the important question of which Hilbert Space the operator HN acts on. The operator will be unbounded, but we shall always consider the Priedrichs's extension of the restriction to C°° functions with compact support. Since the problem of stability is a question of lower bounds, the Priedrichs's extension is the correct setting. We begin by discussing the Hilbert spaces for just one particle, the one-particle space. We consider, in general the one particle spaces to be the square integrable functions corresponding to particles with q internal states, i.e., H\ = L 2 (R 3 ;C 9 ), where the non-negative integer q may be different for the different particles. In terms of spin, this corresponds to particles of spin (q — l ) / 2 . The many particle space of interest is then wphysica1=
( /\Hl)®[ V /
(2)
where K is a positive integer. In order for the Hamiltonian HN to act on the above Hilbert space we must require that all the masses and charges of the particles j = 1 , . . . , TV — K are the same. Put differently, these particles are identical fermions. For simplicity we assume that rrij = 1 and Zj = — 1, for all j = 1 , . . . , N — K. (Except for the sign of charge this simply amounts to a choice of units). Thus the fermions are negatively charged. We assume the remaining particles to be positively charged, i.e., Zj > 0 for j = N — K + 1 , . . . ,7V. Moreover, all the fermions must be described by the same kinetic energy operator. The positively charged particles could, in principle, have any kinetic energy operators and even simply vanishing kinetic energy (equivalent to infinite mass). Of course, the Pauli kinetic energy operator requires the spin of the particle to be 1/2, i.e., q = 2. Since fermions have half-integer spin one would maybe like to restrict the negatively charged fermions accordingly, i.e., the corresponding q to be even. For the discussion here this restriction, however, plays no role. The reader may ask why one does not consider the situation when all or some of the positively charged particles are fermions or bosons. In fact, stability in this case is a simple consequence of the situation discussed above, since considering fermions or bosons simply means restricting to certain subspaces. To study what happens when we ignore Fermi statistics all together we shall consider all particles to be described by the standard non-relativistic kinetic energy T and all to have mass rrij = 1. Moreover, we assume that all the charges Zj are either +1 or —1. We indeed consider the charge of each particle to be a variable. Put differently, we consider the TV-particle Hilbert space
116
JAN PHILIP SOLOVEJ
N
HN =
(3)
x{l,-l»,
where the set { 1 , - 1 } refers to the charge variable. We define the ground state energy as the bottom of the spectrum of the Hamiltonian. We will not here address the question of whether this is and eigenvalue or not, i.e., whether there really is a ground state. When a magnetic field is present we consider the vector potential as a dynamic variable and we also minimize over it. Thus -^physical
= inf inf speo^phys C*\HN
— inf inf < — '
_,—
feffnw
physical JV
\{0}
(4)
and EN = inf specWjv HN = mil
* 6 C 0 ° ° n ^ \ {0}
}•
(5)
Here CQ° refers to smooth functions of compact support. In (4) we minimize over all smooth vector potentials for which |V x A\ € L 2 (R 3 ). Of course the minimization over the vector potential is only relevant if the kinetic energy really depends on A otherwise the minimum simply occurs for A = 0. The energy E^ on the space where we consider the charges as variables is precisely the same as we would get if we calculate the energy for any fixed choice of charges and afterward minimize over this choice. Note, in particular, that the charge variable commutes with the Hamiltonian H^. The reason for including the charges as variables is that the Hamiltonian is then fully symmetric in all N particles and not just in the positively or negatively charged particles separately. Strictly speaking the energies in (4) and (5) are only defined as the bottom of the spectrum if the rightmost expressions are finite (i.e., not —oo). It is in this case that we can define the operators as Priedrichs's extensions. The property that the ground state energies are bounded below is often referred to as stability of the first kind. It holds except in the cases when the fermions are described by the kinetic energy operators T R e l or Tp&uh. In these two cases stability of the first kind requires that max{zj}/c = max.j{zj}a is small enough in the case of T R e l or that q vaaxj {ZJ} / c2 = qmax.j{zj}a2 is small enough in the case of ^Pauli
3. Stability and instability of matter Stability of matter is a stronger statement than stability of the first kind. It means that there exists a constant C G R such that ^physical >
_CN
(6)
for all N, i.e., that the total binding energy per particle is bounded. Theorem 3.1 (Stability of Matter). On the space W^ hysical stability of matter (6) holds with a constant C that depends only on q and max{zj} if any one of the following situations hold:
The energy of charged matter
117
— the ferrnions are described by the standard non-relativistic kinetic energy T or by the magnetic kinetic energy T M a s ; — the ferrnions are described by the relativistic kinetic energy T R e l and qa(— qc~1) is small enough and ma,Xj{zj}a < 2/n; — the ferrnions are described by the Magnetic Pauli kinetic energy T P a u l i and qa{= qc~l) and qmaxj{zj}a2 are small enough. The case of the standard non-relativistic kinetic energy is the situation first settled by Dyson and Lenard [13] and later by Lieb and Thirring [31] with a constant of the correct order of magnitude. There was also a proof by Federbush [14]. Stability for the magnetic kinetic energy is an immediate consequence of the non-magnetic case and the diamagnetic inequality. In fact, the ground state energy is achieved without a magnetic field. Stability for the relativistic kinetic energy was first solved by Conlon [6] and then improved by Fefferman and de La Llave [17]. The version formulated here (which is sharp with respect to the bound 2/n) is due to Lieb and Yau [32]. The case of one electron and one nucleus had been studied previously by Herbst [22] and Weder [36] and the case of one electron and several nuclei by Lieb and Daubechies [10]. A proof of stability for the magnetic Pauli kinetic energy was first published by Lieb, Loss, and Solovej [26], but had been previously announced, (although with weaker bounds) by Fefferman (only later published in [15]). The result as formulated here is optimal in the sense that if either qa or qva.ax.j{zj)al are large then stability does not hold. This was realized in a series of papers [20,24,33]. In the case without Fermi statistics Dyson [11] proved that there is no stability and he, in fact, conjectured the following result. Theorem 3.2 (The asymptotic energy of a charged Bose gas). For the energy EN defined in (5) we have the asymptotics
j^= i " f {5./i v ^-*'/>h o '/* 2 = 1 }'
(7)
where
The reader may wonder why the theorem refers to a charged Bose gas when, in fact, no statistics was enforced in the definition of the Hilbert space HN- The reason is that the ground state energy on HN is the same as the ground state energy one would get if restricted to the fully symmetric subspace (i.e., the bosonic subspace). This is a fairly simple consequence of the facts that the Hamiltonian is fully symmetric in all variables and that the expected energy of any trial state does not increase if we replace it by its absolute value. It is a simple consequence of the classical Sobolev inequality (see below) that the right side of (7) is finite. Dyson proved in [11] that EN < —CN7/5, but not with the correct constant. His method and conjecture was inspired by the Bogolubov approximation, which had been previously used by Foldy [19] to calculate the energy asymptotics for the high density onecomponent Bose plasma (bosonic jellium). The Bogolubov approximation is usually applied to bosonic systems, it is therefore important that we can think of our system as such.
118
JAN PHILIP SOLOVEJ
In their original work on stability of matter [12] Dyson and Lenard obtained as a corollary a lower bound on the Bose energy EN > —C'N5/3 (see also Brydges and Federbush [3]). The correct exponent 7/5 was however first proved by Conlon, Lieb and Yau in [7], where one finds the lower bound E^ > —C"N7/5, but again not with the correct constant. Dyson's conjecture was finally proved in two papers establishing respectively an upper and a lower bound. An asymptotic lower bound of the form (7) was proved by Lieb and Solovej [30]. The corresponding asymptotic upper bound will appear in Solovej [34]. The rigorous calculation of Foldy's high density asymptotics for the energy of the onecomponent plasma can be found in Lieb and Solovej [29] (lower bound) and Solovej [34] (upper bound). The main goal here is to review the techniques used to prove Dyson's conjecture and to sketch the main steps in the proof. Many of the techniques used were developed to study stability of matter and I will therefore use the opportunity to briefly review these applications as well. The main strategy in proving the lower bounds in Theorems 3.1 and 3.2 is to estimate the Hamiltonian below by a simplified Hamiltonian for which one can get a fairly explicit lower bound. For Theorem 3.1 the simplified Hamiltonian is a non-interacting mean field type Hamiltonian. For Theorem 3.2 the simplified Hamiltonian is a non-particle conserving Hamiltonian, which in the language of second quantization is quadratic in creation and annihilation operators (a quadratic Hamiltonian). We first discuss how to treat the simplified Hamiltonians.
4. One-particle operators and quadratic Hamiltonians The most basic estimate on the ground state energy of a one-particle Hamiltonian, given in the next theorem, is a simple consequence of the Sobolev inequality / |VV>|2 > C(J |V'|6)1^3) C > 0 for functions i/) on K3. Theorem 4.1 (Sobolev estimate). Let V be a locally integrable function on R 3 , such that the Schrodinger operator —A + V may be defined as a quadratic form on C°° functions with compact support. Then
-\A + V>-CsJ\V(x)t/2dx where Cs > 0 and \t\- = max{—1,0}. This theorem of course immediately implies that the many-body operator N
1
sr--Ai+v(Xi) i=l
z
has the lower bound -CsN J \V(x)\_ dx. On the fermionic subspace we, however, have the much stronger Lieb-Thirring inequality [31].
The energy of charged matter
119
Theorem 4.2 (Lieb-Thirring inequality). On the fermionic space /\ integer) we have the operator inequality
H\ (M a positive
M
J T - 1 A , + V(Xi) > -Cvrq J
\V(x)fJ2dx,
where CLT > 0 V is a locally integrable function on R 3 . For the relativistic operator T R e l and the magnetic Pauli operator T P a u h one has similar inequalities on the space f\M H\{q = 2 for T P a u l i ). For T R e l (with m = 1) Daubechies [9] proved M
J2T™
+ V(xi)
> -Cuqc~3
/ \V{x)\idx
- Mc2.
(8)
For T P a u l i (with m = 1 and 2 = 1) the inequality AT ^-7;Pauli
+
K(a
* =1
,.)
/" 5 /2
3
> -Cft's J l ^ ( ^ ) | - ^ - ^ s c - /
2
ft \3/4 2 (^y |V x A | ^ j
f t \ X/4 (^y \V(x)\idxj
(9)
can be found in [26]. When studying bosonic systems one may, as explained above, use the Sobolev estimate to get a lower bound on the energy. This will in general not give the best dependence on the number of particles. E.g., for the charged gas this would lead to a lower bound —CN5^3 as in [3,12]. To get the sharp dependence —CN7/5 a more precise treatment of the interplay between the kinetic energy and the Coulomb potential is needed. This brings us to Bogolubov's method for calculating the energy of a Bose gas. In the Bogolubov approximation one assumes that most particles form a Bose condensate in a momentum zero state. The main contribution to the ground state energy will come from the correlation between two non-condensed particles with opposite momenta. This effect is most easily explained using the formalism of creation and annihilation operators. The following theorem gives a rigorous formulation of Bogolubov's method in the simple case used by Foldy in [19]. Theorem 4.3 (Bogolubov's method). Assume thatb± ± are four (unbounded) operators defined on a common dense domain on a Hilbert space, such that their adjoints are also defined on the same domain. Assume moreover that in the sense of quadratic forms on this domain we have the commutator identities [b*T,,z,XA
= [&r',,'A,,] = [h>,-,K,+] = [bT',+X,-]
=0,
forallz,z',T,T'
= ±,
and [6T)Z,b*iZ] < 1,
for allz,r
= ±.
Then for all t,g+,g- > 0 we have (again in the sense of quadratic forms) 1
XI r,2=±l
b
r,zK,z+
Yl
V9T9^zz'(b*+,zb+,z'+b*-,zb-,z'+K,zb-,z'
+b+,zb-,z')
z,z'=±l
> -(t + g++g.) + y/{t + g++g-)2 - {g+ +g-)2.
120
JAN PHILIP SOLOVEJ
The operator inequality in this theorem follows simply by completing squares (see also [30]). The importance of the lower bound is emphasized by the fact that it is sharp if the operators b±,± axe truly annihilation operators. In Bogolubov's method the operators are however not exactly annihilation operators. Rather one should think of b±tZ as the operator that annihilates a particle with charge z and momentum ± (here only the sign of the momentum is important) and creates a particle in the condensate (i.e., with momentum 0) with charge z. The value t represents the kinetic energy of the particles with momentum ± and g± represent the strength of the Coulomb interaction.
5. Reduction to a simplified Hamiltonian The reduction to a simplified Hamiltonian requires controlling the Coulomb interaction. It is often convenient to replace the Coulomb potential by the Yukawa potential
W
=* ^ ,
M>O.
Replacing the Coulomb potential YQ by a Yukawa potential Y^ with fj, > 0 amounts to introducing a long distance cutoff in the potential. It is easy to control this replacement since YQ — VM has positive Fourier transform (is of positive type). Hence N
J2
2
ZiZjiYo-Y^Xi-Xj)^-^^,
l
t=l
since (Y0 - VM)(0) = fi. In order to bound the many-body Hamiltonian H^ below by a one-body Hamiltonian one must estimate the two-body potential by a one-body potential. Theorem 5.1 (Onsager's correlation estimate). Given particles with charges at positions xi,..., XN, let
zi,...,Zff
Di = min{|:rj — Xj\ | ZiZj < 0 } ,
i.e. Di is the shortest distance from, the particle at Xi to a particle with the opposite charge. Then N
( 1
1
\
N ziZj Y^Xi - Xj) > - ]T>? ^12 (AM)2 + 2 (AM) + IJ Y^Di). l
Above Di depends only on Xi and on the positions of all the particles with the opposite charge of the particle at a^. This theorem allows us to separate the original Hamiltonian HN in a one-body Hamiltonian for all the negatively charged particles (with a one-body potential depending on the positions of all the positively charged particles) and a one-body Hamiltonian for all the positively charged particles (with a one-body potential depending on the positions of all the negatively charged particles). The proof of this theorem essentially goes back to Onsager [35] (for (i = 0), who addressed a stability question for classical charged matter. In [30] it is being used to introduce a short distance cutoff in the potential (i.e., to replace YQ hy YQ — Y^ for large /x). The one-body potential is controlled using the Sobolev estimate Theorem 4.1.
The energy of charged matter
121
If both the negatively and positively charged particles are fermions, we may use Onsager's estimate together with the Lieb-Thirring inequality in Theorem 4.2 to prove stability of matter. This special case was treated by Dyson and Lenard in their first paper [12], where they also used a version of Onsager's correlation estimate, but did not have the Lieb-Thirring inequality at their disposal. In order to prove stability of matter as formulated in Theorem 3.1 one may use the following stronger correlation estimate due to Baxter [1]. Theorem 5.2 (Baxter's correlation estimate^.> Assume as before that all negatively charged particles have charge —1 Then
E T^-^- E (n-z^iziVDr1. l
l
3
>
i=l,Zi<0
The importance here is that the sum on the right is only over negatively charged particles. Thus the right side is a one-body potential for the negatively charged particles thinking of the positively charged particles as fixed. A sharper (although more complicated to state) estimate was given by Lieb and Yau in [32]. The estimate of Lieb and Yau is sharper in the sense that the coefficient 1 + 2z above may be changed to z, but additional bounded errors are needed. Remark 5.1 (The proof of stability of matter). A proof of stability of matter in the standard non-relativistic case may proceed as follows. We use Baxter's estimate to arrive at N
HN>
^2
(Ti-(l
+ 2max{zi})(D-1-R-1))-N(l
+
2msxx{zi})R-1
i=l,zt<0
> - C i V ( l + 2max{z i }) 2 , where we have used the Lieb-Thirring inequality Theorem 4.2 for the potential V = (l + 2ma,x{zi})(-D-1
+ i?" 1 )
and we chose R ~ (1 + 2max{zj}) _ 1 . The proof of stability of matter by Lieb and Thirring [31] used the No-binding Theorem of Thomas-Fermi theory (see Lieb and Simon [28]) instead of Baxter's estimate. If the fermions are described by the relativistic kinetic energy T R e l or the magnetic Pauli operator T P a u h the proof of stability is not quite as simple. Note in particular that a potential that has a Coulomb singularity is not integrable to the fourth power and one can therefore not directly use the inequalities (8) or (9). The reader is referred to [32] and [26] for the detailed proofs of Stability in these cases. 5.1. Many-body localization techniques In many aspects of many-body theory the method of localizing has been a very useful technique. Dyson and Lenard used it heavily in their papers [12,13]. We here discuss the two problems connected with localizing a charged system, i.e., the localization of the interaction and the localization of the kinetic energy.
122
JAN PHILIP SOLOVEJ
The simplest way to approach the localization of the kinetic energy is to do a Neumann localization. In fact, let — A ^ u be the Neumann Laplacian for a unit cube centered at y GM3. Then in terms of quadratic forms defined on smooth compactly supported functions we may write f ,,,-, -A= / -A^dy, (10) where —A is the Laplacian on all of R 3 . Of course, there is a rescaled version to cubes of other sizes. The structure of this identity, i.e., that we have written the original operator as an integral over localized operators is characteristic for the way we shall write localizations in this section. The idea is then that for each value of the integration parameter one has a localized problem that one will estimate. Afterward the estimate is integrated. Another approach is to use a smooth localization. Let x be a C°° function with compact support. Let Xyix) = x(x ~ v)- Then we have the identity
( | x2) (-A) = j
-Xy&Xy dy-J VX2,
(11)
which is often referred to as the IMS localization formula (see [8]). Instead of writing the localization as an integral one often uses a sum instead, but then one must introduce a partition of unity. For the problem of the charged Bose gas it turns out that we cannot use either of these localization methods. In this case we cannot completely localize the kinetic energy operator. We must still have some kinetic energy to control the variation between localized regions. The solution is to write the kinetic energy as a sum of a a high energy part and a low energy part. We then localize only the high energy part and use the low energy part to control the large distance variation. The result can be found in [30]. We shall state a simplified version here. Assume that \ with 0 < x < 1 is smooth has support in a unit cube centered at the origin in R 3 and that i / l - x 2 is also smooth. Let as before Xy(x) = x(x ~ ]))• Let a*(y) be the creation operator for a particle in a normalized constant function in the unit cube centered at y e R 3 . Let Vy be the orthogonal projection operator projecting on functions orthogonal to constants in the cube. Then for any bounded set fid3 and any 0 < s < 1 we have the following many-body kinetic energy localization estimate on the symmetric tensor product space 0 s y m I/ 2 (R 3 ) (the fully symmetric subspace of the full tensor product space)
(1 + £ (x, s)) £
-A, > ^
^v
• ] C ( ^ « o ( i / + ej)ao(v + ej) + 1/2 - y/<%(y)aQ(y) + 1/2
dy
3=
3vol(fi),
(12)
where ei,e 2 ,e3 is the standard basis in R 3 and e(x,s) —» 0 as s —» 0. Note that this is an explicit many-body bound in the sense that the right side of the operator inequality is not a one-body operator. The above estimate generalizes immediately to the situation where a charge variable is present. In this case there should be two creation operators, one for charge
The energy of charged matter
123
+ 1 and one for charge —1. There are rescaled version of (12), which hold if we replace the unit cube by cubes of other sizes. The first sum on the right side of (12) is the localization of the high energy part of the kinetic energy. High energy means much larger than s - 2 . The second sum on the right side of (12) represents the kinetic energy due to the variations between localized regions. Using the Cauchy-Schwarz inequality one sees that for a state *€®£mL2(R3)wehave
*> \Ja*o(y + ejK(y + ej) + 1/2 - y/<%(y)a0{y) + 1/2 j * > (V(*. K(y + tjH(y + **) +1/2)*) -
/ v
(*, (oSfoKte) +1/2)*)) • (13)
This allows us to think of the last term on the right side of (12) as a discrete Laplacian acting on the function y -» ^(*,(a5(y)Oo(y) + l / 2 ) * ) .
(14)
For the Bose gas we shall conclude that most particles are in the constant function state (the zero momentum state). More precisely, this means that ($,a,Q(y)a0(y)'$) is almost the expected total number of particles in the unit cube centered at y. The expectation of the last term on the right side of (12) will therefore essentially give a contribution equal to the Laplacian of the square root of the density (assuming that we can approximate the discrete Laplacian by the continuous Laplacian). We finally come to the discussion of the localization of the interaction. For the Coulomb or Yukawa interaction this can be done using a method of Conlon, Lieb, and Yau [7]. Let as before x be a smooth function supported in the unit cube centered at the origin. There is an LJ > 0 depending on \ s u c n that for all \i > 0 we have
a
X2J
53
z z
i jYn(xi-Xj)>
53
Xyi^Y^+Uxi-x^Xyix^dy-Nw.
(15)
Note that the effect of the localization function Xy o n the right side is that for each fixed value of the integration parameter y only particles that live in the unit cube centered at y interact with each other. Again it is easy to see how this estimate changes under rescaling. A very elegant version of this estimate was given by Graf and Schencker [21].
6. T h e lower b o u n d in Dyson's conjecture We shall now describe the main steps in the proof of the lower bound in Theorem 3.2. The reader should look in [30] for details. In each step we shall ignore certain errors and we shall not explain in details how these errors are estimated. In the detailed proof the errors are, in fact, only estimated at the very end when the main contributions have been identified. It is first of all important to understand that there are two relevant length scales in the problem. A long scale LQ, which is the diameter of the Bose gas and a short scale £o which is the distance on which the Bogolubov pairs interact. It will turn out that LQ ~ N~1^5 and to ~ N-V5.
124
JAN PHILIP SOLOVEJ
Step 1: We first localize the whole system into a large cube of size L » LQ ~ N~1/5. On the other hand L should not be too large in order to allow us to control the volume error in (12). This first localization is done using the Conlon-Lieb-Yau estimate (15) and an IMS-localization (11) of the kinetic energy. Note that after rescaling the error in (15) will be 2 2 2 NUJ/L < UJN6/5. The IMS localization error (i.e., N / ( V x ) / / X ) will be NL~ < AT 7 / 5 . Step 2: We do a second localization into boxes of size £, where £Q <SL £ <§C L0. This localization is done using the many-body kinetic energy localization (12) together with the Conlon-Lieb-Yau estimate (15). This time the error in using (15) is Nui/£ <§: NU)/£Q ~ UJN7'5.
The result of this localization is that the total ground state energy is estimated below by a sum of two terms. One term is the discrete Laplacian term discussed in the previous section. The other term is the local energy, which will eventually lead to the effective Hamiltonian for which we may apply the Bogolubov approximation. Step 3: Before proceeding with the analysis of the local short scale energy we introduce long and short distance cutoffs in the interaction. This is done as explained in the beginning of Section 5. Step 4: This is the final step in the reduction to the effective Hamiltonian. The localized two body potential is of the form W(x,x') = Xy(x)V(x — x')xy(x'), where V is a cutoff Coulomb potential, i.e, V = Y^ — Yv for some appropriate \x
5Z
E
zz
'{ua®Uf3,Wu0®us)a*azapz,aSz,ayz.
z,z'=±l af3-8
The fourth step in the proof is to show that one may ignore (for a lower bound) all terms in this sum which do not have precisely two of the parameters a, /3,7,5 equal to zero. Moreover one may also ignore terms for which a = j = 0 or (3 = 5 = 0. This step in the proof is rather technical and uses heavily that we have been able to introduce cutoffs in the potential. The resulting two body interaction may be written
577^3 /
V
^^^'(KPJ+P^
+ b-PJ-P,z' + hXp,zh-v,f + b+PJ-p,*') * '
where V denotes the Fourier transform of the potential V, u± denote the total number of positively and negatively charged particles in the cube respectively, and b;,z =
(£3vz)-1/2az(VyXyeinaoz,
where a*(/) creates a particle of charge z in the state / and as before Vy is the projection orthogonal to constants. It is easy to see that for fixed p the operators &±p>z satisfy the conditions in Theorem 4.3 on the Foldy-Bogolubov method. It is also easy to see that the localized kinetic energy resulting from (12) may be estimated
The energy of charged matter
125
below by
where t(p) = |£ 3 p 4 (p 2 + is-2)-1. It is now immediate from the Foldy-Bogolubov method in Theorem 4.3 that the local ground state energy is estimated below by 2(2TT) :
/ (t(P) + Vyv[p)) - ^(t(Py + 2t(p)vyv(p)dP
where vy = u+ + v~ (which depends on the location of the cube, although this had been suppressed for u^). If we now ignore the various cutoffs (which may be justified) and simply replace t(p) by ^£3p2 and V(p) by 4tr/p2 we see that the above integral gives —Jvy ^~ 3 / 4 = -J(vy/t3)5/4£3, with J defined in Theorem 3.2. Step 5: In order to control the errors encountered in reducing to the effective Hamiltonian as well as in treating the discrete Laplacian term we must conclude that most particles are in the condensate, as explained in the previous section. To do this, we shall localize the number of excited particles. To be more precise, we may think of the Hilbert space as being a direct sum over subspaces with a definite number of particles in the condensate. The Hamiltonian is not diagonal in this representation. We want to conclude however that restricting any given state to only a small finite number of the subspaces will not significantly change its expected energy. This is achieved using the following result from [29]. Theorem 6.1 (Localization of large matrices). Suppose that A is an NxN Hermitian matrix and let Ak, with k = 0,1,...,N — 1, denote the matrix consisting of the kth supraand infra-diagonal of A. Let ip € CN be a normalized vector and set dk = (ip,Akil') and A = (ip,Aip) = ]Lfc=o dk- (ty need not be an eigenvector of A.) Choose some positive integer M < N. Then, with M fixed, there is some n € [0, N — M] and some normalized vector <j> € C ^ with the property that <j)j = 0 unless n + 1 < j
M-l
Af-l
2
(0,^)
(16)
k=M
where C > 0 is a universal constant. (Note that the first sum starts with k = 1.) Step 6: The final step is to combine the two parts of the energy described in Step. 2. The local energy was (approximately) bounded below by —J(vy/(.3)5^4£3 which when integrated over y and normalized by the volume of the cube gives
-jf
(-y/^f'dy. JR3
The other part of the energy is essentially the kinetic energy of the function in (14) (where we had actually ignored the charged variable). Using the result of Step 5 that most particles are in the condensate we may write this kinetic energy as (approximately)
v
U>( ^
/3
2
dy.
126
JAN PHILIP SOLOVEJ
If we use that the total number of particles is N we have the condition that /M3 uy/£3 dy = N. Thus if we define >(y) = N~4/5^l
i/5y/£3 we see by a straightforward scaling argument
that J 4>(x)2dx = 1 and that the ground state energy is approximately bounded below by N7/5 (\
f
(V
Mx))5'2
dy
which is of course bounded below by minimizing over all <j> as in (7).
7. The upper bound in Dyson's conjecture To prove an upper bound on E^ of the form given in Dyson's conjecture Theorem 3.2 we shall construct a trial function using as an input a minimizer <j> for the variational problem on the right side of (7). That minimizers exist can be seen using spherical decreasing rearrangements. Define <j>o(x) — N3/w
*=X[{l-\i)1/AexJ-\l
+ \0aZ+ + \0al_-
£
£
^ « ' < z < « < ) |0),
(17)
where a*a z is the creation of a particle of charge z = ± 1 in the state (f>a, |0) is the vacuum state, and the coefficients Ao, Ai,... will be chosen below satisfying 0 < Aa < 1 for a ^ 0. It is straightforward to check that \P is a normalized function. Dyson used a very similar trial state in [11], but in his case the exponent was a purely quadratic expression in creation operators, whereas the one used here is only quadratic in the creation operators a*az, with a ^ 0 and linear in a^±. As a consequence our state will be more sharply localized around the mean of the particle number. Consider the operator oo
a=l
,2 a
A straightforward calculation of the energy expectation in the state VP gives that ( *, £
HNV ) = X20 /"(V0o) 2 + Tr ( I T ) + 2A* Tr ( / c ( r - v T ( r + l ) ) ) ,
(18)
where K. is the operator with integral kernel K{x, y) = (t>0{x) \x - 2/| _1 <j>0{y). Moreover, the expected particle number in the state \£ is 2AQ + Tr(r). In order for * to be well defined by the formula (17) we must require this expectation to be finite.
The energy of charged matter
127
Instead of making explicit choices for the individual functions
(2TT)- 3
/
JWxR3
f(u, \p\)V£0 \0U
where P f is the projection orthogonal to (j>o, ^ l P ( i ) = exp(ipx) xe(x - u), and
1/
^ + IftrAfoodQ'
_A
We note that T is a positive trace class operator, T<po = 0, and that all eigenfunctions of r may be chosen real. These are precisely the requirements needed in order for V to define the orthonormal family (f>a and the coefficients Aa for a ^ 0. We use the following version of the Berezin-Lieb inequality [2,23]. Assume that £(£) is an operator concave function on K + U {0} with £(0) > 0. Then if Y is a positive semi-definite operator we have Tr(YZ(T)) > (27r)- 3 |e(/Kbl)) {KvV&YV&B^) dudp.
(19)
We use this for the function £(t) = ^/t(t+l). HY = I then (19) holds for all concave function £ with £(0) > 0. Of course, if £ is the identity function then (19) is an identity. This reduces proving an upper bound on the energy expectation (18) to the calculations of explicit integrals. After estimating these integrals one arrives at the leading contribution (in TV) A
2
/ ^ )
+ fj
2
( 5 P 2 + 2A O 2 0 O ( U ) 2 ^) f(u, \p\) - ^2\l4>0(ufv,f(u,
\p\)(f(u, \p\) + 1) dpdu
= A2|(V^0)2-j/(2A2)5/^/2, where J is as in (7). If we choose A0 = \/N/2 we get after a simple rescaling that the energy above is TV7/5 times the right side of (7) (recall that
128
JAN P H I L I P SOLOVEJ
We construct a trial function \&' as above, b u t with an expected particle number N' chosen appropriately close t o but slightly smaller t h a n N. Using t h a t we have a good lower bound on the energy EN for all N we may, without changing the energy expectation significantly, replace \1>' by a normalized wave function ^ t h a t only has particle numbers less t h a n N. Since the function N H-> E^ is a decreasing function we see t h a t the energy expectation in the state \I> is, in fact, an upper bound t o E^.
References 1. J. R. Baxter, "Inequalities for potentials of particle systems", Illinois J. Math. 24, 645-652 (1980). 2. F. A. Berezin, Izv. Akad. Nauk, ser. mat, 36 (1972). English translation: USSR Izv. 6 (1972). and F. A. Berezin, "General concept of quantization", Coram. Math. Phys. 40, 153-174 (1975). 3. David Brydges, Paul Federbush, "A note on energy bounds for boson matter", Jour. Math. Phys. 17, 2133-2134 (1976). 4. Luca Bugliaro, Jiirg Frohlich, Gian Michele Graf, "Stability of quantum electrodynamics with nonrelativistic matter", Phys. Rev. Lett. 77, 3494-3497 (1996). 5. Chandrasekhar, Subramanyan, Phil. Mag. 11, 592 (1931). 6. Joseph G. Conlon, "The ground state energy of a classical gas", Comm. Math. Phys. 94, 439458 (1984). 7. Joseph G. Conlon, Elliott H. Lieb, Horng-Tzer Yau, "The AT7/,S law for charged bosons", Comm. Math. Phys. 116, 417-448 (1988). 8. H. L. Cycon, R. G. Froese, W. Kirsch, B. Simon, "Schrodinger operators with application to quantum mechanics and global geometry", in Texts and Monographs in Physics, SpringerVerlag, Berlin, 1987. 9. Ingrid Daubechies, "An uncertainty principle for fermions with generalized kinetic energy", Comm. Math. Phys. 90, 511-520 (1983). 10. Ingrid Daubechies, Elliott H. Lieb, "One-electron relativistic molecules with Coulomb interaction", Comm. Math. Phys. 90, 497-510 (1983). 11. Freeman J. Dyson, "Ground state energy of a finite system of charged particles", Jour. Math. Phys. 8, 1538-1545 (1967). 12. Freeman J. Dyson, Andrew Lenard, "Stability of matter. I", Jour. Math. Phys. 8, 423-434 (1967). 13. Freeman J. Dyson, Andrew Lenard, "Stability of matter. II", Jour. Math. Phys. 9, 698-711 (1968). 14. Paul Federbush, "A new approach to the stability of matter problem. I, II" Jour. Math. Phys. 16, 347-351 (1975); ibid. 16, 706-709 (1975). 15. Charles L. Fefferman, "Stability of matter with magnetic fields", CRM Proc. Lecture Notes 12, 119-133 (1997). 16. Charles Fefferman, Jiirg Frohlich, Gian Michele Graf, "Stability of ultraviolet-cutoff quantum electrodynamics with non-relativistic matter", Comm. Math. Phys. 190, 309-330 (1997). 17. Charles Fefferman, Rafael de la Llave "Relativistic stability of matter. I", Rev. Mat. Iberoamericana 2, 119-213 (1986). 18. Michael Fisher, David Ruelle, "The stability of many-particle systems", Jour. Math. Phys. 7, 260-270 (1966). 19. Leslie L. Foldy, "Charged boson gas", Phys. Rev. 124, 649-651 (1961); errata, ibid. 125, 2208 (1962). 20. Jiirg Frohlich, Elliott H. Lieb, Michael Loss, "Stability of Coulomb systems with magnetic fields. I. The one-electron atom", Comm. Math. Phys. 104, 251-270 (1986). 21. Gian Michele Graf, Daniel Schenker, "On the molecular limit of Coulomb gases", Comm. Math. Phys. 174, 215-227 (1995).
The energy of charged matter
129
22. Ira W. Herbst, "Spectral theory of the operator (p 2 + m 2 ) 1 / 2 - Ze2/r", Comm. Math. Phys. 53, 285-294 (1977). 23. Elliott H. Lieb, "The classical limit of quantum spin systems", Comm. Math. Phys. 3 1 , 327-340 (1973). 24. Elliott H. Lieb, Michael Loss, "Stability of Coulomb systems with magnetic fields. II. The many-electron atom and the one-electron molecule", Comm. Math. Phys. 104, 271-282 (1986). 25. Elliott H. Lieb, Michael Loss, "Stability of a model of relativistic quantum electrodynamics", Comm. Math. Phys. 228, 561-588 (2002). 26. Elliott H. Lieb, Michael Loss, Jan Philip Solovej, "Stability of matter in magnetic fields", Phys. Rev. Lett. 75, 985-989 (1995). 27. Elliott H. Lieb, Heinz Siedentop, Jan Philip Solovej, "Stability and instability of relativistic electrons in classical electromagnetic fields", Jour. Stat. Phys. 89, 37-59 (1997). 28. Elliott H. Lieb, Barry Simon, "The Thomas-Fermi theory of atoms, molecules and solids", Adv. in Math. 23, 22-116 (1977). 29. Elliott H. Lieb, Jan Philip Solovej, "Ground state energy of the one-component charged Bose gas", Comm. Math. Phys. 217, 127-163 (2001); erratum, ibid. 225, 219-221 (2002). 30. Elliott H. Lieb, Jan Philip Solovej, "Ground state energy of the two-component charged Bose gas", preprint, 2003. 31. Elliott H. Lieb, Walter E. Thirring, "Bound for the kinetic energy of fermions which proves the stability of matter", Phys. Rev. Lett. 35, 687-689 (1975). 32. Elliott H. Lieb, Horng-Tzer Yau, "The stability and instability of relativistic matter" Comm. Math. Phys. 118, 177-213 (1988). 33. Michael Loss, Horng-Tzer Yau, "Stability of Coulomb systems with magnetic fields. III. Zero energy bound states of the Pauli operator", Comm. Math. Phys. 104, 283-290 (1986). 34. Jan Philip Solovej, in preparation. 35. Lars Onsager, "Electrostatic interaction of molecules", Jour. Phys. Chem. 4 3 , 189-196 (1939). 36. R. A. Weder, "Spectral analysis of pseudodifferential operators", J. Functional Analysis 20, 319-337 (1975).
Entropy production and convergence to equilibrium for the Boltzmann equation CEDRIC VILLANI (ENS
Lyon)
In this short paper, I shall address recent developments in the study of convergence to equilibrium for the Boltzmann equation, linked with the study of Boltzmann's H Theorem and entropy production. The presentation closely follows my lecture at the ICMP 2003 Conference in Lisbon. For pedagogy, I will use an informal style, at the cost of losing rigor and precision; but details and rigorous discussion can be found in the research papers quoted within the text.
1. A "simple" motivating problem In an empty box (modelled as a smooth open connected subset il of R 3 ) throw N ~ 1020 "independent" small particles (say billiard balls of radius r ~ l/\/N), according to some nice probability density fo(x,v)dxdv in phase space. Here "phase space" means the coordinate space of positions and velocities. Of course it is impossible to have really independent particles, since they have to exclude each other, but let us forget about this issue. The density of particles, or empirical measure, is given by the formula
fl(dxdv) :=
1
N
jjj^6^^)i=l
If N is very large, and the particles are (close to) being independent, some versions of the law of large number imply that at time 0, Jlo — fo(x,v) dvdx, in a sense which will not be made precise here. Now, at positive times the system evolves according to Newton's equations, with interaction between the particles, so the balls acquire new positions and velocities. Can one predict a good approximation of the density of particles at large times?? Here would be a physicist's guess. Let (ft)t>o be the solution of Boltzmann's equation starting from the initial datum /o. One expects (i) flt ~ ft (with very high probability), for given t, if TV is large enough; (ii) ft ~ Gaussian when t is large enough; B U T statement (i) cannot hold true when the time t is very, very large — because of Poincare's recurrence theorem, for instance. In fact, the Boltzmann approximation loses its validity under very, very long periods of time. Here "large enough", "very high", "very very large" should be quantified in terms of /o, interaction, shape of the box, number N of particles. Our seemingly simple problem actually appears to be tremendously complicated! To this date, neither statement (i) nor statement (ii) has received mathematical justification in a quantitative way. Statement (i) is related to Lanford's 1973 famous theorem (a
130
Entropy production and convergence to equilibrium for the Boltzmann equation
131
good account of this theorem can be found in Cercignani, Illner and Pulvirenti [3]). This talk is all about statement (ii). So the central question which will occupy this paper is the following: Can one establish quantitative rates of convergence to equilibrium for solutions of the Boltzmann equation? Before entering into the bulk of the subject, I should stress that the presentation which follows is quite informal and tries to conceal the extreme technicality of the subject. More precise information can be obtained in recent papers by the author, some of them in collaboration with Desvillettes [5,6,16] (all of them available from http://www.umpa.ens-lyon. f r / ~ c v i l l a n i ) . These references themselves contain pointers to various parts of the mathematical literature.
2. The Boltzmann equation The Boltzmann equation models the evolution of a dilute "chaotic" gas made of many particles interacting by binary, elastic, localized, instantaneous collisions. A lot of references and information about the mathematical theory of the Boltzmann equation can be obtained in the author's long survey paper [15]. The unknown in Boltzmann's equation is the time-dependent distribution function of the gas, in phase space: ft(x,v), where t = time; x = position 6 fl c R n (say n = 3); v = velocity. The quantity ft(x,v) can be thought of as the density of the gas around position x and velocity v, at time t. The Boltzmann equation itself reads (BE)
^ +
v-Vxf = Q(f,f). transport collisions
The collision operator Q may seem ugly to the non-familiar reader:
Q(f,f)=
[
f
\f(v')f(v'.)-f{v)f{y.)]B{v-v.,o)dcTdv*.
Here we used the notation ,
u + u* 2
\v — v,\ 2 '
, *
v + v* 2
\v-vJ 2
The reader can think of (v',i>*) as the velocities of two particles before they collided; after the collision, their velocities have become (v, v*). Moreover, B is the Boltzmann collision kernel, e.g. B = \v — v*\ (hard sphere kernel in dimension 3). The Boltzmann equation is not complete without b o u n d a r y conditions (for x £ dQ). Here are some simple examples of boundary conditions: bounce-back
specular reflection
f(x,v) = f(x,-v)
f(x,v) =
f(x,Rxv)
periodic
132
CEDRIC VILLANI
3. From kinetic to hydrodynamic description Most equations in fluid mechanics are written in terms of hydrodynamic models rather than kinetic models. From a kinetic description one can always define a hydrodynamic description, with the following recipes: p(t,x
:=
f dv
u(t,x
:= -
F(t,x :=
fvdv
(mean velocity),
f (v —u) <S> (v — u)dv
T(t, x := — t r P np
E(t,x
(density),
:= Jf\±dv
(tensor pressure field),
(temperature), = pl^f
+
^pT
(energy).
Besides their importance in physics, these fields will play a key role in the sequel.
4. Mathematical status of the Boltzmann equation The mathematical theory of the Boltzmann equation is still far from completion, and so far only scattered pieces of theory exist. Below are listed some of the most significant pieces; many details and references can be found in the author's discussion [15] of the Cauchy problem for the Boltzmann equation. — Existence of weak ("renormalized") solutions in great generality (for short-range interactions this is a celebrated theorem by DiPerna k, Lions (1989), which can be found in Cercignani, Illner and Pulvirenti [3]; for long-range interactions this is a recent result by Alexandre and the author [1]). All the rest is under restrictive assumptions: — A good theory of solutions close to equilibrium: this theory was started by Ukai (1974) and then continued by many authors since then. Among a great numbers of contributions, we highlight the recent works by Guo [8-10]. — A rather good theory for small solutions in the whole space (Kaniel-Shinbrot 1978, Illner, Toscani, . . . ) . — An excellent theory for spatially homogeneous solutions, i.e., no x variable (Carleman 1933, Arkeryd 1972, . . . ) . — It is still an outstanding problem to construct a decent theory for classical solutions in the large. Why is it so difficult to deal with the full Boltzmann equation? Here below are listed some of the main reasons: — the collision operator Q is quadratic; — Q is complicated, and fine analysis with it is quite a challenge;
Entropy production and convergence to equilibrium for the Boltzmann equation
133
— Q acts only on the velocity-dependence, not on the position, which introduces a fundamental degeneracy; — v • V x and Q get along awfully: virtually any trick which works well for one, is a disaster for the other. To give an idea of the present state of attempts towards a theory of smooth solutions in the large, here below is an example of conditional regularity result, worked out by the author: Under reasonable assumptions on B (say "hard potentials without cut-off", for instance) and periodic boundary conditions, I F one has the following a priori estimates: (i) the density and energy fields bounded from above, (ii) the density is bounded below (no vacuum!): p > 5 > 0, (iii) the pressure tensor is bounded below, in the sense of matrices: P > XIn (A > 0), T H E N one can construct very nice solutions: (1) all Sobolev norms (with derivatives in both x and v) of / are bounded; (2) all u-moments (J f\v\k dvdx) are bounded. N B : Condition (iii) above is rather natural, in the sense that / integrable =>• P > 0 />a.e.
5. B o l t z m a n n ' s H T h e o r e m : Let us define the H functional as the opposite of Boltzmann's entropy: H(f):=
f
f log f dvdx =
-S(f).
JClxRn
Let (ft)t>o be a nice solution of the Boltzmann equation. Then, (i) dH/dt < 0. More precisely, there exists a functional D>0,
jtH(ft) = -JD(ft(x,-))dx
acting on L 1 (R"), s.t.
<0.
entropy production (ii) Assume B > 0 a.e. Then D(f) = 0 iff f is a
Maxwellian:
Statement (ii) can be reformulated in terms of solutions of the Boltzmann equation: (ii1) The entropy production vanishes at time t iff ft is a local Maxwellian dynamical state": ft(x,v)
= MpuT(v),
p = p(x),
u = u(x),
= "hydro-
T = T(x).
To the H Theorem one can add the following complement: (iii) Assume appropriate boundary conditions and appropriate geometric conditions on £1; then, up to normalization of conservation laws, the B.E. admits a unique solution which is hydrodynamical (locally Maxwellian) at all times, (iv) Corollary: This is the only stationary state.
134
CEDRIC VILLANI
Main example: Assume n = 2 or 3, specular reflection, 0, connected and not axisymmetric, | 0 | = 1, total mass 1, total energy n/2. Then the only hydrodynamical solution of the Boltzmann equation is the global Maxwellian e ft(x,v) = M(v) = (2TT)"/ 2
6. Convergence to equilibrium Let / be a nice solution of B.E., with nice bounds, uniform in time. It is an easy task to prove convergence to a stationary state (for this you just have to combine the H Theorem with a bit of functional analysis). What is much, much more tricky is to find constructive bounds on the speed of convergence. Wrong strategy: Indeed,
Start by linearizing close to equilibrium and perform a spectral study.
— The "natural" estimates of Sobolev and moment bounds are not strong enough to allow the "natural" linearization of the Boltzmann equation in L2{M~l). — How long do we have to wait until the solution is sufficiently close to equilibrium, that it makes sense to linearize? This problem is not of technical nature but intrinsic to linearization. In fact linearization can only be understood as a complement to other "fully nonlinear" techniques.
7. Short history The problem of speed of convergence was pioneered by Kac [11] (1954), who tried to treat it in the spatially homogeneous case as the limit of a many-particle problem. On this occasion he introduced — "Kac's problem" about spectral gap in large dimension; on this problem there has been a number of recent contributions by Diaconis & Saloff-Coste, Janvresse, Maslin, Carlen & Carvalho & Loss, and the other; in particular, it is profitable to consult Carlen, Carvalho & Loss [2]; — the concept of "propagation of chaos"; — one of the first continuous mean-field limits; . . . but did not get anywhere about rates of convergence. Then McKean [12] (1966) introduced information theory into the game, for an oversimplified one-dimensional model called Kac's caricature. In the 1990's, Desvillettes, Carlen & Carvalho, Toscani & Villani obtained strong results in the spatially homogeneous case. These works are reviewed in a recent work [16] by the author, which will be discussed later on in this paper. The result to be presented in the sequel is about explicit rates of convergence in the spatially inhomogeneous case, for smooth solutions; most of it was worked out by the author in collaboration with Desvillettes.
Entropy production and convergence to equilibrium for the Boltzmann equation
135
8. Main difficulties In dealing with the problem of convergence to equilibrium one encounters difficulties which are reminiscent of the already mentioned difficulties for the Cauchy problem: — The complexity of Q, again. — The fact that the entropy production vanishes for hydrodynamical states, which can be seen as a reflection of the degeneracy of the collision operator. In particular, —• local Maxwellians may be a nuisance here! This is very different from the problem of hydrodynamical limit,
| £ + t , . V x / = |^Q(/,/),
Kn^O.
In that limit, on the contrary, one hopes (and in some cases proves) that the solution stays very close to be a local Maxwellian at each time. — One has to understand, and translate into equations, the crucial role of the transport operator v • Vx and the boundary conditions, which help selecting the equilibrium but do not produce any entropy! Remark: There are some common points between the way the problem was just formulated, and the hypoelliptic regularity problematic. The analogy can actually be pushed rather far.
9. M a i n result (Desvillettes & Villani, 2002) Here is our main result, formulated in a rather informal way; a precise statement can be found in the original research paper [6]. Let f be a solution of the Boltzmann equation such that (i) all Sobolev norms and moments of f are bounded, uniformly in t; (ii) ft(x,v) > K0 exp(-A0\v\i°). Then, after suitable normalizations, \\ft — M||£°o = 0(t~°°),
with explicit constants.
Remarks: — It was shown recently by Mouhot [13] that (i) => (ii) under realistic assumptions, at least for periodic boundary conditions. — In the spatially homogeneous case, say for "hard potentials with cut-off", a complete regularity theory can be worked out, and then such a theorem applies to prove convergence to equilibrium like 0(£~°°), see Mouhot and Villani [14]; but even if we take the regularity bounds for granted, going from spatially homogeneous to spatially inhomogeneous is still a monster headache; — The theorem as it stands applies almost verbatim to Guo's solutions, but does not need / to be close to equilibrium, and does not rely on linearization. — The usual strategy to attack the problem of convergence to equilibrium is to try to show that / looks like a hydrodynamic state in large time, then identify this state. Our proof goes somehow the opposite way, as we shall see.
136
CEDRIC VILLANI
10. Strategy of proof The functional used to measure the distance to equilibrium is the Kullback information of / with respect to M: H(f | M) = H(f) - H(M). This simplified form is due to the conservation of energy, actually the more general definition of the Kullback information is #(/!$) = / / l o g - , which is nonnegative as soon as / / = / g. Then our proof goes as follows: (1) Use a "sharp" quantitative H Theorem. (2) Establish a quantitative version of "instability of the hydrodynamic behavior". (3) Couple both features by — "additivity of the entropy" (total entropy = hydrodynamic entropy + kinetic entropy), — interpolation and geometrical inequalities. (4) Get a system of differential inequalities involving closeness to global and to local Maxwellians. Let us examine these various points in more detail one after another.
11. Quantitative H t h e o r e m Let / = f(v) be a distribution in velocity space. Let p, u, T be the associated density, mean velocity, temperature. This defines a Maxwellian distribution MpuT — Mpur{v). We know: £>(/) = 0 iff / = MfpuT. Our goal is to establish an inequality roughly looking like D{f) > "distance (/, MpuT)". Here D is the functional of entropy production, whose explicit form is
D(f) = \j(f(v')M)
- f(v)f{v,j)
log j^jf^Bdadvdv*
.
Presently, a good notion of "distance" is given by the Kullback information
H(f\MfpuT)=
I J
fVygf-dv. M
puT
Strictly speaking, this is not a distance (although often called Kullback-Leibler distance), but it behaves in some respects like the square of a distance. Cercignani's conjecture (around 1980): Under suitable positivity assumptions on the collision kernel B, holds the functional inequality D(f) > K H(f | MfpuT), where K depends on p and T — maybe on other a priori estimates on f as well.
Entropy production and convergence to equilibrium for the Boltzmann equation
137
To study this conjecture, it is sufficient to assume p = 1, u = 0, T = 1. In any case, it is false most of the time (Bobylev, Wennberg, Cercignani). The first positive results approaching it were established by Carlen & Carvalho (1992), with subsequent considerable improvement in Toscani & Villani (1999). A recent result by the author [16] establishes a sharp variant of Cercignani's conjecture which can be used in the spatially inhomogeneous problem under smoothness/moments/ lower bounds a priori estimates. The paper can be consulted as well for a review of the abovementioned previous results by Bobylev, Cercignani, Carlen, Carvalho, Toscani, Wennberg and the author. Theorem
(Villani, 2002):
(i) Let B — 1 -\-\v — u*|2 [nonphysical assumption!]; then Cercignani's conjecture holds true: D(f) > K H(f \ M) with
T*(f) := max eigenvalue of P. (ii) For any reasonable B (ex: \v — v*\), if f lies in all Sobolev spaces, has all moments finite, and satisfies a lower bound f > Koexp(—Ao\v\9°), then D(f)>K£(f)H(f\M)1+s.
Ve>0,
Here are some very sketchy elements about the proof of this theorem. To go from (i) to (ii), the key is to establish the tricky non-concentration estimate {f'fl-fU)\ogf-^do-dvdv*=0(5n-*H{f\M)1-*),
/ J\v—v.\<8
J J*
where 5 and e are arbitrarily small, and the constants depend on Sobolev norms of / as well as a (stretched exponential) lower bound. The fact that the exponent of 8 in the r.h.s. is close to n is not so important, but it is important that the exponent does not approach 0 as e —* 0, and that the exponent of H(f | M) be arbitrarily close to 1. This is what ensures that in the end the loss of exponent in the final inequality is arbitrarily small. Next, here is a vague sketch of proof for (i). The argument involves the relative Fisher information, well-known in information theory and statistics:
I(f\M) = Jf\Vvlog{f/M)\2dv, and the Ornstein-Uhlenbeck regularization semigroup (let's call it (St)t>o) generated by the PDE dtf = Avf + V„ • (/«). Note that {d/dt)H{Stf \ M) = -I(SJ | M). The key identity, partially algebraic, is a commutator identity: let £(F,G) := (F — 0)\og(F/G). Then
dt t=o
[St,£] = -J,
J(F,G)
= \V\ogF - V l o g G | 2 ( F + G).
138
CEDRIC VILLANI
At the end of a quite intricate chain of inequalities, one can derive the following representation formula for a lower bound on D: /•+00
D{f)>K Jo
r
e" 4 "' / Jm2n
J(StF,StG)dvdv*dt,
where F(u,u») = f{v)f(v*) is a tensor product and G(v,v*) is the average w.r.t. a of f(v')f(v'*): ^ o n l y depends o n » | t ) » and \v\2 + |v*|2. This conflict of symmetries is the key: it implies, after another chain of inequalities, /
J{StF,StG)dvdv*
> KI(Stf
| M).
To conclude, one uses the inequality e-4ntI(g\M)>I(S2ntg\M) (a particular case of the Blachman-Stam inequality from information theory). The final result is
= • D(f) >K J
I(S{2n+1)tf
\M)dt=
^ - j - y H(f | M).
The complete proof is quite tricky and relies on the precise form of the collision operator; but once it is done, we can forget almost everything about the precise form of the Boltzmann collision operator, and only recall that -jtH(f\M)>KeH(f\MfpuT)^. In the spatially homogeneous case, this would be the end of the game: -jtH(f
| M) > K£H(f
| M)1+£
==*• By Gronwall, H(f \ M) = 0(t~i) and we are done. But in the spatially inhomogeneous case, p, u, T depend on x, so the entropy production does not control H(/ | M)!!! In fact, if / happens to be hydrodynamical at some time, then the instantaneous entropy production is zero this shows that we cannot aim at anything in Gronwall style.
12. Instability of hydrodynamic behavior Here is the main idea which we worked out with Desvillettes: If f approaches a local Maxwellian, not global, then f will depart "transversally" from the space M. of all local Maxwellian distributions. This vague statement will be quantified only on the average, and holds true only for generic local Maxwellian. In fact: — VT 7^ 0, or dev(u) ^ 0 = » departure from M; — symmetrized gradients of u =4> departure from the subspace of M with uniform temperature;
Entropy production and convergence to equilibrium for the Boltzmann equation
139
— gradients of p =$• departure from the subspace of M with uniform temperature and zero velocity field. In the above, we introduced the deviatoric part of the velocity field u, denoted by dev(u). It is defined as (Vit + ( V u ) r ) div(ti) 2 n i.e., the traceless part of the symmetric Reynolds tensor. It is well-known in hydrodynamics and plays an important role here too. Here is a schematic picture of the dynamics summarizing the above statements. The flow is supposed to represent in a fuzzy way the Boltzmann flow; the surface drawn here stands for the infinite-dimensional manifold of locally Maxwellian distributions; this manifold is unstable in some sense, except along a certain sub-manifold (no gradients of temperature, no deviatoric part); this sub-manifold itself is unstable, except for a sub-sub-manifold (not represented here; no gradients of temperature, no gradients of velocity); this sub-sub-manifold is in turn unstable, except for the global equilibrium M.
/
T=const dev(u)=0
The question is how to turn this picture into quantitative bounds. We used the following recipe (adapted from Desvillettes &; Villani [4]): compute a lower bound for d2 Let us answer some of the most immediate questions about that strategy: W h y the second derivative? 1) to estimate the speed of departure from M; since it is likely to vanish at second order in time if it ever vanishes, a lower bound on the second derivative will prevent it to vanish too much; H(f\MfpuT)
140
CEDRIC VILLANI
2) by applying the equation twice, we shall have the transport operator v • V x enter the equations twice, and this will have the same effect as a A^; this is the way we recover "ellipticity". (Cf. the Chapman-Enskog procedure to go from the Boltzmann equation to the compressible Navier-Stokes system: the appearance of the viscosity in that process can be traced back to the repeated appearance of the transport operator.) W h y a square norm? 1) for smoothness: the square norm is easier to handle, than, say, the norm itself; 2) because H(f\g) anyway behaves somewhat like a square norm. In fact it is wellknown that ,, . ,,2 H{f | g) > KZJhL. W h y not look at (d2/dt2)H(f | M j u T ) ? Because it is very difficult, if not impossible, to control a Kullback information from above by something (almost) quadratic without using stronger estimates, like L°°(M~1 dvdx). Now that the goal is more clear, we can implement the strategy: after monster computations, we find ^
11/ ~ MfpuT\\h
> Kjj^*T\2
+ I tev(u)\2)dx - Ce(f) ||/ - MfpuT\\^H(f
|M)1^,
where Ce(f) depends on smoothness/moments of/ at high enough order (interpolation . . . ) , and a lower bound. Now there is no gradient of p in the r.h.s.! This reflects the existence of quasi-equilibria (the sub-manifold referred to above), for which the entropy production vanishes at high order in time (typically, order 4 instead of 2), but p is not uniform. To remedy this, we establish two "similar" inequalities with MpuT replaced by M
lu(T) —* presence of / | V s y m u | 2 d a ; ; M'p0l —• presence of / \Vp\2 dx.
We don't write down these inequalities explicitly here, the reader will find them all in Desvillettes & Villani [6].
13. Putting both features together It remains to "glue" together our quantitative H Theorem on one hand, and the instability of hydrodynamical regime on the other. This is not trivial, because these results are expressed in terms of different functionals. 1) First tool: Additivity of the entropy. H(f\M) where
We use formulas of the style
= H(f\MpfuT)
+
H(p,u,T)
Entropy production and convergence to equilibrium for the Boltzmann equation
H(p,u,T)= I
plogj^dx
= J(p]ogp-p + l) + J p^ 4>(T)=T
141
+^ ^
+
^T)),-^^)],
-logT-1.
There are two other "similar" formulas involving Mfu,T), 2) Second tool: "Geometrical" inequalities.
We use two types of inequalities:
j |VT| 2 dx > K{Q) \\T - (T)p\\
Poincare inequality: Korn inequality:
Mp01.
/ | V sym w| 2 dx > /C(fi) / | Vu| 2 dx.
—> For specular reflection (which results in the tangency condition: u tangent to the boundary), the Korn constant /C(0) is positive iff fl not axisymmetric! 3) Third tool: Relating entropies and L2 norms. H(f | MfpuT) > K£(f)(\\f
For this we use interpolation:
- MfpuT\\2L2)1+e,
y"(plo g/ 9-p + l ) < C ( / ) | | p - l | | 2 2 ,
etc.
etc.
At the end of the day, we obtain a system of 4 "nonlinear" differential inequalities (first and second-order in t) coupled by identities and inequalities. To deal with this system, we need a replacement for Gronwall's lemma. What will play this role is the following lemma: Lemma:
Let h(t) > 0 satisfy Vt € (ti, t2),
h"(t) + Ah(t)l~E
> a > 0,
for some e < 0.1. Then, a2n^iy
either t% —1\ is small:
t^ — t\ < 50 1 ! A21-12-') 1 r a ~ (\ 1 \ inf I —, —-^ ). KJ dt > 2 or h is large on the average: t2-hJ/ h(t) ~ 100 \A 'A J tl It is not completely the end: to apply this lemma, we need the differential inequalities to be valid on "not too short" time intervals, and therefore first rule out rapid oscillations of hydrodynamic quantities.
14. D a m p i n g of hydrodynamic oscillations This damping is the last step in our proof, and enables us to rule out the possibility of rapid oscillations in the hydrodynamical quantities in long time. Because M is stationary and / is smooth, one can establish inequalities of the style
142
CEDRIC VILLANI
/
plogpdx
etc.
(we obtain three such new differential inequalities).
15. Conclusion of t h e proof After putting together all the pieces of the puzzle, we prove: for e < 0.01, the whole system of differential inequalities implies H(ft | M) =
0{t-1'™*).
This concludes the proof of the theorem. Let us now make a few remarks.
16. R e m a r k 1: Quasi-equilibria If p, u, T are such that VT = 0,
dev(u) = 0 ,
u • n = 0,
then the entropy production around the local Maxwellian MpuT vanishes up to order (almost) 4 in t. Do there exist nontrivial u's? This question amounts to ask whether the conformal group C(fi) is nontrivial. The answer to this question is not immediate, and depends on the dimension and the geometry of the domain: — n = 2, Q simply connected: C(fi) is always 3-dimensional (!); — n = 3: C(Q) is nontrivial iff fi = conformal image of an axisymmetric domain (this lemma was explained to us by Ghys; it is related to (but does not rely on) some famous results in the differential geometry, like the Liouville and Obata-Ferrand theorems). Remark:
Grad "proves": for fi non-axisymmetric, \dev(u)\2dx>K
\Vu\2dx
( u - n = 0).
This is obviously false in view of the above, but can probably be saved under more conditions on the domain. A good estimate of this kind would lead to better estimates in dimension 3 than in dimension 2 . . . [This criticism does not aim at diminishing the merit of Grad, whose intuition in the problem of convergence to equilibrium is actually very impressive.]
17. R e m a r k 2: Role of t h e geometry of fJ How does the shape of the domain affect the speed of convergence to equilibrium? This of course is a very natural and physically relevant question. In our proof, the shape of O only affects the values of Poincare and Korn constants! It is therefore of great interest to have explicit estimates on these constants.
Entropy production and convergence to equilibrium for the Boltzmann equation
143
For specular boundary conditions (u • n = 0), the positivity of the Korn constant K.(Cl), f \Vsymu\2 dx > /C(fi) f |Vu| 2 dx, quantifies departure of CI from axisymmetry. If ft is convex, /C(fi) is bounded below in terms of G(Q) — Grad's number (here for n = 3): G(n) := inf2 inf j||V sym M||?,2 (n) ; V • u ~ 0, curl(w) = a, u • n = o\ . Estimating this number turns out to be related to a Monge-Kantorovich minimization problem! (Desvillettes & Villani [5]) G(Sl) > K inf
W2(Cn,CT''af,
where C = Lebesgue measure; "sym; a" means "symmetrized around axis (g,a)"; W2 is the 2-Wasserstein distance,
W^{n, v)2 = infr#^=t/ / \x — ^(a;)! 2 d^(x).
18. Remark 3: Are there oscillations in the kinetic entropy?? The proof suggests that H(f | M* T), which measures how close / is from being hydrodynamic, may show strong time-oscillations. On some integrable baby models (Gaussian diffusion semigroups . . . ) , this is true. But does this tendency persist for the Boltzmann equation?? I do think that this may be the case, and to back this theory I invite the reader to take a look at the following numerical simulations performed by Filbet on a simplified geometry (periodic boundary conditions, 1-dimensional in position, 2-dimensional in velocity) with an accurate spectral code [7] (the length L is the size of the periodic box, the Knudsen number is of order 0.25): Local Relative Entropy Global Relative Entropy
0.1
Local Relative Entropy Global Relative Entropy
0.01 0.001 0.0001 1e-05 1e-06 1e-07 1e-08
0.2
0.4
0.6
0.8
L=l Figure 1. Time-evolution of the "local" (kinetic) and "global" entropies.
144
C E D R I C VILLANI
In these experiments, the initial d a t u m was chosen to be in local equilibrium. Since this is a logarithmic plot, the convergence seems to be in fact exponential, as could be expected, and it is faster for a small box. W h a t is much more surprising is the fact t h a t , in t h e first picture, the dynamics behaves almost in a spatially homogeneous way after some time, contrary to the possibly natural guess t h a t the hydrodynamical regime would be attained first. In t h e second picture, t h e oscillations in t h e entropy production are quite well-marked, and definitely suggest t h a t the solution oscillates between states where it is quite close t o hydrodynamic, and states where it is not at all. This kind of behavior in any way is a clear indication t h a t things in the spatially inhomogeneous context can be much, much more subtle t h a n in the spatially homogeneous one. No such weird behavior is ever observed in numerical simulations of the spatially homogeneous Boltzmann equation.
References 1. R. Alexandre, C. Villani, Comm. Pure Appl. Math. 55, 30-70 (2002). 2. E. Carlen, M. C. Carvalho, M. Loss, "Determination of the spectral gap for Kac's master equation and related stochastic evolutions", to appear in Acta Math. 3. C. Cercignani, R. Illner, M. Pulvirenti, The Mathematical Theory of Dilute Gases, Springer, New York, 1994. 4. L. Desvillettes, C. Villani, Comm. Pure Appl. Math. 54, 1-42 (2001). 5. L. Desvillettes, C. Villani, ESAIM Control Optim. Calc. Var. 8, 603-619 (2002). 6. L. Desvillettes, C. Villani, "On the trend to global equilibrium for spatially inhomogeneous kinetic systems: the Boltzmann equation", preprint, available from h t t p : //www. umpa. ens-lyon. fr/"cvillani. 7. F. Filbet, G. Russo, J. Comput. Phys. 186, 457-480 (2003). 8. Y. Guo, Comm. Math. Phys. 231, 391-434 (2002). 9. Y. Guo, Arch. Ration. Mech. Anal. 169, 305-353 (2003). 10. Y. Guo, Invent. Math. 153, 539-630 (2003). 11. M. Kac, Proceedings of the Third Berkeley Symposium on Mathematical Statistics and Probability, 1954-1955, vol. Ill, 1956, pp. 171-197. 12. H. P. McKean, Jr., Arch. Rational Mech. Anal. 21, 343-367 (1966). 13. C. Mouhot, "Quantitative lower bounds for the full Boltzmann equation", preprint (2004). 14. C. Mouhot, C. Villani. "Regularity theory for the spatially homogeneous Boltzmann equation with cutoff", preprint, available from h t t p : / / w w w . u m p a . e n s - l y o n . f r / ~ c v i l l a n i . 15. C. Villani, in Handbook of Mathematical Fluid Dynamics, vol. I, North-Holland, Amsterdam, 2002, pp. 71-305. 16. C. Villani, Comm. Math. Phys. 234, 455-490 (2003).
Aspects of free probability DAN VOICULESCU (U. California at Berkeley) Free probability theory provides a probabilistic framework for quantities with the highest degree of noncommutativity. This brings out the ties among von Neumann algebras of free groups, the large TV limit of random multimatrix models, the operators of the Boltzmann Fock space and combinatorics of noncrossing partitions. Many concepts of classical probability theory have free probability counterparts. In particular, the free analogues of entropy and of Fisher's information will be one of the main focuses of the talk. The analysis of the free variables involved, relies on free difference quotient derivations, which give rise to bialgebras in the class with derivation comultiplication, which appears to be selfdual. Duality for such bialgebras underlies the free entropy and analytic aspects of the free Markov property.
1. Introduction Free probability theory is a probabilistic framework for quantities with the highest degree of noncommutativity. In particular, this means that the same laws of randomness occur in different situations with maximum noncommutativity. Here, after a brief look at the theory as a whole, we will go into somewhat more detail about the free analogue of entropy and about certain free analysis questions related to the free difference quotient derivations. The subject originates in the study of operator algebras and we shall discuss links to random matrices and to duality for the class of bialgebras with derivation comultiplication. Besides a few references quoted in the text, at the end of each of sections 2, 3 and 4 there are brief bibliographical notes to connect with the list of references at the end.
2. Free probability 2.1. A game, which is rather often played, is to modify an axiom of a basic theory and to examine the consequences. In the case at hand the basic theory is noncommutative probability (a.k.a. quantum probability) theory. The modification occurs in the definition of independence, the independent quantities, instead of commuting, will be highly noncommuting in general. 2.2. The framework for noncommutative probability theory will be similar to that for quantum mechanical quantities. A noncommutative probability space will be an algebra A, 1 G A, of bounded operators on a Hilbert space H. endowed with an expectation functional
145
146
DAN VOICULESCU
2.3. A family of subalgebras (Ai)i£i, 1 € A%, in (A,
Aspects of free probability
147
let £(h)£ = h<£>£, h €.H, £ € TH be the left creation operators. Then if (7ii)iei are pairwise orthogonal subspaces in H, the family of sets of operators
ui = {e(h),f(h)\ he?U},
iel,
will be free w.r.t. the vacuum expectation ?(•) = (-1,1) where Ti®° = CI. 2.10. Some of the nicest examples of freeness are in von Neumann algebras with faithful trace-states (M,T). This means the algebra M is weakly closed in the weak operator topology, T € M => T* e M, T(ST) = T(TS) and T(T*T) = 0 iff T = 0. In case such M is a factor, i.e., Centre(M) = C l and M is infinite-dimensional, M is called a factor of type Hi and the trace-state turns out to be unique. 2.11. If G is a group and, A(g)e/l = egh is the left regular representation on £2(G), then the von Neumann algebra L(G) generated by X(G) has a natural trace r(-) = (-ee,ee). In case G has infinite conjugacy classes (abbreviated i.c.c.) L{G) is a factor of type Hi. Suppose G = Gi * G2 is a free product of subgroups. Then the von Neumann subalgebras generated by A(Gi), A(G2) are free in (L(G),T) (and are isomorphic to L{Gi),L{G2)). 2.12. The large N limit of n-tuples of N x N random matrices also produces freeness and von Neumann algebras with trace states. Let Tjtff = T?N be such a n-tuple which is also hermitian. Then the GNS-construction of operator algebra theory can be used to construct a von Neumann algebra with faithful trace-state (M,T), generated by a n-tuple Tj = T*, 1 < j < n so that r(Th ... Tip) = lim A T ^ T r ^ . . . Tip,N) ((j an ultranlter and assuming the limits exist and yield bounded operators). 2.13. Theorem ([33, Cor. 2.14]). Let TjtN = T*N, 1 < j < n, n = n x + . . . + np be n-tuples of random matrices with classical distributions /ijv € Prob(A / ljy)". Let (Ui,... ,Up-i) e J7(A r ) p_1 act on (A\,..., An) by conjugating Aj byUq ifni + ... + n Q _i < j < ni + . . . + nq (if ni + . . . + n p _i < j < n, then Aj stays the same) and assume /xjy is invariant. If additionally || supp£t./v||oo < R then in the large N limit von Neumann algebra the sets Qs = {Tj I ni -I
h n s _i < j < ni H
(- n s } ,
1 < s
148
DAN VOICULESCU
i.i.d. random variables in the classical theory. Moreover the large iV-limit von Neumann algebra, i.e., the von Neumann algebra generated by Si,...,Sn is isomorphic to L(F(n)) where F(n) is a free group on n generators. 2.15. The fact that L(F(n)) is asymptotically generated by a n-tuple of Gaussian random matrices has been useful both ways. It has added to the understanding of the large N limit of these random matrices, while at the same time it has been the source of much progress on the L(F(n))'s. For instance if P = P* = P2 G L(F(oo)), P ^ 0 is a projection then L(F(oo)) and PL(F(oo))P are isomorphic (the case T(P) G Q was done by the author in [25] and then completed by Radulescu also for T(P) G R\Q [14]). This scaling result, which concerns von Neumann's fundamental group of L(.F(oo)) uses in an essential way the symmetry and scaling properties of Gaussian random matrices and their interplay with asymptotic freeness. 2.16. The Hi-factors (M, r) of von Neumann are a wonderful world of geometries of linear subspaces, corresponding to the selfadjoint projections P = P* = P2 £ M with dimensions T{P) G [0,1]. The most basic and smallest such M is the hyperfinite Ili-factor, which by a deep theorem of Connes can be defined as the Ili-factor L(G) obtained from any countable amenable i.c.c. group. Free probability theory and, in particular, the results obtained via the asymptotic random matrix realization has made the extended class of free group factors (i.e., factors PL(F(n))P for some projection P G L{F{n))) the second important class of Ill-factors. For instance, some of the deepest results in subfactor theory are about subfactor inclusions of hyperfinite Hi-factors N C M. On the other hand not all inclusion data (the standard invariant) have hyperfinite realization, but it has been shown (combined work of Shlyakhtenko, Ueda and Popa) that there are always realizations as subfactors of a free group factor. 2.17. Note also that like the hyperfinite class which has an extension to type III there is an emerging class of "free type III" factors (Shlyakhtenko, Radulescu). 2.18. At this time it seems that such "free" (or "free group") von Neumann algebras may arise from mathematical models in physics either via creation and annihilation operators on the full Fock space or from some associated subfactor inclusion without hyperfinite realization or, last but not least, from random multimatrix models. The last possibility is demonstrated for instance by the large N 2D Yang-Mills QCD (work of I. M. Singer, M. Douglas, R. Gopakumar and D. Gross). It seems natural to formulate the following 2.19. Guess: the large N limit of random multimatrix models with distributions daN = cN exp(-JV Tr P(Ai
,...,An))d\,
P a polynomial, give rise to von Neumann algebras in the class of free group factors. 2.20. Bibliographical notes. Expositions of free probability and more references are in [38] and [35]. For the combinatorial approach see [20]. The first paper on free probability theory is [24]. Applications of free probability and especially of the random matrix model [26] to von Neumann algebras are in [25], [14], [7], [15], [17], [13], [23]. About connections with some models in physics see [6], [9], [19].
Aspects of free probability
149
3. Free entropy 3.1. Shannon's differential entropy of a n-tuple of numerical random variables ( / i , . . . , fn) with joint distribution p(t\,..., tn) dt\... dtn is given by H(fi,...,fn)
= - /
p(ti,...,tn)\ogp(ti,...,tn)dt1...dtn.
By free entropy we mean a quantity x(-^i> • • • ,Xn) where Xj = X* G (M,T) (a von Neumann algebra with faithful trace state) which has similar properties to H under the free/classical parallelism, i.e., under free independence instead of classical independence, semicircle law instead of Gauss law, etc. There are two approaches: one based on microstates, the other on the free analogue of Fisher's information. 3.2. The Boltzmann formula S = klogW provided the idea for the first approach to free entropy. In view of the asymptotic freeness results for random matrices it was natural to look for matricial microstates. Let r # ( X i , . . . , Xn; m, k, e) be the set of n-tuples (Ai,..., An) G (M%a)n, || A,-1| < R, 1 < j < n, so that \k-1Trk(Ah...Aip)-T(Xil...Xip)\<e for all 1 < ij < n, 1 < j < p, 1 < p < m. Then x(-X"i> • • •, Xn) is defined by taking sup inf inf R>0
e>0 m£N
of limsup (fc _ 2 logvoir(. • •) + — logfc) fc-^oo ^
where vol is the Euclidean volume on structure.
2
(Msha)n
/
w.r.t. the Hilbert-Schmidt Hilbert space
3.3. If n = 1, Tji(X;...), R > ||X|| is approximately a tube around the unitary orbit {UAU* | U G U(k)} of a microstate A. This rough idea underlies the (rigorous) proof of the formula X(X) = J J log \s ~ t\ Ms) dfx{t) + ^ + ~ log 2TT where // = Hx is the distribution of X. 3.4. If n > 1 and Xi,...,Xn are free, then the asymptotic freeness result for random matrices implies that T(X\,..., Xn;...) is close in measure as k —> oo to a product r j 1 < < n r ( X J ; . . . ) . This "explains" the fact that, assuming x{Xj) > —oo (1 < j < n) we have
x(xu...,xn) iff X\,...,
= x(x1) + --- + x(xn)
Xn are free.
3.5. There is also a change of variable formula X(F1(X1,...,
Xn),...,
Fn(Xu ...,Xn))=
log \det\(J(F))
+ X(*i,
...,Xn)
150
DAN VOICULESCU
where F\,...,Fn are noncommutative power series, |det| the Kadison-Fugles positive determinant in Mn ®M® Mop, J(F) <E Mn
= -oo.
(The condition is equivalent to M not being a factor and n > 1.) 3.9. Free entropy has had important applications to the solution of problems on von Neumann algebras. For instance, the question whether every separable Hi factor M has a Cartan subalgebra (i.e., a maximal abelian von Neumann subalgebra A C M, so that its normalizer {u e U{M) \ uAu*} generates M), was answered in the negative by showing that L(F(n)) is a counterexample. The question means, roughly, whether all separable Hi factors arise from measurable ergodic equivalence relations, i.e., originate in ergodic theory. The idea of the proof we gave is that on one hand L(F(n)) is generated by a semicircular system Si,...,Sn for which x(5i>• • • >Sn) = nx(Si) > ~°°! while on the other hand we showed that the existence of a Cartan subalgebra implies that x(^i) • • • >^n) = — °° f° r any generator T\,... ,Tn. Several other results in this direction were obtained by L. Ge, M. Stefan, D. Shlyakhtenko, K. Jung and K. Dykema. 3.10. An important problem for free entropy is the variational problem of maximizing X(T1,...,Tn)-T(P(T1,...,Tn))
where P is a given selfadjoint noncommutative polynomial. It is the natural generalization of the n = 1 case, which is equivalent to the classical problem of maximizing / /
log \s - t\ dfx(s) dn(t) - J P(t) dn(t).
3.11. The second approach to free entropy is based on a free analogue of the Fisher information. Classical Fisher information can be defined as the derivative of entropy under an independent Gaussian increment:
lH(f
+ e1/2g)\e=0=FiSher(f)
Aspects of free probability
151
where / , g are independent and g is (0,1) Gaussian. A function can be recovered (up to a constant) from its derivative, similarly the notion of entropy from the Fisher information. If / has distribution p(t) dt where p is smooth with compact support, then „'\2
Fisher(/) = f
SOI*
L2(p)
where Jj is the differential operator of derivations acting on test-functions in L2(R,pdt)
p
and
\dtj
is the statisticians score function. The Fisher information naturally extends to the free nvariable context. Besides the usual dictionary entries Gauss/semicircle, independent/free, etc., there is an additional entry for the derivative. 3.12. The free difference quotient replaces in free probability (more generally in "free analysis") the usual derivative. Let 1 G B c M be a selfadjoint subalgebra and let X = X* G M. Assume there is no nontrivial algebraic relation between B and X, so that the algebra generated by them identifies with B(X), the one-variable polynomials with noncommuting (except scalars) coefficients in B. We define the derivation dx..B : B{X) ^ B{X) ® B(X) by dx-.BboXbiX,..
.,bn = ]>J
b0X,...,bj-1®bjX,...,bn.
l<j
In case Xj = Xj € M, 1 < j < n, satisfy no nontrivial algebraic relation, they generate a unital algebra isomorphic to C ( X i , . . . , X n ) , and putting Bj = C{Xi,..., XJ-I,XJ+I, ..., Xn) we have C ( X i , . . . ,Xn) = Bj(Xj). We define the j - t h partial free difference quotient derivation dj:C(X1,...,Xn)^C(X1,...,Xn)®2 tobedxy.Br Note that in case B = C, C(X) identifies with C[X] (the commutative polynomials) and under the isomorphism C[X]®2~C[X,F] the derivation dx-.c corresponds to the difference quotient v
;
X-Y
This explains our terminology. 3.13. The conjugate variable of X w.r.t. (a.k.a. free score) is defined by J(X
: B) = d*x,Bl <8> 1
152
DAN VOICULESCU
(if the adjoint exists). Here dx-.B is viewed as an unbounded operator defined on B(X) C L2(M,T)
taking values in L2(M,T)
® L2(M,T)
and L2(M,T)
is the noncommutative L2-
space for the scalar product (mi, 1712) = Tfm^mi). Similarly we define Jk=d'kl®l
=
J{Xk:Bk)
when given Xj = Xj G M, 1 < j < n. The existence problem for conjugate variables is overcome by semicircular perturbations. If S G M is (0,1)-semicircular and free with B(X) then \\J(X + eS:B)\\ < 2 e " 1 (i.e., J(X + eS : B) is actually in M). The free Fisher information of a n-tuple X\,..., in M is **(X1,...,Xn)= ]T r(J2).
Xn
l
The corresponding free entropy is defined by x%Xu...,Xn)
J™(-^t-$%X1+t1/2S1,...,Xn+t1/2)n}dt
= ^log27re +
where Si,..., Sn is a semicircular system free with {Xi,..., Xn}. x* is sometimes called the nonmicro states free entropy or micro states-free free entropy. Several classical results in this context carry over to the free setting (Cramer-Rao inequality, Stam inequality, informationlog-Sobolev inequality, etc.). 3.14. The conjugate variables are also related to the free entropy defined in the matricial microstates approach. If Pk — P* £ C ( X i , . . . ,Xn) then —X(Xi+ePi,...,Xn
+ ePn) 6=0
provided Ji,...,Jn
= £ r(JkPk), l
exist.
3.15. The theory of x* is incomplete due to some unresolved technical problems like the continuity problem for $*(Xi +t^2Si,..., Xn + t1/2Sn) (only right semicontinuity has been proved). 3.16. An important question is unification, i.e., proving x = X* (maybe under certain conditions). For n = 1 this is easily verified. For n > 1 only the inequality x < X* has recently been established by P. Biane-M. Capitaine-A. Guionnet [2]. Though it is the easier of the two inequalities comparing x a n d X* t n e proof is quite difficult. The approach relies on the connection to large deviations for a n-tuple of Gaussian random matrices. Here x is related to the quantity to be estimated while x* to the rate function. This brings to the proof some powerful probabilistic techniques from large deviations and stochastic analysis. 3.17. We have also studied a notion of mutual free information for a pair A, B of von Neumann subalgebras in (M,T). The approach is similar to the free Fisher information approach to x* • I n * m s c a s e dx-.B is replaced by a new derivation dA..B : A V B -» (i4 V B)®2
Aspects of free probability dA-.Ba — a ® 1 - 1
153
OA-.BB = 0
where A V B is the algebra generated by >1 U 5 and its is assumed there is no nontrivial algebraic relation between A and B. 3.18. Bibliographical notes. The basic references for most of the material about entropy are the author's papers [29-33]. A survey of free entropy is in [36], where the unification problem and variational problem are briefly discussed. Applications to Neumann algebras are in [30], [8], [21], [18]. Some large deviations papers connected to entropy are [5], [2]. Further work about free entropy is in [3] and [12].
free also von free
4. Duality for t h e coalgebra of dx-.B 4 . 1 . The free analysis questions around free difference quotient derivations are perhaps best understood within the bialgebra duality framework to which we turn now. We have coassociativity relations: {dx-.B ® i d ) O dx-.B = ( i d ® 9 x : s ) O dx-.B
and (di
(M ® id)
o (id ®<9).
This structure has the remarkable feature of being selfdual. Let A' denote the dual of the vector space A. If (A,fi,d) is a GDQ ring then (A',d',fj,*) is a (topological) GDQ ring ("topological" refers to using a topology on A' and a topological tensor product). The one-line proof is that dualizing the relation for d o \i we have: tf o 8' = (9* ® id) o (id gi/x*) + (id «><9*) o (/i* ® id).
154
DAN VOICULESCU
4.4. The corepresentations (grouplike elements) of (A, fi, d), i.e., matrices a €E M.n{A) so that (d id.Mn)a — a ®Mn a-> are important in understanding the dual. If there is X € A so that dX = \®1 then elements of the form where j3j € A4„(kerd), are corepresentations. In the (M,T) context where we have many invertible elements, the above formula provides a good supply of corepresentations. Actually, we will use only those with /33 = pi = / , i.e., generalized matricial resolvents
a=
((3-X®In)-\
Roughly, the idea for a duality transform is to consider a map A'
9 if -* 0 ( y >
a
) (a) e 0
Mdlm a a
with a running over some set of corepresentations. We will do better by having the range of the map in a natural topological GDQ ring and the transform a morphism. 4.5. Let B be a C*-algebra (i.e., a selfadjoint algebra of operators on Hilbert space, closed in the norm-topology) and assume 1 € B. A fully matricial B-set is fi = (Cln)n>i where fi„ C Mn(B) are such that fim+„ n (Mm(B) © Mn(B)) = 0 m © Q.n and (AdS®IdB)(ft„)=fi„ where S € GL(n,C) and (Ad S)(T) = STS"1. CI is open if each Cln is open. Similarly, we define a scalar fully matricial analytic function on fl, to be / = (/n) n >i where / „ : 0 „ —* .M„(C) is analytic, /m+n|fim©fi„
=
fm © fn
and / „ o (Ad S ® id B ) = Ad S o fn. The set of such / will be denoted by A(fl). Fact ([37]). A (f2) is a topological GDQ ring. In case Q. = Cl* A(Q,) has a natural involution and a "dual positivity" relation. Here Cl* = (f^)„>i. For the definition of the comultiplication and dual positivity we refer to [37]. In case B = C, and fln = {Te Mn(C) | a(T) C Qi} then the comultiplication on the f\-component
of / is just the difference-quotient
h{z\)- hi**) z\ - Z2
on analytic functions. 4.6. Returning to B(X) C M, we can ensure that dx-.B is closeable in a suitable topology by a free semicircular perturbation X —» X + eS, in which case dx-.B will a l s o be defined on the
155
Aspects of free probability
ring A obtained by adjoining to B(X) the matrix-coefficients of all matricial B-resolvents (/? - X
\0-X®In
invertible}.
Then p(X;B) = (pn(X;B))n>i is an open fully matricial B-set we shall call the fully matricial B-resolvent of X. The duality transform A'Bip->((tp®id.Mn)(»
-X®
In)~l)n>i
e
A(p(X;B))
is then a GT>Q-ring morphism intertwining involutions and dual positivities. In case B = C, this boils down to the n = 1 component and to the map t)'xdp{t)
Meas (o-(X)) 9 p -> f (z -
where the right-hand side is C(p)(z) the Cauchy transform of p defined on C\a(X). dual multiplication, when defined, is such that C(p1#p2)(z)
=
The
C(p1)(z)C(p2)(z)
and the comultiplication
c(dp)(Zl,Z2) =
c
^ Z\ -
c
^ .
Z2
4.7. The dual multiplication underlies the definition of conjugate variables. Indeed, we have T(*J(X : B)) = (T ® r ) o dX:B(») = T#T. 4.8. The B-resolvents are quite natural in free probability. Indeed the "conditional probabilities" amount to replacing C by an algebra B, i.e.,
+
Y-ZI)-1)=T((X-U(Z)T1).
ls a The key to this result is the fact that the conditional expectation EVN(X+Y)\VN(X) GDQ-ring homomorphism of the bialgebra of dx-.c to the bialgebra of dx+Y-.c and hence maps corepresentations to corepresentations. (The subordination result is essentially due to the author, was followed by the realization of P. Biane that it was the consequence of
156
DAN VOICULESCU
an operator-valued subordination result and was t h e n taken u p again by us, generalized t o B-resolvents and explained via the bialgebra structure.) 4 . 1 0 . T h e use of matricial B-resolvents has been essential in some remarkable random matrix results obtained via free probability methods. One example is the recent result of U. Haagerup and S. Thorbjoernsen ([10]) t h a t for any noncommutative polynomial and the Gaussian i.i.d. n-tuple of hermitian random matrices limjv->oo \\P(TitN,...,Tn,N)\\ = | | P ( 5 i , . . . ,-Xn)|| almost surely. T h e other example is the work of D. Shlyakhtenko [16] on Gaussian random band matrices. 4 . 1 1 . B i b l i o g r a p h i c a l n o t e s . T h e main references for this section are [35], [37]. Concerning the combinatorics and the classical difference quotient bialgebra see [11]. For conditional free probability, freeness with amalgamation, etc., see [24], [27]. T h e analytic subordination work is in [28], [1] and [35].
Acknowledgments T h e author was supported in part by N S F Grant DMS-0079945. He t h a n k s Mrs. Faye Yeager (U.C. Berkeley) for typing the manuscript. P a r t of the work on these notes was done while the author held an International Blaise Pascal Research Chair, from the State and lie de France Region, managed by the Foundation de l'Ecole Normale Superieure and visited the Institut de Mathematiques de Jussieu. T h e author's participation in the International Congress of Mathematical Physics was supported in part by funds from the Clay Mathematics Institute as a CMI Emissary.
References 1. P. Biane, "Processes with free increments", Math. Z no. 1, 143-174 (1998). 2. P. Biane, M. Capitaine, A. Guionnet, "Large deviation bounds for matrix Brownian motion", Invent. Math. 152, 433-459 (2003). 3. P. Biane, R. Speicher, "Free diffusions, free entropy and free Fisher information", Ann. Inst. H. Poincare Probab. Statist. 37, 581-606 (2001). 4. E. Brezin, C. Itzykson, G. Parisi, J. B. Zuber, "Planar diagrams", Coram. Math. Phys. 59, 35-51 (1978). 5. T. Cabanal-Duvillard, A. Guionnet, "Large deviations upper bounds for the laws of matrixvalued processes and noncommutative entropies", Ann. Probab. 29, 1205-1261 (2001). 6. M. R. Douglas, "Stochastic master fields", Phys. Lett. B 344, 117-126 (1995). 7. K. J. Dykema, "Free products of hyperfinite von Neumann algebras and free dimension", Duke Math. J. 69, 97-119 (1993). 8. L. Ge, "Applications of free entropy to finite von Neumann algebras II", Ann. of Math. 147, 143-157 (1998). 9. R. Gopakumar, D. J. Gross, "Mastering the master field", Nucl. Phys. B 451, 379-415 (1995). 10. U. Haagerup, S. Thorbjoernsen, "A new application of random matrices: Ext(C£(F2)) is not a group", preprint (2002). 11. A. S. Joni, G.-C. Rota, "Coalgebras and bialgebras in combinatorics", in Studies in Applied Mathematics 6 1 , Elsevier North Holland, 1979, pp. 93-139. 12. A. Nica, D. Shlyakhtenko, R. Speicher, "Some minimization problems for the free analogue of the Fisher information", Adv. Math. 121, 282-347 (1999). 13. S. Popa, D. Shlyakhtenko, "Universal properties of L(Foo)", in Subfactor Theory, preprint (2002).
Aspects of free probability
157
14. F. Radulescu, "A one-parameter group of automorphisms of L(Foo) ® B(H) scaling the trace", C. R. Acad. Sci. Paris t. 314, Serie I, 1027-1032 (1992). 15. F. Radulescu, "Random matrices, amalgamated free products and subfactors of the von Neumann algebra of a free group of noninteger index", Invent. Math. 115, 347-389 (1994). 16. D. Shlyakhtenko, "Random Gaussian band matrices and freeness with amalgamation", International Math. Res. Notices 20, 1013-1025 (1996). 17. D. Shlyakhtenko, "Free quasi-free states", Pacific J. Math. 177, 329-368 (1997). 18. D. Shlyakhtenko, "On prime factors of type III", Proc. Nat. Acad. Sci. 97, 12439-12441 (2000). 19. I. M. Singer, "On the master field in two dimensions", Functional Analysis on the Eve of the 21st Century, Essays in Honor of the 80th Birthday of I. M. Gelfand, Progress in Math. 131, 263-283 (1995). 20. R. Speicher, "Combinatorial theory of the free product with amalgamation and operator-valued free probability theory", Mem. Amer. Math. Soc. 627 (1998). 21. M. B. Stefan, "The indecomposability of free group factors over nonprime-subfactors and abelian subalgebras", preprint. 22. G. t'Hooft, "A two-dimensional model for mesons", Nucl. Phys. B75, 461-470 (1974). 23. Y. Ueda, D. Shlyakhtenko, "Irreducible subfactors of index A > 4", preprint (2000). 24. D. Voiculescu, "Symmetries of some reduced free product C-algebras", in Operator Algebras and Their Connections with Topology and Ergodic Theory, Lecture Notes in Math 1132, Springer, 1985, pp. 556-588. 25. D. Voiculescu, "Circular and semicircular systems and free product factors", in Operator Algebras, Unitary Representations, Enveloping Algebras and Invariant Theory, Progr. Math. 92, Birkhauser, Boston, MA, 1990, pp. 45-60. 26. D. Voiculescu, "Limit laws for random matrices and free products", Invent. Math. 104, 201-220 (1991). 27. D. Voiculescu, "Operations on certain noncommuting operator-valued random variables", Asterisque no. 232, 243-275 (1995). 28. D. Voiculescu, "The analogues of entropy and of Fisher's information measure in free probability theory", Comm. Math. Phys. 155, 71-92 (1993). 29. D. Voiculescu, "The analogues of entropy and of Fisher's information measure in free probability theory II", Invent. Math. 118, 411-440 (1994). 30. D. Voiculescu, "The analogues of entropy and of Fisher's information measure in free probability theory III: The absence of Cartan subalgebras", Geom. Fund. Anal. 6, 172-199 (1996). 31. D. Voiculescu, "The analogues of entropy and of Fisher's information measure in free probability theory V: Noncommutative Hilbert transforms", Invent. Math. 132, 182-227 (1998). 32. D. Voiculescu, "The analogues of entropy and of Fisher's information measure in free probability theory VI: Liberation and mutual free information", Adv. Math. 146, 101-166 (1999). 33. D. Voiculescu, "A strengthened asymptotic freeness result for random matrices with applications to free entropy", International Math. Res. Notices No. 1, 41-63 (1998). 34. D. Voiculescu, "Lectures on free probability theory", Lectures on probability theory and statistics, Ecole d'ete des Probability's de Saint-Flour XXVIII, Lecture Notes in Math. 1738, Springer, 1998, pp. 280-349. 35. D. Voiculescu, "The coalgebra of the free difference quotient in free probability theory", International Math. Res. Notices 2, 79-106 (2000). 36. D. Voiculescu, "Free entropy", Bull. London Math. Soc. 34, 257-278 (2002). 37. D. Voiculescu, "Free analysis questions I: Duality transform for the coalgebra of dx-.a", to appear in International Math Research Notices (2004). 38. D. Voiculescu, K. J. Dykema, A. Nica, Free random variables, CRM Monograph Series 1, Amer. Math. S o c , Providence, RI, 1992. 39. E. P. Wigner, "Characteristic vectors of bordered matrices with infinite dimensions", Ann. Math. 62, 54 (1955).
This page is intentionally left blank
Invited session talks
This page is intentionally left blank
Condensed matter physics Session organized by J. YNGVASON (Wien) and Y. AvRON (Haifa)
This page is intentionally left blank
Adiabatic transport, Kubo formula and Anderson localization in some lattice and continuum models A.
ELGART
(Courant
Institute)
The different explanations of the Quantum Hall Effect rely on the validity of the linear response theory for a system that has infinite extent. We will present recent results on the adiabatic charge transport in this context for two dimensional lattice (joint work with M. Aizenman and J. Schenker) and continuum (joint work with B. Schlein) models of a non-interacting electron gas. It is proved that if the Fermi energy falls in the localization regime then the Hall transport is correctly described by the linear response Kubo formula. The localization condition is set forth by the fractional moment method, which is by now extended also to continuum models (joint work with M. Aizenman, S. Naboko, J. Schenker and G. Stoltz). In the present talk, besides localization criteria, we will discuss some ideas — Nenciu's asymptotic expansion, generalized space-momentum inequalities, and finite speed of propagation estimates — which enter the proof.
1. Introduction Since the discovery of the Quantum Hall Effect (QHE) by von Klitzing in 1980 [16] a great variety of different problems associated with the behavior of a two-dimensional electron gas in the presence of a magnetic field has been studied both experimentally and theoretically. In particular, the remarkable phenomenology associated with the QHE was generated by a number of theoretical physicists, notably by Laughlin [17], Thouless et al. [20], and Halperin [14]. Von Klitzing realized that a two dimensional electron gas at very low temperatures and strong magnetic field displays a quantization of the Hall conductance, that is the conductance measured in the direction transversal to the applied current. Specifically, the graph of the Hall conductance as a function of the magnetic field is a staircase function, where at the plateaux the value of the conductance is an integer multiple of e2/h, with astonishing accuracy. The robustness of the quantization is especially intriguing because on the microscopic level the samples have a lot of impurities, and are characterized by different concentration of electrons and by different geometries. The topological nature of the quantization, which explains such insensitivity, was first suggested by Thouless [20], and was understood latter in terms of topological invariants: Chern numbers [6] and Fredholm indices [9]. Two pictures were introduced for a description of QHE: "Edge currents picture" and "Bulk currents picture". The edge current picture [14] suggests that the Hall current flows in the narrow regions along the sample boundaries, so that the Hall voltage drops entirely in these regions. On the other hand, the description in terms of bulk currents [4,6] suggests that the Hall voltage drops gradually across the sample, and one can think about a system with an infinite extent. The natural proposal raised by Halperin suggests that in reality one should expect an intermix of these two pictures, and that the edge and the bulk conductance should coincide. Such a link between the edge and the bulk conductance was recently
163
164
A. ELGART
rigorously established by Kellendonk, Richter, and Schulz-Baldes [15], using methods of non commutative geometry, and shortly thereafter by Elbau and Graf [11], with functional analytic techniques, provided that the bulk conductivity is equal to the index of the pair of projections associated with the model and that there is a spectral gap at the Fermi energy. Here we present a derivation of the bulk conductivity for QHE models from first principles. As far as we are aware of, the analogous theory of the edge quantization has yet to be constructed. As was mentioned above, for the bulk models we can assume that the system has an infinite extent to begin with, and is characterized by two parameters, u> and A (we will use the atomic units e = m = h = 1 throughout the paper). The parameter w describes the frequency, or the rate at which the system is driven by the external field, while A describes its strength. There are two natural quantities that can be studied here: the Hall conductance ae and the Hall conductivity ay. In the first case the parameter A is equal to the total voltage drop AV across the sample and ae is a coefficient of proportionality in the (expected) Ohm's law: / = aeAV, where J is a Hall current. In the second case A is associated with the strength E of the (linear) electric field, and oy relates the current density j with E: j = ayE. In (and only in) two dimensional systems two quantities are expected to coincide: ae = ery.a Our results deal exclusively with the Hall conductance in 2D, with underlying Hilbert space being £ 2 (Z 2 ) or £ 2 (R 2 ). If one hopes for the linear relation / = aeAV to hold, and expects a total of N electrons to be transported during one cycle, a natural constraint on the two parameters w and A = AV arises: The total charge transported in the system is an integral over the time of the current which should behave as a*-1 • AV. Accordingly, we are forced to keep the latter relation fixed in the weak field regime (AV —> 0), where the linear relation between the current and the field is naturally anticipated13. Such a coupling is encoded in the adiabatic setup [8], where the Hamiltonian Ho of the system in equilibrium is perturbed by an external field of the form eg(et)Ai, with t describing a (physical) time, g G C^(0,1), e being an adiabatic parameter (eventually we will be interested in the limit e —> 0), and A standing for some (fixed) perturbation. In order to describe the electric field applied in the x\ direction, we choose Ai in the form of a multiplication operator depending on the variable x\ only (we will use the jargon "switch function in xi direction") in the coordinate basis, with a profile satisfying Ai(xi) = 0 for very negative values of the coordinate xi, and Ai(^i) = 1 for very positive values of x\. The corresponding voltage drop across the sample is AV = eg(et) at time t. This time dependent electric field will generate the Hall current, flowing in the x-i direction, which we then measure by detecting electrons crossing some given line perpendicular to x%. The spatial location of the line is essentially irrelevant in the limit e —> 0, since one expects that the current will be incompressible. In particular, one can sample the currents along different lines, and then compute some weighted average. This observation suggests the choice of the current operator (i.e., the observable whose quantum expectation gives the value of the current) in the form I := [H(t), A2], where A2 is another switch function, but in x? direction. The above picture, being very accessible from the mathematical viewpoint, is well tied by a b
For a cubic sample of size L in d dimensions aeL2~d ~ cry. T h e theory becomes simpler if the linear response limit, AV —* 0 is undertaken before the DC frequency limit, ui —» 0. However, such a setup does not describe the charge transport due to the above argument. For recent developments in this direction, see [10].
Adiabatic transport, Kubo formula and Anderson localization ...
165
the Faraday's law with the experiment of von Klitzing, where the voltage drop is induced by applied currents. The resulting formula for the conductance
2. Results Let us first glue together what was discussed in the introduction. We start with an equilibrium system on 1? or R 2 , described by some Hamiltonian Ho, acting on the corresponding Hilbert space (?{I?) or £2(R2). We then perturb it by time dependent external field, so that the full Hamiltonian is H := Ho + eg(et)A\, and we are interested in computing the number of electrons, which will cross the sample in the transversal direction X2 as a result of the perturbation (we will use the terminology "excess charge" for this quantity, and use the notation (Qe))- We assume that initially (at any time t < 0) the system is in an equilibrium state, described by the density matrix p(Ho). Then the instantaneous state pe(t) is given by the solution of the Heisenberg equation: idtPt{t) c
= [H{t),Pt{t)}\
Pe(0) = p(ffo).
(1)
Of course the natural definition of the localized regime in this context will be the constancy of ae in the corresponding range of the (Fermi) energies. Accordingly, the description of localized regime, given by (A2,A4') should be called a sufficient condition for localization. However, it is believed that in the context of random Schrodinger operators there is a jump discontinuity in the behavior of the transport exponents, so that for two dimensional systems either (Al) is satisfied or the system is characterized by at least diffusive behavior (see [13] for some rigorous results in this direction). This fact, together with the formal computation for conductance [1], indicate that in 2D (Al) can be also viewed as the necessary condition for localization, and therefore we omit the adjective "sufficient" from the description of (A2).
166
A. ELGART
In order to compute (Qe) we first consider the quantum mechanical expectation of the excess current (Ie)(t), given by (Q(t) := ti(Pe(t) - p(H0))I(t)
= tv(p£(t) - P(H0))[H(t),
A 2 ],
(2)
which should be viewed as a difference between the total current (coming from the pe part), and the equilibrium current (associated with p(H0) part). We then integrate (Ie) over the time in order to get (Qe). If the Ohm's law Ie(t) = aeAV{t)
= eg(et)
holds, then /•OO
/»1
{Qe) =&e
eg(et) dt = ae / g(s) ds. Jo Jo The results, established in two consequent works with Aizenman and Schenker (for the lattice case), and with Schlein (for continuum), state that under the assumptions (Al)-(A3) below Theorem 2.1. The following relation holds: lim(Qe)=K ->°
£
[ g(s)ds, Jo
(3)
with K:=2ntTp(H0)[[Aup(H0)},[A2,p(Ho)}},
(4)
which can be therefore identified with the conductance ae. Remark 2 . 1 . The formula for K, introduced in the theorem, is known as the Kubo formula for conductance. This object has a number of important properties: (1) the stability under deformations of the switch functions Ait2, that can be viewed as robustness with respect to the changes in the voltage distribution inside the sample and the incompressibility of the Hall current; (2) as a function of the Fermi energy, Ep, K remains constant inside the localization regime, described by the assumption (A2) — which explains the staircase form of
Adiabatic transport, Kubo formula and Anderson localization ...
167
the underlying Hamiltonian in order to control the trace norms, associated with quantum mechanical expectations. On the other hand, the Landau type models have been and remain the main prototype for the studies on QHE, therefore driving our motivation to prove the Kubo formula for them. We believe however that the theorem can be proven in a more general setup of Schrodinger operators, under the localization condition (A4') stated below, using methods derived for the lattice case and available results for the continuum. 2.1. Assumptions For the lattice case we set forth the following conditions: (Al) Locality of Hamiltonian: A self adjoint operator H on f 2 (Z 2 ) is said to be short range if there are fi > 0 and A < oo such that (x\H\y)\
<,Ae~^-
for all x ^ y.
(5)
(A2) Localization at the Fermi energy Ep: Let Hu be a random self-adjoint operator on £ 2 (Z 2 ). We say that Hu satisfies the fractional moment condition at E € R if for some 0 < a < 1, /x > 0, and A < oo, E
sup \ee[-l,l]
1
H.-E-ie
(6)
For the continuum we introduce the family of Landau Hamiltonians (A3) H0 acting on £ 2 (R 2 ): / B \2 / B \2 Ho = [Pl - —X2J + [P2 + —Xt) + V , where the potential V is smooth and relatively weak: || V||„ i00 < Cn for some integer n large enough (n = 5 will do), and ||V||oo < B/2 (which means that there is a spectral gap between the different bands associated with the Landau levels, and we choose Ep to fall into one of these gaps). What we really expect to be good assumptions in the continuum case are (A3') Ho is a Schrodinger operator on £2(R2) with sufficiently smooth potential — analog of (Al). (A4') Localization at Fermi energy Ep: E
sup Xz Ho-E-ie W-i,i]
Xw
where Xz is the indicator function of the cube of size one centered at z — analog of (A2). This condition corresponds to the natural generalization of the AizenmanMolchanov criteria for continuum Schrodinger operator, developed in a joint work with Aizenman, Naboko, Schenker, and Stolz [3].
168
A. ELGART
3. Basic tools Let us first highlight some physically and mathematically interesting features of the problem, and then describe natural tools required to overcome the corresponding difficulties. (1) Importance of the localized regime. The justification of the Kubo formula becomes a relatively easy task in the case when the Fermi energy EF falls into a gap (use of Nenciu expansion and propagation bounds below solve the problem). However, from the physical viewpoint, it is important to solve the problem in the absence of a gap. Indeed the widths of the Hall plateaux correspond to the band of localized states, since in the gap the filling factor remains constant (the integrated density of states does not change). The absence of the gap in the spectrum suggests that (naively) there is no intrinsic time scale associated with such a system. (2) Interesting topology. As was already stressed above, the Kubo formula is associated with certain Fredholm index, which already suggests that the effect comes from non trace class perturbation of Hamiltonian HQ. In fact it is the product of the density p£(t) — P{HQ) with the current operator I(t) which makes equation (2) trace class. (3) Non trivial time dependence. The total variation in the Hamiltonian H(t) is of order e, but we are interested in the cumulative effect during the time e _ 1 , therefore the simple perturbation theory will not work. The tools that help to tackle these difficulties are as follows: (1) We will argue below that the time scale is nevertheless present here, and comes from the density of the states in the vicinity of the Fermi energy. (2) Finite speed of propagation, equation (8) below, will be crucial in studying the trace class properties of the corresponding operators. Obtaining such a bound is a simple task on the lattice under (Al), but requires a substantial work for the continuum case, basically due to the time dependence of H(t). (3) The Nenciu expansion below (in the presence of the gap) and the adiabatic evolution introduced by Avron, Seiler, and Yaffe [8] are very handy tools for working with (slowly) time dependent problems. (4) In addition to these three items, one has to develop the various trace class estimates for the continuum case, where we have to tackle also the infrared and ultraviolet problems typical for Schrodinger operators. 3.1. Finite speed of propagation Let us start with an explanation of why such bounds are important. Morally, the density pt{t) — p0 that appears in equation (2) behaves like -if
Ue(t,s)[Aupo]Ue(s,t)g(es)ds, Jo
where Ue is the unitary propagator, satisfying: ieU€(s,t) = H(s)Ue(s,t),
Ue{t,t) = \.
(7)
Adiabatic transport, Kubo formula and Anderson localization
169
If the kernel of po is sufficiently localized (as it happens when Ep lies in a gap or (almost surely) in the localized regime (A2,A4')), then [p0, Ai] looks like a bump in the x\ direction. On the other hand [H0, A2] looks like a bump in the i 2 direction, which suggests that ||[Ai, / o 0 ][jy 0 ,A 2 ]||! < 00. If we substitute equation (7) into equation (2), a similar expression will appear, but with Uc lying between the two commutators. Finite speed of propagation below guarantees that also with Ut we get a trace class operator. We will use the following notation: (xi) = (I + 2; 2 ) 1 / 2 . Lemma 3.1. (Finite speed of propagation in continuum) Fix n,m £ N/2, and assume that V £ ^2(n+m),oo(IR2)- Then there is a constant D = D(n,m) such that, for i = 1,2, \\(xi)n(H(s)+irUc(s,t)(H(t)
+ i)-m-n(xi)-n\\
(8)
for all s , t £ l . 3.2. Nenciu expansion We want to solve the following Heisenberg equation iepe(s) =
\H(s),pe(s)},
where pc(0) is a spectral projection of the operator H(0). Naturally, one can look for asymptotic series pe(s) ~ B0(s) + eBxis) + e2B2(s) + ••• . Substitution into the Heisenberg equation leads to a sequence of differential equations iBj(s) = [H(s),Bj+1(s)]
j=0,...
(9)
However, these equations do not have a unique solution. The idea of Nenciu was to use an extra constraint, namely the fact that p 2 (s) = pe(s), so that: 3
Bj(s) = £
Bm(s)Bj_m(s)
j = 0,...
(10)
m=0
It turns out that the system of hierarchical relations 9, 10 has a unique explicit solution (Nenciu [18]). 3.3. Adiabatic evolution and effective gap Without the spectral gap, Nenciu expansion does not work, since for every power of e in the expansion we have to pay a price in singularity of Bj. However it is clear that pe(t) is not singular at all, so a different method is needed in the absence of a spectral gap. With introduction of a fictitious adiabatic evolution, suggested by Avron, Seiler and Yaffe [8], the problem essentially reduces to the control of the following object: Po Pe(l) (1 ~ Po) • Let us use the notation P A for the spectral projection onto the energy interval A around Ep • Then the following result holds:
170
A. ELGART
L e m m a 3.2. (1) po(l - PA)pe(l)
(1 - po) = 0(e f c A- f c ) for any k e N.
(2) If Ep falls in the localized regime then P&pe = PAQ + 0(ekA~k), Q is bounded and has (a.s.)
exponentially
localized
where the operator
kernel.
This result suggests t h a t one can create an "effective gap" of width A , as far as A > e. T h e price one has t o pay for t h e creation of this gap (i.e., t h e corresponding error) is dictated by the density of t h e states in t h e A-interval around the Fermi energy, namely by trxo-Pe- T h e interesting open question here is whether this upper bound on t h e error (clearly, t r Xo-Pe —* 0 as e —> 0) is also sharp. T h e belief of t h e author is t h a t t h e answer t o this question is yes, and it comes from the close similarity t o another time dependent problem. It t u r n s out t h a t an a t o m subjected t o t h e quantized radiation field and slowly varied external field behaves very differently with and without spectral gap between t h e ground state and a.c. spectrum [5]. In t h e presence of t h e gap t h e error of t h e adiabatic evolution is 0(ek) for any k, b u t in its absence t h e error is of some fixed power of e, which depends entirely on t h e density of t h e states in t h e vicinity of the ground state.
References 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. 13. 14. 15. 16. 17. 18. 19.
M. Aizenman, G. M. Graf, J. Phys. A: Math. Gen. 3 1 , 6783 (1998). M. Aizenman, A. Elgart, J. H. Schenker, preprint. M. Aizenman, A. Elgart, S. Naboko, J. H. Schenker, G. Stolz, preprint. H. Aoki, T. Ando, Solid State Commun. 38, 1079 (1981). J. E. Avron, E. Elgart, Comm. Math. Phys. 159, 399 (1999). J. E. Avron, R. Seiler, Phys. Rev. Lett. 54, 259 (1985). J. E. Avron, R. Seiler, B. Simon, Comm. Math. Phys. 159, 399 (1994). J. E. Avron, R. Seiler, L. G. Yaffe, Comm. Math. Phys. 110, 33 (1987). J. Bellissard, A. van Elst, H. Schulz-Baldes, J. Math. Phys. 35, 5373 (1994). J. M. Bouclet, F. Germinet, A. Klein, J. H. Schenker, in preparation. P. Elbau, G. M. Graf, Comm. Math. Phys. 229, (2002). A. Elgart, B. Schlein, to appear in Comm. on Pure and Applied Math.. F. Germinet, A. Klein, preprint. B. I. Halperin, Phys. Rev. B 25, 2185 (1982). J. Kellendonk, T. Richter, H. Schulz-Baldes, Rev. Math. Phys. 14, 87 (2002). K. v. Klitzing, G. Dorda, M. Pepper, Phys. Rev. Lett. 45, 494 (1980). R. B. Laughlin, Phys. Rev. B 23, 5632 (1981). G. Nenciu, Comm. Math. Phys. 152, 479 (1993). R. E. Prange, S. M. Girvin (eds.), The Quantum Hall Effect. Graduate Texts in Contemporary Physics, Springer-Verlag, New York, 1987. 20. D. J. Thouless, M. Kohomoto, M. P. Nightingale, M. den Nijs, Phys. Rev. Lett. 49, 405 (1982).
Transport in adiabatic quantum pumps G. M. G R A F (ETH Zurich), A. ELGART (Courant
L.
SADUN
(U. Texas), K.
SCHNEE
Institute),
(Schrodinger Institute,
Vienna)
A quantum pump is an externally driven device coupled to reservoirs at equilibrium with one another. We consider transport phenomena when the electrons are independent and the driving is slow compared to the dwell time of particles in the pump. The charge transport associated with a given change of pump parameters is characterized in terms of 5-matrices pertaining to time-independent junctions. In fact, several transport properties (charge, dissipation and, at positive temperature, noise and entropy production) may be expressed in terms of the matrix of energy shift which, like Wigner's time delay to which it is dual, is determined by the S-matrix. We discuss transport at a semiclassical level, including geometric aspects of transport such as the question of charge quantization. On the analytical side we present an adiabatic theorem on transport for open gapless systems.
1. Introduction An adiabatic quantum pump is a time-dependent scatterer connected to several leads. The idealized setting is as follows: a pump, whose internal configuration varies slowly in time in a prescribed manner, is connected to n channels, along each of which independent electrons of charge e = 1 can enter or leave the pump. We assume that electrons do not have spin nor ^
^
1 2
Figure 1. The pump proper with n channels.
degrees of freedom transverse to the channel and may be thought of as a (non-relativistic) particle moving on a half line. The incoming electron distribution, at zero temperature, is a Fermi sea with Fermi energy /z common to all channels. As a rule, this does not apply to the distribution of the outgoing electrons, as their energies may have been shifted while scattering at the pump. Because of this imbalance a net current is flowing in the channels. The expected charge transport is expressed by the formula [9,11,12] dQ.
=
±{{dS)S')n
•
(1)
Here S = {Sij) is the n x n scattering matrix at energy [i computed as if the pump were frozen into its instantaneous configuration. A change of the configuration is accompanied
171
172
G. M. GRAF, A. ELGART, L. SADUN, K. SCHNEE
by a change S —> S + dS of the scattering matrix and by a net charge dQj leaving the pump through channel j . We recall that the definition of the S matrix involves the comparison of the dynamics with that generated by a reference Hamiltonian, e.g. a configuration with channels disconnected from the pump. Equation (1) is of a form well-known in thermodynamics for describing exchanges between a system and a reservoir, like dW = —pdV for the reversible work exchanged through a piston. In fact, dQj depends on the change of the system, but not on its rate, provided it is slow. Moreover dQj is expressed through S which, like p and V, is determined statically and from outside the system. As it is appropriate for a pump,
V
dQj = ^- tr((dS)5*) = -^-dlogdetS = - d £ .
r~~f
27T
Z7T
Here tr(-) denotes the trace of a n x n matrix, in contrast to Tr(-) applying to operators on the (single particle) Hilbert space. As a result, the Kirkhoff law only holds when integrated over a cycle: / £ ) " = i dQj = 0. Derivations of equation (1) and of related transport properties have been given in various settings and at different levels of rigor [3,7,9,11,12,16].
2. Semiclassical description and quantization of charge transport A semiclassical description of the effect of the pump on the electrons may be given pretending that they have (i) classical positions and momenta as long as they freely move in a channel and (ii) that these phase space coordinates are changed according to quantum mechanical scattering when they traverse the pump. As for (i), canonical phase space coordinates of a particle are conveniently chosen as its time i 6 R of passage at some fiducial point, e.g. at the connection x — 0 of the channel [0, oo) with the pump, and its energy E G [0, oo). The aspect (ii) can be described in terms of the frozen scattering matrix S(E) of the pump in effect at the time it is being traversed by the electron. The scattering matrix so acquires a parametric time dependence, S(E,t). Indeed, the shift induced by the scattering on the coordinate t is the well-known Wigner time delay [14,21] T
(E,t)
= -i-^(E,t)S*(E,t).
(2)
More precisely its diagonal element Tjj(E, t) has the meaning of the average time delay of a particle exiting channel j . In fact, consider an incoming wave packet J dke~i(-kx+u(-h^
Transport in adiabatic quantum pumps
173
the probability \Sji\2 for the particle to have come from channel i, we find for the average delay n
n
£|%|2(arg%)' = I m ^ % ^ =7^, i=l
i=l
Similarly, the scattering process affects the coordinate E by the energy shift [4] AC
£(E,t) = i~(E,t)S*(E,t).
(3)
While this claim appears plausible in view of the duality between equations (2, 3) one may find it puzzling because each matrix S(E, t) originates from a frozen Hamiltonian and hence from an energy-conserving dynamics. Of course, the dynamics of the electron in an operating pump is generated by a time-dependent Hamiltonian H(t) and the appropriate (dynamical) scattering operator Sd is that relating it to the reference Hamiltonian HQ. Moving the time origin from 0 to r results in H (t) —> H(t + r) and in a unitarily equivalent scattering operator Sd{r), Sd{T) = e-iHorSdeiH°T. (4) Written as an equation of motion, dSdjdr = — i[Ho, Sd(r)}, the equation can be reorganized as [17] H0 = Sd(t)H0S*d(t)
+^S*d(t),
assuming unitarity of Sd. Conjugation by the scattering matrix takes outgoing observables to incoming observables. The last term on the r.h.s. is thus to be identified with the energy shift [17]. The relation to its frozen counterpart (3) emerges in the adiabatic limit and can be clarified by means of coherent states [5] or pseudodifferential operators [6]. The net charge leaving the pump through channel j in the time interval [0, T] is the difference between the outgoing and incoming contributions:
Qj = ^J
dE' J dt'p{E)- — j
dEj
dtp(E),
(5)
where p is the occupation of incoming states, e.g. p(E) = Q(p, — E) in the zero temperature situation considered in section 1. In the first integral E is given through the map
$ : (E',f) ~ (E,t) =
(E'-£ij(E',t!),t'-Tjj(E',t)),
which describes the effect of the pump on the energy and the time of passage of an electron in terms of the outgoing data (E',t'). The denominator 2TT is the size of the phase space cell of a quantum state. By linearizing in the small energy shift £jj we obtain Qj{t)
=
1 ~2^J
r°°dE
P'(E)£n(E,t),
(6)
(' = d/dt), which reduces to 1 Qi{t) =—SjMt), 2TT
i.e., to equation (1), at zero temperature.
(7)
174
G. M. GRAF, A. ELGART, L. SADUN, K. SCHNEE
An alternate expression for the transported charge may be derived from equation (5) by using $ as a change of variables in the first integral. Its Jacobian is 1 — Q,jj(E',t'), where Cljj is the divergence of the displacement (£JJ,TJJ). Remarkably, ttjj is a diagonal entry of the commutator of time delay and energy shift Q = i[T,£\.
(8)
The result is the following balance equation fT
2-KQJ^
Jo
/»oo
dtp(0)£j:j(0,t)
-
Jo
dEp(E)Tj:i{E,t)
/*oo
t=T t=0
+
pT
dE dtp(E)Q.j:i(E,t). Jo Jo
(9)
The first term on the r.h.s. describes the release and trapping from bound states. It can in fact be seen [6] that £jj(0,t) consists of delta functions peaked at times t when an eigenvalue is branching off or merging with the continuum at E = 0. The middle term describes the depletion of the outgoing flow as a result of a time delay increasing over time, since effectively no charge is exiting during a time Tjj\\z£. The last term describes electrons that are being reshuffled between leads without delays, i.e., consistently with Kirkhoff's law, since Y^j=i tyj = trfi = 0 by equation (8). In addition to (7), but without explanation, we mention further transport properties (pointwise in time) [4,6] which can be expressed in terms of the energy shift: (i) Dissipation is the part of the energy flow Ej into lead j except for the part, p.Qj, which can be recovered from the lead (or reservoir) by reclaiming the transported charge. The expression at zero temperature is Ej(t) - (iQjit) = i j ; ( f 2 ) W (/i,t) > 0. (ii) The entropy and noise currents are given as the difference between the outgoing and incoming entropy (or noise) currents. The expressions are sJ(t,^) = ^-k((S%j-(£jj)2)>0, <we
k = {2 e n t r ° P y ; 16 noise,
(10)
the results being valid for small temperatures / 3 _ 1 provided however that /3 remains small w.r.t. to the adiabatic time scale.
3. Quantization of charge transport Early work [20] on pumps showed that a system which is both space and time periodic with filled bands of electrons exhibits quantized transport at zero temperature and in the adiabatic regime for reasons related to Chern numbers. In our case, where the pump is spatially compact, equation (9) allows for a comparison with that situation. Assuming that the system has no bound states, the charge transported during a period T is Qj = w~ I &jj(E,t)dE lit Jc
A dt,
(11)
where the integration is now over a finite cylinder (E, t) 6 C — (M mod T) x [0, /x]. Actually, since £(0, t) = 0 by the assumption just made, the bottom of the cylinder may be pinched
Transport in adiabatic quantum pumps
175
to a point and the cylinder turned to a disk. Consider now the complex line bundle with base C and fiber C C n spanned by the state feeding channel j \i)(E,t)) = S*(E,t)\j),
|j) = (*«)(i=i,...n).
The connection Pjd, Pj(E,t) = \tp(E,t)){ip(E,t)\, Ejjdt — TjjdE and curvature
(12)
has connection 1-form i[(dS)S*]jj —
-i(dS A dS*)j:j = dEjj Adt + dEA dTj:i = Slj:jdE A dt, which exhibits the integrand of equation (11) as a Chern character. However the analogy with [20] ends here and does not provide Chern numbers as a rule, since the manifold C has a boundary and, besides, equation (12) provides a global section, trivializing the bundle. There is an exception to this rule, however. The bundle is the pull-back of the canonical line bundle L over P C " - 1 via 7r o ip, where ip : C —> C" is given by equation (12) and 7r : C™ H-> PC™ -1 is the Hopf map. Hence, Qj = (27r) _1 L 0 1 I U C ) w, where (27r)_1u; is the first Chern character of L. The exception thus is: If d~K{%j){C)) = 0 , i.e., if \ip((i,t)) is constant in t up to a phase along the period, then Qj is an integer. This last conclusion can be reached more quickly from equation (7) since, by Sjj(/j,,t) = —i(il>\ip), the charge Qj =
4. An adiabatic theorem The basic equation (1) may be given a more firm footing on the basis of the Schrodinger equation. We sketch here (see [7] for full details) the mathematical framework. The singleparticle Hilbert space is given as H = H0®L2(R+,Cn),
(13)
where states in L2(R+,Cn) = ®" = 1 L 2 (R + ), resp. in Ho, describe an electron in one of the leads j = 1, n, resp. in the pump proper, cf. figure 1. Let Ho : Ji —> H denote the projection onto H.Q. We consider Hamiltonians H on H satisfying Hi/> = -d2ijj/dx2
for V € C^°(M + ,C n ),
||(if + i)~ m n 0 ||i < C for some fixed C> 0 and m £ N, <7pp(ff) n (0, oo) = 0. The first assumption states that particles in the channels are free. The second one, where || • ||i denotes the trace class norm over H, essentially requires that the pump proper contains
176
G. M. GRAF, A. ELGART, L. SADUN, K. SCHNEE
finitely many states below any fixed energy. The third one forbids embedded eigenvalues. These assumptions guarantee that the pump proper is finite, but place no restriction on its constitution except that it should leak at all positive energies. Since the pump configuration is supposed to change slowly in time t, we consider the evolution of the electrons in an adiabatic limit, where s = et is kept fixed as e > 0 tends to 0. In terms of the rescaled time coordinate s, called epoch, the propagator Ue(s,s') on H satisfies the non-autonomous Schrodinger equation idtUe(s, s') = £-xH{s)Ue{s,
s'),
UE(s', s') = 1,
where the Hamiltonians H(s) are of the above kind with H(s) — H(s') bounded and smooth in s. It is convenient to start the system from an equilibrium state. This is done by positing its 1-particle density matrix to be of the form p{H{so)) at some initial epoch so- Here p is a function of bounded variation with suppiip C (0, oo). In particular, an admissible choice is the Fermi sea, where p(E) = 9(p, — E). The time evolution then acts as p(H(s0)) ^ Pe(s) = U£(s, so) p(H(s0)) Ue(s0, s).
(14)
It should be noted that P£(s) is not an equilibrium state for H(s). In a single channel R + 9 x, the operator detecting a particle past some fiducial point x = a is the (smooth) characteristic function F(x > a). For technical reasons to be described below we found it preferable to use \A\ instead of x, where A is (px + xp)/2 (or a related operator) on the subspace L 2 (R+) of a chosen channel j and 0 on the remaining subspaces in equation (13). The current operator then consistently is Ij(a) = i[H(s),F(\A\ > a)], and Tr(Pe(s)Ij(a)) is its expectation value. The parameter a may be vaguely interpreted as the position of the ammeter. The fundamental result equation (1) may thus be given the following reformulation [7,19]. T h e o r e m 4 . 1 . Under the above assumptions we have for s > SQ
lim l i m e - ^ P e O O J ^ o ) ) = - — / o—too £-»o
2ir J0
dp(E)( — S*) \ as
,
(15)
/ jj
where S(E, s) is scattering n x n matrix (fiber) of the scattering operator for the pair (H(s),Ho) with Ho being any fixed reference Hamiltonian. The double limit is uniform in s £ I, I being a compact interval, whence it carries over to the transfered charge f Jo
dt'Ti{Pe{et')Ij{a))
= e-1 f ds'Tr(F £ (s')^(a))Jo
We shall conclude this section with several remarks about the theorem and its derivation. They are mostly heuristic, but may serve as a guide through the complete proof. The limit a —» oo is done in order to obtain a scattering description. Though a (positive energy) particle may require a large time (not epoch) interval to complete scattering, its length remains bounded in e small. Therefore the adiabatic limit e —> 0 is to be performed first. Equation (15), like equation (6), reduces to equation (1) at zero temperature. The above result is a statement about the adiabatic behavior of certain open, gapless systems, as explained shortly. Typically, adiabatic theorems are concerned with the evolution
Transport in adiabatic quantum pumps
177
Ue(s,so)PUe(s,so)*, where P is the spectral projection of H(so) onto a separated part of its spectrum [8,18] or on an embedded eigenvalue [2,10], and possibly with its deviation from the corresponding projection of H(s). The same is true here, see equation (14), except that P = p(H(so)) corresponds to a gapless part of the continuous spectrum. In fact, the theorem is implied by the following statement, which probes that deviation: Split the current 7,-(o) = i[H,F(\A\ > a)} = i{H,F(A < - a ) ] + i[H,F{A > a)] = ir(a) + I+(a) into incoming and outgoing parts. Then, in the double limit of equation (15), e-1Tr(Pe(s)I-(a)) 1
e- Tr(Pe{s)I+(a))
= e-1Tv(p(H(s))lr(a))
+ o(l),
1
= e" Tr (p(H(s))I+(a))
+ Tr([S^(s),
p(H(s))}I+(a))
+ o(l),
where oo
/
dtteiHMtH{s)e-iH^)t.
(17)
-oo
According to these estimates the state (14) deviates from equilibrium only in its outgoing part and may be approximated on that part by p(H(s))+s[S^(s),p(H(s))}
+ ..-.
(18)
The asymmetry w.r.t. the incoming part, p(H(s)), is of course due to the assumption s > s0, i.e., to the fact that the current measurement occurs after the preparation of the state. It should moreover be noted that in the sum of the r.h.s. of equations (16) the equilibrium contributions cancel, Tr(p(H(s))(I~(a) + I+(a))) = 0. The theorem also represents an instance where linear response theory can be justified rigorously, cf. [13], without there being a dissipation mechanism. Indeed, by the chain rule for wave operators, we may pretend that the reference Hamiltonian used in equation (15) is H(s) itself and thus prove the result with (dS/ds)S* replaced by ds>S(s',s)s<=s, where S(s',s) is the scattering operator for the pair of autonomous Hamiltonians (H(s'),H(s)). However, as explained in connection with equation (4), the relevant object is the dynamical scattering operator, now relating the ^-dependent Hamiltonian H(s+et) — H(s) +sH(s)t + ••• to H(s). In linear response approximation it equals Sd = 1 + eS^(s) + ••-, and equation (18) comes from Sdp(H(s))S^ and can be rewritten by means of [S^(s),p(H(s))}
= -ids,S(s',s)s<=3p'(H(s)),
(19)
with oo
/
iH t dte
^ H(s)e-iHMt.
(20)
•oo
Here, 1 + (s' — s)ds>S(s',s)a>=s + ••• is the Born approximation for S(s',s). Formally, equation (19) follows for ^(A) = e1^*, (t e U), by a change of variables in equation (17) and hence for general functions p. The importance of equation (19) is twofold. First, it reduces matters to frozen scattering data and their derivatives, as required by the theorem. Second, it makes it clear why only the variation dp(E) matters and, in particular, why for the Fermi sea p(A) = 8(n — A) only states at the Fermi energy p, contribute to the current. As a final remark we mention that the integrals (17, 20), once multiplied with if {a), can be shown to converge using propagation estimates.
178
G. M. G R A F , A. ELGART, L. SADUN, K. SCHNEE
Acknowledgments We t h a n k J. Avron, without whom t h e results presented here would not have been possible. This work is supported in p a r t by the Texas Advanced Research P r o g r a m and N S F Grant PHY-9971149; t h e E U grant H P R N - C T - 2 0 0 2 - 0 0 2 7 7 .
References 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. 13. 14. 15. 16. 17. 18. 19. 20. 21. 22.
A. Andreev, A. Kamenev, Phys. Rev. Lett. 85, 1294 (2000). J. E. Avron, A. Elgart, Coram. Math. Phys. 203, 445 (1999). J. E. Avron, A. Elgart, G. M. Graf, L. Sadun, Phys. Rev. B 62, R10618 (2000). J. E. Avron, A. Elgart, G. M. Graf, L. Sadun, Phys. Rev. Lett. 87, 236601 (2001). J. E. Avron, A. Elgart, G. M. Graf, L. Sadun, J. Math. Phys. 4 3 , 3415 (2002). J. E. Avron, A. Elgart, G. M. Graf, L. Sadun, arXiv:math-ph/0209029. J. E. Avron, A. Elgart, G.M. Graf, L. Sadun, K. Schnee, arXiv:math-ph/0209029. J. E. Avron, R. Seiler, L. G. Yaffe, Comm. Math. Phys. 110, 33 (1987). P. W. Brouwer, Phys. Rev. B 58, 10135 (1998). F. Bornemann, Lecture Notes in Mathematics, vol. 1687, Springer, 1998. M. Biittiker, A. Pretre, H. Thomas, Phys. Rev. Lett. 70, 4114 (1993). M. Biittiker, H. Thomas, A. Pretre, Z. Phys. B 94, 133 (1994). A. Elgart, these proceedings. L. Eisenbud, Dissertation, Princeton University, 1948, unpublished. L. D. Landau, E. M. Lifshitz, Statistical Mechanics, Pergamon Press, 1978. L. S. Levitov, H. Lee, B. Lesovik, J. Math. Phys. 37, 4845 (1996). P. A. Martin, M. Sassoli de Bianchi, J. Phys. A 28, 2403 (1995). G. Nenciu, Comm. Math. Phys. 152, 479 (1993). K. Schnee, Dissertation, ETH-Zurich, 2002, unpublished. D. Thouless, Phys. Rev. B 27, 6083 (1983). E. P. Wigner, Phys. Rev. 98, 145 (1955). D. R. Yafaev, Mathematical Scattering Theory, AMS, 1992.
One-dimensional behavior of dilute Bose gases in traps ROBERT SEIRINGER, ELLIOTT
H. LIEB (Princeton), JAKOB
YNGVASON
(U. Wien)
We report on a rigorous analysis of the ground state of a dilute Bose gas in an external confining potential in the limit where the confinement becomes very strong in two dimensions and as a result the system behaves essentially one-dimensional. We show that the system is well described by a one-dimensional model of Bosons with delta function interaction that was solved long ago by Lieb and Liniger. We explicate 5 parameter regions in which various types of behavior of the system occur. This is relevant for the interpretation of recent experiments on ultra-cold atomic gases in strongly elongated traps.
1. Introduction Recently it has become possible to do experiments in highly elongated traps on ultra-cold Bose gases that are effectively one-dimensional [1-4]. These show peculiar features predicted by a model of a one-dimensional Bose gas with delta function two-body interaction analyzed long ago [5,6], like quasi-fermionic behavior [7], the absence of Bose-Einstein condensation (BEC) in a dilute limit [8-10], and an excitation spectrum different from that predicted by Bogoliubov's theory [6,11,12]. The theoretical work on the one-dimensional behavior of the system in elongated traps has so far been based either on variational calculations, starting from a 3D pseudo-potential, or on numerical Quantum Monte Carlo studies. In contrast, in [13] a rigorous study of the behavior of the system, starting from the basic Schrodinger equation, was undertaken, and we give a summary of this work here. The complicated nature of the problem can be seen from the fact that for an interaction potential with a hard core, as we consider here, the true ground state wave function does not approximately factorize in the longitudinal and transverse variables (otherwise the energy would be infinite) and the effective one-dimensional interaction potential can not be obtained by simply integrating out the transverse variables of the three-dimensional interaction potential. Nevertheless we are able to demonstrate rigorously that the one-dimensional behavior really follows from the fundamental Schrodinger equation. It is also important to delineate, as we do here, precisely what can be seen in the different parameter regions. The proofs of our assertions are too long to be given here and can be found in [13]. We emphasize that everything can be rigorously derived from first principles.
2. Setting and main results We shall now describe the setting more precisely. It is convenient to write the Hamiltonian in the following way. We choose units such that H = 2m = 1, where m denotes the particle mass. The Hamiltonian is than given by JV
HN,L>r>a = '£(-Aj+Vrx(Xf)
+ VL(zj))+
j=l
Yl l
179
«a(|xi-x,-|)
(1)
180
ROBERT SEIRINGER, ELLIOTT H. LIEB, JAKOB YNGVASON
with x = (x, y, z) = (x-1, z) and with ^ ( x ^ ^ K ^ x
1
/ ^ ,
VL{z) = ±V{z/L),
U o (|x|)
= lt;(|x|/a).
(2)
Here, r,L,a are variable scaling parameters while V-1, V and v are fixed. The interaction potential v is supposed to be nonnegative, of finite range and have scattering length 1; the scaled potential va then has scattering length a. (See, e.g., [14] or [15] for the definition of the scattering length.) The external trap potentials V and V1- confine the particles in the longitudinal and the transversal directions, respectively, and are assumed to be locally bounded and tend to oo as |z| and Ix-1! tend to oo. To simplify the discussion we find it also convenient to assume that V is homogeneous of some order s > 0, namely V(z) = \z\s, but weaker assumptions, e.g. asymptotic homogeneity [16], would in fact suffice. The case of a simple box with hard walls is realized by taking s = oo, while the usual harmonic approximation is s = 2. It is understood that the lengths associated with the ground states of —d2/dz2 + V(z) and —Ax + y- L (x J -) are both of the order 1 so that L and r measure, respectively, the longitudinal and the transverse extensions of the trap. We shall always be concerned with the ground state of the Hamiltonian (1) and with large particle number, N 3> 1, which is appropriate for the consideration of actual experiments. The other parameters of the problem, o, r and L, will satisfy a
= N J l * o ( x , x 2 , . . . , X i V )| 2 d 3 x 2 • • -cftcjv •
(3)
On the average, this 3D density will always be low in the parameter range considered here (in the sense that the mean distance between particles is large compared to the scattering length a). However, the effective ID density can be either high or low. Note that, in contrast to 3D gases, high density in ID corresponds to weak interactions and vice versa [5]. In parallel with the 3D Hamiltonian (1) we consider the Hamiltonian for n Bosons in ID with delta function interaction and coupling constant g > 0, i.e., H™=ir-dydz]+g j=l
Y,
Sizi-zj).
(4)
l
We consider this Hamiltonian for the Zj in an interval of length £, and are interested in the thermodynamic limit t —> oo, n —> oo with p = nji fixed. The ground state energy per particle in this limit is independent of boundary conditions and can, according to [5], be written as elD(p) = P2e(g/p), (5) with a function e(t) determined by a certain integral equation. Its asymptotic form is e(t) K \t for t < 1 and e(t) -> TT 2 /3 for t -> oo. Thus elD(p) » \gp for g/p « 1
(6)
4D(P) « (?r 2 /3)p 2 for g/p » 1.
(7)
and
One-dimensional behavior of dilute Bose gases in traps
181
This latter energy is the same as for non-interacting fermions in ID, which can be understood from the fact that (4) with g = oo is equivalent to a Hamiltonian describing free fermions. Taking /oeJD(p) as a local energy density for an inhomogeneous ID system we can form the energy functional oo
/
( | V v ^ ) | 2 + VL(z)p(z) + p{zfe{glp{z)))
dz.
(8)
-oo
Its ground state energy is obtained by minimizing over all normalized densities, i.e.,
E1D(N,L,g)=hdU\p] : p > 0, J" p(z)dz = N\ .
(9)
Using convexity of the map p — i > p3e(g/p), it is standard to show that there exists a unique minimizer of (8) (see, e.g., [15]). It will be denoted by p}p£„g- We also define the mean ID density of this minimizer to be 1
r°°
P=xJ_oo(p1N,L,g(z))
i
dz.
(10)
Note that, in general, this mean density depends on g besides N and L. The order of magnitude of p in the various parameter regions will be described in the next section. Our main result relates the 3D ground state energy of (1), ECiM(N,L,r,a), to the ID 1D density functional energy E (N,L,g) in the large TV limit provided r/L and a/r are sufficiently small. For this to be true the parameter g has to be chose appropriately, of course. To give the precise statement, let e x and b(x±), respectively, denote the ground state energy and the normalized ground state wave function of — A x + V ± (x- L ). The corresponding quantities for -A1 + V ^ x 1 ) are e x / r 2 and ^ ( x - 1 ) = ( l / r ) 6 ( x ± / r ) . In the case that the trap is a cylinder with hard walls 6 is a Bessel function; for a quadratic V1- it is a Gaussian. In any case, 6 is a bounded function, and in particular b £ L 4 (R 2 ). We can therefore define 9 by
l6(x±)l4rf2x± = 8™ / I M x ^ l 4 ^ .
9-^rJ
(ii)
Our main result is: Theorem 2.1. Let N —> oo and simultaneously r/L —> 0 and a/r —> 0 in a way such that r2p • min{p, g} —> 0. Then l m
EQM(N,L,r,a)-Ne±/r2 E">{N,Ltg)
= 1
-
(12)
An analogous result holds for the quantum mechanical ground state density. Define the ID QM density by averaging over the transverse variables, i.e.,
Let L := N/p denote the extension of the system in ^-direction, and define the rescaled density p by PlN,L,g{z) = jP(z/L).
(14)
182
ROBERT SEIRINGER, ELLIOTT H. LIEB, JAKOB YNGVASON
Note that, although p depends on N, L and g, ||p||i = ||p]|2 = 1, which shows in particular that L is the relevant length scale in z-direction. We have T h e o r e m 2.2. In the same limit as considered in Theorem 2.1,
in weak Ll sense. Note that because of (6) and (7) the condition r2p • min{p~, g} —> 0 is the same as eJ D (p) « 1/r2,
(16)
i.e., the average energy per particle associated with the longitudinal motion should be much smaller than the energy gap between the ground and first excited state of the confining Hamiltonian in the transverse directions. This is sufficient for ID behavior, it is not necessary to have a ~ r, as might have been supposed. This is an intrinsically quantum-mechanical phenomenon.
3. Discussion We will now give a discussion of the various parameter regions that are included in the limit considered in Theorems 2.1 and 2.2 above. We begin by describing the division of the space of parameters into two basic regions. This decomposition will eventually be refined into five regions, but for the moment let us concentrate on the basic dichotomy. In earlier work [15,16] we proved that the 3D Gross-Pitaevskii formula for the energy is correct to leading order in situations in which iV is large but a is small compared to the mean particle distance. This energy has two parts: The energy necessary to confine the particles in the trap, plus the internal energy of interaction, which is N4:irap3D. This formula was proved to be correct for a fixed confining potential in the limit N —> oo with a3p3D —> 0. However, this limit does not hold uniformly if r/L gets small as N gets large. In other words, new physics can come into play as r/L —> 0 and it turns out that this depends on the ratio of a/r2 to the ID density, or, in other words, on g/p. There are two basic regimes to consider in highly elongated traps, i.e., when r < L . They are — the ID limit of the 3D Gross-Pitaevskii regime; — the 'true' ID regime. The former is characterized by g/p
One-dimensional behavior of dilute Bose gases in traps
183
In both regions the internal energy of the gas is small compared to the energy of confinement. However, this in itself does not imply a specifically ID behavior. (If a is sufficiently small it is satisfied in a trap of any shape.) ID behavior, when it occurs, manifests itself by the fact that the transverse motion of the atoms is uncorrelated while the longitudinal motion is correlated (very roughly speaking) in the same way as pearls on a necklace. Thus, the true criterion for ID behavior is that g/p is of order unity or larger and not merely the condition that the energy of confinement dominates the internal energy. We shall now briefly describe the finer division of these two regimes into five regions altogether. Three of them (Regions 1-3) belong to the weak interaction regime and two (Regions 4-5) to the strong interaction regime. They are characterized by the behavior of g/p as N —> oo. In each of these regions the general functional (8) can be replaced by a different, simpler functional, and the energy Elu(N,L,g) in the theorem by the ground state energy of that functional. Analogously, the density in Theorem 2.2 can be replaced by the minimizer of the functional corresponding to the region considered. The five regions are • Region 1, the Ideal Gas case: In the trivial case where the interaction is so weak that it effectively vanishes in the large TV limit and everything collapses to the ground state of —d2/dz2 + V(z) with ground state energy e", the energy E1D in (12) can be replaced by Ne^/L2. This is the case if g/p
This case is
oo
/
(I V^p(z)\2
+ VL{z)p{z) + \gp{zf)
dz ,
(17)
-oo
corresponding to the high density approximation (6) of the interaction energy in (8). Its ground state energy is EGP(N,L,g) = NL~2EGP(1,1, NgL), by scaling. • Region 3, the I D TF case: N~2 < g/p < 1, with p ~ (N/L)(NgL)~1/(-s+1), where s is the degree of homogeneity of the longitudinal confining potential V. This region is described by a Thomas-Fermi type functional oo
/
{VL{z)p{z) + \gp{z)2)dz.
(18)
-oo
It is a limiting case of Region 2 in the sense that NgL » 1, but a/r is sufficiently small so that g/p -C 1, i.e., the high density approximation in (6) is still valid. The explanation of the factor (NgL)1^3+1') is as follows: The linear extension L of the minimizing density of (17) is for large values of NgL determined by VL(L) ~ g(N/L), which gives L ~ (NgL)1/(-s+1^L. In addition condition (16) requires gp < r~ 2 , which means 1/( s+1) that Na/L(NgL) < 1. The minimum energy of (18) has the scaling property ETF(N, L, g) = NL-2{NgLy/(s+^ETF(l, 1,1). • Region 4, the LL case: g/p ~ 1, with p ~ (N/L)N~2^s+2\ functional
described by an energy
oo
/
(VL(z)p(z) + p(z)3e(g/p(z)))dz. -oo
(19)
184
ROBERT SEIRINGER, ELLIOTT H. LIEB, JAKOB YNGVASON
This region corresponds to the case g/p ~ 1, so that neither the high density (6) nor the low density approximation (7) is valid and the full LL energy (5) has to be used. The extension L of the system is now determined by V L ( £ ) ~ (N/L)2 which leads to L ~ LN2^s+2\ Condition (16) means in this region that Nr/L ~ NsKs+2)r/L -> 0. Since Nr/L ~ (p/g)(a/r), this condition is automatically fulfilled if g/p is bounded away from zero and a/r -> 0. The ground state energy of (19), £ LL (iV, L,g), is equal to Nj2Ehh(l, 1,5/7), where we introduced the density parameter 7 := (7V/L)AT-2/(s+2).
(20)
• Region 5, the GT case: g/p » 1, with p ~ (N/L)N~2^s+2\ described by a functional with energy density ~ p 3 , corresponding to the Girardeau-Tonks limit of the the LL energy density. It corresponds to impenetrable particles, i.e, the limiting case g/p —> 00 and hence formula (7) for the energy density. As in Region 4, the mean density is here p ~ 7. The energy functional is oo
/ with minimum energy EGT(N,L)
(VL(z)p(z) + (ir2/3)p(z)3)dz,
(21)
•00
= A^7 2 £^ GT (1,1).
As already mentioned above, Regions 1-3 can be reached as limiting cases of a 3D GrossPitaevskii theory. In this sense, the behavior in these regions contains remnants of the 3D theory, which also shows up in the the fact that BEC prevails in Regions 1 and 2. This is what we discuss in the next section.
4. Bose-Einstein condensation In this final section we remark about BEC in the ground state. We are able to prove its occurrence in Regions 1 and 2. It probably also occurs in part of Region 3, but we cannot prove this and it remains an open problem. BEC means that the one-body density matrix 7 Q M (x, x'), which is obtained from the ground state wave function * 0 by QM 7
( x , x') = N J * 0 ( x , x 2 , . . . , XJV ) * 0 ( x ' , x 2 , . . . , x w )*
(22)
factorizes approximately, for large N, as Nip(x)ip(x') for some L2-normalized function ip. This, in fact, is 100% condensation and this is what we prove occurs in Regions 1 and 2. The condensate wave function ip is the square-root of the minimizer of the ID GP functional (17) times the transverse function 6 r (x ± ). This is stated in the following Theorem. Its proof is similar to the work in [17]. Theorem 4 . 1 . In the limit N —> 00, r/L —» 0 with NgL fixed, lim ^ V
M
( ( r x \ L*), ( r x ' \ Lz')) = 6(x x ) 6(x' X ) $GP{z) 4>GP(z>)
(23)
in trace norm. Here 4>GP is the square root of the minimizer of the GP functional (17) with N — 1, L = 1 and interaction parameter NgL.
One-dimensional behavior of dilute Bose gases in traps
185
B E C is not expected in Regions 4 and 5. In fact for a homogeneous gas of I D impenetrable bosons Lenard showed [8] t h a t t h e largest eigenvalue of 7 ^ M grows only as Nll2 and, according t o [18,19], this holds also for impenetrable bosons in a quadratic t r a p potential.
Acknowledgments E. H. L. was partially supported by the U.S. National Science Foundation grant No. PHY 01 39984. R. S. was supported by the Austrian Science Fund. References 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. 13. 14. 15. 16. 17. 18. 19.
K. Bongs et al., Phys. Rev. A 6 3 , 031602 (2001). A. Gorlitz et al., Phys. Rev. Lett. 87, 130402 (2001). M. Greiner et al., Phys. Rev. Lett. 87, 160405 (2001). F. Schreck et al., Phys. Rev. Lett. 87, 080403 (2001). E. H. Lieb, W. Liniger, Phys. Rev. 130, 1605 (1963). E. H. Lieb, Phys. Rev. 130, 1616 (1963). M. D. Girardeau, J. Math. Phys. 1, 516 (1960). A. Lenard, J. Math. Phys. 5, 930 (1964). L. Pitaevskii, S. Stringari, J. Low Temp. Phys. 85, 377 (1991). M. D. Girardeau, E. M. Wright, J. M. Triscari, Phys. Rev. A 6 3 , 033601 (2001). A. D. Jackson, G. M. Kavoulakis, Phys. Rev. Lett. 89, 070403 (2002). S. Komineas, N. Papanicolaou, Phys. Rev. Lett. 89, 070402 (2002). E. H. Lieb, R. Seiringer, J. Yngvason, arXiv:cond-mat/0304071 and arXiv:math-ph/0305025 . E. H. Lieb, J. Yngvason, Phys. Rev. Lett. 80, 2504 (1998). E. H. Lieb, R. Seiringer, J. Yngvason, Phys. Rev. A 6 1 , 043602 (2000). E. H. Lieb, R. Seiringer, J. Yngvason, Comm. Math. Phys. 224, 17 (2001). E. H. Lieb, R. Seiringer, Phys. Rev. Lett. 88, 170409 (2002). T. Papenbrock, Phys. Rev. A 67, 041601(R) (2003). P. J. Forrester, N. E. Frankel, T. M. Garoni, N. S. Witte, arXiv:cond-mat/0211126.
Semiclassical dynamics of an electron moving in a slowly perturbed periodic potential STEFAN T E U F E L , GIANLUCA PANATI
(Tech. U. Miinchen)
The semiclassical dynamics of a quantum particle in a slowly perturbed periodic potential are discussed. Based on recent results [16] we explain how the well known semiclassical model of solid state physics is related to the Schrodinger equation. We also present the less known first order corrections to the semiclassical model and discuss their relation to the quantization of the Hall conductivity.
1. From q u a n t u m dynamics t o t h e refined semiclassical model A basic problem of solid state physics is to understand the motion of electrons in the periodic potential which is generated by the ionic cores. While this problem is quantum mechanical, many electronic properties of solids can be understood already in the semiclassical approximation [1,11,20]. It is common lore in solid state physics that for suitable wave packets, which are spread over many lattice spacings, the main effect of a periodic potential Vr on the electron dynamics corresponds to changing the dispersion relation from the free kinetic energy Efvee(k) — \ k2 to the modified kinetic energy En(k) given by the n t h Bloch function. Otherwise the electron responds to slowly varying external potentials A,
k = -V>(r) +rx
B(r),
(1)
where K — k — A(r) is the kinetic momentum and B = curlA is the magnetic field. (We choose units in which the Planck constant h, the speed c of light, and the mass m of the electron are equal to one, and absorb the charge e into the potentials.) The corresponding equations of motion for the canonical variables (r, k) are generated by the Hamiltonian Hsc(r,k) = En{k-A(r))+<j>(r),
(2)
where r is the position and k the quasi-momentum of the electron. Note that there is a semiclassical evolution for each Bloch band separately. In a recent work [16], we use adiabatic perturbation theory in order to understand on a mathematical level how these semiclassical equations emerge from the underlying Schrodinger equation i ds i/>(y, * ) = ( £ ( - iV y - A(sy))2 + Vr(y) +
(3)
in the limit e —> 0 at leading order. In addition, the order e corrections to (1) are established, see equation (7). In (3) the potential Vr : Kd —> R is periodic with respect to some regular lattice T generated through the basis {71,... ,7
r = lx e Rd •. x = Yfj=iaj a
186
for s o m e
« e zd|
Semiclassical dynamics of an electron moving in a slowly perturbed periodic potential
187
and Vr( • + 7 ) = Vr(-) f° r all 7 S T. The lattice spacing defines the microscopic spatial scale. The external potentials A(ey) and >(ey), with A : Rd -> R d and <> / : R d -> R, are slowly varying on the scale of the lattice, as expressed through the dimensionless scale parameter £, e
(4)
with initial conditions tpe(x) = e~d/2ip(x/e). If Vr = 0, then the limit e —> 0 in equation (4) is the usual semiclassical limit with e replacing H. The problem of deriving (1) from the Schrodinger equation (3) in the limit e —* 0 has been attacked along several routes. In the physics literature (1) is usually accounted for by constructing suitable semiclassical wave packets [11,13,20]. The few mathematical approaches to the time-dependent problem (4) extend techniques from semiclassical analysis, as the Gaussian beam construction [5,8], or Wigner measures [2,7]. Separation of time-scales plays a fundamental role in the understanding of the dynamics of many physical systems. Bloch electron are not an exception. We have shown [9,16] that (4) can be fruitfully studied using adiabatic perturbation theory [15,18]. The results obtained in this way constitute not only a derivation of the semiclassical model (1) with general electric and magnetic fields, but they allow to compute systematically higher order corrections in the small parameter e. It turns out that the electron acquires a fc-dependent electric moment An(k) and magnetic moment Mn(k). If the n t h Bloch band En(k) is nondegenerate and isolated, with Bloch eigenfunctions i/jn(k,y), the electric dipole moment is given by the Berry connection An{k) = i(i>n{k)y^n{k)),
(5)
and the magnetic moment by the Rammal-Wilkinson term Mn(k)
= i (ViMfc), x(Hper(k)
- En(k))Vi>n(k))
•
(6)
For sake of a simpler exposition, we focus on the case d = 3 and use the vector product notation (a x b)i = ei:>lajbi for vectors a, b G R 3 . The inner product in L 2 (M 3 /T) is denoted by (•, •) and Hpel(k) is H of (3) with <j> = 0 = A for fixed Bloch momentum k, see (11). Note that En, An and Mn are T*-periodic functions of k, where T* is the lattice dual to T. Hence one can as well think of them as functions on the domain M* = R 3 /T*, the first Brillouin zone. As our main result [16] we find, cf. Theorem 2.1 in this note, that the semiclassical
188
STEFAN TEUFEL, GIANLUCA PANATI
equations of motion including first order corrections read r =
VK(E„(K)
k = -VP (#r)
-eB{r)
•
A1„(K))
- e k x £!„(«) ,
- e J3(r) • M „ ( K ) ) + r x B(r),
(7)
with fin(fc) = V x An(k) the curvature of the Berry connection. In terms of the eigenprojectors Pn(k) of Hpei(k) corresponding to the eigenvalues En{k) one finds that f U * ) = i Tr(P„(fc) [VP„(/c), xVPn(k)}),
(8)
and Mn(k)
= i Tr (Pn(fc) VP„(fc) x (H0(k) - En(k)) VP„(fc)) .
(9)
This shows immediately that iln(k) and M.n{k) do not depend on the choice of the phase of the Bloch function tpn(k), i.e., are invariant with respect to the Bloch gauge, and therefore have an intrinsic physical meaning. Notice, in parentheses, that the equations (7) are in Hamiltonian form with respect to a non standard, e-dependent symplectic form[16]. The equations of motion (7) were discovered only recently by Niu et al. [4,17] using coherent state solutions of equation (4). As the most striking application note that they provide a simple semiclassical explanation of the Quantum Hall Effect. To this end one adds in (3) a strong uniform magnetic field BQ, i.e., the vector potential AQ{X) = ^BQ X X. If its magnetic flux per unit cell is rational, then the Hamiltonian in (3) is still periodic at the expense of a larger unit cell and replacing the usual translations by magnetic translations. The semiclassical equations of motion (7) remain formally unaltered, only En(k) now refers to a magnetic Bloch band. Let us specialize (7) to two dimensions and take B(r) = 0,
jn= f JM'
dkr{k)= [
dk(WkEn(k)-£±nn(k))
JM*
= -£± f
dk£ln(k).
JM-
It is well known that fM, dfc £ln(k) is the Chern number of the magnetic Bloch bundle and as such an integer [19]. A more detailed exposition of the previous argument and further applications related to the semiclassical first order corrections are presented in Chapters 12 and 13 of ref. [3]. Among the latter are the anomalous Hall effect [10] and the thermodynamics of the Hofstadter model [6].
2. Transport of observables and Wigner functions: mathematical results We now come to the precise relationship between the semiclassical equations (7) and the Schrodinger dynamics (3), and we outline how this problem fits in the general framework of space-adiabatic theory. For the unperturbed periodic Hamiltonian HpeT = - A + VT
Semiclassical dynamics of an electron moving in a slowly perturbed periodic potential
189
the separation between slow and fast degrees of freedom is realized by the Zak transform (UMk,y)
:= Y, e- i(y+7) - fc V(2/ + 7), (*,y) £ R 2d ,
(10)
which defines a unitary operator U : L2(RJ) -» L2(Mfe*) ® L2(Td) * L2{M*k,L2(Td))
=-. H.
The transformed operator is fibered, i.e., UH^U-1
+ k)2 + Vr(y) = HpeT(k).
= l(-iS7y
(11)
Hper(k) is an operator-valued multiplication operator acting on the vector valued functions in L2(M£,L2(Ty)). Its pointwise spectrum defines the Bloch band structure through Hper(k)i>n(k)
=
En(k)i)n(k).
If Pn(k) denotes the eigenprojection corresponding to aBlochband En(k), then the projector Pn = JM, dkPn(k) commutes with Hper, thus defining a subspace which is invariant but not necessarily spectral. Let us now consider the perturbed Hamiltonian #m = §( - iVx - A{ex)f
+ Vr{x) + 4>{sx)
(12)
appearing in (3), where the index m reminds us of the microscopic spatial scale. Its Zak transform is easily computed to be Hi:=UH^U-1
= ^-iWy
+ k-A(ieVl))2
+ Vr(y) + ci>(ieVl),
where iV£ is the gradient on L2(M£, L2(Ty)) with suitable boundary conditions. In contrast to the unperturbed case, this operator is not fibered. However, instead of an operator-valued multiplication operator, one can make sense of it as an pseudodifferential operator with operator-valued symbol. Indeed, Hez is the Weyl quantization H(k, ieVj[) of the unboundedoperator valued symbol H(k, r) = \ ( - i V x + k- A(r)f
+ Vr(x) + <j>(r),
which acts on the Hilbert space Hi = L2{Td). It is crucial that, even in the perturbed case, to each isolated Bloch band En there corresponds an almost invariant subspace HnH. Only for states which start in this subspaces and thus, by construction, remain there up to small errors for long times, the semiclassical equations of motion (7) can have any significance. We now introduce some further notation and assumptions to prepare the statement of the main result. The flow of the dynamical system (7) is denoted by $£ : R2d —> M.2d or in canonical coordinates (r, k) = (r,K + A(r)) by $te(r,k) =
($lr(r,k-A(r)),&eK(r,k-A(r))+A(rj).
190
STEFAN TEUFEL, GIANLUCA PANATI
The existence of the smooth family of diffeomorphisms $* is not completely obvious from (7) alone, but follows from the Hamiltonian formulation of (7). For the formulation of the semiclassical limit it is more convenient to switch to the macroscopic Schrodinger equation (4). Its Hamiltonian is denoted by Hs = I ( - ieV* - A(x))2 + Vr(x/e) +
(13)
Under the following assumption on the potentials He is self-adjoint on H2' Assumption. Let Vr be infinitesimally bounded with respect to —A and assume that <j> € C£°(R d ,R) and Aj £ C^°(Rd,R) for any j £ { 1 , . . . ,d}. Here C£°(Rrf) denotes the space of smooth functions which are bounded together with all their derivatives. <0> , equipped with the metric dc
Finally we introduce the Frechet space C = C^ induced by the standard family of semi-norms \\a\\a = \\daa\\00,
a£N2d,
and the subspace of T*-periodic observables Cper = {aeC:
a(r, k + 7*) = a(r, k) V 7 * e T*} .
We abbreviate dc{a) := dc{a,0). Our main result [16] on the semiclassical limit of (4) is the following Egorov-type theorem. Theorem 2.1. Let En be an isolated, non-degenerate Bloch band. For each finite timeinterval J c K there is a constant C < 00, such that for all a € Cper with Weyl quantization a — a(x, —ieVx) one has (e i f f £ t/e~e-iHH/e
<
a°<£>o)n* B(L 2 (R d ))
eCdc{a),
(14)
and n^(eiff£'/eae-iffEt/£-ao^)^
<e2Cdc{a).
(15)
R e m a r k 2.1. The corresponding statement in [16] does not make explicit the dependence of the error on the observable a. However, the more precise version formulated here is a standard consequence of the Calderon-Vaillancourt theorem and the fact that composition with $ £ is a continuous map from C into itself. 0 Theorem 2.1 provides a semiclassical description of the evolution of observables. The most direct way to turn it into a description for the semiclassical evolution of states is via duality, i.e., via the Wigner function. Recall that according to the Calderon-Vaillancourt theorem there is a constant C < 00 depending only on the dimension d such that for a € C one has \(ip,aiP)L2{Rd)\< Cdc(a) HVII2. (16) Hence, the map C B a *-^> (ijj, aip) £ C is continuous and thus defines an element wf of the dual space C, the Wigner function of ip. Writing (V>, aip) =: (wf, a)C',c =• /
dgdp
a{q,p)wf(q,p)
(17)
Semiclassical dynamics of an electron moving in a slowly perturbed periodic potential and inserting into (17) the definition of the Weyl quantization for o e
191
S(R2d)
one arrives at the usual formula wf{q,v)
= ~ 3 JRd dte^riq
+ e£/2) 1>{q - e£/2)
(18)
for the Wigner function. Direct computation yields \\wt\\L2(R2d)=e-d(27rrd/2\\i;\\l2m. Therefore, wf £ L2(R2d) for all e > 0, which explains the notion of Wigner function. Although wf is obviously real-valued, it attains also negative values in general. Hence, it does not define a probability distribution on phase space. However, it correctly produces quantum mechanical expectations of semiclassical observables via (17). With this preparations we obtain the following corollary of Theorem 2.1, which says that the Wigner function of the solution of the Schrodinger equation (4) is approximately transported along the classical flow of (1) resp. (7). Corollary 2.1. Let En be an isolated, non-degenerate Bloch band. Then for each finite time-interval I C K there is a constant C < oo such that for t £ I, a £ Cper and for i>o e IlnL2(Rd) one has ( ( i ^ ' - i ^ o * , , ' ) , * ) ^
<sCdc(a)
||Vo|
<e2Cdc(a)||Vo| Here ipt = e~lH
t e
^ ipo
25
the solution of the Schrodinger equation (4).
As mentioned, the strategy of the proof of Theorem 2.1 is based on adiabatic methods, i.e., on separation between slow and fast degrees of freedom. In particular one realizes that Hz = UH'U-1
= ~ ( - iV„ + k - A(ieVJE))2 + Vr(y) + »(ieVp
is given by the Weyl quantization of an unbounded-operator valued symbol H{k, r) = ~ ( - i V x + k- A(r))2 + Vr(x) + 0(r), which acts on the Hilbert space H[ = L2(Td) for the fast degrees of freedom. This representation allows one to apply a suitable generalization of space-adiabatic perturbation theory [14,15] in order to construct the following objects. — Almost-invariant subspaces. To any isolated non-degenerate Bloch band En corresponds an orthogonal projection n ^ € B{H) which almost commutes with Hz, i.e., \\[Hi,Iln}\\=0(e°°). Hence, the subspace U.nH is approximately invariant under the time-evolution generated by i ? | .
192
STEFAN T E U F E L , GIANLUCA PANATI
— I n t e r t w i n i n g u n i t a r i e s a n d effective H a m i l t o n i a n . There exist a unitary operator U% : WnU -» L2(M*), such t h a t || (e- , H S« - f / r e - 1 ^ 4 K)K
|| = 0(e°°(l
+ \t\)),
where hn is the Weyl quantization of a semiclassical symbol. Its principal symbol hnfi = En(k — A(r)) +
References 1. N. W. Ashcroft, N. D. Mermin, Solid State Physics, Saunders, New York, 1976. 2. P. Bechouche, N. J. Mauser, F. Poupaud, "Semiclassical limit for the Schrodinger-Poisson equation in a crystal", Comm. Pure Appl. Math. 54, 851-890 (2001). 3. A. Bohm, A. Mostafazadeh, H. Koizumi, Q. Niu, J. Zwanziger. The geometric phase in quantum systems, Texts and Monographs in Physics, Springer, Heidelberg, 2003. 4. M. C. Chang, Q. Niu, "Berry phase, hyperorbits and the Hofstadter spectrum: Semiclassical dynamics and magnetic Bloch bands", Phys. Rev. B 53, 7010-7023 (1996). 5. M. Dimassi, J.-C. Guillot, J. Ralston, "Semiclassical asymptotics in magnetic Bloch bands", J. Phys. A 35, 7597-7605 (2002). 6. O. Gat, J. E. Avron, "Magnetic fingerprints of fractal spectra and the duality of Hofstadter models", New J. Phys. 5, 44.1-44.8 (2003). 7. P. Gerard, P. A. Markowich, N. J. Mauser, F. Poupaud, "Homogenization limits, Wigner transforms", Commun. Pure Appl. Math. 50, 323-380 (1997). 8. J. C. Guillot, J. Ralston, E. Trubowitz, "Semi-classical asymptotics in solid state physics", Comm. Math. Phys. 116, 401-415 (1988). 9. F. Hovermann, H. Spohn, S. Teufel, "Semiclassical limit for the Schrddinger equation with a short scale periodic potential", Comm. Math. Phys. 215, 609-629 (2001). 10. T. Jungwirth, Q. Niu, A. H. MacDonald, "Anomalous Hall effect in ferromagnetic semiconductors", Phys. Rev. Lett. 88, 207208 (2002). 11. W. Kohn, "Theory of Bloch electrons in a magnetic field: the effective Hamiltonian", Phys. Rev. 115, 1460-1478 (1959). 12. P. L. Lions, T. Paul, "Sur les mesures de Wigner", Revista Mathematica R>eroamericana 9, 553-618 (1993). 13. J. M. Luttinger, "The effect of a magnetic field on electrons in a periodic potential", Phys. Rev. 84, 814-817 (1951). 14. G. Nenciu, V. Sordoni, "Semiclassical limit for multistate Klein-Gordon systems: almost invariant subspaces and scattering theory", Math. Phys. Preprint Archive mp_arc 01-36 (2001). 15. G. Panati, H. Spohn, S. Teufel, "Space-adiabatic perturbation theory", Adv. Theor. Math. Phys. 7 (2003). 16. G. Panati, H. Spohn, S. Teufel, "Effective dynamics for Bloch electrons: Peierls substitution and beyond", to appear in Comm. Math. Phys. (2003). 17. G. Sundaram, Q. Niu, "Wave-packet dynamics in slowly perturbed crystals: Gradient corrections and Berry-phase effects", Phys. Rev. B 59, 14915-14925 (1999). 18. S. Teufel. Adiabatic perturbation theory in quantum dynamics, Springer Lecture Notes in Mathematics 1821, 2003. 19. D. J. Thouless, M. Kohomoto, M. P. Nightingale, M. den Nijs, "Quantized Hall conductance in a two-dimensional periodic potential", Phys. Rev. Lett. 49, 405-408 (1982). 20. J. Zak, "Dynamics of electrons in solids in external fields", Phys. Rev. 168, 686-695 (1968).
Dynamical systems Session organized by M. VlANA (Rio de Janeiro) and H.
ELIASSON
(Paris)
This page is intentionally left blank
Linear quasi-periodic systems — reducibility and almost reducibility L. H. ELIASSON (U. Paris 7) Reducibility is a central notion for linear quasi-periodic systems. We present a perturbative result about a slightly weaker property — almost reducibility — and we discuss this notion and its relation to reducibility.
1. Introduction We consider two types of quasi-periodic linear systems: time-continuous systems x'(t) = A(d + tw)x(t),
t£R,
(1)
with A : Td —» gl(RD), and time-discrete systems x(t + l) = A(9 + tu)x(t), d
D
*GZ,
d
d
(2)
d
with A : T -> Gl(R ). T is the d-dimensional torus K /Z . The mapping A is at least continuous, but we shall frequently assume that it is smooth and even analytic. w G M.d is the frequency vector. For the system (1) we can assume without restriction that the components of u> are rationally independent, i.e., (n,w)/0,
Vn€Zd\0.
This condition defines a dense Qs of full measure. Frequently we shall assume that w satisfies a Diophantine condition, i.e., for some 0 < K, 0 < r |(n,o;)| > - ^ - V n £ Z d \ 0 . (3) \n\T A vector w with rationally independent coefficients is said to be Liouville if it is not Diophantine. The set of Liouville vectors is a dense Qs (if d > 2), but almost all vectors are Diophantine. For the system (2) we shall assume that (w, 1) is rationally independent and, frequently, that (u, 1) satisfies a Diophantine condition. The flow of (1) is the mapping on T d x GL(RD) DC(K,T):
{0,X)»(0
+ tu,xe{t)X),
(4)
where x$(t) is the matrix solution of (1) defined by xg(0) — Id — the fundamental solution or monodromy matrix. x$(t) verifies the co-cycle property: xe{t + s) = xe+su,(t) xe{s),
\/t, s ^6 £ Td.
For the system (2) the flow has the same form (4) but in this case the fundamental solution xe(t) is defined by xg(l) = A{9) and the cocycle property.
195
196 Time-1-map.
L. H. ELIASSON
If xe{t), t € R, is the fundamental solution of (1) then xe(t),
t G Z,
is the fundamental solution of (2) with the new A(9)=xe(l). (This follows from the co-cycle property.) Clearly such an A(ff) is necessarily homotopic to Id, so we cannot obtain all time-discrete systems in this way. But by suspension we can obtain all time-discrete systems, even those that are not homotopic to Id. Indeed for a given A(9) of class Cr we can construct a matrix A(6,8d+i), also of class C r , such that the fundamental solution x$(t) of x'(t) = A((9,9d+1)
+ t(u,l))
x(t),
verifies x$(l) = A{9). (It is not known if this result holds also in the analytical category.)
2. Reductibility in t h e sense of Lyapunov Lyapunov introduced the notion of reducihility. He said that the system (1) is reducible if its fundamental solution can be represented as xe(t) = z(t)etB, for some B in gl(RD) and for some z(t) £ GL(ED) \z(t)\ and
(5) with
\z{t)~l\
bounded (uniformly in t). It is easy to see that (5) is equivalent to z'(t) = A(6 + tw) z(t) - z(t)B.
(6)
Proposition 2.1. If (1) is reducible for some 6, then it is reducible for all 9 and B{6) = B. Proof. If (1) is reducible for some 6 then clearly it is reducible for all 6+tu) with B(6+tu>) = B. Let first tn be a sequence such that 9 + tnu -> 9 in Td. By taking a subsequence we can assume that z(tn,9)
-» C.
Let now w'(t)=A(0
+ tLj)w(t)-w(t)B,
w(0) = c.
Since the solution depend continuously on the equation it follows that Z(t + tn)->w(t),
Z(t + tn)'1
-»
Hence, both \w(t)\ and |u;(t) _ 1 | are bounded uniformly in t.
Wit)-1. •
Linear quasi-periodic systems — reducibility and almost reducibility
197
The discrete system (2) is reducible if its fundamental solution can be represented as xe(t) = z(t)B\
(7)
for some B in Gl(RD) and for some z(t) as before. The equation (7) is equivalent to z(t + l) = A(6 + tLj)z{t)B-\
(8)
Proposition 2.2. (1) is reducible if, and only if, its time-1-map is reducible. Proof. We only need to show the implication to the left, so let's assume that the time-1map. Define now simply ze(t) = xe(t) e~tB. By assumption zg(t) is bounded on t G Z, hence on t € M.
•
Reducible systems which are not constant exist of course. For example, if A(8) € so(R n ) then all solutions of (1) are bounded and (1) is therefore reducible. 3. Obstructions t o reducibility Let v be a non-zero vector in Rd and consider X(9,v) = limsup - log\xe(t)v\.
(9)
t—»oo t
This number, which measures the exponential growth of the solution xg(t)v, can take only one out of at most D values
Xi(e) < • • • < xd(e).
(io)
These are the characteristic numbers of Lyapunov. By the theorem of Oseledec, the lim sups are limits for a.e. 8 £ Td and all v £ M D , and these limits are independent of 6 (for a.e. 6). These numbers are known as the Lyapunov exponents. Irregular systems.
The characteristic numbers of (1) always verify 1 /"*
V A J - ( 0 ) > lim - /
Tr(A(6 + SUJ) ds
(11)
— see [19]. The system is said to be regular if we have equality in (11), and it follows from the theorem of Oseledec that (1) is regular for almost every 6. It is clear that if (1) is reducible (for some, and hence for all, 6) then the system is regular for all 6. But there are almost-periodic linear systems that are non-regular [11,17]. Non-uniformly hyperbolic systems. (w+(8)
(1) is hyperbolic if the spaces
= {v£RD
: \(6,v) D
< 0}
1 W_(0) = { » £ l : X(9,v) > 0}
198
L. H. ELIASSON
span RD for a.e. 9, i.e., W+(6) + W-(e)=RD,
a.e. 0 <E T d .
We say that (1) is uniformly hyperbolic if there exists A+ < 0 < A_ such that \x8(t)v\ < cste.e a +M, tA
\x6{t)v\ < cste.e -|w|,
Vt > 0,Vu G W+{6), Vt < 0,Vw e W_(6>).
If it is not uniformly hyperbolic, we say that it is non-uniformly hyperbolic. Clearly a reducible system cannot be non-uniformly hyperbolic. There are many examples of non-uniformly hyperbolic systems. The best studied are the time-continuous Schrodinger equation Heu(t) = -u"{t) + V{9 + tu)u{t) = Eu(t),
t € R,
and the time-discrete Schrodinger equation Hgu(t) = -(u{t + 1) + u(t - 1)) + V{6 + tuj)u{t) = Eu(t),
t e Z.
It is known that these equations are uniformly hyperbolic if and only if E does not belong to the spectrum of Hg a [12], so if the Lyapunov exponent is positive for an E in the spectrum then the corresponding system is non-uniformly hyperbolic. For the discrete Schrodinger equation there are many analytic [3,10,21] as well as nonanalytic [4,9,20] examples of non-uniformly hyperbolic systems. Time-continuous systems are less well studied but also in this case there are analytic [9, 21] and non-analytic [1] examples. Non-uniformly hyperbolic systems which are not Schrodinger are constructed in [22]. Non-polynomial growth. A reducible system with zero Lyapunov exponents has solutions which are either bounded or have polynomial growth. Examples of systems with solutions with non-polynomial growth are given in [5].
4. Reductibility For quasi-periodic systems it is natural to consider a more special notion which goes back to Floquet. We say that (1) is reducible in the sense of Floquet, from now on simply reducible, if it is reducible in the the sense of Lyapunov with z(t) = Z(tu),h Z : (2T) d -> GL{RD) continuous. In this case (6) takes the formc d„Z(6) = A{6)Z{6) - Z(6)B. a
T h e spectrum of Hg is independent of 8. B y (2T) d we mean that Z is 2-periodic in each variable. c By du we means the directional derivative on T d in the direction u>.
b
(12)
Linear quasi-periodic systems — reducibility and almost reducibility
199
Another way to state this is to say that the change of variables = Z(9r1x)
(6,x)~(6,y
(13)
transforms the system (1) to the constant coefficient system y'(t) = By(t). In the time-discrete case equation (8) takes the form Z{e + u) =
A(6)Z(6)B-1
and the transformation (13) transforms (2) to the constant coefficient system y(t+l)
= By(t).
Reducibility is a stronger notion than reducibility in the sense of Lyapunov since it takes into account the "rotational" behaviour of the solutions. Periodic systems are always reducible, but quasi-periodic are not. Consider for example
and assume that d
I
t
a(0 + su) — (a) ds
is unbounded. Then (1) is not reducible. However, if the integral is bounded, which is always the case if w is Diophantine and a is sufficiently smooth, then the system is indeed reducible. Another example is (2) with A(0\ = ( y>
C0S
^
sin e
( )\
sin
V- (0) cos(0y"
This is an analytic system but not reducible unless the system is periodic. Ergodicity.
On a compact group like Td x SO(R3)
the Haar measure is invariant under the flow of (1). If the system is reducible then it has an invariant foliation and uncountably many ergodic measures. Examples of analytic systems with Diophantine frequency vector that are uniquely ergodic, and hence not reducible, are given in [6] — see also [18] for similar results for smooth systems with w Liouville. The examples [5,6] show that reducibility is not an open property not even in an analytic topology. In general the structure of the set of reducible system is badly understood. There are some relevant results in C°-topology — for SL (K2) [2] and for the Schrodinger equation [8] — and in C°°-topology — for T x SU{C2) [13] and for T x SL(R2) [14]. (a) is the mean value of the function a.
200
L. H. ELIASSON
5. A l m o s t r e d u c t i b i l i t y We say that (1) is almost reducible if, for all e > 0 there exist Z : (2T) d -> GL(RD)
continuous
and B G gl{RD) such that duZ(0) = A(0)Z(6) - Z(0)(B + A(0)),
B G gl(RD),
(14)
and such that | i | and I Z P H Z - 1 ! < £ .
(15)
This property implies that the system, for arbitrarily long time, behaves as a reducible system. For example, (14) and (15) imply \xe(t) - Z{6 + tu)etBZ{6)-l\
< e\t\e^tB\
\t\ < - . e
A consequence is that an almost reducible system, like a reducible one, never can be irregular nor non-uniformly hyperbolic. However, it may have asymptotic behaviour that is different from that of a reducible system. For example, an almost reducible systems may have nonpolynomial growth [5] or may be uniquely ergodic [6]. The reason is that such behaviour can occur close to constant coefficients and any system close to constant coefficients is almost reducible, as follows from the theorem described below [7]. Assumptions. Let A : Td -> gl(RD),
\A\r = sup \A(x)\ < oo, \Sfx\
and A0£gl(RD),
U£DC{K,T).
Theorem 5.1. There exists a constant £o = £o(r, K, r, \Ao\,D,d) \A-A0\r
< £0,
then for all e > 0 there exist JZ :(2T)d ^GL{RD),
\Z\p < oo
such that duZ(6) = A(6)Z(0) - Z(6)(B + A(6)), with
{
|Z|o, | Z - 1 | o bounded, \A\P,
\ZA\p<e.
such that, if
Linear quasi-periodic systems — reducibility and almost reducibility
201
Notice that the transformed system B -f- A(6) is defined only on (2T) d , so one cannot iterate this statement without loosing "periodicity". This is an essential complication in the proof. Notice also that the domain of analyticity of Z, measured by p, depends on e and may go to 0 as e —> 0. A result of similar flavour was proven in [15] for certain compact groups. It follows from this theorem that the set of almost reducible systems contains an open neighbourhood (in | | r -topology) of the constant systems. Question 5.1. Is the set of almost reducible systems an open set? contained in the closure of its interior?
Or perhaps, is it
Question 5.2. Is the set of almost reducible systems dense among all systems? Among all systems without exponential growth (all Lyapunov exponents being zero)? Among all systems with bounded solutions?
6. A r e m a r k a b o u t t h e proof and t h e second Melnikov condition 6.1. Floquet exponents Suppose that (1) is reducible with Z{6) and B verifying (12). The eigenvalues of B are the Floquet exponents. Proposition 6.1. The real parts of a(B) are unique — these are the Lyapunov exponents of (1). The imaginary parts of a(B) are determined modulo
{i(n,W):nGZd}. Proof. Let xe(t) = Z{9 + tLo)eBtZ(9)-1.
Then
lim -J- /
xe(t)e-^dt
has some finite and ^ 0 component only if Re(7) £ Re(a(B))
and
S(7)eS((7(S)) + | i ( n , W ) : n G Z d
• The following example shows that the imaginary parts are not unique in general. Example 6.1. If B = l
0 -(n,u)
for some n £ Z d , then 0
Zfc(*)=expi
(k,6) }
Q
verifies xe(t) = Zk(e + tuj)exp(t( for any k £
Q
_k
{n A
o)
0
*'u))\Zk{0)
-l
202
L. H. ELIASSON
6.2. Reduction of Floquet exponents. Let a = ( a i , . . . , ar>) be the eigenvalues of B and let a =
(ai,...,aD)
be D arbitrary complex numbers submitted to the conditions6
{
ak = &j
if ak = atj,
&k = &j
if ak = a j .
We define B(a) be "the same" matrix as B but with the eigenvalues a replaced by the eigenvalues a. It is well-defined by the first condition and real by the second condition. Take now for any eigenvalue ak a vector mk &\lid such that (mk=mi
if ak=
\mk
if
= -mi
an,
ak=a~i,
and define Y{6) = exp B i m (0) = exp[S - B(a - i(m, 0))]. Then Y : (2T) d -> Gl(RD),
Y(0) = 1,
and verifies duY{6) = BY{0) - Y{6)B(a -
i(m,u)).
Hence, (0,x)~(6,y
=
Z(er1x)
transforms the constant coefficient system x' = Bx with Floquet exponents a to a new constant coefficient system with Floquet exponents a = a — i(m,Lo). 6.3. The second Melnikov condition The eigenvalues of the operator ad B : X -> [B, X] are precisely all the differences of the eigenvalues of B. B is said to satisfy a "second Melnikov condition" if there exist K' > 0, T > 0, such that for all ak, a.\ we have \ak-ai-i{n,u)\>—-, e
z denotes the complex conjugate of z.
K'
VneZd\0.
(16)
Linear quasi-periodic systems — reducibility and almost reducibility
203
This condition is central in perturbation theory for reducibility and for lower-dimensional tori. Usually it is sufficient that, for each "scale" of the perturbation theory, this condition is fulfilled for n in some finite, but sufficiently large, domain in Zd. Another important point is that condition (16) is not required to hold for n — 0. If w satisfies the Diophantine condition (3), for each atk,ai there can be at most one vector n = rikj € Z d in the domain
I-I*"'-(£)* violating (16). Of course, there may be no such n at all in which case condition (16) is fulfilled in \n\ < N'. Let U = {(fc, I) : n,k,i exist}. Then one verifies that there exist vectors nik G | Z d such that nk,i = mlc-mi,
V(k,l)eU,
and ma = m/3
if a = /?,
ma = —771/3 if a = /9. Using these vectors we can construct Y(0) = exp B , m (0) which transforms the system x' — Bx to a new system y' — By with Floquet exponents a = a — i(m,Lj). By construction, these exponents will satisfy the Melnikov condition in some finite but large domain \n\ < N" PS N'. This operation will permit us to carry out the perturbation theory to any order of accuracy and, hence, obtain almost reducibility. The price to pay is that one has to use transformations which are close to an "exponential" of the type exp B , m (0).
(17)
The transformation Z in the Theorem is therefore not close to the identity but to a finite product of such "exponentials".
7. Reducibility and almost reducibility An infinite product of exponentials (17) will typically not converge. Therefore, if the (appropriate) Melnikov condition is violated at infinitely many "scales" of the perturbation theory we cannot expect reducibility. On the other hand, if it is violated only at finitely many "scales" we will get reducibility. This analysis lies behind the following conjecture.
204
L. H. ELIASSON
Conjecture 7.1. For u> £ DC(K,T) Ax{&) in \AX-A0\r
and for any generic one-parameter family of analytic <e0(r,K,T,\A0\,D,d),
the system x'{t) = Ax{9 + tw) x{t) is reducible for a.e. A. For reasonable one-parameter families of such systems, reducibility holds for a very large set of parameter values. For almost all parameter values reducibility has been proven for the Schrodinger equation [5] and for certain one-parameter families on certain compact groups [16]. On the other hand, not all systems close to constant coefficients are reducible. Conjecture 7.2. For w £ DC(K,T)
and for any generic analytic A{6) in
\A-Ao\r
<s0(r,K,T,\A0\,D,d)
the system x'(t) = A(9 + tuj)x(t) is non-reducible and, in case of a compact group, uniquely ergodic. This has been proven in the case Td x SO (R 3 ) [6]. Question 7.1. Are "almost all" almost reducible systems reducible? Question 7.2. Is the generic almost reducible system, non-reducible?
References 1. K. Bjerklov, "Positive Lyapunov exponents for a class of 1-D quasi-periodic Schrodinger equations — The continuum case", manuscript (2003). 2. J. Bochi, "Genericity of zero Lyapunov exponents", Ergod. Th. & Dynam. Sys. 22, 1667-1696 (2002). 3. J. Bourgain, M. Goldstein, "On non-perturbative localization with quasi-periodic potential", Ann. of Math. 154, 835-879 (2000). 4. L. H. Eliasson, "Discrete one-dimensional quasi-periodic Schrodinger operators with pure point spectrum", Acta Math. 179, 153-196 (1997). 5. L. H. Eliasson, "Floquet solutions for the one-dimensional quasi-periodic Schrodinger equation", Comm. Math. Phys. 146, 447-482 (1992) 6. L. H. Eliasson, "Ergodic skew systems on Td x SO(3,R)", Ergod. Th. & Dynam. Sys. 22, 1429-1449 (2002). 7. L. H. Eliasson, "Almost reducibility of linear quasi-periodic systems", Proceedings in Symposia in Pure Mathematics 69, 679-705 (2001). 8. R. Fabbri, R. Johnson, R. Pavani, "On the nature of the spectrum of the quasi-periodic Schrodinger operator", Nonlinear Analysis 3, 37-59 (2002). 9. J. Frohlich, T. Spencer, P. Wittver, "Localization for a class of one-dimensional quasi-periodic Schrodinger operators", Comm. Math. Phys. 132, 5-25 (1990). 10. M. Herman, "Une methode pour minorer les exposants de Lyapunov", Comment. Math. Helvetici 58, 453-502 (1983).
Linear quasi-periodic systems — reducibility and almost reducibility
205
11. R. A. Johnson, "The recurrent Hill's equation", J. Diff. Equations 46, 165-193 (1982). 12. R. A. Johnson, "Exponential dichotomy, rotation number, and linear differential equations with bounded coefficients", J. Diff. Equations 6 1 , 54-78 (1986). 13. R. Krikorian, "Global density of reducible quasi-periodic cocycles on T 1 x SU(C2)", Ann. of Math. 154, 269-326 (2001). 14. R. Krikorian, "Reducibility, differentiable rigidity and Lyapunov exponents for quasi-periodic co-cycles on T x SL(R2)", manuscript (2003). 15. R. Krikorian, "Reductibilite des systemes produits-croises a valeurs dans des groupes compacts", Asterisque 259 (1999). 16. R. Krikorian, "Reductibilite presque partout des flots fibres quasi-periodiques a valeurs dans des groupes compacts", Ann. Sci. Ecole Norm. Sup. (4) 32, 187-240 (1995). 17. V. M. Millionscikov, "Proof of the existence of irregular systems of linear differential with almost periodic coefficients", Diff. Equations 4, 203-205 (1968). 18. M. Nerurkar, "On the construction of smooth ergodic skew-products", Ergod. Th. & Dynarn. Sys. 17, 311-326 (1988). 19. V. V. Nemytskii, V. V. Stepanov, Qualitative theory of differential equations, Princeton University Press, Princeton, New Jersey, 1960. 20. Ya. G. Sinai, "Anderson localization for the one-dimensional difference Schrodinger operator with a quasi-periodic potential", J. Stat. Phys. 46, 861-909 (1987). 21. E. Sorets, T. Spencer, "Positive Lyapunov exponents for Schrodinger operators with quasiperiodic potential", Comm. Math. Phys. 142, 543-566 (1991). 22. L. S. Young, "Lyapunov exponents for some quasi-periodic cocycles", Ergod. Th. & Dynam. Sys. 17, 483-504 (1997).
On discrete Schrodinger operators with stochastic potentials W. SCHLAG (Caltech)
1. Introduction Given an ergodic map T : Td -> Td, a potential Vx(n) = v(Tnx), analytic and nonconstant, define the Hamiltonian Hx:=-Az
where v : Td -> R is
+ XVx.
(1)
This is a version of the well-known Anderson's model. In the late 1950s, Phil Anderson predicted that random impurities could turn conductors into insulators. In mathematical terms, he predicted that random potentials should lead to pure point spectrum with rapidly decreasing eigenfunctions — at least for large disorders. Instead of random potentials, one considers here potentials given in terms of deterministic dynamics. The "randomness" is given by the parameter x £Td. Basic questions: Does (1) display pure point spectrum for large A, and does Anderson localization take place (exponentially decaying eigenfunctions)? One can also ask about dynamical localization. This refers to the property sup t ||(l + |ra|) A e tti? ^o||2 < °o for all A > 0, provided tpo decays very rapidly. Other topic of interest is the limiting distribution of the eigenvalues of Hx restricted to \—N, N] as N —> oo (this is known as the integrated density of states or IDS). Other very interesting questions concern the presence of a.c. spectrum for small A, as well as the structure of the spectrum (Cantor set). We cannot review the long and rich history of this subject here, but rather refer the reader to the monographs by Carmona and Lacroix [9] as well as Figotin and Pastur [11] for this purpose. Another resource which is closely related to this note is the forthcoming book by J. Bourgain [3]. Let us merely mention the fundamental results by Dinaburg, Sinai [10] (a.c. spectrum for small A), Fiirstenberg [14] (positivity of the Lyapunov exponent), Goldsheid, Molchanov, Pastur [15] (A.L. for the one-dimensional model), Frohlich, Spencer [12] ("multiscale analysis" in the random case), Aizenman, Molchanov [1] ("fractional moment method"), Frohlich, Spencer, Wittwer [13] and Sinai [18] (A.L. for the quasiperiodic model), as well as the many deep and important contributions by Avron, Bellissard, Campanino, Carmona, Delyon, Eliasson, Figotin, Gordon, Jitomirskaya, Kirsch, Klein, Kotani, Last, Martinelli, Oseledec, Pastur, Ruelle, Simon, Simon-Wolff, Souillard, Stolz, Thouless, Wegner. We now consider the eigenvalue equation (HxiP)n = -4>n+1 - V„_i + \v(Tnx)ipn Examples of maps T are: the shift Tx = x+u)(modZd),
206
= E1>n.
(2)
the doubling map Tx = 2a: (mod 1),
On discrete Schrodinger operators with stochastic potentials
207
and the skew-shift T(x, y) = (x + y,y + ui) (mod I?). It follows from the covariance relation HTx = UHXU* with U = left shift that Y,x := spec(H x ) is constant for a.e. x. The same also holds for the spectral parts E'ac x '
25". ££ c .
Recall the following basic objects in the study of (2). The monodromy matrices and Lyapunov exponent are defined as 1
Mn(x,\,E):=Y[ Ln(X,E):=L(\,E):=
\\v(T*x)-E
f n JTd
-1 0
log\\Mn(x,\,E)\\dx
lim Ln(\, E) = Urn - log ||Mn(a;, A, E)\\,
where the last relation holds for a.e. x by Kingman's subadditive ergodic theorem. Furthermore, by the Ishi-Pastur theorem E£ c C {E : L(\,E) = 0}, whereas Kotani's converse statement is meas [{E : L(X, E) = 0} \ S"c] = 0, a.e. x. Let Tx = T^x := x + w (mod 1) be the one-dimensional shift. We denote the Lyapunov exponent by L(u, A, E) to emphasize the dependence on w. The following theorem is proved in [4]. Theorem 1.1 (Bourgain-Goldstein). Suppose i n f ^ s L(w, A, E) > 0. Then for a.e. w and a.e. x the Hamiltonian Hx displays Anderson localization, as well as dynamical localization. N o t e : This does not explicitly require large A. If v{x) = cos(2nx) (the almost Mathieu operator), then Theorem 1.1 applies for A > 2 (by Herman's result L(o;, A, E) > log(A/2) for A > 2). Jitomirskaya [16] proved this result for the almost Mathieu operator and all Diophantine to and a.e. x. The proofs of both Theorem 1.1 as well as [16] are nonperturbative, and are based on the transfer matrix formalism. The following theorem is from [17]. Theorem 1.2 (Goldstein-S.). Suppose to is Diophantine (which means here that \\nuj\\ > nQogn)" forn> 1, a> I). If L(w, E) > 0 for all Ei < E < E2, then L(LJ, •) as well as the integrated density of states N(u>,-) are Holder continuous in E € [.Ei,^]N o t e : The integrated density of states (IDS) is the deterministic limit N{w, E) = lim — ^ — - # { 1 < j < 2N + 1 : EJN) < E} where E= Ej (x,w) are the eigenvalues of Hx restricted to [-N, N]. Thouless's formula relates the Lyapunov exponent L and the IDS N: L(u,E)=
f\og\E-E'\N(w,dE').
Sinai's work on large disorder [18] shows that the IDS can be no better than Holder-^ continuous, and Bourgain [3] obtained the exponent ^ — for the almost Mathieu operator and large
208
W . SCHLAG
A. In [17], the Holder exponent depended on the Lyapunov exponent. Bourgain refined [17] to obtain a Holder exponent that is uniform in L{E). Bourgain and Jitomirskaya [7] modified the method from [17] to show that L{u, E) is jointly continuous in w, E at every point (UQ,E), U>Q G K \ Q as well as continuous in E for every w (without assuming that L is positive). The skew-shift on T 2 is defined to be Tu(x,y) = (x + y,y + u>) (modZ 2 ). Note the quadratic dependence on n in the nth iterate T£(x, y) = (x + ny + n{n - l)w/2, y + nu>) (mod Z 2 ). It is conjectured to lead to L(w, X,E) > 0 for all A > 0. The following theorem is proved in [5]. Theorem 1.3 (BGS). Fix e > 0. Then there exist Ao(e) and Q(A,e) C T 3 such that meas[T 3 \ fi(A, e)] < e and for all X > Ao(e) and (w,x,y) € Q(\,e), {HUtXty%j))n := —tf)n+i ~ ipn-i + Xv(T™(x,y))ipn displays Anderson localization. This theorem is related to the following "quantum kicked rotor" model:
idtV(t, x) = ad^(t, x) + ibdx^{t, x) + V(t, x)V(t, x) where V(t,x) = KCOS(27TX) ^ n e Z <5(£ — n). Let its unitary evolution operator be denoted by S(t). Then the monodromy operator is defined as W — 5(1) = Uam-n{TTnx) + 4>n-m(Tnx) for m 7^ n, with c/> exponentially decaying and small (for K small). Therefore, H is a long-range version of the skew-shift Hamiltonian from Theorem 1.3. In analogy with Theorem 1.3, one can show that Anderson localization for H holds for small K and most a, b. Hence, W also possesses an orthonormal basis of exponentially decaying eigenfunctions (where the exponential decay refers to the Fourier coefficients). More precisely, Bourgain proved the following result in [2]. Theorem 1.4 (B). For small K and (a,b) outside a set of small measure, one has: Let W(t,x) = (S(t)ty(Q,-))(x) be the solution of the kicked rotor equation. 7/*(0, •) is a smooth function on T, then \&(i) is an almost-periodic H 1 (T)-valued function and sup t ||^(t)||jyi(T) < oo. In recent work, Bourgain and Jitomirskaya have obtained a version of Theorem 1.4 that applies to a.e. choice of the parameters (a, b). Theorem 1.3 is proved by means of the transfer matrix formalism, whereas Theorem 1.4 cannot be obtained this way because of long-range interactions. One uses instead an approach that originated in the proof of the following theorem from [6]. Theorem 1.5 (BGS). Let v : T 2 —> K be analytic, nonconstant on any vertical and horizontal line segment. Let (Hxyip)nm — —AZ2tpnm+Xv(x+nu)i,y+mui2)ipnm for(n,m) £ Z 2 . Then for all e > 0, A > Ao(e) the operator Hxy displays Anderson localization for all (x,y,u>i,uj2) 6 T 4 up to a set of measure at most e.
On discrete Schrodinger operators with stochastic potentials
209
2. Transfer matrix formalism As in the case of truly random potentials, it is of basic importance to control the probability that a given energy E is close to an eigenvalue of the Hamiltonian H restricted to a box. In other words, one needs to estimate meas [x € Td : dist(H^,E)
< e~p]
where A = [-N, N] and p is relatively large (say, » (\ogN)A). By self-adjointness, this is the same as bounding the norm of the Green function \\(H£ — (i? + i 0 ) ) - 1 | | = | | G A ( I , i£)||. By Cramer's rule, one has f[-N,n-l] (Z) E) f[m+l,N] (x, E) f[-NtN](x,E)
GA(x,E)(n,m) where /[p>Q](a;, E) = det{Hf
— E). Moreover, there is the well-known relation
Mn(x,E)
=
f[i,n](x,E) .f[i,n-i](x,E)
-f[2,n]{x,E) -/[3in](x,£)_ '
Recall that Kingman's theorem implies that jf log ||Mjv(a;, E)\\ —> L^(E) for a.e. x e T. A quantitative version of this statement are the following estimates on the deviations: For large N meas[x : | log \\MN(x,E)\\ - N LN{E)\ > Na] < e~NT (3) for some fixed 0
u{x) = 2_J log \e(x + nu) — 1| = / log \z — C,\ ^(d() n=\
^N
2 r
where z = e(x) = e ' « ) ^ = 5Z n=1 <$ e (_ nw ). The left-hand side can be thought of as a Riemann sum. It is standard to estimate the error between such sums and their mean via Koksma's inequality: Let S := {xn}„=1 C T; N
y}/(*n)- / f{x)dx
J
n=l
°
where DN(S) = s u p J C T | # { n : xn e J } — N\ J||. The problem here is, of course, that log \e(x) — 1| £ BV. A more direct approach is to observe that u(x) is the Hilbert transform of a sum of saw-tooth functions. More precisely, let fo(x) = — \ — x for — ^ < x < 0 and fo(x) = \ — x for 0 < x < \. Then, with Ji being the Hilbert transform, (x)=n(J2fo(-
+ nu>))(x)
N
\HBMO
DN({ncj}%=1).
< C n=l
210
W . SCHLAG
Since for a.e. u> and N large Djv({nw}^ =1 ) < N£, the John-Nirenberg inequality implies meas [x : \u(x) - (u)\ > Na] < exp [ -
Na~e].
Observe the analogy of this estimate with the large deviation theorem (3). The transition to the noncommutative case is typically accomplished by means of two devices: — subharmonic extensions of log ||Mjv(£, E)\; — the so-called "avalanche principle" (this requires positive exponents L(E) > 0). The former is well-known and straightforward: Since v is analytic, u(x) = log ||MJV(x, E)\\ extends to a neighborhood of the unit circle as a subharmonic function. Such functions have a Riesz representation: u{z) = J\og\z-C\[x{dQ
+ h{z)
with some measure /x > 0, and a harmonic function h. Here \\fi\\ ~ TV, but there is a lot of cancellation in the integral to ensure, for example, that ||u||BMO < (logN)A. This latter property can only be captured by means of invoking the underlying dynamics. More precisely, one obtains the structure of u as a sum of shifts of another function by means of the following Avalanche Principle (Goldstein-S.): n.
Let {An}»=1 Then
e S*(2,R). Suppose \\An\\ >fi>N, N-l log ||Mjv|| + J2 n=2
\\An+1An\\
N-l l0
S WAn\\ ~ E l 0 § l l ^ n + 1 ^ n=l
< y/JI\\An+1\\
||A„|| for all
where MN = Ajq • • • A\. In the work by Bourgain, Goldstein, and the author, this device has been used to establish the following properties: — positivity of the Lyapunov exponent for large disorder; — inductive proof of large deviation theorems; — Holder continuity of the integrated density of states. We now give a typical application of the avalanche principle. Write o MnN(x,E)= Y[ Mn(Tjnx,E), where n ~ (logN) A . j=N-l
Suppose, for 0 < a, r < 1 fixed, > na] < e""'
meas [x : \\og\\Mn(x,E)\\-nLn(E)\
(4)
and the same for 2n, where Aa > 100. Furthermore, suppose Ln(E) > L,2n{E) > 1 for all E, and Ln(E) - L2n{E) < 1/100. Then up to a set of measure < CNe~n" ~ N~", and for n large, one has for all j \\Mn(Tjnx)\\
> en-n°
> e n / 2 =: /x,
||M n (r°' + 1)n a;)M n (r^x)|| > e^L^{E)-n"
>
> ^\\Mn{T^+^nx)\\
e2nLn(E)-2n°
\\Mn{T'nx)\\.
On discrete Schrodinger operators with stochastic potentials
211
Prom the avalanche principle, one concludes that up to a set of measure < N " , N-2
\og\\MnN(x,E)\\
+ £
N-2
- Y, logHManCr^z.JSOII
\og\\Mn(T^x,E)\\
(5)
Typically, the sums are uniformly in x close to their means (since they represent averages over very long orbits). The conclusion then is that u(x) •= — log||MTlW(a;,JE;)|| =u0(x)+
ui(x),
where \\u0 - (u0)\\oo < CN~1+e =: e0, and ||ui||i < CN~90 =: ei. Provided the Riesz mass of the subharmonic extension of u(x) is < N20, the splitting lemma (see below) yields 20 \\U\\BMO < C(e0 + \/N Ei) < CN~1+e. By means of the John-Nirenberg inequality, this implies that meas[x
: | log ||M n i V (a;,£)|| -nNLnN{E)\
> (nN)a] < Ce" •cN"
which is (4) for N instead of n (provided T < a — s). The same argument applies to 2nN. Moreover, averaging (5) over x yields that \LnN(E)
+ Ln(E) - 2L2n(E)\
<
CN~\
which implies LnN(E) > 1 — 2{Ln(E) — L2n(E)) — CN~l. Continuing inductively leads to positivity of L(E) as well (4) for all n. The following result is the aforementioned splitting lemma from [5]: Lemma 2.1. Let u be subharmonic on a neighborhood of T with Riesz mass N. Suppose u = u0+ui onT with ||UO||L°°(T) = £o and ||MI||LI(T) = £i- Then ||U||BMO < C(e0 + -JWe[). To motivate this statement, consider N points Zj — e(xj) in T. Suppose that P(z) = Ylj-iiz — Zj) satisfies sup| z | = 1 \P(z)\ < eT. We claim that DN{{XJ}^=1) < C^/NT where -DJV is the usual discrepancy (see above). Proof. u(x) = log |P(e(x))| = W ( £ ? = i /o(- - a;,-))- Set F := £ f = 1 /o(- - Xj). Let KN be a smooth bump function KN{8) = K(N6), K>0,fK = l,supp/f c [-.01, .01]. Then
(KN*fo)(--jj)--<M-)<(KN*fo)(;
+ 1j) +
Using JTu = 0, ||u||i < CT, F = Ti~1u = —Hu, one now has \\F\\00<2\\H-1(u*KM)\\00
+
CN —
CN + ^
Setting M = \/N/T gives H-FHoo < CVNT easy to check that H-^Hoo ~ DN({XJ}JLI)>
(and thus also desired.
as
||U||SMO
+
<
CVNT).
CN —
Finally, it is •
212
W . SCHLAG
Note that this argument shows that if u(z) = flog\z - Qdfj,(C) with supp(/i) C T, sup T w < (u) + T (here (u) = 0 and thus ||u||i < r ) , then ||U||BMO < C\/IHI r - A small variation of this argument proves Lemma 2.1 above. Final remarks on the transfer matrix formalism: — For the shift and skew-shift, large deviation theorems for u(x) = log ||Mjv(a;,£)|| are obtained via (a) subharmonicity (b) almost invariance u(x) ~ u(Tx). The latter can be either be the simple invariance property sup x |u(a;) — u{Tx)\ < C (suffices for the shift) or the avalanche principle (needed for the skew-shift). The main difference between these cases is that the Riesz mass of u in the former is TV, in the latter iV2 (or Nc for higher-dimensional versions of the skew-shift). The avalanche principle requires positive exponents, and the first step (but only that one) is perturbative. This means that both the initial positivity of the Lyapunov exponent, as well as the large deviation theorem at the initial scale are obtained by making the disorder very large. It is conceivable that the required information at the first step could also be provided by a numerical calculation. We would like to emphasize that for the plain shift, large deviation theorems do not require positive exponents and are non-perturbative. This allows Bourgain and Goldstein [4] to prove localization on the basis of L(E) > 0 alone. — The avalanche principle gives positive Lyapunov exponents for large disorders under very general circumstances. But this either requires the Riesz mass of log ||Mjv(a:, E)\\ to grow at most like Nc (so that the necessary large deviation theorem can be obtained simultaneously), or one needs to prove the LDT by other means at all scales. One case where the Riesz mass grows exponentially rather than polynomially is the doubling map Tx = 2x (mod 1). For this model, one proves the LDT by means of well-known subgaussian bounds for sums of martingale differences, see [8]. — In recent work, Goldstein and the author have obtained large deviation theorems for the entries of the monodromy matrix rather than the entire norm. Recall that the entries are the determinants f^(x,E).
3. Localization The large deviation theorems from the previous section only control the probability of a single resonance at a fixed energy E, i.e., meas [xGT
: dist{E,H^'N])
< e~N"].
It is well-known that this does not suffice in order to prove Anderson localization. Rather, one needs to control the probability of double resonances. More precisely, let DCA,C C T denote the set of u> with \\ntj\\ > r4^ for all n ^ 0 and define 5N := meas[w € DCA,C : d i s t ^ s p e c ^ j 1 ' " 1 ) ) <
e~n\
G[k,k+N}{®,w,E) is bad for some E and \k\ ~
Nc].
Here n ~ (logiV)' 4 , and G[kik+Nj(x,oj,E) is good if both \\G[k:k+N](x,uj,E)\\ < eN" (a < 1 fixed) and \G[ktk+N](x,u,E)\(jJ) < e~~
On discrete Schrodinger operators with stochastic potentials
213
lower bound on the Lyapunov exponents. One then needs to show that V . 82i < oo. By means of standard methods (resolvent identity plus the polynomial Shnol-Simon bound on generalized eigenfunctions) this ensures that for a.e. LO £ DCA,C the operators Ho,w display Anderson localization. Finally, letting c —> 0 in the Diophantine condition allows one to extend this to a.e. w £ T, thus proving Theorem 1.1. To estimate 5N, note that SN <
Yl
meas
[ w G DCA,c • log\\MN{k(j,Lj,E)\\
<
NLN(w,E)-Na, for some |fc|~JV c ].
Bespec(<-')
The set on the right-hand side is the projection onto the w-axis of the intersections fijv := U|fc|~Arc SN n Ik of the lines £k := {(w, ku) : UJ £ T} with SN:={(u,x)£DCA,cXT
: log \\MN(x,u>,E)\\ < NLN{u,E)
-
Na).
The measure of SN is very small by the LDT, but this by itself does not preclude QJV = T (consider the case where S^ is one of the lines Ik)- We need to know that the intersections of SN with any horizontal line consist of a small number (say Nc many) connected components. This property is given by the Milnor-Thom bound of dc on the number of connected components of semi-algebraic sets of degree d, at least if the potential v is a trigonometric polynomial (the case of general analytic v then follows by approximation). Indeed, if v is a trigonometric polynomial, then the horizontal sections of SN are contained in semialgebraic sets of very small measure and degree Nc, as can be seen from the fact that the entries of MN are polynomials in w of degree < Nc'. From this complexity bound and the LDT bound get Spj < Cn meas[fi/v] < N~T for some r > 0 by means of the following lemma from [4]. L e m m a 3.1. Let 5 c T for every x £ T. Then
2
be such that {u> £ T : (u>, x) £ <S} consists of at most M intervals
meas[w £ T : (u,ku) £ S for some \k\ ~ N] < CJVc(meas[«S])*
+CMN - l
Concluding remarks — Localization can be obtained by this method for shifts of any dimension, as well as the skew-shift. In the latter case the main difficulty is the LDT (only known for large disorders). — The long-range case as well as the Laplacian on Z 2 cannot be treated by the transfer matrix formalism, so no LDT or avalanche principle are available. In those cases one needs to develop estimates on the probability that a given Green's function is bad (in the spirit of Frohlich-Spencer's multiscale method), which again relies on subharmonic function arguments and reductions to smaller scales. — Establishing Holder regularity of the IDS requires a sharp LDT meas [x £ T : | log \\MN(x, E)\\ - NLN(E)\
> 5N] < e'^
N
.
214
W.
SCHLAG
This L D T is known, but only for the shift on T. Averaging the avalanche principle over x by means of this L D T yields \LN{E)
+ Ln[E) < CN~1+e
T h u s \L(E) - LN(E)\
\LN(E) Hence \L(E) - L(E')\
- 2L2n{E)\
< N~1+e
where n ~ log TV.
and
- LN(E')\
< C\E - E'\a,
< CN~1+£
+ ec*n\E
-
E'\.
for some a > 0, as desired.
Open problems — Holder regularity of IDS for shifts on Td with d > 2. Does t h e IDS become more regular as the number of frequencies increases? Currently, t h e results deteriorate with the number of frequencies. Related question: Is there a L D T of t h e form meas [x G T : | log \\MN(x,
—
— — —
E)\\ - NLN(E)\
> 6N] < e~Cs
N
for shifts in higher dimensions? Determine the Holder exponent for t h e IDS. More precisely, can one get Holder | — provided the potential has only non-degenerate critical points? This is known (Bourgain) for the almost Mathieu model and large disorders. Prove a version of Theorem 1.5 on Zd, d > 3. Positivity of the Lyapunov exponent for small disorders for the skew-shift. Positivity of the Lyapunov exponent and Anderson localization for all positive disorders with Tx = 2x (mod 1), see [8].
References 1. M. Aizenman, S. Molchanov, "Localization at large disorder and at extreme energies: an elementary derivation", Coram. Math. Phys. 157, 245-278 (1993). 2. J. Bourgain, "Estimates on Green's functions, localization and the quantum kicked rotor model", Ann. of Math. (2) 156, 249-294 (2002). 3. J. Bourgain, "Green's function estimates for lattice Schrodinger operators and applications", to appear in Annals of Mathematical Studies, Princeton. 4. J. Bourgain, M. Goldstein, "On nonperturbative localization with quasiperiodic potential", Annals of Math. 152, 835-879 (2000). 5. J. Bourgain, M. Goldstein, W. Schlag, "Anderson localization for Schrodinger operators on Z with potentials given by the skew-shift", Coram. Math. Phys. 220, 583-621 (2001). 6. J. Bourgain, M. Goldstein, W. Schlag, "Anderson localization for Schrodinger operators on Z 2 with quasi-periodic potential", Acta Math. 188, 41-86 (2002). 7. J. Bourgain, S. Jitomirskaya, "Continuity of the Lyapunov exponent for quasiperiodic operators with analytic potential", dedicated to David Ruelle and Yasha Sinai on the occasion of their 65th birthdays, J. Statist. Phys. 108, 1203-1218 (2002). 8. J. Bourgain, W. Schlag, "Anderson localization for Schrodinger operators on Z with strongly mixing potentials.", Comm. Math. Phys. 215, 143-175 (2000). 9. R. Carmona, J. Lacroix, Spectral theory of random Schrodinger operators., Birkhauser, Boston 1990.
On discrete Schrodinger operators with stochastic potentials
215
10. E. I. Dinaburg, Y. G. Sinai, "The one-dimensional Schrodinger equation with quasiperiodic potential" M i . Anal. i. Priloz. 9, 8-21 (1975). 11. A. Figotin, L. Pastur, Spectra of random and almost-periodic operators, Grundlehren der mathematischen Wissenschaften 297, Springer, 1992. 12. J. Prohlich, T. Spencer, "Absence of diffusion in the Anderson tight binding model for large disorder or low energy", Comm. Math. Phys. 88, 151-189 (1983). 13. J. Frohlich, T. Spencer, P. Wittwer, "Localization for a class of one-dimensional quasi-periodic Schrodinger operators", Comm. Math. Phys. 132, 5-25 (1990). 14. H. Fiirstenberg, "Noncommuting random products", Trans. AMS 108, 377-428 (1963). 15. I. Ya. Goldsheid, S. A. Molchanov, L. A. Pastur, "A pure point spectrum of the stochastic one-dimensional Schrodinger equation", Funkt. Anal. Appl. 11, 1-10 (1977). 16. S. Jitomirskaya, "Metal-insulator transition for the almost Mathieu operator", Annals of Math. 150 (1999). 17. M. Goldstein, W. Schlag, "Holder continuity of the integrated density of states for quasi-periodic Schroinger equations and averages of shifts of subharmonic functions", Ann. of Math. (2) 154, 155-203 (2001). 18. Y. G. Sinai, "Anderson localization for one-dimensional difference Schrodinger operator with quasi-periodic potential", J. Stat. Phys. 46, 861-909 (1987).
Non-zero random Lyapunov exponents versus mean deterministic exponents for a twist like family of diffeomorphisms of the two sphere MICHAEL SHUB* (U.
Toronto)
We consider a monotone twist map of the unit two sphere in three space and consider the family of dynamical systems given by composing the twist maps with the group of orientation preserving orthogonal transformations of three space. We compare the Lyapunov exponents of random products in this family with the mean of the Lyapunov exponents for random products restricted to a small neighborhood of the individual elements. We prove these two numbers are asymptotic as the twist tends to infinity. Moreover, we present numerical evidence that the same is true for the mean of the Lyapunov exponents for the deterministic systems. If the experimental evidence is borne out, these families would present examples of an analytic family of volume preserving maps of the two sphere containing twist maps and with positive entropy. This is joint work with Francois Ledrappier, Carles Simo and Amie Wilkinson. We compare the results with results on a similar problem concerning random products of linear maps obtained with Jean-Pierre Dedieu.
1. Introduction What I am reporting on is joint work with Keith Burns, Charles Pugh, Amie Wilkinson [2], Jean-Pierre Dedieu [4] and Francois Ledrappier, Carles Simo and Amie Wilkinson [6]. Almost all of the contents here are taken from these three papers without specific attribution. The material may be found in much expanded form in these references. Any errors which may have crept in are mine alone. One of the major achievements of the uniformly hyperbolic theory of dynamical systems is the work of Anosov, Sinai, Ruelle and Bowen on the ergodic theory of uniformly hyperbolic systems. Anosov proved that smooth volume preserving globally hyperbolic systems are ergodic. Sinai, Ruelle and Bowen extended this work to specifically constructed invariant measures for general uniformly hyperbolic systems, now called SRB measures. The ergodicity of these measures asserts that although particular histories are difficult to compute the statistics of these histories, the probability that a point is in a given region at a given time, is captured by the measure. Much of the work in dynamical systems in recent years has been an attempt to extend the results of the uniformly hyperbolic theory to more general systems. One theme is to relax the notion of uniform hyperbolicity to non-uniform or partial hyperbolicity, and then to conclude the existence of measures sharing ergodic properties of the SRB measures. The Proceedings of the Seattle AMS Summer Symposium on Smooth Ergodic Theory and Its Applications contains much along these lines. In particular, you can find the survey of recent progress on ergodicity of partially hyperbolic systems [2] included there. Also there is the more recent [8]. 'Partially supported by an NSF Grant.
216
Non-zero random Lyapunov exponents versus!mean deterministic exponents ...
217
The work on the ergodicity of partially hyperbolic systems may be considered as an elaboration of the Hopf argument for the ergodicity of the geodesic flow on surfaces of constant negative curvature. The crucial point being, of course, the existence of at least some hyperbolicity so that stable and unstable manifolds exist and ergodic averages are a.e. constant along them. In this paper we attempt to study some mechanisms which will ensure the existence of some hyperbolicity in terms of the existence of some non-zero Lyapunov exponents.
2. Unitarily invariant measures on GL n (C) Let Li be a sequence of linear maps mapping finite dimensional normed vector spaces Vi to Vi+i for i £ N. Let v £ Vr0\{0}. If the limit lim £ log ||Lfc-i... £o(w)|| exists it is called a Lyapunov exponent of the sequence. It is easy to see that if two vectors have the same exponent then so does every vector in the space spanned by them. It follows that there are at most dim(Vo) exponents. We denote them Aj where j < k < dim(Vo). We order the Aj so that Aj > Aj+i Thus it makes sense to talk about the Lyapunov exponents of a diffeomorphism / of a compact manifold M a t a point m £ M, \j(f,m) by choosing the l sequence L; equal to T / ( / ( m ) ) . Given a probability measure fi on GL n (C) the space of invertible n x n complex matrices we may form infinite sequences of elements chosen at random from \x by taking the product measure on GL„(C) N . Thus we may also talk about the Lyapunov exponents of sequences or almost all sequences in GL„(C) N . Oseledec's Theorem applies in our two contexts. For diffeomorphisms / Oseledec's theorem says that for any / invariant Borel measure v, for v almost all m £ M, / has dim(M) Lyapunov exponents at m, Xj(f,m) for 1 < j < dim(M). For measures fi on GL„(C) satisfying a mild integrability condition, we have n Lyapunov exponents r\ > r • • • > rn > - c o such that for almost every sequence . . . gu • • • gi £
(*)
218
MICHAEL SHUB
Theorem 2.1. If /J, is a unitarily invariant measure on GL„(C) satisfying (*) then, for fc = 1 , . . . ,n, k
k
£ log ^(,4)1 d/i(^) >£*-;.
/
MeGL„(C)
i=1
i=1
By unitary invariance we mean fx(U(X)) = fJ.(X) for all unitary transformations U G U„(C) and all ^-measurable X G GL„(C). Thus non-zero Lyapunov exponents for the family, i.e. non-zero random exponents, implies that at least some of the individual linear maps have non-zero exponents i.e eigenvalues. For complex matrices we have achieved part of our goal. Later we will suggest a way in which these results may be extended to dynamical systems. Corollary 2.1.
f>g+1^(^)1 d»(A)>J2ri+•
f JAeGhn(C)
i=1
i=1
Theorem 2.1 is not true for general measures on GL„(C) or GL„(R) even for n = 1. Consider
and give probability 1/2 to each. The left hand integral is zero but as is easily seen the right hand sum is positive. So, in this case the inequality goes the other way. We do not know a characterization of measures which make Theorem 2.1 valid. In order to prove 2.1 we first identify the right hand summation in terms of an integral. Let Gn>fe(C) denote the Grassmannian manifold of k dimensional vector subspaces in C™. If A £ GL„(C) and Gn^ G G n| k(C), A\Gntk the restriction of A to the subspace G„,fc. Let v be the natural unitarily invariant probability measure on Gnife(C). The next theorem is a fairly standard fact. Theorem 2.2. If /j, is a unitarily invariant probability measure on GL n (C) satisfying (*) then k
f^n=
[
i=1
•M€GL n (C)./G„, fc eG„, fc (C)
[
\og\det(A\Gn,k)\dv(Gn,k)dn(A).
We may then restate Theorem 2.1. Theorem 2.3. If /i is a unitarily invariant probability measure on GL n (C) satisfying (*) then, for k = 1 , . . . ,n, k
f JA€GLn(C)
V l o g 1^(^)1 rf/x(A)> / ,_,
JAeGhn(C)
/
]og\deb(A\Gn,k)\MGn,k)dn(A). JGn,k£Gn,k(C)
Theorem 2.3 reduces to a special case. Let A G GL„(C) and // be the Haar measure on U n (C) (the unitary subgroup of GL n (C)) normalized to be a probability measure. In this case Theorem 2.3 becomes:
Non-zero random Lyapunov exponents versus mean deterministic exponents ... Theorem 2.4. Let A € GL n (C). Then, [
J2log\\i(UA)\dLi(U)>
JUGVn(C)
219
forl
log|det(>l|Gn,fe)|di>(Gn,fc)-
JG„,*eG„, fc (C)
i=1
We expect similar results for orthogonally invariant probability measures on GL„(R) but we have not proven it except in dimension 2. Theorem 2.5. Let fi be a probability measure on GL,2(R) satisfying g G GL 2 (R) ->log + (||5||) and log + (||5 _ 1 ||) are fi-integrable . a. If n is a §0>2(R) invariant measure on G L ^ R ) then, f ]oS\\1(A)\dn(A)= JAeGh+(R)
[ f logWAxWdS^x) JAEGL+(R) Jxes1
dfi(A).
b. If n is a S02(M) invariant measure on GLJ(M), whose support is not contained in R0>2(R), i.e. in the set of scalar multiples of orthogonal matrices, then f JAeGh^
\og\X1(A)\dn(A)> (R)
f
\og\\Ax\\dS1(x)dfi(A).
[ 1
JAeGl.2 (R) JxeS
Here G L ^ R ) (resp. GL^"(R)) is the set of invertible matrices with positive (resp. negative) determinant.
3. M e t h o d of proof A flag F in C" is a sequence of vector subspaces of C": F = (Fi, F 2 , . . . , Fn), with Fj C F J + I and dimF, = i. The space of flags is called the flag manifold and we denote it by F„(C). An invertible linear map A : C™ —> C" naturally induces a map A$ on flags by At(FuF2,...
,F n ) = (AFUAF2,...,
AF n ).
The flag manifold and the action of a linear map A on F n (C) is closely related to the QR algorithm, see [10] for a discussion of this. In particular if F is a fixed flag for A i.e. A$F = F, then A is upper triangular in a basis corresponding to the flag F, with the eigenvalues of A appearing on the diagonal in some order: X\(A, F), ..., Xn(A, F). Let V^ = {(U,F) G U n (C) x F„(C) : (UA)tF = F). We denote by IIi and n 2 the restrictions to VA of the projections U„(C) x F„(C) —> U n (C) and U n (C) x F n (C) —* F„(C). VA is a manifold of fixed points. We use the diagram
U„(C)
F n (C)
in order to transfer the right hand integral in 2.4 over F n (C) to an integral over U„(C).
220
MICHAEL SHUB
4. A n a l t e r n a t e m e t h o d of proof We give an alternate proof of Theorem 2.5 a in [6]. Let A € GL„(C) or GL„(R) and fi be the Haar measure on U„(C) or SO(n) respectively normalized to be a probability measure. Let G denote GL n (C) or GL n (R). Now we interpolate between random products and deterministic powers by changing /i. Let 5 > 0 and Gs be the delta neighborhood of the identity in G. For g £ G, GggA is a neighborhood of gA in GA. We normalize Haar measure restricted to Gs and push it forward to G$gA. Let us call this measure ns,g- Let ri (S, g) be the largest random exponent for this measure. Now it is not too difficult to see the following propositions. Proposition 4.1. lim^oT\{6,g) =
X\(gA).
Now let vs,g be the stationary measure on G n ,i induced by ns,gProposition 4.2. n(5,g) = / P e G ( n ) 1 ) log \\(A\P)\\ dv5,g. Let i/g = J„pG vs,g dfi. It follows that: Proposition 4.3. JgeG Xi(gA) d/j, = l i m ^ 0 / P e G („,i) l o S \\(A\P)\\
d
"s-
Now n = / p G G / n !) log ||(A|P)|| dv(G(n, 1)). So a comparison of / G G X\(gA) d/j and r\ can be achieved via an understanding of the relationship between i/(G(n, 1)) and i/g. These and the next proposition are proven in [6]. Proposition 4.4. For G = 50(2), i/(G(ra, 1)) = vs for all S > 0. Thus we have another proof of Theorem 2.5 a. The general situation is different. The inequality in Theorem 2.4 for k = 1 is strict when n > 2 unless A is an isometry. So for G = SU{n) the measures v& favor the expanding directions of A as S —* 0. Experimentally the same seems to hold for SO(n) when n > 2. We pursue this line of argument for a dynamical system analogue.
5. A dynamical systems analogue We now use notation in agreement with that in [6]. The content of this section is mostly drawn from [6]. The families we consider are not obtained from a specific set of equations, but from the following construction. Let / : S2 —> S2 be an area-preserving diffeomorphism of the round sphere, and let 50(3) be the isometry group of 5 2 . Let F={9°f
I 2 €50(3)}
be the left 50(3)-coset of / in Diff (5 2 ), and let v be the push-forward of Haar measure on 50(3) to T. Provided that / is not itself an isometry, the family T has nonzero random Lyapunov exponents with respect to v. The question [6] addresses is whether these random exponents can somehow be connected to the Lyapunov exponents of individual members of J7, at least on u-average.
Non-zero random Lyapunov exponents versus mean deterministic exponents ...
221
To test whether there might be such a connection, we chose / to be a twist map, all of whose Lyapunov exponents are zero. The resulting family T has similarities to the standard family on the 2-torus. The key difference is that the maps in T are "integrable", in the sense that the phase space is foliated by invariant curves, while the standard family is not integrable. The dynamics of the individual elements of J- and how they depend on parameters is an interesting topic, but in [6] we only study some key properties in the case of small twists. We mainly focus on two quantities, the random exponent R{v) and the average exponent k{v), which we now define. Let /z be Lebesgue measure on S2 normalized to be a probability measure. Suppose for now that v is an arbitrary Borel probability measure supported on a subset T of DiffM(S'2), the space of //-preserving diffeomorphisms of 5 2 . For / G T, the largest Lyapunov exponent of / at x G S2 is also found by computing the limit: hm-log\\Txr\\
= >
(1)
n-»oo n
which exists for //-almost every x by the subadditive ergodic theorem. We define the average exponent of f to be A ( / ) = / \1(f,x)d^(x) Js2 and the average exponent of v to be
(2)
A(v) = J\(f)Mf)-
(3)
Rather than iterate a single diffeomorphism / G J-, we might choose instead a sequence of diffeomorphisms {/i, / 2 l . , . } c f and form their composition: /
( n )
:=/n°/n-l"-°/l.
If the sequence is chosen independent and identically distributed with respect to v, then almost surely the limit lim - log \\Txf™ || =: R(x, (/Of, v)
(4)
n—»oo n
will exist, for /i-almost every x. (This too follows from the subadditive ergodic theorem, applied in the appropriate context). Further, the integral of R(x, (/i)i°,^) with respect to fi is almost surely independent of the sequence (fi)f- We define the random exponent of v to be this integral: R{y)=
f R(x,(fi)?>,v)dn{x) (5) Js2 (see Kifer for an introduction to the subject of random diffeomorphisms and their exponents) . The quantity A.(v) is mysterious from a computational perspective, but useful from a dynamical one. The quantity R(v) is relatively easy to estimate and is usually positive, unless v is fairly degenerate [3]. In [2] we asked: Question 5.1. Is there a positive constant C — perhaps 1 — such that A(v) > CR(v)l
222
MICHAEL SHUB
Some motivation for Question 5.1 can be found in the linear algebra results above. This is the question we now consider here. We return to the specific family of diffeomorphisms mentioned earlier. For e > 0, we define a one-parameter family of twist maps fe as follows. Express S2 as the sphere of radius 1/2 centered at (0,0) in R x C, so that the coordinates (r, z) G S2 satisfy the equation |r| 2 + |z| 2 = 1/4. In these coordinates define a twist map fe : S2 —> S2, for e > 0, by fe(r,z)
= (r,exp(27ri(r +
l/2)e)z).
Let F n (C) e be the orbit SO(3)fs. Let v be the push-forward of Haar measure on 50(3). We denote the resulting random and average Lyapunov exponents by R(e) and A(e), respectively. To summarize the numerical results obtained in [6], it seems that the inequality A(e) > R(e) is satisfied for large e and even that A(e)/R(e) —> 1 when e —• oo. But the inequality goes the other way for small e. We give heuristics to explain why the inequality is satisfied when "e = oo". We also present an outline of why the same inequality is not satisfied for small e. A partial analysis of the maps of the form g o fe is carried out in [6]. Our analysis of the case "e = oo" is carried out in analogy with the alternate proof of section 4. We interpolate the quantities R(e) and A(e) by a third quantity R(e, S), which measures the exponents of the "in-between" process in which each element of F n (C) e had added noise in a 5-ball. For each g e 50(3) we consider a 5 diffused process around gf. The induced process on T\S2 has a unique stationary measure mffi£i5,which is the unique fixed point of a "simple" linear operator. The measures m 9i£i 5 have the following properties: — ms,e,i5 is stationary for the 5-diffused process about gfe; — mg,e,i5 i s absolutely continuous with respect to m, with smooth density; — m s,e,s projects to Lebesgue measure /x on S2. The measure mE^ is defined by me,s = /
rng:e,5dg,
JSO{3)
where, for each g e 50(3), Jn S)£i j is the measure above. In analogy with the propositions of section 4 we prove that the random exponent R(e, S) satisfies limsupi?(e, 8) < A(e), R(e,6)= R(e)=
f
logHT/vlldm^t;),
[
log||r/i/||(fm(«).
JTiS2
Non-zero random Lyapunov exponents versus mean deterministic exponents . . .
223
If it is the case t h a t m€is —> m as 5 —> 0, then it would follow t h a t : A(e) > lim sup R(e, 5) «-»o = limsup/ 5—*0
= f =
JTxS2
log
\\Tfv\\dmeiS(v)
log\\Tfv\\dm(v)
R(e).
Hence the properties of this measure m£ts are potentially quite interesting with regard t o Question 5.1. For fixed 5 we show t h a t the 'simple" linear operator defining the measures mgtet$ have a limit as e —> oo and t h a t t h e fixed points at infinity have t h e required property. This is suggestive, b u t does not prove the theorem we would like.
References 1. D. V. Anosov, "Geodesic flows on closed Riemannian manifolds of negative curvature", Proc. Steklov. Inst. Math. 90 (1967). 2. K. Burns, C. Pugh, M. Shub, A. Wilkinson, "Recent Results about Stable Ergodicity", in Proceedings of Symposia in Pure Mathematics vol. 69, Smooth Ergodic Theory and Its Applications (A. Katok, R de la Llave, Y. Pesin, H. Weiss, eds.), AMS, Providence, R.I., 2001, pp. 327-366. 3. A. Carverhill, "Furstenberg's theorem for non-linear stochastic systems", Probability Theory and Related Fields 74, 529-534 (1987). 4. J. P. Dedieu, M. Shub, "On random and mean exponents for unitarily invariant probability measures on GL(n,C)", to appear in Asterisque. 5. I. Ya Gol'shied, G. A. Margulis, "Lyapunov indices of a product of random matrices", Russian Math. Surveys 44, 11-71 (1989). 6. F. Ledrappier, M. Shub, C. Simo, A. Wilkinson, "Random versus deterministic exponents in a rich family of diffeomorphisms", to appear in JSP. 7. V. I. Oseledec, "A multiplicative ergodic theorem. Lyapunov characteristic numbers for dynamical systems", Trans. Moscow Math. Soc. 19, 197-231 (1968). 8. "Stable ergodicity", preprint. 9. D. Ruelle, Ergodic Theory of Differentiable Dynamical Systems, Publications Mathematiques de 1'IHES, volume 50, 1979, pp. 27-58. 10. M. Shub, A. Vasquez, "Some linearly induced Morse-Smale systems, the QR algorithm and the Toda lattice", in The Legacy of Sonia Kovalevskaya, Linda Keen ed., Contemporary Mathematics vol. 64, AMS, 1987, pp. 181-193.
This page is intentionally left blank
Equilibrium statistical mechanics Session organized by D.
BRYDGES
(Vancouver) and E.
OLIVIERI
(Rome)
This page is intentionally left blank
Dimensional reduction for isotropic and directed branched polymers JOHN Z. IMBRIE (U.
Virginia)
I will describe an exact relation between self-avoiding branched polymers in D + 2 continuum dimensions and the hard-core continuum gas at negative activity in D dimensions (joint work with David Brydges [1,2]). Our results explain why the critical behavior of branched polymers should be the same as that of the i
1. Introduction and main results In this article we describe some beautiful identities which connect branched polymer models with repulsive gases in lower dimensions. For ordinary (isotropic) branched polymers (BP) the reduction in dimension is 2, and our results [1,2] provide a rigorous version of [3] (in which the critical behavior of BP in D+2 dimensions is connected with that of the Yang-Lee edge in D dimensions). For directed branched polymers (DBP), the reduction in dimension is 1, and our results [4] parallel earlier work on directed lattice animal a (which have been related to dynamical models of hard-core lattice gases [5] and to the critical dynamics of the Yang-Lee edge [6,7]; see also exact results in [8-10] and the review [11]). Forest-root formulas provide a unified picture for dimensional reduction for both BP and DBP. They are used to interpolate in the extra dimensions. They have other applications as well: we show how the one-dimensional forest-root formula generalizes some formulas of [12], used for interpolating in cluster expansions. We now define generating functions for BP and DBP. Let T be a tree graph on { 1 , . . . , TV}. For BP, the kth monomer is at position yk S RD+2 and we write yk = (wk,xk) £ C x RD. For DBP, it is at yk = (tk,xk) e E + x S, where S is either RD or IP'. While we must use continuous coordinates for BP, we allow discrete spatial coordinates for DBP, and for some models the time coordinate is discrete as well. For a unified treatment with DBP we consider here rooted BP (in contrast to [1,2,13], where BP are defined mod translations. Fix the vertex 1 as the root, with y\ = 0. For each pair (i,j) = ij, we define = (*•• x - ) = f(Wi~wJ>Xi~xA> 1 (\ti — tj\,Xi — Xj),
forBP
> for DBP.
^x
Each link of T connects a vertex j to a vertex i where i is one step closer than j to the root along T. For DBP, we require tj > ti. a
Animals (for which loops are allowed) are generally believed to have the same critical behavior as BP (where loops are not permitted). Likewise for directed animals and DBP.
227
228
JOHN Z. IMBRIE
The weight associated with each configuration depends on a linking weight V(y) and a repulsive weight U(y). The generating functions are written as oo
N
»
v
N=l
'
oo
•MILXK
T
JV
^
)
ji£T
ji£T *< T
( 2 )
/•
(N ~ 1)! ^
V
+ xRD)s-i
^
.X^
Here each pair {i, j } appears exactly once, either in rij;gT o r m IljigT- We a s s u m e that U —* 1 as yji —> oo, so that the repulsion vanishes at infinity. For BP, {/ is evidently invariant under rotations in K-° +2 ; for DBP we require U(t,x) = U(t, —x). In order to get dimensional reduction, we require V(t)=2U'(t),
forBP,
V(t,x) = U'(t,x),
for DBP,
'
where prime denotes the i-derivative. Note, however, that in the BP case t denotes a squared-radius variable, whereas for DBP t is a "time" variable. For positive weights, we require that U and V are positive. Furthermore, we assume that V is an integrable function of y so that (2) is well-defined. We relate these generating functions to the density of a repulsive gas in D dimensions. Let A c S and define the grand canonical partition function ,N
r
/ N=0
'
JA
N
T\dxi »=1
Ki<j
where Uy = U(\xij\2|)2 \ for BP and Uij = U(0,Xij) for DBP. Then define the pressure and density: P
^
=
A
1
^ |A) 1 ° g Z H C ^ - ) ;
PKC
^
= Z
TzP^'
^
T h e o r e m 1.1. For all z such that the right-hand side converges absolutely,
PHc(z) = \~2*ZBP\to)' y—ZDBp{-z).
(6)
We can also prove a dimensional reduction formula for correlations. Let N
i
N
6
Xi
p( ) = Yl ^~ ^ t=l
where x,xt £ RD and y,y{ £ RD+2
p{y) = Yl5(y~Vi)>
(7)
i=l
for BP or R+ x S for DBP. Then the density-density
Dimensional reduction for isotropic and directed branched polymers
229
correlation functions of t h e three systems can b e written as
GBp(0,y;z) = £ ^
K * ~ '•)• Y A C X I " ) " -
N
°°
GDBP(0, jj; z)
P® II [<%^toil2)] II U{\Vji\2),
/
7^-TTTE 1
=£
1
jXi eX r
L L
• MT
r
y^^—^ £ /
G H c ( 0 , X; Z) = j h n (p(0) p{x))HCA
p(y) [ ] [<%n^)] II ^ ) .
(8)
.
Here (-)HC,A is the expectation in the measure for which Znc(z)
is the normalizing constant.
T h e o r e m 1.2.
GHc(0,
\-2Trfd2wGBp(o,y;-~), x\z) = < V X; Z) = / 0 °° dt GDBP{0, V, -z),
Here d2w is defined as dudv,
wl
where y = (w, x) e C x
RD,
where y = (t, i ) e R
S.
(9) +
x
where w = u + iv.
2. Examples A n a t u r a l B P example is obtained by letting U(t) = fl{t — 1), where $ is the usual step function. T h e n ^(|2/ij| 2 ) = 5(|j/y| — 1), and we have a model of hard spheres linked together with kissing conditions determined by t h e tree graph T. See figure 1. One can also consider soft repulsion of t h e form U(t) — e~v(-t\ This is particularly interesting when D — 0, because the sine-Gordon representation allows one to write oo
/
r
I
dip 2
(10)
J2TTV
exp -L (ze* + ±
Figure 1. A branched polymer in R2 (left) and a directed branched polymer in Z 2 (right).
230
JOHN Z. IMBRIE
(the t axis). Alternatively, if we let U(t, x) = ^{t + \x\ — 1), then we obtain models of hard diamonds (when |ar| = X^ a =i \xa) or hard double-cones (when \x\2 = Yla=i \x"a) distributed uniformly in contact with the positive surface of the monomer it is linked to (subject to the constraint of nonoverlap with other monomers). In D = 1, the pressure of the hard-sphere gas is computable, it is p{z) = LambertW(z) = -T(-z), where T(z) = that
Y1°N=I ZNNN~1/N\
(11)
is the tree generating function [2]. So Theorem 1.1 implies
ZBP(z) = - i - 0 H C ( - 2 7 r * ) = ] T
(2nz)NNN ^
N=l 00
ZDBP(Z)
= -PHC{-Z)
= Yl
(12)
zN]yN N\
•
N=l
Thus we have exact expressions for the volume available to BP and DBP of size N. One can also consider lattice DBP examples by taking U(t,x) = 1 — J(a;)'!?(l — t), where I(x) is the indicator function of a set of "neighbors" in the lattice, such as {x : \x\ < 1}. Since V(t,x) = I(x)8(t — 1), the set determines which sites a link can jump to, with t always increasing by 1. This model is closest to the standard examples of DBP. See figure 1 for a configuration in two dimensions. The factors rijieT U{yji) enforce nearest-neighbor exclusion for monomers on the same level (same value of t). But there is a subtlety when a monomer at level t has n neighbors at level t — 1 in the polymer. In this case, we can write the [/-factors as •# n - 1 (0), which should be interpreted as ~ = j •&n~1d,d (this can be seen by approximating t? with a smooth function and integrating over t). Monomers which are separated by more than one unit in the t direction do not interact, since U(t, x) = 1 for £>1. By Theorem 1.1, the generating functions of these models equate to the density of the nearest-neighbor exclusion models in D dimensions associated with the weights U(0,x) = 1 — I(x). In D = 1, the pressure can be calculated explicitly [16, equation 2.16] p(*)=lnQ + i v T + 4 ^ ,
(13)
so the generating function is d X •7 (\ < \ l ( ZnMz) = -zTzP(-z) = - [j—
^ V* [27V-1]!! 2 ^ - ^ - I) = E . Wl
(14)
which gives an explicit enumeration of the number of DBP with N monomers. 3. Critical e x p o n e n t s As these reduction formulas are valid out to the edge of convergence of the generating functions, one can deduce the values of the BP and DBP critical exponents by looking at solvable repulsive gas models in low dimension. Let us define an exponent OJHC from the
Dimensional reduction for isotropic and directed branched polymers singularity of the pressure of the hard-core gas, p(z) ~ (z-zc)2~aHC. exponents 7 B P and 7DBP can be denned from ZBP ( - ^ )
~ (z - z.)1-^,
231
Likewise, susceptibility
ZDBP(-z) ~ (z - zc)1-^.
(15)
Note that zc is negative. By Theorem 1.1, the singularities must be the same, so aHc(£>)=7Bp(-D + 2 ) = 7 D B p C D + l).
(16)
Closely related to the susceptibility exponents are the counting exponents. If one defines CJV, d/v from 7
ZBP(Z) =
Z
OO
OO
CNZN
T Y .
>
"•Z N=l
Z
°BP(2)
= T dNzN,
(17)
N=l
then #BP, #DBP a^e determined from the asymptotic behaviors CN
~(~S)~N
N 6BP
~
>
dN~(-zc)-NN-e™*.
(18)
The exponent #BP is usually denned from counting unrooted BP, the difference being a factor N which is produced by z-^. We see that the unrooted generating function Y^, CNZN is related to the pressure p(z) of the repulsive gas [1]. From (16) and the relations #BP = 3 — 7 B P ,
#DBP=2-7DBP,
(19)
one can determine #BP, #DBP from a n o In D = 0, we have «HC = 2 because Znc{z) = 1 + z, so the "pressure" has a logarithmic singularity. For D = 1, one can compute «HC = 3/2 from the solutions for the pressure (11), (13), which have a square root singularity. For D — 2 there is an exact solution for the hard-hexagon model [17], which has GJHC = 7/6 [5,18]. This model works as the starting point for one of our lattice DBP models (with D +1 = 3). Our construction for BP does not work as described above for the hard hexagon model. Nevertheless, one expects the same value of anc for hard spheres. Hence the exponents #BP {D + 2), #DBP {D +1) are determined for D = 0,1,2 (rigorously for D = 0,1). See the table below. Theorem 1.2 connects the Green's functions for the three models, so it is clear that the exponents for the divergence of the correlation length are all equal. For DBP this gives information only on the transverse correlation exponent ^fj B P , which describes the vanishing of the rate of decay in the x directions. The exponent vuc can be computed in D = 1 or more generally determined from a n c via hyperscaling (Dis^c = 2 — CCHC)- Theorem 1.2 also implies that rjnc = VBP [2], and one can compute ?7HC in D = 1, or more generally determine it from Fisher's relation (77 = 2 — 7/^). In D = 2 it can be obtained from the conformal field theory of the Yang-Lee edge [19]. The repulsive gas singularity at negative activity is believed to be in the same universality class as the Yang-Lee edge [16,20]. One way of seeing this is by writing down the sine-Gordon representation for the repulsive gas (see (10) for the D = 0 case). The interaction is elip, whose lowest order term at the critical point is iip3, the Yang-Lee edge interaction. The Yang-Lee edge exponent a can be equated with 1 — anc- This leads to the Parisi-Sourlas
232
JOHN Z. IMBRIE
relation 9BP(D + 2) = 2 + a(D) [3] and its analogue for DBP: 6DBp(D + 1) = 1 + a{D) [6,7,21]. The table below summarizes what we know about exponents for these models (the last line gives the presumed mean-field exponents and upper critical dimensions; see [22,23] for results on BP in high dimensions). OLUC
HC dim£>
DBP BP = 7BP dimD + 1 dim.D+2 = 7DBP
= #DBP
#BP
^BP
= ??BP
a =
— ^DBP
0
1
2
2
0
1
1
2
3
3 2
1 2
3 2
1 2
-1
l 2
2
3
4
7 6
5 6
11 6
5 12
4 5
1 6
MFTD>6
D>7
D>8
1 2
3 2
5 2
1 4
-1
0
1 2
4. Forest-root formulas The key ingredients for proving Theorems 1 and 2 are a pair oi forest-root formulas, which we use to interpolate in the extra dimensions. Let / ( t ) depend on a collection of non-negative real variables t = {£ij}i 0 when any tt —• oo. In the BP case, the t's are functions of another set of variables: i^ = \wi — Wj\2, tj, = |WJ| 2 , with each u>i G C. The forest-root formula is
The sum is over forests F and roots -R (i? is any subset of { 1 , . . . , N} and F is a loop-free graph on { 1 , . . . , N} such that each connected component or tree of F has exactly one root in R). See figure 2. The expression /( F ' f l ) denotes the iVth partial derivative of / with respect to *$,-, ij G F and U, i £ R. The simplest example is when N = 1, F = 0, i? = {1}, in which case (20) reduces to the fundamental theorem of calculus: /(0)=
f'(t)^
= -
f(t)dt.
(21)
In the DBP case the i's are the underlying variables, and t%j = |£« — t,-|. Our second forest-root formula is
/(°) = E / (F,fl)
,/M
n [-*»•] n [-^ -^)]/(F'fi)(t)+ r+fi
(22)
jiGF
As in (2), each link in a tree of F connects a vertex j to a vertex i which is one step closer to the root for that tree. The integration region is {tr > 0, r G R and £, > t,, ji G F } . For iV = 1, (22) is just /(0) = - /0°° /'(*) df.
Dimensional reduction for isotropic and directed branched polymers
233
tree tree
tree tree forest Figure 2. Example of a forest.
4.1.
P r o o f of t h e m a i n r e s u l t s
Let us see how these forest-root formulas lead to a proof of Theorem 1.1. Let N
f(t)=g(t1/e)l{g(eti)
J ] Utj,
(23)
l
where Utj is either f/(|yy| 2 ) = £/(*»j + \%ij\2) for B P or U(Uj,Xij) for D B P (we suppress the dependence on { x y } in / ( t ) ) . Here g is any smooth function which decreases to 0 and satisfies g(0) = 1. For each ji e F, Uji is differentiated and becomes t h e linking weight Vji (Vji/2 for B P ) . For each r € R a g is differentiated. One finds t h a t (up t o terms which vanish as e —> 0) all the trees of F decouple due to the large separation in the ^-direction (the distribution of tr for an n-vertex tree is essentially —(g(etr)n)'dtr or —(g(etr)n)'cPw/ir, which are very spread-out probability measures). One tree has its root fixed at 0 by a factor — (g(tr/e))'dtr or — (g(tr/e))'d2w/Tr, which converges to a <5-measure at 0. T h e others cancel with the normalization Znc(z), so t h a t
p z)
^ =}%»a
z c{z) l
oo
N
.
N
» ~ £ ML ndx^
-2-7rZBp 1 Similarly, one can show t h a t pnc(z) = ~^DBP(—Z). For each link there is a factor — ^ (BP) or —1 (DBP), and this leads to the scaling of Z D B P and Z B P and their arguments in Theorem 1.1. Further details of this argument are in [2]. Theorem 1.2 can be obtained by differentiating (6) with respect to a source at x (see [1] for details). 4.2.
Decoupling expansions
It is interesting to look at these forest-root formulas from the point of view of decoupling expansions. Complete decoupling occurs only at t = 00, so it is important to note t h a t the
234
JOHN Z. IMBRIE
trees of F still interact in (20),(22). However, if we start with a function h which depends only on {Uj}, and let / ( t ) = h(t) f l i = 1 (e£i) as above, then in the limit e \ 0 we obtain (from (22), for example) a forest-root formula where all the trees are decoupled:
Mo) = E / F
JK
II[-d(tj-*i)]>*(F)(t).
(25)
+ jieF
Note that if a tree has n vertices, the sum over its root gives a factor n. This appears in — (g(etr)n)' which, as noted above, is a spread out probability measure. Thus the root sums disappear (one still needs to select a root for each tree, however — take the one with the smallest label in { 1 , . . . , N}, for example). One is tempted to change variables here, to Sji = tj — ti for ji € F. Then h^ has to be evaluated at tki = \tk — U |, where tk is a height variable obtained as the sum of s parameters along the tree joining the root to k (tki = oo if k and I are not in the same tree). The result is a new decoupling formula which has some similarity with Theorem III.2 in [12]. In fact, the rooted Taylor forest formula of [12] can be obtained from (22) by a limiting procedure similar to what we did in making a lattice model of DBP. Let H(w) depend on {toy}, a set of decoupling parameters in [0,1] (no assumption that Wij = Wi—Wj). Make a change of variable Wij = $(1 — s^), where $ is a smooth, monotone approximation to the step function. Apply (22) with Sji =tj —t^ for ji £ F. The result of this is that for each kl £ F, s^i is determined as the height difference tki- As Sji € [1— £, 1+e] for ji € F, the height variables tj. become discretized and actually measure the number of steps from the root to k. Then if the height difference is 2 or more, Wki = 0, and if the difference is 1, all Wjk linking j to the next level down are equal to Wji, where ji G F. Thus we obtain
n dw ^ (F) ( w )>
tfw^z/
(26)
which is Theorem III.2 of [12]. We have switched from s-derivatives and integrals to wderivatives and integrals, noting that under a change of variables ®M- -ds is invariant (except for the loss of the minus sign because ^7 < 0). 4.3. The two-dimensional forest-root formula The two-dimensional forest-root formula (20) is proven using a supersymmetry argument in [1]. Replace each variable ti in / ( t ) with _ dwj A dvjj Ti = WiWi -\ — , 2m
(27)
and each t%j with y = WijWij +
dwi A diDi ——-. 2m
(28)
Then / ( r ) is defined by its Taylor series. A "localization" formula / / ( r ) = /(0) JcN
(29)
Dimensional reduction for isotropic and directed branched polymers
235
holds, which becomes the forest-root formula when expanded out. This can be proven by deforming the problem to the independent case (21), using ideas from [24]. See [14] for an alternative argument, which uses the linearity of (29) to reduce to the case where / is an exponential (a Gaussian calculation) [25]. See also [26]. 4.4. Proof of the one-dimensional forest-root formula As mentioned above, the TV = 1 case is just the fundamental theorem of calculus. For N = 2, consider / ( t i , /2, £12) and use subscripts 1, 2, 12 to denote partial derivatives. Then /»oo
/(0) = - / ds(f1(S,s,0) + f2(s,s,Q)). (30) Jo Apply the N = 1 formula to the fa term (integrating with respect to x2 — £1), and to the fa term (integrating with respect to x\ — x2). The result is /*oo
/(0) = /
/>oo
dtx\
Jo
/>oo
d(t2 - i 1 )(/ 1 , 2 + / M 2 ) + / Jo
Jo
/*oo
dt2
d(t2 - tiX/2,2 + J 2 , 12 ),
(31)
Jo
since ^- = 1 for t2 > t, and ^- = 1 for tx > t2. The two / i j 2 terms combine to form /E2 dtidt2fit2, which is the term R = {1,2} of (22). The other two integrals are the terms i?={l},{2}. We prove the general case by induction on N. Begin as above with f(0) = -
dflV/fe(s>...,s)0,...,0)I Jo
(32)
fc=i
where the integral is along the diagonal, t\ = t2 = • • • = t^. Consider one of these terms, say k = N, and apply (22) in the variables it = U — tjv, i = 1, • • •, N — 1, keeping £jv = s fixed:
fN(tN,...
,tN,0,... ,0) = 53 JNI (F,R)
R
+
l[[-dir} n [-d{ti - ii)\fg>k\t). r£R
(33)
jieF
Note that when computing the derivative of f^ with respect to ir, there will be a term /^)7and also a term JN,TN (coming from the dependence on trN = tr — t^ = ir). Thus each (F, R) on { 1 , . . . , N — 1} gives rise to 2'fll terms, each of which can be assigned a unique (F, R) on { 1 , . . . , TV}. R consists of N, together with each r £ R with an /jv,r term. F consists of F, together with rN, r € R when r gives rise to an /N,VN term. Observe that each root in R ceases to be a root in R if it is connected by a bond rN in F. We obtain in this way all (F, R) with N € R, and each satisfies the condition that each tree of F contains exactly one root. As a result, /(°)=E
E
k=l (F,R):keR
/„[-<***] n R
+
i€iJ\{fc}
H(
(34)
ji€F
It is evident that if we take the term (F, R) of (22), and consider the subset of the integration region for which tk = min r £ fli r , we obtain the term k, (F,R) of (34). This completes the proof.
236
JOHN Z. IMBRIE
Acknowledgments I t h a n k David Brydges and J o h n Cardy for discussions which improved my understanding of these problems. Research supported by N S F grant PHY-0244884.
References 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12.
13.
14. 15. 16. 17. 18. 19. 20. 21. 22. 23. 24. 25. 26.
D. C. Brydges, J. Z. Imbrie, Ann. Math., to appear; arXiv:math-ph/0i07005. D. C. Brydges, J. Z. Imbrie, J. Statist. Phys. 110, 503 (2003); arXiv:math-ph/0203055. G. Parisi, N. Sourlas, Phys. Rev. Lett. 46, 871 (1981). J. Z. Imbrie, "Dimensional reduction for directed branched polymers", preprint. D. Dhar, Phys. Rev. Lett. 5 1 , 853 (1983). J. L. Cardy, J. Phys. A 15, L593 (1982). N. Breuer, H. K. Janssen, Z. Phys. B 48, 347 (1982). D. Dhar, Phys. Rev. Lett. 49, 959 (1982). P. Di Francesco, E. Guitter, J. Phys. A 35, 897 (2002); arXiv: cond-mat/0104383. Sumedha, D. Dhar, J. Phys. A 36, 3701 (2003); arXiv: cond-mat/0303450. M. Bousquet-Melou, Discrete Math. 180, 73 (1998). A. Abdesselam, V. Rivasseau, "Trees, forests and jungles: a botanical garden for cluster expansions", in Constructive physics, Lecture Notes in Phys. vol. 446, Springer, Berlin, 1995, pp. 7-36; mp.arc: 94-291, http://www.ma.utexas.edu/mp_arc-bin/mpa?yn=94-291 . J. Frohlich, "Mathematical aspects of the physics of disordered systems", in Phenomenes critiques, systemes aleatoires, theories de jauge, Part II (Les Houches, 1984), North Holland, Amsterdam, 1986, pp. 725-893. J. Z. Imbrie, Ann. Henri Poincare 4, 421 (2003); arXiv:math-ph/0303015. J. L. Cardy, J. Phys. A 34, L665 (2001); arXiv:cond-mat/0107223. S.-N. Lai, M. E. Fisher, J. Chem. Phys. 103, 8144 (1995). R. J. Baxter, Exactly solved models in statistical mechanics, Academic Press Inc., London, 1982. A. Baram, M. Luban, Phys. Rev. A 36, 760 (1987). J. L. Cardy, Phys. Rev. Lett. 54, 1354 (1985). Y. Park, M. E. Fisher, Phys. Rev. E 60, 6323 (1999); arXiv:cond-mat/9907429. H. E. Stanley, S. Redner, Z. R. Yang, J. Phys. A 15, L569 (1982). T. Hara, G. Slade, J. Statist. Phys. 59, 1469 (1990). T. Hara, G. Slade, J. Statist. Phys. 67, 1009 (1992). E. Witten, J. Geom. Phys. 9, 303 (1992); arXiv:hep-th/9204083. D. C. Brydges, J. Wright, J. Statist. Phys. 5 1 , 435 (1988); J. Statist. Phys. 97, 1027 (1999). J. L. Cardy, "Lecture on branched polymers and dimensional reduction"; arXiv:cond-mat/ 0302495.
Random path representation and Ornstein-Zernike theory of fluctuations DMITRY I O F F E
(Technion, Haifa)
Rigorous approach to the Ornstein-Zernike theory of fluctuations is based on a random walk/random line type representation of semi-invariants (high temperature models) and of phase boundaries (low temperature two-dimensional models) and on renormalization procedures which set up the stage for a probabilistic analysis along the lines of the thermodynamic formalism of one-dimensional systems associated to countable Markov shifts. Recent results include sharp asymptotics of correlation functions for sub-critical percolation models/high-temperature finite range Ising models in any dimension and an invariance principle for low temperature twodimensional interfaces.
1. Introduction Classical Ornstein-Zernike theory [12] leads to a sharp asymptotic description of point-topoint radially symmetric density correlation function G on R d for simple fluids away from criticality. The original computation hinges on three main ad hoc assumptions: A l The interaction between particles at different points of Kd is either direct or mediated by a direct interaction with a third particle: G(x) = F(x) + f
F(y) G(x - y) dy,
(1)
where F is the direct correlation function. A2 J F(x) dx < 1, which is an expression of non-criticality. A 3 The direct interaction F is a short range one. Then there exists a decay exponent £ > 0 and a number c < oo, such that the asymptotic behaviour of G is given by G(x) = _ £ _ e - « W ( l + o(l)).
(2)
In probabilistic terms (2) could be (in the case of non-negative F) understood as follows: Define the Laplace transforms (t G M.d and (•, •) is the corresponding scalar product) W(t)=
f
e{t'x)F{x)dx
and
jRd
G(t) = f JRd
e^G(x)dx.
(3)
By A 3 function F is defined and analytic on Rd. Furthermore, by A2 the G-integral in (3) has positive radius of convergence £ > 0 (no coincidence — it is the same £ as in (2)), and for every \t\ < £,
237
238
DMITRY IOFFE
Thus ¥(t) = 1 on {t : \t\ = £}. In particular, given x G Rd and the "dual" direction t0 = to(x) = £x/\x\, the function
foiv) = e^v)F(y) is a probability density on Rd. Let Vi.V^,... be the i.i.d. steps of an Revalued random walk, each V* being sampled from /o. Then the renewal formula (1) could be developed as oo
k
*iWG{x) = Y,®h{Vx fe=i
+ --- + Vk = x),
(5)
l
where ®i/o is the A;-fold convolution of /o- Because of the radial symmetry the expectation Eo^fc = VlogF(io) is co-linear with x. The OZ formula (2) follows now from classical local limit results and the Gaussian summation formula. It is important to stress that (2) is a consequence of the pre-assumed random walk type structure of the two-point function G. The purpose of this lecture is to explain recent attempts to recover the Ornstein-Zernike formula directly from the picture of microscopic interactions in various lattice models of Statistical Mechanics including weighted lattice self-avoiding walks [7,8], sub-critical short range Bernoulli percolation models [3] and sub-critical finite range ferromagnetic Ising models [4]. In all these models two-point functions enjoy random line type representations and the corresponding versions of the Ornstein-Zernike formula (2) could be reinforced with invariance principles for these random lines [5,6,10]. Similar results hold for phase separation lines in the nearest neighbour 2D Ising model below the critical temperature [6] and, apparently, pertain to a much larger class of low temperature 2D models in the Pirogov-Sinai regime [9]. It is remarkable that by and large the original assumptions A 1 - A 3 indeed perceive the mathematical structure of the twopoint functions. However, rigorous verification of an appropriate form of A3 comes as the major renormalization step involved in the proof and, furthermore, in general it appears to be natural to build the corresponding local limit analysis not on the results for sums of independent steps (as generated by the renewal assumption A l ) , but rather on the general thermodynamic formalism of one-dimensional systems generated by Ruelle's operator for shifts over countable alphabets (of irreducible random path-type objects).
2. Bernoulli bond percolation 2.1. The OZ formula Each nearest neighbour bound of Zd is independently open with probability p and closed with probability 1 — p. The two-point function is defined as Gp(x) = P p (0 <-> x),
Random path representation and Ornstein-Zernike theory of
fluctuations
239
where the event {0 <-> x} implies the existence of a connected chain of open bonds leading from the origin to x. Define now the (direction dependent) inverse correlation length ip{x) = - lim -
logG([nx\).
(6)
n—»oo n
A fundamental result of the percolation theory [2,11] implies that £p is an equivalent norm on Rd for every p < pc(d), where pc is the percolation threshold. Theorem 2.1. [3] For any d > 2 and for any p < pc(d) the asymptotic behaviour of G(-) is given by
G(z) = M ^ e - « > M (1+ 0 (1)), where n(x) = x/\x\ G S d _ 1 is the direction ofx and ^p is a positive locally analytic function onSd~l. The qualitative picture behind Theorem 2.1 is that under the conditional measure P p ( • |0 <-> x) the common connected cluster C0<x has an essentially forward one-dimensional structure: The meaning of the word "forward" is encoded in the geometry of £p: Being an equivalent norm on Rd it is a support function of a compact convex set K-P={t
: (t,y)
My e Rd) .
Alternatively the set K,p could be described as the closure of the domain of convergence of the series t
_> Y,e(t'v)G(y)-
(7)
y
Given x G Rd \ 0 a boundary point t0 e dJCp is said to be dual to x if £ p (x) = {t0,x). Given such tQ and a small enough 5 > 0 let us say that a vector y e Rd is forward if {to,y) > (1 - S)£p(y). Let C0 be the convex cone of forward vectors. The forward structure of the connection cluster is quantified on large enough renormalization scales K < oo. With such scale K fixed the irreducible decomposition of CotX is depicted on figure 1:
Hxiui
KxiUi)
(\ 'ill.
"I
T-Cx{u„) Figure 1. Irreducible decomposition Co,* = C L U Ci U ... U C„ U C R : u 0 = 0, un+i = x and u i , . . . , u„ are cut points of Co,x such that C, C KB(ui) +C$ for i = 1,..., n - 1 and CR C KB(un) + CY
240
DMITRY IOFFE
Namely, let us say that ti e Co,j is a cut point if 0 < (u, x) < (x,x) and Hx (u) D Co,x = u, where TLx{u) = {v : (x, v — u) = 0} and Co,x is embedded in Rd with all its open bonds. In the irreducible decomposition depicted on figure 1 un is the first (largest x-projection ) cut point of Co,x s u c n that C R = CQ>X d{v : (v — un) > 0} satisfies C R C KB(un) + Cs, where B(u) is the unit R d -ball centred at u. Accordingly, u„_i is the first cut point after un such that C„_i = Co,x H {v : (v — un) > 0} \ C R satisfies C„_i C KB{un^\) + Cs and so on. The collection U = {u$ = 0, u\,... ,un,un+\ = x} of irreducible cut points could be viewed as a trajectory of a Zd-valued random walk with steps Vj_, = ui, VR = x — un and Vi = V{Ci) = Uj+i — Ui\ i = 1 , . . . , n — 1 being the displacement along the end-points of the cluster Ci. The crucial OZ assumption A 3 on the short range nature of direct correlations finds its rigorous expression in the following Lemma 2.1. There exists a finite renormalization scale K = K(d,p,5) stants c\, C2 > 0 such that
and positive con-
P(max{diamCt,diamCi,...,diamCfl} > a \ 0 <-> x) < ci|x|e~ C2a ,
(8)
uniformly in x G Rd. In particular the left-most and the right-most clusters CL and C R could be ignored on the CLT scale (as \x\ —> oo). The remaining intermediate clusters C\,..., Cn-\ have identical geometric structure. In the sequel we shall refer to them as irreducible. Let to S dKp be a dual point to x, £p(x) = (to,x). Define:
QP(y)=
£
e(*°^Pp(C).
C irreducible : V(C)=y
It readily follows from Lemma 2.1, the irreducible decomposition of CojX on figure 1 and a characterisation of K.p in terms of the convergence of the series in (7) that Q p is a probability distribution on Z d with exponentially decaying tails. Similarly, the (in general nonprobabilistic) weights O L ( S / ) = Y,
e (t0,!/) P P ( C L )
and
QR(y) =
CL:0-»?/
£
e^Pp(CR)
CR:-y—0
also have exponential tails. In view of Lemma 2.1 the irreducible decomposition of CotX could be recorded (for generic directions of x) as oo
e*'<*>G(aO = o (e- C3 l*l) + £ Q L ( y ) Q R ( * ) £ y,z
n
(g)Q„ (V, + • • • + Vn = x - y - z),
(9)
n=l 1
and the statement of Theorem 2.1 follows by classical local limit computations. 2.2. Diffusive scaling of the connection cluster C0t\_Nxj The geometry of K.p is related to a local limit analysis based on the representation formula (9) through (7). In particular, it follows [3] that dK,p is locally analytic and, in addition,
Random path representation and Ornstein-Zernike theory of
fluctuations
241
at each point t0 G dK,p the principal curvatures Ki,... ,Kd-i are strictly positive. These principal curvatures explicitly come into picture on the level of the invariance principle: Let x G S d _ 1 . By the very construction all the steps VL, V i , . . . , Vn-\, VR. in the irreducible decomposition of Co^iVxj n a v e positive ^-projection. Let CM — £N{VL, VI, • • •, VL) be the polygonal interpolation through the points of U. By construction, the cardinality # { £ J V D Ti-hNx) — 1 for every h G [0,1]. Thus the polygonal line CN could be represented as a function on [0,1]. Define C^(h) = £jv l~l Ti-hNx a n d consider the following rescaled centred version of the latter:
The random function
Bd~l
any p < pc{d) and any x £Rd
the weak limit of 4>N
are independent Brownian bridges on [0,1].
Since by Lemma 2.1 the Hausdorff distance djy(Col[./vxj,£Ar) vanishes under the above diffusive scaling, Theorem 2.2 could be equally considered as an invariance principle for the whole connection cluster Co, [JVXJ .
3. Ferromagnetic Ising models 3.1. OZ formula Let J = {Jv}vezd> be a collection of nonnegative real numbers such that Jv = J^v and Jv = 0 if \v\ > R, where R is some finite number. To avoid trivialities we assume that {v : Jv > 0} generates Zd. The (formal) Hamiltonian of the corresponding finite range ferromagnetic Ising model on Zd is given by H
(°) = ~~ Yl
J
y-x°-xOy .
(10)
In the high temperature region (3 < (3C = f3c(3,d) there is the unique infinite volume Gibbs state ( -)p and, furthermore [1], the inverse correlation length £.p(x) = - lim -log(f7O0-|„xj) is an equivalent norm on Rd \ 0. In the sequel we shall use JCp to denote the convex set supported by £3. Theorem 3.1. In any dimension d>l and for any ferromagnetic model (10) the asymptotic decay of the two-point correlation function in the high-temperature region j3 < j3c{3,d)
242
DMITRY IOFFE
is given by
M^^0k^"-><-<(l+om),
(11)
where nx = x/\x\ € S rf_1 is the unit vector in the direction of x, and the function typ is strictly positive and analytic. As a byproduct of the proof of the theorem above we infer that dICp is locally analytic and strictly convex. In two dimensions Kp is reminiscent of the Wulff shape (by duality it is precisely the low temperature Wulff shape in the case of nearest neighbour interactions). By the analyticity of dK.p, no roughening transition is possible in such low temperature 2D models where the theory applies [5]. The relation between the curvature of dKp and the variance of fluctuations is discussed in Subsections 3.2. The proof of Theorem 3.1 is based on a careful analysis of the geometry of the following high temperature random line representation of the two point-function: q
{o-0O-x)/3 = Yl
^ '
(12)
A:0-»x
where the sum runs over a family of admissible self-avoiding paths, see e.g. [4,13] for details. As in the percolation case the crux of the matter is to develop an irreducible decomposition of such paths on a large enough renormalization scales and then to reinterpret (crocrx)p in terms of the associated random walk whose steps are just displacements along the successive irreducible pieces. The new difficulty arising now is that is not possible anymore to write down an OZ equation: Indeed, for the Ising model the weights do not factorize: qp(XUf) ^ qp(X) 9/3(7), where AII7 is the concatenation of two (compatible paths). Consequently local limit analysis of {o-Qax)p cannot be based on the results about sums of independent random variables. However, it happens that the weights qp possess a number of nice properties which give rise to essentially the same local limit structure as in the independent case (see e.g. §1 of [5]). The crucial role is played by the strict exponential decay of the two-point function and by the following exponential decoupling property: For any pair of compatible paths A and 77 define the conditional weight qp(\ | 77) = qp(X U rf)/qp{r}) where A H 77 denotes the concatenation of A and 77. Then, there exists C4 < 00 and 0 € (0,1) such that, for any four paths A, 77, 71 and 72, with A H 77II71 and AII77II72 both admissible,
qp{X\nUl2)
)
fr!
(13)
xGX
3/G71U72
As in the case of Bernoulli percolation the notion of irreducibility is spelled out in terms of the geometry of dICp. Given a finite renormalization scale K, a small number 5 > 0, a target point x € Z d and the direction to £ dKp dual to x, let us say that ui is a break point of a path A = (u\,..., un) if {ut} - \r\Hx{ui), and {ui+i,...,un}
cKB(ii)+C0,
Random path representation and Ornstein-Zernike theory of
fluctuations
243
where, as in the percolation case, Co is the cone of forward directions. A path A is irreducible if it does not contain break points. If 7 = Ai II • • • II An is the the irreducible decomposition of 7, 9lv ul c fe_/ i4
Y^ 5 2
~ ^ s0 .
( )
which, in view of the decoupling property (13), already implies uniform estimate on the ratios of conditional weights. As a consequence, the irreducible representation
(
]C
g/?(AiII...IIA n )
A:0-*x A=AiII...IIA n Afcirreducible, k=l,...,n n-1
= J2 n>l
£ A:0-»x A=AiU...nA„ At irreducible, A;=l
q0(Xn)Hq0(Xk\Xk+1U---UXn)
(15)
fe=l n
can be interpreted in terms of thermodynamic formalism of one-dimensional systems associated to Ruelle's operators for shifts over countable alphabets (of irreducible paths) with uniformly Holder continuous potentials and, accordingly, classical local limit results come into play through the analytic perturbation theory of simple poles of the corresponding resolvent operators [4]. 3.2. Invariance principle for high temperature models As in the case of the Bernoulli percolation the irreducible representation (15) elucidates the effective random walk structure of the two-point function (o-Qcrx)p: Path weights q@ give rise to the probability measure Qx,p on the space of all connection paths 7 : 0 1—» x in their irreducible representation 7 = Ai I I . . . An; Qx,/3(7) = Qx,/3 (Ai I I . . . A„) =
rq0
(Ai I I . . . A„)
(O"0<Jx//9
Let Vi = V(Xi) be the displacement along the i-th irreducible piece Aj. Lemma 3.1. For any dimensional > 2, for any (3 < (3c(d) and for any positive number 5 > 0 there exists a finite renormalization scale K = K(d,P,S) and positive constants Ce,cr > 0, such that QX:/3 (max{diam(Ai), V 2 ,..., Vn} > a) < c 6 |x|e" C 7 a . (16) uniformly in x € Z d . Let x € S d _ 1 and consider the family of measures Q[Nx\,0 a s N —> 00. Let 7 = AiII... A„ be the irreducible decomposition of a connection path 7 from 0 to [Nx\. As in the percolation case we use CN = £jv [7] to denote the linear interpolation through the vertices 0,V1,V1+V2,...,V1
+ --- +
Vn=[Nx\.
By Lemma 3.1 there exists c 8 > 0 such that
244
DMITRY IOFFE
vanishes as N —> oo. Thus 7 and £#[7] are indistinguishable on scales much larger than log N, under the diffusive scaling in particular. As in the percolation case define: (j>N{h) = - = (CN(h) VN
hNx),
with 4>N being understood as a function from [0,1] to the tangent space to dtCp at the dual to x point to, £/3(x) = (t0,x). Theorem 3.2. [6] For any d > 2, any (3 < 0c(d) and any x £ S'* -1 the weak limit of 4>M under Q[Nxit0(-) is
(^/irlB\...,^r1Bd-1),
where B1,..., Bd~l are independent Brownian bridges on [0,1], and « i , . . . , KJ,-I are principal directions of curvature of dK,p at to. I would like to stipulate that the results above go beyond finite range ferromagnetic models and, in fact, reflect large scale limit properties of a whole class of random line type objects once a (12)-like decomposition of the corresponding partition function is available. A set of general explicit conditions on path weights is formulated in [5]. In particular, appropriate modifications of these techniques lead to a proof of the invariance principle for phase separation lines in the 2D Ising model up to the critical temperature [6] and, more generally, to a theory of interfaces in the two dimensional low temperature models in the Pirogov-Sinai regime [9].
Acknowledgments Part of the work reported in this paper has been accomplished during my stay at Isaac Newton Institute for Mathematical Sciences in the framework of the programme "Interaction and growth in complex stochastic systems".
References 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. 13.
M. Aizenman, D. J. Barsky, R. Fernandez, J. Stat. Phys. 47, 342 (1987). M. Aizenman, D. J. Barsky, Comm. Math. Phys. 108, 489 (1987). M. Campanino, D. Ioffe, Ann. Probab. 30, 652 (2002). M. Campanino, D. Ioffe, Y. Velenik, Prob. Theor. Rel. Fields 125, 305 (2003). M. Campanino, D. Ioffe, Y. Velenik, to appear in Advanced Studies in Pure Math. (2003). L. Greenberg, D. Ioffe, preprint (2003). O. Hryniv, D. Ioffe, to appear in Mark. Proc. Rel. Fields (2003). D. Ioffe, Mark. Proc. Rel. Fields 4, 323 (1998). D. Ioffe, Y. Velenik, in preparation. E. Kovchegov, to appear in Mark. Proc. Rel. Fields (2003). M. V. Menshikov, Sov. Math. Doklady 33, 856 (1986). L. S. Ornstein, F. Zernike, Proc. Acad. Set. (Amst.) 17, 793 (1915). Ch. E. Pfister, Y. Velenik, Comm. Math. Phys. 204, 435 (1999).
Construction of a 2-d Fermi Liquid HORST KNORRER
(ETH-Zentrum, EUGENE
Zurich), J O E L FELDMAN (U. of British TRUBOWITZ (ETH-Zentrum, Zurich)
Columbia),
The temperature zero renormalized perturbation expansions of a class of interacting many-fermion models in two space dimensions have nonzero radius of convergence. The models have "asymmetric" Fermi surfaces and short range interactions. One consequence of the convergence is the existence of a discontinuity in the particle number density at the Fermi surface. Here we describe the main results and highlight some of the strategy of the construction.
1. T h e r e s u l t s The concept of a Fermi liquid was introduced by L. D. Landau in [16-18] and has become the generally accepted explanation for the unexpected success of the independent electron approximation. An elementary sketch of Landau's well known physical arguments can be found in [4, pp. 345-351]. More thorough and technical discussions are presented in [1, pp. 154-203] and [22]. Roughly speaking, at temperature zero, the single particle excitations of a noninteracting Fermi gas become (almost stable) 'quasi-particles' in a Fermi liquid. The quasi-particle spectrum has the 'same structure' as the noninteracting single particle excitation spectrum and the quasi-particle density function n(k) still has a jump at the 'Fermi surface'. The quasi-particle interaction at temperature zero is encoded in Landau's /-function / ( k p , k ' F ) . It is well known that there are a number of potential instabilities that can drive an interacting Fermi gas away from the Fermi liquid state. See, for example, [21, §1.2,4.5]. One of the most celebrated is the BCS instability for the formation of Cooper pairs leading to superconductivity in 2 and 3 dimensions. This is a potential instability for any time reversal invariant system [15,19]. Another important instability is the Luttinger instability. There are solvable models in one space dimension that exhibit qualitatively different behavior from that of a three dimensional Landau Fermi liquid. In particular, the quasi-particle density function n(k) is continuous across the 'Fermi surface' but has infinite slope there. These systems are called Luttinger liquids. For a rigorous treatment of Luttinger liquids in one dimension, see [5] and the references therein. Anderson [2,3] suggested that a two dimensional Fermi gas should exhibit behavior similar to a one dimensional Luttinger liquid. [11, Theorem 1.4] rigorously shows that this is not the case for a specific class of models. In particular, we show that, at temperature zero, the density function rc(k) has a jump discontinuity across the Fermi surface [11, Theorem 1.5]. The existence of the Landau f-function and its basic regularity properties follow directly from [11, Theorem 1.7]. For results concerning Fermi liquids at strictly positive temperature, see [7,8,23,24]. The class of models that we consider is somewhat unusual in that the Fermi surface survives the turning on of all sufficiently weak short range interactions. To motivate the
245
246
HORST KNORRER, JOEL FELDMAN, EUGENE TRUBOWITZ
class, consider a gas of fermions with prescribed, strictly positive, density, together with a crystal lattice of magnetic ions. The fermions interact with each other through a two-body potential. The lattice provides periodic scalar and vector background potentials. As well, the ions can oscillate, generating phonons and then the fermions interact with the phonons. At the present time our result is restricted to d = 2 space dimensions. But we believe that the difficulties preventing the extension to d = 3 are technical rather than physical. Indeed, there has already been some progress in this direction [6,20]. To start, turn off the fermion-fermion and fermion-phonon interactions. Then we have a gas of independent fermions, each with Hamiltonian
#o = ;^-(iV + A(x)) 2 + £/(x). We assume that the vector and scalar potentials A, U are periodic with respect to some lattice r in R 2 . Note that it is the magnetic potential, and not just the magnetic field, that is assumed to be periodic. This forces the magnetic field to have mean zero. By convention, bold face characters are two component vectors. Because the Hamiltonian commutes with lattice translations it is possible to simultaneously diagonalize the Hamiltonian and the generators of lattice translations. Call the eigenvalues and eigenvectors e ^ k ) and ^ ^ ( x ) respectively. They obey flo<7^,k(x) = e„(k)0„,k(x), ^ , k ( x + 7) = e ' < k ^ , k ( x ) ,
V7er.
The crystal momentum k runs over R 2 / r # where r#
= {b G M2 | (b,7)
G 2TTZ for all 7 G T}
is the dual lattice to T. The band index v G N just labels the eigenvalues for boundary condition k in increasing order. When A = U = 0, £„(k) = ^ (k — k„,k) 2 ^ or s o m e b„, k e r # . In the grand canonical ensemble, the Hamiltonian H is replaced by H — /j,N where N is the number operator and the chemical potential fi is used to control the density of the gas. At very low temperature, which is the physically interesting domain, only those pairs v, k for which e„(k) « /J, are important. To keep things as simple as possible, we assume that £„(k) ss n only for one value v$ of v and we fix an ultraviolet cutoff so that we consider only those crystal momenta in a region B for which |e^0 (lc) — JJL\ is smaller than some fixed small constant. We denote E(k) = e„ 0 (k) — fi. When the fermion-fermion and fermion-phonon interactions are turned on, the models at temperature zero are characterized by the Euclidean Green's functions, formally defined by
(2) The action Aty,$)
= -fdk
(ik0 - £(k)) V^Vfc + V(V, V0-
(3)
247
Construction of a 2-d Fermi Liquid The interaction V will be specified shortly. We prefer to split A - J dk(iko — •E(k))^fcVfc a n d write
(lM)) =
Q + V where Q
J/(M)e*^>n fc , g dV> fc , g d&, g
f eVWrf)
duett,®
where due is the Grassmann Gaussian "measure" with covariance C(k)
1 ik0 - E(k)'
We have here dropped some factors of 2ir. We will continue to routinely drop various unimportant constants throughout this article. We now take some time to explain (2). The fermion fields are vectors i>k
V'fe.T .V'fea.
Tpk = [>l>k,t
4>k,l]
whose components ipk,a,'4>k,(r, & = (fco,k) e K x B, a € {T,j}, are generators of an infinite dimensional Grassmann algebra over C That is, the fields anticommute with each other, (7)
(7)
(7)
(7)
We have deliberately chosen ip to be a row vector and ip to be a column vector so that
•
V'feV'p
ipk,-[ipPA V'fe.T^P.l .V'fe.iV'p.T V'fcj^pj.
In the argument fc = (fco,k), the last d components k are to be thought of as a crystal momentum and the first component k0 as the dual variable to an imaginary time. Hence the \ / - T in iko - E(k). Our ultraviolet cutoff restricts k to B. In the full model, k is replaced by (i/,k) with v summed over N and k integrated over Rd/T*. On the other hand, the ultraviolet cutoff does not restrictfcoat all. It still runs over R. So we could equally well express the model in terms of a Hamiltonian acting on a Fock space. We find the functional integral notation more efficient, so we use it. The relationship between the position space field ip(x), with x = (t,x) running over (imaginary) time x space, and the momentum space field tpk is really given, in our single band approximation, by
1>(x) = Jdke
:k0t~
Vo,k (x)lAfe-
We find it convenient to use a conventional Fourier transform, so we work in a "pseudo" space-time and instead define i>(x) Under suitable conditions on the real one.
I
dkellc-xil>k-
,,k(x), it is easy to go from the pseudo space-time ip(x) to
248
H O R S T KNORRER, JOEL FELDMAN, EUGENE TRUBOWITZ
For a simple two-body fermion-fermion interaction, with no phonon interaction, V = - -
^2
J
dtd^dyu{-x.-y)i>a{t,-x.)^}a{t,y)\[)T{t,y)i}T{t,y).
T h e general spin independent form of t h e interaction is 1 V("0,^) = -
4
f ]T T,r6{T,i}
m
Y[dxiV(xi,X2,X3,Xi)ll>
dki5{ki
+k
2
- h - k4) il>kli>k3
Spin independence is imposed purely for notational convenience. I t plays no role. T h e function V(xi,X2,X3,X4), or equivalently (fci, A^l^lfe, ^4), can implement b o t h t h e fermionfermion and fermion-phonon interactions. Its precise value does not concern us. We just assume H y p o t h e s i s 1 . 1 . The interaction is weak and short range. That is, V is sufficiently near the origin in QJ, which is a Banach space of fairly short range, translation invariant functions V(xi,X2,X3,X4). See [11, Theorem 1.4] for %3's precise norm. For some results, we also assume t h a t V is "fc 0 -reversal real", V(Rxi, where R(xo,x)
Rx2, Rxs, Rxi) — V(xi,x2,x3,X4)
(4)
— (xo, —x), and " b a r / u n b a r exchange invariant", V(-X2,-Xi,-X4,-X3)
= V(XI,X2,X3,XA).
(5)
If V corresponds t o a two-body interaction w(xj — X3) with a real-valued Fourier transform, then V obeys (4) and (5). Our goal is t o prove t h a t perturbation expansions for various objects converge. These objects depend on b o t h -E(k) and V and are n o t smooth in V when -B(k) is held fixed. However, we can recover smoothness in V by a change of variables. To do so, we split E(k) = e(k) — Se(V, k ) into two parts and choose Se(V, k ) t o satisfy an implicit renormalization condition. This is called renormalization of t h e dispersion relation. Define t h e proper self energy E(p) for t h e action A by t h e equation
(^o-e(p)-E(p))
S(P-q)=
J
e
^
)
W
k
a
T h e counterterm Se(V, k) is chosen so t h a t E ( 0 , p ) vanishes on t h e e(p) = 0 } . We take e(k) and V, rather t h a n t h e more natural data. T h e counterterm 5e will be an o u t p u t of our main theorem. Banach space £. While t h e problem of inverting t h e m a p e i-> E well understood on a perturbative level [13], our estimates are not so nonperturbatively. Our main hypotheses are imposed on e ( k ) .
d
^
•
Fermi surface F = { p | E(k) and V, as input It will lie in a suitable = e — Se is reasonably yet good enough t o do
Construction of a 2-d cFermi Liquid
249
Hypothesis 1.2. The dispersion relation e(k) isia real-valued, sufficiently smooth, function. We further assume that (a) the Fermi curve F = {k e R 2 | e(k) = 0} \s a simple closed, connected, convex curve with nowhere vanishing curvature. (b) Ve(k) does not vanish on F. (c) For each q € R 2 , F and —F + q have low degree of tangency. (F is "strongly asymmetric".) Here —F + q = {-k + q | k € F}. Again, for the details, see [11, Hypothesis 1.12]. It is the strong asymmetry condition, Hypothesis 1.2.c, that makes this class of models somewhat unusual and permits the system to remain a Fermi liquid when the interaction is turned on. If A = 0 then, taking the complex conjugate of (1), we see that £„(—k) = e„(k) so that Hypothesis 1.2.c is violated for q = 0. Hence the presence of a nonzero vector potential A is essential. We shall say more about the role of strong asymmetry later. For now, we just mention one model that violates these hypotheses, not only for technical reasons but because it exhibits different physics. It is the Hubbard model at half filling, whose Fermi surface looks like
This Fermi curve is not smooth, violating Hypothesis 1.2.b, has zero curvature almost everywhere, violating Hypothesis 1.2.a, and is invariant under k —> —k so that F = —F, violating Hypothesis 1.2.c with q = 0. To give a rigorous definition of (1.2) one must introduce cutoffs and then take the limit in which the cutoffs are removed. To impose an infrared cutoff in the spatial directions one could put the system in a finite periodic box Rd/LT. To impose an ultraviolet cutoff in the spatial directions one may put the system on a lattice. By also imposing infrared and ultraviolet cutoffs in the temporal direction, we could arrange to start from a finite dimensional Grassmann algebra. We choose not to do so. Our goal is to prove that formal renormalized perturbation expansions converge. The coefficients in those expansions are well-defined even without a finite volume cutoff. So we choose to start with x running over all R 3 . We impose a (permanent) ultraviolet cutoff through a smooth compactly supported function U(k). This keeps k permanently bounded. We impose a (temporary) infrared cutoff through a function ve(kQ + e(k) 2 ) where f £ («) looks like
£
250
HORST KNORRER, JOEL FELDMAN, EUGENE TRUBOWITZ
When e > 0 and v£{k,Q + e(k) 2 ) > 0, \ik0 — e(k)| is at least of order e. The coefficients of the perturbation expansion (either renormalized or not) of the cutoff Euclidean Green's functions G2n;e{x\,(T\,
• • • ,yn,Tn)
= ( J \ VV* (^i)V'n (Vi)
where SfW,$)ev«>>i»drc.M) (/)£
"
. , , ^ M
J^^li^MJ)
Cf(k)„e(fcg + e (k) 2 )
H r )
' ' ~ ifc0-e(k) + Je(k)
are well-defined. Our main result is Theorem 1.1. [11, Theorem 1.4] Assume that d = 2 and that e(k) ,/iii/iis Hypothesis 1.2. There is — a nontrivial open ball B C 93, centered on the origin, and — an analytic function V G B — i > 5e(V) € £, that vanishes /or V = 0, such that: — for any e > 0 and n G N, the formal Taylor series for the Green's functions Ginfi converges to an analytic function on B; — as £ —> 0, G2n\e converges uniformly, in x\,...,yn and V G B, to a translation invariant, spin independent, particle number conserving function Gin that is analytic in V. If, in addition, V is kg-reversal real, as in (4), then <$e(k; V) is real for all k. Theorem 1.2. [11, Theorem 1.5] Under the hypotheses of Theorem 1.1 and the assumption that V G B obeys the symmetries (4) and (5), the Fourier transform G2(fco,k) = j(teoAe'(-fe»I»+k'x)
G 3 ((0,0, T), (x0,x, T))
= J dx0 d"xe«-fco*o+k-*)
ik0 - e(k) - E(fc)
G 2 ( ( 0 ; 0 j i ) ; (a . 0) X j i } )
when U(k.) = 1
of the two-point function exists and is continuous, except on the Fermi surface (precisely, except when fco = 0 and e(k) = 0). The momentum distribution function n(k) = lim [ ^
eik°r
is continuous except on the Fermi surface F. IfkG
G2(k0,k)
F, then lim n(k) and lim n(k) exist e(k)>0
and obey lim n(k) -
k—+k e(k)<0
lim n(k) = 1 + 0{V) > \ .
k—*k e(k)>0
2
e(k)<0
Construction of a 2-d Fermi Liquid
251
Theorem 1.3. [11, Theorem 1.7] Let Gi{k1,k2,h,ki) (spin dropped from notation) he the Fourier transform of the four-point function and 4
G^{ki,k2,k3,ki)
= Gi(ki,k2,k3,k4)Y\ /=1
l > G2{ke)
its amputation by the physical propagator. Under the hypotheses of Theorem 1.2, Gf has a decomposition G±(ki,k2,k3,ki)
=
N{ki,k2,kz,ki) +
l . / ^ i + fe k3 + ki
2^1
2
'
2
1 T/k3
+ k2 ki+k4
'fc2 ~ ^V \
with — N continuous, — L(qi,q2,t) continuous except at t = 0, — lim L(qi,q2,t) continuous, to—>0
— lim L(qi,q2,t)
continuous.
Think of £ as a particle-hole ladder
2. Blocking t h e Cooper channel We now discuss further the role of the geometric conditions of Hypothesis 1.2 in blocking the Cooper channel. When you turn on the interaction V, the system itself effectively replaces V by more complicated "effective interaction". The (dominant) contribution
to the strength of the effective interaction between two particles of total momentum t Pi+P2=:qi + q-i is
stuff
dk-r ' [tfc0 - e(k)] [i(-ko +10) - e ( - k +1)] ' / Note that [ik0 - e(k)] = 0 <=» k0 = 0, e(k) = 0 4=^ k0 = 0, k € F, [i(-k0 + k) - e(-k + t)] = 0 *=> k0 = t0, e ( - k + t) = 0 <=> fc0 = t 0 , k G
t-F.
252
HORST KNORRER, JOEL FELDMAN, EUGENE TRUBOWITZ
We can transform jf^z^^j locally to jfc0Lfcx by a simple change of variables. Thus jg~^y^ is locally integrable, but is not locally L2. So the strength of the effective interaction diverges when the total momentum t obeys to = 0 and F = t — F, because then the singular locus of ik -efkl coincides with the singular locus of i , k + f y_ e /_ k , t-,. This always happens when F = —F (for example, when F is a circle) and t = 0. Similarly the strength of the effective interaction diverges when F has a flat piece and t / 2 lies in that flat piece, as in the figure on the right below. t - F
F
On the other hand, when F is strongly asymmetric, F and t — F always intersect only at isolated points. A "worst" case is illustrated below. There the antipode, a(k), oik € F, is the unique point of F, different from k, such that the tangents to F at k and a(k) are parallel.
k + a(k) - F / * " * " Y k{
) a(k)
For strongly asymmetric Fermi curves, [iko - e(k)j [i(-ko +10) - e ( - k + t)] remains locally integrable in k for each fixed t and strength of the effective interaction remains bounded.
3. Power counting and nonperturbative bounds The proofs of Theorems 1.1, 1.2 and 1.3 are quite technical. The whole construction is given in a series of papers [10]. In the first paper [11] of the series, the main difficulties and our strategies to overcome them are described. Here, we concentrate on one aspect, namely the need to use both position space and momentum space arguments and the problems created by the interplay between them. The Green's functions G2n are constructed using a multiscale analysis and renormalization. The multiscale analysis is introduced by choosing a parameter M > 1 and decomposing momentum space into a family of shells, with the j t h shell consisting of those momenta k obeying \iko — e(k)| w 1/MJ'. Correspondingly, we write the covariance as a telescoping series C(k) = J27Lo C^(&) where, for j > 1,
253
C o n s t r u c t i o n of a 2-d Fermi Liquid
is the "covariance at scale j " . By construction C^\k) vanishes unless y/k% + e(k) 2 is of order M~J, and ||C (j) (fc)|| L ~ ss MK We consider, for each j , the effective interaction at scale j
Wj(0,0, V, VO = log i - 1 e^C+v(V.+C^+C) d/XCw_, (C) c-}) where the source term
Wi+1(4>, <£, >,$ = log i y e*J<+w>«<*'*+M+Sd»cU)
(C, 0 -
(6)
The recursion relation (6) is the renormalization group map. The main difficulties in controlling it already arise when (j) =
dx
i---dxndyi
•••dynw2n{xl,...,xn,
n>0
yi,...,yn)
V'(x 1 ).--'!/'(a;n)^(2/i)---V'(yn) dpi • • • dpn dqi---
dqn S(pi + ---+pn-qi
qn)
U>2n{Pl, • • • ,Pn, qi, • • -,qn)
$ P i • • • V»p„ 1pqi---
i>qn
with the position space kernels w2n(xi,... ,xn, t / i , . . . , y n ) translation invariant and antisymmetric in the x and y variables separately. Similarly, write W j+ i(O,O,V,'0)
=
Y1
dxi • • • dxndyx • • • dynw'2n(xi,...
n>0"
ip(xi)
• • dpn dqi---
• • • ip(xn)
,xn, ip(yi)
dqn 5{pi + ---+p„-qi
yi,...,yn) •••
ip(yn)
q„)
n>0J W2n(pi,
• • • , P n . 9 1 , • • • . 9n) $ p i • • • 4>pn VV ' ' " ^qn
Then w'2n can be written as a sum of values of connected directed Feynman graphs with vertices w2,W4,... and propagator C^\ See [14, Chapter 3]. By power counting we mean finding simple j-dependent bounds on appropriate norms of w2n such that all diagrams contributing to w'2n fulfil analogous bounds with j replaced by j + 1. Also, two and four-legged vertices should remain bounded as j —> oo. a The choice of an appropriate system of norms is an important issue in our construction. To treat all orders in perturbation theory, but without worrying about convergence of the series, the supremum norm in momentum space ||W27J||L°° seems to be the most convenient norm. a
T o achieve this, special effects like renormalization and the special geometry of the Fermi surface, mentioned above, are also used.
254
HORST KNORRER, JOEL FELDMAN, EUGENE TRUBOWITZ
To illustrate power counting, observe that every diagram can be built by successively applying one of the two following procedures. (i) Contraction, that is connecting two disjoint vertices
it
_i+
r c = _»_
0 For example, let
If, in the contraction described above, ipi has 2r legs and tp2 has 2s legs, then Tc has 2(r + s — 1) legs and r c ( p i , • • • ,pr+*-l,<7l> • • • ,Qr+a-l) = ipi(p1,...,pr,qi,.. with k =pi -)
\-pr-qi
.,qr-i,k)
C w (fc)
qr-\ = qr-\
||fc||L« < c | | ^ l | | L o . | | 0 2 | | L ~
\~Qr+s-i ~Pr+i
(7)
Pr+s-i- Therefore
W i t h c = \\C^{k)\\Loo
&Mj.
If, in the tadpole formation described above,
,pn-i,k,
qi,...,
qn-i,k)
C(j)(k).
Therefore
HftlU- < &II0IU- with b= ||C ( %)|| L i « M~j, since C^\k) is supported in a region having volume of order M -2 - 7 so that ||C^(fc)|| L i ss M-^||C^)(fc)|| L oc wAf--'. In general, if one has a system of norms || • || for vertices having arbitrarily many legs, we call c a contraction bound with respect to those norms if, in the contraction described above,
l|rc||
Construction of a 2-d Fermi Liquid
255
Similarly, we call b a tadpole bound if, in the tadpole formation described above,
l|rt||<*IMIStandard power counting can be phrased in this language as follows. If one assumes that W = 0 ( ^ i )
for all n,
(8)
then every graph contributing to w'2n is again of order JJ^=T • For example, if such a graph r has two vertices, W2nx and u>2n2, then there are r = n\ + n2 — n connecting lines and the norm of V is bounded by
c&-+—"-Willie || =
0(d^J^—'Ji_1Jt_1)
A general graph may be bounded by building it up one vertex at a time. In the case of the supremum norm in momentum space, c « M? and b « M~j so that c6n1_i « M^n~2\ Problems with the convergence of the perturbation expansion can arise when one builds diagrams from one vertex by forming a large number n of tadpoles.
There are n! choices for connecting the outgoing arrows to the incoming arrows. For each choice one gets a diagram whose norm can be bounded by 6n||
kn) = / dxi • • • dxn eikl-Xl • • • eik"-Xnf(Xl,. = = —
dx1---dxneiklXl---eiknXn:^ dxi---dxnf{xi,...,xn)
1 r = —- / axi • • • dxn jlxi,..., n\ J
^
..,xn) sgmr f(xn(l),...
] T s g n o - e ^ 1 ^ 1) xn) JJ[ki,...
kn\ x\,...,
,xAn))
. . . g*" : n' a 'o , (n)
xn),
256
HORST KNORRER, JOEL FELDMAN, EUGENE TRUBOWITZ
where D(ki, ...kn; xu. ..,xn)
= det [e**'" 1 ']. J=h
n
.
The Euclidean length of each column of this matrix is y/n. Therefore, by Hadamard's estimate, ||-D||Loo < (%/"•)" s o that H/IIL-
< ^ll/IUi < ^ l l / l l n •
This suggests that the L1 norm in position space — or rather, to break translation invariance, ||w2n||i,oo = max / dx2- --dxn \w2n(xi,. • • ,x2„ xx J — should be used as the principal norm for estimating the effective interaction. Whenever one encounters a situation in which one forms all possible tadpoles between n incoming and n outgoing legs of a kernel
.,yT+3_i)
dudv!
C{3){u - u') ip2(u',xr+i,...,
where C^(x) is the inverse Fourier transform of C^\k).
yr+s-i),
It follows that
||r c ||l,oo < | | £ 0 ) ( a O | | L i llVllkoo |l>2||l,oo-
A naive computation, given in the next paragraph, gives a bound on ||(?(J')(a;)||£i that is of order M2K A more refined argument, sketched in [11, §11.7], gives a more realistic bound of order M3j/2. In any event, M3-7/2 > ||C^^(A;)[|L~ and naive power counting in position space does not coincide with power counting in momentum space. It is easily seen that \\d^(x)\\Loo is a tadpole bound for the || • || li00 norm. Clearly, \\C^(x)\\Loo < ||C^(A:)|| L i, so that we again have a tadpole bound of order 1/M J . Substituting c = 0(M3j/2) and n b = 0(1/Mi) into (8) yields the requirement that ||tU2n||i,oo be order M^ ~^. In particular the norm of the four point function would have to decrease like l / V A P as j increased. This is absurd, since the original interaction V is, at each scale, the dominant part of the four point function.
Construction of a 2-d Fermi Liquid
257
We sketch the standard calculation that gives the naive bound on ||C^)(a;)|| L i. For a multi index 5 = (60,81,62) of non negative integers, write |<5| = 60 + 61 + 82 and x6 = x^x^x5^. Then, integrating by parts \5\ times,
(w)'^™*7m\^*ir°(w)' has volume of order 1/M2-? and \\^C^(k)\\L~>
since the support of §^C^(k) AP'. Therefore
1+ f—V
is of order
0
1+
(wwnw)*)*"™- ^)-
Dividing by n„=o 1 2^ + (Xv/Mj)2) and integrating over R 3 gives the bound \\C^(x)||Li = 0(M2i). To overcome the mismatch between position and momentum space we use, as in [12], a hybrid of the supremum norm in momentum space and the L 1 norm in position space: We cover the support of C^\k) by a union of "sectors" that are short enough to not feel the curvature of the Fermi curve.
One Sector Then we write u) 2 „(fci,..., k2n) as a sum of terms for which each variable hi is supported in a single sector and apply the position space L1 norm to each term. In this way one constructs a norm which allows one to implement the Pauli exclusion principle and has good power counting properties in two space dimensions. For details, and for the obstacle hindering the extension of this procedure to three space dimension, see [11, §11.8]. It is an intriguing question whether it is possible to implement the Pauli exclusion principle by an argument which only uses momentum space variables. The would almost surely dramatically simplify the proof of Theorems 1.1-1.3.
4. A model problem In this section, we formulate an elementary question about permutations that may be connected with implementing the Pauli exclusion principle in momentum space. Let pi,...,pn,si,... ,sn be real numbers. For a permutation ir € Sn, set e(n;p,s)
=
Sgn7T
if
p„{1)
< Si, p,r(i) +p7v(2)
0
otherwise.
< S2, •••,
Pv(l)
H
*rPTx(n)<Sn,
Question 4.1. Is there a constant K such that, for all natural numbers n and all p = (Pi,---,Pn),
s = (si,...,sn)
in M.n,
258
HORST KNORRER, JOEL FELDMAN, EUGENE TRUBOWITZ
The following slight variation of Question 4.1 is directly related to cancellations between Fermionic diagrams. Question 4.2. For all natural numbers n and all p = (pi,... ,pn), q = (<7i, • • •, qn) and s = (si,... ,sn) in R" and all pairs of permutations TT,TT' £ Sn, set
s(-K,n';p,q,s)
sgnTTSgnTr' = <
if p 7r(1) + • • • + p 7 r W - ^-(i)
•<3V(<) ^ s «
for all £ = 1 , . . . , n — 1,
0
otherwise.
Is there a constant C such that, for all n and all p, q, s, ]T
e(7r,ir';p,q,s)
?
7T,7r'6S„
Observe that pv{l) + I-!?„(„) - <2V'(i) ?„/(„) = p\-\ +pn -qi qn < sn either for every pair n, ir' of permutations or for no such pair. To illustrate how Question 4.2 is connected with cancellations between fermionic diagrams, let C(k) be a covariance that depends on one real variable k and is Schwartz class. We consider a kernel that is obtained from n four-legged vertices by iterated contraction. Each vertex has the kernel S(pi +P2 — qi — qi) in momentum space. Pi ,.
<
T
P'Z <
ft
T<
Pn #<#
^\
<
I
The (amputated) value of this diagram is a momentum conserving delta function times
(9)
-qi)C(k+pi
+p2-qi
- 92) • • -C(k +px H
+pn-i
- Qi - •
qn-i)-
Denote by S(k) the sum (with appropriate fermionic signs) of all diagrams obtained from ip by forming tadpoles from all legs pt to all legs qj. That is S k
()
=
Yl
SgI17r
/
dpi
' ' ' dPn
• • • ,P,r(n)) C(pi)
•••
C(pn).
There are n! terms in the sum. Applying the tadpole estimate term by term gives \\S(k)h~ Equation (9) immediately
^nlWCWWhWtpUoo.
(10)
< ^ ( ^ H J o o 1 and we get the "perturbative estimate" l|5(fc)||L«
The Fourier transform of Equation (9) also shows that, in position space, IMkoo^llc-WII^1.
(11)
Construction of a 2-d Fermi Liquid
259
Let ^
=
Sgn71 E " 7r,7r'€5n
^
S
g n7r '<£(fc>P7r(l), • • • ,Pn(n),Qn'(l),
• • -,Qn'(n),k
+ Epi -
Eft)
be the antisymmetrization of ant||L~ < — T — ||p a n t ||l,oo < — n — I M k o o (12) H1.00 < — ^ n — II? ant Combining (10), with (p replaced by ip , (12) and (11) gives the "nonperturbative estimate" ||5(fc)||Lcc < const 2 " \\C(k)\\h WC^WzJ1-
(13)
However, this argument uses position as well as momentum space. If the answer to Question 4.2 were positive, one could get an estimate like (13) without passing to position space. Recall that 1
^
n—1
= ^12
H
S
Sn7r S§n7r' I I C(fc
7r,7r'eSn
For each r = p„{1) +
+PT(1)
+ • • • +P*(e) -
9TT'(1)
Q*'W)-
t=l
h pn{e) - gw,(1)
C(k + r)= fdt5{k
qn,{€),
+ r-t)C(t)=
f dt6(t - k
-r)C'(t),
where 9 is the Heaviside step function. Consequently, ^ant
where n-l
0 ( t i , . . . ,f„_i; fc,p,g) = ^
sgn?r s g n V f J 0(**-fc-(p,r(i)+- • •+p7r(«)-g7r-(i)
?*'(
If the answer to Question 4.2 were positive, it would follow, setting si = ti — k, that ||6|U-
It would then follow from (14) that
w^w^^^wcmiT1.
as)
Again, combining (10), with tp replaced by <pant, (15) and (11) would give the "nonperturbative estimate"
Iis-Wlli-
260
H O R S T KNORRER, J O E L FELDMAN, E U G E N E TRUBOWITZ
Acknowledgments This research of J. F . was supported in part by the Natural Sciences and Engineering Research Council of C a n a d a and the Forschungsinstitut fur Mathematik, E T H Zurich. H. K. thanks the Institute for Advanced Study, Princeton for its hospitality.
References 1. A. A. Abrikosov, L. P. Gorkov, I. E. Dzyaloshinski, Methods of Quantum Field Theory in Statistical Physics, Dover Publications, 1963. 2. P. W. Anderson, "'Luttinger-liquid' behavior of the normal metallic state of the 2D Hubbard model", Phys. Rev. Lett. 64, 1839-1841 (1990). 3. P. W. Anderson, "Singular forward scattering in the 2D Hubbard model and a renormalized Bethe Ansatz ground state", Phys. Rev. Lett. 65, 2306-2308 (1990). 4. N. W. Ashcroft, N. D. Mermin, Solid State Physics, Saunders College, 1976. 5. G. Benfatto, G. Gallavotti, Renormalization Group, Physics Notes, Vol. 1, Princeton University Press, 1995. 6. M. Disertori, J. Magnen, V. Rivasseau, "Interacting Fermi liquid in three dimensions at finite temperature: Part I: Convergent Contributions", Ann. Henri Poincare 2 733-806 (2001). 7. M. Disertori, V. Rivasseau, "Interacting Fermi liquid in two dimensions at finite temperature. Part I: Convergent Attributions", Comm. Math. Phys. 215, 251-290 (2000). 8. M. Disertori, V. Rivasseau, "Interacting Fermi liquid in two dimensions at finite temperature. Part II: Renormalization", Comm. Math. Phys. 215, 291-341 (2000). 9. H. Dym, H. P. McKean, Fourier Series and Integrals, Academic Press, 1972. 10. J. Feldman, H. Knorrer, E. Trubowitz, "A two dimensional Fermi liquid", Parts 1-3; "Single scale analysis of many fermion systems", Parts 1-4; "Convergence of perturbation expansions in fermionic models", Parts 1, 2; "Particle-Hole Ladders"; arXiv:math-ph/0209040 to arXiv: math-ph/0209049, to appear in Comm. Math. Phys. and Rev. Math. Phys. 11. J. Feldman, H. Knorrer, E. Trubowitz, "A two dimensional Fermi liquid, Part 1: Overview", arXiv:math-ph/0209047, to appear in Comm. Math. Phys. 12. J. Feldman, J. Magnen, V. Rivasseau, E. Trubowitz, "An infinite volume expansion for many fermion Green's functions", Helv. Phys. Acta 65, 679-721 (1992). 13. J. Feldman, M. Salmhofer, E. Trubowitz, "An inversion theorem in Fermi surface theory", Communications on Pure and Applied Mathematics LIII, 1350-1384 (2000). 14. A. L. Fetter, J. D. Walecka, Quantum Theory of Many-Particle Systems, McGraw-Hill, 1971. 15. W. Kohn, J. M. Luttinger, "New mechanism for superconductivity", Physical Review Letters 15, 524-526 (1965). 16. L. D. Landau, "The theory of a Fermi liquid", Sov. Phys. JETP 3, 920 (1956). 17. L. D. Landau, "Oscillations in a Fermi liquid", Sov. Phys. JETP 5, 101 (1957). 18. L. D. Landau, "On the theory of the Fermi liquid", Sov. Phys. JETP 8, 70 (1959). 19. J. M. Luttinger, "New mechanism for superconductivity", Physical Review 150, 202-214 (1966). 20. J. Magnen, V. Rivasseau, "A single scale infinite volume expansion for three-dimensional many fermion Green's functions", Mathematical Physics Electronic Journal 1, no. 3 (1995). 21. W. Metzner, C. Castellani, C. Di Castro, "Fermi systems with strong forward scattering", Adv. Phys. 47, 317-445 (1998). 22. P. Nozieres, Theory of Interacting Fermi Systems, Benjamin, 1964. 23. W. Pedra, M. Salmhofer, "Fermi systems in two dimensions and Fermi surface flows", in Proceedings of the 14th International Congress on Mathematical Physics, Lisbon, 2003, JeanClaude Zambrini (ed.), World Scientific, 2005. 24. M. Salmhofer, "Continuous renormalization for fermions and Fermi liquid theory", Comm. Math. Phys. 194, 249-295 (1998).
The restriction property for conformally covariant measures GREGORY
F. LAWLER (Cornell)
Theoretical physicists have used conformal invariance to predict exact values of critical exponents for two-dimensional lattice systems at criticality. This paper discusses recent work with Oded Schramm and Wendelin Werner that rigorously classifies the collection of conformally invariant (or covariant) limits. There are two important one-parameter families, the Schramm-Loewner evolution with parameter K (SLEK) and the restriction families with exponent a. These two families agree at only one value (K = 8/3, a = 5/8), but we describe a close relation between these families when a > 5/8 and K, = na = 6/(2a + 1).
1. Introduction Many planar lattice models in statistical physics at criticality are expected to have continuum limits that are conformally invariant or conformally covariant. This belief allowed for a number of critical exponents for such models to be predicted nonrigorously. Although the arguments were phrased in terms of conformal field theory, in essence they boil down to the fact that there are only a limited number of possibilities for such limits. In the last few years there has been much progress in the rigorous understanding of the possible conformally invariant limits. This work has focused on finding measures on curves or "hulls". There are two different, but related, assumptions one can make on families of measures that are conformally invariant or covariant — the Markovian assumption (which leads to the Schramm-Loewner evolution) and the restriction property (which leads to restriction measures/families). The latter property and its relation to the former property, i.e., its relation to SLE processes, has been studied in detail in [16]; see also [19]. This paper summarizes the major results of those papers.
2. Schramm-Loewner evolution (SLE) Suppose i/£>(z,w) is a collection of probability measures, indexed by all simply connected proper subdomains D of C and distinct points z,w 6 dD, on curves 7 : [0, t 7 ] —> D with 7(0) = z,7(t 7 ) = w. We will consider two curves to be the same if one can be obtained from the other by an increasing reparametrization. Suppose that this collection is conformally invariant, i.e., if / : D —> D' is a conformal transformation, then
fouD(z,w)
=
uD,(f(z),f{w)).
If this is true, then we need only specify VD{Z,W) for one choice of D,z,w, say when D = H — {x + iy : y > 0}, z — 0, w = 00. The Schramm-Loewner evolution (SLE) is obtained by imposing another condition which is sometimes referred to as a the "Markovian" assumption. It is the assumption of independent, identically distributed increments in the space of conformal maps. Let v =
261
262
GREGORY F. LAWLER
Z/H(0, 00)
be a probability measure on paths (modulo reparametrization) 7 : [0,00) —> H with 7(0) = 0,7(4) —» 00, £ —> 00. For the moment, let us consider the case when the measure is supported on simple (non-self-intersecting) paths 7 with 7(0,00) C H. Suppose we have viewed the path 7[0, t] and we wish to determine the distribution of "/[t, 00) given 7[0,£]. Let Ht = H \ 7(0, £] and let gt : Ht —> H be a conformal transformation with 9t(l(t)) = 0,<7t(°°) = 00. Then the Markovian assumption is: given 7[0,£], the measure on 5t(7[*,oo)) is v. Conformal invariance and the Markovian assumption restrict the measure v to a twoparameter family of measures. Let gt : Ht —» H be the unique conformal transformation with gt(z) = z + o(l) as z —> 00. Then gt has an expansion at infinity,
a
gt{z)=z+
M+o[^y
a)
for a continuous, increasing function t \-* alt). By reparametrizing 7 if necessary we may assume that a(t) = 2t. Then the assumptions imply that gtlz), as a function of t, satisfies the initial value problem 2 9t{z) = -T-,—jr, 9o(z) = z, (2) gtlz) - Ut where Ut = fJ-t + \fHBt and Bt is a standard one-dimensional Brownian motion. If we make the additional assumption that the distribution of v is symmetric about the imaginary axis, then [x = 0, and we are left with the one-parameter family called (chordal) SLEK. While it is not difficult to show that, given Bt, (2) can be solved to give the maps gt and the corresponding domains Ht, it is more difficult to show, but true, that for each K > 0, this gives a measure on curves 7 [14,20]. However, the curves 7 are simple only for K < 4. In this paper, we will only need to consider K < 8/3 for which the paths are simple. It is known [4,20] that for these K, the Hausdorff dimension of the paths is 1 + («/8) (actually this is known for all K € [0,4) U (4,8]).
3. Restriction measures If D is a simply connected proper subdomain of C and z,w are distinct boundary points, then a hull connecting z and w in D is a closed set K with z, w £ K; K\ {z, w} C D; and such that D\K has exactly two connected components. If 7 : [0, t 7 ] —> D is a curve with 7(0) = z, 7(£ 7 ) — w, 7(0, £7) C D, then the hull generated by 7 is the union of 7 with all the connected components of D \ 7 whose boundary does not intersect dD \{z,w}. Suppose HDIZ,W) is a collection of measures (not necessarily probability measures) indexed by D,z,w, supported on hulls connecting z and w in D. If z,w are nice boundary points (say, dD is analytic in neighborhoods of z,w), we assume that 0 < \nolz,w)\ < 00, where | • | denotes total mass. Let / ^ ( z , w) be the corresponding probability measure so that MD(Z,W) = \VDIZ,W)\
We say that {nolz,w)}
n%lz,w).
is a restriction family if it satisfies:
• conformal covariance: if / : D —• D' is a conformal transformation, then
fon#lz,w)=nUflz),flw));
The restriction property for conformally covariant measures
263
• restriction property: if D' C D and dD,dD' agree in neighborhoods of z,w, then fiD'(z,w) is IJ,D(Z,W) restricted to hulls in D'. We call a probability measure ju# = /jg(0, oo) a (half-plane) restriction measure if there is a restriction family /J,D(Z,W) such that if / : M —> D is a conformal transformation with /(0) = z, /(oo) = w, then /z5(z,to) = / o / i # . This family is unique up to a multiplicative constant; we can specify it uniquely by requiring |//D(—1,1)| = 1. Here D denotes the unit disk. There is a one-to-one correspondence between half-plane restriction measures and restriction families with |/io(—1,1)| = 1. A probability measure P = /x# on hulls K connecting 0 and oo is specified by giving the probability P ( V A ) for all events of the form VA = {K : K(~\M
264
GREGORY F. LAWLER
• If P a ,P& are given with realizations Ka,Kb, then P a + 6 = P „ x P j where (KajK/,) is identified with the hull generated by Ka U Kb- In particular, the hull generated by the union of k independent Brownian excursions gives Pfc. For this reason, we think of P„ as the measure induced by "a Brownian excursions". • To show that P5/8 is given by chordal SLE8/3 it suffices to show that the probability that a chordal SLES/3 path avoids A is $^(0) 5 / 8 . Once we have this, we see that that distribution of the hull generated by eight independent SLE8/3 paths is the same as that of the hull generated by five Brownian excursions.
4. Brownian loop soup A loop is a continuous map 7 : [0,t 7 ] —> C with 7(0) — 7(i 7 ). Any loop generates a hull by "filling in". The Brownian loop measure is a measure on hulls a that is (a constant multiple times) the scaling limit of the following measure on simple random walks, see [17]. • For every lattice spacing 5, consider the set of all nearest neighbor paths w — [WQ,W\, ..., u>2n] on the lattice 51? with wo = U2n• Consider two paths to be equivalent if they traverse the same points in the same direction, i.e., w' ~ w if u' = [WJ,UJ+I, ..., W2n ) u'i,... ,UJ-I,WJ]. • Give each such equivalence class of loops weight ( l / 4 ) 2 n . Call this measure /4°°P• For each 0 < a\ < 02 < 00, let /j}oop(ai,a2) be (a fixed multiplicative constant times) the limit as 5 —» 0+ of Hgop restricted to loops whose diameter is between ai and a^. Consider this as a measure on hulls whose diameter is between a\ and a-i• Let ._ ^ M '°°P( a , i/a), M ioo P = M ioo P(0j ^ The measure /Joop can also be defined directly using Brownian bridge measures [19]. If D is a domain, we define /j,p°p to be fj}oop restricted to loops (or, equivalently, hulls) that stay in D. The family of measures / i ^ o p satisfies the restriction property by construction. This is an infinite measure since small loops get high measure; however, if D is bounded and a > 0, the fip°p measure of the set of hulls of diameter at least a is finite. Not so obviously, these measures are also conformally invariant. P r o p o s i t i o n 4 . 1 . [19] If f : D —> D' is a conformal transformation,
then
The following converse is also true: any family of measures //£> on hulls in D that is conformally invariant and satisfies the restriction property must be a multiple of the Brownian loop measure. The Brownian loop soup with intensity A is a realization of a Poissonian point process with intensity measure Xp}oop. More specifically, it is a collection of random variables N(V) indexed by subsets V of C, the space of hulls, with the properties: • If VnV a
= 0, then N(V) and N(V)
are independent with JV(VW') =
N(V)+N(V).
Although we are only considering this measure as a measure on hulls, one can consider this measure as a measure on loops with parametrization, see [19].
The restriction property for conformally covariant measures • If fi]oop(V)
< oo, then N(V) has a Poisson distribution with mean
265 Xfj}oop(V).
The random variable N(V) denotes the number of hulls in the realization in the set V. A realization of the Brownian loop soup is a countable collection £ of hulls. If D is a domain, then we can write £ = Co U CD± where Co denotes the set of hulls in £ that lie in D and £ D J_ = £ \ Co- For each D, the realizations Co and CD± are independent. Proposition 4.1 can be rephrased as follows: if f : D —> D' is a conformal transformation and £ is a realization of the Brownian loop soup with intensity A, then {/ o 7 : 7 e Co} is a realization of the Brownian loop soup in D' (with intensity A).
5. Constructing restriction measures For any a > 0, let *
a =
6 2^Tl'
. ,B _ , 2a(8a-5) Aa = a ( 8 - 3 « 0 ) = - ^ T r .
(4)
Note that a — t > na is a decreasing function and a 1—> Aa is an increasing function. Also, K5/8 = 8/3,A 5 / 8 = 0. For each a > 5/8, consider the following procedure to produce a random hull K connecting 0 and 00 in H: • Let 7 be a chordal SLEKa path connecting 0 and 00. Since Ka < 8/3, this is a simple path with 7(0,00) C H. • Let £ be a realization of the Brownian loop soup with intensity Aa independent of 7 (if a = 5/8, then £ contains no loops). Let £ 7 be the set of loops in £ that stay in H and intersect 7(0, 00). • Let K be the hull generated by 7[0,oo)U
(J L. Lee-,
Then it can be shown [16] that for any A as before, P{KnA
= (D} = $'A(0)a.
(5)
In other words, K has the distribution of P a . It follows immediately from the construction that K is supported on simple curves for a = 5/8 and is not supported on simple curves for a > 5/8. The proof of (5) focuses on a particular martingale derived from the chordal SLE path. Suppose 7 and £ are given, and let Ht = H \ 7[0,t]. Let £ 7 ( s , t ) = £ # ± \ CH± be the set of loops in £JH that intersect j(s,t] but do not intersect 7(0, s]. Let Ks,t be the (compact) hull generated by j[s,t]U ( J L, Lecy(s,t)
and let Kt = Kott. Let Tt denote the cr-algebra generated by j(s), 0 < s < t. Let Mt = P[K n A =
266
GREGORY F. LAWLER
If gt is the map as in (1) and gt{z) = gt(z) —gt(l(i)), soup and the Markovian property for SLE imply
then conformal invariance of the loop
Mt = 1{7[0, t ] n i = i } l { ^ I e £ 7 (0, t) with L n A / 0} P{K n gt(A) = 0}. Using stochastic calculus, it can be shown that Mt = l{-y[0,t}nA
= 0} 1{^L e Cy(0,t) with L n i ^ 0}$§ t(j4) (O) a
is a martingale, provided that we choose parameters Ka, Xa as above. Also, it can be shown that Moo = Moo = l{7[O,oo)n,4 = 0} l { ^ L e £ 7 ( 0 , o o ) with L n A ^ 0 } . We then conclude that Mt = M t , and, in particular, P{K D A = 0} = M 0 = M 0 = $' A (0) a . The hulls of the restriction P a can be built from a skeleton SLEKa curve with Brownian loops added. It may seem strange at first that the skeleton curve gets "thinner" as a increases, but this has a natural interpretation in terms of conjectures for Laplacian random walks. The fact that the hulls should get "fatter" as a increases (since, for example, the Pa+b hulls can be obtained from unions of independent P a and P& hulls) is reflected in the fact that A0 increases. For a = 1 (« a = 2), this construction differs from the construction using Brownian excursions. In fact, this shows that the hull of a Brownian excursion can be obtained by starting with an SLE2 path and adding Brownian loops. This is the "inverse" of the construction of loop-erased random walk, where one starts with a random walk and erases loops. In [14] it is proved that the scaling limit of loop-erased random walk is SLE2- The argument in that paper is self-contained using a discrete martingale somewhat similar to the martingale above. The construction shows that the Hausdorff dimension of the boundaries of the clusters K is the same as the Hausdorff dimension of planar Brownian motion, which is known to be 4/3 (see [12,13]).
6. Right-side restriction If K is a hull in H connecting 0 and 00, let K* denote K with the "left-side" filled in, i.e., K* = K U DK,I where DK,I is the connected component of HI \ K whose boundary contains (—oo,0]. The hull is determined by the "right-side" boundary, which we denote by 8RK. The restriction measure P a induces a measure P + , which can be considered as a measure on hulls "filled in on the left" (K H-> K*) or on curves from 0 to 00 (K i-> ORK). We will say that a probability measure P + on left-filled hulls or on curves is a right-side restriction measure if (3) holds if all such A, A' such that A' n (-00,0) = 0 . Here, VA denotes the event { F n i c i \ i } = {dRK n l c i \ 4 UK* is generated by K and A n (-00,0) = 0, then this is the same as {K DM C M\A}. As in the case of restriction measures, it can be shown that a measure is a right-side restriction measure if and only if there is an a > 0 such that
pt(yA) = ^(or, for all "right-side" compact hulls A. For a > 5/8, we construct P + directly from P a ; however, right-side restriction measures exist for all a > 0. This is proved by giving two
The restriction property for conformally covariant measures
267
different constructions of the hull. We first note that the "addition" rule works for right-side restriction — if K* and K£ are independent left-filled hulls with distributions P+,PjJ", then the left-filled hull generated by K* U K£ has distribution P „ + 6 . The first method will construct P + for all a G (0,1). The addition rule allows us then to construct P + for other values of a (of course, we already have a construction for a > 5/8). Let Zt denote a "reflected Brownian excursion" which, roughly speaking, is a Brownian motion starting at 0 conditioned to go to infinity without leaving H U (—oo, 0], and which is reflected at angle 6 off (-oo,0]. This can be obtained by first defining a process in the wedge W(6) = {rei0' : 0 < 9' < TT - 9} that acts like a Brownian excursion in the second component and a Brownian motion reflected in the first component (i.e., horizontally) off dW{9). If we map W{9) to M by z— i » zn'^~6', then the image of this process is the reflected Brownian excursion. It can be checked that if A is a right-side compact hull such that M \ A is simply connected, then P{Z[0, oo) n A = 0} =
&A(0)a,
where a — 1 - (O/n). In other words, if K* is the hull generated by left-filling the hull generated by Z[0,oo), then K* has the distribution P + . Since P+ is unique for each a, this gives some surprising equivalences of distributions. For example, the right boundary of the hull generated by a reflected Brownian excursion at angle 6 = 37r/8 has the same distribution as SLEs/3. The second method to construct P + focuses on the right boundary. The right boundary of a Brownian path does not have double points (this is not obvious but can be verified by considering an appropriate intersection exponent as in [5]). For this reason, one might hope to give a description of the simple curve rj that gives the right boundary under P + . Suppose that T] is parametrized by capacity and let gt : H \ 77(0, t] —> - H be the conformal transformation as in section 2. Let Wt = gt(v(t)) a n d let Ot be the image of the "left-hand" side of the origin under gt. For t > 0, the Loewner theory tells us that dOt/dt = 2/(Ot — Wt)Assume that Wt satisfies the stochastic differential equation
dWt = - ^ p - + yfcdBu
(6)
Wt — Ut
where Bt is a standard Brownian motion. Theorem 6.1. [16] If p > — 2 and Wt satisfies (6), then the distribution o/r/[0,00) is that of the right hand boundary under P+ where (3p+10)(2 + p) fl = 32 ' This theorem can be used to show that P a does not exist for a < 5/8. If it were to exist, P a would be symmetric about the imaginary axis and hence the probability that i lies in the domain to the right of the hull would be at most 1/2. However, (6) can be used to show that this probability under P + is strictly increasing in a. Since a = 5/8 corresponds to a distribution on simple curves that is symmetric about the imaginary axis and these curves do not go through i (with probability one), the probability is 1/2 for a = 5/8. Hence, it is strictly less than 1/2 for a < 5/8. Therefore P + cannot be coming from any P a for a < 1/2.
268
GREGORY F. LAWLER
7. Relations to conformal field theory The relationship restriction hull = SLEK U Brownian loops has given a new way to understand the idea of "central charge" in conformal field theory. In some sense SLE and restriction hulls are to these conformal fields what Brownian motion is to harmonic functions — a measure on paths/clusters from which one can define the functions/fields but which inherently provide more information than just the functions/fields. Duplantier [7] was the first to predict that SLEK paths should correspond to conformal fields with central charge
This relationship is now being understood rigorously, [9,10]; see also [2,3] as well as [8] and references therein, for nonrigorous treatments. Note if a > 0, then A0 = — cKa, where na, Xa are as defined in (4). Suppose a > 0, and K* denotes the left-filled hull distributed according to P + . If 0 < x\,X2,. •. ,xn < oo, define B^{xi,x2,
lim e- 2 n P{K* n [xuXl
...,xn)=
+ V2d] ^ 0 ; . . . ; K* n [xn,xn + V2d] ^ 0}.
e—>0+
These can be computed recursively by BQ ^n+l\xi
x
1i
=
—
•••i
o "m
= 1 and
x
n)
\x^i
• • ' ' xn)
~
(Xj — x)2
/ .
z
X
B^(Xl,...,xn)
. . J= l
^B%\x1,...,xn)+J2xN~2*-NB^(x1,...,xn),
= X
N>\
where $>N denotes the operator n
i =l
Note that
{^N}
satisfy the commutation relation [*^,
*M]
= (N — M)
$N+M-
8. Lattice models In [15] it is noted that if self-avoiding walks have a conformally covariant limit then the limit measure should be a restriction measure supported on simple curves. It has now been proved that the only such measure is that given by SLES/^. Critical exponents for SAW can be interpreted in terms of scaling exponents for the limiting measure, and these scaling exponents can be determined exactly (and rigorously) for SLEs/3. This not only gives another heuristic definition of the critical exponents for planar SAW, it also shows what the limiting distribution should be. Monte Carlo simulations [11] support the conjecture that SLEs/3 is the limiting distribution for SAW.
The restriction property for conformally covariant measures
269
There is a one-parameter family of processes called Laplacian random walks that might correspond to the one parameter family of continuous processes SLEK. A simple random walk excursion in the discrete upper half-plane is a random walk conditioned not to return to the real axis. Given that the process is at j + ik the probabilities of jumps of size i, —i, 1, —1 are (fc + l)/(4fc), (fc- l)/(4fc), l/(4fc), l/(4fc), respectively. Given any finite A C I? let pA(z) be the probability starting at z that an excursion never enters A. The (half-plane) Laplacian random walk with exponent a (or Laplacian-a walk for short) is the process Sj such that P{Sn+1
= wn+1 | [S0l...,S„]
= [wo,...,wn]}
-
PA
>"+l)a
E|„-UB|=IPAB(I/)°'
where An = {UQ, . . . , W „ } . It is straightforward to show that the loop-erased excursion is the Laplacian-1 walk. Conjecture: The scaling limit of the half-plane Laplacian-a walk is chordal SLEKa where na is as in (4). • Roughly speaking, the Laplacian-a has transition probabilities that are proportional to the probability that a excursions all avoid the past. This is reminiscent of the P a measure which can be considered the measure of a Brownian excursions. Prom this perspective it is natural that the Laplacian-a walk should get "thinner" as a increases. • The case a — 1,K 0 = 2 is known rigorously since the Laplacian-1 walk is the looperased walk (this has been proved for the radial case, but the chordal case should work similarly). • The conjecture states that the half-plane self-avoiding walk should have the same limit as the Laplacian-(5/8) walk. This is the unique value of a such that (at least in the limit) the process determining the transition probabilities (a excursions) is the same as the Laplacian-a walk. This idea is reflected in the idea of the infinite self-avoiding walk. • In the case a = 0, Ka = 6, this relationship can be seen on the honeycomb lattice by considering the transition probability of the "percolation exploration process".
Acknowledgments This paper focuses on joint work with Oded Schramm and Wendelin Werner. I would like to thank Michael Kozdron for his comments on an earlier draft of this paper. The author was partially supported by the National Science Foundation.
References 1. A. Belavin, A. M. Polyakov, A. B. Zamolodchikov, "Infinite conformal symmetry in twodimensional quantum field theory", Nuclear Phys. B 241, 333-380 (1984). 2. M. Bauer, D. Bernard, "SLEK growth and conformal field theories", Phys. Lett. B 543, 135-138 (2002). 3. M. Bauer, D. Bernard, "Conformal field theories of stochastic Loewner evolutions", preprint (2002). 4. V. Beffara, "The dimension of the SLE curves", preprint (2003).
270
GREGORY F. LAWLER
5. K. Burdzy, G. Lawler, "Non-intersection exponents for random walk and Brownian motion. Part II: Estimates and applications to a random fractal", Annals of Probability 18, 981-1009 (1990). 6. J. L. Cardy, "Conformal invariance and surface critical behavior", Nucl. Phys. B 240 (FS12), 514-532 (1984). 7. B. Duplantier, "Higher conformal multifractality", J. Stat. Phys. 110, 691-738 (2003). 8. B. Duplantier (2003), "Conformal fractal geometry and boundary quantum gravity", preprint. 9. R. Friedrich, W. Werner, "Conformal fields, restriction properties, degenerate representations and SLE", C. R. Acad. Sci. Paris Ser. I, Math. 335, 947-952 (2002). 10. R. Friedrich, W. Werner, "Conformal restriction, highest-weight representations and SLE", preprint (2003). 11. T. Kennedy, "Monte Carlo tests of SLE predictions for 2D self-avoiding walks", Phys. Rev. Lett 88, 130601. 12. G. Lawler, O. Schramm, W. Werner, "Values of Brownian intersection exponents II: plane exponents", Acta Math. 187, 275-308 (2001). 13. G. Lawler, O. Schramm, W. Werner, "Analyticity of intersection exponents for planar Brownian motion", Acta Math. 189, 179-201 (2002). 14. G. Lawler, O. Schramm, W. Werner, "Conformal invariance of planar loop-erased random walks and uniform spanning trees", Annals of Probab., to appear (2001). 15. G. Lawler, O. Schramm, W. Werner, "On the scaling limit of planar self-avoiding walks", in Fractal Geometry and Application, A Jubilee of Benoit Mandelbrot, AMS Proc. Sympt. Pure math, to appear (2002). 16. G. Lawler, O. Schramm, W. Werner, "Conformal restriction. The chordal case", J. Amer. Math. Soc. 16, 917-955 (2003). 17. G. Lawler, J. A. Trujillo Ferreras, "Convergence of random walk loops to the Brownian loop soup", in preparation. 18. G. Lawler, W. Werner, "Universality for conformally invariant intersection exponents", J. Europ. Math. Soc. 2, 291-328 (2000). 19. G. Lawler, W. Werner, "The Brownian loop soup", to appear in Probab. Th. Rel. Fields (2003). 20. S. Rohde, O. Schramm, "Basic properties of SLE", to appear in Annals of Math (2001). 21. O. Schramm, "Scaling limits of loop-erased random walks and uniform spanning trees", Israel J. Math. 118, 221-288 (2000). 22. B. Virag, "Brownian beads", preprint (2003). 23. W. Werner, "Random planar curves and Schramm-Loewner Evolutions", in Lecture Notes of the 2002 St-Flour summer school, Springer, to appear.
Fluid dynamics and nonlinear PDEs Session organized by S. KUKSIN (Edinburgh) and J. P . ECKMANN (Geneve)
This page is intentionally left blank
Birkhoff normal form for some quasilinear Hamiltonian PDEs DARIO BAMBUSI
(U. Milano)
Consider a Hamiltonian P D E having an elliptic equilibrium at zero. Assuming a suitable condition on the frequencies we will construct a canonical transformation putting the system in Birkhoff normal form up to a small reminder. In the case of quasilinear systems the normal form will be used to describe the dynamics of smooth solutions of small amplitude. Applications to the water wave problem and to quasilinear wave equations will be given.
1. Introduction During the last fifteen years perturbation theory of Hamiltonian partial differential equations has been quite extensively studied and remarkable results have been established. In particular the existence of quasiperiodic solutions has been proved through suitable extensions of KAM theory [6,8,11,17,18,20]. However very little is known on the behavior of the solutions lying outside KAM tori, solutions which correspond to the large majority of initial data. In the finite dimensional case they are described by Nekhoroshev's theorem, whose extension to PDEs is at present a completely open problem a . However there is a particular situation where the full power of Nekhoroshev theorem is not needed: the neighborhood of an elliptic equilibrium point, where (in the finite dimensional case) Birkhoff normal form theorem gives a quite precise description of the dynamics. In the present paper we will present a result partially extending Birkhoff normal form theorem to PDEs. Assuming a suitable nonresonance condition we will show that for any positive integer M there exists a canonical transformation putting the Hamiltonian in Birkhoff normal form up to a reminder having a zero of order M + 1 at the origin. In the case of quasilinear systems [14,15] the normal form will be used to show that smooth initial data with norm e give rise to solutions remaining 0(eM) close to a torus up to times of order c" 1 . The theory will be applied to the equations of the water wave problem [9] and to quasilinear wave equations in arbitrary space dimension. For the water wave problem we will obtain that the transformation putting the system in normal form is well defined, but we will not deduce any dynamical consequence. For the quasilinear wave equation we will also deduce that smooth solutions remain close to finite dimensional tori for long times. Some of the results presented here were proved in [2]. a
See however the papers [1,5] where the dynamics close to finite dimensional small amplitude tori is studied.
273
274
DARIO BAMBUSI
2. The finite dimensional case Consider a finite dimensional Hamiltonian system with a linearly stable equilibrium point. It is well known that there exist coordinates in which the Hamiltonian takes the form H = H0 + P
(1)
ffo(p,<7):=f>^^
(2)
where
3= 1
and P has a zero of order at least three at the origin. Assume that P is of class C°°, then the following theorem holds. Theorem 2.1 (Birkhoff). For any positive integer M > 2 there exist a neighborhood of the origin UM and a canonical transformation TM '• MM -* K2Ar which puts the system in Birkhoff normal form, namely such that HoTM
= H0 + Z + RM
(3)
where Z Poisson commutes with Ho, namely {Ho; Z} = 0 and RM is small, i.e. \XRM(p,q)\
(4)
where XRM is the Hamiltonian vector field of RM (
DRM
9RM\
^-{-da-'^p-) Remark 2.1. In the case where the frequencies are nonresonant namely they fulfill u) • k ^ 0,
Vfc G Z"\{0} ,
(5)
the function Z can be shown to depend on the actions Ij := (Pj + ?f )/2 only. In the nonresonant case one can immediately deduce some dynamical consequences: Corollary 2.1. Assume that condition (5) holds, then for any M\,M?, with M := 1M\ + M2 > 2 there exists CM with the following property. Consider a solution z(t) corresponding to an initial datum ZQ = (p°,q°) fulfilling e : = \ZQ\ < tM •
Then there exists a smooth torus TZo such that d(z(t),TZ0)
for \t\ < {CMe)-M\
(6)
Moreover, up to the same time one has \z(t)\<2e,
\Ij(t)-
lj(0)\
(7)
Birkhoff normal form for some quasilinear Hamiltonian PDEs
275
The main difficulty in order to extend theorem 2.1 to the infinite dimensional case is due to the appearance of small denominators. In the finite dimensional case one has that, having fixed M, [0 < |fc| < M + 2, u • k ^ 0] implies \u> • k\ > 7 , (8) a condition needed in the proof of Birkhoff's theorem. In the infinite dimensional case (N = 00) the set considered in (8) is infinite and in all interesting cases one has inf
\oj • k\ = 0 .
0<|fc|<M + 2
This makes impossible a straightforward generalization of Birkhoff's theorem to PDEs.
3. A n abstract result for t h e infinite dimensional case Consider a (formal) Hamiltonian system of the form (1) with
HoM^vj^tt^.
(9)
To define precisely the phase space consider the Banach space (?s of the sequences such that
{XJ}J>\
\x\2s--=Y.i2Sx2j<00> and denote Va := £% x (?s. If z = (p,q) S Vs is a phase point, we will denote by \z\2s := bis + Ms (* ne square of) its norm and by BS(R) the open ball of radius R in VsFor any positive (large) N denote by u / ^ := (OJI, . . . , LJN) the truncation of length N of the frequency vector. We assume that HI) For any positive (large) r there exist a = a(r) and 7 = 7(7") > 0 such that for any N and VA; € ZN, with 0 < |fc| < r + 2 one has either
w(Ar) • k = 0 or
\JN)
• k\ > - £ - ,
where |fc| := \kx\ + ... + |fcjy|. H2) There exists so, and, for any positive r there exists dr with the following properties: for any s > So there exists an open neighborhood of the origin Us+dr C Vs+dr s u c h that the Hamiltonian vector field of H is defined on Us+dr and fulfills XH € Cr+2(Us+dr,Vs)Definition 3.1. Let /V be a positive integer, a functional Z will be said to be in iV-normal form if it is independent of all the variables {Pj,Qj}j>N and {Z,H0}=0.
(10)
For fixed N we will denote
HTM-J:^^.
(ID
276
DARIO BAMBUSI
Theorem 3.1. Assume H1-H2 and fix a positive M. Then there exist constants s', s*, i?» > 0, a function N(e), and an analytic canonical transformation T : i?s„(i?„) —• Vs, that puts the Hamiltonian in normal form up to order M. Precisely such that = H^N) + Z + TZ
HoT
(12)
where Z is in N(e)-normal form. Moreover, for any s > s* there exists Rs such that T restricts to an analytic map from BS(RS) to Vs and, for any z G Ba+a>(Rs+s<) the following estimate holds \Xn(z)\a
(14)
H2') There exists d, and, for any s > so a positive i?s+d, such that the map B3+d(Rs+d)
3 Z H A{z) G
is of class C°°. Moreover g is smooth, i.e. g G
B{Vs+d,Vs) C°°(Bs+d(Rs+d),Vs+d).
For any e small enough consider the set of the functions C £ C°([0, T], 7's+d)nC'1([0, T], Vs) fulfilling sup \C(t)\,+d+ sup \((t)\s<e (15) te[o,T] tg[o,T] and the linear time dependent equation z = A(0))z.
(16)
H3) There exists v > 1 such that the evolution operator U(t, s) associated to equation (16) exists and fulfills the estimate sup 0
\U(t,r)\e>
"+
P
<Me^T, +
with some constants M, (3 independent of £, T, e.
(17)
Birkhoff normal form for some quasilinear Hamiltonian PDEs
277
H4) g has a zero of order at least v + 1 at the origin. Due to the form of the Hamiltonian, one has A(z) = A0 + B(z), with A0 = XHo and B{.) an operator valued map vanishing at zero. In assumption H3 we are thinking of the case where B has a zero of order v at the origin. In the semilinear case one has B = 0, and therefore H3 is automatic. Remark 3.1. Using Kato's theory [14] one can prove that under the assumptions H2', H3, H4 the dynamics of the system is locally well posed in any space Vs with s > SQ + d. We fix now the notations concerning the objects we are going to compare. Given an initial datum ZQ we consider the corresponding solution z(t) of the equations of motion (14) of the original system. Then we consider the solution ZM{t) of the finite dimensional normalized system z = X W{z) + Xz(z) with initial datum Tl1^T~1(zo), where IIJV is the projector on the first TV" modes. Finally we define the approximating solution za(t) := T(zN(t)), which is the solution of the normalized system in the original coordinates. Theorem 3.2. Assume HI, H2t', H3, H4 and fix M > 2, then there exists s' » 1, and, for any s large enough, there exists es with the following property. If the initial datum is smooth and small enough, precisely if e:= \z0\s+s> < e s ,
(18)
then ds(z(t),za(t))
for\t\<~,
(19)
ofVs.
Remark 3.2. The above theorem is essentially an averaging theorem, indeed the time of validity of its description of the dynamics is of the same order of magnitude as the inverse of the size of the perturbation. Remark 3.3. When the frequencies are nonresonant the solution of the normalized system lie on an invariant torus, and so the same is true for za(t). From this one can conclude that there exists a finite dimensional smooth torus TZo with the property that, up to the considered times ones has ds(TZ0,z(t))
4. Applications 4.1. The water wave problem The water wave problem consists in describing the motion of the free surface of a fluid subjected to the gravitational force. Here we will consider the case of a fluid lying in a two dimensional domain of infinite depth. We will study space periodic solutions. In terms of
278
DARIO BAMBUSI
the velocity potential ip(x, y) and of the profile of the surface rj(x), the equations of motion are given by A(p = 0, 9? - > 0 ,
0 < x < 2n, — oo < y < r](x),
(20)
y -> - o o ,
(21) 2
(22)
Vt =
(23)
Zakharov [22] pointed out that this is a Hamiltonian system. The corresponding Hamiltonian function is the energy of the fluid, and conjugated variables are given by the wave profile 77(0;) and the velocity potential tp(x,ri(x)) at the free surface. It was shown by Zakharov that close to the equilibrium solution in which the fluid is at rest and the surface is horizontal the Hamiltonian has the form (1) with Wfc = \fg\k\-, k £ Z\{0}. So it is easy to verify that the frequencies fulfill the assumption HI. The regularity of the nonlinear part P can be proved by classical theory of elliptic equations. So assumption H2 holds. It follows that according to theorem 2.1 there exists a canonical transformation putting the Hamiltonian in normal form up to any finite order. We point out that in this problem nothing is known on the Lyapunov exponents of the system, and therefore we are not able to deduce any (rigorous) conclusion on the dynamics of the system. The first terms of the function Z were computed explicitly in a series of paper [9,10,12,13], with the surprising outcome that up to order four HQ + Z is integrable. The dynamics of this integrable system has also been studied in detail. However as far as we know the rigorous existence of the normalizing transformation was, up to now, established only for the transformation putting the system in third order normal form [9]. 4.2. A quasilinear wave e q u a t i o n Consider the n dimensional parallelepiped TZ with sides of length Li, namely n-.= {x = (x1,...,xn)
6R" :
0<Xi
and the formal Hamiltonian system defined by the Hamiltonian
H(u,v)=
[
|Vu|2
mu2
„„
_
'
dnx,
(24)
where u is a function on TZ vanishing at its boundary, v = u is the momentum conjugated to u, and W is a smooth function. The corresponding Hamilton equations have the form (25)
utt — Au + mu — bij(u, Vu)didjU + g(u, Vu) = 0
(with summation convention over the indexes i,j and suitable functions &,.,• and g) which should be supplemented with Dirichlet boundary conditions. In order to fit the abstract scheme we introduce the phase spaces Ts composed of the functions (u,v) £ Hs+1 © Hs with s := ns/2, fulfilling the compatibility conditions (-A)HHI=0'
°^3^
(-A)jv\
au
0,
0<j<
S+ l
1.
(26)
Birkhoff normal form for some quasilinear Hamiltonian PDEs
279
Passing to Fourier coefficient it is easy to see that Ts is isomorphic to Vs. In order to satisfy the nonresonance condition HI remark that the frequencies depend parametrically on the sides Li of the domain and on the mass m > 0: W
0 W „ ) = \/ [T3I)
+•••+ \T3n)
+rn.
(27)
Theorem 4.1. [See [2]J For any M there exists a set N C R" + 1 with full measure, such that, if (Li,..., Ln, m) £ TV" then the frequencies are nonresonant and fulfill the condition HI. Assume that the potential W is even in each of its arguments and that it has a zero of order v + 2 at the origin; then H2', H3, H4 hold and therefore theorem 3.2 applies. Consider an initial datum of the form u(x,0) = eu0(x),
u(x,0) = ev0(x).
(28)
Theorem 4.2. [See [2]] Fix M, assume that the parameters belong to M and that the potential W is even and smooth; then there exists s' with the following property. If |"o|s+s' ,
|^o|s+s' <
1
then, for small enough e there exists a finite dimensional torus TZOie such that one has ds(z(t),TZ0,e)
for\t\<{CMe)-v
.
(29)
5. Concluding remarks The main limitation of theorem 3.2 rests in the time of validity of its prediction. Indeed, in the finite dimensional case and also in some infinite dimensional cases [3,4] one has that the solution remains very close to a 'quasi-invariant' torus up to times 0(e~Ml) for any M\. So it is natural to ask whether a similar result holds also in the case considered by theorem 3.2. To discuss this point consider the particular case of equation (25) given by utt - (1 + uvx)uxx +mu = 0
(30)
with Dirichlet boundary conditions on [0, n]. According to the previous theory one has that for almost all values of the mass m the frequencies fulfill HI. Theorem 4.2 applies provided v is even. Now, it has been proved by Klainerman and Majda [16] that in the case m = 0 the solution of (30) with initial data (28) develops a singularity in the second derivative at a time of order e~v. In the case m = 0 the nonresonance condition of our theorem is violated, so Klainerman and Majda's result is not a counterexample to theorem 3.2, but we think that it poses serious doubts on the validity of our description for longer times scales. Finally we recall that up to now very little was known concerning equations in more than one space dimensions [4, 6, 8,19] and on the dynamics of quasilinear equations. We also mention the papers [7,21] that are strongly related to the present one.
280
DARIO BAMBUSI
Acknowledgments This work has been partially supported by INTAS-00.221 project and by "Gruppo Nazionale di Fisica Matematica" of "Istituto di Alta Matematica".
References 1. D. Bambusi, Math. Z. 130, 345 (1999). 2. D. Bambusi, "An averaging theorem for quasilinear Hamiltonian PDEs", Annales Henri Poincare, to appear. 3. D. Bambusi, Comm. Math. Phys. 234, 253 (2003). 4. D. Bambusi, B. Grebert, Forme normale pour NLS en dimension quelconque, CRAS Paris, to appear. 5. D. Bambusi, N. N. Nekhoroshev, Acta Applicandae Mathematicae 70, 1 (2002). 6. J. Bourgain, Ann. Math. 148, 363 (1998). 7. J. Bourgain, J. Anal. Math. 80, 1 (2000). 8. J. Bourgain, "Green Functions Estimates for Latticee Schrodinger Operators and Applications", preprint (2003). 9. W. Craig, "Birkhoff normal form for water waves", Mathematical problems in the theory of water waves, F. Dias, J. M. Ghidaglia, J. C. Saut (eds.), Contemporary Mathematics 200, AMS, 1996. 10. W. Craig, C. Sulem, J. Comp. Phys. 108, 73 (1993). 11. W. Craig, C. E. Wayne, Comm. Pure Appl. Math. 46, 1409 (1993). 12. W. Craig, P. A. Worfolk, Physica D 84, 513 (1995). 13. A. I. Dyachenko, V. E. Zakharov, Phy. Lett. A 190, 144 (1994). 14. T. Kato, Lect. Notes Math. 448 (1975). 15. T. Kato, Abstract Differential Equations and nonlinear Mixed Problems. Scuola Normale Superiore, Pisa, 1985. 16. S. Klainerman, A. Majda, Comm. Pure and Appl. Math. 33, 241 (1980). 17. T. Kappeler, J. Poschel, RAM & KdV, Springer, 2003. 18. S. B. Kuksin, Funct. Anal. Appl. 21, 192 (1987). 19. S. B. Kuksin, PMM U.S.S.R. 53, 150 (1989). 20. S. B. Kuksin, Analysis of Hamiltonian PDEs, Oxford University Press, 2000. 21. K. Matthies, A. Scheel, Trans. Amer. Math. Soc. 335, 747 (2002). 22. V. E. Zakharov, Appl. Mech. Tech. Physics 2, 190 (1968).
Coupling stochastic PDEs MARTIN HAIRER*:{U: Warwick) We consider a class of parabolic stochastic PDEs" driven by white noise in time, and we are interested in showing ergodicity for some cases where the noise is degenerate, i.e., acts only on part of the equation. In some cases where the standard Strong Feller / Irreducibility argument fails, one can nevertheless implement a coupling construction that ensures uniqueness of the invariant measure. We focus on the example of the complex Ginzburg-Landau equation driven by real space-time white noise.
1. Introduction In this work, we consider the long-time behaviour of stochastic partial differential equations of the type dX(t) = AX(t)dt + F(X)dt + QdW(t), (1) where A is the generator of an analytic semigroup on a Hilbert space H, W is a cylindrical Wiener process on Ti, Q : H —> H is a bounded operator, and F : D{F) —> Ti is a suitable nonlinearity. We refer to the monograph by Da Prato and Zabczyk [2] for a number of conditions on A, Q and F that ensure the well-posedness of (1), as well as the existence of a unique stochastic flow $>t(X) that yields the solution of (1) at time t with initial condition X£H. One associates to (1) a semigroup Vt acting on bounded measurable functions f : Ti —> R, as well as its dual semigroup Vf acting on Borel probability measures \x by
(Vt
(V;n)(A) =
E(^T\A))).
Prom an intuitive point of view, Vt
fT
and thus guarantee the existence of an invariant measure (since every accumulation point of RTH is an invariant measure under some minimal regularity assumptions, see e.g. [3]). The present paper focuses on the question of the uniqueness of the invariant measure for (1). We start in Section 2 by giving a very short review of two of the main methods used to tackle this question. We then proceed in Section 3 by describing how several kinds of *Work supported by the Swiss National Science Foundation.
281
282
MARTIN HAIRER
coupling methods have recently been applied to this problem. We conclude by explaining in detail how to apply some of the results explained in Section 3 to the complex GinzburgLandau equation driven by real-valued space-time white noise.
2. Two general m e t h o d s Until recently, two general methods dominated the literature about ergodicity results for the type of parabolic SPDEs considered here. We refer to these two methods as the "dissipativity method" and the "overlap method". The reader interested in a more detailed overview of these two methods is referred to the excellent review paper by Maslowski and Seidler [14], which also contains a more complete list of references. 2.1. The dissipativity method In its most crude form, this method assumes that A and F satisfy the dissipativity condition (X - Y, A[X -Y)
+ F(X) - F(Y)) < -c\\X - Y\\2 ,
(2)
for some positive constant c and for all X and Y in the domain of A. Under further (rather weak) regularity assumptions on F, (2) implies that any two solutions X(t) and Y(t) of (1) driven by the same realisation of the noise process W converge exponentially toward each other. This immediately implies that if /i* is an invariant measure for (1) and /i is any measure on H with sufficiently good decay properties, one has Vty, —* /i* in the topology of weak convergence. Of course, many variants of this method are available, in particular one may wish to measure the distance between solutions by a Lyapunov function V which is different from || • ||2 and more adapted to the problem at hand [13]. It is also possible to formulate conditions analogous to (2) that imply exponential convergence in the case where H has only a Banach space structure [2]. Unfortunately, there seems to be no way of applying the dissipativity method to situations where the deterministic part of the equation induces chaotic behaviour of the solutions, which is the situation of interest in the study of most problems related to turbulence. 2.2. The overlap method The "overlap method" is based on the following classical theorem by Doob [5]: Theorem 2 . 1 . Let Vt be a Markov semigroup which is irreducible and has the Strong Feller property. Then Vt admits at most one invariant measure. Recall that the strong Feller property means that the semigroup maps bounded measurable functions into continuous functions. Irreducibility means that (V?n)(A) > 0 for every fj,, every t > 0, and every non-empty open set A. The conditions of Theorem 2.1 imply that VIii and V^v have a non-zero "overlap" for any two probability measures fi and v, i.e., there exists a positive measure 6 such that Vf/J. — 6 and V^v — 5 are both positive measures. Combining this with the well-known fact from ergodic theory that if Vt admits more than one invariant measure, at least two of them must be mutually singular yields the
Coupling stochastic PDEs
283
result. If one furthermore assumes some bounds on the dissipativity of the equation (in a much weaker sense than in the previous subsection, one basically needs to get bounds on the hitting time of some compact set), the techniques exposed in the monograph of Meyn and Tweedie [17] allow to translate them into bounds on the convergence of an arbitrary initial measure toward the invariant measure. This convergence then takes place in the total variation distance, which can be defined between two measures \i\ and /Z2 on H by ||/ii-M2||TV=
>f
u(n2\{(x,x)\xen}),
(3)
where C(/xi,/i2) is the set of measures on H2 with marginals Hi and /x2. Notice that this distance does not take into account the topology of the space H. One can interpret it as measuring the maximal probability for two random variables with respective laws /L*I and /^2 to be equal. For a finite-dimensional SDE with smooth coefficients, the Strong Feller property is a consequence of the hypoellipticity of the operator dt + L, where L is the generator of the Markov process. A very efficient criteria for hypoellipticity is given by Hormander's theorem [10]. However, no satisfactory formulation of Hormander's theorem is available yet in the infinite-dimensional setting, so the Strong Feller property is usually proved by other means there. One efficient tool for proving that the strong Feller property holds for an infinite-dimensional system is given by the Bismut-Elworthy-Li formula [8]. In one of its formulations, this formula is given by
{DVtf){X)h = -t E((p O $ t )(X) J
(Q-i{D*,){X)h,dW(s))\
.
Here, the notation (Df)h is used to denote the directional derivative of the function / in the direction h. The main feature of this formula is that it yields bounds on DVt
284
MARTIN HAIRER
type equation with a covariance operator Q having a kernel of finite co-dimension has been treated by this method so far [7].
3. The coupling method In this section, we present one way to apply coupling techniques to the problem of ergodicity for stochastic PDEs. Recall that a coupling for a pair of measures Hi and fi2 is a measure /i on the product space with marginals Hi and /J.2- In the context of stochastic PDEs, we call a coupling for (1) a family of stochastic processes (X(t),Y(t)) indexed by their initial conditions (X(0),F(0)) G H2 and such that X{t) and Y(t) both solve (1). The basic idea of any coupling method is to introduce correlations between X(t) and Y(t) in such a way that \\X(t) — Y(t)\\ —> 0 as t —> oo. The precise type and speed of this convergence then determines the topology and the speed of convergence of Pt*/z to the (unique) invariant measure /i*. Although coupling methods have been used to prove ergodicity results for stochastic evolutions since the late thirties (see e.g. [4]), they seem to have been applied successfully to stochastic PDEs only recently. Actually, both the dissipativity and the overlap method can be interpreted as special cases of couplings. In the dissipativity method, one drives X(t) and Y(t) with the same realisation of the noise process and relies on the dissipativity of the equation to drive both processes toward each other. In the overlap method, one first discretises time (by looking at integer times, say) and then uses the maximal coupling for the transition probabilities. The transition probabilities P(X, Y, •) for the coupled process are thus given by the coupling that realises the infimum in (3) with ^i = P(X,-) and fi2 = P(Y, •). This coupling can easily be shown to exist and to be unique. (Here, P denotes the time 1 transition probabilities for (1).) In other words, the maximal coupling will try as hard as it can to force X(t) and Y(t) to become equal and, once it succeeds, they will be guaranteed to remain equal for all subsequent times. If one starts X in the invariant measure /i* and Y in an arbitrary measure H, one has \\V*H — M*I|TV < P { T > £}, where r is the first time at which X(T) = Y(r). In a series of recent papers, E, Mattingly, Sinai [6,16] and Kuksin, Shirikyan [11,12] realised that, loosely speaking, it is possible in certain cases to split the state space of (1) into two spaces Ti = W+ © H- and to combine the dissipativity method on 7i_ with the overlap method on H+. Let us call 7i+ the "unstable modes" and H- the "stable modes" and assume that H+ is finite-dimensional. In both cases, the authors focused on the example of the 2D Navier-Stokes equation on a torus with periodic boundary conditions, forced by a noise with a covariance operator Q that has the important property that n + <2iT + is invertible on Ti+. (We denote by H± the orthogonal projections on H±.) Under this condition, they constructed a coupling with the following properties. On H+, it behaves like the maximal coupling, i.e., it maximises the probability for the H+ components X+(t) and Y+ (t) to become equal. On H~ on the other hand, it drives X- and YL with identical realisations of that component of the noise process. The space H- is chosen in such a way that the dynamic then tends to steer X- and Y_ toward each other. (In the typical situation of a semilinear parabolic equation, the space H- would consist of Fourier modes with sufficiently high wave number.) The main difficulty of this approach comes from the fact that, unlike in the situation of
Coupling stochastic PDEs
285
the overlap method, having X+(t) = Y+(t) at a certain time does not allow one to ensure that X+(s) = Y+(s) for all subsequent times s > t. The reason is that one has in general X- (t) ^ y_ (t) and therefore, if one drives X and Y with the same realisation of the noise, the influence of the stable modes will tend to drive the unstable modes away from each other. Call Wx and Wy the noises driving X and Y respectively. The (finite-dimensional) processes X+ and Y+ are then solutions of SDEs of the following type: dX+(t) = g(X+,X^)
dt + U+Qn+
dY+{t) = g(Y+,Y-)dt
+
dWx{t),
Il+Qn+dWy{t).
Since T1+QU+ is invertible by assumption, one notices that the Girsanov transform
dwv(t) = d\vx(t) + (n + Qn+)- 1 ( 5 (x + ,x_) - 5 ( y + , Y_)) dt,
(4)
allows to have X+(t) = Y+(t) for all times. Of course, dWy as defined by (4) is not a Wiener process anymore, so this is not an acceptable coupling. It is nevertheless possible to construct a coupling that gives positive mass to the event (4). Furthermore, as long as it is satisfied, the difference \\X- — Y_|| converges to 0, so the difference between dWx and dWy in (4) becomes smaller and smaller. The coupling can be constructed in such a way that it therefore becomes more and more likely for the event (4) to be satisfied. By carefully estimating these probabilities, one can show that the random time r — inf{£ > 0 | X + ( s ) = Y+(s)Vs > t} is almost surely finite. Estimates on r and on the speed at which \\X- — Y_ || —> 0 then immediately translate into estimates on the speed at which V*fi converges to the invariant measure. Another method for constructing couplings for an equation of the type (1) is to construct two operator-valued functions G\ and G2 such that G\G\ + GiG*i = Identity,
(5)
and to consider the couple of equations dX(t) = AX{t) dt + F{X) dt + QGx{X, Y) dW^t) + QG2(X, Y)
dW2(t),
dY{t) = AY(t) dt + F{Y) dt + Qd(X,
dW2(t),
Y) dWx{t) - QG2{X, Y)
where W\ and W2 are two independent cylindrical Wiener processes. It is clear that this is a coupling for (1) for any choice of the Gi satisfying (5). It was shown in [18] that, for a certain class of semilinear parabolic equations, it is possible to choose the Gi in such a way that the stopping time r = inf{t > 0 | X(t) = Y(t)} is almost surely finite, therefore obtaining convergence toward the invariant measure in the total variation distance. We finally turn to the coupling technique developed in [9]. This technique is very close in spirit to the one exposed in [11,12,16]. However it allows in some cases to treat a situation where the noise acts in a degenerate way on the unstable part of the equation. The idea exposed in [9] is to look for a Banach space 8 c H with norm || • ||* and for a function G : S 2 —> H with the following properties: P I The solutions of (1) are almost-surely B-valued and they satisfy E||«(*)||£ < C(*)(l + ||«(0)||J).
286
MARTIN HAIRER
P 2 The function G is bounded by ||G(u,u)|| < C\\u - v\\e(l + \\u\\* + \\v\\*)N for some positive constants C, s, N, and for all u and v in B. P 3 If u is the solution of (1) driven by dW and v is the solution of (1) driven by dW = dW + G(u, v) dt, then ||u(t) — u(£)||* —> 0 exponentially as t —• oo. The main result of [9] is then: T h e o r e m 3.1. Consider (1) and assume that there exists a function G and a Banach space B C H satisfying P1-P3. Then, (1) has a unique invariant measure and, for every initial condition in 7i, the solution converges to the invariant measure at an exponential rate. R e m a r k 3 . 1 . The convergence takes place in a Wasserstein distance which is the same as the one usually used in the dissipativity method. The topology defined by this distance is slightly stronger than the topology of weak convergence. There does not seem to be a general recipe for finding a suitable function G. On the other hand, when the nonlinearity is small (not in the sense of "arbitrarily small" like in perturbation theory, but in the sense that the noise gets transmitted to the whole phase space through the linear part of the equation and that the nonlinearity does not affect this behaviour), we will see in the following section that it is not too difficult to find a good function G.
4. T h e complex Ginzburg-Landau equation We now focus on the following equation, which is a stochastic perturbation of the complex Ginzburg-Landau equation: du{x,t) = ((l + ia)Au{x,t)
+ (l + i(3)u(x,t)-u(x,t)\u(x,t)\2)dt
+ dW{x,t).
(CGL)
In this equation, a and ft are two real-valued parameters, x G [—L,L], A denotes the Laplace operator, and u(-,t) is assumed to satisfy periodic boundary conditions. Furthermore, ^f(x,t) formally denotes space-time white noise, i.e., it is a distribution-valued Gaussian process with covariance E^—(x,t)
—
(y,s))=6(x-y)6(t-s).
It is a standard result (see [2]) that (CGL) has a unique H = L2([—L, L])-valued solution. We denote again by Vt and Vf the semigroups associated to it. The remainder of this paper is devoted to the proof of the following result: T h e o r e m 4 . 1 . For every pair (a, (3) satisfying the conditions 3 |/?l
*2'
24/32
^ 3 6 / ^ ( 1 5 - / ^ '
(6)
the equation (CGL) has a unique invariant measure and all solutions converge exponentially to it.
Coupling stochastic PDEs
287
Remark 4.1. The conditions on a and on /? are not sharp, not even within the framework of the method of proof presented here. Proof. All the calculations in this proof will be performed at a formal level. A rigorous justification is relatively easy by using the regularising properties of the equation (CGL). It is indeed a standard result that the solutions of (CGL) are almost surely continuous functions of space and of time. In particular, condition P I holds for B equal to the space of continuous functions equipped with the supremum norm. Denoting the real and imaginary parts of a complex-valued function u by ur and ut, (CGL) becomes dur(x, t) = (Awr — aAui + ur — (3ui — ur(v% + u\)) dt + dW(x, t), dui(x, t) — (AUJ + aAur +Ui + (3ur — v.i(v% + u?)) dt. Our aim is to give an explicit expression for a function G satisfying properties P 1 - P 3 of the previous section. Denote by v = vr + i Vi the solution of (CGL) driven by the noise process dW — dW + Gdt and define g = v — u. One then has for g: dgr = (Agr - aAft + gr - /Jp» - vr(vf + v?) + ur(v% + uj)) dt + G(u, v), 2
dSi = (A0i + aAgr + & + /3gr - Vi(v? + vf) + Ui(u T + uf)) dt.
(7a) (7b)
The most "dangerous" part of this equation is the linear instability. Before we proceed, we therefore consider the following simplified equations: —
= gr-PQi
+ G,
— = gi + /3gr.
(8)
Let us introduce the variable £ = f3gr + 3£; and rewrite the second equation as
Choosing G = -5ft.+ ( ^ - | ) f t ,
(10)
we obtain for £ the equation ^ = —£. Therefore, £ converges exponentially to 0, which in turn implies by (9) that Qi converges exponentially to 0. Since gr is a linear combination of £ and Qi, it converges exponentially to 0 as well. To characterise this convergence more precisely, we introduce the norm
M2 = 8 + ^8+
%&•*•
A straightforward computation shows that, with the choice (10) for G, one has
^ for some constant 7 > 0.
= -2#-J^<-7M a ,
(ii)
288
MARTIN HAIRER
We now turn to the full equation (7), which we rewrite as - ^ = Agr - aAft - Agr - -Qi + G2(u, v), at p
(12a)
-J5- = Agi + aAgr + ft + /3gr - {ut + gi)(urgr + u^i + g2. + g2) - gi{u2r + u2), (12b) where we defined G = Gi + G2 with Gx = -5gr + [p - - ) & + vr(v2 + v2) - ur(u2r + u2). Notice that G\ satisfies P 1 - P 2 with || • ||, replaced by the supremum norm. By analogy with (11), we introduce the norm \Qr\\2 +a\\Qi\\2 + b(Qr,Qi),
a-
15 . 6 „2 , "~g
that dur
<-i\y\f-Tx-T2,
dt where we defined
Ti = (2 + ab)\\Vgr\\2 + (2a - a6)||Vft|| 2 + 2(6 + aa - a)(Vgr,
Vft),
and T2 = -{2gr + bguG2) + (2agt + bgr, (ut + gi)(urgr + Uigi + g2 + gf) + gi(u2 + u2)) . Due to the second condition in (6), T\ is always positive. It thus remains to find a function G2 satisfying P 1 - P 2 and such that T2 is always positive. Since G2 is multiplied by 2gr + bgi, we can choose it in such a way to replace every occurrence of gr by —^Qi in the above expression, thus yielding ia-b2/ ( o „ o T2 = — - — ( Q U \ur + 2u;-
b / b2\ -UiUr + [2 + —jgiUi+
/ b2\ \1 +
2
b \ \ —)gi--urgijgi).
Since 4a — b2 is positive, positivity of T2 is equivalent to the matrix -b 8
-b
4 + ^_
-b 4 + ^2
4A -L+ 6) :
being positive definite. Since T is positive definite for b = 0, it remains so until the first value of b for which d e t r - ^ - ( 6 4 + 1262-64) = 0 . 64 Therefore, under the condition \b\ < 4, (i.e., |/3| > | ) , one has g^lel 2 < —7||e||2 almost surely. Applying Theorem 3.1 concludes the proof of Theorem 4.1. • R e m a r k 4.2. Note that the same proof goes through if one multiplies the cubic term in (CGL) by a factor (1+17) with 7 £ R small enough.
Coupling stochastic PDEs
289
References 1. S. Cerrai, "Smoothing properties of transition semigroups relative to SDEs with values in Banach spaces", Probab. Theory Relat. Fields 113, 85-114 (1999). 2. G. Da Prato, J. Zabczyk, Stochastic Equations in Infinite Dimensions, University Press, Cambridge, 1992. 3. G. Da Prato, J. Zabczyk, Ergodicity for Infinite Dimensional Systems, volume 229 of London Mathematical Society Lecture Note Series, University Press, Cambridge, 1996. 4. W. Doeblin, "Expose sur la theorie des chaines simples constantes de Markoff a un nombre fini d'etats", Rev. Math. Union Interbalkanique 2, 77-105 (1938). 5. J. L. Doob, "Asymptotic properties of Markoff transition probabilities", Trans. Amer. Math. Soc. 63, 393-421 (1948). 6. W. E, J. C. Mattingly, Y. Sinai, "Gibbsian dynamics and ergodicity for the stochastically forced Navier-Stokes equation", Comm. Math. Phys. 224, 83-106 (2001). Dedicated to Joel L. Lebowitz. 7. J.-P. Eckmann, M. Hairer, "Uniqueness of the invariant measure for a stochastic pde driven by degenerate noise", Comm. Math. Phys. 219, 523-565 (2001). 8. K. D. Elworthy, X.-M. Li, "Formulae for the derivatives of heat semigroups", J. Fund. Anal. 125, 252-286 (1994). 9. M. Hairer, "Exponential mixing properties of stochastic PDEs through asymptotic coupling", Probab. Theory Related Fields 124, 345-380 (2002). 10. L. Hormander, The Analysis of Linear Partial Differential Operators I-IV, Springer, New York, 1985. 11. S. Kuksin, A. Piatnitski, A. Shirikyan, "A coupling approach to randomly forced nonlinear PDEs II", Comm. Math. Phys. 230, 81-85 (2002). 12. S. B. Kuksin, A. Shirikyan, "A coupling approach to randomly forced nonlinear PDEs I", Comm. Math. Phys. 221, 351-366 (2001). 13. B. Maslowski, "Uniqueness and stability of invariant measures for stochastic differential equations in Hilbert spaces" Stochastics Stochastics Rep. 28, 85-114 (1989). 14. B. Maslowski, J. Seidler, "Invariant measures for nonlinear SPDEs: Uniqueness and stability", Archivum Math. 34, 153-172 (1999). 15. B. Maslowski, J. Seidler, "Probabilistic approach to the strong Feller property", Probab. Theory Related Fields 118, 187-210 (2000). 16. J. C. Mattingly, "Exponential convergence for the stochastically forced Navier-Stokes equations and other partially dissipative dynamics" Comm. Math. Phys. 230, 421-462 (2002). 17. S. P. Meyn, R. L. Tweedie, Markov Chains and Stochastic Stability, Springer, New York, 1994. 18. C. Mueller, "Coupling and invariant measures for the heat equation with noise", Ann. Probab. 21, 2189-2199 (1993). 19. J. Zabczyk, "Structural properties and limit behaviour of linear stochastic systems in Hilbert spaces", in Mathematical Control Theory, volume 14 of Banach Center Publ., PWN, Warsaw, 1985, pp. 591-609.
Long time behaviour of periodic stochastic flows V.
KALOSHIN*
(Caltech), D.
DOLGOPYAT*
(Maryland), L.
KORALOV*
(Princeton)
We consider the evolution of a set carried by a space periodic incompressible stochastic flow in a Euclidean space. We report on three main results obtained in [8-10] concerning long time behaviour for a typical realization of the stochastic flow. First, at time t most of the particles are at a distance of order \/t away from the origin. Moreover, we prove a Central Limit Theorem for the evolution of a measure carried by the flow, which holds for almost every realization of the flow. Second, we show the existence of a zero measure full Hausdorff dimension set of points, which escape to infinity at a linear rate. Third, in the 2-dimensional case, we study the set of points visited by the original set by time t. Such a set, when scaled down by the factor of t, has a limiting non random shape.
1. Introduction We study of the long-time behaviour of a passive substance (or scalar) carried by a stochastic flow. Motivation comes from applied problems in statistical turbulence and oceanography, Monin k Yaglom [18], Yaglom [21], Davis [7], Isichenko [11], and Carmona & Cerou [3]. The questions we discuss here are also related to the physical basis of the Kolmogorov model for turbulence, Molchanov [17]. 1.1. Local properties of the dynamics The key element of our analysis is presence of nonzero Lyapunov exponents. The physical mechanism of turbulence is still not completely understood. It was suggested in [19] that the appearance of turbulence could be similar to the appearance of chaotic behaviour in finite-dimensional deterministic systems. Compared to other cases, the mechanism responsible for stochasticity in deterministic dynamical systems with nonzero Lyapunov exponents is relatively well understood. It is caused by a sensitive dependence on initial conditions, that is, by exponential divergence of nearby trajectories. It is believed that a similar mechanism can be found in many other situations, but mathematical results are scarce. Here we describe a setting where analysis similar to the deterministic dynamical systems with nonzero Lyapunov exponents can be used. We consider a flow of diffeomorphisms on an n-dimensional torus T" (or more generally on a compact manifold), generated by solutions of stochastic differential equations driven by a finite-dimensional Brownian motion. d
dxt = J2 Xk(xt)
o d6k(t) + X0(xt)
dt,
(1)
fe=i *V. K. was partially supported by American Institute of Mathematics Fellowship, NSF, and Courant Institute. tD. D. was partially supported by NSF and Sloan Foundation. *L. K. was partially supported by NSF postdoctoral fellowship.
290
Long time behaviour of periodic stochastic flows
291
where XQ,XI,. .. ,Xd are C00 smooth divergence free space periodic vector fields on R" with period one and 6(t) = (6i(t),..., 6d(t)) is a standard Revalued Brownian motion. We show that the presence of non-zero exponents combined with certain non-degeneracy conditions (amounting roughly speaking to the assumption that the noise can move the orbit in any direction) implies almost surely chaotic behaviour in the following sense: — Exponential, in time, decay of correlations between the trajectories with different initial data. — Equidistribution of images of submanifolds. — Central Limit Theorem, with respect to the measure on a "rich enough" subset, which holds for almost every fixed realization of the underlying Brownian motion. In order to illustrate the last point, let us consider a periodic flow on R n , and let v be a Lebesgue probability measure concentrated on an open subset. As a motivating example one may think of an oil spot on the surface of the ocean. The ultimate goal could be to remove the oil or at least to prevent it from touching certain places on the surface of ocean. Thus, we wish to predict the properties and the probability laws governing the dynamics of the spot in time. Let vt be the measure on R" induced from v by time t map of the flow. We shall show that almost surely ut is asymptotically equivalent to a Gaussian measure with variance of order t. In other words, for a sufficiently large positive R for large time 99 percent of the oil spot is contained in the ball of radius Ry/i. See Theorem 2.1 below for an exact statement. 1.2. Global properties of the dynamics Global and local properties are intimately related. Ballistic points As has been demonstrated for a large class of stochastic flows with zero mean, under some mixing conditions on the flow, the displacement of a single particle is typically of order \Ji for large t. On the other hand, it has been shown in the work of Cranston, Scheutzow, Steinsaltz, and Lisei [5], [6], [13], [20] that in any open set there are points which escape to infinity at a linear rate. We sharpen these results for the stochastic flow (1) and show that the linear escape points form a set of zero measure and full Hausdorff dimension (see Theorem 2.4 below). L « = ( i 6 l " : liminf t -> o o - ^ >
I
/
t
0
1
J
(2)
for a.e. realization {6(t)}t>o we have HD(Lg) = n. S h a p e of poisoned a r e a Consider the planar case n = 2. Denote the original set by fl c K2. The evolution of the set under the action of the flow will be denoted by Qt • We study the set of "poisoned" points, that is those visited by the image of O before time t,
Wt(ft) = ( J n , . 3
292
V. KALOSHIN, D. DOLGOPYAT, L. KORALOV
As shown in [5] and [6], the diameter of this set grows linearly in time almost surely. It turns out that Wt(fi), if scaled down by t, converges almost surely to a nonrandom convex set B e l 2 independent of fl
«U*
(3)
Now we give the precise statements.
2. Statement of the results Consider a stochastic flow of diffeomorphisms on M.n generated by a finite-dimensional Brownian motion (1). We impose several assumptions on the vector fields Xo, X\,..., Xd, which are stated in the next section. All those, except the assumption of zero drift, are nondegeneracy assumptions and are satisfied for a generic set of vector fields XQ, X\,..., Xj. The main results are the following: Central Limit Theorems Let v be a probability measure on M", which has compact support, such that for some positive p it has finite p-energy
In particular, this means that the Hausdorff dimension of the support of v is positive (see [14, sect. 8]). Theorem 2.1. [8] Let v be a probability measure with finite p-energy for some p > 0 and with compact support, and let assumptions (A)-(D) below be satisfied. Let xt be the solution of (1) with the initial measure v. Then for almost every realization of the Brownian motion the distribution of •% induced by v converges weakly as t —* oo to a Gaussian measure on W1 with zero mean and some variance D. We also prove Central Limit Theorems for a 2-point motion: Theorem 2.2. [8] Let conditions (A)-(C) be satisfied. Let x\ and x\ be the solutions of (1) with different initial data. Then for some value of the drift v the vector -y;(xt ~ v^i xt ~ v^) converges as t —> oo to a Gaussian random vector with zero mean. and an m-point motion for any positive integer m: Theorem 2.3. [8] Let the conditions (A), (Bm), and (C) be satisfied. Let x\,..., x™ be the solutions of (1) with pair-wise different initial data. Then for some value of the drift v the vector \{x\ — vt,... ,x™ — vt) converges as t —> oo to a Gaussian random vector with zero mean. Actually all these results can be stated more generally in terms of additive functionals of one, two, and m-point motion on a compact manifold (see [8]). In the proof of the above results we essentially use the work of Baxendale-Stroock [1,2]. Baxendale also proposed an application of results of Meyn-Tweedie [15], which helped us to simplify the proofs.
Long time behaviour of periodic stochastic flows
293
Ballistic points Cranston-Scheutzow-Steinsaltz [5,20] proved that the set of ballistic points L$ is uncountable. Here is an extension of their result. Theorem 2.4. [9] Let assumptions (A)-(D) he satisfied. For almost every realization of the Brownian motion {@(t)}t>o we have that points of the flow (1) with linear escape to infinity Le form a dense set of full Hausdorff dimension HD{Le) = n. Shape of poisoned area Theorem 2.5. [10] Let the original set Q, C R 2 be bounded and contain a continuous curve with positive diameter and let assumptions (A)-(E) below he satisfied. Then there is a compact convex non random set B, independent of £l, such that for any e > 0 almost surely (1 - e)tB C Wt(n) C (1 + e)tB
(5)
for all sufficiently large t. 2.1. Nondegeneracy assumptions In this section we formulate a set of assumptions on the vector fields XQ, X\,..., Xj. Recall that XQ, X\,..., Xd are assumed to be periodic and divergence free. We shall assume that the period for all of the vector fields is equal to one. (A) (Strong Hormander Condition for xt) For all i £ R" we have Ue(X1,...,Xd){x)
= M2,
where Lie(Xi,... ,Xd)(x) is linear span of all possible Lie brackets of all orders formed out of X\,..., Xd at x. Strong Hormander Condition for xt basically implies that probability that a point after time one gets into an open set is positive. Denote the diagonal in T n x Tn by A = {(a;1,re2) eRnxRn
: x1=x2
(mod 1)}.
(B) (Strong Hormander Condition for the two-point motion) The generator of the twopoint motion {(xj,x2) : t > 0} is nondegenerate away from the diagonal A, meaning that the Lie brackets made out of (Xi(x1),Xi(x2)), . . . , (Xd(x1),Xd(x2)) generate T x iT n x Tx2Tn = Rn x R". (B m ) (Strong Hormander Condition for the m-point motion) The generator of the mpoint motion {{x},. ..,x™) : t > 0} is non-degenerate away from the generalized diagonal A (/= )(T"), meaning that Lie brackets made out of (Xi(x1),... ,Xi(xm)), ..., (Xd(x*),..., Xd(xm)) generate T x iT" x . . . x TxmTn = R n x . , . x l " ( m times). To formulate the next assumption we need additional notations. Let Dxt : T Xo R n —> TXtRn be the linearization of xt at t. We need the Strong Hormander Condition for the process {(xt, Dxt) : t > 0}. Denote by TXk the derivative of the vector field Xk thought as the map on TRn and by SRn = {v e TW1 : \v\ = 1} the unit tangent bundle on Rn. If we denote by Xk(v) the projection of TXk(v) onto TvSRn, then the stochastic flow (1) on R n
294
V. KALOSHIN, D. DOLGOPYAT, L. KORALOV
induces a stochastic flow on the unit tangent bundle SRn defined by the following equation: d
dxt = ^ X f c ( x t ) o dOk(t) + X0(xt)
dt.
fc=i
With these notations we have condition (C) (Strong Hormander Condition for (xt,Dxt))
For all v £ SRn we have
Ue(X1,...,Xd)(v) Let LxkXk(x)
TvSRn.
=
denote the derivative of Xk along Xk at the point x. Notice that 1
d
is the deterministic component of the stochastic flow (1) rewritten in Ito's form. Below we show that conditions (A)-(C) guarantee that the flow (1) has Lyapunov exponents and at least one of them is positive. We require that the flow has no deterministic drift, which is expressed by the following condition. (D) (zero drift) d / i1 JL
r
\
-^2LXkXk+X0\(x)dx
= 0,
(6)
We further require that /
Xk(x)dx
= 0,
k = l,...,d.
(7)
The last assumption is concerned with the geometry of the stream lines for one of the vector fields X\,..., Xd in the 2-dimensional case. Fix a coordinate system on the 2-torus T 2 = {x = (xi,x2) mod 1}. As the vector fields have zero mean and are divergence free, there are periodic stream functions Hi,... ,Hd, such that Xk(x) = (—H',H'). We require the following (E) (Morse condition on the critical points of Hi) non-degenerate.
All of the critical points of Hi are
Functions with this property are called Morse functions. generic function is a Morse function (see e.g. [16]).
It is a standard result that a
2.2. Nondegeneracy assumptions (A)—(C) imply positive Lyapunov exponents For measure-preserving stochastic flows with conditions (C) Lyapunov exponents A i , . . . , AdimM do exist by multiplicative ergodic theorem for stochastic flows of diffeomorphisms (see [4, thm. 2.1]). Under condition (A) the sum of the Lyapunov exponents is equal to zero. On the other hand, Theorem 6.8 of [1] states that under condition (A) all of the Lyapunov exponents can be equal to zero only if for almost every realization of the flow (1) one of the following two conditions is satisfied (a) there is a Riemannian metric p' on T n , invariant with respect to the flow (1) or (b) there is a direction field v(x) on T n invariant with respect to the flow (1).
Long time behaviour of periodic stochastic flows
295
However (a) contradicts condition (B). Indeed, (a) implies t h a t all the Lie brackets of {(Xie(x1), Xk(x2))}f=1 are tangent to the leaves of t h e foliation {(x\x2)
G T n x T " : p'(x\x2)
= const.}
and don't form the whole tangent space. On the other hand (b) contradicts condition (C), since (b) implies t h a t all the Lie brackets are tangent to t h e graph of v. This positivity of X\ is crucial for our approach.
References 1. P. Baxendale, "Lyapunov exponents and relative entropy for a stochastic flow of diffeomorphisms", Probab. Th. & Rel. Fields 81, 521-554 (1989). 2. P. Baxendale, D. W. Stroock, "Large deviations and stochastic flows of diffeomorphisms", Prob. Th. & Rel. Fields 80, 169-215 (1988). 3. R. Carmona, F. Cerou, Transport by incompressible random velocity fields: Simulations and Mathematical Conjectures, AMS, Providence, 1999, pp. 153-181. 4. A. Carverhill, "Flows of stochastic dynamical systems: ergodic theory", Stochastics 14, 273-317 (1985). 5. M. Cranston, M. Scheutzow, D. Steinsaltz, "Linear expansion of isotropic Brownian flows", El. Comm. Prob. 4, 91-101 (1999). 6. M. Cranston, M. Scheutzow, D. Steinsaltz, "Linear bounds for stochastic dispersion", Ann. Prob. 28, 1852-1869 (2000). 7. R. Davis, "Lagrangian Ocean Studies", Ann. Rev. Fluid Mech. 23, 43-64 (1991). 8. D. Dolgopyat, V. Kaloshin, L. Koralov, "Sample path properties of the stochastic flows", Ann. Probability 32, 1-27 (2004). 9. D. Dolgopyat, V. Kaloshin, L. Koralov, "Hausdorff dimension of linear escape points for periodic stochastic dispersions", J. Stat. Physics 108, 943-971 (2002). 10. D. Dolgopyat, V. Kaloshin, L. Koralov, "A limit shape theorem for periodic stochastic dispersion", Comm. in Pure and Appl. Math. 57, 1127-1158 (2004). 11. M. Isichenko, "Percolation, Statistical topography and transport in random media", Reviews in Modern Physics, 1992. 12. R. Leandre, "Minoration en temps petit de le densite d'une diffusion degeneree", J. Func. Analysis 74, 399-414 (1987). 13. H. Lisei, M. Scheutzow, "Linear bounds and Gaussian tails in a stochastic dispersion model", Stock. Dyn. 1, 389-403 (2001). 14. P. Mattila, Geometry of sets and measures in Euclidean spaces. Fractals and rectifiability, Cambridge Studies in Advanced Mathematics 44, Cambridge University Press, Cambridge, 1995. 15. S. Meyn, R. Tweedie, "Stability of Markovian processes III: Foster-Lyapunov criteria for continuous-time processes", Adv. Appl. Prob. 25, 518-548 (1993). 16. J. Milnor, "Morse theory", Ann. of Math. Studies, no. 51, Princeton University Press, Princeton, N.J., 1963. 17. S. Molchanov, "Topics in statistical oceanography", in Stochastic Modeling in Physical Oceanography, Boston, Birkhauser, 1996, pp. 343-380. 18. A. Monin, A. Yaglom, Statistical fluid mechanics: mechanism of turbulence, MIT Press, Cambridge, MA, 1971. 19. D. Ruelle, F. Takens, "On the nature of turbulence", Comm. Math. Phys. 20, 167-192 (1971). 20. M. Scheutzow, D. Steinsaltz, "Chasing balls through martingale fields", Ann. Probability 30, 2046-2080 (2002). 21. A. Yaglom, Correlation Theory of Stationary and Related Random Functions, vol. 1: Basic Results, Springer-Verlag, New York, 1987.
Local in time and space nonlinear stability of pulses in an unstable medium Gumo
SCHNEIDER
(U. Karlsruhe),
HANNES U E C K E R
ROBERT L. P E G O (U. Karlsruhe)
(Maryland),
Burgers equation and the KS-perturbed KdV-equation arise as amplitude equations for small amplitude long waves on the surface of a viscous liquid running down an inclined plane. Burgers equation describes the system in case when the trivial solution, the so called Nusselt solution, is spectrally stable. If the inclination angle is increased the Nusselt solution becomes unstable via a sideband instability and the KS-perturbed KdV-equation takes the role of Burgers equation. It is the purpose of this paper to discuss certain aspects of the dynamics of these two equations and the consequences for the inclined plane problem. Our main issue is the analysis of the KS-perturbed KdV-equation 6\u=—d^u— ^dx(u2)— e(d^+d^)u. In this equation the dynamics is dominated by traveling pulse-trains, although the individual pulses are unstable due to the long-wave instability of the flat surface. In order to add a piece to the understanding why the dynamics of this system is dominated by individually unstable pulses, we prove a nonlinear local in space and time stability result for the individual pulses. The identification of a twodimensional locally attractive structure near these pulses allows one to explain the generation of shelves behind excited pulses.
1. Self similar decay of perturbations in the stable case Burgers equation dtu - d\u -
dx(u2)
arises as an amplitude equation for small amplitude long waves on the surface of a viscous liquid running down an inclined plane [2,5,12]. It describes this system in case when the trivial solution, the so called Nusselt solution which possesses a parabolic flow profile and a flat top surface, is spectrally stable. This is the case when the inclination angle 9 which serves as a bifurcation parameter in this physical problem is below a critical value 0C; see figure 1. In Burgers equation there is a self-similar decay of spatially localized perturbations to the trivial solution. In order to explain our result for the inclined plane problem we recall this result for Burgers equation first. By the Cole-Hopf transformation ip(x,t) = eI-oou(x't>dx Burgers equation reduces to the heat equation dtip = dxtp. For the heat equation it is well known that solutions with initial conditions limx^-ooip(x,0) = 1 and limx-^oo ip(x, 0) = A + 1 > 0 satisfy, under very weak assumptions, lim ip(xVi,t)
t—+oo
= 1+
Aerf(x)
with rate 0(l/y/i). Using the fact that the inverse Cole-Hopf transformation is given by u(x,t) = dxip(x,t)/ip(x,t) it follows that solutions u with spatially localized initial condi-
296
Local in time and space nonlinear stability of pulses in an unstable medium
297
Figure 1. The inclined film problem: A fluid of height y — h(x,t) runs down a plate with inclination angle 6 subject to constant gravitational force g.
tions in Burgers equations satisfy A prf'(r)
d
rA(x)
t—>oo
with a rate 0(l/\/i). The limiting profile satisfies \imx^±00 f^(x) = 0. Therefore, the renormalized solutions to spatially localized initial perturbations converge towards a nonGaussian limit. This kind of dynamics in Burgers equation persists under higher order perturbations of the equation. Lemma 1.1. [1] For b > 0, C\ > 0 there exists a Ci > 0 such that the following holds. Let H(u,dxu) = 0 ( | u P l | + \uP2(dxu)P3\) for \u\ + \dxu\ -> 0 with pi > 4, p2 + 2p 3 > 4 and let UQ be an initial condition of dtu = d2xu + dx{u2) + H(u, dxu) with Hwoll/f2 = ||ito/021|jf2 < C2, where p(x) = \/\ + x2. u\t=\ = UQ satisfies
(1)
Then the solution u of (1) with
\\Vtu(xVi,t) - rA(x)\\Hl < c.t-1/2^ for each t € [1,00) and an A > —1. This observation is also true in the full physical problem and so the Nusselt solution turns out to be nonlinearly stable in the sense that spatially localized perturbations decay in a self-similar way to zero, as predicted by Burgers equation. Such a nonlinear stability result is a nontrivial task since the linearization around the Nusselt solution possesses continuous spectrum up to the imaginary axis and since the Navier-Stokes equations with the free top surface which describe the inclined plane problem form a quasilinear system [11]. In detail, the inclined plane problem is completely determined by the evolution of the free top surface y = h(x,t) and the velocity field u = u(x,y,t) £ R2 in the interior £l(t) = {(x,y) \ 0 < y < h(x,t)} of the fluid. Theorem 1.1. [10] Let b € (0, \) and assume that the trivial solution, the Nusselt solution (hN,ujv) is spectrally stable, i.e., let 6 < 6C. Then for all C\ > 0 there exists a C2 > 0
298
GUIDO SCHNEIDER, ROBERT L. PEGO, HANNES UECKER
such that the following holds. Let (ho,uo) £ # f W x ^1(^(1)) w^1 ll(^Oilto)||j:/3xi/2 — £2Then there exists a unique solution (/ijv + h,UN + u) with (h,u)\t=i = (ho,uo) of the inclined plane problem and an A > — 1 such that, in suitable coordinates,
t^Ht^x^)
- rA(X)
m
+
t^u(t1/2x,y,t)-rA(x)(2y
(2)
Hi
Note that the estimate implies sup|ft(a;,i)| +
sup
\u(x,y,t)\
= 0(tb
2
).
2. T h e K S - p e r t u r b e d KdV-equation as amplitude equation If the inclination angle is increased the Nusselt solution becomes unstable via a sideband instability. Above the threshold of instability after some rescaling the Kuramoto-Shivashinsky (KS)-perturbed Korteweg-de Vries (KdV)-equation dtu = — d%u — -dx{v?) — e{d2x + d^)u,
u = u(x,t)
e
x e
i>0,
(3)
where 0 < e « \J8 — 6C
3. Existence of pulse solutions For e — 0 we have the well known KdV-equation for which there exists a two-dimensional family Mo = {u(x, t) = uc(x - ct + x0) : x0 G R, c > 0},
uc(y) = 3c sech2
(-y/cyj
of solitary waves. For e > 0 we have an amplitude/speed selection principle, and there exists a unique velocity ce = 7/5 + 0(e) and a one-dimensional family Me
{u(x, t) = u£(x — ct + x0) : x0 €
Local in time and space nonlinear stability of pulses in an unstable medium
299
of solitary waves for (3), with \\us - uCe||oo = 0(e) [7]. In particular ||U £ ||L°° = 0(1) for e -> 0, and \ue(y)\ < Ce'PoM with constants C and f30 > 0 both 0(1) for e -> 0.
4. Linear stability analysis The perturbation v of u£ in the comoving frame, i.e. u(x,t) satisfies dtv = Bev - dy(u£v) - ^dy(v2)
= Lev -
= ue(x — ct) + v(x — ct,t), \dy(v2)
with B£v = dyV - edyV - edyV + cdyv,
y = x - ct.
For £ > 0 the asymptotic rest state u = 0 for y —• ±oo is unstable due to the continuous spectrum of its linearization Be, i.e. Bjkx
= \{k)e[kx,
with
A(fc) = e(fc2 - fc4) + ick - ifc3.
Since Lc is a relatively compact perturbation of Be, both operators possess the same essential spectrum. Hence also ue is unstable [6]. In the KdV case (e = 0), the discrete spectrum of the linearization vt = L0v (considered in L 2 (R)) consists of a double eigenvalue 0 with eigenfunction dXouc which is called translation mode in the following and the generalized eigenfunction dcuc which is called speed/amplitude mode in the following. In [8] it is shown that for e > 0 sufficiently small the Jordan-block present for e = 0 vanishes and that the operator Le has the simple eigenvalue Ai = 0 with eigenfunction
A(ifc)^\ A(ifc—a)
\ +';>
Figure 2. The spectrum of Lc,a eigenvalue for Le,a.
x+
for e > 0 and a = 0, a > 0. The resonance pole x for L £ | o becomes an
300
GUIDO SCHNEIDER, ROBERT L. P E G O , HANNES U E C K E R
To deal with the unstable continuous spectrum of Le and to extract the local in space behavior of (3) near u£ we introduce the weighted variable w — veay, with a > 0 chosen below. This gives dtw = LEiaw for the linearized system where Le,a = -{dy-
of -e[(dy
- a)2 + (dy - a) 4 ] + c(dv - a) -
dy{uev).
The first effect of the weight is that the essential spectrum of Le is shifted to \a(k)
= -(ifc - o) 3 - e((ijfc - a) 2 + (ifc - a) 4 ) + c(ik - a).
In particular, for e > 0 sufficiently small, there exist values of a > 0 such that the essential spectrum of L£ifl is completely in the left complex half plane and O(l) away from the imaginary axis. In detail, if we choose such an a > 0, then there exist constants a > 0 and <7i > 0, both 0(1), such that Re Aa(fc) < —a — a\k2. Secondly, for suitable a — 0(1) not too large, the resonance pole A# becomes an eigenvalue for Le>a since the function ip# = eay4># decays exponentially to 0 for both y —> ±oo. Thus, the spectrum of the operator L £j0 consists of an eigenvalue zero, an eigenvalue which is negative of order 0(e) and the rest is strictly left of {Re z = — a} for some a > 0 independent of e. Therefore, on the linear level the evolution of w is dominated by the modes associated to ip± and if;#, i.e., we obtain w(y, t) = ci^i(y) + e A *'c # V#(j/) + 0(e-^2),
(4)
with c\ and c# determined by initial data.
5. The local nonlinear stability result In order to make a nonlinear result out of (4) the v and w variables have to be considered simultaneously. This is due to the fact that in the pure w-formulation of (3) the nonlinear term is given by dy(e~ayw2) which is not bounded from H1 to L2 due to the factor e~ay. Then due to the linear instability of the pulses in the v variable only a local in time stability result can be expected which coincides with the experimental observations of the dynamics of the system. In order to prove the result we write the nonlinear equation for w as dtw = Leiaw + -avw -
-dy(vw),
where we expressed the nonlinearity using the unweighted variable v. By this choice the nonlinearity in the w equation is linear in w, i.e. H e - ^ l l t f i = ||H|tfi < ||u|MM|i/i. Next, in the equation for v due to the decay rates of ue for \y\ —> co the term dy(u£v) will be replaced by G(w)(y) = dy(u£(y)e~ayw(y)), i.e. we rewrite the equation for v as dtv = BEv - G(w) -
\dy{v2).
Local in time and space nonlinear stability of pulses in an unstable medium
301
According to the linear stability analysis we define L£iQ-invariant projections in L 2 (R), namely Pi to span{V>i}, P2 to span{V>#}, and Ps to the rest of L£
\dy{v2),
dtivc = Acwc — ^Pc(dy(vw)
— avw),
dtws = Asws - \Ps(dy{vw)
- avw),
dtv = Bv-
(5)
with wc = Pcw, Pc = Pi+ P2, ws = Psw and Ac = L€taPc, As = L£
sup ||u(r)|| ff n,
8w{t)=
0
sup
\\w(T)\\Hn.
0
Then there exist constants a > 0 and £0 > 0 such that for all T0 > 0 there exist C > 0, Ci > 0 such that the following holds for all e € (0, £o)- F°r a^ <^(0) > 0 and all 5W(0) > 0 satisfying e-lT0 (^-
+ Sv(0)\
we have solutions v,wc,ws
and
max [e^To,
(e^To)1'2} ^(0) < C
of (5) satisfying
5v(T0/e) < C25v(0),
5w{TQ/e) < C25w{0),
IKWIIff- < e-at\\ws(0)\\Hn+C25w(0)5v(0),
t G [0,To/e].
(6) (7)
Remark 5.1. The assumptions on 5V(0) and 5W(Q) are for instance satisfied if MO) < Ce
and
8W(0) < e6v(0) < Ce2 ,
for C > 0 sufficiently small, but independent of e. Remark 5.2. Theorem 5.1 is an attractivity result since ws which is initially of order O(Sw(0)) is attracted into a neighborhood of the pulse of size 6*2^(0)^(0) with an exponential rate. This happens on a time scale 0{\ ln(e)j). Theorem 5.1 is a stability result since the solutions then stay close to the pulse on the long time interval 0 ( l / e ) . Remark 5.3. It is the purpose of future research to prove the attractivity and local stability of the individual pulses ue also for bigger domains of attraction, namely Sv(0) = 0(1) and 5W(0) = 0(1) in H1, on the long time interval 0 ( l / e ) , by using the orbital stability of the KdV-pulses in Hl. In Hn spaces, for n > 2, we do not expect such a result and suppose that a result of the above form is optimal. By handling the ^-variable with a priori energy estimates [9], it is an easy exercise to establish such a result on an 0(l)-time interval where we have pure KdV-dynamics.
302
GUIDO SCHNEIDER, ROBERT L. PEGO, HANNES UECKER
P r o o f of T h e o r e m 5.1. The variation of constants formula for (5) yields v(t) = eBtv(0) - J wc(t) = e^wc(0)
e B ( ' - ) ( G ( » c + ws) +
+ ^J
eA^-^Pc[(avw
\dy{v2)){T)dT,
- dy(v(wc +
1 /•* ws(t) = eA»*u;s(0) + - / eA^-T^Ps[(avw * Jo /o
WS)))(T)]CIT,
- dy(v(wc + ws)))(r)]dT.
The linear semigroups satisfy \\em\\Hn^Hn
< CeCst,
Ue^Htfn^H- < C,
\\eA^\\Hn_H,
< Ce""*,
for constants C, a > 0 independent of e G (0,£o). Moreover, we have the smoothing properties
lleBtu
Hn
eA^u\\Hn
^ C e ^ ^ l + Buple-^^Jfc^HullHn-!
+
(st)-i)\\u\\Hn^
< Ce~at (l + sup | e - Cfe2 *fc|) ||u|| ff »-i < Ce~at (l + r *) H ^ - i , fceR
where we used that ReAa(fc) < — a — o\k2. Thus, for 0 < t < £ _ 1 T 0 we obtain Sv(t) < C6V(0) + Ce-lT0{5w{t)
+
5v{tf),
5w(t) < C8W{0) + Cmax {e^T,,, (e^T,,)* }
5v(t)5w(t).
Hence 8v{t) and 5w(t) are uniformly bounded by initial data for 0 < t < e _ 1 T 0 if e
"
l T
o ( j ® + M O ) ) >0
and
max{£-1T0,(£-1T0)5}<Jv(0)>0
are sufficiently small. Moreover, we obtain KWIIff" < < 7 e - C T t | | ^ ( 0 ) | | ^ + C /
e-^t-r\t-T)-i\\v(T)\\Hn\\iv(T)\\HndT
Jo
+C2Sv(0)5w(0),
which completes the proof of Theorem 5.1.
•
6. Conclusions: Interpretation in a>space and exponentially growing tails The splitting w = wc + ws cannot be transformed to the v variable uniformly in space due to the unboundedness of (j>#{y) = e —ay, 0#(y)- Hence we consider the solutions on the semi-infinite interval [—?/o,oo) for yo arbitrary but fixed and finite. On this interval we see a drift of the pulse and some multiple of the second eigenfunction. Since the tu s -part is of order Sv(0)Sw(0) + e~at\\ws(0)\\[fn, it is much smaller after a time ti = 0(\ hi(e)|) than the
Local in time and space nonlinear stability of pulses in an unstable medium
303
part W2 = P2WC. This means that, although slowly damped in time, W2e ay will emerge out of v, showing locally an exponentially growing tail after the pulse. In detail, we have v(y, t) = ai (t)(f>i (y) + a# {t)<j># (y) + ws (y,
t)Q-ay
with |ai(t)| + |a 2 (i)| < C6W(0). Using sup„ 6l _ OTi0o) |u(i/)e-°»| < e°lw>l||u||L- and K ( t ) I U - < K ( i ) | | « » < C8w(0)6v{0) +
Ce-at\\wl{0)\\Hn,
we find sup
\v(y,t) -
ai(t)Mv)
~ a#(t)^#(y)\
< CeaM(5U0)Sv(0)
+
e-at\\ws(0)\\Hn),
y€\-Vo,oo)
which is much smaller than the order O(6w(0)) of a\{t)(f)\{y) + a,2(t)(f)#(y) on [—yo,oo) for t>t\.
Acknowledgments The work of Hannes Uecker is partially supported by the Deutsche Forschungsgeminschaft DFG under the grant Ue60/1. The work of Robert Pego is partially supported by the National Science Foundation under grants DMS 00-72609 and DMS 03-05985.
References 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12.
J. Bricmont, A. Kupiainen, G. Lin, Commun. Pure Appl. Math. 47, 893-922 (1994). H.-C. Chang, E. A. Demekhin, Adv. Appl. Mech. 32, 1-58 (1996). H.-C. Chang, E. A. Demekhin, D. I. Kopelevich, Physica D 97, 353-375 (1996). H.-C. Chang, E. A. Demekhin, E. Kalaidin, SIAM J. Appl. Math. 58, 1246-1277 (1998). H.-C. Chang, E. A. Demekhin, Complex Wave Dynamics on Thin Films, Studies in Interface Science 14, Elsevier, 2002. D. Henry, Geometric Theory of Semilinear Parabolic Equations, Lecture Notes in Mathematics 840, Springer, 1981. T. Ogawa, Hiroshima Math. J. 24, 401-422 (1994). T. Ogawa, H. Susuki, SIAM J. Appl. Math. 57, 485-500 (1997). R. L. Pego, M. I. Weinstein, Comm. Math. Phys. 164, 305-349 (1994). G. Schneider, H. Uecker, in preparation. Y. Teramoto, J. Math. Kyoto Univ. 32, 593-619 (1992). J. Topper, T. Kawahara, J. Phys. Soc. Japan 44, 663-666 (1978).
Some mathematical problems of statistical hydrodynamics ARMEN SHIRIKYAN
(U. Paris-Sud XI)
We give a survey of recent advances in the problem of ergodicity for a class of dissipative PDEs perturbed by an external random force. One of the main results in this direction asserts that, if the external force is sufficiently non-degenerate, then there is a unique stationary measure, which is exponentially mixing. Moreover, a strong law of large numbers and a central limit theorem hold for solutions of the problem in question. The results obtained apply, for instance, to the 2D NavierStokes system and to the complex Ginzburg-Landau equation.
1. Introduction Let us consider the two-dimensional Navier-Stokes system in a bounded domain D e l with smooth boundary dD: u-\- (u, V)tt — v&u + Vp = r](t,x),
divu = 0,
x G D.
2
(1)
Here u = («i,W2) is the velocity field of the fluid, p = p(t,x) is the pressure, v > 0 is the viscosity, and r) is an external force. Equation (1) is supplemented with the Dirichlet boundary condition u\dD = 0. (2) It is well known that, under some mild regularity assumptions on the right-hand side -q, the Cauchy problem for (1), (2) has a unique solution defined on the half-line t > 0. We shall describe some recent results on asymptotic behaviour of solutions as t —> +oo in the case when 77 is a random process. Let us give a more precise formulation of the problems we are dealing with. Suppose that the right-hand side r](t,x) depends on an additional random parameter w. Thus, it can be considered as a stochastic process valued in a functional space, and we shall assume that r\ is stationary in time and smooth in x. In this case, solutions of (1), (2) are also random processes, and we can consider their distributions, which are probability measures on the (infinite-dimensional) phase space of the problem. We are interested in the following questions: (i) Existence of stationary solutions, i.e., solutions whose distribution does not depend on time. (ii) Uniqueness of distribution of stationary solutions. (iii) Large-time asymptotics of distributions of solutions, (iv) Large-time asymptotics of the time average of some observables. The aim of this paper is to give a survey of some recent results concerning the above questions and to formulate some open problems.
304
Some mathematical problems of statistical hydrodynamics
305
The paper is organised as follows. Section 2 is devoted to the initial value problem for (1), (2). In section 3, we recall the concept of stationary measure and exponential mixing and formulate a fundamental result on stationary measures for problem (1), (2). Section 4 is devoted to the law of large numbers (LLN) and central limit theorem (CLT) for solutions. Finally, in section 5, we describe two open problems. Most of the results of this paper are obtained in collaboration with Sergei Kuksin.
2. Cauchy problem We fix a bounded domain D c R 2 with smooth boundary dD and introduce the space H = {u € L2(D, R 2 ) : divu = 0, (u,
n)\gD=0},
where n is the outward normal to dD. Let II: L2(D, M2) —» H be the orthogonal projection onto H. Applying formally the operator II to (1), we obtain the following evolution equation in H: u + vLu + B(u) = r)(t), (3) where L = —IIA is the Stokes operator, B(u) = U(u, V)u is the bilinear form resulting from the nonlinear term in (1), and we retained the notation for the right-hand side. We shall assume that the right-hand side of (3) is a random process white in time and smooth in the space variables. Namely, let us denote by {e,} an orthonormal basis in H formed of the eigenvectors of the Stokes operator. We assume that
v(t) = g-tat), c(t) = X>&(*te,
(4)
where {Pj } is a sequence of independent standard Brownian motions denned on a probability space (fi, T, P) and bj > 0 are some constants such that OO
B:=2>2
(5)
j=i
Condition (5) ensures that a.e. realisation of C(t) is a continuous function of time with range inH. Consider the initial value problem for (3), (4): u(0) = uo,
(6)
where u§ is an H-valued random variable independent of £. Let us set V = H f) HQ, where H& = H&(D,R2) is the space of vector functions that belong to the Sobolev space of order 1 and vanish at 3D. Definition 2.1. A random process u(t) defined for t > 0 is called a solution of (3), (4), (6) if it possesses the following properties: (i) The process u(t) is progressively measurable with respect to the filtration Tt generated by Mo and £> and its almost every realisation belongs to the space C(R.+ ,H) ("1
306
ARMEN SHIRIKYAN
(ii) With probability one, for any t >0 we have u(t) -u0+
Jo
(vLu + B(u)) ds = C(t),
where the left- and right-hand sides of this relation are regarded as elements of the dual space of HQ. The following theorem, which gives, in particular, the existence and uniqueness of a solution for the problem (3), (4), (6), is established in [29] (see also [3,8]). Theorem 2.1. Suppose that condition (5) is satisfied, and let UQ be an H-valued random variable independent of £. Then the following assertions hold: (i) The problem (3), (4), (6) has a unique solution u(t,uo) in the sense of Definition 2.1. (ii) The set of solutions u(t, v) corresponding to all deterministic initial functions v G H form a Markov process in the space H. Let B(H) be the family of Borel subsets of H, let V(H) be the set of probability measures on (H,B(H)), and let Cb{H) be the space of bounded continuous functionals / : H —> K. We shall denote by Pt(v,T) the transition function of the Markov process constructed in Theorem 2.1:
pt(t>,r) = p{u(*,«)er},
VGH,
T&B(H).
Finally, let us recall the definition of Markov semi-groups tyt and *#*[ corresponding to the transition function Pt(v,T). Namely, for / G Cb(H) and \x G V(H), we set
Mv) = / pt(v,dz)f(z), qjt>(r) = / pt(v,r)^dv). JH
JH
3. Stationary measures: Existence, uniqueness, and exponential mixing Let us recall that a measure \x G V(H) is said to be stationary for (3), (4) if $t*M = fj. for all t > 0. We denote by C{H) the space of functionals / G Cb(H) such that l l / k := sup |/(u)| + sup 1 ^ " ) - ^ ) ! < oo, where || • || is the natural norm in L2. Definition 3.1. A stationary measure \x is said to be exponentially mixing if there is a constant 0 > 0 and an increasing function h : E + —> R + such that, for any v G H and / G C(H), we have \%tf(v)-{f,n)\
t>0,
(7)
307
Some mathematical problems of statistical hydrodynamics The following result is of fundamental importance for applications.
Theorem 3.1. Suppose that condition (5) is satisfied. Then the assertions below take place for any v > 0. 1. Existence and a priori estimate: There exists a stationary measure LL € V(H). Moreover, there are positive constants c and C not depending on v such that any stationary measure ii satisfies the inequality
L
ecv"u" Li(du) < C.
2. Uniqueness: such that, if
(8)
There is an integer N = iV„ > 1 depending on the constant B in (5) bj^O
for j = l,...,N,
(9)
then the stationary measure \x is unique. 3. Exponential mixing: If (9) holds, then the stationary measure LI is exponentially mixing in the sense of Definition 3.1 with h{r) = const(l +r). Theorem 3.1 implies two important corollaries. The first of them concerns the asymptotic behaviour of the distributions of solutions to the Cauchy problem. Corollary 3.1. Under the conditions of Theorem 3.1, for any H-valued random variable UQ that is independent of C, and has a finite mean value, we have T>(u(t,uo)) —^ n
as t —> +oo,
(10)
where £>(£) is the distribution of the random variable £, and the convergence in (10) is understood in the weak* topology ofV(H). To formulate the second corollary, we introduce a space of Holder continuous functionals with at most exponential growth at infinity. Namely, for a £ (0,1] and e > 0 we denote by Ca(H, s) the space of continuous functionals / : H —> E such that ii fII
ll/lk£
._ «„n
!/(")!
- ™£ exp( e ||«P)
,m
+
n
!/(")-/(")!
^_
T/v (exp( £ ||«|P) + exp(e|MP)) ||U - V ||«
<
Corollary 3.2. Under the conditions of Theorem 3.1, for any a £ (0,1] and sufficiently small e > 0 there are positive constants C and 7 depending on v such that |E/(«(t,uo)) - (/,M)| < C e - ^ E e x p ^ K I I 2 ) ,
t > 0,
(11)
where f £ Ca{H,e) is any functional with norm ||/||a, £ < 1 anduo is an arbitrary H-valued random variable independent ofXLet us make some comments on the history of the problem of ergodicity for the 2D NavierStokes system. The existence of a stationary measure is established in [29] with the help of the Bogolyubov-Krylov argument (see also [8]). A proof of the a priori estimate (8) can be found in [1,25], The problem of uniqueness and exponential mixing, which is much more delicate, was in the focus of attention during the last few years. The first result in this
308
ARMEN SHIRIKYAN
direction was obtained by Flandoli and Maslowski [9]. Under the condition that the righthand side rj of equation (3) is sufficiently irregular with respect to the space variables, they established the uniqueness of stationary measure and convergence to it in the variational norm. Their results were refined later by Ferrario [7]. We also mention Mattingly's paper [22] devoted to the case v ~S> 1. The first uniqueness result that allows the right-hand side to be infinitely smooth with respect to the space variables and applies to all v > 0 was established by Kuksin and the author [13] (see also [14,17]). We considered the case in which the right-hand side rj is a random process of the form oo
v(t) = Y,Vk(x)6(t-k),
(12)
fc=i
where {%} is a sequence of i.i.d. random variables in H with sufficiently non-degenerate distribution. The proof in [13] is based on a new approach involving a Lyapunov-Schmidt type reduction and a version of the Ruelle-Perron-Frobenius theorem. E, Mattingly, Sinai [4] and Bricmont, Kupiainen, Lefevere [2] studied later the case when the space variables x belong to the torus and the right-hand side r/ is white noise in time and trigonometric polynomial in x. They showed that there is at most one stationary measure. Moreover, it was established in [2] that, for /u-a.e. initial function (where fi is the stationary measure), the corresponding solution converges to /x in distribution exponentially fast. We also mention the paper [6] by Eckmann and Hairer in which an infinite-dimensional version of the Malliavin calculus is developed to study the problem of ergodicity for the real Ginzburg-Landau equation perturbed by a rough degenerate forcing. In the papers [11,15,16,18,26] another approach based on coupling of solutions was developed to establish uniqueness of stationary measure and exponential convergence to it for all initial functions. The idea of coupling was also used in the papers by Mattingly [23], Masmoudi and Young [21], and Hairer [10]. Some further properties of random dynamical systems generated by stochastic PDEs are studied in [19,20]. In particular, it is proved in [20] that the support of the Markov disintegration of a mixing stationary measure is a minimal random point attractor. Finally, as is shown in [12,27], Theorem 3.1 implies a strong law of large numbers and central limit theorem for solutions of the problem (3), (4). The corresponding results are discussed in the next subsection. The approach developed in the papers [11,15,16,18,26] applies not only to the 2D NavierStokes system, but also to a large class of dissipative PDEs perturbed by a random force of the form (4) or (12). For instance, an analogue of Theorem 3.1 is valid for the complex Ginzburg-Landau equation it- (v + ia)Au + iX\u\2u = rj(t).
(13)
We shall not dwell on the details in this paper.
4. Law of large numbers and central limit t h e o r e m To simplify the presentation, we shall confine ourselves to bounded uniformly Lipschitz functionals. The following result on strong law of large numbers for solutions of the problem (3), (4) is established in [12,27].
Some mathematical problems of statistical hydrodynamics
309
Theorem 4.1. Suppose that (5) holds and the non-degeneracy condition (9) is satisfied with sufficiently large N > 1. Then for any e € (0, g) there is a constant D > 0 such that, for an arbitrary initial function UQ € H and any functional f G C{H), the following statements hold: (i) There is a random variable T(to) > 1 depending on e, v, and f such that r
1
/ f(u(s,u0))ds-(f,ii) Jo
for
(ii) The random variable T is almost surely finite. Moreover, if0
t>T(u). then
+ \\u0\\2)\\ffc.
We note that Theorem 4.1 remains valid for functionals of the class Ca(H,e) denned in section 3 (see [27]). Moreover, if the problem is studied on the two-dimensional torus T 2 , and the right-hand side r/(t, x) is infinitely smooth with respect to the space variables, then in Theorem 4.1 we can take Holder continuous functionals on any Sobolev space Hs. For instance, one can consider m-point correlation tensors of the form f{u) = ^(a;1) • • -u(xm), where a; 1 ,..., xm £ T 2 are given points. We now discuss the CLT for the problem (3), (4). For any / e C(H) satisfying the condition (/, /x) = 0, we introduce the functional /•OC
g{u) = /
ysf(u)
ds,
u<= H.
Inequality (7) implies that g is well defined. Furthermore, we introduce a non-negative constant
J /(u(s))ds)
> 0,
where u(s) is a stationary solution with distribution //. Hence, the constant 07 > 0 is well denned by relation (14). For any a > 0, we denote by $ ff (r) the one-dimensional centred Gaussian distribution with variance a: *„(r) = - 7 = T
e-^l^ds.
Finally, for a = 0, we set ( 1, r > 0 , { 0, r < 0. The following result on central limit theorem for solutions of the problem (3), (4) is established in [12,27]. Theorem 4.2. Suppose that (5) holds and the non-degeneracy condition (9) is satisfied with sufficiently large N > 1. Then the following statements hold:
310
ARMEN SHIRIKYAN
(i) For any a > 0 there is a function h&(ri,r2) > 0 defined on R+ x R + and increasing in both arguments such that, for any f S C(H) satisfying the conditions af > a and (/, n) = 0, we have sup ' { t ~ * j [ f(u(s,u0))ds
where t > 1 and UQ € H. (ii) There is a function h(r\,r2) > 0 defined on R+ x E + ana! increasing in both arguments such that, for any f £ C(H) satisfying the conditions ay = 0 and (/, fj.) = 0, we have sup
(\z\ A l) 'It'1* j*f(u(s,u0))ds
J
w/iere £ > 1 anrf uo G -ff • As in the case of LLN, the above theorem remains valid for a class of Holder continuous functionals growing at infinity. Furthermore, for the problem with periodic boundary conditions and smooth (in x) right-hand side, we can consider continuous functionals on Sobolev spaces of arbitrarily large order.
5. Open problems Many questions remain unsolved in the theory of randomly forced 2D Navier-Stokes equations. Let us formulate two of them. 5.1. Uniform condition for ergodicity Let us consider the NS system (1) either in a bounded domain D C R 2 , with Dirichlet boundary condition, or on the torus T 2 = R 2 /Z 2 . Theorem 3.1 ensures that, for any v > 0, there is an integer iV„ > 1 such that, if condition (9) is satisfied with JV = Nv, then (1) has a unique stationary measure, which is exponentially mixing. In particular, if bj =/= 0 for all j > 1, then a stationary measure is unique for any v > 0. Problem 1. Prove the following assertion: there is a finite integer N > 1 not depending on v such that, if condition (9) is satisfied, then for any v > 0 there is a unique stationary measure for (I), which is exponentially mixing. 5.2. Problems with continuous spectrum As was mentioned in Section 3, Theorem 3.1 remains valid for a large class of type (3) dissipative PDEs perturbed by a sufficiently non-degenerate random force of the form (4) or (12), provided that the problem is considered in a bounded domain (and therefore the spectrum of the linear operator L is discrete). On the other hand, the Bogolyubov-Krylov argument enables one to construct stationary measures for some equations in unbounded domains (e.g., see [5,24]).
Some mathematical problems of statistical hydrodynamics
P r o b l e m 2. Find a sufficient condition that guarantees stationary measure for equations in unbounded domains.
uniqueness
and ergodicity
311
of a
References 1. J. Bricmont, A. Kupiainen, R. Lefevere, J. Stat. Phys. 100, 743 (2000). 2. J. Bricmont, A. Kupiainen, R. Lefevere, Comm. Math. Phys. 230, 87 (2002). 3. G. Da Prato J. Zabczyk, Ergodicity for Infinite-Dimensional Systems, Cambridge University Press, Cambridge, 1996. 4. Weinan E, J. C. Mattingly, Ya. Sinai, Comm. Math. Phys. 224, 83 (2001). 5. J.-P. Eckmann, M. Hairer, Nonlinearity 14, 133 (2001). 6. J.-P. Eckmann, M. Hairer, Comm. Math. Phys. 219, 523 (2001). 7. B. Ferrario, Stochastics Stochastics Rep. 60, 271 (1997). 8. F. Flandoli, Nonlinear Differential Equations Appl. 1, 403 (1994). 9. F. Flandoli, B. Maslowski, Comm. Math. Phys. 171, 119 (1995). 10. M. Hairer, Probab. Theory Related Fields 124, 345 (2002). 11. S. B. Kuksin, Partial Differential Equations. Mark Vishik's Seminar, edited by M. S. Agranovich and M. A. Shubin, American Mathematical Society, Providence, RI, 2002, p. 161. 12. S. B. Kuksin, Rev. Math. Phys. 14, 585 (2002). 13. S. B. Kuksin, A. Shirikyan, Comm. Math. Phys. 213, 291 (2000). 14. S. B. Kuksin, A. Shirikyan, Math. Phys. Anal. Geom. 4, 147 (2001). 15. S. B. Kuksin, A. Shirikyan, Comm. Math. Phys. 221, 351 (2001). 16. S. B. Kuksin, A. Piatnitski, A. Shirikyan, Comm. Math. Phys. 230, 81 (2002). 17. S. B. Kuksin, A. Shirikyan, Ergodic Theory Dynam. Systems 22, 1487 (2002). 18. S. B. Kuksin, A. Shirikyan, J. Math. Pures Appl. 8 1 , 567 (2002). 19. S. B. Kuksin, A. Shirikyan, Proc. Roy. Soc. Edinburgh Sect. 133A, 1 (2003). 20. S. B. Kuksin, A. Shirikyan, Fund. Anal. Appl. 37 (2003), to appear. 21. N. Masmoudi, L.-S. Young, Comm. Math. Phys. 227, 461 (2002). 22. J. C. Mattingly, Comm. Math. Phys. 206, 273 (1999). 23. J. C. Mattingly, Comm. Math. Phys. 230, 421 (2002). 24. J. Rougemont, Comm. Math. Phys. 225, 423 (2002). 25. A. Shirikyan, Russian Math. Surveys 57, 785 (2002). 26. A. Shirikyan, J. Math. Fluid Mech. 5, 1 (2003). 27. A. Shirikyan, Prepublication 2003-05, Orsay (2003); see h t t p : //www. math. u-psud. f r/"airmen. 28. R. Temam, Navier-Stokes Equations: Theory and Numerical Analysis, North-Holland, Amsterdam, New York, 1979. 29. M. I. Vishik, A. V. Fursikov, Mathematical Problems of Statistical Hydrodynamics, Dordrecht, Kluwer, 1988.
This page is intentionally left blank
General relativity Session organized by A.
RENDALL
(Potsdam) and R.
BARTNIK
(Canberra)
This page is intentionally left blank
Formation of singularities in Yang-Mills equations P I O T R BIZON (Jagellonian U., Krakow) In this talk we report on recent studies of singularity formation for solutions of spherically symmetric Yang-Mills equations in 5 + 1 dimensions. These equations exhibit interesting similarities with the Einstein equations in the physical dimension, in particular the dynamics at the threshold of singularity formation shares many features (such as universality, self-similarity, and scaling) with critical phenomena in gravitational collapse.
1. Introduction One of the most interesting features of many nonlinear evolution equations is the spontaneous onset of singularities in solutions starting from perfectly smooth initial data. Such a phenomenon, usually called "blowup", has been a subject of intensive studies in many fields ranging from fluid dynamics to general relativity. Whether or not the blowup can occur for a given nonlinear evolution equation is the central mathematical question which, from the physical point of view, has a direct bearing on our understanding of the limits of validity of the corresponding model. Unfortunately, this is often a difficult question and we do not know the answer for some major equations like the Navier-Stokes equations or the Einstein equations. Once the existence of blowup is established for a particular equation, many further questions come up, such as: When and where does the blowup occur? What is the character of blowup and is it universal? Can a solution be continued past the singularity? In this talk we consider these questions for the Yang-Mills (YM) equations in higher dimensions. In the physical 3+1 dimensions, where the YM equations are the basic equations of gauge theories describing the weak and strong interactions of elementary particles, it is known that no singularities can form. This was shown by Eardley and Moncrief [1] who proved that solutions starting from smooth initial data remain smooth for all future times. The study of the YM equations in unphysical N+l dimensions for N > 3 is motivated by the hope that by understanding the problem of singularity formation for the YM equations one might get insight into the analogous, but much more difficult, problem in general relativity. Prom this viewpoint — in which the YM equations are considered as a toy model for the Einstein equations — it is essential that these two equations belong to the same criticality class. Let us explain what this means. Consider a nonlinear wave equation which is scale invariant (most equations of mathematical physics have this symmetry), that is, if u(x) is a solution, so is u\(x) = Xau(x/\) for a certain constant a. Then, the energy of the solution E(u) scales as a homogeneous function of degree (3 (called the criticality) E(u\) = X^E(u). Depending on the sign of (3 one classifies equations into subcritical (/? < 0), critical (/3 = 0), and supercritical ((3 > 0). This classification is important because it serves as a guiding principle in making conjectures about the possibility of blowup: subcritical equations are expected to be globally regular, while supercritical equations may develop singularities for some (large) initial data [2]. This principle is based on the observation that in the subcritical case solutions cannot concentrate on arbitrarily small scales (large frequencies) because it
315
316
PlOTR BlZON
would cost an infinite amount of energy. In contrast, for supercritical equations the transfer of energy to large frequencies may be energetically favorable so singularities are anticipated. As we shall see below, the criticality for the TV+1 YM equations is PYM = TV—4, while for the Einstein equations PE — TV —2. Thus, the YM equations in TV = 5 have the same criticality, (3 = 1, as the Einstein equations in TV — 3. This gives a hope that by understanding the dynamics of YM equations in five space dimensions we can learn something about the Einstein equations in the physical dimension.
2. S e t u p We consider Yang-Mills fields in TV + 1 dimensional Minkowski spacetime (in the following Latin and Greek indices take the values 1,2,..., TV and 0,1, 2 , . . . , TV respectively). The gauge potential Aa is a one-form with values in the Lie algebra g of a compact Lie group G. In terms of the curvature Fap = daAp — d$Aa + [Aa, Ap) the action is S = ^ J Tv(Fa/3Fal3) dNx dt,
(1)
where e is the coupling constant. It is customary to set e = 1 and we shall also do so in the following. However, we point out in passing that the dimension of e depends on TV, namely [e2] = M~lLN~4 (in c = 1 units); in particular e2 has the same dimension in TV = 5 as the Newton constant G in TV = 3. This is another way of seeing the analogy between the 5 + 1 dimensional YM equations and the Einstein equations in the physical dimension. The YM equations derived from the action (1) are daFal3 + [Aa,Fa0}=O. These equations are scale invariant: if Aa(x) is a solution of (2), so is Aa(x) = The positive definite conserved energy E=
f
IV (Fgi + F*)dNx
(2) \~1Aa(x/\).
(3)
JRO
scales as £"(^4) = \N~iE(A), hence the YM equations are subcritical for TV < 3, critical for TV = 4, and supercritical for TV > 5. Since we want to use the YM equations as a toy model for the Einstein equations in the physical dimension, henceforth we restrict ourselves to the case TV = 5 (we note aside that, in agreement with the heuristic principle relating scaling and regularity, for TV = 3 the initial value problem for the YM equations is well posed in the energy norm [3]). For simplicity, we take here G = 50(5) so the elements of so(5) can be considered as skew-symmetric 5 x 5 matrices and the Lie bracket is the usual commutator. Assuming the spherically symmetric ansatz [4] A^(x)
= (5^-6i^)1-WJt'r\
(4)
the YM equations (2) for TV = 5 reduce to the scalar semilinear wave equation for the magnetic potential w(t,r) 2 3 —wtt + wrr + -wr + - 2 ^ ( 1 - w2) = 0.
(5)
Formation of singularities in Yang-Mills equations
317
The central question for equation (5) is: can solutions starting from smooth initial data w(0,r)=f(r),
wt(0,r)=g{r)
(6)
become singular in future? Before describing the numerical studies of this problem, we first need to learn about the structure of self-similar solutions of equation (5). As we shall see, these solutions play a key role in understanding the nature of blowup.
3. Self-similar solutions By definition, self-similar solutions are invariant under scaling w(t,r) —> w(t/\,r/\), they have the form w(t,r) = W(r]),
V=i^t,
hence (7)
where a positive constant T, clearly allowed by the time translation invariance, is introduced for later convenience. Substituting the ansatz (7) into equation (5) one obtains the ordinary differential equation W" + -W
+
2
,,3
9,W(1
- W2) = 0.
(8)
The requirement of regularity inside the past light cone of the singular point (t = T, r = 0), that is for 0 < r\ < 1, enforces the boundary conditions W{0) = ±1
and
W(l) = 0.
(9)
It was shown in [5] that this boundary value problem has a countable family of smooth solutions W„, where the integer index n = 0 , 1 , . . . denotes the number of zeros of Wn{rj) on the interval 0 < r/ < 1. The ground state solution of this family is known in closed form
WoiT))
= ITp-
(10)
The role of self-similar solutions in dynamics depends crucially on their stability with respect to small perturbations. It was shown in [5] that the solution Wn has exactly n unstable modes. Thus, the ground state solution Wo can appear as a generic attractor while the higher n solutions are not expected to participate in the dynamics of generic initial data.
4. Numerical simulations Having learned about self-similar solutions, we are now prepared to understand the results of numerical studies, first reported in [6], of the initial value problem for equation (5). The main goal of these studies was to determine the asymptotics of blowup. Our numerical simulations were based on finite difference methods combined with adaptive mesh refinement. The latter were instrumental in resolving the structure of singularities developing on vanishingly small scales. We stress that a priori analytical insight into the problem, in particular the knowledge of self-similar solutions was very helpful in interpreting the numerical results.
318
PlOTR BlZON
We solved equation (5) for a variety of initial conditions interpolating between small and large data. A typical example of such initial data is a Gaussian (ingoing or time-symmetric) of the form w(0, r) = 1 - Ar2 exp [-a{r - R)2] , (11) with adjustable amplitude A and fixed parameters a and R. The global behaviour of solutions is qualitatively the same for all families of initial data and depends critically on the amplitude A (or any other parameter which controls the "strength" of initial data).
10
100 1000 10e4 10e5 10e6 10e7
ln(r)
Figure 1. The left plot shows the late time evolution of time symmetric initial d a t a of the form (11) with a = 10, R = 2, and A = 0.2. As the blowup progresses, the inner solution gradually attains the form of the stable self-similar solution Wo(r/(T — t)). The outer solution appears frozen on this timescale. In the right plot the rescaled solutions w(t, (T — t))r are shown to collapse to the profile Wo(r) (solid line).
For small amplitudes the solutions disperse, that is the energy is radiated away to infinity and in any compact region the solution approaches the vacuum solution w = ± 1 . This is in agreement with general theorems on global existence for small initial data. Heuristically, this follows from the fact the for a small amplitude the nonlinearity is dominated by the dispersive effect of the linear wave operator. For large amplitudes we observe the development of two clearly separated regions: an outer region where the evolution is very slow and a rapidly evolving inner region where the solution attains a kink-like shape which shrinks in a selfsimilar manner to zero size in a finite time T. The kink is, of course, nothing else but the self-similar solution Wo(r/(T — t)) (see figure 1). We summarize these findings in the following: Conjecture. Solutions of equation (5) corresponding to sufficiently large initial data do blow up in finite time in the sense that wrr{t,0) diverges as t f T for some T > 0. The universal asymptotic profile of blowup is given by the stable self-similar solution: lim wit, (T • •t)r) tyr
=
W0(r).
(12)
We think that the basic mechanism which is responsible for the observed asymptotic selfsimilarity of blowup can be viewed as the convergence to the lowest "energy" configuration. To see this, note that by rewriting equation (5) in terms of the similarity variable n and the slow time r = — ln(T — t) one can convert the problem of asymptotics of blowup into the problem of asymptotic behaviour for r —> oo. In these variables the wave equation
F o r m a t i o n of singularities in Yang-Mills e q u a t i o n s
319
contains a damping term reflecting an outward flux of energy through the past light cone of the singularity, hence it is natural to expect that solutions will tend asymptotically to the ground state solution Wo. Since solutions with small data disperse and solutions with large data blow up, there arises a natural question what happens in between. Using bisection, we found that along each interpolating family of initial data there is a threshold value of the parameter, say the amplitude A*, below which the solutions disperse and above which a singularity is formed. The evolution of initial data near the threshold was found to go through a transient phase which is universal, i.e. the same for all families. This intermediate attractor was identified as the self-similar solution W\. Having gone through this transient phase, at the end the solutions leave the intermediate attractor towards dispersal or blowup. This behaviour is shown in figure 2.
Figure 2. The dynamics of time-symmetric initial data of the form (11) with amplitudes that are finetuned to the threshold of singularity formation. The rescaled solution w(t, (T — t)r) is plotted against ln(r) for a sequence of intermediate times. Shown (solid and dashed lines) is the pair of solutions starting with marginally critical amplitudes A = A" ± e, where A* = 0.144296087005405. Since e = 1 0 " 1 5 , the two solutions are indistinguishable on the first seven frames. The convergence to the self-similar solution W\ (dotted line) is clearly seen in the intermediate asymptotics. The last two frames show the solutions departing from the intermediate attractor towards blowup and dispersal, respectively.
The universality of the dynamics at the threshold of singularity formation can be understood heuristically as follows. As we mentioned above, the self-similar solution W\ has exactly one unstable mode — in other words the stable manifold of this solution has codimension one and therefore generic one-parameter families of initial data do intersect it. The points of intersection correspond to critical initial data that converge asymptotically to W\. The
320
PlOTR BlZON
marginally critical data, by continuity, initially remain close to the stable manifold and approach W\ for intermediate times but eventually are repelled from its vicinity along the one-dimensional unstable manifold. The universality of the nearly critical dynamics follows immediately from the fact that the same unstable mode dominates the evolution of all solutions. More precisely, the evolution of marginally critical solutions in the intermediate asymptotics can be approximated as w(t,r) = W^rf) + c(A)(T - t ) - 7 ^ ) + radiation,
(13)
where v is the single unstable mode with the positive eigenvalue 7. The small constant c(A), which is the only vestige of the initial data, quantifies an admixture of the unstable mode — for precisely critical data c(A*) = 0. The time of departure from the intermediate attractor is determined by the time t* in which the unstable mode grows to a finite size, i.e., c(A)(T -1*)-"1 ~ 0(1). Using c(A) « c'{A*){A - A*) we get T-f~\A*A\lh. Various scaling laws can be derived from this. For example, consider solutions with marginally sub-threshold amplitudes A = A* — e. For such solutions the energy density 2
e
w;
(^) = 7 f + 7 f +
•}(-, 3(1 v
_ „ „ 22 \ 22
2r4
-w )
'
(14)
initially grows at the center, attains a maximum at a certain time « t* and then drops to zero. Substituting (13) into (14) we get that e(f,0) ~ {T-t)~A, and hence e ( f ,0) ~ e " 4 ^ . For completeness we note a puzzling fact that the numerical computation gives an integer value 7 = 5 (with the accuracy of ten decimal places).
5. Connection with critical phenomena in gravitational collapse The behaviour of solutions near the threshold of singularity formation described above shares many features with critical phenomena at the threshold of black hole formation in gravitational collapse. To explain these similarities, we now briefly recall the phenomenology and heuristics of the critical gravitational collapse. Consider a spherical shell of matter and let it collapse under its own weight. The dynamics of this process, modelled by the Einstein equations, can be understood intuitively in terms of the competition between gravitational attraction and repulsive internal forces (due, for instance, to kinetic energy of matter or pressure). If the initial configuration is dilute, then the repulsive forces "win" and the collapsing matter will rebound or implode through the center, and eventually will disperse. On the other hand, if the density of matter is sufficiently large, some fraction of the initial mass will form a black hole. Critical gravitational collapse occurs when the attracting and repulsive forces governing the dynamics of this process are almost in balance, or in other words, the initial configuration is near the threshold of black hole formation. The systematic studies of critical gravitational collapse were launched in the early nineties by the seminal paper by Choptuik [7] in which he investigated numerically the collapse of a self-gravitating massless scalar field. Evolving initial data fine-tuned to the border between no-black-hole and black-hole spacetimes, Choptuik found the following unforeseen phenomena near the threshold: (i) universality: all initial data which are near the black hole threshold go through a universal transient period in their evolution during which they approach a certain intermediate
Formation of singularities in Yang-Mills equations
321
attractor, before eventually dispersing or forming a black hole. This universal intermediate attractor is usually referred to as the critical solution. (ii) discrete self-similarity: the critical solution is discretely self-similar, that is it is invariant under dilations by a certain fixed factor A called the echoing period. (iii) black-hole mass scaling: for initial data that do form black holes, the masses of black holes satisfy the power law Mbh ~ e 7 where e is the distance to the threshold and 7 is a universal (i.e., the same for all initial data) critical exponent. Thus, by fine tuning to the threshold one can make an arbitrarily tiny black hole. Put differently, there is no mass gap at the transition between black-hole and no-black-hole spacetimes. What Choptuik found for the scalar field, has been later observed in many other models of gravitational collapse, although the symmetry of the critical solution itself was found to depend on the model: in some cases the critical solution is self-similar (continuously or discretely), while in other cases the critical solution is static (or periodic). In the latter case black hole formation turns on with finite mass. These two kinds of critical behaviour are referred to as the the type II or type I criticality, respectively, to emphasize the formal analogy with second and first order phase transitions in statistical physics. We refer the interested reader to [8] for an excellent review of the literature on critical gravitational collapse. The heuristic understanding of critical behaviour in gravitational collapse is based on the dynamical systems picture described above, that is, it is associated with the existence of a critical solution with exactly one unstable mode. This picture leads to some quantitative predictions. In particular, in the case of type II critical collapse, an elementary dimensional analysis shows that the critical exponent 7 in the power law Mbh ~ e 7 is a reciprocal of the unstable eigenvalue of the critical solution. By now, the similarities between type II critical gravitational collapse and the dynamics at the threshold of singularity formation in the 5 + 1 YM equations should be evident. This analogy, together with similar results for wave maps in 3 + 1 dimensions [9], shows that the basic properties of critical collapse, such as universality, scaling, and self-similarity, first observed for the Einstein equations, actually have nothing to do with gravity and seem to be robust properties of supercritical nonlinear wave equations. An advantage of toy models, such as the one presented in this talk, is their simplicity which allowed to get a much better analytic grip on critical phenomena than in the case of the Einstein equations; in particular, it was possible to prove existence of the critical solution. The only characteristic property of type II critical collapse which so far has not found in simpler models (besides, of course, the absence of black holes which are replaced by singularities) is discrete self-similarity of the critical solution. It would be very interesting to design a toy model which exhibits discrete self-similarity at the threshold for singularity formation because this could give us insight into the origin of this mysterious symmetry.
6. Conclusions There are two main lessons that we wanted to convey in this talk. The first lesson is that there are striking analogies between major evolution equations. In particular, the mechanism of blowup to a large extent is determined by the criticality class of the model.
322
PlOTR BlZON
These analogies can be used to get insight into hard problems (such as singularity formation for the Einstein equations) by studying toy models which belong to the same criticality class. This approach is in the spirit of general philosophy expressed by David Hilbert in his famous lecture delivered before the International Congress of Mathematicians at Paris in 1900 [10]: "In dealing with mathematical problems, specialization plays, as I believe, a still more important part than generalization. Perhaps in most cases where we seek in vain the answer to a question, the cause of the failure lies in the fact that problems simpler and easier than the one in hand have been either not at all or incompletely solved. All depends, then, on finding out these easier problems, and on solving them by means of devices as perfect as possible and of concepts capable of generalization." The second lesson is concerned with the interplay between numerical and analytical techniques. Accurate and reliable numerical simulation of singular behaviour is difficult and hard to assess. In order to keep track of a singularity developing on exceedingly small spatiotemporal scales, one needs sophisticated techniques such as adaptive mesh refinement. For these techniques the convergence and error analysis are lacking so extreme care is needed to make sure that the computed singularities are not numerical artifacts. For this reason, in order to feel confident about numerics it is important to have some analytical information, like existence of self-similar solutions. Without a theory, simulations alone do not provide ample evidence for the existence of a singularity. We believe that the interaction between numerical and analytical techniques, illustrated here by the studies of blowup, will become more and more important in future as we begin to attack more difficult problems.
References 1. D. Eardley, V. Moncrief, Coram. Math. Phys. 83, 171 (1982). 2. S. Klainerman, in Progress in Nonlinear Differential Equations and Their Applications, vol. 29, Birkhauser, 1997. 3. S. Klainerman, M. Machedon, Ann. Math. 142, 39 (1995). 4. O. Dumitrascu, Stud. Cere. Mat. 34, 329 (1982). 5. P. Bizon, Acta Phys. Polon. B 33, 1893 (2002). 6. P. Bizon, Z. Tabor, Phys. Rev. D 64, 121701 (2001). 7. M. W. Choptuik, Phys. Rev. Lett. 70, 9 (1993). 8. C. Gundlach, Living Rev. Relativity 2 (1999). 9. P. Bizori, T. Chmaj, Z. Tabor, Nonlinearity 13, 1411 (2000). 10. D. Hilbert, reprinted in Bull. Amer. Math. Soc. 37, 407 (2000).
A family of quasi-local mass functionals with monotone flows HUBERT L. BRAY* (Columbia
U.)
We define a one parameter family of quasi-local mass functionals m c ( £ ) , 0 < c < oo, which are nondecreasing on surfaces in 3-manifolds with nonnegative scalar curvature with respect to a one parameter family of flows. In the case that c = 0, mo(E) equals the Hawking mass of S 2 and the corresponding flow is inverse mean curvature flow. Then, following the arguments of Geroch [8], Jang and Wald [12], and Huisken and Ilmanen [9], we note that the generalization of their results for inverse mean curvature flow would imply that if HIADM is the total mass of the complete, asymptotically flat 3-manifold with nonnegative scalar curvature, then ITIADM > m c ( 2 ) for all nonnegative c and all connected surfaces £ which are not enclosed by surfaces with less area.
1. Definitions Let (M3,g) be an asymptotically flat, time-symmetric (zero second fundamental form) space-like slice of a spacetime. Then the dominant energy condition implies that g has nonnegative scalar curvature, since the scalar curvature Rg at each point equals the local energy density of the spacetime (in the direction orthogonal to M 3 ) . (We note that maximal slices of spacetimes also have nonnegative scalar curvature. Prom this point on we do not need to make reference to the spacetime and only need to use the fact that Rg > 0.) Let £ be any smooth surface in (M3,g), and let y = f(x) be the implicit solution to 2 x = y + y 3 / 2 for x,y > 0. Then for each c € [0,oo), the quasi-local mass functional we define is
^'{w^-kLh^)-^
(1>
where A is the area of £ and H is the mean curvature of £ (the trace of the second fundamental form) in (M3,g). Note that since f(x) ~ x2 for small x, then defining mo by taking the limit as c goes to zero gives
•"•w-l/il^-iaFjM-
(2)
which is the famous Hawking mass of E. In this paper, we will consider flowing surfaces in the outward normal direction with speed 2c Vc
~ f'{cHY
[6)
where for now we are assuming that the mean curvature H of £ is positive. Again, note 'Research supported in part by NSF grant #DMS-0206483.
323
324
HUBERT L. BRAY
that since f'(x) « 2x for small x, then this flow reduces to inverse mean curvature flow
Vo = jj
(4)
in the limit as c goes to zero. For technical reasons which will be explained later, we will define the flows rjc only on surfaces which are not enclosed by any surface with less area. We will call such surfaces outer-minimizing. Note that an outer-minimizing surface has H > 0 since otherwise there would exist an outward variation which would decrease its area.
2. Motivation and intuition The key reason for considering the flow r\c on a surface E to produce a family of surfaces Ec(£) is that m c (E c (£)) can then be shown to be a nondecreasing function of t in 3-manifolds with nonnegative scalar curvature. As long as H > 0 and the flow is smooth, this fact can be verified by direct computation. When the flow is not smooth (and the surfaces "jump" as in Huisken and Ilmanen's inverse mean curvature flow [9]), we strongly conjecture that m c (E c (£)) is still nondecreasing. In fact, it would appear that the monotonicity of m c is true in a completely analogous way to the Hawking mass in the case of inverse mean curvature flow as shown by Huisken and Ilmanen. Furthermore, in the case that the 3-manifold is complete and asymptotically flat (with sufficient decay), we check in special cases and conjecture in general that the total mass rriADM = limt_*oo w c (E c (t)). Thus, these (very reasonable) conjectures imply that "IADM > m c (E)
(5)
for all nonnegative c and connected outer-minimizing surfaces E. The need for this condition on E will be explained in a moment. So what if the mean curvature H of E is not positive? Quite beautifully, Huisken and Ilmanen show that a generalized form of inverse mean curvature flow can always be defined as long as E is outer-minimizing. Huisken and Ilmanen's generalization of inverse mean curvature flow can be thought of geometrically as follows: If at any point in the flow of surfaces the surface is enclosed by a surface with equal or less area, immediately jump out to that surface and then continue flowing by inverse mean curvature. This nice idea guarantees enough positivity for H for the generalized flow to always exist. We suggest that this main idea can also be used to define the generalized flow for r\c = 2c//'(off) which we conjecture also always exists in the weak sense as defined by Huisken and Ilmanen. Another key point of this "jumping" idea is that m c (E) never decreases with a jump. The area of the surface does not change with a jump for two reasons: the area can not increase since the jumps only happen if the new surface has less or equal area, and the area can not decrease since otherwise the surface would have jumped out at an earlier time in the flow. (Note that this last argument requires that the initial surface of the flow is outerminimizing.) Furthermore, since the new surface after a jump is area minimizing where it does not touch the original surface, the new surface must have zero mean curvature at these points. Where the two surfaces touch, the new surface must have mean curvature less than or equal to the original surface. Hence, J s f(cH) must decrease or stay the same with a jump since f{x) is an increasing function of x. Thus, m c does not decrease during these jumps, so m c is nondecreasing under the generalized flow for rjc = 2c/f'(cH).
A family of quasi-local mass functionals with monotone flows
325
3. Monotonicity of the mass functionals We prove that the quasi-local mass functional mc is monotone increasing in the case that the flow 77c is smooth. Given an initial outer-minimizing surface S, let E c (t) be the smooth family of surfaces generated by flowing out in the normal direction with speed
%=
7W
(6)
Recall that y = f(x) is defined as the implicit solution to x2 = y + y 3 / 2 for x, y > 0. The key property of this function is that y = f(x) is a solution to the o.d.e.
^i. = dx
Axy
3x2-y'
(7\
( )
Note that y = -^f(cx) is also a solution to the above o.d.e. Also, y = x2 (which can be thought of as the c = 0 case) solves the o.d.e. As we will see, each of these solutions to the o.d.e. yield a corresponding monotone quasi-local mass functional. (There are even more solutions to the above o.d.e. which are of potential interest although so far it is not clear how to use these other solutions.) The most important step in finding these new monotone quasi-local mass functionals is answering the following question: "For which f(x)_ and g(x) can the rate of change of Js f(H) (when flowing £ in the outward normal direction with speed rj = g{H)) be bounded by a + b J s f(H) in manifolds with nonnegative scalar curvature?" At first glance, it seems unlikely that such a statement would be true for any f(x) and g(x). However, the miracle of the Hawking mass and inverse mean curvature flow is that f{x) = x2 and g(x) = l/x satisfy the conditions of the above question. The question then becomes, "Are there any other functions which work?" As it turns out, the answer is yes. To see this, we need to compute. Recall the standard formulas that if r\ is the speed of the surface S in the outward normal direction, then | i 7 = (-A-||II||2-Ric(^,i7))77
(8)
and —diJ, = Hridn,
(9)
where H is the trace of the second fundamental form II, Ric is the Ricci curvature of the manifold, v is the outward normal unit vector to S, and d/x is the area form of E. Also recall that the full trace of the Gauss equation in this context is mC{v,v)=l-R-K+l-H2-l-\\ll\\2,
(10)
where R is the scalar curvature of the manifold and K is the Gauss curvature of the surface £. Finally, note that if we define Ai and A2 to be the principal curvatures of E (that is, the eigenvalues of II), then H = Xi + A2 and ||II|| 2 = A2 + A2, so that = \H2 + \(\1-X2)2.
(11)
326
HUBERT L. BRAY
Then it follows that dt
£ f(H) dn = J^ f(H)HV + f(H) (-&-1R
+ K- \H2 -\(\I-
A2)2) V (12)
(note that when the area form dfi is not written it is implied). The above equation makes our choice of rj = g(H) easy. By the Gauss-Bonnet equation, we know that J s K < Air assuming that E is connected. Thus, we need f'(H)rj to be a constant on E since otherwise it is not clear how to bound the Gauss curvature term. Multiplying 77 by a constant simply amounts to speeding up or slowing down the flow, so without loss of generality we choose V=m-y
(13)
Then since
and R > 0, we have that
To answer our question, we choose / such that 2xf(x)
3 22 \x = kf(x).
(16)
Rescaling / by a constant factor allows us to choose k = —1/2 without loss of generality. This case is equivalent to y = f(x) satisfying equation (7). We then have
|/E/(H)<8.-i/E/(fl)
(17)
for any f(x) satisfying equation (7). Now choose y = f(x) to solve x2 = y + y3/2 for x, y > 0 (which we leave as an exercise to show satisfies equation (7)). Then since y = f(cx)/c2 also satisfies equation (7), it follows that
JtJJf(cH)<8«-\JJf(cH)
(18)
for the flow ™
t'icH)
(.9)
since dy/dx = f'(cx)/c. The last ingredient for proving that mc is nondecreasing under r]c flow is to note that since 0 < y < x2 and 4xy/(3x2 — y) is increasing as a function of y, then dy 4x • x2 _£. < = 2x dx ~~ 3x2 — x2
A family of quasi-local mass functional with monotone flows
327
by equation (7). Hence, 2C ,c
>i.
(20)
f'(cH) - H'
with equality only in the limit case c = 0. Hence, if we let A(t) be the area of E c (t) defined by rjc flow, then j
p
-A(t)
=
Hnc> A(t)
(21)
•J 2-,c\t)
with equality when c = 0. Equations (1), (18), and (21) then imply the following result. Theorem 3.1. When the outward normal flow nc = 2c/f'(cH) m c (E c (£ 0 )) > 0, then
§trnc(Xc(t))
is smooth and
> 0.
(22)
t=t0
4. Expansion of the flow and the mass Computing the first few terms of the Taylor series for y = f(x) which solves x2 = y + y3^2, we find that /(*) = x2 - x3 + ^ 4 + 0(x5) (23) and f(x)
= 2x-3x2+6x3 + 0{x4).
(24)
Similar calculations then yield that +j-^H2
V=^
+ 0(H2)
(25)
and
™<<E> - V i s
0 - i L "2 -cH3+lc2fl4+0
-c-
<26)
In fact, more careful estimates show that
f(x)<x2-x3
+ lx4,
(27)
sup m c (E),
(29)
from which it follows that
Hence, if we define m(E) =
0
then the quadratic optimization problem (with endpoints) in c gives us that 1 2
™V)>mH(Z)
IE] /
+
^
3 f- H ' i s " V .^V
^f
m maaxx U0
{161T)3/2
J]
| E[1 /2 /
'
where m # = mo is the usual Hawking mass functional from equation (2)
2
(»)
328
HUBERT L. BRAY
5. Convergence t o t h e total mass We conjecture that the family of surfaces £ c (£) denned by rjc flow converge to large round spheres in the asymptotically flat end of (M3,g) in the same sense and for the same reasons that this also occurs for inverse mean curvature flow. Furthermore, if we let A(t) be the area of T,c(t) and define A = iwr2, then for large t we have that the mean curvature H(x, t) of £ c (£) behaves like ^ = " + 0 ^ .
(31)
Thus, lim J——- / V 16TT I 16TT / S
t—*00
cHi\=c
(32)
whereas the analogous integrals of Hp for p > 3 go to zero. Hence, using equation (26), Urn m c (£ c (t)) = lim * / A A - _ L / F 2 ) + c - c, t->oo
t->oo V l07T \
1D7T JY,
= rriADM
(33)
J (34)
for the same reasons that inverse mean curvature flow gives this limit for the Hawking mass (with sufficient assumptions on the asymptotic flatness of (M3,g)). Hence, putting everything together, and assuming our very reasonable conjectures about the existence of the flow r\c (for all c > 0) and convergence of the flow to round spheres for large t, we conclude that mADM > m c (S) (35) for all smooth, outer-minimizing S. Then taking the supremum over all positive c as in the previous section, we further conclude that TOADM > m(E)
(36) IE 11/2
m a x
^m-(E)+(l6ij^
6JS^
(37)
where ran is the usual Hawking mass functional.
6. Conclusions Naturally it would be nice to find applications for the above estimate of the ADM mass of an asymptotically flat, time-symmetric (or maximal) slice M 3 of a spacetime in terms of the outerminimizing surfaces E in M 3 . It is intriguing that equation (37) sometimes gives an improvement on the Hawking mass as a lower bound. However, the Hawking mass is already the best estimate possible in the spherically symmetric case, so we know (and can verify through other means as well) that rh = m # on the spherically symmetric spheres in this case. Hence, m can only give an improved lower bound on the total ADM mass when (M3,g) is not spherically symmetric. In fact, one can prove that such examples do exist. Hence, it is possible that the extra term in equation (37) could be a useful estimate
A family of quasi-local mass functionals with monotone flows
329
in situations which are naturally asymmetrical, and might even be a kind of measure of asymmetry. This suggests looking for a connection with angular momentum, for example. As a final comment, we note t h a t this direction of deriving explicit quasi-local mass functionals like m c a n d t h e Hawking mass c a n b e viewed as b a b y steps towards t h e problem of understanding t h e Bartnik mass [1] of E in terms of t h e geometry of E. T h e estimates on t h e A D M mass obtained here apply t o t h e outer Bartnik mass [2] as well, for example. Bartnik has observed t h a t t h e geometry of E which defines t h e Bartnik mass of E is t h e metric on E a n d t h e mean curvature H of E (but not t h e rest of t h e second fundamental form). Since our estimates are functionals of t h e "Bartnik data" of E , it is possible t h a t there m a y be a connection between these two problems.
References 1. R. Bartnik, "New definition of quasi-local mass", Phys. Rev. Lett. 62, 2346 (1989). 2. H. L. Bray, "Proof of the Riemannian Penrose inequality using the positive mass theorem", Jour. Diff. Geom. 59, 177-267 (2001). 3. H. L. Bray, "Black holes, geometric flows, and the Penrose inequality in general relativity", Notices of the AMS 49, 1372-1381 (2002). 4. H. L. Bray, A. Neves, "Classification of prime 3-manifolds with Yamabe invariant greater than RP3", Annals of Mathematics 159 (2004). 5. H. L. Bray, R. M. Schoen, "Recent proofs of the Riemannian Penrose conjecture", in Current Developments in Mathematics 1999, edited by S.-T. Yau. 6. D. Christodoulou, S.-T. Yau, "Some remarks on the quasi-local mass", Contemporary Mathematics 7 1 , 9-14 (1988). 7. "Fifty years of the Cauchy problem in general relativity", Electronic Proceedings (with video, audio, slides, and lecture notes at h t t p : / / f a n f r e l u c h e . m a t h . v i n i v - t o u r s . f r / ) , organized by Piotr T. Chrusciel and Helmut Friedrich in Cargese, Corsica, summer 2002. 8. R. Geroch, "Energy extraction", Ann. New York Acad. Sci. 224, 108-117 (1973). 9. G. Huisken, T. Ilmanen, "The inverse mean curvature flow and the Riemannian Penrose inequality", J. Diff. Geom 59, 353-437 (2001). 10. G. Huisken, T. Ilmanen, "The Riemannian Penrose Inequality", Int. Math. Res. Not. 20, 10451058 (1997). 11. G. Huisken, T. Ilmanen, "A note on inverse mean curvature flow", in Proceedings of the Workshop on Nonlinear Partial Differential Equations (Saitama University, Sept. 1997), available from Saitama University. 12. P. S. Jang, R. M. Wald, "The positive energy conjecture and the cosmic censor hypothesis", J. Math. Phys. 18, 41-44 (1977).
On a wave map equation arising in general relativity HANS RINGSTROM (MPI,
Golm)
I consider a class of spacetimes for which the essential part of Einstein's equations can be written as a wave map equation. The domain is not the standard one, but the target is hyperbolic space. The objective is to describe the asymptotics of solutions in the time direction corresponding to expansion in the spacetime. The sort of question one wants to answer in the end concern e.g. future causal geodesic completeness.
1. Introduction In the study of the expanding direction of cosmological spacetimes, the results obtained so far can roughly be divided into small data results and results obtained for situations with symmetry. The small data results without symmetry are often very difficult to prove, but one does get conclusions for an open set of initial data. On the other hand, these results concern initial data close to known solutions, and for the method to work, the perturbed solutions typically has to decay to the known one. In studying situations with symmetry one in some sense considers an empty set of initial data, but on the other hand, one need not start with initial data close to something known. Thus, there is the possibility that one may observe some unexpected non-linear behaviour. The symmetry classes for which one can describe the asymptotics in detail consist mainly of spatially homogeneous solutions. However, even in this case, one gets quite interesting behaviour, especially if one also considers the direction toward the singularity. In fact, this case is far from completely understood at this time. Here, I want to discuss the so called Gowdy spacetimes. These admit a two dimensional group of symmetries acting on spatial Cauchy surfaces, so that the equations one ends up with are a system of non-linear wave equations in 1 + 1 dimensions. This class has received considerable attention, probably due to the fact that it is on the borderline; it is not trivial to analyze it, but the set of equations is manageable. The basis for this contribution to the proceedings is [5].
2. The Gowdy spacetimes The Gowdy vacuum spacetimes were first introduced in [3] (see also [2]), and in [4] the fundamental questions concerning global existence were answered. The following conditions can be used to define a member of this class: — It is a time orientable globally hyperbolic vacuum Lorentz manifold. — It has compact spatial Cauchy surfaces. — There is a smooth effective group action of U(l) x U(l) on the Cauchy surfaces under which the metric is invariant. — The twist constants vanish.
330
On a wave map equation arising in general relativity
331
Let me explain the terminology. A group action of a Lie group G on a manifold M is effective if gp = p for all p £ M implies g = e. Due to the existence of the symmetries there are two Killing fields. Let me call them X and Y. The twist constants are defined by Kx=ea0ySXaYf3W^Xs
and
KY =
eafhSXaY(>V1Ys.
The fact that they are constants is due to the field equations. By the existence of the effective group action, one can draw the conclusion that the spatial Cauchy surfaces have topology T 3 , S3, S2 x S1 or a Lens space. In all the cases except T 3 , the twist constants have to vanish. However, in the case of T 3 they need not vanish, and the condition that they vanish is the most unnatural of the ones on the list above. Since one only expects there to be a causally geodesically complete direction in the T 3 -case, and since the equations are much more complicated when the twist constants are not zero, I will however only consider the Gowdy T 3 -case. I refer the interested reader to [2] and [3] for a proof of these statements. In [2], the symmetries are imposed on initial data, which is perhaps somewhat more natural. I will take the Gowdy vacuum spacetimes on R+ x T 3 to be metrics of the form (1). This is in fact not quite true, see [2, pp. 116-117]; I have set some constants to zero. However, the mentioned class is a natural subclass, and the discrepancy should not cause any major difficulties. The subject of this work is the asymptotic behaviour of metrics of the form g
= t-l'2ex'2{~dt2
+ dO2) +t[epda2
+ 2epQdadS + (epQ2 + e~p)d52]
as t —> oo. Here, t G M+ = (0, oo), (6, a, 5) are coordinates on T functions of (t,0). The evolution equations become
3
(1)
and P, Q and A are
Ptt + \Pt-Pee-e2P(Q2-Q2e)=0, Qtt + \QI - Qee + 2{PtQt - PeQe) = 0,
(2) (3)
and the constraints Xt=t[P2
+ P2 + e2P(Q2 + Q2)}, 2P
Xe = 2t{PePt + e QeQt).
(4) (5)
Obviously, the constraints are decoupled from the evolution equations, excepting the condition on P and Q implied by (5). The procedure for constructing a metric is thus to choose initial data for P and Q and their time derivatives such that there exists a smooth A : S1 —> M satisfying (5). One then solves (2)-(3) after which (4) determines A up to a constant. Finally one can check that (5) holds for all time. Consequently, the equations of interest are the two non-linear coupled wave equations (2)-(3). In this parametrization, the expanding direction corresponds to t —> oo, and my main concern will be the asymptotics of solutions to (2)-(3) as t —> oo.
3. Wave map structure The equations (2)-(3) can be interpreted as a wave map equation. In fact, let go
= -dt2 + dO2 + t2d
(6)
332
HANS RINGSTROM
be a metric on R+ x T2 and let 91
= dP2 + e2PdQ2
(7)
be a metric on M2. Then (2)-(3) are the wavemap equations for a map from (R+ x T2, go) to (M.2,gi) which is independent of the <j> variable on T2. Note that ( K 2 , ^ ) is isometric to the upper half plane H = {(x,y) £ E2 : y > 0} with the metric ga = (dx2 + dy2)/y2 under the isometry (Q,P) H-> (Q,e~p). One important consequence of this is that isometries of the hyperbolic plane map solutions of (2)-(3) to solutions. Another important consequence is the existence of certain conserved quantities which I will write down in a moment. It will be convenient to carry out the analysis in the (P, Q)-variables, but the conclusions take their most natural form in the (x, y)-variables. For this reason I will use the different variables in parallel.
4. Energy decay The starting point of this work was the numerical studies carried out by Beverly Berger and Vincent Moncrief, see [1], One object they considered was
l(t) = J^P2
+ e2PQ2d0.
(8)
This is the length of the closed curve in hyperbolic space defined by P and Q for a fixed time t. Their studies indicated that it should decay as i - 1 / 2 . This statement can then be interpreted as saying that the solution becomes more and more spatially homogeneous. In fact, they observed that H
= \J lP" + pe + e2P& + $)1 &
decays as 1/t. Note that this implies that l(t) < Kt^1/2, prove the following. Theorem 4.1. Consider a solution to (2)-(3). T > 0 and a constant CH such that
(9)
where / is defined by (8). You can
Then if H is given by (9), there is a K, a
\tH(t)-cH\<j
(10)
for all t>T. Furthermore, if CH is zero, the solution is independent of 6, and in that case, t2H(t) is constant. Note that this proves the 1/t-decay for H and that it is optimal for spatially homogeneous. Note furthermore that (10) is also optimal, cannot obtain a better decay estimate that holds for all solutions to non-trivial spatially homogeneous solution, t2H(t) = CQ > 0, which have anything of the form o(t _ 1 ) on the right hand side of (10).
solutions that are not in the sense that one (2)-(3). In fact, for a makes it impossible to
On a wave map equation arising in general relativity
333
5. Conserved quantities and asymptotic curves In most cases studied numerically, the analysis suggested that given a solution to (2) and (3), one can find a spatially homogeneous solution to the equations such that the difference between the solution one started with and the spatially homogeneous solution decays to zero in the supremum norm. It turns out that this is not always true. Before discussing the asymptotics, let me introduce some terminology. Consider a solution to (2)-(3). Then there are the following constants A=
[ {2Q(tQt)e2P-2(tPt)}d9,
(11)
B=
[ e2p(tQt)d6,
(12)
C= I {(tQt)(l-e2PQ2) Js1
+ 2Q(tPt)}d6.
(13)
As has been mentioned, (2)-(3) can be given a Lagrangian formulation. Since the Lagrangian is invariant under the isometries of the hyperbolic space, one gets conserved quantities due to Noether's theorem. Thus, e.g. the fact that A is constant is a consequence of the fact that dilations are isometries of the upper half plane and the conservation of B follows from the fact that translations in Q are isometries. Of course, one can check that A, B and C are constants by differentiating with respect to time and using the equations. When one maps a solution to a solution by an isometry of the hyperbolic plane, the constants generally change. However, there is one combination, A2 + ABC, which is unchanged, and this object will play an important role in the analysis. I will also use the notation a = A/2TT, /? = B/2ir, 7 = C/27T and S=y^+Mm
(14)
In the spatially homogeneous case a2 + A(31 = At2(P2 + e2PQ2).
(15)
Thus, spatially homogeneous solutions to (2)-(3) satisfy A2 + ABC > 0 with equality if and only if the solution is trivial, i.e., P and Q are constants. However, in general, no matter what the value of A2 + ABC is, one can find non-trivial data that yield this value. It turns out that the asymptotics are very different depending on whether A2 + ABC is positive, zero or negative. If A2 + ABC > 0, then the solution qualitatively behaves like a spatially homogeneous solution, but not if the opposite inequality holds. In fact, one has the following. Theorem 5.1. Consider a solution to (2)-(3). Letx = {x,y) = (Q,e~p) and let du be the metric induced by the Riemannian metric gu- Then there is a K, a T > 0 and a curve T such that
d„(xM),r)
The possibilities for T are as follows.
— If all the constants A, B and C are zero, Y is a point.
334
HANS RINGSTROM
— If A2 + ABC = 0, but the constants are not all zero, V is either a horocycle (i.e., a circle touching the boundary) or a curve y = constant. — If A2 + ABC > 0, r is either a circle intersecting the boundary transversally or a straight line intersecting the boundary transversally. — If A2 + ABC < 0, T is a circle inside the upper half plane. Remark. I use the convention dH{x,T) = inf X o £ rd#(x,xo). The circle one obtains in the case A2 + ABC < 0 may be degenerate, i.e., a point. Note also that since l(t) defined by (8) decays as t~xl2, I have sup
dH[x{t,61),x(t,92)}
(16)
In a generalized sense, one can thus say that the solution converges to a circle. Since there are four different kinds of circles in the upper half plane (points, non-degenerate circles inside the upper half plane, circles touching the boundary and circles intersecting the boundary transversally) one gets the four cases above.
6. Movement along t h e asymptotic curve Concerning the case where the solution converges to a point, not that much more remains to be said, but in the other cases, it is of interest to know how the solution moves along the respective curves. Theorem 6.1. Consider a solution to (2)-(3) with A2+ABC > 0. Then there is an isometry of the hyperbolic plane, a K, a T > 0 and constants c\ and c2 such that if (Qi,Pi) = (x, — lny) is the transformed solution, X
—- c i y
< Hr1'2
(17)
1
^ S ,K)
and \\\ny + 8lnt + c2\\c(siM for all t>T,
(18)
where 5 is given by (14).
Equation (17) says that the distance from the solution to the straight line x = c^y decays to zero as t~1'2, and (18) shows that the solution is moving toward the boundary of hyperbolic space along this straight line. In the spatially homogeneous case and in the polarized case (Q = 0), the constant c\ is always zero. There are however initial data for which this is not the case. Thus unlike the polarized and spatially homogeneous solutions, the asymptotic curve need not be a geodesic of the hyperbolic plane. One is then led to ask the question: is there a natural characterization of solutions with c\ — 0? Equations (17) and (18) yield a measure of how fast the solution is moving toward the boundary. One consequence of these estimates is that if one fixes a point inside the hyperbolic plane, the distance from this point to the solution at time t is 5 m i up to an error which is bounded, irrespective of which ^-coordinate one chooses.
On a wave map equation arising in general relativity
335
Theorem 6.2. Consider a solution to (2)-(3) with A2 + ABC = 0, but for which not all the constants are zero. Then there is an isometry of the hyperbolic plane, a K, a T > 0 and constants C\ and c% such that if (Qi,P\) = (a;, — Iny) is the transformed solution, h - CIWC(SKR) < Kr1/2
(19)
and \\x - lni - c2||C(SMR) < Kt-1'2
(20)
for all t > T. Note that the conditions of the theorem are inconsistent with spatial homogeneity. Equation (19) says that the distance from the solution to the curve y = c\ decays to zero as i - 1 / 2 , and by (20), the solution is diverging to infinity along this curve. Furthermore, one can use (19) and (20) to conclude that if one fixes a point in the hyperbolic plane, the distance from this point to the solution at time t is 2 In In t up to a bounded error term. Let me now consider the case A2 + ABC < 0. The first question to ask is whether the limit circle is always a point. One can prove that this is not the case. Again, it would be of interest to characterize those solutions which have the degenerate behaviour. Concerning the movement along the circle, the following statement is true. Theorem 6.3. Consider a solution to (2)-(3) such that A2 + ABC < 0. / / the circle T obtained in Theorem 5.1 is not a point, there is a K and T > 0 and for every to > T a curve 7 to with the properties
7to(R+) = r, d„[x(M),7t0W] < Ktz1/2 for all t >to and /2TT
7to *I exp I —
= 7to(*i).
9H{it0{t),it0{i))
=
r
-^-,
where litr is the length of the circle T with respect to the hyperbolic metric. Remark. One can give an explicit expression for the curve 7 i o . Consequently, the solution oscillates forever along the circle, and is more or less periodic in a logarithmic time coordinate. Observe that the solutions in this case behave in a way unlike anything seen when studying spatially homogeneous solutions to the equations; spatially homogeneous solutions are either constant or go to the boundary along a geodesic. However, the solution becomes spatially homogeneous in the sense that (16) is satisfied. Thus solutions which become spatially homogeneous in the limit, in the sense (16), need not at all behave like spatially homogeneous solutions to the equations.
7. F u t u r e causal geodesic completeness In the end, I am interested in the metric (1) and thus in the behaviour of the functions P, Q and A. From the proofs of Theorems 5.1-6.3, one can deduce the behaviour of P and Q. The leading order behaviour of A can interestingly enough be deduced immediately from Theorem 4.1.
336
HANS RINGSTROM
Theorem 7.1. Consider a solution to (2)~(5). Then, if the solution is not independent of 6, there is a T > 0 and a K such that \\X(t,-)-cxt\\C(s\R)
2tH(t)
By (10), I also have
(At) - °f 7T
K
*7«
where CH is positive under the assumptions of the theorem. I get the conclusion of the theorem. • The above statements can be obtained without any control of the sup norm of the derivatives of P and Q. In some situations it is however of interest to have such control. Proposition 7.1. Consider a solution to (2)-(3). \\Pt\\c(s\K) + \\Pe\\c(S\R) + \ePQt\\c(s\R)
Then + ||e p Q fl || c( si,R) <
Kr1'2.
Using the above information, one finally obtains the following theorem. Theorem 7.2. Consider a metric given by (1), where X, P and Q are solutions to (2)-(5). Assume furthermore that the metric is not independent of 6. Let 7 : (s~,s+) —• M.+ x T 3 be an inextendible causal geodesic with respect to this metric and assume that (7',9t) < 0. Then 7 is future complete. For the case where the solution is independent oi 6,1 refer the reader to the literature on spatially homogeneous solutions.
References 1. 2. 3. 4. 5.
B. Berger, arXiv:gr-qc/0207035 (2002). P. T. Chrusciel, Ann. Phys. NY 202, 100 (1990). R. H. Gowdy, Ann. Phys. NY 83, 203 (1974). V. Moncrief, Ann. Phys. NY 132, 87 (1981). H. Ringstrom, to appear in CPAM (2003).
Integrable systems and random matrix theory Session organized by P . D E I F T (New York) and M. JiMBO (Tokyo)
This page is intentionally left blank
Limiting d i s t r i b u t i o n of last passage percolation m o d e l s J l N H O BAIK* (U.
Michigan)
We survey some results and applications of last percolation models of which the limiting distribution can be evaluated.
1. I n t r o d u c t i o n Consider a Poisson process of rate 1 in R+. If one is only interested in the points in a square (0, t) x (0, t), it is equivalent to think of picking M points at random in the square where TV itself is a random variable such that »(W" = N) =
N
\t2) N\
N =
0,1,2,...
(1)
Given a realization of the process, an up/right path from (0,0) to (t, t) is a piecewise linear curve starting from (0,0) to (t, t) joining Poisson points such that the slope, where denned, is positive. The length of an up/right path is defined by the number of points on it. Let L(t) be the length of the longest up/right path from (0,0) to (t,t), making it a random variable. See figure 1 for an example of a last longest up/right path. This is a type of last passage
(0,0)
Figure 1. Poisson points and a longest up/right path from (0,0) to (t,t).
In this example, L(t) — 4.
percolation model which is to find a path that maximizes the total weight (passage time) in a random environment. Various statistics of L(t) as t —> oo have been of interest. The basic law is [4]
,.
m/L(t)-2t
lira P t-»oo
Y
V
t1'3
S«)=FM
=—,(-/-<.-,),»<.),(.)
(2)
where q(x) is the (unique) solution [24] to the Painlee II equation q" = 2q3 + xq •Research supported in part by the National Science Foundation under grant DMS-0208577.
339
(3)
340
JlNHO BAIK
satisfying the condition q(x) ~ - Ai(x),
x —> +00,
(4)
where Ai denotes the Airy function. Convergence of all the moments is also proved in the same paper. The limiting distribution F(x) is called the Tracy-Widom distribution, which will be discussed further in section 3. This article intends to provide some motivations and applications of the above Poisson last passage percolation model, and discuss the result (2).
2. Motivations and applications Random permutation Given a realization of the Poisson points in the square (0, t) x (0, t), suppose one labels them as (xi, yn(i)), i = 1 , . . . ,Af, such that x\ < X2 < • • • < aw- Note that with probability 1 no two points have the same x- or ^-coordinate. Then the indices of the y-coordinates of the points generate a permutation IT. Moreover, an up/right path is mapped to an increasing subsequence of the corresponding permutation. In the example of figure 1, the associated permutation is 629473518 and the increasing subsequence corresponding to the indicated up/right path is 2358. Therefore denoting the length of the longest increasing subsequence of SM by Ljv and recalling the property of the Poisson process discussed earlier, we find that °°
F(L(t) = i)=Y,
2 2 N
P-t
(t \
L
HLN = i),
t£Z.
(5)
N=0
Using this formula and the so-called de-Poissonization lemma [25] one can extract from the result (2) that [4]
The problem of finding various statistics of the longest increasing subsequence of a random permutation in the large N limit has been known as Ulam's problem since early 1960s. The Poisson version of the model as in the last passage percolation model above is sometimes called the Hammersley's process. See e.g. [1,16] for more history of this combinatorial problem and its applications. Plancherel measure A partition of A'' is a sequence of integers A = {Xj}jL1 such that Aj > A2 > • • • > Ajv > 0 and such that the sum of Aj's is N. Given a partition A of N, let d\ denote the number of standard Young tableaux of shape A (see e.g. [46] for definition). It is a basic result of representation theory of the symmetric group that d\ is the dimension of the irreducible representation of SN parameterized by A. Hence the sum of d\ over all partitions A of AT is equal to N. In view of this identity, a natural probability on the set of partitions of N is the Plancherel measure defined by
P(A) = ff-
(7)
Limiting distribution of last passage percolation models
341
Now the famous Robinson-Schensted [44] algorithm states that one can uniquely associate a pair of Young tableaux of a partition A of N to each permutation •K 6 SN. Moreover LN(-K) is equal to the largest part Ai of the corresponding partition. This implies that the distribution of the largest part Ai of a random partition of N taken according to the Plancherel measure (7) is precisely equal to the distribution of L^ of a random permutation. Hence the result (6) yields the limiting law for Ai. The asymptotic statistics of other parts A2, A3,... have also been studied. See e.g. [5,6, 14,27,32,37,49]. Polynuclear growth ( P N G ) model Consider a one-dimensional flat substrate. Suppose that there occur random nucleation events, which is a Poisson process in the space-time plane. If a nucleation occurs at (xo, to), an island of height 1 with zero width is created at position XQ at time to- As time increases, the island grows laterally in both directions with speed 1 while keeping its height. Often two growing islands of same height collide. In that case they form one island and the edges of the amalgamated island keep growing with the same speed 1. Note that nucleations can occur on top of an existing island, and hence new islands may be created on an old island. Let h(x, t) be the height of PNG model at position x at time t. Thus h(x, t) is a piecewise constant function in x for fixed t. An example of the graph of h(x,t) is in figure 2.
Figure 2. A snapshot of the height function of a PNG model. The picture on the right is of droplet initial condition.
One could impose various initial conditions. For now, suppose that the nucleation events occur only for |x| < t. A different way of describing this condition is that at time t = 0, the substrate satisfies h(0,0) = 0 and h(x, 0) = —oo for all x ^ 0, and as time increases, the base substrate itself grows laterally of speed 1 and nucleations occur only on top of the base substrate (and of course on top of islands on the base substrate). This condition is called the droplet case. See the picture on the right in figure 2 for an example. An observation by Prahofer and Spohn [40] is that the PNG model with the droplet initial condition can be mapped to the above Poisson last passage percolation model. Imagine the space-time plane in which the Poisson points corresponding to nucleation events are marked by dots. Due to the droplet condition, the dots are in the forward-light-cone \x\ < t, and as the growth speed of islands is 1, the height at (xi,ti) depends only on the nucleation events in the backward-light-cone \x — x\\
342
JlNHO BAIK
By re-interpreting the result (2) in terms of the PNG model using this identification, Prahofer and Spohn [40] found the following result: for fixed \c\ < 1,
fe'Cfc-^^'W
(8)
Note that the super-diffusive scaling i 1 / 3 is due to the fact that the height at a position strongly depends on the heights at the neighboring positions. Actually it has been believed that the exponent 1/3 should be universal for a wide class of one-dimensional random growth models as long as the spatial correlation is not too weak (see e.g. [13]). Such models are said to be in the KPZ universality class. In the famous work [30], Kardar, Parisi and Zhang introduced a nonlinear stochastic differential equation for the surface height h(x, t) as a continuum model for this class of random growth models, and renormalization group analysis has suggested that the scaling exponent should be 1/3 for one-dimension case. The PNG model now plays the role of the unique growth model for which the 1/3 exponent can be rigorously proved. Moreover, one can even establish the limiting distribution as in (8). One might ask what happens if the initial condition is changed. It turned out that while the scaling exponent is the same, the limiting distribution may change. For instance, instead of the droplet condition, consider the flat initial condition: the initial substrate is R and nucleation events could occur at any position on it. Then for any fixed position x, one finds (see [10,40])
fiS,<^fe#
* *) —»(-5 jf «M*) • F(.)•".
(9)
As a second example, consider a PNG model on a half line {x : x > 0}. In this case, one could imagine the situation such that extra nucleation events occur at x = 0, which corresponds to a 1-dimensional Poisson process in time at x = 0. In other words, there is excessive creation of islands at the origin. Then depending on the rate a of the creation of islands at the origin, the height function could have different properties. Indeed if a is big, then the height at x = 0 is dominated by the creation of islands at the origin, while if a is small, then it is likely that the islands created at the origin have little effect for the height at x = 0. Thus one expects a transition of h(0, t) in a, which is actually proved in [10,42]. For further references in this direction, see e.g. [3,10-12,41-43]. We note that all these different initial conditions have combinatorial meanings on random permutations and Plancherel measure. Particle/anti-particle process A different description of the PNG model [43] is to regard the right edge of an island as a particle and the left edge of an island as an anti-particle. Hence there are right-moving particles and left-moving anti-particles on the real line. Creation of island corresponds to creation of particle and anti-particle pair, and the fact that two islands stick together when they meet implies that upon colliding, particle and anti-particle annihilate each other. In this dynamic picture, the height function is now equal to the total number of the particles and the anti-particles that have crossed the given position up to the given time.
Limiting distribution of last passage percolation models
343
Random vicious walks The combinatorics of the longest increasing subsequence and the Plancherel measure have an interpretation as non-intersecting paths, which is sometimes called vicious walks [20]. See e.g. [21-23,29] for reference.
3. Random matrix and universality The Tracy-Widom distribution F(x) in (2) also appears in a totally different subject; random matrix theory. The main interest in the random matrix theory is the limiting statistics of the eigenvalues as the size of matrix tends to infinity. Random matrix theory has very diverse applications in both mathematics and physics from the spectrum of heavy nuclei to the zeros of Riemann-zeta function (see e.g. [17,31,35]). Of special interest is the Gaussian unitary ensemble (GUE) which is the set oi N x N Hermitian matrices H equipped with the probability measure [35] 1 e-Ntr(H>)dH,
(10)
where dH is the Lebesgue measure and ZN is the normalization constant. In 1994 Tracy and Widom [47] considered the limiting distribution of the largest eigenvalue £ma.x(N) of N x N Hermitian matrix taken from GUE and found that
Jim P((£max(iV) - V2)V2iV2/3 < a;) = F(x), N—>oo
(11)
'
where F(x) is precisely the same function in (2). In other words, upon proper scaling, the largest eigenvalue of a random Hermitian matrix taken from GUE and the length of the longest up/right path in the Poisson last passage percolation model have the same limiting law. It should be mentioned that the analysis of obtaining (2) and (11) are independent. Especially it is not found whether there is a direct relation between L(t) and £max(N) for finite t and N. Only in the limit t —> oo and TV —> oo, two seemingly different quantities have the same limit after independent computations. A framework to understand this might be central limit theorem. In the classical central limit theorem, the sum of n independent identically distributed random variables converges, after proper scaling, to the Gaussian distribution as n —> oo, irrelevant to the detail of the random variable. The results (2) and (11) have the similar feature that the two different 'random variables' share the same limiting distribution. So one may expect that the Poisson percolation model and GUE are two instances of models to which a nonlinear version of central limit theorem is applied. It is however not known whether there is indeed such a natural nonlinear version of the central limit theorem with proper general setting that includes the Poisson percolation model and the GUE. But there are some 'patches' of universality results in random matrix theory, and in the remaining of this section, we discuss some of them.
344
JlNHO BAIK
Universality of random matrices A generalization of GUE is the set of Hermitian matrices with the density ^Le-NtrV(H)
d^
(12)
ZN
where V is an analytic function with sufficient decay properties. We remark that then the eigenvalues £1 > £2 > • • • > £N n a v e the density
w
n
\Zi-ti\*-i[vi*i)
" l
(13)
j=l
for some constant Z'N and w(x) = e~NV(-x\ This can be thought of as a Coulomb gas of particles in R with logarithmic repulsion and external potential w. For general V, the local statistics of the eigenvalues in the middle of the limiting density of states are found to be independent of V (see [15,18,39]). Moreover, by using the results of [18] and [34], one can also deduce that the limiting distribution of the largest eigenvalue of random Hermitian matrix taken according to the probability (12) is generically given by the Tracy-Widom distribution F(x). The special case of V(x) = x4 +tx2 was obtained by Bleher and Its [15]. The GUE has an alternative definition and a different generalization. Namely, GUE is the set of Hermitian matrices with independent, except for the Hermitian condition, complex Gaussian entries. It is direct to check from this definition that the density of the matrix is precisely (10). A natural generalization is then the random Hermitian matrices of independent entries which are not necessarily Gaussian. This is called the Wigner matrix [35]. For Wigner matrix, the largest eigenvalue is found to have the same limit F(x) again [45] (see also [28] for a result regarding the eigenvalues in the middle of the limiting density of states). Hence the limiting law (11) still holds true for two different generalizations of GUE, one of density function (12) and the other of independent entries. Growth models and discrete orthogonal polynomial ensembles In addition to the Poisson percolation model, there are a few more isolated examples of percolation models for which the scaling limit (2) can be obtained. Consider the lattice sites N 2 . Suppose that we assign a random variable X(i,j) at each site (i,j) £ N 2 . We further assume that X(i,j) are independent and identically distributed. Consider an up/right path 7r from the site (1,1) to (M,N), which is a collection of neighboring sites {(ik>jk)} such that (ffc+i, jfc+i) — (ik,jk) is either (1,0) or (0,1). Let TL(M,N) denote the set of up/right paths from (1,1) to (M,N), and define
k
(»,j)eir
J
If X(i,j) are positive random variables, an interpretation is that X(i,j) is the passage time at the site (i,j), and L(M, N) is the last passage time to travel from (1,1) to (M, N) along an admissible up/right path. The Poisson percolation model is a continuum version of this more general directed last passage percolation model.
Limiting distribution of last passage percolation models
345
For general random variables X(i,j), the scaling limit law (2) is an open problem, but when the random variable X(i,j) is either geometric or exponential, (2) is proved [26]. Also if the definition of the u p / r i g h t p a t h s is modified, there are a few more isolated cases for which (2) is obtained (see e.g. [2,27,48]). However all the cases such t h a t (2) is proved share the common feature t h a t they all have an interpretation as a version of t h e longest increasing subsequence of a random (generalized) permutation,, and all of t h e m have t h e same algebraic structure (see e.g. [9,38]). Hence it is an open question to prove the universality result (2) for a general random variable X(i,j) which does not have such a structure. T h e geometric percolation model above has an alternative representation. Consider the density function on the set of particles £i > £2 > r > — > £JV > 0, £,• £ N U {0}, given by (13). T h e only change is t h a t the 'particles' £,- lie on a discrete set instead of a continuous set. Sometimes this is called the 'discrete orthogonal polynomial ensemble', while (13) with continuous weight w is called t h e 'continuous orthogonal polynomial ensemble'. A result of Johansson [26] is t h a t for the geometric percolation model, L(M, N) has the same distribution (except for a minor translation change) as the 'largest particle' £i in the discrete orthogonal polynomial ensemble with the special choice w(x) = (^+Mx~N)qx (assuming t h a t
M>N). Hence one may wonder whether the discrete orthogonal polynomial ensemble with general weight w have universal properties just like its continuous weight counterpart (13). This is indeed proved to be the case in [7,8] for a wide class of discrete weight w. T h e analysis extends the Deift-Zhou steepest-descent method [19] of Riemann-Hilbert analysis to the discrete interpolation setting (see also [33,36]). For general discrete weights w, it is not clear if discrete orthogonal polynomial ensembles have any percolation-type interpretation, but the universality of (13) for b o t h continuous and discrete weights provides a linkage of the result (11) for G U E and the result (2) for the geometric percolation model.
Acknowledgments T h e author would like t o t h a n k Percy Deift and Michio Jimbo for kindly inviting him to the special session of integrable systems in the International Congress on Mathematical Physics.
References 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12.
D. Aldous, P. Diaconis, Bull. Amer. Math. Soc. (N.S.) 36, 413-432 (1999). J. Baik, Comm. Pure Appl. Math 53, 1385-1410 (2000). J. Baik, in Contemporary Mathematics 326, AMS, 2003, pp. 1-21. J. Baik, P. A. Deift, K. Johansson, J. Amer. Math. Soc. 12, 1119-1178 (1999). J. Baik, P. A. Deift, K. Johansson, Geom. Funct. Anal. 10, 702-731 (2000). J. Baik, P. A. Deift, E. Rains, Comm. Math. Phys. 223, 627-672 (2001). J. Baik, T. Kriecherbauer, K. T.-R. McLaughlin, P. Miller, Int. Math. Res. Not. 15, 821-858 (2003). J. Baik, T. Kriecherbauer, K. T.-R. McLaughlin, P. Miller, in preparation. J. Baik, E. Rains, Duke Math. J. 109, 1-65 (2001). J. Baik, E. Rains, Duke Math. J. 109, 205-281 (2001). J. Baik, E. Rains, in Math. Sci. Res. Inst. Publ. 40, Cambridge Univ. Press, Cambridge, 2001, pp. 1-19. J. Baik, E. Rains, J. Stat. Phys. 100, 523-541 (2000).
346
JlNHO BAIK
13. A. Barabasi, H. E. Stanley, Fractal concepts in surface growth, Cambridge University Press, Cambridge, 1995. 14. A. Borodin, A. Okounkov, G. Olshanski, J. Amer. Math. Soc. 13, 481-515 (1999). 15. P. Bleher, A. Its, Ann. of Math. (2) 150, 185-266 (1999). 16. P. A. Deift, Notices Amer. Math. Soc. 47, 631-640 (2000). 17. P. A. Deift, Orthogonal Polynomials and Random Matrices: a Riemann-Hilbert Approach, AMS, RI, 1999. 18. P. A. Deift, T. Kriecherbauer, K. T.-R. McLaughlin, S. Venakides, X. Zhou, Comm. Pure Appl. Math. 52, 1335-1425 (1999). 19. P. A. Deift, X. Zhou, Ann. Math. (2) 137, 295-368 (1993). 20. M. Fisher, J. Statist. Phys. 34, 667-729 (1984). 21. P. Forrester, J. Phys. A 23, 1259-1273 (1990). 22. P. Forrester, J. Phys. A 34, L417-L423 (2001). 23. A. Guttmann, A. Owczarek, X. Viennot, J. Stat. Phys. 102, 1085-1132 (2001). 24. S. Hastings, J. McLeod, Arch. Rational Mech. Anal. 73, 31-51 (1980). 25. K. Johansson, Math. Res. Lett. 5, 63-82 (1998). 26. K. Johansson, Comm. Math. Phys. 209, 437-476 (2000). 27. K. Johansson, Ann. of Math. 153, 259-296 (2001). 28. K. Johansson, Comm. Math. Phys. 215, 683-705 (2001). 29. K. Johansson, Probab. Theory and Related Fields 123, 225-280 (2002). 30. M. Kardar, G. Parisi, Y. Z. Zhang, Phys. Rev. Lett. 56, 889-892 (1986). 31. N. Katz, P. Sarnak, Random matrices, Frobenius eigenvalues, and monodromy, AMS, RI, 1999. 32. S. Kerov, Fund. Anal. Appl. 27, 104-117 (1993). 33. T. Kriecherbauer, K. T.-R. McLaughlin, P. Miller, to appear in Ann. of Math. Stud. 34. A. B. J. Kuijlaars, K. T.-R. McLaughlin, Comm. Pure Appl. Math. 53, 736-785 (2000). 35. M. Mehta, Random matrices, Academic press, San Diego, 1991. 36. P. Miller, Int. Math. Res. Not. 8, 383-454 (2002). 37. A. Okounkov, Int. Math. Res. Not. 20, 1043-1095 (2000). 38. A. Okounkov, Selecta Math. (N.S.) 7, 57-81 (2001). 39. L. Pastur, M. Shcherbina, J. Stat. Phys. 86, 109-147 (1997). 40. M. Prahofer, H. Spohn, Physica A 279, 342-352 (2000). 41. M. Prahofer, H. Spohn, Phys. Rev. Lett. 84, 4882 (2000). 42. M. Prahofer, H. Spohn, in Progr. Probab. 51, Birkhauser, Boston, MA, 2002, pp. 185-204. 43. M. Prahofer, H. Spohn, arXiv:cond-mat/0212519 , preprint (2002). 44. C. Schensted, Canad. J. Math. 13, 179-191 (1961). 45. A. Soshnikov, Comm. Math. Phys. 207, 697-733 (1999). 46. R. P. Stanley, Enumerative Combinatorics, Cambridge University Press, Cambridge, United Kingdom, 1999. 47. C. Tracy, H. Widom, Comm. Math. Phys. 159, 151-174 (1994). 48. C. Tracy, H. Widom, Probab. Theory Related Fields 119, 350-380 (2001). 49. H. Widom, Int. Math. Res. Not. 9, 455-464 (2002).
Differential equations and the Bethe ansatz PATRICK DOREY (U. Durham), CLARE DUNNING ADAM MILLICAN-SLATER, ROBERTO TATEO (U.
(U. York), Durham)
We review some surprising links which have been discovered in the last few years between the theory of certain ordinary differential equations, and particular integrable lattice models and quantum field theories in two dimensions. An application of this correspondence to a problem in non-Hermitian ('PT-symmetric) quantum mechanics is also discussed.
1. Introduction In the last few years, detailed connections have begun to emerge between two previouslyseparated areas of mathematical physics: the spectral properties of ordinary differential equations such as the Schrodinger equation, and the study of integrable lattice models and integrable quantum field theories using techniques related to the Bethe ansatz. This is sometimes given the (perhaps over-grand) title of the 'ODE/IM correspondence'. While the correspondence remains at the level of a mathematical coincidence, albeit a detailed one, it has already provided some important new insights into a variety of problems. On the 'ODE' (ordinary differential equations) side, some of the key themes are Hermitian and non-Hermitian spectral problems, Schrodinger equations, the WKB method, analytic continuation and 'PT-symmetry. These have been developed by a variety of authors; some important names and references for the story that we want to tell are Sibuya [1], Voros [2], Bessis and Zinn-Justin [3], and Bender and Boettcher [4,5]. The 'IM' (integrable models) aspect has a similarly long history. The theories relevant for the first example of the correspondence to be discovered [6] are the six-vertex model, and the quantum field theory of the massless sine-Gordon model. The Bethe ansatz, TQ relations and fusion hierarchies all have a role to play, and the works on these topics by Baxter [7], Kliimper and Pearce [8], and Bazhanov, Lukyanov and Zamolodchikov [9-11] are particularly relevant. In a short survey such as this, it is impossible to do justice to all of this background. In the next two sections some of the basic vocabulary will be sketched. More detailed reviews have been given in [12,13], and an extended version is currently in preparation. Subsequent developments following the initial observation in [6] can be found in [14-28].
2. First prologue: the basics of integrable models Consider a rectangular two-dimensional lattice, of size N x M. On each link of the lattice, attach an arrow, or spin, a, pointing in one of the two possible directions along the link. An assignment of a spin to each link i of the lattice gives a configuration {<7i}. If we impose the additional constraint that the number of arrows pointing into each vertex is equal to the number pointing out, then we are on the way to defining the six-vertex model.
347
348
PATRICK DOREY, CLARE DUNNING, ADAM MILLICAN-SLATER, ROBERTO TATEO
The main task is to calculate averages over all possible configurations, weighted by relative probabilities which can be found as follows. First, the local Boltzmann weights W[a,b,c,d\ must be specified, one number for each possible set of values taken by the spins on the four links connected to a single site of the lattice. Then the relative probability of a given configuration is simply the product of these local weights over all sites. A key quantity is now the sum of these relative probabilities over all possible configurations, which gives the normalisation factor for the calculation of the actual probability of any quantity. Otherwise known as the partition function, it is given schematically as Z=
zZ
l[W[a,b,c,d\.
(1)
{cT{} sites
The local Boltzmann weights W, and hence also the partition function Z, may depend on physical parameters such as the temperature. For the six-vertex model there is just one physically-relevant quantity to specify, namely the anisotropy rj. In the related quantum field theories this becomes the sine-Gordon coupling /?, related to rj by /32 = l—2r)/n. For the solution technique we wish to describe below, however, a rather less physical parameter will be especially important, for reasons which will be sketched shortly: the spectral parameter £. To evaluate Z, we can first define the transfer matrix T, to be the sum over the spins on a line of horizontal links of the product of their Boltzmann weights, for a given set of spins on the adjacent vertical links above and below. If we assign one multi-index a to the vertical links above the line in question, and another a' to the links below, then T can be thought of as a 2N x 2N matrix T£,, and, for periodic boundary conditions, the partition function Z is Z = Trace [T M ] . (2) Clearly, if we can diagonalise T, then we shall be able to compute Z. Even better, since the main interest is in the so-called thermodynamic limit when both N and M tend to infinity, we only need find the lowest-lying eigenvalues. Note that the mathematical structures found in this limit can also be obtained directly in the context of the continuum quantum field theory of the massless sine-Gordon model on a cylinder, using the constructions of [9-11]. Now comes the first moment where the concept of integrability arises: the local Boltzmann weights for the six-vertex model are so defined that the transfer matrices taken at different values of the spectral parameter commute: [T(O,T(£')]=0.
(3)
(The deeper meaning of this property would take us too far afield, but from one point of view it is related to the existence of infinitely-many commuting conserved quantities.) If the matrices T(£) all commute, then they can be diagonalised simultaneously and we can study individual eigenvalues T(£) as functions of £. There are various methods for finding these functions, but the most relevant here was discovered by Baxter in the early 1970s, in connection with his work on the more-complicated eight-vertex model. Since the transfer matrix is an entire function of £ with ^-independent eigenvectors, the eigenvalues T(£) must also be entire. Baxter showed further that, for each eigenvalue T(£), there is an associated function Q(£), also entire, such that the following TQ relation holds: r ( O Q ( 0 = e- 2 ™ p Q(
e^Qtft),
(4)
Differential equations and the Bethe ansatz
349
where the phase q depends on f3 as q = el7TP , and the extra parameter p allows for the possibility of inserting a 'twist' into the periodic boundary conditions on the lattice. (Strictly speaking, for finite values of the lattice width N the TQ relation contains some extra factors, but these go in the thermodynamic limit. We are following the conventions of [10], appropriate for this limit, save that there Q is denoted A for p ^ 0, and both it and T are taken to be entire functions of the squares of their arguments.) To see why this equation provides a useful constraint, we can argue as follows. Since Q is entire, it is, subject to suitable growth conditions, determined by the location of its zeroes. Suppose these zeroes are at {eo,..., efc}; then
Q(O = Q(0)n( 1 -f)i=o
v
eiJ
Now T(£) is also entire, so the TQ relation (4) taken at £ = ej implies that e~2nipQ(q~2ej) e2mpQ(q2ej) = 0. Rearranging and using the product form (5),
n (ei~q22j)=-e-47rip-
(5) +
(6)
This gives us fe+1 equations in fc+1 unknowns, which determine the ej up to discrete ambiguities corresponding to the fact that the transfer matrix has not one but many eigenvalues. These equations coincide with the so-called Bethe ansatz equations (BAE) found in other, more direct, approaches to the problem. In this context, the et are sometimes called the Bethe roots. Note, once the Bethe roots are known, the eigenvalue T(£) is easily recovered by using (5) and then (4). Although the BAE (6) do not determine the Bethe roots uniquely, for the so-called ground state eigenvalue — the largest one, most important in the thermodynamic limit — this ambiguity is fixed by the fact that the Bethe roots for this eigenvalue are all located on the positive real axis. Over the last few decades, Bethe ansatz equations such as (6) have been studied extensively, and much is known about their solutions. For the moment the key thing to remember is the simple way that they follow from the TQ relation. If a similar functional equation can be found in another context, we can hope to establish a rather detailed connection with the already-existing body of knowledge on integrable models.
3. Second prologue: 7 ? T : -symmetric q u a n t u m mechanics We now make a change of tack, and turn our attention to some quantum-mechanical problems first investigated by Bessis and Zinn-Justin in the early 1990s [3], and then successively generalised in [4,5,17,24,25]. For the current discussion it will suffice to consider the following collection of non-Hermitian 'position space' Hamiltonians:
where M is a positive real number. The interest is in the spectrum of HMJ'- those values of E such that the ordinary differential equation HM,^ = Eip has a normalisable solution. By 'normalisable', for M < 2 we simply mean square-integrable on a contour running along the
350
PATRICK DOREY, CLARE DUNNING, ADAM MILLICAN-SLATER, ROBERTO TATEO
14-
la: I = -0.025
lb: I = -0.001
Figure 1. Eigenvalues of HM,^ = Ei/>, plotted as a function of M at I = —0.025 and I = —0.001. real axis, distorted just below the origin to avoid the singularity there when 1(1+1) ^ 0. For M > 2, the contour must be further distorted to ensure the correct analytic continuation, as explained in [4]. In figure 1 we show the spectrum as a function of M for two negative values of I. Note that M = 1 corresponds to the exactly-soluble simple harmonic oscillator with angular momentum, and that this point marks a profound change in the nature of the spectrum: for M > 1 it is entirely real, while for M < 1 infinitely-many eigenvalues pair off and become complex. The reality of the spectrum for M = 3/2, I = 0 was the subject of Bessis and Zinn-Justin's original conjecture [3], which was extended to all M > 1, I = 0 by Bender and Boettcher in [4]. However a complete proof was surprisingly elusive, and has only recently been given, making essential use of the ODE/IM correspondence [24]. The nature of the transition to complex eigenvalues as M falls below 1 looks rather simple on figure la, but less so once figure lb has been examined. In fact, as shown in figure 2, for I > 0 the connectivity pattern seen in figure l a is completely reversed. Some further discussion of this behaviour can be found in [17,29], while the even-richer structure which emerges when a suitably-chosen inhomogeneous term is added to the potential is discussed in [24,25]. As emphasised in [4,5], a key feature of the Hamiltonians (7) is that, while not Hermitian, they are unchanged by the combined action of parity V and time-reversal T — whence the name 'PT-symmetric quantum mechanics. We hope to have demonstrated that the subject contains many interesting phenomena for the mathematical physicist to explore; see [30-49] and references therein for some further developments in the area. However, from the point of view of this article, its main interest lies in its surprising links with the six-vertex model discussed in section 2 above, which we shall now describe.
Differential equations and the Bethe ansatz
2a: 1 = 0
351
2b: I = 0.001
Figure 2. Eigenvalues of HM,IIP = Etjj, plotted as a function of M at I = 0 and I = 0.001.
4. Spectral determinants and t h e correspondence Rather than worrying about individual eigenvalues of (7), it turns out to be useful to combine them into a single function — a spectral determinant — and then exploit the analytic properties of this function. This approach was particularly advocated in the differential equation context by Sibuya [1] and Voros [2]. Here, influenced by parallels with the theory of integrable models, we shall use spectral determinants to couple the non-Hermitian problem discussed so far with a related Hermitian problem, obtained from (7) by sending x —> x/i and E —» —E. The new problem is ax*
xz
J
$(z) = E${x),
(8)
and we choose as boundary conditions that the solution should vanish as x —> oo along the real axis, and behave as xl+1 as x —•> 0. For 3fte Z > —1/2, this problem is Hermitian. In the language of ordinary differential equations in the complex domain, it is sometimes called a 'radial' problem, while (7) is called a 'lateral' problem. (Note, the radial problem can also be considered for Stel < —1/2, but is best then defined by analytic continuation in I.) Let {Ej} be the set of eigenvalues of (7), and let {e,} be the eigenvalues of (8). Then define two spectral determinants, as follows: E T(E) = T(0) fl ,1 + E~
(9)
j=o
and
Q(E) = Q ( 0 ) n ( l i=0
E
(10)
352
PATRICK DOREY, CLARE DUNNING, ADAM MILLICAN-SLATER, ROBERTO TATEO
Both products are convergent for M > 1, and define entire functions of E, and their zeroes (or, for T(E), their negatives) coincide with the eigenvalues of the corresponding spectral problem. For M < 1 convergence factors must be added, and it is more efficient to define the spectral determinants indirectly, via certain special 'Sibuya' solutions to (8), which are anyway needed to prove the key identity (11) below; see [1,17] for more details. By considering the asymptotic behaviour of the Sibuya solutions in the complex plane it is not hard, following the arguments given in [17], to establish a Stokes relation, from which one can obtain the following functional equation T(E) Q(E) = u-V'+V/iQiv-iE)
+ ^2l+1^2Q(u2E),
(11)
where w = ein^M+1\ This shows that the spectral problems (7) and (8) are related by much more than a simple change of variables and boundary conditions; and furthermore, that this relation is encoded precisely in a TQ relation of the sort which had previously arisen in the context of the six-vertex and sine-Gordon models.
5. Consequences of t h e correspondence If we set 01 = 1/(M+1) and p = (2/+l)/(4M+4), then the two TQ relations (4) and (11) match perfectly. The remaining ambiguity in the solutions of the resulting Bethe ansatz equations is resolved once we recall that the problem (8) is Hermitian, and so all of its eigenvalues e^ are real. Given the remarks about the locations of Bethe roots at the end of section 2, this means that the spectral determinant Q(E) arising in the ordinary differential equation should be identified with the Q-function Q(£) for the ground state of the sine-Gordon model. (We must also match the normalisations of E and £, since these are left undetermined by the TQ relations. This is easily achieved by comparing the leading asymptotic of the field theory Q-function given in [10] with that obtained by a simple WKB analysis of the ordinary differential equation — see [6,17] for more details.) Once the Qfunctions have been identified, so can be the T-functions, and we have our main punchline: The spectral determinant T for the lateral (PT-symmetric) problem (7) is equal to the ground state, or vacuum, eigenvalue of the transfer matrix T of the massless sine-Gordon model on a cylinder. Together with the corresponding statement for Q, this provides a powerful tool to analyse the functions constructed in the context of integrable quantum field theory in [9-11], as the analytic properties of solutions of ordinary differential equations are relatively easy to control. (In this regard, we should mention that, for a complete proof of the correspondence, it currently seems best to argue via the so-called 'quantum Wronskian' relation, as in [6,14]. However the route we have adopted here is perhaps more intuitive, and it leads naturally on to the reality proof which we shall describe shortly.) There are also less direct applications of the correspondence — for example, it yields a proof [14] of a duality relation in quantum Brownian motion, first proposed in [50]. In finishing this contribution, we mention two more, both on the ordinary differential equations side of the correspondence. The first concerns the reality properties of the 'PT-symmetric problems (7). We have already seen, in section 2, the argument which shows that substituting E = ej into the TQ
Differential equations and the Bethe ansatz
353
relation (11) leads to Bethe ansatz equations, which in the light of the correspondence we can reinterpret as coupling the different eigenvalues of the Hermitian problem (8). However, if we instead set E = —Ej, the same reasoning leads, in the region M > 1 for which the factorised form (10) applies, to
n(££|)—-"-•
<«>
This equation couples the so-far mysterious eigenvalue Ej of the PT-symmetric problem (7) to the much better-controlled, and indeed real and positive, eigenvalues e; of the Hermitian problem (8). If we take the modulus 2 of both sides, it is soon seen that the only way that the resulting equality can be achieved is for Ej to be real, as had previously been conjectured. Notice, the proof breaks down for M < 1, as has to be the case given the numerical findings of [4,17], illustrated in figures 1 and 2 above. More details, in a slightly more general setting than that described here, can be found in [24]; for a further generalisation of the method, see [51]. The second application was also found in [24], but is a little more specialised. We consider the radial problem at M = 3, and add an inhomogeneous term ax2 to the potential. Although this was not done in the original paper, it will also be convenient to reparametrise the angular-momentum term by setting A = ^(21+1), so that the equation becomes
(-W
+ x + ax2 +
*
(13)
T ^ r ) *(JB) = mx)-
Via the Bethe ansatz approach, it turns out that this problem has a relationship with a third-oider ordinary differential equation: (D(g2 - 2) D(9l - 1) D(g0) + x3)4>=^-
E<j>,
(14)
where D(g) = (d/dx — g/x), and go = 1 + (a + V3A)/4,
$i = l + a / 2 ,
g2 = 1 + {a - \/3A)/4.
(15)
This third-order equation is associated with SU(3) Bethe ansatz equations, as discussed in [18,21]. Furthermore, the third-order equation is symmetrical in {go, 9\, 2}t a feature which is completely hidden in the original second-order equation. By playing with this symmetry, one can establish some novel spectral equivalences between different (second-order) radial problems, and also between these and certain lateral problems. More details and explicit formulae can be found in [24], and here we simply add to that paper the remark that, when expressed in terms of the variables (a, A), the mappings turn out to act as certain 2 x 2 matrices in the Weyl group of SU(3). Other aspects of the ODE/IM correspondence are still being developed as this contribution is being written. It would be nice to have a working correspondence also for finite lattice systems, before the thermodynamic limit is taken, and some small progress in this direction will be reported in [52]. Another important question that until recently remained open concerns the other states in the quantum field theories, besides the ground state, and whether they can also be matched with differential equations. In a very recent paper [53], Bazhanov,
354
PATRICK D O R E Y , CLARE DUNNING, ADAM MILLICAN-SLATER, ROBERTO TATEO
Lukyanov and Zamolodchikov have answered this in the affirmative. T h e 'Q-potentials' t h a t they construct are no longer real, even for the radial problems. This is not as surprising as it might seem — recall t h a t it is only for the ground state t h a t t h e Bethe roots, which correspond to the eigenvalues of t h e radial spectral problem, all lie on the real axis. For other states there are complex roots, and so even the radial problems can no longer be Hermitian. Finally, it would b e valuable t o have a more physical understanding of the relationship between integrable q u a n t u m field theories and ordinary differential equations. At the moment this remains mysterious, b u t sufficiently-many examples of t h e phenomenon have now been collected t h a t progress may not be too far off.
Acknowledgments P E D t h a n k s the organisers for the invitation t o speak at the conference; T C D , AM-S and R T t h a n k t h e UK E P S R C for a Research Fellowship, a Studentship and an Advanced Fellowship respectively. This work was partially supported by the E C network "EUCLID", contract number HPRN-CT-2002-00325.
References 1. Y. Sibuya, Global Theory of a second-order linear ordinary differential equation with a polynomial coefficient, Amsterdam: North-Holland, 1975. 2. A. Voros, J. Physique Lett. 43, LI (1982); Ann. Inst. Henri Poincare X X X I X , 211 (1983); J. Phys. A: Math. Gen. 32, 5993 (1999); Corrigendum, J. Phys. A: Math. Gen. 34, 5783 (2000). 3. D. Bessis, J. Zinn-Justin, unpublished, circa 1992. 4. C. M. Bender, S. Boettcher, Phys. Rev. Lett. 80, 4243 (1998). 5. C. M. Bender, S. Boettcher and P. N. Meissinger, J. Math. Phys. 40, 2201 (1999). 6. P. Dorey, R. Tateo, J. Phys. A: Math. Gen. 32, L419 (1999). 7. R. J. Baxter, Ann. Phys. 76, 1 (1973); Ann. Phys. 76, 25 (1973); Ann. Phys. 76, 48 (1973). 8. A. Kliimper, P. A. Pearce, J. Stat. Phys. 64, 13 (1991). 9. V. V. Bazhanov, S. L. Lukyanov, A. B. Zamolodchikov, Comm. Math. Phys. 177, 381 (1996). 10. V. V. Bazhanov, S. L. Lukyanov, A. B. Zamolodchikov, Comm. Math. Phys. 190, 247 (1997). 11. V. V. Bazhanov, S. L. Lukyanov, A. B. Zamolodchikov, Comm. Math. Phys. 200, 297 (1999). 12. P. Dorey, C. Dunning, R. Tateo, JHEP Proceedings PRHEP-tmr 2000/034, Nonperturbative Quantum Effects 2000, arXiv:hep-th/0010148 . 13. P. Dorey, C. Dunning, R. Tateo, Proceedings of the Johns Hopkins workshop on Current Problems in Particle Theory 24, Budapest, 2000, World Scientific, 2001. 14. V. V. Bazhanov, S. L. Lukyanov and A. B. Zamolodchikov, J. Stat. Phys. 102, 567 (2001). 15. J. Suzuki, J. Phys. A: Math. Gen. 32, L183 (1999). 16. P. Fendley, Lett. Math. Phys. 49, 229 (1999). 17. P. Dorey, R. Tateo, Nucl. Phys. B 563, 573 (1999). 18. P. Dorey, R. Tateo, Nucl. Phys. B 571, 583 (2000). 19. J. Suzuki, J. Phys. A: Math. Gen. 3 3 , 3507 (2000). 20. J. Suzuki, J. Stat. Phys. 102, 1029 (2001). 21. P. Dorey, C. Dunning, R. Tateo, J. Phys. A: Math. Gen. 33, 8427 (2000). 22. J. Suzuki, "Stokes multipliers, Spectral determinants and T-Q relations", a r X i v : n l i n - s i / 0009006. 23. K. Hikami, Nucl. Phys. B604, 580 (2001). 24. P. Dorey, C. Dunning, R. Tateo, J. Phys. A: Math. Gen. 34, 5679 (2001). 25. P. Dorey, C. Dunning, R. Tateo, J. Phys. A: Math. Gen. 34, L391 (2001).
Differential equations and the Bethe ansatz
355
26. V. V. Bazhanov, A. N. Hibberd, S. M. Khoroshkin, "Integrable structure of W3 conformal field theory, quantum Boussinesq theory and boundary affine Toda theory", arXiv:hep-th/0105177 . 27. V. V. Bazhanov, S. L. Lukyanov, A. M. Tsvelik, "Analytical results for the Coqblin-Schrieffer model with generalized magnetic fields", arXiv:cond-mat/0305237 . 28. S. L. Lukyanov, A. B. Zamolodchikov, "Integrable circular brane model and Coulomb charging at large conduction", arXiv:hep-th/0306188 . 29. P. Dorey, A. Millican-Slater, R. Tateo, in preparation. 30. E. Delabaere, F. Pham, Phys. Lett. A 250, 25 (1998). 31. M. Znojil, Phys. Lett. A 259, 220 (1999). 32. G. A. Mezincescu, J. Phys. A: Math. Gen. 33, 4911 (2000). 33. M. Znojil, J. Phys. A: Math. Gen. 33, 6825 (2000). 34. E. Delabaere, D. Trinh, J. Phys. A: Math. Gen. 33, 8771 (2000). 35. B. Bagchi, F. Cannata, C. Quesne, Phys. Lett. A 269, 79 (2000). 36. C. M. Bender, M. Berry, P. N. Meisinger, Van. M. Savage, M. Simsek, J. Phys. A: Math. Gen. 34, L31 (2001). 37. C. M. Bender, Q. Wang, J. Phys. A: Math. Gen. 34, 3325 (2001). 38. G. A. Mezincescu, J. Phys. A: Math. Gen. 34, 3329 (2001). 39. C. R. Handy, X. Q. Wang, J. Phys. A: Math. Gen. 34, 8297 (2001). 40. V. M. Tkachuk, T. V. Fityo, J. Phys. A: Math. Gen. 34, 8673 (2001). 41. C. M. Bender, S. Boettcher, H. F. Jones, P. Meisinger, M. Simsek, Phys. Lett. A 291, 197 (2001). 42. F. Cannata, M. Ioffe, R. Roychoudhury, P. Roy, Phys. Lett. A 281, 305 (2001). 43. G. S. Japaridze, J. Phys. A: Math. Gen. 35, 1709 (2002). 44. R. Kretschmer, L. Szymanowski, "The Interpretation of quantum-mechanical models with nonHermitian Hamiltonians and real spectra", arXiv:quant-ph/0105054. 45. A. Mostafazadeh, J. Math. Phys. 4 3 , 205 (2002). 46. C. M. Bender, D. C. Brody, H. F. Jones, Phys. Rev. Lett. 89, 270401 (2002). 47. A. Mostafazadeh, J. Math. Phys. 44, 974 (2003). 48. C. M. Bender, P. N. Meisinger, Q. Wang, J. Phys. A: Math. Gen. 36, 1973 (2003). 49. S. Weigert, "Completeness and orthonormality in 'PT-symmetric quantum systems", arXiv: quant-ph/0306040. 50. M. P. A. Fisher, W. Zwerger, Phys. Rev. B 32, 6190 (1985). 51. K. C. Shin, Comm. Math. Phys. 229, 543 (2002). 52. P. Dorey, J. Suzuki, R. Tateo, in preparation. 53. V. V. Bazhanov, S. L. Lukyanov, A. B. Zamolodchikov, "Higher-level eigenvalues of Q-operators and Schrodinger equation", arXiv:hep-th/0307108 .
Correlation functions of the XX Z spin-1/2 chain: recent progress JEAN-MICHEL MAILLET (ENS
Lyon)
We review recent progress in the computation of correlation functions of the XX Z spin-1/2 chain and their long distance asymptotic behavior. Our method is based on the resolution of the quantum inverse scattering problem in the framework of the algebraic Bethe ansatz. It leads to multiple integrals representations of the correlation functions in the context of which we discuss their long distance asymptotic behavior.
1. Introduction The main challenging problem in the field of quantum integrable models [1-3] is to compute exact and manageable expressions for their correlation functions. This issue is of great importance not only from a theoretical view point but also for applications to relevant physical situations. Although several important advances have been obtained over the years, we are still looking for a general method that could give a systematic approach to this problem, in particular concerning the long distance asymptotic behavior of the correlation functions. Besides obvious physical applications, such results would give "of shell" informations on the conformal and massive limits of the corresponding lattice integrable models. The purpose of this article is to give a review of an approach to this problem elaborated in [4,5] and of recent progress obtained in [6-9]. In our search for a general method to compute correlation functions of quantum integrable models our strategy is to consider a simple but representative model where it is possible to develop new tools to approach to this problem. Such an archetype of quantum integrable lattice models is provided by the XXZ spin-^ Heisenberg chain in a magnetic field with Hamiltonian M
H= £
( «
+ 1
+
+ 1
- 1)) - hSz.
(1)
m=l
Here A is the anisotropy parameter, h an external classical magnetic field, cr^ y ' z are spin operators (in the spin-| representation), associated with each site m of the chain, Sz is the third component of the total spin commuting with H and we assume periodic boundary conditions. In the thermodynamic limit, M —> oo and at zero magnetic field, the model exhibits three different regimes depending on the value of A [1]. For A < —1, the model is ferromagnetic, for —1 < A < 1, the model has a non degenerated anti ferromagnetic ground state, and no gap in the spectrum (massless regime), while for A > 1, the ground state is twice degenerated with a gap in the spectrum (massive regime). Although the method to compute eigenstates and energy levels goes back to H. Bethe in 1931 [10-13], the knowledge of its spin correlation functions was for a long time restricted to the free fermion point A = 0, a case for which nevertheless tremendous works have
356
Correlation functions of the XXZ spin-1/2 chain: recent progress
357
been necessary to obtain full answers [14-17]. For generic A, at zero temperature, and for zero magnetic field, multiple integral representation of elementary blocks of the correlation functions (see definition below) have been obtained from the g-vertex operator approach (also using corner transfer matrix technique) in the massive regime A > 1 in 1992 [18] and conjectured in 1996 [19] for the massless regime - 1 < A < 1 (see also [20]). A proof of these results together with their extension to non-zero magnetic field has been obtained in 1999 [4,5] for both regimes using algebraic Bethe ansatz [21,22] and the actual resolution of the so-called quantum inverse scattering problem [4,23]. Let us denote by \tpg) the normalized ground state in the massless regime and any one of two ground states constructed by the algebraic Bethe ansatz in the massive regime. Let E^'c,n be the elementary operators acting at site m as the 2 x 2 matrices E^ = S^e'Sk,e- Any n-point correlation function can be reconstructed as a sum of the following elementary blocks, m
Fm({Ci,^.}, h) = {i)g\\\E^\i)g).
(2)
To compute them, the following successive problems have to be addressed: (i) determination of the ground state (ipg\, (ii) evaluation of the action of the product of the local operators on it, and (Hi) computation of the scalar product of the resulting state with \ipg). For the XXZ spin-^ Heisenberg chain in a magnetic field, this problem has been solved in [4,5] in the framework of algebraic Bethe ansatz. A short review of our method will be given in section 2. In section 3 we give the main features of the programme for the asymptotic analysis of physical correlation functions (like spin-spin correlations) which we started in the papers [6,7]. Section 4 is devoted to a particular correlation function, the so-called emptiness formation probability for which both exact [8] and asymptotic [9] results have been derived recently. In the last section we give some conclusions and perspectives.
2. Correlation functions: elementary blocks The central object of the algebraic Bethe ansatz method [21] is the so called quantum monodromy matrix depending on a complex variable A (the spectral parameter). For the XXZ spin-^ chain it is a 2 x 2 matrix with operator valued entries acting in the quantum space of states of the chain,
The quadratic commutation relations between these four operators are given by the YangBaxter algebra for the T matrices governed by a trigonometric R matrix solving the YangBaxter equation. The Hamiltonian of the chain is then contained in the commutative family of operators generated by the transfer matrix t(X) = (A+D)(X) for arbitrary values of A. The algebraic Bethe ansatz leads to the simultaneous diagonalization of these transfer matrices. In particular, the ground state {ipg\ (resp. \ipg)) is given (up to suitable normalization) as the successive action of operators C(Xk) (resp. -B(Afc)) on the ferromagnetic reference state (0| (resp. |0)), the state with all spin up, for a particular set of spectral parameters {Afc} solving the Bethe equations.
358
JEAN-MICHEL MAILLET
To evaluate the action of local operators on this state, the strategy is to imbed them in the Yang-Baxter algebra of T matrices by solving the quantum inverse scattering problem (see [4,23] for details) as,
fc=i fc=i
where cosh 77 = A. Then, using the Yang-Baxter algebra, we reduce the computation of any elementary block to multiple sums of scalar products of some states with \tpg). Each of these scalar products can be computed as the ratio of two explicit determinants [4,24]. In the thermodynamic limit, the multiple sums lead to m fold integrals over contours Cj1 which are depending on Cj-'s, on the regime considered and also on the value of the magnetic field. Indeed in this limit, the set of parameters {A.,} building up the ground state can be described by a density functions ptotMi ^ being in the some interval [—A/,,A/i], and the Bethe equations reduce to an integral equation for /Otot(^)- For technical purposes, we introduce the inhomogeneous density p(X, £) as the solution of [13]
-2irip(\,£) + [ K(\-ti)p(ti,Odfi
= t(\,Z),
(5)
Jc where K(y. 1
sinh2?7 sinhry = l W {) ' ~sinh(A + 7 ? )sinh(A-7 ? )' ' sinh(A - n) sinh(A - n + ry)' It coincides with ptotW at £ = 77/2. For our goals it is enough to consider — £ < Im(£) < 0 for - 1 < A < 1 and r\ < Re(£) < 0 for A > 1. The integration contour C = [-Aft,A/,] depends on the regime considered. In the massless case — 1 < A < 1, the contour C is an interval of the real axis and the parameter 77 is imaginary: rj = —i£, £ > 0. In particular, at h —> 0, Ah —* 00, and the above integral equation can be solved explicitly. For A > 1 (77 < 0) the limits ±A^ are imaginary, which means that the integral in (5) is taken over an interval of the imaginary axis. At h — 0, A/j = —iir/2 and the solution of the integral equation is given in terms of theta-functions. Finally, any elementary block can be written generically as [5] =
)
Fmdej^'j},
h) = J ] /
h
d\j nm({\},
{ej, e;}) Sh({\}).
(7)
Here fim({A}, {e^,^}) is a purely algebraic quantity, in particular not depending on the regime or on the magnetic field. 5/i({A}) is depending on the density function p/j(A). It is remarkable that for zero magnetic field the above multiple integral representations found from the g-vertex operator approach [18,19] and from Bethe ansatz [4,5] are identical both in the massive and in the massless regimes (contours and integrands are the same). It would be very desirable to understand this intriguing fact directly at the operator level. Note that for non zero magnetic field, the quantum affine symmetry used in the (/-vertex operator approach is broken, and no result is known up to now from this method in this case.
Correlation functions of the XXZ spin-1/2 chain: recent progress
359
3. Physical correlation functions In principle, any n-point correlation function can be obtained from the elementary blocks described in the previous section. However, although these formulas are quite explicit (see [4, 5,18,19]), their actual analytic computation is essentially missing up to now despite some recent works [25,26]. Moreover, the evaluation of correlation functions of physical relevance, as for example the spin-spin correlation functions at distance m on the lattice like, (&tcrrn+i)> 1S Quite involved. Indeed the identity 771
+
2
(8)
shows that the corresponding spin-spin correlation function is actually given as the sum of 2 m _ 1 elementary blocks, making the problem of asymptotic behavior at large distance extremely difficult to solve in these settings. Using the solution of the quantum inverse scattering problem, this question amounts to compute the following average value,
{%\c(V){A + D)m-l(V)B(V)\*g).
(9)
To obtain manageable (re-summed) formulas for spin-spin correlation functions we have derived a compact formula for the multiple action of transfer matrix operators (A + D)(xa) for any set of spectral parameters xa and on arbitrary quantum states of the XXZ model in [7]. The above spin-spin correlation function at lattice distance m is then given as the sum of only m terms (instead of 2 m _ 1 ) . For example, K < + i ) =[2Dl-—- - *
-4D Wm— A + 1 )
(10)
(exp(/3Q1,ro))
with the first and the second lattice derivatives Dmf(m)
D2mf(m) = f(m + l) + f(m-l)-2f(m),
= f(m + l)-f(m),
(11)
and
(exp(/3Q, ( e x p ( W l , m ) >))•••• ^T
l
dZj
if\ cd fl\2nij
+ T1,2) Sinh(A v/2) f dn\ AFT (ShlHZa l_lU i n h ( , a - ? ? / 2 ) s i n h ( A °a +~? ? / 2 ) ; Y
x Wn({X},{z}) Here
• detn[Mjk({X}\{z})]
.detn^-.Zfc)].
wn({x},{z})=nnsinuxr?ttinh^6~XaV\> a=i b = i
smh
( ^ o - At + ri) sinh(z a - zb + 77)
(12)
~^
and A*,*({A}|{z}) = t(zk, A,) + e^(A,,, f c ) f[ S ^ ~ *' + "> ^ ~ *" + "\ . J ij- smh(Aj - Aa + 77) smh(z a - Xj + 77)
(14)
The contour C is as in the integral equation (5), it depends on the regime and on the value of the magnetic field, and V surrounds the point 77/2. The main property of this result is at
360
JEAN-MICHEL MAILLET
first an effective re-summation of the corresponding 2 m + 1 elementary blocks into only m + 1 multiple integrals; moreover the integrand of each terms in this sum is now containing the distance as the power of a simple function of the integration variables A and z. This feature will be important in future asymptotic analysis of such correlation functions. At the free fermion point A = 0, our formula gives the known answers in a very simple way. First of all, the matrix Mjk becomes proportional to the Cauchy matrix
(I5)
*»«*"<•»-J&Ar <-!•
and, hence, can be computed. Moreover, detnM,-fc is proportional to (eP — 1)": this means that, if one takes the first (respectively the second) derivative with respect to (3 and sets /? = 0 to get the az two point function, only the terms n < 1 (respectively n < 2) in the sum (12) do not vanish, namely, {Qi,m) a n d (Qi, m )- We have (Qi m) = — arctan(sinh 2A) =
,
(16)
where po is such that cosh 2A = (cospo) - 1 and p0 = arccos (*). Similarly, ,2
D2
1
--^.a-™82^).
m(Qlm) = 2(^)
(17)
We finally obtain K<+i> = (^ 7 - 0
--^-j(l-cos2mpo).
(18)
This result was obtained in [15]. For zero magnetic field, p0 — 7r/2, and the constant contribution to the correlation function disappears and we find \al°m+l
)= ^ ( ( -
1
)
m
-
1
) '
«** = <>.
(19)
4. The emptiness formation probability The emptiness formation probability T{rri) (the probability to find in the ground state a ferromagnetic string of length m) is defined as the following expectation value 171
Z
1
r(m) =
(20)
l
where \i/jg) denotes the normalized ground state. As explained in the previous sections, in the thermodynamic limit (M —> oo), this quantity can be expressed as a multiple integral with m integrations; for A = cos£, 0 < C < 7r> one has r(m)=
lim
T(m,{^}),
(21)
where
m
=
^f
*•«»•«»
det„ (
J
\^
(22)
Correlation functions of the XXZ spin-1/2 chain: recent progress
361
with m
Apt
m
r7 ^r\i sew _ TT TT s i n h ( A 0 ~ ft) sinh(Aa - & - i() smh(Aa - A, - zO
uet
I
— ^ sin C
)
"» ^sinh^-^Osinh^-^-iC),/ n ginh(eo _ 6 ) a>b
(23) This correlation function is particularly interesting from the methodological view point: although simpler than the spin-spin correlation functions (it involves only one multiple integral) and less relevant for physical applications, it exhibits one of the main problem we encounter in the asymptotic analysis of generic correlation functions at distance m —> oo, namely, the number of integrations growths like the distance m while the integrand contains obviously functions to the power m or even m 2 . It is therefore an interesting laboratory to test various asymptotic methods that we would like to use in more general (and physical) situations. For this quantity, we have obtained two explicit results. First, for A = \ (( = 7r/3), due to the symmetry properties of the integrand and the fact that the periods in the hyperbolic functions are commensurate (which is not the case for generic £), it is possible to actually separate the m integrations. Then, the result can be obtained as the determinant of size m of single integrals. Then for this particular value of C, this determinant can be computed exactly using a formula due to Kuperberg [27]. We have obtained,
Observe that the quantity Am = YlT^ofik + l ) ' / ( m + &)' i s t n e number of alternating sign matrices of size m [28]. This result was conjectured in [29]. The asymptotic behavior of T(m) for m —> oo can then be evaluated simply using the Stirling formula: 5
36 T(m) —> c( — ) mm~36, , - < $ )
— oo, mm— >>oo,
(25)
with ' f°° exp
h
(be-'
\ 36
sinh H sinh -fe \ dt sinh2! ) t
(26)
For general values of A, the above multiple integral representation for T(m) has a symmetric integrand with respect to the A variables which contains exponential m 2 factors; this makes possible to use the saddle point method to evaluate its leading asymptotic behavior. It gives an explicit formula in complete agreement with exact known answers for ( = | and C = § (respectively A = 0 and A = 1/2). We first reduce the integration domain to— o o < A i < A 2 < - " < A m < o o using the symmetry of the integrand. Following the standard arguments of the saddle point approach we estimate the integral (22) by the maximal value of the integrand. Let {A'} be the set of parameters corresponding to this maximum. They satisfy the saddle point equations and for large m we assume that their distribution can be described by a density function p(X'), p(X'j)=
lim ——
-r
(27)
362
JEAN-MICHEL MAILLET
Thus for large m, one can replace sums over the set{A'} by integrals. To express Zm({A},{£}) in a suitable form for the saddle point equations, we use the following relation valid for large m (77 = -i(), detmi(A^) - ( - 2 « r d ^ ( ^ - 1 ^ ^ )
deu(
2 C s
.
n h f
^_
a )
),
(28)
where K and t are defined in (6). The saddle point analysis can then be performed in a straightforward way (see [9] for details) leading to the result li m l o g ^ K ) . _ ! * 8 m->oo m 2 C
11 f 2 7
^ sinh |(TT - Q cosh2 ^ w sinh^sinh^coshwC'
where cos £ = A, 0 < £ < 7r. If £ is commensurate with 7r (in other words if e*^ is a root of unity), then the integral in (29) can be computed in terms of V'-function (logarithmic derivative of T-function). In particular for £ = 7r/2 and £ = 7r/3 (respectively A = 0 and A = 1/2) we obtain from (29) log Tim) 1, . \ ' =--log2, A = 0, log r(m) 3, 1 A hm ^ ' =-log3-31og2, A = -, m-»oo m J 2 2 which coincides with the exact known results obtained respectively in [7,30,31] and in [8,29]. And for the particular case of the XXX chain (A = 1, £ = 0), lim
lim 12I2M = log (EiffilT) „ log(0.5991),
m-oo
m2
6
V
T{\)
(30)
which is in good agreement with the numerical result log(0.598), obtained in [32,33].
5. Conclusions In this paper we have reviewed a new approach to the computation of correlation functions of lattice integrable models. Generic tools valid for a large class of models have been constructed (like the solution of the quantum inverse scattering method [4,23]) and explicit results obtained for the XXZ Heisenberg spin-1/2 chain. Although hard work is still ahead of us, our method opens new perspectives towards the computation of asymptotic behavior of correlation functions (under study) and of the dynamical correlation functions (also under study). We can now hope that applications to many different models (models with impurities, with boundaries, field theories, ...) could be addressed in this framework in the near future.
Acknowledgments I would like to thank N. Kitanine, N. Slavnov and V. Terras for their longstanding collaboration on these difficult but very exciting problems.
Correlation functions of the XXZ
spin-1/2 chain: recent progress
363
References 1. R. J. Baxter, Exactly solved models in statistical mechanics, Academic Press, London-NewYork, 1982. 2. M. Gaudin, La Fonction d'Onde de Bethe, Paris: Masson, 1983. 3. E. H. Lieb, D. C. Mattis (eds.) Mathematical physics in one dimension, New York: Academic Press, 1966. 4. N. Kitanine, J. M. Maillet, V. Terras, Nucl. Phys. B 554 [FS], 647 (1999); arXiv:math-ph/ 9807020. 5. N. Kitanine, J. M. Maillet, V. Terras, Nucl. Phys. B 567 [FS], 554 (2000); arXiv:math-ph/ 9907019. 6. N. Kitanine, J. M. Maillet, N. A. Slavnov, V. Terras, Nucl. Phys. B 641 [FS], 487 (2002); arXiv:hep-th/0201045. 7. N. Kitanine, J. M. Maillet, N. A. Slavnov, V. Terras, Nucl. Phys. B 642 [FS], 433 (2002); arXiv:hep-th/0203169. 8. N. Kitanine, J. M. Maillet, N. A. Slavnov, V. Terras, J. Phys. A: Math. Gen. 35, L753-L758 (2002); arXiv:hep-th/0201134. 9. N. Kitanine, J. M. Maillet, N. A. Slavnov, V. Terras, J. Phys. A: Math. Gen. 35, L385-L388 (2002); arXiv:hep-th/0210019. 10. H. Bethe, Zeitschrift fur Physik 71, 205 (1931). 11. R. Orbach, Phys. Rev. 112, 309 (1958). 12. L. R. Walker, Phys. Rev. 116, 1089 (1959). 13. C. N. Yang, C. P. Yang, Phys. Rev. 150, 327 (1966). 14. E. Lieb, T. Shultz, D. Mattis, Ann. Phys. 16, 407 (1961). 15. B. M. McCoy, Phys. Rev. 173, 531 (1968). 16. B. M. McCoy, C. A. Tracy, T. T. Wu, Phys. Rev. Lett. 38, 793 (1977). 17. M. Sato, T. Miwa, M. Jimbo, Pub RIMS 14, 223 (1978); 15, 201 (1979); 15, 871 (1979); 16, 531 (1980). 18. M. Jimbo, K. Miki, T. Miwa, A. Nakayashiki, Phys. Lett. A 168, 256 (1992). 19. M. Jimbo, T. Miwa, J. Phys. A 29, 2923 (1996). 20. M. Jimbo, T. Miwa, Algebraic analysis of solvable lattice models, AMS, 1995. 21. L. D. Faddeev, E. K. Sklyanin, L. A. Takhtajan, Theor. Math. Phys. 40, 688 (1980). 22. L. A. Takhtajan, L. D. Faddeev, Russ. Math. Surveys 34, 11 (1979). 23. J. M. Maillet, V. Terras, Nucl. Phys. B 575, 627 (2000); arXiv:hep-th/9911030. 24. N. A. Slavnov, Theor. Math. Phys. 79, 502 (1989). 25. H. E. Boos, V. E. Korepin, F. A. Smirnov, Nucl. Phys. B 658, 417 (2003). 26. K. Sakai, M. Shiroishi, Y. Nishiyama, M. Takahashi, Phys. Rev. E 67 (2003). 27. G. Kuperberg, Int. Math. Res. Notices 3, 139 (1996); arXiv:math. CO/9712207. 28. D. Zeilberger, arXiv:math.C0/9407211. 29. A. V. Razumov, Yu. G. Stroganov, J. Phys. A: Math. Gen. 34, 3185-3190 (2001); arXiv:cond-mat/0012141. 30. M. Shiroishi, M. Takahashi, Y. Nishiyama, J. Phys. Soc. Jap. 70, 3535 (2001); arXiv:cond-mat/0106062. 31. A. R. Its, A. G. Izergin, V. E. Korepin, N. A. Slavnov, Phys. Rev. Lett. 70, 1704 (1993). 32. H. E. Boos, V. E. Korepin, Y. Nishiyama, M. Shiroishi, J. Phys. A 35, 4443 (2002). 33. V. E. Korepin, S. Lukyanov, Y. Nishiyama, M. Shiroishi, arXiv:cond-mat/0210140.
Applications of a complete expansion for the partition function of random matrix theory KENNETH D. T.-R. MCLAUGHLIN (U. of North Carolina at Chapel Hill) We review some recent developments in random matrix theory, and establish a moderate deviation result for traces of powers of random matrices.
1. R a n d o m m a t r i x theory: u n i t a r y ensemble The unitary ensemble of random matrices refers to any probability distribution on the space of N x N Hermitian matrices of the form d/i = Z„l exp {-NTi
[V(M)]}dM,
(1)
where dM is Lebesgue measure on the matrix entries, i.e., N
dM=\{dMfkdM)kY[dMjj, where Mjj^ denotes the real part of the matrix entry Mjk, and M?fe denotes the imaginary component of the matrix entry Mj/.. In (1), there are many allowable choices for the function V. The simplest one is the quadratic V(M) = M 2 / 2 ; in this case the matrix entries (separately real and imaginary parts for off-diagonal entries) are independent Gaussians. But this is far from the only case of interest, and indeed the measure is equally well defined, for example, if V(x) is a lower semicontinuous function of x G R, bounded from below, and growing faster than (log (1 + x2))1+e as jarj —-> oo, for some e > 0. For any such function V, the measure (1) is invariant under arbitrary unitary transformations, and this is the origin of the term "unitary ensemble". (Note, however, that the matrix entries are no longer independent.) For more details, see [14]. In what follows we will assume the function V is a polynomial of degree u with v even: V(x) = Vt(x) = x2/2 + ] T ^ .
(2)
j=i
The vector of coefficients t will live in a cone defined as follows. For any given T > 0 and 7 > 0, set
T(T,7) = j t e l T : | t | < T , i*>7$2|tj|)-
(3)
It is a basic fact that the measure (1) induces a probability measure on the eigenvalues, with density
364
Applications of a complete expansion for the partition function ...
365
where Z^ = Zff(t) is the partition function:
ZN(t1,t2,...,tu) = ^.--j^xpl-N2
±JTvt(\j)-~Y/iog\\j-Xe\
\dN\. (5)
Most of the interest in random matrices from the unitary ensemble has concerned the statistical behavior of the eigenvalues, and their asymptotic behavior when TV, the size of the matrices in question, grows to oo. A great deal is known about the limiting statistical behavior of the eigenvalues, for a rather general class- of functions V. This is due to the explicit induced measure on eigenvalues (4), a remarkable connection between most (if not all) statistical questions concerning the eigenvalues, and orthogonal polynomials, due to Gaudin and Mehta [11], as well as recent developments from integrable systems which permit a complete asymptotic analysis of the polynomials [2,4-6]. Let {pj(x; N, t)}JL0 be the sequence of polynomials orthogonal with respect to the measure exp (—NVt(x))dx. That is, {pj(x; N, t)}^l 0 satisfies / ^ P j P u exp (-NV)dx = Sjk, and Pj(x; N, t) = j \ 'xi' -\ , 7 ^ ' > 0. (The leading coefficient 7^ is of course dependent on the parameters ti,...,t„; however, we suppress this dependence for notational convenience.) Investigations of the statistical properties of the eigenvalues {^j}jLi usually begin with the mean density of eigenvalues, defined as follows: p[N)(X) = (d/dX) E (AT-X# {j : A,- < A}),
(6)
where E(-) denotes integration with respect to the measure (4). The connection to orthogonal polynomials is expressed by the following formula: JV-l
p[N\\)
= TV"1 exp ( - W W ) ^ - ( A j i V . t ) 2 .
(7)
1=0
There are many other remarkable formulae expressing the explicit connection between eigenvalues statistics and the orthogonal polynomials, but for our purposes (7) will suffice. The partition function ZN is a generating function for moments of the eigenvalues, in the sense that by taking derivatives of ZN with respect to any of the i,- 's, one obtains moments of the fundamental random variables ¥>* = £ ^ -
(8)
71=1
Here are two examples that will prove extremely useful: dtklogZN alN
= -NE(
(9)
2
:= N~ dfk log ZN = Var (
(10)
Let us define normalized random variables <j>k as follows: <£fc = (
(11)
One may also use the partition function to express the moment generating function of
„
exp
(ifW
z„
(t _ _L_&)
/
z „ (t) ,
(12)
366
KENNETH D. T.-R. MCLAUGHLIN
where j k is the fcth unit vector in W, jk(i) = 6ik. Moreover, the partition function is directly related to p[ , through the following formula: {d/dtj) logZ N = -N2 [ Xjp[N)(X) dX.
(13)
2. Asymptotic behavior One of the results in [13] is a complete mathematical proof that the mean density of eigenvalues p\ ' possesses a weak limit. (While this result was known in some form previously, [13] contains a complete proof, in great generality.) The weak limit is a known object, namely the equilibrium measure, which arises in the consideration of the following variational problem: Q ( t ) : = s u p ( - /V t (A)d/i(A)+ [flog H€A L J
|A - V\ dfx(X) d^r,)}
JJ
,
(14)
)
where A is the set of all positive Borel measures on the real axis, with unit mass. This variational problem has been considered extensively in the approximation theory literature (see [16]). It is well known that the supremum is achieved at a unique measure \i* (a complete proof of this fact can be found in [16]). Moreover, for real analytic functions V, the equilibrium measure pL* is supported on finitely many disjoint intervals, and on the interior of each interval, it has analytic density [3]. We will denote the density of the equilibrium measure by \j), so that dfi* = ip(X) dX. The mean density p[ is an analytic function for each N, and so it is natural to ask if it converges to ip point-wise, or uniformly, and to determine the rate of convergence. This was carried out in [2], and in [4]. In [2], the authors considered the two parameter family of functions V = tX2 +gA4, and in [4], the authors considered the general case of real analytic functions V, with sufficient growth at oo. In [4], the following result was stated (the proof was given in [6], see also [15]). Theorem 2.1. [4] Suppose that V is a real analytic function, growing faster than (log(l + A 2 )) 1 + e for some e > 0. If A is a closed subset of the support of the equilibrium measure such that ^(X) is strictly positive on A, then there is a positive constant C such that for all N sufficiently large N) sup p[ (X)-^(X)
(15)
XeA
Ercolani and McLaughlin carried this analysis to higher order, under assumptions which guarantee, in particular, that the equilibrium measure is supported on a single interval. Their result is the following. Theorem 2.2. [9] Suppose that V is a polynomial of the form (2), with coefficients satisfying (3). Then the equilibrium measure is supported on a single interval (a,/?). On (a, j3), the density V> satisfies tp(X) = ^/(X — a)(/3 — A) h(X) where h is a polynomial which is strictly positive on K. If A is a closed subset of (a, /?), then there is a complete asymptotic expansion
Applications of a complete expansion for the partition function ...
367
/ of the form
^ w = *w ^(^-p-xh) 1 in which Hj(X) and Gj(\) terms V(\).
#2(A) + G2(A) sin
cos {N/X m ds (N
(16)
f V(s) ds
are locally analytic functions which are explicitly computable in
In [9] Ercolani and McLaughlin also obtained a complete asymptotic expansion for p\ ' valid in neighborhoods of endpoints of the support of ip. This was used, together with formula (13), to obtain the main result of that paper, which is a complete asymptotic expansion for logZjv- The result is the following: Theorem 2.3. [9] There is T > 0 and 7 > 0 so that for t G T(T, 7), one has the N -> 00 asymptotic expansion
log
(IS)) = iv2eo(t)+ei(t) + he*{t)+• • • •
(1?)
The meaning of this expansion is: if you keep terms up to order N~2k, the error term is bounded by CN~2k~2, where the constant C is independent oft for all t € T(T, 7). For each j , the function e.,(t) is an analytic function of the (complex) vector t, in a neighborhood of 0. Moreover, the asymptotic expansion of derivatives of log (Z^) may be calculated via term-by-term differentiation of the above series.
3. Application: Central limit theorem and moderate deviation result Another beautiful result in [13] is a proof that finite collections of the random variables 4>k defined in Section 1 converge to finite collections of independent Gaussian random variables. (It is worth mentioning that in [13] similar results are stated for other values of /3 as well.) In this Section we will present a new proof of this result, using the asymptotic expansion for ZM described in Section 2, and we will investigate moderate deviations of
(18)
as defined in (12). It is quite straightforward to verify that G\ := limAr_»oo o~\N = d2ke0(t) using Theorem 2.3. It is well known that eo(t) = Q(t), with Q defined in (14) (see, for example, [13] or [3]). Straightforward calculations show that cteo(t) = - 2 f
f
\og\x-y\{atkil))(x)(dthil>){y)dxdy=:Q{{dtkil>),{atkiP))>0.
(19)
368
KENNETH D. T.-R.
MCLAUGHLIN
The last inequality is because the quadratic form Q is positive definite on mean-zero functions. Similarly, one may verify that N~ldtk \ogZN = Ndtkeo(t) + ©(TV-1) as TV —> oo. Furthermore, the uniformity of the asymptotic expansion in Theorem 2.3, both with regards to the domain of allowable times, as well as the term-by-term differentiability of the series, allows one to establish s
3k No-k,N
ZN[t-
ZN(t)
= exP
IN2 e 0
t
SJk Nak,N
eo(t)
+
ei
t
Nak,N
ei(t)
+
0(N~i)
„2
= exp j - ^ e o ( t ) + ^|Sf f c eo(t) + O ^
1
)} .
Now combining our various asymptotic results, we find lim
2
FN(S)
/2
(20)
N—>oo
from which it follows that
-a2/2.
(21)
N—too
The proof of this Theorem was motivated by the proof of Cramer's Theorem as discussed in [12, pp. 5-7]. We emphasize that
exp<sz/2
(d?ke0 + N~2dfkei
+ N-4dfke2
+ •••)
W°k,N +
24iv^(< eo +
iV
"2^ei +
iV
" 4 ^ e2 + • • • ) + ••• } '
(22)
Applications of a complete expansion for the partition function ...
369
The above expansion holds true provided s/N tends to 0 when N —> oo. For example, the condition \s\ < N~< for any 0 < 7 < 1 will suffice. Now having control on the error allows us to estimate the tail of the distribution for (f>k, as follows: P (4>k > A) = P (e*^*-^
> l ) < FN{s)e~As,
(23)
provided s > 0. Now we may minimize F^(s)e~As. Analysis of the asymptotic expansion (22) shows that for 0 < A < AT?, 0 < 7 < 1, FN(s)e~A3 achieves its local minimum for some s* with |s* — A\ < JV - ' 1-2 " 1 '), and hence we can conclude that for any 0 < 7 < 1 and any fixed a > 0 lira AT"27 log [P (4>k > alT1)] < -a2/2. (24) To establish the lower bound, let the density function for
fN{x)dx
= e~aTNyFN{T)
Jx>aN~<
f
e~TXQN(x)
dx
Jx>0
> e-arJVTFw(r) /
e-TXQN{x)dx
> e~aTN^ FN(T) e-TCF(XN
€ (0,C)).
(25)
Now since X^ converges to a normalized Gaussian, we may chose C large enough (but fixed) so that F(XN e (0,C)) > 1/2. Now we may take logs, divide by AT27, and let N -> 00, to obtain lim AT 2 7 log [P (>fc > aN"1)} > -a2/2, (26) N—»oo
completing the proof of Theorem 3.1.
•
4. Discussion Were cfik a standardized sum of independent, identically distributed random variables, then the correct scaling for a large deviation result would be N1/2. In reality, Theorem 3.1 demonstrates that we can go much further into the tail of the distribution {N1 for any 0 < 7 < 1), and still observe Gaussian behavior. To really investigate the tail of the distribution, we should consider deviations of order N, rather than Ny for 0 < 7 < 1. Such considerations will be undertaken in a later publication, the result of which will be as follows. lim AT 2 log (P{fa > aN)) = 1(a),
(27)
N
where 7(a) =
min
e0 ( t
jk) - e 0 (t) + -dtke0(t)
- ax .
(28)
370
KENNETH D. T.-R.
MCLAUGHLIN
The function appearing in the square brackets above is certainly convex for x in its domain of definition. It can be shown that there is a > 0 sufficiently small so that if a G (0, a], then the minimum in (28) is achieved at a value of x such that t — (x/a)jk £ T(T, 7), and (27) holds for a € (0,5]. A rather different type of question has to do with corrections to the central limit theorem "in the bulk". In the case of sums of independent, identically distributed random variables, such results are known as Edgeworth expansions, or Berry-Eseen theorems (see, for example, [10]). As one can see from the asymptotic expansion (22), a better approximation to the moment generating function FN(S) is
Now FN(S) is the Laplace transform of a rather nice function: F^r(s) = where fN(x) = (27T)-1 J eXp f-z2/2 + * f f f * 3 + izx\ dz.
Lesxfjv(x)dx, (30)
This is a classical special function, it can be directly related to the Airy integral. It is expected that the asymptotic expansion for the moment generating function of 6k could yield a sequence of approximations to /JV(Z) for TV large, which improve on the leading order approximation (2ir)~1/2e~x I2 afforded by the central limit type result discussed above. However, obtaining fx from F^ requires knowledge of FN(S) (and its asymptotic expansion) along the imaginary s-axis. Thus another interesting direction of research involves asymptotic analysis of the partition function (and hence the orthogonal polynomials) for values of t in the complex domain.
Acknowledgments The research of K. D. T.-R. McLaughlin is supported in part by the National Science Foundation under grants DMS-9970328 and DMS-0200749. He thanks Sergio Volchan, Brian Rider, and Alexander Soshnikov for useful conversations, and Percy Deift and Michio Jimbo, for organizing the session "Integrable Systems and Random Matrix Theory" at the 2003 International Congress of Mathematical Physics, and also thanks the organizers of the International Congress for all their efforts. This work was completed while McLaughlin was visiting the Mathematics Department at the Pontiffcia Universidade do Rio de Janeiro, and he thanks the faculty and staff of that department for their kind hospitality.
References 1. 2. 3. 4.
D. Bessis, X. Itzykson, J. B. Zuber, Adv. Appl. Math. 1, 109-157 (1980). P. Bleher, A. Its, Ann. Math. 50, 185-266 (1999). P. Deift, T. Kriecherbauer, K. T.-R. McLaughlin, J. Approx. Theory 95, 388-475 (1998). P. Deift, T. Kriecherbauer, K. T.-R. McLaughlin, S. Venakides, X. Zhou, Int. Math. Res. Notices 16, 759-782 (1997). 5. P. Deift, T. Kriecherbauer, K. T.-R. McLaughlin, S. Venakides, X. Zhou, Commun. Pure Appl. Math. 52, 1491-1552 (1999).
Applications of a complete expansion for the partition function . . .
371
6. P. Deift, T. Kriecherbauer, K. T.-R. McLaughlin, S. Venakides, X. Zhou. Commun. Pure Appl. Math. 52, 1335-1425 (1999). 7. P. Deift, X. Zhou, Ann. Math. 137, 295-368 (1993). 8. P. Deift, X. Zhou, Comm. Pure Appl. Math. 48, 277-337 (1995). 9. N. M. Ercolani, K. D. T.-R. McLaughlin, Int. Math. Res. Not. 2003, no. 14, 755-820 (2003). 10. W. Feller, An Introduction to Probability Theory and Its Applications, volume 2, Wiley, 1971. 11. M. L. Mehta, M. Gaudin, Nuclear Phys. 18, 420-427 (1960). 12. F. den Hollander, Large Deviations, Fields Institute Monographs, American Mathematical Society, 2000. 13. K. Johansson, Duke Math. J. 9 1 , 151-204 (1998). 14. M. L. Mehta, Random Matrices, 2nd Edition, Academic Press, San Diego, CA, 1991. 15. S. Albeverio, L. Pastur, M. Shcherbina, Comm. Math. Phys. 224, 271-305 (2001). 16. E. B. Saff, V. Totik, Logarithmic Potentials with External Fields, New York: Springer-Verlag, 1997.
Complete solutions to form factor equations of SU(2) invariant Thirring model ATSUSHI NAKAYASHIKI
(Kyushu U.)
We review the recent construction of solutions to the form factor equations of SU(2) invariant Thirring model.
1. Introduction The SU(2) invariant Thirring model (ITM) is the simplest massive integrable model in two dimensional quantum field theories such that their two particle 5-matrices are non-diagonal. It is also considered as a deformation of the su(2) Wess-Zumino-Witten (WZW) model at k = 1 in conformal field theory. By solving the form factor equations we find the subspace of local operators in SU(2)-ITM which has the same character as the chiral subspace of WZW model k = 1. Let V = Cv+ © Cv- be the vector representation of sl2, where v+ = *(1,0), v+ = *(0,1). The S-matrix of SU(2)-ITM is the operator S((3) : V®2 —• V®2 given by
S(J3) = So(J3)S(0), £(/?) = V ^ >
) 2" < )2"(,
W)= l-Kl
]
(1)
V 27T2
where P is the permutation operator, P(vtl ®vt2) = v€2 ® v€1. The form factors of a local operator in SU(2)-ITM give the set of meromorphic function (•F2n)£Lo> where F2n being the V®2n-valued function of the rapidities / 3 l r .. ,/32„. They satisfy the following system of equations: Pi,i+l Si,i+l(0i P2n-l,2n
- Pi+l) F2n(p1: ...,p2n)
P2n-l,2n-2
• • • Pl,2 F2„(0l
= F2n(. . . , & + ! , & , . . .),
~ 2m, ...,fon)
= ( - 1 ) " F2n{/32,
(2) . . .,02n,Pl),
(3)
2m res / 3 2 n = / 3 2 n _ 1 + ^F 2 „(/3i,..., /?2n) = [I - ( - ! ) n _ 5'2n-l,2n-2(/32n-l - Ihn-i)
F2n-2(l3i,...,(i2n_2)®(v+®v-
•'•^n-l.l^n-l ~
-v-®v+),
Pi))
(4)
where Sij ((3) acts on i-th and j - t h components of V®2n as S((3) etc. Conversely any solution (F2n)^=0 of (2), (3), (4) determines a local operator in such a way that its 2n-particle form factor is F2n [13]. Thus the space of local operators can be identified with the space of meromorphic solutions of (2), (3), (4). Solutions of n = 1 equations corresponding to several local operators were studied in [6]. Solutions (^2,1)^0 for special local operators such as the su(2)-current and the energy momentum operators were first derived in [5,12,13], Later F. Smirnov proposed a construction
372
Complete solutions to form factor equations of SU(2) invariant Thirring model
373
of a large family of solutions extending the previous constructions. On the other hand, using the vertex operator approach which was successful in solvable lattice models [1], S. Lukyanov proposed an alternative construction of a huge family of solutions [7]. In spite of these remarkable constructions it was quite difficult to determine whether their solutions are complete. One way to check this is to make a correspondence between local operators in SU(2)-ITM and WZW-model at k = 1. The latter space is described as the tensor product of the chiral and the anti-chiral subspaces. Thus it is a crucial problem to single out a subspace of solutions to the form factor equations which has the same character as the integrable highest weight representation of sl2 of k = 1. This problem is solved in [8,9]. We review it and give a conjecture on the full space of solutions.
2. Minimal form factors We consider the subspace of solutions of the form (0,... ,0,F2m,F2(m+i),...) which are called 2m-minimal. For a 2m-minimal solution F2m is called a minimal form factor. Equation (4) for a minimal form factor F2m becomes resf}2rn=02m_i+„i F2m(/3i,. • .,(32m) = 0.
(5)
We shall first determine solutions of equation (5) among solutions of equations (2), (3). The complete description of solutions of equations (2), (3) is known [10,15]. Solutions are parametrized by certain polynomials and relations of solutions are described by relations of polynomials. Thus let us concentrate on the polynomials. Let Xj = exp(/?j), Rn the ring of symmetric polynomials of Xj, 1 < j < n and
#(n) := ®nkZlRnXk, the space of polynomials of the variable X of degree at most n — 1 with the coefficients in Rn. We identify the ^-th exterior product space AeH^ with the space of anti-symmetric polynomials of X i , . . . , Xt with the coefficients in Rn by r
i
A - A l " H AsymtXJ 1 • • • * ? ) = £ sgn a X?{1) • • • Xfa *ese
.
To each P e AeH^2n^ the solution \I/p of equations (2), (3) taking the value in the s/2 highest weight vectors of V®2n with the weight 2n — 21 is constructed in the form of multidimensional g-hypergeometric integrals [10] (see the Appendix). In [8] it is proved that equation (5) for \I>p follows from the following equations for P: P±{P) : = P ( X 1 , . . . , ^ _ i , ± a ; " 1 | x1:...
,xn-2,x,
-x) = 0.
(6)
We conjecture that the converse is true [8] and in the following we assume it. Solutions of equation (6) are determined in the following way. Let e[ be the elementary symmetric polynomial,
n(i+M)=i>L B) * fc . j= l
fc=0
374
ATSUSHI NAKAYASHIKI
Define the symmetric polynomial Pr,s , 1 < r < n, s e Z, by the following recursion relations: p(2") _ _(2n) e
^1,8
~
2s-l>
p(2n) _ p(2n) _ (2n) p (2n) e ^r,* — ^V-l.s+l 2s ^r-1,1
,
I o r
„ ~ d. *•
We set
^J^ef0*2'',
^ 2 ")=^i 5 r (2 s " ) ^ 2(s - 1) , ^ 2 n ) =X^ 2 "), l < r < n ,
j=0
(7)
s= l
and, for 1 < j < n,
2€f°(*i,* a )=f^^^r^)+,r ) (^)^ ( 2 " ) (x 1 )) - v{t){X1)wfn\x,)
+ vfn){X2)wfn){X1).
(8)
We use the multi-index notations like vi — vtl A • • • A vie for I = (ii,..., Set Un,t = {Pe AeH^ | p+(P) = p_(P) = 0 } .
it).
Then Theorem 2.1. As an R2n-module, U2n,e is a free module with the following set of elements as a basis:
vfn)Aw{^A$n\ I = {i\,...,itl),
I
J = O'l, •••,3i2),
l<ji<---<3e2
K = (ku...,kh), l
£3,
+ l,
3. Form factors in chiral subspace In [14] a sufficient condition for (P2n)%Lo was found in order for (*p 2n )£L0 t o satisfy equation (4) (in the case of chargeless operators). It is extended to the charged operators in [11]. For a function f(xi,...,xn) we denote / the function obtained from / by specializing the last two variables to x, ~x:
if it has a sense. Then the condition is summarized in the following theorem. Theorem 3.1. [11] Let P2n(Xi,..., Xe2n | x i , . . . , x 2 n ) , n>m, be polynomials in X\, ..., Xe2n with the coefficients in the ring of symmetric Laurent polynomials of Xj 's such that deg Xa P < 2n for 1 < a < e2n- Set P 2 n — 0 for n < m. Suppose that (P 2 n)£L 0 satisfy the
Complete solutions to form factor equations of SU(2) invariant Thirring model
375
following conditions: there exists a set of polynomials (P2n) such that A s y m ( i ^ ) = Asym ( f[ (1 - x2X^n),
(9)
^ a=l
'
4 | x ( l n = ± » - = ±X^2n^d2nP2„-2,
(10)
d2n = ^—){-2ni)-r+^,
(11)
where Asym is the anti-symmetrization satisfies (2), (3), (4).
with respect to Xr+i,...
,Xg2n.
Then (^'p2„)^Lo
Smirnov's condition for P is actually sufficient for our present purpose. In fact, to each polynomial P2m from U2m^ we can construct P2n, m < n, satisfying (9) and (10). Let m and r be non-negative integers such that 0 < r < m. We set A = 2m — 2r and (•in =r + n — m. Define P2n = 0 for n < m. Let p\n = E ? = i x%j f° r * 7^ 0. The multiplication by p ^ - i ^s e Q u a l to the action of the local integral of motion of spin 2s — 1. We introduce the generating function of p2™-i:
^W=exP(^i2,_1pg^1 Let us define one more function n{±),
Q2n{)
s _ ilq 2 =r+l( 1 - ; l ( : ? 2 ^ 2 )
n5id-*f*) '
for n > m. For n — m the empty product in the numerator is understood as 1, that is,
Q2m{z)
-x\%{i-*t*)~hj
where hh m' := h- m'
is the complete symmetric function. They satisfy the equation P±(Q&(*))=Q£-2(Z),
(12)
for e = ± . The functions Q2n (z) were extracted from the integral formulae for the form factors of the Lukyanov's operators [7,10,11]. Take / , J, K such that they satisfy conditions in Theorem 2.1 with t\ +t2 + 2^3 = r. We set, for ra < n, m
P2n = C2Xlf(t)Y[Q(2t)(*i)v?n)*™?n)*S(Kn} t=l
fan
I I * 2 " +1+2r - 2a ,
(13)
a=r+l
where the constant c2n is given by C 2n = ( _ 1 ) l ( n - m ) ( n + m - 2 r - l ) ( ^ ( - T T J ) ) " * - " ( 2 7 r i ) ( m - " ) ( n + r ) .
(14)
376
ATSUSHI NAKAYASHIKI
„(2n) . ^,(2n) .
In (13), v\ ' A Wj
Aln
A£K
is understood as the polynomial of Xi,...,
Xr. We expand Pin
as ^2n — /
,, °2n,a,7 £ 2 j
a,7
where a = (... , a _ i , a i , a 3 , . . . ) , 7 = (7i.---.7m)Theorem 3.2. (1) The set of functions (P2n)%Lo satisfies (9), (10). (2) The following property holds for all n: *P2„,„, T 03l +6,-.-,fon
+ 0) = e x p ( ^ d e g 2 P 2 m,a, 7 ) * P 2 n , Q | 7 ( / ? l , • • • . A t a ) ,
where
deg2 P2m,a,7 = m 2 + Y^ iai + 5Z T* + de Si ( 4 2 m ) A ™ J2"0 A ^ m ) ) > and degx is the degree defined by degj Xa = — 1 and degx Xj; = 1. In [9] the following equation has been proved: D ,r> mr ( 2 m ) ( 2 m ) ( 2 m ) 1 I. (2m) #2m = ©0
>(2m) ' ' ' h2rm •
Thus any element of U2m,r can be expressed as a linear combination of P2m,a,-y's' t n a t is> ^2m,r
=
/ v ^-*2m,q,7, a,7
where the summation is taken for all 7 and a with a» = 0, i < 0. The statement (2) of the above theorem tells that the Lorentz spin of the operator corresponding to (P2n,a,7)£Lo i s de S2 P2 Consider the space generated by {P2n,a,-y)^=ot cxi = 0 (i < 0). There are polynomials P such that Vl/p vanishes identically and there are linear relations among (P2n,a,-y)^Lo'sWe consider the factor space of the space above by these relations. Introducing the degree by Lorentz spins it is possible to calculate the character of this space. It becomes the branching function of sl2 highest weight vectors in the level one integrable highest weight vacuum representation V(Ao) of sh (see [8,9] for precise statements). Non-highest weight solutions are obtained from the highest ones by the action of s^. As a whole we have the subspace of solutions which is isomorphic to ^(Ao).
4. Total space of local operators To define the chiral subspace of solutions equation (2), (3), (4) we restrict the coefficients of U2n,t to i?2n- To get the total space of solutions we have to extend coefficients from i?2n to Laurent polynomials R2n = C[xf \ . . . ,xf^]. To this end we first formulate a conjecture on the structure of symmetric Laurent polynomials. Let pM* = pf\x±% hf± = hf\x") x±x denotes (xf1,... ,xf*)- Then
and / § ? = C\p?n)± ,pfn)±,...,p%£]t
where
Complete solutions to form factor equationsof SU(2) invariant Thirring model
377
Conjecture 4.1. The following equation holds:
^ = E r r o E siTde)----e)-e+)1---e)fc=0 l e 2n )
ri,...,r„>0
We take / , J, K etc. as in the previous section and set Pln = C2nE^(t)J2ll^kIl9{2n\zi) fc=0 \€2n
f[ Q&'fc) i=fc+l
1 i=l
a=r+l
for m < n, where cin is the same as that in the previous section. If we set Pin = 0 for n <m, (Pin)n°=o satisfies equations (9) and (10). If we assume the conjecture, to each polynomial Pim from Rim ®Rim Uimie it is possible to construct Pin, m
5. Concluding remarks It is important to understand our results by representation theory. After [8,9], Jimbo et al. [2,3], extending Tarasov's results in [15], have shown that the space of solutions to (6) can be understood in terms of the representation theory of the quantum affine algebra U^j(sli) at level zero.
Appendix. Integral formula of tyP For a subset M C { 1 , 2 , . . . , n} we set V®n,
VM=VCX®---®VZH€
M = {j\
tj =
-}.
To P E A'#
IM{P):=
[et[daaUM^)WMP^r-Xe}"1,-x,X:^
JC
a=\
lla=l Hj^lV1
o=l
(15) ^aXj)
where, for M = ( m i , . . . , m e ) with mi < • • • < me, n
r(a~_ft
+ Tr<
)
and WM = Asym(#M)- The integration contour C is defined as follows. It goes from —oo to +oo, separating two sets {(3j - 2mk | 1 < j < n, k > 0} and {j3j + (2k - l)ni | 1 < j < n,k> 0}.
378
ATSUSHI NAKAYASHIKI
T h e function ^ p is given by
where ((ft) is a certain meromorphic function (see the appendix of [9]).
References 1. M. Jimbo, T. Miwa, Algebraic analysis of solvable lattice models, Conference Board of the Math. Sciences, Regional Conference Series in Math. 85, 1995. 2. M. Jimbo, T. Miwa, Y. Takeyama, "Counting minimal form factors of the restricted sine-Gordon model", arXiv:math-ph/0303059. 3. M. Jimbo, T. Miwa, E. Mukhin, Y. Takeyama, "Form factors and action of t/y^j^s/^) on oo-cycles", arXiv:math-ph/0305323. 4. A. Koubek, Nucl. Phys. B. 435, 703-734 (1995). 5. A. N. Kirillov, F. Smirnov, Phys. Lett. B 198, 506-510 (1987). 6. K. Karowski, P. Weisz, Nucl. Phys. B 139, 455-476 (1978). 7. S. Lukyanov, Comm. Math. Phys. 167, 183-226 (1995). 8. A. Nakayashiki, "Residues of g-hypergeometric integrals and characters of affine Lie algebras", arXiv:math.QA/0210168. 9. A. Nakayashiki, "The chiral space of local operators in SU(2) invariant Thirring model", arXiv: math.QA/0303192. 10. A. Nakayashiki, S. Pakuliak, V. Tarasov, Ann. Inst. Henri Poincare 71, 459-496 (1999). 11. A. Nakayashiki, Y. Takeyama, in Progr. in Math. Phys. 23, Birkhauser, 2002, pp. 357-390. 12. F. Smirnov, in Nankai Lectures on Mathematical Physics (Mo-Lin Ge and Bao-Heng Zhao, eds.), World Scentific, 1990, pp. 1-68. 13. F. Smirnov, Form factors in completely integrahle models of quantum field theories, World Scientific, Singapore, 1992. 14. F. Smirnov, Nucl. Phys. B 453, 807-824 (1995). 15. V. Tarasov, Amer. Math. Soc. Translations Ser. 2, 201, 309-321 (2000).
The uses of random partitions ANDREI OKOUNKOV
(Princeton)
These are extended notes for my talk at the ICMP 2003 in Lisbon. Our goal here is to demonstrate how natural and fundamental random partitions are from many different points of view. We discuss various natural measures on partitions, their correlation functions, limit shapes, and how they arise in applications, in particular, in the Gromov-Witten and Seiberg-Witten theory.
1. Recognizing random partitions 1.1. W h y partitions? 1.1.1. Random partitions occur in mathematics and physics in a wide variety of contexts. For example, a partition can record a state of some random growth process. More often it happens that a certain quantity of interest is expressed, explicitly or implicitly, as a sum over partitions. This can come as a result of a localization computation in geometry, or from a character expansion of a matrix integral, or from something as innocent as expanding a determinant. Typically, one can recognize in such a sum a discrete version of some random matrix integral and so one can ask whether the powerful and honed tools of the random matrix theory can be applied. The purpose of these notes is to argue that certain natural measures on partitions are not just discrete caricatures of random matrix ensembles, but are, in fact, objects of fundamental importance, with profound connections to many central themes of mathematics and physics, including, in particular, integrable systems. Therefore, I believe that it is very natural to present these views in this special session on Random Matrix Theory and Integrable Systems. 1.1.2. The wealth of applications and connections of random matrices is such that it is utterly impossible to argue that something is "just as good" in one short talk. So, instead of trying to paint the whole picture, I will give a few illustrative examples, selected according to my own limited expertise and taste. Much, much more can be found in the works cited in the bibliography. But even though I tried to make the bibliography rather extensive, it is still hopelessly far from being complete. Several topics that should be covered in any reasonable survey on random partitions are completely omitted here. These include, for example, the 2-dimensional Yang-Mills theory and character expansions in the random matrix theory. We also say nothing about the relation between random partitions and planar dimer models, even though the 2-dimensional point of view on random partitions if often illuminating, has several technical advantages, as well as some exciting connections to algebraic geometry [44].
379
380
ANDREI OKOUNKOV
1.2. Coordinates on partitions 1.2.1. Diagram of a partition By definition, a partition A is simply a monotone sequence A = (Ai > A2 > A3 > • • • > 0) of nonnegative integers such that Aj = 0 for i ^> 0. The size of A is, by definition, the number |A| = £]Aj. The standard geometric object associated to a partition is its diagram. There are several competing traditions of drawing the diagram. We will follow the one illustrated in figure 1, which portrays the partition A = (8, 5,4,2, 2,1). This way of drawing diagrams is sometimes
Figure 1. The partition (8,5,4, 2, 2,1).
referred to as the Russian one (as opposed to the older French and English traditions of drawing partitions). Its advantages are not just that the picture looks more balanced on the page and saves space by a factor of « \/2, but also that from it one can see more clearly several other useful ways of parameterizing the partitions. 1.2.2. Profile of a partition The upper boundary in figure 1 is a graph of a function f\{x) such that f'x{x) = ± 1 . This function f\ is known as the profile of the partition A. The map A H / A from partitions to functions with Lipschitz constant 1 allows one to talk about limit shape of partitions. Namely, given a sequence of probability measures on partitions, we say that it has a limit shape / if, after a suitable scaling, the corresponding measures on functions converge weakly to the 6-measure on / . 1.2.3. Partitions vs. particles Another useful way to parametrize partitions is via the map
The uses of random partitions
381
from partitions to subsets of Z + ~. The geometric meaning of the set 6(A) should be clear from figure 1 where the elements of 6(A) are shown by bold dots. The map 6 makes a random partition a random subset of Z + \ or, in other words, an ensemble of random particles on the lattice. It is these random particles that are the analogs of eigenvalues of a random matrix. A natural question here is to compute the correlation functions, that is, the probability to observe particles in specified locations. 1.3. Partitions and the fermionic Fock space 1.3.1. Subspaces instead of subsets Note that, for any partition A, the set 6(A) has as many positive elements as it has negative holes (that is, negative half-integers not in 6(A)). It can be viewed, therefore, as a finite excitation over the Dirac sea 6(0) = { - 1 / 2 , - 3 / 2 , - 5 / 2 , . . . } , in which all negative positions are filled by particles while all positive ones are vacant. Consider the vector space V with an orthonormal basis {e/t}, k e Z + ^, indexed by all possible positions of one particle. To a partition A one then associated the following vector vx = eXl_i
A e A 2 _ | A eXs_s
A•••
in the half-infinite exterior power fS^V of the 1-particle space V. In other words, one associates to a partition A the image of the subspace spanned by {e^}, k € 6(A), under the Pliicker embedding of the corresponding Grassmannian. Note that the vectors v\ are orthonormal with respect to the natural inner product, and, in particular, any vector v in their span defines a probability measure measure Ttv on partitions by
1.3.2. The action of
GL(oo)
The main advantage of trading sets for linear spaces like we just did is the following. The group GL(V) of invertible linear transformations of a finite-dimensional vector space V acts naturally in all exterior powers of V. The case of an infinite exterior power of an infinite-dimensional space requires more care (in particular, the need for normal ordering of operators arises), but it is still possible to define a projective action on /\~V of a suitable version of the group GL(V), see for example [42,60,81]. For our purposes, it suffices to define the action of the operators of the form ex, where the matrix X lies in the Lie algebra Qi(V). If X has zeros on the diagonal then the naive definition of its action on f\2 V works fine (note however, that a central extension appears, see e.g. (9)). For diagonal matrices X, we will use the formula (18) as our regularization recipe. It is the gigantic symmetry group GL(V) that makes certain computations in f\~V so pleasant (such as, for example, computations of correlation functions, see below). It also
382
ANDREI OKOUNKOV
opens up the connection to integrable systems, where, through the work of the Kyoto school and others, the space f\2 V has become one of the cornerstones of the theory. 1.4. Plancherel measure We conclude this introductory section with the discussion of the most basic measure on the set of partitions — the Plancherel measure. 1.4.1. Representation theory The general linear group and the symmetric group S(ri) are the two most important groups in mathematics and the representation theory of both groups is saturated with partitions. In particular, over a field of characteristic zero (we will not come across any other fields in these notes), irreducible representations of S(n) are indexed by partitions A of size n. Let dim A denote the dimension of the corresponding representation. It follows from a theorem of Burnside that the formula *t P l a n c h (A) = ^
^
(2)
defines a probability measure on the set of partitions of n. This measure is known as the Plancherel measure, because of its relation to the Fourier transform on the group S(n). Often, it is more convenient to consider a related measure on the set of all partitions defined by 9KPp(A) = e - ^ l ( ^ )
2
,
£>0.
(3)
It is known as the poissonized Plancherel measure, the number £ > 0 being the parameter of the poissonization. 1.4.2. Plancherel and G U E While representation theory provides an important motivation for the study of the Plancherel measure, the representation-theoretic definition of it may sound like something very distant from the "real world" until one recognizes, through other interpretations of the number dim A, that one is dealing here with a distinguished discretization of the GUE ensemble from the random matrix theory. S. Kerov (see, for example his book [50]) and K. Johansson [38] were among the first to recognize this connection. A pedestrian way to see the connection to the GUE ensemble is to use the following formula
*•••»-no,+l-oi n ^ - ^ + j - o . i<j
<«>
where k is any number such that Afc+i = 0. The first factor in (4) is roughly a multinomial coefficient and hence a discrete analog of the Gaussian weight, while the second factor looks like Vandermonde determinant in the variables ©(A). This makes (dim A)2 resemble the radial part of the GUE measure given, up to a constant factor, by the weight
e-i^
fa"**)2.
J] i<j
the particles 6(A) playing the role of the eigenvalues
{XJ}.
(5)
The uses of random partitions
383
1.4.3. Plancherel measure and random growth Another interpretation of dim A is the following: it is the number of ways to grow the diagram of A from the empty diagram 0 by adding a square at a time, while maintaining a partition at every step. For example, dim(2,2) = 2 corresponds to the two possible growth histories shown in figure 2. In representation theory, this interpretation of dim A is a consequence
Figure 2. The two ways to grow the partition (2,2).
of the branching rule for the restriction S(n) j S(n — 1). It links Plancherel measure with the Robinson-Schensted algorithm (see, for example, [12, 84]) and many related growth processes. In particular, by a theorem of Schensted, the distribution of Ai with respect to 9Jtpianch is precisely the distribution of the longest increasing subsequence in a uniformly random permutation of { 1 , . . . , n } . The understanding of this distribution was a major stimulus for the study of Plancherel measure, culminating in the work of J. Baik, P. Deift, and K. Johansson [3]. They proved that the scaled and centered distribution of Ai converges to the Tracy-Widom distribution [85] which describes the maximal eigenvalue of a large random Hermitian matrix. They also proved a similar statement for A2 and conjectured that, more generally, the joint distribution of {Ai, A2,...}, scaled and centered, converges to the Airy ensemble which describes the behavior of the 1st, 2nd, and so on eigenvalues of a large random matrix. This conjecture was established in [8,39,68], see more in Section 4.2.2 below. Plancherel measure also arises in more general growth processes, where both adding and removing a square is allowed, see below. This is a discrete analog of how one finds the GUE distribution in the context of non-intersecting Brownian motions, see for example [40]. 1.4.4. Operator form of partition growth Consider the following elements of the Lie algebra gl(V) an-ek
= ek-n-
From definitions, one finds that in the basis v\ the operator a_i acts as follows
(6)
384
ANDREI OKOUNKOV
where the summation is over all possibilities to add a square to the partition A. Exponentiating (7) and comparing it to (1), we conclude that Wlpp = 9Jlv, where v = exp ( v ^ a - i j • U0.
(8)
Similarly, the adjoint operator a\ = a'!_1 removes a square from partitions. Prom the basic commutation relation [an,am]=n5n+m (9) satisfied by the operators an in the (projective) representation /S^V, one sees that mixing adding squares with removing squares leads again to the Plancherel measure. The formula (8) leads to a simple computation of the correlation function of the Plancherel measure, see below. 2. R a n d o m p a r t i t i o n s in G r o m o v - W i t t e n t h e o r y 2.1. Random matrices and moduli of curves 2.1.1. Matrix integrals in 2D quantum gravity The Wick formula expansion of the following Gaussian integral over the space of N x N Hermitian matrices H [e-itrH2
J
f\tTHk*dH
i=i
= const /
e"* *>? T T f c - xjff]
J*"
?<j
IY
ti \U
xk> | dx
(10)
J
is well-known to enumerate different ways to glue an orientable surfaces of a given genus g from a fci-gon, fc2-gon, etc., see for example [94] for an elementary introduction. In other words, the integral (10) can be written as a certain sum over discretized surfaces, also known as maps on surfaces, of the kind shown in figure 3. This sum over surfaces can, in turn, be viewed as a Riemann sum for a certain vaguely defined "integral" over the infinitedimensional space of all metrics on a genus g surface. In 2-dimensional quantum gravity,
Figure 3. A cubist's view of a discretized torus.
The uses of random partitions
385
one wants to integrate over this space of metrics; this is the reason why integrals (10) are studied there, see, in particular, [11,20,21,31,32] and [15,16] for a survey. More precisely, the relevant N —» oo asymptotic regime is when the number n of pieces forming the surface grows, while the shapes fc; of individual pieces remain bounded. For example, one can look at surfaces composed out of a large number of squares and hexagons. The corresponding (hard) mathematical problem is the N —> oo asymptotics of the integral fe-itrH>+trP(H)dHj
( n )
where P{H) is a polynomial of H the coefficients of which depend on AT in a certain critical fashion. 2.1.2. Witten's conjecture It was conjectured by Witten [93] that this discretized "integration" over the space of metrics is equivalent to certain topologically defined integrals over the (finite-dimensional) space of just the conformal classes of smooth metrics. Conformal, or, equivalently, complex structures on a genus g surface form a finite-dimensional moduli space A4g of dimension 3g — 3 for g > 1. The integrals in Witten's conjecture are integrals of Chern classes of certain natural line bundles and, as a result, they have a topological interpretation as intersection numbers on a certain distinguished compactification M.g D M.g. This compactification Mg, constructed by Deligne and Mumford, is obtained by allowing nodal degenerations (of the kind seen on the left in figure 4) satisfying certain stability conditions, see, for example, [35]. Similarly, the moduli space M.g
6
H2(Mg,n).
Witten's conjecture (first proved in [52]) was that the natural generation function for the intersection numbers {n1---Tkn)=
f_
ci(£1)fel...ci(£n)S
T ) * i = 3 s - 3 + n,
(12)
JMg.n
is precisely the r-function of the KdV hierarchy that emerged from the study of the matrix model of the 2D quantum gravity. 2.1.3. Edge of the spectrum and moduli of curves The same r-function also arises in a mathematically much simpler asymptotic regime of the integral (10), namely the one in which the fcj's go to infinity simultaneously with N —> oo,
386
ANDREI OKOUNKOV
while their number n remains bounded. It is clear that the largest eigenvalues of H dominate this asymptotics. Since the distribution of eigenvalues of H near the edge of its spectrum is well known to converge to the Airy ensemble, this limit is very easy to control. Somewhat surprisingly, it is this simpler matrix model that has a natural geometric relation to Mgtn, as discussed in detail in [72] and also, e.g., in [70], leading, in particular, to another, and in several respects more satisfying, proof of Witten's conjecture. 2.2. Gromov-Witten theory of P 1 One explanation of why the two very different asymptotic regimes lead to the same result is the following. The intersection theory of Mg,n embeds, in fact in many different ways, into a certain richer geometric theory which, as it turns out, can be described by the Plancherel measure analog of (10) on a finite level, that is, without taking any limits. 2.2.1. Stable maps and G W invariants This richer theory is the Gromov-Witten theory of the Riemann sphere P 1 , which is defined using intersection theory on the moduli space J M fl , n (P 1 , d) of stable maps to P 1 , see e.g. [28]. By definition, a point in the space of stable maps is described by the data f:(C,plt...,pn)-+F1,
(13)
where / is a degree d holomorphic map whose domain C is a possibly nodal curve of genus g with smooth marked points pi £ C, see figure 4. Here, again, possible degenerations of
/
Figure 4. A schematic view of a boundary element in ^ 3 , 5 (P 1 ).
the domain C are limited by a certain stability condition. An open (but not dense) subset Mg,n(F1,d)cMg,n(F1,d) is formed by maps with smooth domains C. In this case / represents C as a Riemann surface of an algebraic function of degree d. Generically, / has only nondegenerate critical points, the number of which equals 2d + 2g — 2 by Riemann-Hurwitz. The corresponding critical values together with the n images f(pi) £ P 1 of the marked points give convenient
The uses of random partitions
387
local coordinates on a neighborhood of such map / . The number 2d + 2g — 2 + n is known as the (complex) expected dimension of Mg^^^d). The whole . M g ^ P 1 , ^ ) is not so nice, being reducible with components of different dimensions. One defines, however, a distinguished homology class {Mg,n(F\d)]viT
£H4M9,n(F\d))
(14)
of the expected dimension, known as the virtual fundamental class. Integration against (14) replaces in Gromov-Witten theory integration over fundamental class in (12). The most fundamental part of the Gromov-Witten theory of P 1 is its stationary sector, obtained by pinning down the images of the marked points by requiring /(pi) = qi, where qi € P 1 are arbitrary distinct points. The stationary GW invariants of P 1 are, by definition, the following numbers (rfcl(pt)...rfen(pt))d= / c1(C1)k*...c1(Cn)k", V Jp*9,n(P\d)} "r\{f(Pi) = <)ih=l...n
(15)
where the classes c\(Ci) are defined as before. Note that the genus g is uniquely determined by the dimension constraint
^2 ki = 2d + 2g - 2 and, therefore, is omitted in the left-hand side of (15). 2.2.2. Plancherel measure and G W invariants of P 1 In order to write down the Plancherel measure analog of the matrix integral (10) we need the partition analog of the function N
trHk = J2^i-
(16)
i=l
It is given by the following function
pfe(A) = £ [(A< - ' + £ ) - H + 1 ) J + C1 - 2 _ f e ) c(-fc)
(17)
i
"=" £ xk, xee(x) where the first line the C-regularization of the direct, but divergent, generalization of (16) written in second line. Equivalently, -z + £ P f e ( A ) z f e = J V < A ' - i + * > = (£(z) • vx,vx), fc
(18)
i
where £(z) is the following diagonal matrix in gl(V) £(z) • ek = ezk ek. We are now ready to state the following result from [73].
(19)
388
ANDREI OKOUNKOV
Theorem 2.1.
(rfel(pt) ... rfcn(pt))d = n(jfc.1+1), E ( ^ r ) I I P ^ + i W -
( 2 °)
As explained above, the right-hand side of (20) is a direct analog of the integral (10) for partitions of finite size d, where d is the degree of GW invariant on the left. In particular, this formula is highly nontrivial already for d = 0, reproducing a result of [27]. There is only one partition of 0, namely the empty partition 0 and Pfe(0) equals the £-term in (17), illustrating the naturality of the C-regularization. Theorem 2.1 continues to hold for stationary GW invariants of any smooth curve X, with the following modification: /dimA\
/dimA\ ~
V d\ )
\ d\
where g(X) is the genus of X. In particular, genus 1 targets lead to the uniform measure on partitions. 2.2.3. Operator form of the G W theory We reproduced here the formula (20) to emphasize the role of the Plancherel measure in the GW theory. It is, however, very much not the final answer in the theory. Using (18) and (8), one rewrites (20) as follows
E ( i K ^ n ^ 1 = j a p <«? n w^-x).
(2i)
where the angle brackets on the right denote the vacuum expectation (M> = (Mu0,w 0 ) ) of an operator M acting on /\2 V. Using the commutation relations between the operators an and £ (z), one evaluates (21) in closed form in terms of trigonometric functions [73]. For example, the 1-point function has an especially simple form
E<^-2Wpt)>^29 = ^ r « % ) M - \
S(z)=
sinh z/2 z/2
(22)
where the superscript ° denotes the connected GW invariant. In the case when the target X is an elliptic curve E, the vacuum matrix element is replaced by trace 1 Lo f 23
E ^ E C T K ^ I K ^ =tro9 n ^)'
d>0
ki
where LQ is the energy operator defined by Lo-v\
=
\X\vx,
( )
The uses of random partitions
389
and tr 0 denotes the trace in the zero charge subspace (spanned by the vectors v\). The sum (23) was computed in [7] in terms of genus 1 theta functions with modular parameter q, see also [69]. From the operator interpretation, one derives Toda equations. These equations were conjectured in [22,23], see also [29,30,76], and translate into effective recurrence relations for the GW invariants of P 1 . The integrable structure of the GW theory fully unfolds in the equivariant GW theory of P 1 , which is described by the 2-dimensional Toda hierarchy of Ueno and Takasaki [88]. The 2D Toda hierarchy is derived from the operator solution of the theory in [74]. The description of the nonstationary sector of the theory is completed in [75].
3. More random partitions from geometry 3.1. Hurwitz theory The Gromov-Witten theory of target curves is closely related to the much older and much more elementary Hurwitz theory [36] that concerns enumeration of degree d branched covers
of a smooth curve X with specified ramifications. What this means is: we require / to be unramified outside some fixed set of points {qi} £ X and for each point Q, we specify the conjugacy class in the symmetric group S(d) of the monodromy of the branched cover / around qi. In other words, for every qi we specify a partition 77W of the number d. By cutting the base X\{gj} (and hence the cover C) into simply-connected pieces, one can see many connections between the Hurwitz theory and enumeration of maps on C discussed in Section 2.1.1. A classical formula of Burnside [13,41] tells us that the number of such covers, automorphism-weighted and possibly disconnected, equals /,.
xX2-2g(X)
£ (^)
IK«>(A),
(24)
where g(X) is the genus of X and ^(A) is the central character of the representation A, that is, the unique eigenvalue of the matrix by which the conjugacy class 77 acts in the irreducible representation A. By a theorem of Kerov and Olshanski [51] ^eA*=0[pi,pa,p3>...],
(25)
which shows that from the random partitions point of view there is no real difference between (20) and (24). This is a manifestation of the Gromov- Witten/Hurwitz correspondence, established in [73]. 3.2. Uniform measure and ergodic theory Note from (24) that enumeration of degree d branched covering of the torus is related to the uniform measure on partitions of d. The large d asymptotics in this problem is interesting,
390
ANDREI OKOUNKOV
because, on the geometric side, it computes the volumes of moduli spaces of pairs (C,w) where C is a smooth curve and u> a holomorphic differential on C with given multiplicities of zeros [26]. This is because points of the form (C, f*(dz)), where / : C -» C / Z 2 is a branched covering of a standard torus, play the role of lattice points in this moduli space. The moduli spaces of holomorphic differentials and, in particular, their volumes are important in ergodic theory, for example, for the study of billiards in rational polygons [25]. The exact evaluation of the sum (23) and the modular properties of the answer were of great help for the asymptotic analysis performed in [26]. The modular transformation exchanges the q —> 1 limit with the q —> 0 limit in (23), which means relating large partitions to small partitions. This is an example of a mirror phenomenon, see for example the discussion in [18]. 3.3. Random partitions from localization A constant source of partition sums in geometry is equivariant localization [2]. Partitions index fixed points of the torus action on the Grassmann varieties (in the Pliicker embedding, these are precisely the vectors v\). They also index fixed points of the torus action on Hilb„(C 2 ), the Hilbert scheme of n points in the plane C 2 , see e.g. [33,63]. 3.3.1. Hilbert scheme of points in the plane By definition, a point in Hilb„(C 2 ) is an ideal / G C[x,y] of codimension n as a linear subspace, such as, for example, the space of polynomials vanishing at n given distinct points in the plane. The torus ( C x ) 2 acts on Hilb n (C 2 ) by dilating the coordinates x and y. Its fixed points are the monomial ideals, that is, ideals spanned by monomials x%yi. Monomial ideals I are naturally indexed by partitions of n, namely, the set {(i, j), xly:> £ 1} is, essentially, a diagram of a partition. In equivariant localization, the contribution of an isolated fixed point / appears with a weight which the reciprocal of the product of the weights of the torus action on the tangent space Ti at / . Let t\ and —e2 be the weights of the torus action on C[x,y]. Then, up to a sign, the natural measure on partitions that arises is the following Jack polynomials deformation 9Jtjack of the Plancherel measure. 3.3.2. Jack deformation of the Plancherel measure Recall the hook-length formula dim A = n
M0)-1'
(26)
where the product is over all squares • in the diagram of A and h(D) is the length of the hook of the square • , see figure 5.
The uses of random partitions
Figure 5. The hook, arm, and leg of a square.
We have h(D) = 1 + a(D) + /(D), where o(D) and /(D) are the arm- and leg-length of • , indicated by different shades of gray in figure 5. In the deformed measure SOT jack one takes arm- and leg-length with different weights. Concretely, one sets
9KJack(A) = I I -,
^T
T.
(27)
atx ((1 + o(D)) Cl + /(D) Ca ) (o(D)ci + (1 + J(D))e 2 ) Up to an overall factor, 9Jtjack clearly depends only on the ratio ei/e2- To make 93tjack a probability measure on partitions of d, one has to multiply it by d\ (ei€2)d. The measure 9Hjack should be viewed as the general (3 analog of the Plancherel measure with (3 = 2ei/e2. Recall that in random matrix theory by ensembles with general /3 one means the generalization of the measure (5) in which Vandermonde squared is replaced by Y\ \xi — Xj\@. Like in the random matrix theory, the measure 9ttjack shares some features of Plancherel measure and lacks others. Most importantly, the free fermion interpretation is lacking, making, for example, the computation of the correlation functions of 9Jljack a difficult open problem. We will see this measure again in Section 5. Also note that the symmetry /? — i » 4//J, of which there are some instances in the random matrix theory, is manifest for the discrete measure 9Jljack-
4. Schur measure 4.1. Definition and correlation functions 4.1.1. Schur functions A generalization of the Plancherel measure, which in many ways resembles placing a random matrix into an arbitrary potential, is defined as follows. Let £1,^2, • • • be parameters. The polynomials s\(t) = I exp I ] P tn a-n J • V0, v\ \
\n>0
/
/
indexed by partitions A, are known as the Schur functions. Upon the substitution tk =
-tvHk,
392
ANDREI OKOUNKOV
the polynomial s\ becomes the trace of a matrix H € GL{n) in the irreducible representation of GL(n) with highest weight A. As a generalization of (8), we introduce the Schur measure on partitions by the following formula
artschur(A) = •!«*(*)«*(*)•
(28)
Here t and i are two independent sets of variables. Choosing one set to be the complex conjugate of the other is sufficient to guarantee that 9Kschur(A) > 0. This positivity, however, is largely irrelevant for what follows. The normalizing factor Z in (28) is given by the Cauchy identity Z = ^sx(t)sx(t)=exp(j2kt^k)x Vk t o
Applying the operator X^fe>o*fc^*fc respect to the Schur measure equals
(^9)
w esee
(29) )
that the expected size (|A|) of A with
<|A|) = 5>2*fc*fc. k
4.1.2. Schur measure and relative G W theory Schur measure naturally arises, for example, in the relative Gromov-Witten theory of C x = P 1 \ {0, oo}, see [73]. This relative Gromov-Witten theory is certain hybrid of GW and Hurwitz theory, in which one insist that the map / : C —> P 1 has given ramification over 0, oo £ P 1 (but what precisely having given ramifications means when C is not smooth is too technical to describe here). Informally, these two ramifications can be thought as inand out-going states of an interaction described by the "worldsheet" C. The variables t and t are the generating function variables coupled to the cycles of the in- and out-going ramifications. 4.1.3. Correlation functions of the Schur measure A remarkable property of the Schur measures is that it is possible to compute their correlations in closed form. Like for random matrices, the n-point correlations are given by n x n determinants with a certain correlation kernel K. Unlike random matrices, the kernel K does not involve any delicate objects like polynomials orthogonal with an arbitrary weight [14,59]. On the contrary, the kernel K has a simple integral representation in terms of the parameters t and t, which is particularly suited for the steepest descent asymptotic analysis. Introduce the function T(z) by T(z) =
YltnZn-YltnZ-nfc>0 fc>0
Let us assume that it converges in some neighborhood of the unit circle (the case when T(z) is a polynomial is already interesting enough). The correlation function of the Schur measure were computed in [69] in the following form
The uses of random partitions
393
Theorem 4.1. For any X c Z + | we have Prob OTsohur {X C 6(A)} = det [ K f o . a ; , - ) ] ^ ^ ,
(30)
where the kernel K is given by
„,
,
1
T w) eeT(z)-T( M-TM
ff
dzdw dzdw
l\w\<\z\
The proof of this formula uses the algebra of the infinite wedge representation A 2 V. In the same spirit, one shows that the for any fixed set X the sequence correlation functions r*(M~) = Prob {X + nC 6(A)}, where X + n denotes the translation of the set X by n lattice spacings, is a sequence of r-function for the Ueno-Takasaki 2-dimensional Toda hierarchy with respect to the two sets of (higher) times t and t. 4.2. Asymptotics and limit shapes 4.2.1. Asymptotics of the correlation functions Our goal now is to explain how convenient is the representation (31) for the steepest descent analysis. For simplicity, let us assume that the variables t are complex conjugate of t. The interesting asymptotic regime is when all variables t grow at the same rate, that is, when T(z) = MP(z), where M —» oo and P(z) is fixed. As we will see, this implies that the typical partition is of length O(M) in both directions, and hence contains 0(M2) squares. We will investigate K {x 1,2:2) in this limit assuming that x~ —j—>x,
x\ - X2 —> Arc.
(32)
The number x describes our global position on the limit shape; the number Ax is the relative local displacement. The exponentially large term in the integral (31) is eM^s^~s^\ where S{z) =P(z)-x
log z.
(33)
By our hypothesis, S(z) is purely imaginary on the unit circle \z\ = 1 and hence the integrand is rapidly oscillating there. We want to shift the contour of integration in z (resp. w) off the unit circle in the direction of ^ grad %IS, so that to make the integrand exponentially small. This direction is given by the sign of g{4>)-x,
where
g(
z—P(z)
(34)
The function g is a real-valued analytic function on the circle, and the set of points where g > x is a finite union of intervals {
(35)
394
ANDREI OKOUNKOV
Pi(sy
1(2)
a2(z)
Figure 6. Construction of the intervals
ft(»)
2TT
[ai(x),/3i(x)].
see figure 6. The set (35) varies from whole circle to the empty set as x varies from the minimal to the maximal value of the function g. These extreme values mark the edges of the limit shape. The integration in (31) is along two nested circles. When we deform the z and w contours in the direction of =F5I(^) as in figure 7, we pick up the residue of the integrand at z = w whenever we push the ^-contour inside the the w-contour. At z = w, most of the factors in
Figure 7. Deformation of the integration contours
the integrand cancel out, so we are left with
(2ni) fti)^
(36) J J\w\<\z\
(27ri)^ J J deformed contour
By the basics of the steepest descent method, the first summand in the right-hand side of (36) goes to 0 as M —* 00. We obtain Theorem 4.2. As M —> 00, we have •y
K(xi,:E2)
te-ai(x)Ax
_
2irAx
e-0i(x)Ax\
(37)
The uses of random partitions
395
Note that (37) is a multi-frequency generalization of the discrete sine kernel (40). In particular, the 1-point function, which determines the density of particles and, hence, the limit shape, satisfies Prob {x G 6(A)} = K(x, x) ->
E
*
IA(
*J ~ " ^
.
(38)
This density decreases from 1 to 0 as x varies in [ming,ma,xg] and, hence, these numbers indeed mark the boundary of the limit shape. Note that the the density (38) is monotone and hence the limit shape is always convex. Also note that the random process defined by the correlation kernel (37) is translation invariant. 4.2.2. Example: Plancherel measure The Schur measure specializes to the measure fUtpp when *.*= ( v ^ , 0 , 0 , . . . ) . Plugging this into (31) leads to the discrete Bessel kernel defined by
KB—(*,„;*) = ^ =
—
JJM<M
zX+, w_y+i
Jx-iJy+i-Jx+>Jy^ x-y
where Jn — Jn (2-\/f) is the Bessel function of order n. This formula is a limit case of the main result of [10]. It appears as stated in [8] and [39]. We have M = y/£ and g{tj>) = 2cos(>). Therefore, 0i(x) = —ati(x) = arccos
(x/2).
Integrating the density ^ arccos | , one arrives at the at the limit shape for the Plancherel measure first obtained by Vershik and Kerov [90] and Logan and Shepp [54]. In the bulk x € (—2,2) of this limit shape, we have the (unsealed) convergence of 6(A) to the discrete sine-kernel ensemble with the correlation kernel „ , % sino(x — y) KsinK y, a) = — T ^ r-^ , ir(x — y)
,_ , , a = arccos (x/2). \ i
i
,,„. (40)
In the continuous situation, the parameter a in (40) can be scaled away, but on a lattice it remains a nontrivial parameter. In fact, a finer analysis performed in [8] shows that the we have the same convergence to the discrete sine kernel for the Plancherel measure partitions of fixed size n as n —• co. Near the edge of limit shape, the random process 6(A) converges, after a suitable scaling to the Airy ensemble. In our setup, this can be seen by analyzing the previously discarded first summand in the right-hand side of (36). On the edge of the limit shape, x is a critical value of g and, hence, the corresponding critical point of the action (33) is degenerate. The
396
ANDREI OKOUNKOV
Airy kernel appears effortlessly in the asymptotics (see, for example, [71] for a pedagogical exposition). The depoissonization analysis performed in [8] and [39] shows that the same Airy ensemble asymptotics remains valid for Plancherel measure on partition of fixed size n as n —> oo. This is precisely the statement of a conjecture of Baik, Deift, and Johansson, first established in [68] by different means.
5. Random partitions in Seiberg-Witten theory 5.1. Gauge theory partition function 5.1.1. Plancherel measure in a periodic potential Fix a period N and let u\,u3,...,uN_\ be such that Yluk = 0. Consider the period N periodic potential U on the lattice 1>+\ defined by U(x) = u xm odiV. Define the energy of the particle configuration ©(A) in the potential U by U(X)=
£ xee(A),
u(x).
(41)
X>-MN
This does not depend on the cut-off M as long as M is a sufficiently large integer. We now define the (poissonized) Plancherel measure in the periodic potential U by flMA) = *W exp ( 1 tf(A)) ( i x f r ) 2 -
(42)
The properties of this measure are in many ways parallel to the theory of periodically weighted planar dimers, developed in [44]. 5.1.2. Instantons and Seiberg-Witten prepotential In (42), we dropped the normalization factor e _ ? present in (3) because the understanding of the partition function for 9Jt(y is, anyway, the main problem in the theory. This is because this partition function, as shown in Section 5 of [67], is essentially the Fourier transform of the N — 2 pure supersymmetric SU(N)-gauge theory partition functions, as computed by Nekrasov in [65] via instanton calculus. For mathematicians, this gauge theory partition function is a generating function for certain integrals over the moduli spaces of instantons. These are topologically defined finitedimensional integrals, so there is a certain similarity in spirit with Witten's formulation of the 2D quantum gravity, discussed in Section 2.1.2. The actual geometry of the instanton moduli spaces seems somewhat more accessible than the geometry of Mg,n- The equivariant localization approach to these instanton integrals, initiated in [56,57] and completed in [65], is a far reaching generalization of what we saw in Section 3.3. In particular, partition sums appear as sums over fixed points. The main expected feature of this partition function is that its quasiclassical £ —» o o ,
h —> 0 ,
The uses of random partitions
397
asymptotics should be described by the Seiberg- Witten prepotential [82,83]. This was indeed demonstrated in [67] and this is where the large random partitions come in. I refer to [66] for more on the geometrical and physical side of this computation; here we will focus on purely the random partition aspect of it. An introduction to the Seiberg-Witten theory for a mathematical audience can be found in [19]. It also contains many further references. A different, non-asymptotic, approach to the analysis of the partition function can be found in [64]. The case of the pure gauge theory is just the beginning of the Seiberg-Witten theory. Various theories with matter lead to related measures on partitions. They are also considered in [67]. 5,2. Asymptotics 5.2.1. Quasiclassical scaling and measure concentration Let £ be very large. Then £lAl/|A|! has a sharp peak around |A| « £. For |A| « £, the weight dim A can be approximately computed as follows. Let / be the profile f\(x) scaled in both directions by y/\X\, so that to make the area of the scaled diagram equal to 1. By the results of [54,90,92]
kg*|A|(^)a~-|AW>.
(«)
where the functional E(f) is defined by E(f) = 2f[
(l + f'(s))(l-f(t))log2(t-s)dsdt.
(44)
J Js
A direct argument shows that, with the same scaling,
\U(\)~^
JMf'Wdt,
(45)
where o~u{x) is a convex continuous function on [—1,1], which is linear on the segments [ — 1 + ^ , —1 + JV ] , i = 0,...,N — 1, such that the set {uk}, sorted in the decreasing order, is the set of slopes of cry. The function o\j has the meaning of surface tension. An example of au can be seen in figure 8. Note that the surface tension ay is invariant under permutations of the Uj's, so in what follows we will assume that til > t l 3 2
2
> • • • > UN_1 "
. 2
We see that (43) and (45) have the same scale when £ ~ const ti~2 ~ |A|. In this asymptotic regime, the measure 9Jt[/ concentrates on the maximizer of the functional S{f) = -E(f)
+ const J
(46)
This functional is strictly concave and has a unique global maximizer /*. By construction, the value S(f*) of the functional (46) at its maximizer / * dominates the partition function and hence we expect to identify it with the Seiberg-Witten prepotential.
398
ANDREI OKOUNKOV
- 3 - 2 - 1 0 1 2 3 4 Figure 8. The surface tension for {u^} = {4,1, —2, —3}.
Note that the surface tension
Figure 9. The conformal map used in the construction of the maximizer (here N = 3).
map from the upper half-plane to the domain A, sending infinity to infinity. The map $ has a certain natural normalization at infinity, which fixes it uniquely. Note that K naturally decomposes into segments of two kinds: those (called bands) mapped by $ to the horizontal parts of dA, and those (called gaps) mapped to the vertical parts of dA. The slit-lengths of A are found from the Ui's by fixing certain period of the differential dS = zd$(z).
(47)
The uses of random partitions
399
Namely, the integral of dS along the fcth gap should equal uk_i — uk+1 up to a multiplicative combination of universal constants such as i and n. The differential (47) will turn out to be precisely the Seiberg- Witten differential. We now have the following result from [67] Theorem 5.1. The maximizer / * is obtained from the map $ by the following formula ^f*(x)=m(x
+ iO),
(48)
where $(x + iO) denotes the natural extension of the map $ to the boundary R of the upper half-plane. It is clear from (48) that on the gaps ^f*(x) is constant, which means that the gaps give the facets of the limit shape. An example of the limit shape is plotted in figure 10.
Figure 10. An N — 3 example of the limit shape.
5.2.3. The Seiberg-Witten family of curves The conformal map $ can be written down explicitly, namely it is a linear function of log w, where w = w(z) solves the equation + - = zN + 0 • zN~l + ••• . (49) w Here dots stand for a certain polynomial of degree N — 2 in z which is to be found from the slit-lengths and, ultimately, from the «;'s. The equation (49) defines an (N — l)-dimensional family of hyperelliptic curves of genus N — 1. These curves are known as the Seiberg- Witten curves. They also arise as spectral curves in the periodic Toda chain (there is, in fact, a natural connection between periodically weighted Plancherel measure and periodic Toda chain). The N — 1 gap-periods of the differential (47) can be taken as local coordinates on the family (49). The main feature of the Seiberg-Witten geometry is that the dual band-periods of (47) turn out to be the dual variables for the Legendre transform of the prepotential S(f*). This follows from Theorem 5.1 by a direct simple computation, see [67], and completes the derivation of the Seiberg-Witten geometry. w
400
ANDREI OKOUNKOV
Acknowledgments I am grateful to Percy Deift and Michio Jimbo for the invitation to Lisbon and to the NSF (grant DMS-0096246) and Packard foundation for financial support. References 1. M. Adler, P. van Moerbeke, "Integrals over classical groups, random permutations, Toda and Toeplitz lattices", Comm. Pure Appl. Math. 54, 153-205 (2001). 2. M. Atiyah, R. Bott, "The moment map and equivariant cohomology", Topology 23,1-28 (1984). 3. J. Baik, P. Deift, K. Johansson, "On the distribution of the length of the longest increasing subsequence of random permutations", Journal of A MS 12, 1119-1178 (1999). 4. J. Baik, P. Deift, E. Rains, "A Predholm determinant identity and the convergence of moments for random Young tableaux", Comm. Math. Phys. 223, 627-672 (2001). 5. J. Baik, E. Rains, "The asymptotics of monotone subsequences of involutions", Duke Math. J. 109, 205-281 (2001). 6. P. Biane, "Representations of symmetric groups and free probability", Adv. Math. 138, 126-181 (1998). 7. S. Bloch, A. Okounkov, "The character of the infinite wedge representation", Adv. Math. 149, 1-60 (2000). 8. A. Borodin, A. Okounkov, G. Olshanski, "Asymptotics of the Plancherel measures for symmetric groups", J. Amer. Math. Soc. 13, 481-515 (2000). 9. A. Borodin, G. Olshanski, "Point processes and the infinite symmetric group", Math. Res. Lett. 5, 799-816 (1998). 10. A. Borodin, G. Olshanski, "Distribution on partitions, point processes, and the hypergeometric kernel", Comm. Math. Phys. 211, 335-358 (2000). 11. E. Brezin, V. Kazakov, "Exactly solvable field theories of closed strings", Phys. Let. B 236, 144-150 (1990). 12. T. Britz, S. Fomin, "Finite posets and Ferrers shapes", Adv. Math. 158, 86-127 (2001). 13. W. Burnside, Theory of groups of finite order, 2nd edition, Cambridge University Press, 1911. 14. P. Deift, Orthogonal Polynomials and Random Matrices: A Riemann-Hilbert Approach, AMS, 2000. 15. P. Di Francesco, "2-d quantum gravities and topological gravities, matrix models, and integrable differential systems", in The Painleve property: one century later, (R. Conte, ed.), 229-286, Springer: New York, 1999. 16. P. Di Francesco, P. Ginzparg, J. Zinn-Justin, "2D Quantum gravity and random matrix models", Phys. Rep. 254, 1-131 (1995). 17. P. Di Francesco, C. Itzykson, J. B. Zuber, "Polynomial averages in the Kontsevich model", Comm. Math. Phys. 151, 193-219 (1993). 18. R. Dijkgraaf, "Mirror symmetry and elliptic curves", The Moduli Space of Curves, R. Dijkgraaf, C. Faber, G. van der Geer (editors), Progress in Mathematics 129, Birkhauser, 1995. 19. R. Donagi, "Seiberg-Witten integrable systems", Algebraic geometry—Santa Cruz 1995, 3-43; Proc. Sympos. Pure Math. 62, Part 2, Amer. Math. Soc, Providence, RI, 1997. 20. M. Douglas, "Strings in less than one dimension and generalized KP hierarchies", Phys. Let. B 238 (1990). 21. M. Douglas, S. Schenker, "Strings in less than one dimension", Nucl. Phys. B 335 (1990). 22. T. Eguchi, K. Hori, S.-K. Yang, "Topological a models and large-AT matrix integral", Internat. J. Modern Phys. A 10, 4203-4224 (1995). 23. T. Eguchi, S.-K. Yang, "The topological CP1 model and the large-AT matrix integral", Mod. Phys. Lett. A 9, 2893-2902 (1994). 24. T. Ekedahl, S. Lando, M. Shapiro, A. Vainshtein, "Hurwitz numbers and intersections on moduli spaces of curves", Invent. Math. 146, 297-327 (2001).
The uses of random partitions
401
25. A. Eskin, H. Masur, A. Zorich, "Moduli spaces of Abelian differentials: the principal boundary, counting problems and the Siegel-Veech constants", arXiv:math.DS/0202134 . 26. A. Eskin, A. Okounkov, "Asymptotics of numbers of branched coverings of a torus and volumes of moduli spaces of holomorphic differentials", Invent. Math. 145, 59-103 (2001). 27. C. Faber, R. Pandharipande, "Hodge integrals and Gromov-Witten theory", Invent. Math. 139, 173-199 (2000). 28. W. Fulton, R. Pandharipande, "Notes on stable maps and quantum cohomology", in Proceedings of symposia in pure mathematics: Algebraic Geometry Santa Cruz 1995, Volume 62, Part 2 (J. Kollar, R. Lazarsfeld, D. Morrison, eds.), AMS: Rhode Island, 1997, pp. 45-96. 29. E. Getzler, "The Toda conjecture", Symplectic geometry and mirror symmetry (Seoul, 2000), World Sci. Publishing, River Edge, NJ, 2001, pp. 51-79; arXiv:math. AG/0108108 . 30. E. Getzler, "The equivariant Toda conjecture", arXiv:math. AG/0207025 . 31. D. Gross, A. Migdal, "Nonperturbative two-dimensional quantum gravity", Phys. Rev. Lett. 64 (1990). 32. D. Gross, A. Migdal, "A nonperturbative treatment of two-dimensional quantum gravity", Nucl. Phys. B 340, 333-365 (1990). 33. M. Haiman, "Notes on Macdonald polynomials and the geometry of Hilbert schemes", in Proceedings of the NATO Advanced Study Institute held in Cambridge, June 25-July 6, 2001, Sergey Fomin, ed., Kluwer: Dordrecht, 2002, pp. 1-64. 34. J. Harer, D. Zagier, "The Euler characteristic of the moduli space of curves", Invent. Math. 85, 457-485 (1986). 35. J. Haris, I. Morrison, Moduli of curves, Springer-Verlag, 1998. 36. A. Hurwitz, "Uber die Anzahl der Riemann'schen Flachen mit gegebenen Verzweigungspunkten", Math. Ann. 55, 53-66 (1902). 37. C. Itzykson, J. B. Zuber, "Combinatorics of the modular group. 2. The Kontsevich integral", Int. J. Mod. Phys. A 7, 1-23 (1992). 38. K. Johansson, "The longest increasing subsequnce in a random permutation and a unitary random matrix model", Math. Res. Lett. 5, 63-82 (1998). 39. K. Johansson, "Discrete orthogonal polynomial ensembles and the Plancherel measure", Ann. of Math. (2) 153, 259-296 (2001). 40. K. Johansson, "Non-intersecting paths, random tilings and random matrices", Probab. Theory Related Fields 123, 225-280 (2002). 41. G. Jones, "Characters and surfaces: a survey", in The atlas of finite groups: ten years on (Birmingham, 1995), London Math. Soc. Lecture Note Ser. 249, Cambridge Univ. Press, Cambridge, 1998, pp. 90-118. 42. V. Kac, Infinite dimensional Lie algebras, Cambridge University Press. 43. V. Kac, A. Schwarz, "Geometric interpretation of the partition function of 2D gravity", Physics Letters B 257, 329-334 (1991). 44. R. Kenyon, A. Okounkov, S. Sheffield, in preparation. 45. S. Kerov, "Gaussian limit for the Plancherel measure of the symmetric group", C. R. Acad. Sci. Paris 316, Serie I, 303-308 (1993). 46. S. Kerov, "Transition probabilities of continual Young diagrams and the Markov moment problem", Func. Anal. Appl. 27, 104-117 (1993). 47. S. Kerov, "The asymptotics of interlacing roots of orthogonal polynomials", St. Petersburg Math. J. 5, 925-941 (1994). 48. S. Kerov, "A differential model of growth of Young diagrams", Proceedings of the St. Petersburg Math. Soc. 4, 167-194 (1996). 49. S. Kerov, "Interlacing measures", Amer. Math. Soc. Transl. 181, Series 2, 35-83 (1998). 50. S. Kerov, Asymptotic Representation Theory of the Symmetric Group and its Applications in Analysis, AMS, 2003. 51. S. Kerov, G. Olshanski, "Polynomial functions on the set of Young diagrams", C. R. Acad. Sci. Paris Ser. I Math. 319, no. 2, 121-126 (1994). 52. M. Kontsevich, "Intersection theory on the moduli space of curves and the matrix Airy func-
402
ANDREI OKOUNKOV
tion", Comm. Math. Phys. 147, 1-23 (1992). 53. A. Lascoux, J.-Y. Thibon, "Vertex operators and the class algebras of symmetric groups", Zap. Nauchn. Sem. S.-Peterburg. Otdel. Mat. Inst. Steklov. (POMI) 283 (2001), Teor. Predst. Din. Sist. Komb. i Algoritm. Metody 6, pp. 156-177. 54. B. F. Logan, L. A. Shepp, "A variational problem for random Young tableaux", Adv. Math. 26, 206-222 (1977). 55. E. Looijenga, "Intersection theory on Deligne-Mumford compactifications (after Witten and Kontsevich)", Seminaire Bourbaki, Vol. 1992/93. Asterisque No. 216 (1993), Exp. No. 768, 4, pp. 187-212. 56. A. Losev, N. Nekrasov, S. Shatashvili, "Issues in Topological Gauge Theory", Nucl. Phys. B 534, 549-611 (1998). 57. A. Losev, N. Nekrasov, S. Shatashvili, "Testing Seiberg-Witten Solution", a r X i v : h e p - t h / 9801061. 58. I. G. Macdonald, Symmetric functions and Hall polynomials, Clarendon Press, 1995. 59. M. Mehta, Random matrices, Academic Press, 1991. 60. T. Miwa, M. Jimbo, E. Date, Solitons. Differential equations, symmetries and infinitedimensional algebras, Cambridge Tracts in Mathematics 135, Cambridge University Press, Cambridge, 2000. 61. P. van Moerbeke, "Random matrices and permutations, matrix integrals and integrable systems", Seminaire Bourbaki Vol. 1999/2000, Asterisque 276, 411-433 (2002). 62. P. van Moerbeke, Integrable lattices: random matrices and random permutations. Random matrix models and their applications, Math. Sci. Res. Inst. Publ. 40, Cambridge Univ. Press, Cambridge, 2001, pp. 321-406. 63. H. Nakajima, Lectures on Hilbert schemes of points on surfaces, AMS 1999 64. H. Nakajima, K. Yoshioka, "Instanton counting on blowup, I", arXiv:math. AG/0306198 . 65. N. Nekrasov, "Seiberg-Witten prepotential from instanton counting", arXiv:hep-th/0206161. 66. N. Nekrasov, ICMP 2003 contribution, this volume. 67. N. Nekrasov, A. Okounkov, "Seiberg-Witten theory and random partitions", a r X i v : h e p - t h / 0306238. 68. A. Okounkov, "Random matrices and random permutations", Internal. Math. Res. Notices, no. 20, 1043-1095 (2000). 69. A. Okounkov, "Infinite wedge and random partitions", Selecta Math., New Ser. 7, 1-25 (2001). 70. A. Okounkov, "Generating functions for intersection numbers on moduli spaces of curves", Internat. Math. Res. Notices, no. 18, 933-957 (2002). 71. A. Okounkov, "Symmetric function and random partitions", in Proceedings of the NATO Advanced Study Institute held in Cambridge, June 25-July 6, 2001, Sergey Fomin, ed. Kluwer, Dordrecht (2002), pp. 1-64. 72. A. Okounkov, R. Pandharipande, "Gromov-Witten theory, Hurwitz numbers, and matrix models, I", arXiv:math.AG/0101147. 73. A. Okounkov, R. Pandharipande, "Gromov-Witten theory, Hurwitz theory, and completed cycles", arXiv: math. AG/0204305 . 74. A. Okounkov, R. Pandharipande, "The equivariant Gromov-Witten theory of P 1 ", arXiv: math.AG/0207233. 75. A. Okounkov, R. Pandharipande, "Virasoro constraints for target curves", arXiv:math.AG/ 0308097. 76. R. Pandharipande, "The Toda equations and the Gromov-Witten theory of the Riemann sphere", Lett. Math. Phys. 53, 59-74 (2000). 77. M. Prahofer, H. Spohn, "Universal distributions for growth processes in 1+1 dimensions and random matrices", Phys. Rev. Lett. 84, 4882 (2000). 78. M. Prahofer, H. Spohn, "Scale invariance of the PNG droplet and the Airy process", J. Statist. Phys. 108, 1071-1106 (2002). 79. E. M. Rains, "Increasing subsequences and the classical groups", Electr. J. of Combinatorics, 5 (1998).
The uses of random partitions
403
80. B. Sagan, The symmetric group. Representations, combinatorial algorithms, and symmetric functions, Springer-Verlag, 2001. 81. G. Segal, G. Wilson, "Loop groups and equations of KdV type", Inst. Hautes Etudes Sci. Publ. Math. no. 61, 5-65 (1985). 82. N. Seiberg, E. Witten, "Monopole condensation, and confinement in N = 2 supersymmetric Yang-Mills theory", Nucl. Phys. B 426, 19-52 (1994); Erratum ibid. 430, 485-486 (1994). 83. N. Seiberg, E. Witten, "Monopoles, duality and chiral symmetry breaking in N = 2 supersymmetric QCD", Nucl. Phys. B 431, 484-550 (1994). 84. T. Seppalainen, "A microscopic model for Burgers equation and longest increasing subsequences", Electron. J. Prob. 1 (1996). 85. C. A. Tracy, H. Widom, "Level-spacing distributions and the Airy kernel", Comm. Math. Phys. 159, 151-174 (1994). 86. C. Tracy, H. Widom "Random unitary matrices, permutations and Painleve", Comm. Math. Phys. 207, 665-685 (1999). 87. C. Tracy, H. Widom, "On the distributions of the lengths of the longest monotone subsequences in random words", Probab. Theory Related Fields 119, 350-380 (2001). 88. K. Ueno, K. Takasaki, "Toda lattice hierarchy", Adv. Studies in Pure Math. 4, Group Representations and Systems of Differential Equations, 1-95 (1984). 89. A. Vershik, "Statistical mechanics of combinatorial partitions and their limit configurations", Func. Anal. Appl. 30, 90-105 (1996). 90. A. Vershik, S. Kerov, "Asymptotics of the Plancherel measure of the symmetric group and the limit form of Young tableaux", Soviet Math. Dokl. 18, 527-531 (1977). 91. A. Vershik, S. Kerov, "Asymptotic theory of the characters of a symmetric group", Functional Anal. Appl. 15, 246-255 (1981). 92. A. Vershik, S. Kerov, "Asymptotics of the maximal and typical dimension of irreducible representations of symmetric group", Func. Anal. Appl. 19 (1985). 93. E. Witten, "Two dimensional gravity and intersection theory on moduli space", Surveys in Diff. Geom. 1, 243-310 (1991). 94. A. Zvonkin, "Matrix integrals and map enumeration: an accessible introduction", Math. Cornput. Modelling 26, 281-304 (1997).
Stochastic growth in one dimension and Gaussian multi-matrix models HERBERT SPOHN, PATRIK
L.
FERRARI, MICHAEL P R A H O F E R
(Tech. U., Munchen)
We discuss the space-time determinantal random field which arises for the PNG model in one dimension and resembles the one for Dyson's Brownian motion. T h e information of interest for growth processes is carried by the edge statistics of the random field and therefore their universal scaling is related to the edge properties of Gaussian multi-matrix models.
1. Introduction There is a huge variety of stochastic growth processes and we refer to the recent monographs by Barabasi, Stanley [1] and Meakin [2]. Amongst them the KPZ (Kardar, Parisi, Zhang) growth models enjoy a particular popularity in theoretical circles. Prototypes are the Eden growth, where each perimeter site of the current cluster is filled after an exponentially distributed waiting time, and ballistic deposition, where there is a low intensity random flux of incoming particles which then attach to the current surface profile. Thus in broad terms KPZ growth is characterised by being stochastic with a — local growth rule, — smoothening mechanism. The list of experiments well described in terms of KPZ is rather short, see Mylls et al. [3] for recent experiments on slow combustion fronts. But one obvious reason for its popularity is that KPZ growth is rather close to the stochastic dynamics commonly studied in Statistical Mechanics with the fine twist that it does not satisfy detailed balance (KPZ growth is not stochastically reversible). There is a second reason, however: through a Cole-Hopf type transformation KPZ growth maps to directed first passage percolation. Thereby techniques and insights from the theory of disordered systems come into play. In fact, there are some rather close analogies, one example being the rigorous discussion of the overlap for directed polymers [4]. Just before the previous ICMP congress it came as a total surprise that KPZ growth in one spatial dimension (the surface is the graph of a function over R) is linked to Gaussian random matrices. For example, the height, h(r), at time r above a given reference point grows at constant speed VQ with some random fluctuations, h(r) =
V0T
+r
1
^
(1)
for large r. If at the reference point the macroscopic profile has a non-zero curvature, then £ has the same distribution function as the largest eigenvalue of a GUE random matrix, which in the community is known as the Tracy-Widom distribution function F2, P(£ < x) = F2(x). F2 is related to the Hastings-McLeod solution of the Painleve II differential equation. The
404
Stochastic growth in one dimension and Gaussian multi-matrix models
405
scaling exponent 1/3 and the random amplitude £ in (1) are expected to be valid for all one-dimensional KPZ growth models. The purpose of our contribution is to explain how this connection arises. The rough indication can be given already now: For some very particular growth models in the KPZ class, there is a determinantal process in the background. Its structure has some similarity to the determinantal process appearing in the context of Gaussian multi-matrix models. Since the relevant information is linked to edge scaling, in the scaling limit the GUE edge distribution arises.
2. G U E , Dyson's Brownian motion, and t h e Airy process Dyson [5] considers an Ornstein-Uhlenbeck process, A(t), t e R, on the space of N x N Hermitian matrices. Its transition probability is given through a Mehler type formula as F(A(t) € dA' | A(0) = A) = I e x p [ - l T r ( y l ' -qA)2/{\ with q by
- q2) dA'
(2)
In particular, for t —• oo the process converges to the GUE ensemble given Z~x exp
dA. -v** For the joint distribution of the stationary process at times ordered as to < t\ <
(3)
one obtains therefore P({A(tj)£dAj,j 1 :exp
=
0,l,...,m})
If^Tr^-^^OVa-^+Tr^ N \j=i
Y[dAj
(4)
3=0
with qj = exp[— (tj — t,-i)]- (4) is the Gaussian multi-matrix model of the title. The process A(t) induces a process on the eigenvalues Ajy(£)<••• < Xi(t) of A(t). It is stationary by construction and happens to be again Markov. In fact, it satisfies the set of stochastic differential equations
d\j(t)=(-±\j(t)
+ (p/2) JT (XjW-Xiity-^dt
+ dbjit), j = l,...,N, (5)
with {bj(t), j = 1 , . . . , N} a collection of N independent standard Brownian motions. In case of Hermitian random matrices (3 — 2. The repulsive drift ensures that \j+i(t) < \j(t) for all t, any j . Clearly (5) has the unique stationary distribution N
N
exp
~AT^
A
J
2
n l
lAi-A/ipv
(6)
j=l
There is a way to rewrite (5) which turns out to be computationally very powerful. We define a random field 4>N over R 2 through N j= l
406
HERBERT SPOHN, PATRIK L. FERRARI, MICHAEL P R A H O F E R
Then 4>N is determinantal in the sense that its moments (correlation functions) have a determinantal structure of the form m
\
Y[
= det{RN(xi,U;xj,tj)}™j=1
(8)
for distinct times t\,..., tm. The defining kernel i?jv can be written in terms of the harmonic oscillator Hamiltonian H
" = -\% + ^2x*>
(9)
which has eigenvalues En = n/N, n = 1,2,... . Let KM be the Hermite kernel which is the spectral projection onto {iJjv < 1}- Then RN(x,t;x',t') = (e~tHN(KN - lQ(t-t'))^'"")^,^)
(10)
with Q(t) — 1 for t > 0 and Q(t) = 0 for t < 0. In particular, at t = 0 (or any other time by stationarity)
/ J I
,
(11)
as well known from the random matrix bible by Mehta [6]. A natural question is to study the statistics of lines close to the, say upper, edge. In our units the top line fluctuates at level V2N, as can be seen by equating the Fermi energy Ep, Ep = 1, with the energy of the confining potential of (9). Thus we shift our attention to x = ^/2N and linearise there the potential, a procedure which should be accurate for large N. Then the imaginary time Schrodinger equation for (9) goes over to d^=(-\d2x + ^V2x)^.
(12)
It becomes iV-independent under the scaling t ~» N2^H, x ~-* A/'1^3a;/-\/2, resulting in the Schrodinger equation with Airy Hamiltonian dti> = Hi>,
H = -d2x + x.
(13)
We have identified the edge scaling and conclude that, in the sense of convergence of moments, Km ^=Nllz4>N(V2N
+ -^N^x,
N2/3t)
=
(14)
The prefactor comes from the spatial volume element when integrating both sides in (14) over a compactly supported test function. cf> is called the Airy random field. Since 0jv is determinantal, so must be its limit. Hence, for distinct times t\,...,tm, m
\
H Hx^tj)) = detlRixut^xj,^)}™^ ,
(15)
compare with (8). The defining kernel is now given through R(x, t; x', t') = (e~tH (K - 119(t - t')) et>H^ (x, x') = sign(t' -t) I d\ Q(X(t
T
t')) e A(t '-* ) Ai(x - A) Ai(x' - A),
(16)
Stochastic growth in one dimension and Gaussian multi-matrix models
407
where sign(i) = 1 for t > 0 and sign(£) = - 1 for t < 0, Ai the Airy function, and K the spectral projection onto {H < 0}. K is known as Airy kernel. For fixed t the limit is studied by Forrester [7], and Tracy, Widom [8]. Some aspects of the multi-matrix model are discussed by Eynard [9]. The top line of the Airy random field is called the Airy process [10], denoted by A(t). Its joint distributions can be written in a concise way. Let h < •••
< £ m }) = P({0(x,^) = 0 for x > & , j = 1 , . . . ,m}) = det(l-i?(m)).
(17)
The Airy process has continuous sample paths and is stationary, by construction. Some more explicit expressions for joint distributions are given in [11-14]. In particular, (^4(0)2) = 0.81325... , ((.4(0) - A(t))2) = 2\t\ for small t, and (A(0)A{t)) - (A(0)}2 = t~2 + 0 ( r 4 ) for large t.
3. Polynuclear growth, its determinantal random field, and edge scaling For the polynuclear growth (PNG) model the height function at time r takes integer values, h(x, T) £ Z, x € R, r > 0. x — i » h(x, r ) has jumps of unit size only. Upward steps move, in the x direction, with velocity —1 and downward steps with velocity 1. They annihilate each other upon collision, which is the required smoothening mechanism. The surface grows through nucleation. At such an event, say (ari,ri), the height h(x,T\) is increased by one unit at xi, thereby creating an upward and downward step, which move apart immediately under the deterministic part of the dynamics. The nucleation events are Poisson in spacetime with intensity 2. We consider the droplet geometry which is enforced by allowing nucleations only in the space interval [—T, T]. Initially h(x, 0) = 0 and, by assumption, h(x,r) = 0 for \x\ > r. Obviously, according to our criteria of the Introduction, the PNG model is in the KPZ class. For PNG, as just described, there is no determinantal process in sight. The miracle happens through the RSK (Robinson, Schensted, Knuth) construction. The idea is to extend the model with additional lines which record the information lost in annihilation events. Thus we introduce the lines hj(x,r), j = 0, —1, — 2 , . . . , and set h0(x,r) = h(x, r ) . Initially hj(x,0) = j . /io evolves according to the PNG specified above. Given h0(x,T) all lower lying lines have a deterministic dynamics. The steps of line hj, j < —1, move with velocity ± 1 , as before. A nucleation, say at space-time point (X,T), takes place whenever in line j + 1 an upward and downward step annihilate each other at (x,r). Thus x H-> hj(x,r) has jumps of unit size only, hj(x, T) = j for |x| > r , and hj(x, T) > hj-i(x, r ) . Furthermore there is a random index jo such that, for j < jo, hj(x,r) = j for all x. Let us set, at fixed time r, [ 1 if there is a height line passing through (j, t), T)T(j,t) = < I 0 if there is no such line,
(18)
408
HERBERT SPOHN, PATRIK L. FERRARI, MICHAEL PRAHOFER
with j € Z, t € [—r, r]. ?7T (ji, i) is a determinantal random field over Z x [—r, r]. Its top line is the object of interest, since it coincides with PNG. To write down the defining kernel for rjT we set, as operators on I? = ^ ( Z ) , Hdi>{j) = ~1>U - 1) - 4>U + 1 ) , HMj)
(19)
= -i>ti - 1 ) - V-(i + 1 ) + i-4>U).
(20)
T
and BT the spectral projection on {HT < 0}. BT is known as discrete Bessel kernel. Then Rr(j,t;j',t')=(e-tH*(BT-KG(t-t'))et,HA
.
(21)
The moments of rjT(j,t) are given by the formula analogous to (8). For large r, ho(x,r) = 2 \ / T 2 — x2, \x\ < T. Thus nT(x,t) is not stationary in t, which is reflected in (21) by the fact that Hd ^ HT. The correct edge scaling can be guessed as for Dyson's Brownian motion, where in spirit T is equated with N. We focus our attention on a space-time window of width r 2 / 3 and height r 1 / 3 centred at t = 0 and j = 2r. Properly rescaled, compare with (12), HT becomes HMx)
= r 2 / 3 ( - $(x - r ~ 1 / 3 ) - ip(x + T~^3)
+ 2^{x) + T^3(X/T)IP(X))
,
(22)
which converges to the Airy operator H as r —> oo. This argument overlooks that even in rescaled coordinates the lines have still a systematic curvature. The correct limit is thus, [•] denoting integer part, lim r 1 / 3 r/ T ([2r +
T1'*{X
- t2)}, r 2 / 3 i ) =
(23)
r—>oo
with <j> the Airy random field. In particular, the top line converges to the Airy process A(t). We summarise our discussion as T h e o r e m 3 . 1 . Let h(x,r) be the PNG model in the droplet geometry. Then, in the sense of weak convergence of finite-dimensional distributions, lim T~1/3 (h(tr2/3 , T) - 2r) = A(t) - t2 . T—>0O
(24)
'
Proof. For t = 0 this is the celebrated result of Baik, Deift, Johansson [15] on the length of the longest increasing subsequence of a random permutation. They use orthogonal polynomials and Riemann-Hilbert techniques in their asymptotic analysis. Johansson [16] develops an approach through Fredholm determinants, which is also the basis of the proof by Prahofer, Spohn [10] for the time-extended case. • Theorem 3.1 is stated for the reference point x = 0. An analogous limit holds for any other reference point x = ar, \a\ < 1. For a space-time discrete version of the PNG droplet Johansson [17] proves the analogue of Theorem 3.1 including tightness. While the details are long, it is likely that Theorem 3.1 can be strengthened to weak convergence of path measures. As established by Johansson [18], for the Aztec diamond the border between the frozen and disordered zones is also governed by the Airy process.
Stochastic growth in one dimension and Gaussian multi-matrix models
409
4. Extensions What is so special about the droplet? From the point of view of growth processes other initial conditions look more natural, e.g. the initially flat surface h(x,0) = 0. The RSK dynamics can be implemented for any choice of initial data and set of nucleation events. However the determinantal property of the resulting line ensemble is fragile. We illustrate our point by three examples. (i) Half droplet. In a discrete space-time setting this problem is studied recently by Sasamoto and Imamura [19]. The rule is to simply suppress all nucleation events for x < 0. In addition there is a source at x = 0 with rate 7. For 0 < 7 < 1 the additional mass is incorporated in the bulk with no change in the macroscopic profile. The height fluctuations at x = 0 satisfy (1) with F 2 replaced by F4, i.e., the distribution function of the largest eigenvalue of a GSE random matrix in the limit of large JV [20]. 7 = 1 is critical and F% in (1) is replaced by F\ [20], i.e., the distribution function of the largest eigenvalue of a GOE random matrix [21]. For 7 > 1 the macroscopic profile develops a linear portion, starting at x = 0, at height (7 + 7 _ 1 ) T and joining tangentially the profile for 7 = 0. The height fluctuations are then of order y/r and Gaussian. The RSK dynamics results in a line ensemble, which for fixed r has the following weight. At x = r the boundary condition is hj(r, r) = j . The lines have jump size one and are not allowed to cross. Under these constraints the jump points are uniformly Lebesgue distributed. If this construction would be extended to x = — r with the boundary condition hj(—r, r) = j , the resulting line ensemble is the one of the PNG droplet. For the half-droplet the lines end at x = 0 and there is an extra weight from the source at the origin, which reads
exp
(log 7 ) £
(h2j(0,T)-h2j^(0,T)-l)
(25)
For 7 = 1 the extra weight (25) equals 1 and the weight for the line ensemble can be written as a product of determinants. However, the correlation functions are not determinantal because of the free boundary at x = 0. By an identity of de Bruijn, gap probabilities are square roots of determinants, thus yielding GOE edge statistics at x = 0 in the scaling limit. For 7 = 0 the lines at re = 0 are constrained as h,2j(0, T) — /i 2 j_i(0, T) — 1, which resembles Kramers degeneracy, thus GSE. An asymptotic analysis is available only for 7 = 0,1 [19] with the result that T 2 / 3 close to the origin the height statistics is governed by the largest eigenvalue of the transition ensemble governing the crossover from GOE, resp. GSE, at x — 0 to GUE in the bulk. For 7 > 1, at x — 0 the top line separates the distance (7 + 7 _ 1 — 2)T from /i_i(0, T) which remains located roughly at IT, as for 7 < 1. This explains the Gaussian fluctuations for 7 > 1. (ii) Stationary P N G . If initially the upward steps are Poisson distributed with density p+ and the downward steps with density p_, then PNG on the whole real line is a spacetime stationary growth process, provided p+p~ = 1. To determine / I ( 0 , T ) it suffices to know the nucleation events in the backward light cone {(x,i)|0
410
HERBERT SPOHN, PATRIK L. FERRARI, MICHAEL PRAHOFER
{(rr,i)|0 < t < r, |a;| < r } . The reason is that for stationary PNG model the intersection points of the diagonals {x = ±t} with the world lines for steps are again Poisson distributed. Thus we arrive at a set-up rather similar to the PNG droplet. The additional feature is that there are sources of nucleation events at the boundaries ± r . The sources are Poisson in time with rates a+, resp. a_, a+ > 0, a_ > 0, stationarity being equivalent to OJ+OJ_ = 1. As before, a line ensemble is generated through the RSK dynamics. At time T, the boundary conditions hj(±r, T) = j , j = —1, — 2 , . . . , hold. However, ho(±T, r ) can now take arbitrary positive integer values. They have geometric weight, i.e., the weight exp[(loga±)/i 0 (±T,T)].
(26)
The step points are Lebesgue distributed, constrained by non-crossing and the boundary conditions at ± T . Let £1 be the collection of all such lines. By running RSK backwards in time, one will end up with hj(0,r) = j and ho(0,r) = n for r > 0 but sufficiently small. £1 decomposes accordingly as fi = Un>o&n- ^ n has the total weight (a+ct-)nZo, ZQ — exp[t(a+ — a^ 1 ) + t(a_ — a l 1 ) ] . Stationary growth corresponds to the sector Q 0 carrying weight Wo and probability Po = Wo/Zo for the lines. However, only the line ensemble with weight W on fi is determinantal. Fortunately, one has the simple relationship PO({M0,T)
= *}) = Z0"1Wo({fco(0,T) = *}) = Zo"-1 (W({/IO(0,T) = *}) - a+a_W({/i 0 (0,r) = k - 1})) .
(27)
Stationary PNG is described by the spatial derivative of a determinantal process. (27) serves as a starting point for an asymptotic analysis, see Baik, Rains [22] and Prahofer, Spohn [23]. (iii) Flat initial conditions. One sets h(x, 0) = 0 and allows for nucleation events on the entire real line. For the RSK construction it is necessary to first restrict to the periodic box [—•£,•£]. Then hj(—i,r) — hj(£,r). As before, the lines are constrained on non-crossing and unit jumps. In addition they have to satisfy mm (hj{x,T) - hj-xfar)) = 1.
(28)
An asymptotic analysis of the line ensemble with these constraints is not available. By completely different methods Baik, Rains [20] prove that, first taking I —> oo, the height fluctuations at x = 0 are distributed as the largest GOE eigenvalue, i.e., Fi replaces F2 in (1). On this basis, a possible guess for the full height statistics, in the limit r —> oo, is Dyson's Brownian motion at (3 = 1. We will have to wait for the next ICMP congress for a confirmation (or a counter argument).
References 1. A. L. Barabasi, H. E. Stanley, Fractal Concepts in Surface Growth, Cambridge University Press, 1995. 2. P. Meakin, Fractals, scaling and growth far from equilibrium, Cambridge University Press, 1998. 3. M. Mylls, J. Maunuksela, J. Merikoski, J. Timonen, V. K. Horvath, M. Ha, M. den Nijs, "Effect of columnar defect on the shape of slow-combustion fronts", arXiv: cond-mat/0307231.
Stochastic growth in one dimension and Gaussian multi-matrix models
411
4. F. Comets, T. Shiga, N. Yoshida, "Probabilistic analysis of directed polymers in a random environment: a review", Advanced Studies in Pure Mathematics, 2003, to appear. 5. F. J. Dyson, "A Brownian motion model for the eigenvalues of a random matrix", J. Math. Phys. 3, 1199-1215 (1962). 6. M. L. Mehta, Random Matrices, Academic Press, 1990. 7. P. J. Forrester, "The spectrum edge of random matrix ensembles", Nucl. Phys. B 402, 709-728 (1993). 8. C. A. Tracy, H. Widom, "Level-spacing distributions and the Airy kernel", Comm. Math. Phys. 159, 151-174 (1994). 9. B. Eynard, "Correlation functions of eigenvalues of multi-matrix models, and the limit of a time dependent matrix", J. Phys. A 31, 8081-8098 (1998). 10. M. Prahofer, H. Spohn, "Scale invariance of the PNG droplet and the Airy process", J. Stat. Phys. 108, 1071-1106 (2002). 11. M. Adler, P. van Moerbecke, "A PDE for the joint distribution of the Airy process", arXiv: math.PR/0302329. 12. C. A. Tracy, H. Widom, "A system of differential equations for the Airy process", Elect. Comm. Prob. 8, 93-98 (2003). 13. C. A. Tracy, H. Widom, "Differential equations for Dyson processes", axXiv:math.PR/0309082 . 14. H. Widom, "On asymptotics for the Airy process", arXiv:math.PR/0308157 . 15. J. Baik, P. Deift, K. Johansson, "On the distribution of the length of the longest increasing subsequence in a random permutation", J. Amer. Soc. 12, 1189-1178 (1999). 16. K. Johansson, "Non-intersecting paths, random tilings and random matrices", Probab. Theory Relat. Fields 123, 225-280 (2002). 17. K. Johansson, "Discrete polynuclear growth and determinantal processes", arXiv:math.PR/ 0206208 . 18. K. Johansson, "The arctic circle boundary and the Airy process", arXiv:math.PR/0306216 . 19. T. Sasamoto, T. Imamura, "Fluctuations of a one-dimensional polynuclear growth model in a half space", arXiv:cond-mat/0307011. 20. J. Baik, E. Rains, "Symmetrized random perturbations", in Random Matrix Models and Their Applications, P. Bleher, A. Its, eds., MSRI Publications 40, 1-19, Cambridge University Press, 2001. 21. C. A. Tracy, H. Widom, "On orthogonal and symplectic matrix ensembles", Comm. Math. Phys. 177, 727-754 (1996). 22. J. Baik, E. Rains, "Limiting distributions for a polynuclear growth model with external sources", J. Stat. Phys. 100, 523-541, (2000). 23. M. Prahofer, H. Spohn, "Exact scaling functions for one-dimensional stationary KPZ growth", arXiv:cond-mat/0212519; J. Stat. Phys., to appear.
This page is intentionally left blank
Nonequilibrium statistical mechanics Session organized by G. GALLAVOTTI (Rome) and S. OLLA (Cergy-Pontoise)
This page is intentionally left blank
Poincare recurrence: old and new Luis
BARREIRA
(1ST, Lisbon)
The classical theorem of Poincare on recurrence only gives information of qualitative nature. On the other hand it is clearly a matter of intrinsic difficulty and not of lack of interest that less is known concerning the quantitative behavior of recurrence. Here we discuss recent developments that include the almost everywhere coincidence between the recurrence rate and the pointwise dimension in the case of hyperbolic dynamics. We also discuss the almost product structure of recurrence, which closely imitates the product structure provided by the families of stable and unstable manifolds as well as the almost product structure of hyperbolic measures.
1. I n t r o d u c t i o n The notion of nontrivial recurrence goes back to Poincare in his study of the three-body problem. He proved in his celebrated memoir [11] of 1890 that whenever a dynamical system preserves volume almost all trajectories return arbitrarily close to their initial position and that they do this an infinite number of times. More precisely, Poincare established the following. Theorem 1.1. If a flow preserves volume and has only bounded orbits then for each open set there exist orbits that intersect the set infinitely often. This is Poincare's recurrence theorem (the versions that we encounter today in the literature slightly differ from this original formulation). The memoir is the famous one that in its first version (printed in 1889, even having circulated shortly, and of which some copies still exist today) had the error that can be seen as the main cause for the study of chaotic behavior in the theory of dynamical systems. Incidentally, Poincare's recurrence theorem was already present in the first printed version of the memoir as then again in [11]. We now present a somewhat more modern formulation of the theorem. We consider a measurable transformation T : X —> X preserving a finite measure fi on X (this means that fx(T~1A) = fJ.(A) for every measurable set A C X). The following is an alternative version of Poincare's recurrence theorem. Theorem 1.2. For each measurable set Ad X we have fi({x € A : Tnx £ A for infinitely many positive n 's}) = n(A). In other words, the existence of a finite invariant measure guarantees that almost every orbit starting in a set A returns infinitely often to this set. When X is a metric space with distance d one can also establish the following version of the recurrence theorem. Theorem 1.3. For (x-almost every x £ X we have
liminfd(T"z,z)=0.
415
(I)
416
Luis
BARREIRA
The identity (1) tells us that the orbit of /x-almost every point returns arbitrarily close to the initial point. Poincare's recurrence theorem is a basic but also fundamental result in the theory of dynamical systems. In particular, it tells us that the existence of a finite invariant measure causes a nontrivial recurrence in each set of positive measure. Unfortunately it only provides information of qualitative nature (in any of its alternative versions, such as those in Theorem 1.2 and Theorem 1.3). In particular it tells us nothing about the following two natural problems: (1) with which frequency the orbit of a point visits a given set; (2) with which rate the orbit of a point returns to an arbitrarily small neighborhood of the initial point. Birkhoff's ergodic theorem gives a complete answer to the first problem. The second problem experienced a growing interest during the last decade, also in connection with other fields, including compression algorithms, and the numerical study of dynamical systems. Our main objective is to discuss several recent developments related to this problem, which pertains to the quantitative study of recurrence.
2. Quantitative recurrence We assume in this section that T: X —> X is a measurable transformation on a space X such that X C R m for some positive integer m. We define the return time of a point x £ X to the open ball B{x, r) of radius r centered at x by Tr{x) = inf{fc > 0 : Tkx £
B(x,r)}.
We also define the lower and upper recurrence rates of x by logr r (x) R(x) = liminf ——r->o - l o g r
and
— logr r (a;) R(x) = hmsup :——-. _>o -logr r
These quantities measure the rate with which the orbit of x returns to an arbitrarily small neighborhood of this point. The following result of Barreira and Saussol in [3] provides upper bounds for the lower and upper recurrence rates in terms of the lower and upper pointwise dimensions of \x at the point x. These are defined respectively by d„ (x); _/iV
= hm inf r-o
log/x(B(z,r)) ^— — and dJx) = hm sup MV logr ' r_>0
logfi(B{x,r)) 7—- —. logr
T h e o r e m 2 . 1 . IfT preserves a finite measure fi on X, then for ^-almost every x € X, R(x)
and ~R(x) ^d^x).
(2)
It follows from Whitney's embedding theorem that if X is a subset of a finite-dimensional smooth manifold, then it can be smoothly embedded into E m for any sufficiently large m > 0, and thus the inequalities in (2) also hold ^-almost everywhere in this situation. The following example illustrates that without further hypotheses the inequalities in (2) may in general be strict on a set of positive //-measure. On the other hand, Theorem 3.1
Poincare recurrence: old and new
417
below shows that under certain reasonable additional assumptions the inequalities in (2) become identities on a set of full /u-measure. These assumptions are related to the existence of some hyperbolic behavior for the dynamical system. Example 2.1. Consider a rotation of the circle by an irrational number u which is well approximated by rational numbers, in the sense that there exists u > 1 such that \u—p/q\ < l/qv+1 for infinitely many relatively prime integers p and q, say pn and qn for each positive integer n. Since \q„u> — pn\ < l/qnu for each n, we have T
i/gn,'(x)
= mf
{ f c > 0 : few (mod 1) < l/qn"} < qn
for every x in the circle, and thus R(x) < liminf —;—j—.
r < - < 1.
On the other hand, irrational rotations have as unique invariant measure the Lebesgue measure m, for which dm(x) = dm(x) = 1 for every x. In particular R(x) < dm(x) for every x in the circle (and thus the first inequality in (2) is strict everywhere). Boshernitzan proved earlier in [7] that if the a-dimensional Hausdorff measure ma is afinite on X (that is, if X can be written as a countable union of sets Xi for % = 1, 2, . . . such that ma{Xi) < oo for all i), and T preserves a finite measure fj, on X, then \\mmi[nl'ad{Tnx,x)\
< oo
for ^-almost every x S X. He also showed that if, in addition, ma(X) lim inf [n1/ad(Tnx,
= 0, then
x)} = 0.
(3)
n—*oo
for /z-almost every x G X. This statement must be compared with (1). While (1) only tells us that some subsequence of the orbit of x converges to x, the identity (3) provides some information about the speed with which that subsequence converges to the point x. The following statement uses the notion of Hausdorff dimension of a measure fi on X. This is defined by dim// M — inf {dim// Z : fx(Z) = fi{X)}, where dim// Z denotes the Hausdorff dimension of the set Z c X. It can be shown (see for example Proposition 3 in [6]) that dim// /i = esssup{dM(:r) : x G X}.
(4)
Boshernitzan's results in [7] can be reformulated in the following manner (see [3] for details). Theorem 2.2. IfT preserves a finite measure fi on X, then R(x) < dim // \x for fi-almost every x G X. One can also rephrase the first inequality in (2) in a form similar to (3) (see [3]). Theorem 2.3. IfT preserves a finite measure fi on X, then (3) holds for ^-almost every x £ X such that d^(x) < a.
418
Luis
BARREIRA
In view of the identity (4), Theorem 2.3 may in general provide a stronger statement than that in (3), and the first inequality in (2) may be sharper than that in Theorem 2.2. This possibility indeed occurs, as the following example illustrates. E x a m p l e 2.2. In [10], Pesin and Weiss presented an example of a Holder homeomorphism T: X —> X on a closed subset X of [0,1], preserving a probability measure [i such that: there exist disjoint sets A\, Ai c [0,1] with positive /i-measure, and there exist positive constants c\ and ci with c\ ^ C2 such that dJx) = d^x) = Ci for /x-almost every x G Ai and i = 1, 2. Clearly dim/j /x = max{ci,C2} and thus d^{x) < dim# \i on a set of positive /i-measure (on the set Ai with i such that Cj = min{ci,C2}). This example illustrates that in general Theorem 2.3 provides a stronger statement then that in (3). Therefore, one can see the first inequality in Theorem 2.1 as a nontrivial generalization of Boshernitzan's results in [7]. In addition, Theorem 2.1 provides also an upper bound for the upper recurrence rate. Theorem 3.1 below shows that for a certain class of maps and a certain class of measures the inequalities in (2) are in fact identities on a set of full measure.
3. Hyperbolic dynamics In the case of hyperbolic dynamics, the above results can be considerably strengthen. More precisely, we can obtain identities instead of inequalities in (2). We first recall the notions of hyperbolic set and of equilibrium measure. Let T: M —> M be a C 1 diffeomorphism on a smooth manifold. We say that a compact set X C M is a hyperbolic set for T if it is T-invariant (i.e., T~lX — X) and there exists a continuous splitting of the tangent bundle TxM = Es © Eu, and constants c > 0 and A G (0,1) such that for each x G X: (1) dxT(Es(x)) = Es(Tx) and dxT{Eu{x)) = Eu{Tx); (2) ||d x T"v|| < cA"|H| whenever v G E3(x) and n > 0; (3) ||d x T _ "v|| < cAn||u]| whenever v G Eu{x) and n > 0. We recall that a probability measure \i on X is called an equilibrium measure for the continuous function (p: X —> E if P(ip)=hli{T)+
f tpd/i, Jx
where P(v?) is the topological pressure of ip and /iM(T) is the Kolmogorov-Sinai entropy of T with respect to /i (see for example [1] for details). We recall that a measure \i is called ergodic if n(A) = 0 or ^(X \ A) = 0 whenever T _ 1 yl = A. The following statement was established by Barreira and Saussol in [3]. T h e o r e m 3.1. For a C1+a diffeomorphism with a compact hyperbolic set X, if [i is an ergodic equilibrium measure of a Holder continuous function, then lim
logrvOr)
r—>0 — l o g r
for ^-almost every point x G X.
=
Hm
logMBfor))
r—>0
log T
(5)
Poincare recurrence: old and new
419
T h e existence for almost every point of t h e limit in the right-hand side of (5) is due to Barreira, Pesin and Schmeling in [2]. We remark t h a t besides showing t h a t t h e inequalities in (2) are in fact identities almost everywhere under the hypotheses of Theorem 3.1, the identity (5) also says t h a t t h e lower and upper recurrence rates are equal almost everywhere. Thinking as if we could erase the limits in (5), we can say t h a t Theorem 3.1 shows t h a t inf{fc > 0 : fkx
G B(x,r)}
is approximately equal t o
l/jj,{B(x,r))
when r is sufficiently small, t h a t is, the time t h a t t h e orbit of x takes t o r e t u r n to the ball B(x,r) is approximately equal t o l//j,(B(x,r)). This should be compared to Kac's lemma: since n is ergodic we have /
Tr(y,x)dfi(y)
= l,
JB(x,r)
where rr(y,x)
= inf{k >0:Tky£
B(x,r)}.
Hence, the average value of the return time to B(x,r) is equal t o l/fx(B(x,r)). Therefore, Theorem 3.1 can be though of as a local version of Kac's lemma. It should also be noted t h a t (5) relates two quantities of very different nature. In particular, only the left-hand side depends on t h e diffeomorphism and only t h e right-hand side depends on t h e measure. T h e proof of Theorem 3.1 combines new ideas with the study of hyperbolic measures by Barreira, Pesin and Schmeling in [2] and results and ideas of Saussol, Troubetzkoy and Vaienti in [12] and of Schmeling and Troubetzkoy in [14] (see also [13]). In view of work of Barreira and Wolf in [5], in the case of surface diffeomorphisms it always exists an ergodic measure of "maximal recurrence", i.e., a measure at which t h e supremum sup M d i m # /x (over all finite invariant measures) is attained, and thus (in view of (4)) for which the left-hand side of (5) attains its maximal possible value almost everywhere. A related result in the case of repellers was obtained by Barreira and Saussol in [4]. Let T: M —> M be a differentiable m a p of a smooth manifold. Recall t h a t a compact T-invariant set X C M is called a repeller of T if there exist constants c > 0 and /3 > 1 such t h a t \\dxTnv\\
> c0n\\v\\
for each n > 0, x G X and v G TXM. It was established in [4] t h a t the statement in Theorem 3.1 remains valid when one replaces "diffeomorphism with a hyperbolic set" by "differentiable m a p with a repeller" (see [4] for details). We briefly present two applications to number theory t h a t can essentially be obtained from a direct application of this result. Let x = 0.:ri:r2 • • • be the base-m representation of t h e point x G [0,1] (this representation is unique except for a countable set of points in (0,1)). By considering [0,1] as a repeller of the transformation x i-> mx (mod 1) one can show (see [4]) t h a t inf{n > 0: | 0 . x n a ; n + i • • • — 0.£ia;2 • • -| < r } ~ - when r —» 0 r for Lebesgue-almost every x G [0,1], in the sense t h a t ,. loginf{n > 0: |0.a;„a; n + i lim : r—0 — log r
0 . x i x 2 • • !-I < r } = 1
420
Luis
BARREIRA
for Lebesgue-almost every x £ [0,1]. Another example is given by continued fractions. Writing each number a; € (0,1) as a continued fraction x = [mi,m 2 ,...] =
,
with rrii = rrii(x) > 0 for each i (again this representation is unique except for a countable set of points in (0,1)), one can show (see [4]) that for Lebesgue-almost every i £ (0,1), inf{n > 0 : |[ra„,m„+i,...] — [mi,m2,.. .]| < r } ~ - when r —> 0. r
4. Product structure and recurrence We discuss in this section the product structure of recurrence on hyperbolic sets. It turns out that the recurrence also possesses an almost product structure, which closely imitates the product structure provided by the families of stable and unstable manifolds as well as the almost product structure of hyperbolic measures. Let T: M —> M be a C 1 diffeomorphism with a compact hyperbolic set X C M. We assume in this section that X is locally maximal. This means that there exists an open neighborhood U D X such that X — f)n€zTnU. We also consider the local stable and unstable manifolds of (a sufficiently small) size £ of a point x € X, Ves(x) = {yeM: u
Ve (x) = {yeM:
d(Tnx,Tny) n
n
d(T x,T y)
< e for every n > 0}, < e for every n < 0},
and we denote by ds and du the distances induced by the distance d of M respectively on each stable and unstable manifold. We denote by Bs(x,r) C V*{x) and Bu(x,r) C V"(x) the corresponding open balls of radius r centered at x. Under the above assumptions, given e > 0 there exists 5 > 0 such that for any x, y £ X with d(x,y) < S the intersection VJ,(x)CiV^(y) contains exactly one point and we can define the map [•,•]: {(x,y) £XxX: d(x,y) < 5} -> M by[x,y} = Ves(x)nV^(y). For each p < 5 we define the stable and unstable return times of x € X respectively into the stable and unstable strips of radius r by r?(x,p) = inf{n > 0 : d{T~nx,x) n
r?(x,p) = inf{n > 0 : d{T x,x)
< p and ds{[x,T-nx],x) n
< p and du{[T x,x},x)
< r}, < r}.
We note that the functions p i-> T*(X, p) and p i-> r"(x, p) are nondecreasing. We define the lower and upper stable recurrence rates of the point x € X by Rs(x) = lim Rs(x, p)
and
p—»0
~R\x) = lim I s (x, p), p—>0
and the lower and upper stable recurrence rates of the point x e X by Ru{x) = lim Ru(x, p)
and
S"(s) = lim i f ( z , p),
421
Poincare recurrence: old and new where = luninfl°gT^p) r—o -logr r,u, \ ,- • ,logT"(a;,p) i?"(x,p) = hm mf p ^ R^x,p)
R°{x,p) = limsup l 0 g T f {x'p), -logr r^o , —u, . ,. logr"(a;,/o) ry and R (x,p) — hm sup & p .
and
Barreira and Saussol showed in [4] that for a C1+a diffeomorphism that is topologically mixing on a locally maximal compact hyperbolic set X, and an equilibrium measure p of a Holder continuous function, we have R°(x) = Rs(x) = lunl0g^BS{x>r))
and fl»(x) =T(x)
= lim ^ M
3
" ^
(6)
for /x-almost every a; G X, where /x* and p " are the conditional measures induced by the measurable partitions £ s and £ u defined by the local stable and unstable manifolds. The existence for almost every point of the limits in (6) is due to Ledrappier and Young in [8]. Barreira, Pesin and Schmeling showed in [2] that measures supported on hyperbolic sets possess an almost product structure (the statement is also valid in the much more general case of hyperbolic measures; see [2] for details). Theorem 4.1. For a C1+a diffeomorphism with a compact hyperbolic set X, if p, is a finite measure supported on X then given 5 > 0 there exists a set Y c X with p,(Y) > p{X) — 5 such that for each x € Y we have - p°x(B*(x,r))p%(B"(x,r))
~
for all sufficiently small r > 0. We now formulate a result of Barreira and Saussol in [4] showing that recurrence also possesses an almost product structure, which imitates the product structure provided by the families of stable and unstable manifolds as well as the almost product structure of measures supported on hyperbolic sets in Theorem 4.1. Theorem 4.2. Let X be a locally maximal compact hyperbolic set of a C1+a diffeomorphism that is topologically mixing on X, and p an equilibrium measure of a Holder continuous function. Then, for p-almost every point x £ X there exists p(x) > 0 such that for each p < p(x) and e > 0 there is r(x,p,e) > 0 such that if r < r(x,p,e) then r* < Tr^P)r^,p)
< r_e
(?)
The identity (7) shows that the return time to a given set is approximately equal to the product of the return times to the stable and unstable directions, as if they were independent.
5. Relation to entropy Ornstein and Weiss obtained related results in the special case of symbolic dynamics. Namely, they showed in [9] that if a+: S + —> S + is a one-sided subshift and p+ is an ergodic cr+-invariant probability measure on E + , then for ^ + -almost every {i\i<2 • • • ) € £ + , lim fc—too
togfaf{">Q
= (in+i-
= h
{(j)
(g)
422
Luis BARREIRA
T h e y also showed in [9] t h a t if a: E —> S is a two-sided subshift a n d fi is a n ergodic cr-invariant probability measure on E , then for /U-almost every (• • • i-iioi\ • • •) € E, ,.
loginf{n > 0 : (t n _ f c • • • i„+k) = (i-k • • • *fc)} , , v = /l>)
t^
2kT~i
-
. . (9)
Any two-sided shift a: E —> E has naturally associated two one-sided shifts a+: £ + —> E + and cr~: E ~ —> E ~ (related with future a n d past). Furthermore, any cr-invariant measure [i on E induces a cr + -invariant measure fi+ on E + a n d a a~ -invariant measure / i ~ on E ~ with hll+(a+) = h^-(a~) = h^(a). For each w = (•• -i-iioii •• •) £ E a n d A; > 0 we set r
fc ( w ) =
T^(UJ)
inf
{n > °
= inf{n > 0
(»n+l ' ••*n+fc) = (*1 •• •**;)}, (i-n-k
Tfc(w) = inf{n > 0 (in-k
• • • i - n - l ) = (i-fc • • •
i-l)},
• • • in+k) — (i-k • • • ik)}-
Let /i be an ergodic cr-invariant measure on E . It follows from (8) a n d (9) t h a t for /x-almost every w e E , given e > 0, if k > 0 is sufficiently large then e~ks < T£(W)T^"(W)/Tk(u>) < eke. Theorem 4.2 and t h e identities in (6) are versions of these statements.
References 1. L. Barreira, "Hyperbolicity and recurrence in dynamical systems: a survey of recent results", Resenhas IME-USP 5, 171-230 (2002). 2. L. Barreira, Ya. Pesin, J. Schmeling, "Dimension and product structure of hyperbolic measures", Ann. of Math. (2) 149, 755-783 (1999). 3. L. Barreira, B. Saussol, "Hausdorff dimension of measures via Poincare recurrence", Comm. Math. Phys. 219, 443-463 (2001). 4. L. Barreira, B. Saussol, "Product structure of Poincare recurrence", Ergodic Theory Dynam. Systems 22, 33-61 (2002). 5. L. Barreira, C. Wolf, "Measures of maximal dimension for hyperbolic diffeomorphisms", Comm. Math. Phys., to appear. 6. L. Barreira, C. Wolf, "Pointwise dimension and ergodic decompositions", preprint. 7. M. Boshernitzan, "Quantitative recurrence results", Invent. Math. 113, 617-631 (1993). 8. F. Ledrappier, L.-S. Young, "The metric entropy of diffeomorphisms. Part II: Relations between entropy, exponents and dimension", Ann. of Math. (2) 122, 540-574 (1985). 9. D. Ornstein, B. Weiss, "Entropy and data compression schemes", IEEE Trans. Inform. Theory 39, 78-83 (1993). 10. Ya. Pesin, H. Weiss, "On the dimension of deterministic and random Cantor-like sets, symbolic dynamics, and the Eckmann-Ruelle conjecture", Comm. Math. Phys. 182, 105-153 (1996). 11. H. Poincare, "Sur le probleme des trois corps et les equations de la dynamique", Acta Math. 13, 1-270 (1890). 12. B. Saussol, S. Troubetzkoy, S. Vaienti, "Recurrence, dimensions and Lyapunov exponents", J. Statist. Phys. 106, 623-634 (2002). 13. J. Schmeling, "Dimension theory of smooth dynamical systems", in Ergodic Theory, Analysis and Efficient Simulation of Dynamical Systems, B. Fiedler ed., Springer, 2001. 14. J. Schmeling, S. Troubetzkoy, "Scaling properties of hyperbolic measures", preprint.
Analysis of Lyapunov modes for hard-disk fluids CHRISTINA FORSTER, ROBIN HIRSCHL, HARALD
A. POSCH (U. Vienna)
For hard-particle systems the perturbations associated with specific Lyapunov exponents exhibit spatial structures to which we refer as Lyapunov modes. The perturbations generating the smallest-positive exponents are delocalized and have a wave-like appearance reminiscent of the modes of fluctuating hydrodynamics. The degeneracy of the associated exponents is determined by the orthonormality, the spatial symmetry, and the polarization (transverse or longitudinal) of the perturbations. In contrast to the stationary transverse modes the longitudinal modes propagate through the system with a velocity only weakly correlated to the velocity of sound. The amplitudes of the modes scale with the inverse square root of the particle number. Fourier transform techniques are applied to study the stability of the modes.
1. Lyapunov s p e c t r a The phase-space trajectory of a many-particle system is Lyapunov unstable due to the convex dispersive surfaces of the atoms. The exponential rate of change of a small phase volume is described by a set of Lyapunov exponents, {A;, I = 1,...,D}, the so-called Lyapunov spectrum [1,2]. Conventionally, the exponents are taken to be ordered, A; > A/ + i. There are D = 2dN exponents, where d and D are the dimensions of space and of phase space, respectively, and N denotes the number of particles. For ergodic systems we may define infinitesimal orthonormal initial perturbation vectors {5F/(0); I = l,...,2
A(=limilnJ|^M, t->oo t |ori(0)|
/ = !,.. .,W,
(1)
exist and are independent of the initial state [3-5]. In conservative systems the evolution changes only the shape but not the volume of a phase extension, and, hence, Yli=i ^i = 0For symplectic systems the conjugate pairing rule [6] assures that A/ + A£>+i_; = 0, and only the positive part of the spectrum needs to be calculated. In this communication we concentrate on the Lyapunov spectrum of hard-disk systems in two dimensions. For the computation we use the classical method of Benettin et al. [7] and Shimada et al. [8] with an extension to hard-sphere systems by Dellago et al. [2,9,10]. It requires following the time evolution of the reference trajectory and of an orthonormal set of tangent vectors {5Ti(t); I = I,... ,2dN}, where the latter is periodically reorthonormalized. In the following we use reduced units for which the disk diameter a, the kinetic energy per particle K/N, the particle mass m, and the Boltzmann's constant ks are 1
ley
equal to unity. Lyapunov exponents are given in units of reciprocal time, (ma2N/K) The shape of the Lyapunov spectrum differs qualitatively for hard and soft disks and varies with the density. It depends on the system size in a subtle way. In figure 1 we show the positive branch of a spectrum for 1024 hard disks in a square simulation box with periodic
423
424
CHRISTINA FORSTER, ROBIN HIRSCHL, HARALD A. POSCH
6 I
0 I
0
1
1
1
1
1
1
1
1
r
i
i
i
i
i
i
i
i
i
1
0.1
0.2
0.3
0.4
0.5 U2N
0.6
0.7
0.8
0.9
1
Figure 1. Lyapunov spectrum of a 1024-disk system in a square simulation box with periodic boundaries. The density p = 0.7. A reduced index 1/2N is used on the abscissa. In the inset the steps are magnified with a non-normalized index I on the abscissa.
boundaries. The particle density p = N/L2 = 0.7, where L is the box size. The index I on the abscissa enumerates the exponents. For 1/2N < 0.96 the spectrum looks smooth. We have previously shown that the perturbation belonging to the maximum exponent, Ai, is strongly localized in space: only a small fraction of all particles contributes actively to the growth of the perturbation norm at any instant of time. This localization persists in the thermodynamic limit [11-14]. For larger I this localization is gradually lost [15]. For 1/2N > 0.96 degenerate exponents exist which provide for a step-like appearance of the spectrum in that range. The normalized index lc/2N w 0.96 characterizing this transition and, hence, the fraction of exponents which contribute to this regime does not depend on the density of the fluid and on the box size L. Increasing L at constant p, or p at constant L, increases the total number of exponents and, hence, the number of discernible steps in the range 0.96 < 1/2N < 1. In the following we demonstrate that the perturbations associated with the degenerate exponents are characterized by periodic wave-like patterns in space reminiscent of the modes of fluctuating hydrodynamics. Therefore, we refer to them as Lyapunov modes [11].
2. Lyapunov modes Each disk, located at r = (x,y) and moving with momentum p — (px,Py), contributes the two-dimensional perturbations 5r and Sp to the 4iV-dimensional tangent vector SF belonging to A. The combined position perturbations of all N particles may be viewed as a two-dimensional vector field, Sr(x, y), over the domain of the simulation box. The analogous vector field for the momentum perturbations, Sp(x, y), is strictly parallel to 5r(x,y), if A is positive, and antiparallel, if A is negative. This is a consequence of the linearized
425
Analysis of L y a p u n o v m o d e s for hard-disk fluids
motion equations for the tangent vectors 5F(t). The vector field Sr(x,y) for the position perturbations is the most direct representation of a Lyapunov mode. Three examples are given in the first row of figure 2 for the modes indicated by the arrows in the enlarged 1024-disk spectrum at the top of this figure. If the components Sx(x,y) and 8y(x,y) are plotted separately as in the second row of figure 2, one observes that these scalar fields are linear combinations of sine- and cosine-like plane waves characterized by wave vectors lq of equal norm, |k;| = k. For a rectangular box with sides Lx and Ly and for periodic boundary conditions, the allowed wave numbers follow from k = 2TT
(2)
where the integers nx and ny denote the number of nodes in the x and y directions, respectively. The null modes, for which nx = ny = 0, exist as for any translational invariant system. They are connected with the vanishing Lyapunov exponents. 2017 2033 2045
/ =2045
(=2048 / =2033
mmmm KsPSSPStf
0Mmm£
0 0.2 0.4 0.6 0.8 1.0 k.
0 0.2 0.4 0.6 0.8 1.0 k.
0 0.2 0.4 0.6 0.8 1.0 k
Figure 2. Top: Relevant part of a Lyapunov spectrum for a 1024-disk system in a square simulation box with periodic boundaries. The density p = 0.7. Second row: Snapshot of the vector field Sr(x,y) for the position perturbations of the transverse modes indicated by the arrows in the spectrum above. Third row: Snapshot of the respective scalar fields 8x(x, y) and Sy(x, y) for the same modes. Fourth row: Contour plots of the Fourier transforms of the modes. They are time averages as defined in the main text.
426
CHRISTINA FORSTER, ROBIN HIRSCHL, HARALD A. POSCH
Numerically, the two wave vectors contributing to a mode are most-easily found from the spatial Fourier transforms of the components Sx,Sy, Spx, Spy, which will be denoted by 6x, Sy, Spx, Spy, respectively. In the bottom line of figure 2 contour plots of ({SxSx* +SySy* + SpxSpx + bPybPy))1^1 a r e shown for all three modes. Here, (• • •) denotes a time average. In all cases the involved wave vectors are readily recognized as points on a square lattice in reciprocal space with lattice constant 2-K/L. This kind of analysis is particularly useful for the more complicated modes with wave vectors not aligned with the simulation box and close to the maximum wave number kc at which modes cease to exist. One observes from figure 2 that the amplitudes (<5a;)o and {5y)o for the component fields Sx(x,y) and 5y(x,y) are not identical. They depend on the initial conditions. However, the mode amplitude defined by PQ = [(Sx)2 + (Sy)2 + (Spx)2 + (Spy)2]1^2 does not. This is a consequence of the Cartesian metric of our phase space used in Equation (1). Thus, the Fourier peaks contributed by different components in the contour plots of figure 2 will be different in general, which is obviously the case. If the size of a square system, Lx = Ly — L, is increased keeping the particle density constant, all small Lyapunov exponents associated with modes depend uniquely on k and are described by two "dispersion relations" which, for small k, are: AT = 2.48(3)fc + 0.23(7)fc2 +
3
XL = 3.13(7)fc + 1.2(2)fc + 0(k ).
(3) (4)
Depending on the particular relation the Lyapunov exponents belong to, the associated modes are classified either as transverse (T) or longitudinal (L). These names are chosen according to the polarization of the waves. For example, the mode on the left-hand side of figure 2 has a perturbation component Sx perpendicular to its wave vector parallel to y, and a component Sy with a wave vector parallel to x. It is a transverse mode, and so are the other modes depicted in this figure. For the longitudinal modes the perturbations with the smallest-possible k are parallel to their respective wave vectors. It is found that the transverse modes do not propagate and are stationary once formed. The longitudinal modes propagate with a density-dependent phase velocity as is shown below. The degeneracy of the Lyapunov exponents and, hence, the step-like structure of the spectrum are a consequence of the symmetry of the simulation box and of the orthonormality of the phase-space perturbation vectors. The sine- and cosine-like waves allow for different orthogonal phase-space perturbations with the same k as is demonstrated in figure 3 by the various reciprocal lattice points on a circle with radius k. For the non-propagating transverse modes the relevant wave vectors may be restricted to the half plane of the reciprocal lattice, figure 3 left. The full and open circles refer to sine- and cosine-like waves, which have to be counted separately. From this figure the multiplicity of the transverse exponents can be read off. For example, for a square system the multiplicity is 4 for the three smallest steps in the spectrum and increases to 8 for the fourth step with k = 2V5n/L corresponding to the circle. For the propagating longitudinal modes the full reciprocal lattice plane is needed, figure 3 right, and the degeneracy is always twice that of the transverse modes for a given k. If the symmetry of the box is reduced to a rectangle, the multiplicity is reduced by a factor of two. From the normalization of the tangent vectors it follows that the mode amplitudes are
427
Analysis of L y a p u n o v m o d e s for hard-disk fluids
Figure 3. Reciprocal lattice with point spacing equal to 2ix/L, for sine-like patterns (dots) and cosine-like patterns (open circles). Transverse modes do not require a sense of direction, and only one half the plane is needed (left), whereas for longitudinal modes the full reciprocal lattice is required (right). The circle connects allowed wave vectors with k = 2\/57r/L.
expected to decrease with N according to Pn =
/ 2 \V2 (5)
VjV,
For the transversal modes the mode amplitude is numerically obtained from the Fourier coefficients according to ((6x(k) 5x*(k) + Sy(k) Sy*(k) + 5px{k) J £ ( k ) + Spy(k) Sp*y(k)))
1/2
where k and k are the wave vectors of Sx(x, y) and 8y(x, y), respectively, and where a time average is involved. For the propagating longitudinal modes the situation is more subtle: the Fourier coefficients exhibit a strange periodic modulation which is synchronized with a similar modulation of the phase velocity as shown below. In this case an average over consecutive maxima is performed. With this restriction, we compare in figure 4 the mode amplitudes of quasi-onedimensional systems of hard disks (rectangular box of width Ly = 1.2 and particle density p = 0.7; the particle number N and, hence, Lx are varied) with the theoretical prediction in Equation (5). All modes belonging to the three lowest transverse and to the three lowest longitudinal steps in the Lyapunov spectrum are considered. The good agreement is an indication of the remarkable stability of the modes.
0.04
am
0.05
0.06
0.07
0.08
1/2
Figure 4. 1/VN dependence of the mode amplitude Pg for the first 3 transverse and first longitudinal modes. The smooth line is the prediction of Equation (5).
428
CHRISTINA FORSTER, ROBIN HIRSCHL, HARALD A. POSCH
3. Phase propagation of modes It was already noted that longitudinal Lyapunov modes move through the system while transverse modes remain stationary. As in the case for the amplitudes also the propagation of the modes is conveniently investigated by Fourier transformation, where the instantaneous position of a particular mode corresponds to the phase of the transform in the complex plane. For simplicity we consider in the following a quasi-onedimensional system with 200 disks in a very elongated periodic box, Lx = 116.7, Ly = 1.2, for which modes only parallel to the x axis develop. The relevant part of the Lyapunov spectrum is shown in the top right-hand part of figure 6. For the transverse modes only 5y{x, y) needs to be considered, for the longitudinal modes only Sx(x, y). On the left-hand side of figure 5 we follow the time evolution of 5yi in the complex plane for the two transversal modes I = 396 and 397 belonging to
10 5
3
° -5 -10 -10 -5
0 5 RE
10
-10 -5
0 5 RE
10
Figure 5. Time evolution of the Fourier transforms 8xi and Syi for various modes I of a quasi-onedimensional system containing 200 disks. The density p = 0.7, and the box width Lv = 1.2. Left: Transverse modes ( = 397 and 396 belonging to the smallest positive exponents. Right: Longitudinal mode I = 395. The relevant part of the Lyapunov spectrum is shown in the top-right part of figure 6. 2000 1500 1000 500 0
350
360 370
380 390
-500
16
1=395 —— P ,^ /
-1000
i \
/•*
_—i-—"* •
\400
-1500 -2000
-2500 Q
400
800
i
30Q
60Q
t
3Q0
35Q
40Q
45Q
5()0
12
4 0
t
Figure 6. Left: propagation of the longitudinal modes I = 395,396,307,398 of a quasi-onedimensional harddisk fluid with 200 particles in a rectangular periodic box (Lx = 238.1, Ly = 1.2, p = 0.7). T h e periodic boundaries are unfolded for the spatial coordinate x'. Top right: relevant part of the Lyapunov spectrum. Bottom right: amplitude and phase modulation for mode / = 395.
Analysis of Lyapunov modes for hard-disk fluids
429
the smallest positive step in the spectrum. As expected, the two modes are stationary and orthogonal to each other. However, on the right-hand side an analogous plot of the time evolution of 5xi for the longitudinal mode I = 395 reveals elliptic-like curves indicating a rather complicated mode propagation. In figure 6 the time dependence of a spatial coordinate x' for four degenerate longitudinal modes constituting the first longitudinal spectral step is shown, where the periodic boundaries have been unfolded. The step-like structure of this motion is a consequence of the tight interdependence of the modes due to their orthonormality constraints. The hight of a step corresponds to the wave length, identical to the box length Lx in this case. In addition, the mode amplitudes are synchronously modulated as mentioned before. This is demonstrated in the lower-right-hand part of figure 6. The phase velocity of the Lyapunov modes depends on the density of the fluid, but there is no obvious connection with the velocity of sound [10]. In the Table we compare the averaged mode velocities for the quasi-onedimensional 200-particle system of figure 6 with the sound velocity of a two-dimensional hard-disk gas of the same density. For the latter we use the Pade approximants P(3,4) to the dynamical pressure due to Ree and Hoover [16,17]. The comparison in the table reveals that the velocity of the Lyapunov modes is alway significantly lower than the sound velocity, and that this ratio varies considerably with density. We reluctantly conclude that there is a considerable difference between the properties of the longitudinal Lyapunov modes and that of ordinary phonon modes, in spite of the fact that the perturbation theories for solids look very similar in both cases [18]. Table 1. Density dependence of the mode velocity c for the longitudinal modes of the system discussed in figure 6. cs is the velocity of sound for two-dimensional hard disks with the same density p. See the main text for details.
p 0.2 0.4 0.6 0.7 0.8
cs 2.17 3.62 6.92 10.3 16.2
c 1.0 1.57 2.37 3.08 4.22
c/cs 0.46 0.41 0.34 0.30 0.26
4. Conclusion and theory We have given a phenomenological description of the Lyapunov modes in tangent space associated with the small positive Lyapunov exponents A of hard-disk systems, which turn out to be degenerate. The modes also exist for hard-sphere systems in three dimensions resembling real fluids [10,11,13]. The involved exponents constitute a measurable subset of the total Lyapunov spectrum in the thermodynamic limit. However, we could not find Lyapunov modes for particle systems interacting with a soft potential [13-15]. Two classes of modes are encountered, transverse modes which are stationary, and longitudinal modes which propagate but differ from the ordinary sound or phonon modes. From the dispersion relations Ar,L(&)> different for both classes, it is possible to construct the regime of small exponents for the Lyapunov spectrum of arbitrary-large systems [15].
430
CHRISTINA FORSTER, ROBIN HIRSCHL, HARALD A. POSCH
Unlike the numerical situation, the current status of the theory is still not fully satisfactory. Soon after our discovery of the Lyapunov modes, Eckmann and G a t [19] were the first to provide theoretical arguments for the existence of Lyapunov modes in transversal-invariant systems. Their model did not have a dynamics in phase space b u t only an evolution matrix in tangent space given as a product of independent random matrices. Most recently, this theory has been improved and made more realistic by Eckmann and Zabey [20]. In another approach, M c N a m a r a and Mareschal [21] isolate t h e six hydrodynamic fields related t o the invariants of the binary particle collisions and t h e vanishing exponents, and use a generalized Enskog theory t o derive hydrodynamic evolution equations for these fields. Their solutions are the Lyapunov modes. In a more detailed extension of this work restricted to small densities, a generalized Boltzmann equation is used for the derivation of the hydrodynamic evolution equations [22]. A related approach is taken by de Wijn and van Beijeren [23], who use the analogy t o the Goldstone mechanism t o construct the modes within the framework of kinetic theory. For low densities good agreement with the simulation results is achieved. Finally, Taniguchi, D e t t m a n n , and Morriss approached the problem from the point of view of periodic orbit theory [24] and master equations [25]. To our knowledge no theory exists for high-density fluids and solids. T h e obvious disappearance of the Lyapunov modes and of the accompanying step structure in the spectrum for soft particles (interacting, for example, with a Lennard-Jones potential) also needs clarification.
Acknowledgments We t h a n k Christoph Dellago, Bill Hoover, and Ljubo Milanovic for useful discussions. This work was supported by the Austrian Fonds zur Forderung der wissenschaftlichen Forschung, grant P15348-PHY, and by the Computer Center of the University of Vienna, for providing access t o the computer cluster "Schrodinger".
References 1. 2. 3. 4. 5. 6. 7. 8. 9. 10.
H. A. Posch, W. G. Hoover, Physical Review A 38, 473 (1988). Ch. Dellago, H. A. Posch, Physical Review E 52, 2401 (1995). V. I. Oseledec, Trans. Mosc. Math. Soc. 19, 197 (1968). J.-P. Eckmann, D. Ruelle, Reviews of Modern Physics 57, 617 (1985). H. A. Posch, W. G. Hoover, Physical Review A 38, 473 (1988). C. P. Dettmann, G. P. Morriss, Phys. Rev. E 53, 1 (1996). G. Benettin, L. Galgani, A. Giorgilli, J. M. Strelcyn, Meccanica 15, 21 (1980). I. Shimada, T. Nagasihma, Prog. Theor. Phys. 6 1 , 1605 (1979). Ch. Dellago, H. A. Posch, Pysical Review E 53, 1485 (1996). R. Hirschl, Computer simulation of hard-disk and hard-sphere systems: Lyapunov modes and stochastic color conductivity. Master's thesis, University of Vienna, 1999. See h t t p : / / c m s . m p i . univie. ac. at/robin/. 11. H. A. Posch, R. Hirschl, "Simulation of Billiards and of Hard-Body Fluids", in Hard Ball Systems and the Lorenz Gas, edited by D. Szasz, Encyclopedia of the mathematical sciences 101, Springer Verlag, Berlin (2000), p. 269. 12. Ch. Forster, Lyapunov Instability of Two-Dimensional Fluids, Master's thesis, University of Vienna (2002). 13. H. A. Posch, Ch. Forster, "Lyapunov instability and collective tangent space dynamics in
Analysis of Lyapunov modes for hard-disk fluids
14. 15. 16. 17. 18. 19. 20.
21. 22.
23.
24. 25.
431
fluids", in P. M. A. Sloot, C. J. K. Tan, J. J. Dongarra, A. G. Hoekstra, eds., Lecture Notes on Computer Science, Computational Science — ICCS 2002, Springer, Berlin, 2002, p. 2331. W. G. Hoover, H. A. Posch, Ch. Forster, Ch. Dellago, M. Zou, Journal of Statistical Physics 109, 765 (2002). Ch. Forster, R. Hirschl, H. A. Posch, W. G. Hoover, Physica D (2003). arXiv:nlin.CD/ 0211047. Wm. G. Hoover, B. J. Alder, Journal of Chemical Physics 46, 686 (1967). F. H. Ree, Wm. G. Hoover, Journal of Chemical Physics 46, 4181 (1967). H. A. Posch, Ch. Forster, Proceedings of the Heraeus Summer School Collective Dynamics of Nonlinear and Disordered Systems, in preparation, 2003. J.-P. Eckmann, O. Gat, J. Stat. Phys. 98, 775 (2000). J.-P. Eckmann, E. Zabey, private communication, and International Workshop and Seminar on Microscopic Chaos and Transport in Many-Particle Systems, Max Planck-Institut fur Physik komplexer Systeme, Dresden, 2002, unpublished. S. McNamara, M. Mareschal, Phys. Rev. E 64, 051103 (2001). M. Mareschal, S. McNamara, International Workshop and Seminar on Microscopic Chaos and Transport in Many-Particle Systems, Max Planck-Institut fur Physik komplexer Systeme, Dresden, 2002. A. de Wijn, H. van Beijeren, private communication, and International Workshop and Seminar on Microscopic Chaos and Transport in Many-Particle Systems, Max Planck-Institut fur Physik komplexer Systeme, Dresden, 2002. T. Taniguchi, C. P. Dettmann, G. P. Morriss, J. Stat. Phys. 109, 747 (2002). T. Taniguchi, G. P. Morriss, Phys. Rev. E 65, 056202 (2002).
The fractality of the hydrodynamic modes of diffusion P I E R R E GASPARD
(U. Libre de Bruxelles)
Transport by normal diffusion can be decomposed into hydrodynamic modes which relax exponentially toward the equilibrium state. In chaotic systems with two degrees of freedom, the fine scale structures of these modes are singular and fractal, characterized by a Hausdorff dimension given in terms of Ruelle's topological pressure. For long-wavelength modes, the Hausdorff dimension is related to the diffusion coefficient and the Lyapunov exponent. In the infinite-wavelength limit, the hydrodynamic modes lead to the nonequilibrium steady states, which also present a singular character. This singular character is a consequence of the mixing property of the dynamics. These results are illustrated with several systems such as the hard-disk and Yukawa-potential Lorentz gases.
1. Introduction A fundamental problem of statistical mechanics is to derive the macroscopic equations of a fluid such as the Navier-Stokes equations, the heat equation or the diffusion equation from the underlying Newtonian dynamics of the particles composing the fluid. On the way from the microscopic dynamics to the macroscopic phenomena, a central role is played by the so-called hydrodynamic modes. These modes describe the collective motions of fluids with periodic inhomogeneities in space relaxing as exponential (or oscillatory exponential) functions in time toward the state of thermodynamic equilibrium. The importance of the hydrodynamic modes holds in the fact that they are associated with the locally conserved quantities: the energy, the momentum, and the masses or numbers of the conserved particles. For the phenomenological diffusion equation dtn = P V 2 n ,
(1)
where n is the concentration or density of tracer particles in the fluid and V is the diffusion coefficient, the hydrodynamic modes of diffusion are the special solutions n(r, t) ~ e i k r e s t ,
with s = -Vk2,
(2)
where k is the wavenumber of the mode and the variable s = Re s + i Im s gives the damping rate — Re s and the frequency Im s of the mode. We notice that the rate variable s is given by an eigenvalue problem for the operator in the right-hand side of the evolution equation, here equation (1). The relation between the rate variable s and the wavenumber k is the socalled dispersion relation, which characterizes the physical processes of wave propagation and relaxation. The knowledge of the dispersion relations provides the values of the transport coefficients such as the diffusion coefficient. This is the way the transport coefficients are derived from kinetic equations such as the Boltzmann equation since the pioneering work by Hilbert [1], Chapman [2], and Enskog [3] (see references [4-6]).
432
The fractality of the hydrodynamic modes of diffusion
433
An example of such treatment is for the random Lorentz gas of independent particles in elastic collisions on hard disks of radius a. These scatterers are randomly located according to a Poisson distribution on the plane with the uniform density na- In the dilute gas limit, n&a2
ft/ + v - V / =
+7T
UddV
dip'
/
sin
[f(r,
(3)
-7T
This kinetic equation admits special solutions of the form: /(r,^)~eik-rest,
(4)
with wavenumber k. The dispersion relations of these modes are depicted in figure 1. At vanishing wavenumber, the rates are given by k = 0:
s
8j = ——-ndav
(j = 0 , 1 , 2 , . . . ) .
(5)
The branch such as s = 0 for k = 0 corresponds to the diffusive mode for which s —23k2 + 0 ( k 4 ) with the diffusion coefficient V =
3v 16arid
(6)
The other branches with larger damping rates correspond to the kinetic modes describing faster transients. Or—
-1
diffusive mode
-2 kinetic modes
Figure 1. Spectrum of the Boltzmann-Lorentz kinetic equation (3) for the random hard-disk Lorentz gas: rate variable s versus wavenumber k = ||k||. The disks are randomly distributed in the plane according to a Poisson distribution.
434
P I E R R E GASPARD
2. The hydrodynamic modes of diffusion in periodic Lorentz gases The Boltzmann-Lorentz kinetic equation is only valid in the dilute-gas limit and for a statistical ensemble of configurations of the scatterers. A fundamental problem is to construct the hydrodynamic modes directly from the underlying microscopic dynamics in order to go beyond the restrictions on the use of typical kinetic equations. Recently, it has been possible to carry out this construction for two-dimensional periodic Lorentz gases which conserve energy and preserve phase-space volumes according to Liouville's theorem. Two examples have been treated in detail: (1) The hard-disk Lorentz gas on a triangular lattice, which has a finite and positive diffusion coefficient and is fully chaotic in the finite-horizon regime, as proved by Bunimovich and Sinai [8]. (2) The Yukawa-potential Lorentz gas on a square lattice, which is ruled by the Hamiltonian: H_pl+Pl
H
V
e—'-'ll
--^r~^l^W
(7)
lez2 " " This system is fully chaotic and diffusive if the energy is larger than some value, as proved by Knauf [9]. The construction of the diffusive modes can be formulated by reducing the flow X t = 3>'(X0) with X = (r, tp) to the Birkhoff-Poincaxe map: x„+i = <£(x„), ' *n+l = tn + T ( X n ) ,
(8)
W l = In + a ( x „ ) ,
where x n are for instance the Birkhoff coordinates of the tracer particle at collision with a hard disk. T(x) is the first-return time function and l n € C is a vector of the lattice C, pointing to the lattice cell where the particle is located. The function a(x) is the lattice vector of the jump of the tracer particle from cell to cell or from collision to collision. Similarly, the Probenius-Perron operator of the flow, /t(X) = / o ( # - ' X ) ,
(9)
can be reduced to a Probenius-Perron operator for the map (8). After a spatial Fourier transform introducing the wavenumber k and a temporal Laplace transform introducing the rate variable s, the Probenius-Perron operator of the map is given by (£k, a «)(x) = e - s T ^ " l x ) - i k ' a ( * " l x ) u ( 0 - 1 x ) .
(10)
Formally, the diffusive mode V"k(x) is an eigenstate of the operator (10) corresponding to the unit eigenvalue: -Rk.skV'k = V'k • (11) This eigenvalue condition constrains the rate variable s to become a function s^ of the wavenumber k [10]. This rate is a Pollicott-Ruelle resonance which gives the dispersion relation of diffusion: sk = - P k 2 + 0 ( k 4 ) , (12)
The fractality of the hydrodynamic modes of diffusion
435
where the diffusion coefficient V is given by the Green-Kubo formula [11,12]. Higher-order Burnett and super-Burnett coefficients can also be obtained [11-13]. The method based on the Frobenius-Perron operator yields the dispersion relation of diffusion in general. A remarkable result is that the eigenstate *k(X) is not a function but a singular distribution with a cumulative function presenting fractal properties [14,15]. The cumulative function of the diffusive mode ^ k can be constructed by first noticing that the dispersion relation of diffusion is equivalently given by the Van Hove formula as su
lim - ln(e,ik-(r t -ro >>,
(13)
where r t is the position of the tracer particle issued from the initial position ro and (•} denotes the average over an ensemble of initial conditions. The choice of this ensemble is arbitrary as long as the mixing condition is satisfied. In this framework, the cumulative function is given by
/ ( fe ik -( r '- r °Wfl /
F k (0) = lim /
*
«
*
•
(
*
-
r
(14)
°Vd0'
where the integral is performed over an initial position and velocity having an angle 9 with the horizon axis around the center of a scatterer. The cumulative function forms a curve in the complex plane (Rei
Im F
o~
-0.4 0.6
ReF Figure 2. Cumulative function (14) of the hydrodynamic modes of diffusion in the periodic hard-disk Lorentz gas versus the wavenumber k = (fc, 0) of magnitude varying in the interval 0 < fc < 0.9. The disks form a triangular lattice, their centers are separated by the distance d = 2.3, and their radius is unity.
436
PIERRE GASPARD
In reference [14], it was shown that, in fully chaotic systems, the Hausdorff dimension DH of these curves in the complex plane is given by the root of the equation: P{DH)=DHResk,
(15)
where su is the dispersion relation of diffusion (12), while P{fi)=
limiln(|At|1-'3)
(16)
t—»oo t
is the Ruelle topological pressure function, i.e., the generating function of the stretching factors | At | > 1 associated with each trajectory of the system. We notice that the positive Lyapunov exponent is given by
while P ( l ) = 0. Equation (15) generalizes a formula obtained by Bowen for the Hausdorff dimension of fractal invariant sets such as the Julia sets of complex analytic maps [16]. Using equations (15), (12), and (17), the Hausdorff dimension is given at low wavenumbers by D H (k) = l + | j ; k 2 + 0 ( k 4 ) , (18) in terms of the diffusion coefficient V and the positive Lyapunov exponent A + . Reciprocally, the diffusion coefficient can be obtained from the Hausdorff dimension and the positive Lyapunov exponent as
= A+ lim D H ( k ] ~ 1 .
V
k2
k—0
(19)
3. T h e nonequilibrium steady states of diffusion in periodic Lorentz gases The nonequilibrium steady states corresponding to a gradient g of particle density between two reservoirs or chemiostats separated by an arbitrarily large distance can be derived from the diffusive modes vl/k according to vr,
•
d
**
(20) k=0
It has been shown [12] that, for Lorentz gases, this nonequilibrium steady state is given by P — OO
tfg(X)=g.
r(X) + / Jo
v(#*X) dt
(21)
Here again, this density is not a function but defines a singular distribution which can be represented by its cumulative function
Ts(6)= f dffVtiXe,),
(22)
Jo as depicted in figure 3. It has been shown in reference [11] that the average current of particles in the nonequilibrium steady state (21) obeys Fick's law. The reason is that
The fractality of the hydrodynamic modes of diffusion
437
the Green-Kubo formula is recovered after averaging the microscopic current given by the particle velocity v(X) over the steady state (21). Indeed, the second term in equation (21) is a part of the Green-Kubo formula while the first term has a vanishing contribution [11,12]. An important remark is that the density of a nonequilibrium steady state of diffusion remains a function as far as the two reservoirs, between which the gradient of tracer particles is established, are at finite distance from each other. This density is highly complicated and transforms itself from a function to a singular distribution in the limit of an arbitrarily large separation between the reservoirs while keeping the gradient constant. 0.3
0.2
0.1
•0.1
•0.2
""6.3
0.2
0.1
0
-0.1
-0.2
-0.3
I Figure 3. Cumulative functions (22) of the nonequilibrium steady states of diffusion in the periodic harddisk Lorentz gas in the plane of the quantities Tx(6) and Ty(6) — Ty(n/2) (where 9 is the angle of the initial position on a disk). The third dimension of the plot is the interdisk gap w. The scatterers are disks of unit radius forming a triangular lattice of finite horizon 0 < w < (4/\/3) — 2.
Finally, using the previous results and, especially, the singular steady state (21), it has been shown in reference [17] that, starting from a coarse-grained entropy, the leading term of the entropy production behaves as expected by the irreversible thermodynamics of diffusive processes. This result is an ab initio derivation of the entropy production from the underlying microscopic dynamics, which has been possible to carry out thanks to the explicit construction of the hydrodynamic modes (14) and the nonequilibrium steady state (21) in consistency with the Green-Kubo formula and Fick's law.
4. Conclusions In this paper, we have shown how the hydrodynamic modes of diffusion as well as the associated nonequilibrium steady states can be explicitly constructed from the underlying
438
P I E R R E GASPARD
microscopic dynamics in periodic Lorentz gases. This construction extends previous work on multibaker maps [12,18-20]. T h e construction is based on a formulation in terms of t h e Probenius-Perron operator, their eigenstates, and their corresponding Pollicott-Ruelle resonances. In the periodic Lorentz gases, the leading Pollicott-Ruelle resonance gives t h e dispersion relation of diffusion in consistency with the Green-Kubo and Van Hove formulae. T h e cumulative functions of the hydrodynamic modes form fractal curves in t h e complex plane. T h e Hausdorff dimension of these curves is given in terms of t h e Ruelle topological pressure, establishing the chaos-transport relationship (19) in the present context. T h e derivation of the basic phenomenology of the irreversible processes of diffusion t u r n s out t o be possible for the deterministic dynamics of t h e periodic Lorentz gases at the price t h a t the phase-space densities of t h e hydrodynamic modes (11) and of the nonequilibrium steady states (21) are given by singular distributions instead of functions.
Acknowledgments T h e author t h a n k s Professor G. Nicolis for support and encouragement in this research. He is financially supported by the F N R S Belgium.
References 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. 13. 14. 15. 16. 17. 18. 19. 20.
D. Hilbert, Gotting. Nachr. 355 (1910); Math. Ann. 72, 562 (1912). S. Chapman, Trans. Roy. Soc. (London) A 216, 279 (1916); 217, 115 (1917). D. Enskog, Svensk. Vet. Akad. Arkiv. Mat, Ast. Fys. 16, 1 (1921). S. Chapman, T. G. Cowling, The Mathematical Theory of Non-Uniform Gases, Cambridge University Press, Cambridge UK, 1960. R. Balescu, Equilibrium and Nonequilibrium Statistical Mechanics, Wiley, New York, 1975. P. Resibois, M. De Leener, Classical Kinetic Theory of Fluids, Wiley, New York, 1977. C. Boldrighini, L. A. Bunimovich, Ya. G. Sinai, J. Stat. Phys. 32, 477 (1983). L. A. Bunimovich, Ya. G. Sinai, Comm. Math. Phys. 78, 247 (1980); 78, 479 (1981). A. Knauf, Comm. Math. Phys. 110, 89 (1987); Ann. Phys. (N. Y.) 191, 205 (1989). S. Tasaki, P. Gaspard, in preparation. P. Gaspard, Phys. Rev. E 53, 4379 (1996). P. Gaspard, Chaos, Scattering and Statistical Mechanics, Cambridge University Press, Cambridge UK, 1998. N. I. Chernov, C. P. Dettmann, Physica A 279, 37 (2000). P. Gaspard, I. Claus, T. Gilbert, J. R. Dorfman, Phys. Rev. Lett. 86, 1506 (2001). T. Gilbert, J. R. Dorfman, P. Gaspard, Nonlinearity 14, 339 (2001). R. Bowen, Publ. Math. IHES 50, 11 (1976). J. R. Dorfman, P. Gaspard, T. Gilbert, Phys. Rev. E 66, 026110 (2002). P. Gaspard, Chaos 3, 427 (1993). S. Tasaki, P. Gaspard, J. Stat. Phys. 8 1 , 935 (1995). S. Tasaki, P. Gaspard, 'Bussei Kenkyu' Research Report in Condensed-Matter Theory 66, 23 (1996).
On the statistics of free-path lengths for the periodic Lorentz gas FRANQOIS GOLSE (U. Paris 7) Consider the motion of a gas of point particles in a periodic array of spherical obstacles. Collisions involving two or more particles are neglected; only the collisions between the particles and the obstacles are taken into account. This talk reviews some results bearing on the distribution of free-path lengths for these particles, more precisely (1) upper and lower bounds for that distribution in any space dimension, and (2) the asymptotic evaluation of the tail of that distribution in the small obstacle limit, in space dimension two. Applications to kinetic theory are discussed.
1. Introduction Almost 100 years ago, Lorentz [13] proposed the following linear kinetic equation to describe the motion of electrons in a metal: (dt + v • Vx + ±F(t, x) • V„)/(t, x, v) = Natr2Jv\C(f(t,
x, •))(«)
(1)
where f(t, x, v) is the (phase space) density of electrons which, at time t, are located at x and have velocity v. In equation (1), F is the electric force field, m the mass of the electron, while Ngx and r a t designate respectively the number of metallic atoms per unit volume and the radius of each such atom. Finally C(f) is the collision integral: it acts on the velocity variable only, and is given, for all continuous (f> = <j>{v) by the formula C(
((j>(v-2(vw)u)-^(v))cos(v,u)du}.
(2)
l,voj>0
In the case where F = 0, Gallavotti [9,10] proved that equation (1) describes the BoltzmannGrad limit of a gas of point particles undergoing elastic collisions on a random (Poisson) configuration of spherical obstacles. His result was successively strengthened by Spohn [15], and by Boldrighini-Bunimovich-Sinai [3]. The case of a periodic configuration of obstacles, perhaps closer to Lorentz' original ideas, leads to completely different results. It is the purpose of this talk to discuss some of these differences.
2. The periodic Lorentz gas Let D e N, D > 2. For all r e (0, \), let Zr = {x £ RD \ dist(x,Z £l ) > r} (the "billiard table"). The "free path length" or "(forward) exit time" for a particle starting from x £ Zr in the direction v e S D _ 1 is denned as rr(x, v) = inf{t > 0 | x + tv e dZr}. The function r r is then extended by continuity to {(x,v) € dZr x S 0 - 1 1 v • nx ^ 0}, where nx is the inward unit normal field on dZr. Finally, rr(x + k,v) = rr(x,v) for each (x,v) € Zr x S D _ 1 and
439
440
FRANgOIS GOLSB
k G iP: hence Tr can be seen as a [0, +oo]-valued function defined on Yr x SD l (and a.e. on Yr x 3D-1), where Yr = Zr/ZD. If the components of v G S ^ - 1 are rationally independent — i.e., if k • v ^ 0 for each k G iP \ {0} — each orbit of the linear flow x — i > x + tv is dense on TD = RD/ZD,
rr(x,v)
and thus
< +oo for each x G Zr.
Figure 1. Zr and the punctured torus Yr-
There are two different, natural phase spaces on which to study the free path length r r . The first one is T+ = {(x, v) G dZr x SD~X \v-nx > 0} — or its quotient under the action of Z D -translations on space variables T+ = Tr/ZD — equipped with its Borel cr-algebra and the probability measure vr proportional to -yr, where djr(x, v) = (v • nx) dS(x)dv, with dS being the surface element on dZr. The second one is Yr x S 1 5 - 1 , equipped with its Borel a-algebra and the probability measure fir proportional to the Lebesgue measure o n F r x S^ 1-1 . On the first phase space fr, defining a notion of "mean free path" for the "Lorentz gas" — i.e., a gas of point particles undergoing elastic collisions with the periodic configuration of obstacles defined as the complement of Zr — and evaluating the corresponding quantity is an easy matter. It is found t h a t a b mean free path =
Wr(rr)
\Yr\ \§D~l\ 7r(f P )
(3)
where Md is the ^-dimensional unit ball. The explicit computation of Wr(rr) (i.e., the second equality above) is credited to Santalo (see reference [14], p. 42). Observe that, in the limit as r —> 0 + and in the case of space dimension D = 3, this evaluation of the mean free path coincides with the reciprocal of the factor
N*rlt a
J
cos(u, u) du ui>0
If P is a probability measure and X a random variable on Q, we denote by EP(A") the expectation — i.e., the mean — of X with respect to P. b If A is a measurable d-dimensional set in RD (d < D), \A\ designates its d-dimensional volume.
On the statistics of free-path lengths for the periodic Lorentz gas
441
appearing in equation (1). On the second phase space Yr x S D _ 1 — which is slightly more natural, at least for the kinetic equation (1) — the analogous definition of the mean free path fails because E M r (r r ) = +oo — see below. In fact, as noticed in Dumas-Dumas-Golse [8]. Lemma 2.1. Let f G C 1 (M + ) satisfy /(O) = 0. Then 7r(fr)
E""(/(T r )) = \Yr\ IS 15 - 1 ! E ^ ( / ' ( r r ) ) .
In the case where f(z) — z, this identity gives back Santalo's formula (3). In the case where f(z) = \z2, it shows that E' i r (T r ) = 2|y r fis^ > - 1 |^ r ( r r) • As one can imagine, rr(x,v) is a wildly oscillating function. For one thing, it depends upon arithmetic characteristics of v — such as which Diophantine class v belongs to — that are known to be very unstable as v runs through S0^1. Hence it is not very surprising that r r has infinite moments of order higher than one.
3. Bounds on the distribution of free path lengths Since E Mr (T r ) = +co, the next simple thing to compute is the distribution of r r under [ir. With applications to kinetic theory in mind, it is in fact more natural to consider the following, slightly more general object: $™(£) = m(v)dfir(x,v)-meas({(x,v)
€ Yr x S0'1
\rr(x,v)
> t})
where m € C ( § D _ 1 ) , m > 0 and E /Jr '(m) = 1. Theorem 3.1 below shows that, although r r is not an element of Ll{Yr x S £ > _ 1 ,^ r ), it does not miss by much: in particular r r G L1'00^ x S D _ 1 , / i r ) (Marcinkiewicz' weak L1 space [16]). Theorem 3.1. For each m e C(SD~1) such that m > 0 and E M r (m) = 1, there exist two positive constants Cm and C'm such that, for each r G (0, ^) and each t > l/rD~l, (~i
(II
m ^m < $m(t) < D D r [) r ~H ~ ~ r ~H '
A weaker variant of the upper bound was proved by Dumas-Dumas-Golse [7] for space dimension 2 (using an improvement by Dumas of his ergodization rate estimates in reference [6]). These investigations suggested that l / r D _ 1 was the right length scale for this problem. The upper bound for any D > 2 is proved in Bourgain-Golse-Wennberg [4] by a method based on Fourier series that is vaguely reminiscent of Siegel's proof of Minkowski's convex body theorem. In the case of space dimension D = 2, a proof of the lower bound is also to be found in reference [4]. It is based on an entirely different argument, more precisely on the construction of obstacle-free channels of rational direction and on a careful estimate of the width thereof. Later, an argument of this type was extended to arbitrary space dimension by Golse-Wennberg [12].
442
FRANQOIS GOLSE
Figure 2. Left: Log-log-plot of **(t) vs. t, Right: Plot of tr$l(t)
vs. t, for r = 0.01, 0.03, and 0.001.
The numerical computations above (taken from Golse-Wennberg [12]) suggest that $*(£/r) ~ C/t as t —> +oo and r —» 0 + , with 0.1 < C < 0.11 (inasmuch as the numerical evaluation of tr^(t) for r —> 0 + and t 3> 1/r can be trusted).
4. Asymptotic evaluation of t h e distribution of free p a t h lengths for D = 2 In the case of space dimension D = 2, one can consider sections of the linear flow on T 2 , which leads to studying iterates of a rotation on the unit circle. This suggests that the continued fraction expansion of the slope of the linear flow considered is the appropriate tool for evaluating r r . Theorem 4 . 1 . Let m € L°°(SD~1)
satisfy m > 0 a.e. and WT(m)
e"o+ |lne| Je
r
\r)
£ ^- + |lne|y £
r
W r
r
— 1. Then, as t -> +oo
irH+U{t2J ' TT^^^^V
This result was proved by Caglioti-Golse [5]. The proof uses essentially two different ideas, which are sketched below. 4.1. A partition of T 2 In 1989, R. Thom posed the following problem: "To find the longest orbit of a linear flow with irrational slope on a flat torus with a disk removed". This problem was essentially solved by Blank-Krikorian [1], by the following construction. For R G (0,1), let Y\R] be the flat torus with a vertical slit of length R removed: Y[R] = T 2 \ ({0} x [0,R] mod. 1). Let v = (cos6, sin6) with 6 G (0, f ) such that tan0 $ Q. Call [oi, 02,03, • • •] with an £ N, the continued fraction expansion of a = tan#, meaning that
On the statistics of free-path lengths for the periodic Lorentz gas
a =
[ai,a2,a3,...] ai +
a2 + Call pn/qn
443
1 a3 +
its sequence of convergents (the integers pn and qn being co-prime), i.e., Pn+l 1n+l
1 1
ai +
1 a2-\
and let dn be the sequence of errors defined as dn — \qnct — pn\. Consider t h e n the following nested partition of (0,1): (0,1) = ( J
(J
In,k , with 7„ife = [ s u p ^ , ^ - ! - kdn),dn-i
- (k -
l)dn).
n>ll
In reference [1], Blank and Krikorian proved t h e following P r o p o s i t i o n 4 . 1 . Assume that R £ In,k- Any orbit of the linear flow with slope t a n a on Y[R] has length either qn, or qn-\ + kqn, or else qn-i + (k + l)qn. Following Blank and Krikorian, the shortest orbits are said to be "of type A", the longest ones "of type C", and the remaining orbits "of type B " . In reference [5], t h e proposition above was used to construct a partition of Y[R] into three strips, each strip being the union of all orbits of type A, B, or C respectively. Call tpR(t,v) the distribution of free p a t h lengths in Y[R] for particles moving in the direction v from a uniformly distributed starting point x. By using the partition of Y[R] mentioned above, especially the width of each one of the three strips in t h a t partition which can be easily expressed in terms of the sequence of errors dn (see reference [5], p. 206), one arrives at an explicit formula for ipn(t,v), whose graph is represented in figure 3 below. Vr(t,v) If Ris in I ^
slope r-[dn_i-(k-l)dn]cose
%
%-1+kQn
(
]n-l+flc+1)cln
Figure 3. Graph of 11-> ipji(t,v) for R e IUtk-
tcos£
444
FRANQOIS GOLSE
For the problem that we consider, the only significant part in the graph below is the middle one — i.e., the contribution of orbits of type B only. More precisely Lemma 4.1. Let r G (0, ^), 9 G (0, f ) be such that tan0 ^ Q, and v = (cos9,sin9). Assume that R = 2r/cos6 G In
•, | U . ( \> - • d£n -'•*-' r ) R
4 < -l( t _ 2 l + 0 0 )(fc),
forallt>qnR.
This is Lemma 4.2 in reference [5], to which we refer the reader interested in a complete proof. 4.2. An ergodic lemma Given a G (0,1) \ Q and e > 0, defined N(a, e) = inf{n G N | dn(a) < e}. Define Aj(a,x) -x - \ndN{aye-x)_j(a) for j = 0,1,... .
=
Lemma 4.2. Let f be a bounded nonnegative measurable function on R 2 . For each i ' £ l and a.e. a £ (0,1), one has 1
I^[£
ne /
| l n e | / ( A o„ ( aA , x. )^, AA1.(,a^> w x )„) d r,12 ^l|^
f1md9 ^ 1+
as e —> 0 + , where ,|ln(e)|
F(0)=/
/(|ln(fl)|-i,,-y)dj/.
This result was proved by Caglioti-Golse [5], using that the Gauss map T : ( 0 , 1 ) 9 I H 1/a; — [l/x] G (0,1) is ergodic with invariant measure ^^fx- ^ e e re f erence [5], PP- 209-210 for a complete proof of this result. The key argument in the proof of Theorem 4.1 is to observe that, by Lemma 4.1, for each t>2,tpR (£,«) ~ (1 - eA^a'-x) - te-Ao<-a'-^)+, with x = -IriR, up to an error of order 4/fc as k > t — 2. Applying Lemma 4.2 to f(zi,z2) = (1 — eZ2 — te~Zl)+ leads to the asymptotic estimates stated in Theorem 4.1. Let us conclude this section with a few remarks on Theorem 4.1. As shown above, the proof is based upon comparing the radius r of the obstacles with the sequence of errors dn(a). In view of the elementary formula dn(a) = adn-i(Ta), one sees that the exit time problem for a linear flow with slope a and obstacle size r is mapped to the same problem with slope Ta and obstacle size r/a. Hence it is natural to consider averages for the Haar measure ^f on the multiplicative group R*\_ in the statement of Theorem 4.1. Following the result by Caglioti-Golse [5], Boca-Zaharescu [2] have recently proved that, for m = 1, <&l(t/r) converges to a limit as r —> 0 + for all t > 0. Their proof is based on the same partition as in reference [5], but uses Farey fractions and Kloosterman sums instead of the analysis presented above. c d
If i £ l , the notation x+ designates sup(x,0); the notation 1A designates the indicator function of A. Whenever necessary, we specify the dependence upon a of the continued fraction expansion of a, denoting by qn{a) the denominator of the nth convergent, by dn(a) the nth error, and so on.
On the statistics of free-path lengths for the periodic Lorentz gas
445
5. Applications to kinetic theory Set D = 2. Define Qc — {ez \z G Ze}, and consider the transport equation onft.xS1,
dt9i+vVxge=0 ge(t, x,v) = 0
9e\t=0 = fn\QtXS1
,
for x G dflt, v • nx > 0,
(4)
with unknown g€(t,x,v). Here, nx is the inward unit normal at the point x G dCle and / ' " is a given, nonnegative function of CC(R2 x S 1 ). Physically, this is the variant of the periodic Lorentz gas with scatterers replaced by holes (or traps) where impinging particles fall instead of bouncing back. Obviously, II&IIL*.,,, = | | / m | | t » „ - Reasoning as in reference [13] suggests that ge -* g in Lt?x,v weak-*, where g solves the uniformly damped transport equation dtg + v • Vxg + g = 0 on R*+ x E 2 x S 1 ,
g\t=0 = / i n ,
(5)
but this is ruled out by Theorem 2.1 of reference [12]. Theorem 4.1 suggests instead that the resulting damping rate should vanish in the limit as t —> +oo. The following result was proved by Caglioti-Golse [5]: Theorem 5.1. Let fm > 0 be a continuous bounded function on M2 x S 1 and let gc be, for each e € (0, -|), the solution of (4). Then, for each nonnegative, compactly supported, C1 test function \, one has Jl5o//(jhTeF /
9r{t,x,v)—
Jx(x,v)dxdv
= / / g(t,x,v) \(x, v) dxdv + o( -^ j
j !s //( Ii ijj/"V(^.)f)x(...)**-//*....)x(...)-.* + o(^ as t —> +oo, where = -^fin(x-tv,v).
g{t,x,v)
(6)
In particular, g satisfies, in the sense of distributions, the transport equation dtg + v-Vxg
+ -g = 0,
(t,x,v)
G (0,+oo) x R 2 x S 1 .
(7)
In fact, Theorem 3.1 also rules out the possibility that the original periodic Lorentz gas (with reflecting instead of absorbing obstacles) may be described by the kinetic model (1) in the Boltzmann-Grad limit — i.e., in the same scaling limit as above. Indeed, let fe(t,x,v) be the solution to dtfe
+ V-Vxfe
fe(t,x,v)
= 0
OnfieXS1,
/eL
= fe(t,x,v-2(v-nx)nx)
0
=
/in|neXS1,
for x G <9fi€.
(8)
Theorem 5.2. There exist initial data fm that are continuous on T 2 x S 1 and such that, for e of the form e„ = 1/n with n > 3, neither fCn nor any subsequence thereof converge in Lfx v weak- * to the solution of (dt + v Vx)f(t, x, v) = C(f(t, x, •))(«), with collision integral C defined in (2).
/|(=0 =
fn
446
FRANgOIS GOLSE
Pick fm independent of v. / ' " = pm(x). By the maximum principle, fen(t,x,v) > in geJt,x,v) = p (x-tv)l{t/eri!+oo)(Ten(x/en,-v)). If/ £ „ -> / in L%>x>v weak-*, Theorem 3.1 and the same arguments as in the proof of Theorem 5.1 (see reference [5], pp. 217-218) imply that „ f(t,x,v)>^pin(x-vt), t>l. (9) If / were the solution to (1) with F = 0 and initial data / ( 0 , x, v) = pm(x), JT 2
2
J
it would satisfy
< Ae-at\\pm\\LHT2)
(10)
i (T xS')
for some constants A > 0 and a > 0 independent of the choice of pm. (This result was proved by Ghidouche-Point-Ukai [11] in the case of the linearized Boltzmann equation — see Theorem 1 (hi), p. 207 of reference [11]; adapting it to equation (1) is obvious.) But (9) and (10) are clearly incompatible, since one can choose ||p in ||L 2 = 1 with / T2 pmdx arbitrarily small.
6. Conclusions Because of the presence of too many long collision-free trajectories with near rational slopes, the Boltzmann-Grad limit of the Lorentz gas is not described by the kinetic equation (1). Whether the precise asymptotic result in Theorem 4.1 could lead to a positive result on this limit, as it does in the case of absorbing obstacles (see Theorem 5.1) remains an interesting open problem. Also, it would be interesting to extend Theorem 4.1 to space dimensions higher than 2; however, this could be hard since the current proof is based on continued fractions (the same can be said of reference [2] which uses Farey fractions instead).
Acknowledgment s The collaboration of the author with E. Caglioti and B. Wennberg benefited from the EU financed network HyKE (contract no. HPRN-CT-2002-00282).
References 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. 13. 14. 15. 16.
S. Blank, N. Krikorian, Internat. J. of Math. 4, 721 (1993). F. Boca, A. Zaharescu, preprint arXiv:math.NT/0301270 . C. Boldrighini, L. Bunimovich, Ya. G. Sinai, J. Statist. Phys. 32, 477 (1983). J. Bourgain, F. Golse, B. Wennberg, Comm. Math. Phys. 190, 491 (1998). E. Caglioti, F. Golse, Comm. Math. Phys. 236, 119 (2003). H. S. Dumas, J. Dynam. Differential Equations 3, 593 (1991). H. S. Dumas, L. Dumas, F. Golse, J. Statist. Phys. 82, 1385 (1996). H. S. Dumas, L. Dumas, F. Golse, J. Statist. Phys. 87, 943 (1997). G. Gallavotti, Phys. Rev. (2) 185, 308 (1969). G. Gallavotti, Nota int. no. 358, Istit. di Fisica, Universita di Roma (1972). H. Ghidouche, N. Point, S. Ukai, J. de Math. Pures et Appl. (9) 57, 203 (1978) F. Golse, B. Wennberg, M2AN Model. Math, et Anal. Numer. 34, 1151 (2000). H. Lorentz, Arch. Neerl. 10, 336 (1905). L. Santalo, Integral geometry and geometric probability, Addison-Wesley, 1976. H. Spohn, Comm. Math. Phys. 60, 277 (1978). E. Stein, Harmonic Analysis, Princeton University Press, 1993.
Nonequilibrium statistical mechanics of open classical systems Luc
REY-BELLET
(U. Massachusetts)
We describe the ergodic and thermodynamical properties of chains of anharmonic oscillators coupled, at the boundaries, to heat reservoirs at positive and different temperatures. We discuss existence and uniqueness of stationary states, rate of convergence to stationarity, heat flows and entropy production, Kubo formula and Gallavotti-Cohen fluctuation theorem.
1. Introduction We report on a series results obtained through various collaborations with J.-P. Eckmann, M. Hairer, C.-A. Pillet and L. E. Thomas [2-5,18-21]. We study the statistical mechanics of chains of anharmonic oscillators coupled at both ends to heat reservoirs at different temperatures. The reservoirs are modeled by linear wave equations and the model is completely Hamiltonian. Such oscillators chains (with various models of reservoirs) are widely used as simple models to test the validity of Fourier Law (see [1,15] for reviews and references) which is a fundamental open question in nonequilibrium statistical mechanics. We consider a chain of n (arbitrary but finite) rf-dimensional oscillators with coordinates q — ( g i , . . . ,qn) E Rnd, and momentap = {p\,... ,p„) S Rnd and with Hamiltonian
#cM = £ f + Et/(1)te) + Elf/(2)fe-^+i) »=i
where Uw and U^ are C°° confining potentials. Each of the reservoirs is described by a wave equation in R.d with Hamiltonian HB(
+ qi I
VipL(x)pL{x)
dx + Hc{p, q)
+ qn
V
(2)
JRd
The Hamiltonian (2) describes the system at finite energy, i.e., at temperature zero. The inverse temperatures of the reservoirs PL and 0R are introduced by assuming that the initial conditions of the reservoirs are distributed according to Gibbs measures p,pL and
447
448
Luc
REY-BELLET
fj,/3R. Formally these measures are given by "Z-1exp(-/?fcff(v>fc)7Tfc)) I ] d>Pk(x)d*k{x)» .
(3)
x€Kd
The equation (3) is merely a formal but suggestive expression. The measures /X/3fc are constructed as follows: The space of finite energy solutions of a wave equation is the real Hilbert space H = H\xL2 with a scalar product denoted by (•, •). We have then Hg(ip, TV) = 5 ((?, 7r), (ip, IT)). The Gibbs measure npk is, by definition, the Gaussian measure (supported on the space of tempered distributions S' x S') with mean 0 and covariance /3^"1(-, •). The existence of this measure follows from Bochner-Minlos Theorem. Almost surely, the initial conditions of the reservoir have infinite energy. For example in dimension d = 1 one recognizes this measure as the product of a Wiener measure (for the
2. Ergodic properties We describe and comment our technical assumptions.
Nonequilibrium statistical mechanics of open classical systems H I Polynomial Growth:
449
There exist constants Ai > 0, i = 1,2, such that lim \-kiU(i){\x)
= Ai\\x\\ki,
(4)
X—»oo
and similar conditions for the first and second derivatives of C/M. Moreover we have k2 > h > 2. H2 Non-degeneracy of t/( 2 >: denote the linear maps given by
(5)
For x e Rd and m = 1,2,..., let A^^x)
d
: Rd -> Rdm
flm+lr/(2)
We assume that for each x e t f there exists mo such that Ra,nk(A{1){x),...,A{mo\x))=d.
(7)
The first part of H I is a mild condition of polynomial growth on U^\x) and U^2\x). The condition H 2 is a local non-degeneracy condition which will ensures that energy is transmitted through the chain. In the special case d = 1, H 2 reduces to the fact that for any q, there exists m = m(q) > 2 such that d^5L () ^ 0, i.e., there are neither linear pieces in the potential nor infinitely degenerate points. The second part of H I , equation(5), ensures that the two-body potential U^ grows as fast or faster than the one-body potential U^'fa) at infinity. This assumption has a dynamical significance. Infinite chains of nonlinear oscillators are known to exhibit breathers, i.e., spatially exponentially localized, time periodic solutions (see e.g. [16]). The condition (5) ensures that these breathers get more and more delocalized as their energy increases. On the contrary, if k\ > k-2, a simple scaling argument shows that, at high energy, the oscillators in the chain behave essentially as uncoupled oscillators (the so-called anticontinuum limit). A crucial ingredient of our dynamical analysis is to show that initial conditions with energy localized far away from the boundaries spread sufficiently in order to interact with the reservoirs and dissipate their energies into them. In the case fci > k% we are unable to have good enough estimates to show even the existence of a stationary state. We expect, in any case, to have a much slower rate of convergence to the stationary state. H3 Rational coupling: tions pk have the form
The Fourier transforms pk(w), k € {L, R}, of the coupling func-
where Pk are polynomials. The assumption H 3 is, in effect a Markovian assumption. With a change of variables one can reduce the dynamics of the chain coupled to the reservoirs at positive temperature to a set of Markovian stochastic differential equations for the variables p, q, and a finite number
450
Luc REY-BELLET
of auxiliary variables. In the simplest case Pk (w2) <x w2 + 7^ the equations have the form dq\ = pi dt, dpi = {-VqiV(q)
-
XLrL)dt, (2f3Z1lL)1/2dBL,
drL = ( - 7 L r L + XLPl) dt + dqj=Pjdt,
j =
dpj = -V 9 j . V{q) dt, dqn
2,...,n-l, j =
2,...,n-l,
=pndt,
dpn = (-V g „ V(g) - XRTR)
drR = (-7T f l + \Rpn) dt +
dt,
(2p-1jR)1^dBR,
where r^, k G {L,R} are auxiliary variables, A^ are coupling constants given by A| = / |/3fc(a;)|2 dx, and Bk, k G {L, R} are Brownian motions. For more general polynomials one obtains similar equations [20]. One can state all our result in terms of the Markov process which solves equations (9), but we will choose here to state them in terms of the original variables. Consider the Hamiltonian equations of motion for the Hamiltonian (2). We concentrate on the variable of the chain and denote by Pt=pt(p,q,$),
qt = qt{p,q,$),
(10)
the solution with initial conditions (p, q) for the chain and initial conditions $ = ('PL^L, ^PR^R) for the reservoirs. Since the initial condition of the reservoirs 3> is distributed according the Gaussian Gibbs measure fipL x / z ^ on CI = (<S')4, we may view the solution equation(lO) as a stochastic process ( t , * ) e l x ! l H (pt, qt) G R2nd.
(11)
The measure fipL x \i$R induces naturally a probability distribution on path space which we denote by Ppfg and we denote by Epf, the corresponding expectation, where the subscript (p, q) indicates the initial conditions of the chain. Theorem 2.1 (Ergodic properties). If conditions HI, H2, and H3 are satisfied we have (a) Ergodicity: such that
There exists a measure n0Lt0R
on R2nd
with a positive smooth density
lim - / f{ps,q3)ds= f(p,q)d-K0L,0R{p,q), (12) - °° t Jo J for all initial condition x = (p, q) of the system, for \i0L x \i0R almost all initial conditions of the reservoirs, and for all observables f £ L1(TT0L^0R). (b) Exponential convergence: Let 9 < min{/3L,/3fi} and let f{p,q) be an observable such that \f(p, q)\ < Ce9H(-p'q\ then there exist positive constants C = Ce and a = a$ such that t ,
E^,/3R)
at eH [f(pt,qt)} - J fip, q) dn0L,0R(p,q) < Ce- \\f\\ee ^
where \\f\\e = supPiQ | / | e x p ( - 0 t f ) .
,
(13)
Nonequilibrium statistical mechanics of open classical systems
451
Part (a) of Theorem 2.1 tells us that, for almost all initial conditions of the reservoirs, the system will converge to a stationary state, while part (b) shows that this convergence occurs, in average, at a exponential rate. The proof of Theorem 2.1 can be found in [20] and is based on a detailed analysis of the Markov process given by equation (9). We prove hypoellipticity of the generator to obtains smooth transition probabilities. We use control-theoretic tools and the support theorem of Stroock-Varadhan to show the irreducibility of the Markov process. Finally the central part of the proof consist in establishing dissipation estimates on the dynamics and constructing a Liapunov function for the Markov semigroup. Altogether we show that the Markov semigroup is a compact irreducible semigroup on a suitable function space.
3. Entropy production and its fluctuations If the reservoirs have unequal temperatures one does expect that in the stationary state, energy is flowing from the hot reservoir through the chain into the cold reservoir (positivity of entropy production). Little is known about the general properties of systems in a nonequilibrium stationary state. The Kubo formula and Onsager reciprocity relations are such properties which are known to hold near equilibrium (i.e., if the temperatures of the reservoirs are close) and this is a result about small fluctuations around equilibrium (of central limit theorem type). In the last few years, a new general relation about nonequilibrium states has been discovered, the so-called Gallavotti-Cohen fluctuation Theorem. It asserts that the large fluctuations of the ergodic average of the entropy production have a certain symmetry. This symmetry can be seen as a generalization of Kubo formula and Onsager reciprocity relations to situations far from equilibrium. It has been discovered in numerical experiments in [6], proved as a theorem for Anosov maps [7,8] (modeling systems with deterministic thermostats), and extended to Markov processes in [12,14,17]. For our model this relation is proved in [21]. The large deviations aspects are nontrivial, due to the noncompactness of the phase space, the unboundedness of the observable of entropy production, and the degeneracy of the coupling (at the boundaries only). Note that the fluctuation theorem is derived entirely within Hamiltonian formalism without a-priori chaoticity or randomness assumptions on the dynamics (see also [9]). To define the heat flows and the entropy productions we define the energy of the j t h oscillators of the chain as Hj = |
+ C/ (1) fe) + \ (tf< 2 >( 9i -i -
qi)
+ C/ (2) fe - qi+i))
,
(14)
i.e., its kinetic energy, its potential energy plus half of its interaction energy with its neighbors. This choice is somewhat arbitrary, but other choices lead to exactly the same results. Differentiating along a trajectory we find that -jf(Pt,qt)
= Fj-i(puqt)
- Fj(pt,qt),
(15)
where *i(P.9) =
{Pj+ J+l)
£
VUW(qj-qj+1).
(16)
452
Luc
REY-BELLET
It is natural to interpret Fj as the heat flow from the j t b to the (j + l ) t h particle in the chain. We define corresponding entropy productions by aj = {f3R-PL)Fj.
(17)
Our results on the heat flow and entropy production are summarized in Theorem 3.1 (Entropy production). If conditions HI, H2, and H3 are satisfied we have (a) Positivity of entropy production: The average of the entropy production Gj in the stationary state TTj3L)j3R is independent of j and nonnegative, j ' ajdKpLfiR > 0 and it is positive away from equilibrium / < ajdlrPL,0R = °
(b) Large deviations
and fluctuation
if
and
onl
y if PL= PR-
theorem:
*i* = \f
(18)
The ergodic averages
°M*))
(19)
satisfy the large deviation principle: There exist a neighborhood O of the interval / VjdnpL^n
, /
(20)
^jdTT/3L^H
and a rate function e(w) (both independent of j) such that for all intervals [a, b] C O we have lim - - l o g P ^ J f o * € [a, b}} = inf e(w). (21) t-»oo
t
™
w€[a,b]
Moreover the rate function e(w) satisfy the relation e(w) - e(-w) = -w,
(22)
i.e., the odd part of e is linear with slope —1/2 (Gallavotti-Cohen fluctuation Theorem), (c) Central limit theorem and Kubo formula: Let us put (3 = (fih + PR)/2 and T) = (3R — f3i. The fluctuations of the heat flow at equilibrium satisfy the central limit theorem
^p™{a
Fiips qs)ds
>
} 7^L
-pf-^-
^
where K2 is finite and positive, independent of j , and given by the integrated autocorrelation function 2 K
= J™ ( / Fj(P, q) KT
\FJ(P*. it)} <*^(P> q)) dt.
(24)
Moreover we have Kubo formula d
-
(J
Fjdn^
= KZ . »7=0
(25)
Nonequilibrium statistical mechanics of open classical systems
453
T h e central limit theorem follows easily from the strong ergodic properties obtained in Theorem 2.1. T h e large deviations for CTJ are more difficult, in particular since ai is an unbounded observable, not even bounded by the energy. B u t we use the very intimate link of t h e entropy production with the dynamics t o show t h a t it satisfies a large deviation principle. Note also t h a t all our results on the fluctuations of Oj are independent of j , the fluctuations are the same wherever the flow is measured and this would remain true even if we would choose different potentials at each lattice site or bond.
References 1. F. Bonetto, J. L. Lebowitz, L. Rey-Bellet, "Fourier law: A challenge to theorists" in Mathematical Physics 2000, Imp. Coll. Press, London, 2000, pp. 128-150. 2. J.-P. Eckmann, M. Hairer, "Non-equilibrium statistical mechanics of strongly anharmonic chains of oscillators", Comm. Math. Phys. 212, 105-164 (2000). 3. J.-P. Eckmann, M. Hairer, "Spectral properties of hypoelliptic operators", Comm. Math. Phys. 235, 233-253 (2003). 4. J.-P. Eckmann, C.-A. Pillet, L. Rey-Bellet, "Non-equilibrium statistical mechanics of anharmonic chains coupled to two heat baths at different temperatures", Comm. Math. Phys. 201, 657-697 (1999). 5. J.-P. Eckmann, C.-A. Pillet, L. Rey-Bellet, "Entropy production in non-linear, thermally driven Hamiltonian systems", J. Stat. Phys. 95, 305-331 (1999). 6. D. J. Evans, E. G. D. Cohen, G. P. Morriss, "Probability of second law violation in shearing steady flows", Phys. Rev. Lett. 71, 2401-2404 (1993). 7. G. Gallavotti, "Chaotic hypothesis: Onsager reciprocity and fluctuation-dissipation theorem", J. Stat. Phys. 84, 899-925 (1996). 8. G. Gallavotti, E. G. D. Cohen, "Dynamical ensembles in stationary states", J. Stat. Phys. 80, 931-970 (1995). 9. C. Jarzynski, " Hamiltonian derivation of a detailed fluctuation theorem", J. Statist. Phys. 98, 77-102 (2000). 10. V. Jaksic, C. A. Pillet, "Ergodic properties of classical dissipative systems. I", Acta Math. 181, 245-282 (1998). 11. F. Herau, F. Nier, "Isotropic hypoellipticity and trend to equilibrium for Fokker-Planck equation with high degree potential", preprint (2002). 12. J. Kurchan, "Fluctuation theorem for stochastic dynamics", J. Phys. A 31, 3719-3729 (1998). 13. R. Lefevere, A. Schenkel, "Perturbative analysis of anharmonic chains of oscillators out of equilibrium", preprint (2003). 14. J. L. Lebowitz, H. Spohn, "A Gallavotti-Cohen-type symmetry in the large deviation functional for stochastic dynamics", J. Stat. Phys. 95, 333-365 (1999). 15. S. Lepri, R. Livi, A. Politi, "Thermal conduction in classical low-dimensional lattices.", Phys. Rep. 377, 1-80. 16. R. S. MacKay, S. Aubry, "Proof of existence of breathers for time-reversible or Hamiltonian networks of weakly coupled oscillators", Nonlinearity 7, 1623-1643 (1994). 17. C. Maes, "The fluctuation theorem as a Gibbs property", J. Stat. Phys. 95, 367-392 (1999). 18. L. Rey-Bellet, "Statistical mechanics of anharmonic lattices", in Advances in Differential Equations and Mathematical Physics, Contemporary Mathematics 327, Providence, R.I.: AMS, 2003. 19. L. Rey-Bellet, L. E. Thomas, "Asymptotic behavior of thermal non-equilibrium steady states for a driven chain of anharmonic oscillators", Comm. Math. Phys. 215, 1-24 (2000). 20. L. Rey-Bellet, L. E. Thomas, "Exponential convergence to non-equilibrium stationary states in classical statistical mechanics", Comm. Math. Phys. 225, 305-329 (2002). 21. L. Rey-Bellet, L. E. Thomas, "Fluctuations of the entropy production in anharmonic chains", Ann. H. Poinc. 3, 483-502 (2002).
454
Luc
REY-BELLET
22. Z. Rieder, J. L. Lebowitz, E. Lieb, "Properties of a harmonic crystal in a stationary nonequilibrium state", J. Math. Phys. 8, 1073-1085 (1967). 23. H. Spohn, J. L. Lebowitz, "Stationary non-equilibrium states of infinite harmonic systems", Comm. Math. Phys. 54, 97-120 (1977).
Operator algebras and quantum information Session organized by A.
OCNEANU
(Penn State) and D.
PETZ
(Budapest)
This page is intentionally left blank
Quantum dynamical entropies and quantum algorithmic complexities FABIO BENATTI (U.
Trieste)
The classical notions of dynamical entropy and algorithmic complexity are related to each other by a theorem of Brudno. In this contribution, we look at the quantum dynamical entropies of Connes-Narnhofer-Thirring and Alicki-Fannes-Lindblad in the light of recently proposed quantum algorithmic complexities.
1. Introduction The average degree of randomness of classical dynamical systems is measured by the Kolmogorov dynamical entropy [1,2], the absolute one by the Kolmogorov algorithmic complexity [3,4]: these two notions are related by a Theorem of Brudno [5,6]. Quantum information [7-9] provides an ideal testing ground for the many quantum dynamical entropies put forward so far [10-16]; particularly interesting appears the study of the relations between the latter and recently proposed quantum algorithmic complexities [17-20].
2. Kolmogorov entropy and algorithmic complexity Classical dynamical systems consist of a measurable phase-space X, a measurable dynamics T : X \—> X and an invariant probability measure fi. Any partition £ = {Ej}f=1 of X into finite, measurable disjoint atoms Ej encodes [21] trajectories {T]x}^ZQ, x € X, by the set i!^ of the strings i£ = inii • • • in-\, ij = 1, 2 , . . . , D, made of the labels of the subsequently visited atoms. Trajectories from x G E(i„) := PI^~QT~^(E^) have a same i£: by setting £ £ p{i n) := /i(£(i£)), the sets I have Shannon entropy H^S.T.n) := - £ \ g p(i£) log 2 p(i£) and contribute an entropy production [1,2] /^(T, £) := linin-^+oo ^H^S^T^n). Definition 2.1. [1,2] The Kolmogorov entropy is: /iM(T) :— sup/i^(T, S). e
Remark 2 . 1 . Stationary classical sources emitting strings i„ of bits with probabilities /i(i n ) are dynamical systems: X is the set of binary sequences i, T the left shift (
457
458
FABIO BENATTI
program p of length £(p) U can halt and output a binary string i„ = U(p) or else loop forever. In Probabilistic Turing machines (PTM) [22] connections among configurations are one-to-more, each one occurring with its assigned probability. Definition 2.2. [3,4] Let W b e a TM, the algorithmic complexity K(i„) of a binary string i n is : K{\n) •= min {£(p) : U(p) = i„}. Remark 2.2. Unlike the Kolmogorov entropy, the algorithmic complexity is an absolute, ^-independent, measure of randomness. Up to a constant independent of the string i„, K(\n) does not change changing U [3,4]. Since i„ can be copied, K{in) < n + C, where C accounts for, say, printing instructions. This bound is very loose: if i„ consist of all ones, K{\n) < log2 n + C. Regularity means compressibility and algorithmic randomness of i n is identified with literal transcription being the cheapest way to reproduce it. For infinitely long trajectories one has [21]: Definition 2.3. [21] Complexity of {Tix}j€N:
KX(T) := sup £ limsup„_ + 0 0
±K(\^{x)).
Kolmogorov entropy and Kolmogorov algorithmic complexity are related to each other by the following theorem [5,6]: Theorem 2.1. If(X,n,T)
is ergodic, KX(T) = h^T)
p,-a.e.
When the programs p are restricted to be self-delimiting, one speaks of prefix-algorithmic complexity [3]; then, K(\n) = — log2 m(i„) + 0(1), where m is a so-called universal probability, that is a universal enumerable discrete semimeasure on the set In of strings of length n [3]. Using m, one connects average complexity and Shannon entropy: Theorem 2.2. [3] With jjLn := {/i(i„)} any computable probability distribution over In and H(fin) its Shannon entropy: 0 < (J2i ^{in)K{in) — ^T(/i„)) < C.
3. Quantum systems Discrete-time quantum dynamical systems are triples (M, 0 , OJ) where [23] M is an algebra of operators, 6 : M H-> M an automorphism of M and to a ©-invariant state. Bit-streams: [24-27] Let &,-, j £ N, be operators such that e, = ej, e | = 1 and efe,- = ejei(—l)s(l*-Jl), with J : N H { 0 , 1 ] fixing the (anti)commutation relations. The operators wi = eia&ix •••eik_1, with io < i\ < • • • < ik-i ^ [0,n — 1] and wq> = 1 generate local algebras A^[o,n-i] embedded in the *-algebra .Mo of all wj. Let a(ei) = e^+j be the shift automorphism on Mo and OJ(WJ) = 1 if I = 0, zero otherwise, a cr-invariant state. By completing Mo within the GNS-construction, one obtains a von Neumann algebra M. We focus on three cases specified by the function g. 1 + e1 • go = 0 : M is generated by commuting projections pi = ——- with oj(pi) = - and one recovers the commutative Bernoulli shift. •
- , and the infinite temperature state u;(a?jak) — -^-.
Quantum dynamical entropies and quantum algorithmic complexities
459
• 9 ='• 92 such that, given / , for countably many p ^ q ap(wi) and o~q(wi) anti-commute (strong anti-commutativity [28]). Some of the statistical properties of bit-streams are captured by the entropy density [29] s(w) := lim -S(p[o,„-i]),
(1)
n—»oo n
where S(p^t„_i]) := — Tr(p[o,n-i] l°g2 Pfo.n-i]) ls the von Neumann entropy of the local density matrices obtained by restriction of the invariant state u> to the local sub-algebras. Remark 3.1. Classical sources are particular (commutative) quantum dynamical systems and their mean entropy density coincides with h^a). Quantum information [7-9] deals with the many facets of information being carried, stored and processed by quantum systems. This has spurred efforts to find analogs of Shannon's coding theorems [7,30-32] (see Remarks 2.1, 3.1). It is thus natural to study the role of quantum dynamical entropies in compressing the information coming from quantum sources; in the following, we will consider two of them [11,14] based on the notion of state decompositions, respectively of partitions of unit. Definition 3.1. Decomposition of u>: any convex combinations u> = J2j ^jwj ojj and weights 0 < A.,- < 1, 5Z • Xj = 1. Definition 3.2. Partition of unit: any set y = {yj}jLi,
w tn
i
states
Vj £ M. with ^ , = 1 y*jVj — 1.
Remark 3.2. In quantum information, partitions of unit are known as POVM [7]: they map the density matrix p into the density matrix V . Vjpy*j. Partitions of unit may also be used to decompose quantum states into linear convex combinations of other states without perturbing them, indeed p = J2j y/pyjVjy/P, *j = TrO%*Z/?)> <^j = (VPyjVjVp)/^3.1. CNT-entropy Given (M,&,us), the quantum dynamical (CNT-)entropy proposed by Connes, Narnhofer and Thirring [11,33] is constructed by means of decompositions of the state w. One starts by introducing the notion of entropy of a sub-algebra. With N C M. a finitedimensional sub-algebra and u> \ N the restriction of u> to N, consider all possible decompositions u> = £V Xjojj and set HU(N) := S(u \ N) - m i n ^ ^ . x.u. j £ \ ^j s(uj \ N ) | Remark 3.3. The entropy of a sub-algebra is related to the entanglement of formation [7] and can be used to better study its properties [34-36]. Then, one defines n-sub-algebra Junctionals [11,33] ffw(N, 9 ( N ) , . . . , 9™ - 1 (N)) by means of decompositions u = ^ s Xsu)s of w with s = soSi • • • s n - i a multi-index, and of their n marginals. It follows that / i g N T ( 9 , N ) := lim n _ + 0 O $HU(N, 9 ( N ) , . . . , 9"~ 1 (N)) is a welldefined entropy production leading to Definition 3.3. [11] CNT-entropy: h £ N T ( 0 ) := s u p / i S N T ( 9 , N ) . N
Bit-streams:
[27] g0 : / i 2
N
» = 1i
9i • / £ " » = \ J 92 • / £
N
» = 0.
460
FABIO BENATTI
3.2. ALF-entropy Given (M,co, 0 ) , let Mo C M be a 0-invariant sub-algebra. The dynamical (ALF-)entropy of Alicki and Fannes [14,23] is based on partitions of unit (see Definition 3.1). They are used to symbolically encode the quantum dynamics as a shift on quantum spin chains, that is on the C*-inductive limit M N of tensor products M[0)„] := <8>fc=o ( M / j ) of D-dimensional matrix algebras M Q . A state on M N comes from partitions of unit y = {Vj}f=i taken from a 0-invariant sub-algebra Mo C M as follows: one considers the partitions of unit Qk(y) := {0fc(j/j)}£=i and the time n refinements y^n_1} := {0" _ 1 (2/j„- 1 )0"" 2 (%„- 2 ) • • -©(j/ji)*/*,}- T h e ex1 pectations uj(y*joQ{y*h) • • • Q " " ^ . ^ . , , . ! ) • • • 0(j/ti)y»o) a r e matrix elements of Dn x Dn density matrices p[3^r®;T1_1i] with von Neumann entropies S(p[yfp „_ 1] ]). Definition 3.4. [14] ALF-entropy: h£%0(Q)
:= sup limsup - S ( p M „ . , J ) . yeM0 n-»+oo n '
Bit-streams: [27] g0 : h^F(a) = 1; 9l : /i£LF(
K
:= — log2 fi with fj. a universal, enumer-
• QK\ refers to descriptions of quantum states by means of classical programs that are input of QTM which approximate ip with accuracy taken care of by the logarithmic
Quantum dynamical entropies and quantum algorithmic complexities
461
correction to the length £(p). QKi is the cheapest bit-description of quantum states and is upper bounded by [17]: QK1{ip)<2n
+ C,
(2)
with a -^-independent C. Actually, it can be proved [19] that there are ip such that QK1{ip)>2n-2log2n
+ C.
(3)
• QK2 is the shortest qbit-description of quantum states with fidelities > 1 — 1/fc, Vk > 0. Since literal transcription of ip £ Hn by QTM is always possible, one gets QK2(il>)
(4)
where the constant C does not depend on ip. • QK3 does not refer to QTM, but hinges on the existence[19] of a universal (semi) density matrix /x . With p a computable density matrix on Hn, one gets [19] a result relating average quantum complexity and von Neumann entropy (see Theorem 2.2): Ci + S(p)
S(p) + C2 .
(5)
4. Quantum entropies and complexities Evidently, the CNT-entropy is sensitive to the algebraic properties of bit-streams, whereas the ALF-entropy is not. Brudno's theorem 2.1 links the complexity production (per symbol) to the entropy production, if anything of its kind holds in the quantum case, then, the CNTentropy of strongly anti-commutative bit-streams would not support any lasting complexity production, while the ALF-entropy would. What are the relations between the quantum complexities and the CNT and ALF-entropies? Consider the fermionic bit-streams with g = g\. The local algebra .M[o,2n-i] generated by operators wj, / C [0,2n — 1], is made of 2" x 2" matrices and its local state is the 2n-dimensional tracial state P[o,2n-i] = 2 _ n l 2 " , a computable density matrix with von Neumann entropy S(p^0^n~i]) = n- Then, from (5), the average QK3 per symbol, (Tr(p[ 0i 2n-i]<3^3)/2n, tends asymptotically to hCNT(a). Also, the Hilbert space of M[0,2n-i] 1S W„ = C 2 "; from (4) vectors ip £ Hn have QK2 complexity per symbol asymptotically bounded by 1/2, thus agreeing with hCNT(a). On the other hand, hALF(a) = 1 seems to bear relations to QK\ which, for some states ip £ Ttn can scale as 2n instead as n (see (2) and (3)). 5.
Conclusions
The arguments of above hint at the CNT-entropy being related to two of the existing quantum complexities, QK2,3, which are quantum descriptions of quantum objects, while QK\ is rather a classical description of them. Seemingly, QK2<3 as well as h%NT are able to capture the algebraic rigidity imposed upon the operators e; by anti-commutativity: in the case of strong anti-commutativity, bit-streams would have asymptotically zero quantum complexity per symbol.
462
FABIO BENATTI
T h e reason may be grasped as follows: the ALF-entropy, being based on partitions of unit, models the q u a n t u m dynamics by means of local q u a n t u m spin algebras whose dimension increases with the number of time-steps, 2n. It is the complexity of the symbolic model rather t h a n t h e one of the underlying bit-stream which is inspected by t h e ALF-entropy. On the contrary, t h e CNT-entropy, being based on decompositions of the global state, respects the local bit-streams algebras whose dimension, because of anti-commutativity, increases with half the number of time-steps, n. Actually, the CNT-entropy coincides with the mean-entropy (1) of the bit-stream, while the ALF-entropy with t h a t of its best spin-chain symbolic model. Of course, it may be the case t h a t QK\ or new sensible q u a n t u m complexities may t u r n to be related t o the ALF-entropy and reveal different, equally important aspects of the complexity of the q u a n t u m dynamics.
References 1. A. Katok, B. Hasselblatt, "Introduction to the modern theory of dynamical systems", Encyclopedia of Mathematics and its Applications, Cambridge University Press, 1999. 2. P. Billingsley, Ergodic Theory and Information, Wiley & Sons, 1965. 3. M. Li, P. Vitanyi, An introduction to Kolmogorov Complexity and its Application, Springer, 1997. 4. T. M. Cover, J. A. Thomas, Elements of Information Theory, Wiley & Sons, 1991. 5. A. A. Brudno, Trans. Moscow Math. Soc. 44, 127 (1983). 6. H. White, Erg. Th. Dyn. Sys. 13, 807 (1993). 7. M. A. Nielsen, I. L. Chuang, Quantum Computation and Quantum Information, Cambridge University Press, 2002. 8. R. F. Werner, Springer Tracts in Modern Physics 173, 14 (2001). 9. M. Horodecki, P, Horodocki, R. Horodecki, Springer Tracts in Modern Physics 173, 151 (2001). 10. A. Connes, E. St0rmer, Acta Mathematica 134, 289 (1975). 11. A. Connes, H. Narnhofer, W. Thirring, Comm. Math. Phys. 112, 691 (1987). 12. J.-L. Sauvageot, J.-P. Thgouvenot, Comm. Math. Phys. 145, 411 (1992). 13. W. S. Slomczynski, K. Zyczkowski, J. Math. Phys. 35, 5674 (1994). 14. R. Alicki, M. Fannes, Lett. Math. Phys. 32, 75 (1994). 15. D. Voiculescu, Comm. Math. Phys 170, 249 (1995). 16. L. Accardi, M. Ohya, N. Watanabe, Open Systems Infor. Dynamics 4, 71 (1997). 17. P. Vitanyi, IEEE Trans. Infor. Theory 47, 2464 (2001). 18. A. Berthiaume, W. Van Dam, S. Laplante, J. Comput, System Sci. 63, 201 (2001). 19. P. Gacs, J. Phys. A: Math. Gen. 34, 6859 (2001). 20. K. Tadaki, "Upper bound by Kolmogorov complexity for the probability in computable POVM measurement", quant-ph/0212071. 21. V. M. Alekseev, M. V. Yakobson, Phys. Rep. 75, 287 (1981). 22. J. Gruska, Quantum Computing, McGraw-Hill, London 1999. 23. R. Alicki, M. Fannes, Quantum Dynamical Systems, Oxford University Press, 2001. 24. G. L. Price, Canad. J. Math. 39, 492 (1987). 25. R. T. Powers, Canad. J. Math. 40, 86 (1988). 26. E. St0rmer, Invent. Math. 110, 63 (1992). 27. R. Alicki, H. Narnhofer, Lett. Math. Phys. 33, 241 (1995). 28. H. Narnhofer, W. Thirring, Lett. Math. Phys. 30, 307 (1994). 29. M. Ohya and D. Petz, Quantum entropy and its Use, Springer, Berlin, 1993. 30. R. Jozsa and B. Schumacher, J. Mod. Opt. 41, 2343 (1994). 31. D. Petz and M. Mosonyi, J. Math. Phys. 42, 4857 (2001).
Quantum dynamical entropies and quantum algorithmic complexities
463
32. I. Bjelakovic, T. Kriiger, R. Siegmund-Schultze, A. Szkoia, "The Shannon-McMillan theorem for ergodic quantum lattice systems", arXiv:math.DS/0207121. 33. F. Benatti, Deterministic Chaos in Infinite Quantum Systems, Springer, Berlin, 1993. 34. F. Benatti, J. Math. Phys. 37, 5244 (1997). 35. F. Benatti, H. Nanrhofer, Phys. Rev. A 63, 042306 (2002). 36. F. Benatti, H. Narnhofer, A. Uhlmann, J. Math. Phys. 44, 2402 (2003). 37. E. Bernstein, U. Vazirani, SIAM J. Comput. 26, 1411 (1997).
Modular invariant partition functions in statistical mechanics, conformal field theory and their realisation by subfactors DAVID E. EVANS (U. Wales) Dedicated to the memory of John T. Lewis
We review what the operator algebra approach has to offer in understanding modular invariant partition functions in integrable classical statistical mechanical models and in conformal field theory.
1. Ising model For pedagogical reasons it is useful to begin at the first non-trivial setting of the two dimensional Ising model, on a square lattice 1?. Here the classical model is set in the commutative C*-algebra C(P) = ® z 2 C 2 of all continuous functions on all configurations P of distributions of two variables labelled + and — on each lattice site of Z 2 . The transfer matrix method [1,17,23], [22, Chapter 6] allows us to transform the problem (e.g. of understanding equilibrium states) to the Pauli algebra A — ® z M^ in one dimension fewer. An equilibrium state ix then yields a linear map 0M on the quantum observable algebra (££)z M%, and there is a transformation F —> F@ taking a local classical observable to a quantum one such that we can recover the classical expectation values from the quantum ones: H(F) = MF0)-
(!)
Due to the inherent reflection positivity, ^M is in fact a state, indeed moreover a ground state for the quantum dynamics Tu(—)T~lt. Here T is the transfer matrix which captures the Ising Hamiltonian but in a new one dimensional form. Taking the Boltzmann weight for a fundamental region of unit size with two column configurations yields a matrix in M4 = M2 <8> M2 as each column has four possible configurations coming from the two values at each vertex. Proceeding in this way the Boltzmann weight for each strip is thought of as a matrix T which determines the quantum Hamiltonian H, T = e~H. The first step in the identification of equation (1) and crucial to what follows in our understanding of modular invariants is the realisation of the quantum partition function Z as a trace of a power of T. Iterating, for a M x iV lattice (M rows, N columns) we get Z = tiTN for periodic boundary conditions. The partition function Z = ^ ff exp(—/3iJ(cr)) with classical two dimensional Ising Hamiltonian H = J2a g Jva&0: where the variable cra takes values ± on each lattice site a, can be computed as Z = S CT Fledges we ight( e dg e )> where the weight of an edge joining a to /3 is exp(— $Joaop). Thus we can sum the partition function by first taking the partition function of a strip, the transfer matrix T, and then (for periodic boundary conditions) taking a sum over the product of strips to get the trace of a power of T.
464
Modular invariant partition functions ...
465
The transfer matrix T — TM is the symmetric 2 M x 2 M array T(a, a') = exp (-0{[S(v)
+ S(a')}/2 + I(a, a')})
(2)
if a,a' £ { ± 1 } M are column configurations. We use the Pauli matrices, cra,a = x,y,z, according to the convention, ax = (J ^>1) and
^M2®M2
where a, a are the dual, double dual actions respectively. In this way 0 M N
2
s - (((C 2 xi Z 2 ) xi Z 2 ) x Z 2 ) • • •
Now C 2 xi Z 2 is generated by unitaries u\, u2 satisfying u2 = 1, itiu 2 = — u2u\, and ® N M 2 is generated by self adjoint unitaries «i, u2, •. • satisfying muj = UjUt, UiUi+i = —ui+xUi for i,j > 1) K — j \ > 1- We concretely identify B with C*(u,- : j € N) by {CTx}^Li = {wi,WlU3,WlW3W5,WiU3U5U7,...},
(3)
{ ° i } j l i = {u2,Ui,ue,us,...}.
(4)
By the universal property of a crossed product, there exists a unique endomorphism u on A which sends Ui to ui+i. Then v\A+ = K|J4 + since {c^cr^ +1 }^l 1 = {u 3 , w5, u 7 , . . . } . Note that v [ax) = Yll=1 crlz, so that v is not graded (vd ^ 'du), in fact v(A) C A+. Moreover v2(ax) = ax<Jx+l so that v2 ^ a. To see better how we should think of v2, we should extend v to the Cuntz algebra as follows. First, let H = C 2 be a 2-dimensional Hilbert space, and let F(H) be the full Fock space 0 ^ = o ( ® m - f f ) and ®°H is a one-dimensional Hilbert space spanned by a unit vacuum vector 0,. Then define t(£) on F ( # ) by t(£)/ = £ & / , / €
466
DAVID E. EVANS
t± = t(e±) satisfy t+t*^ + t-t*_ + Q <8> fl = 1. It follows from this relation that T2 = C*(t+,t-) contains K(F(C2)), the compact operators on F(C2). The quotient map T2 —» T2/K(F(C2)), takes i± to isometries s± satisfying s+s*+ +ss*_ = 1. Thus T2/K{F(C2)) = C*(s+,s-) = the Cuntz algebra 02. If u is a unitary on C 2 , let Fu denote 0 ^ = o ( ® m « ) on F(H). Then au — AdFu, defines an action of T2 since Fut(£)F* = t(v£), leaving the ideal fC(F(C2)) invariant and so drops to an action (3 on the quotient, the Cuntz algebra, such that 0us(Q = s{u£). The fixed point algebra T = {x € 02 : (3t(x) = x, for all t s T } of 02 is generated by {s^sl : p., v £ {±}m}. The latter form a set of 2 m x 2 m matrix units in 02, generating M2m =
= SSrS*
+ S-aSSla,
(5)
where cr = ± , so that p 2 ^ ) = S+SaS*+ + S-S-aS*_.
(6)
This means that [p]2 = [id] © [a] as sectors on 02, where a is the automorphism of 02 by interchanging + with —, i.e., a = au where u = az. Let ip be the gauge invariant state on
2. M o d u l a r invariance In the continuum limit on the torus, one then obtains an expression for the corresponding partition function as Z{j)
-
t r ( e 27rML 0 -c/24) e~2-nif
(L 0 -c/24)\ _
/^
Here LQ, LQ are commuting Hamiltonians, related to the Virasoro or affine algebras as in the case of the SU(n) models at level k. The Ising model being the first non-trivial case SU(2)
Modular invariant partition functions ...
467
at level 2. Decomposing the Hilbert space with respect to the irreducible representations of such symmetries, we then have Z
W=EA,M^XA(T)X,(T)*.
(8)
Here XA(i")=Tr# A (e 27 " T(Lo ~ c/24) ) will be the character of the irreducible representation. The partition function being intrinsic to the torus should not depend on its parameterisation. Consequently we would expect [16] Z(T) = Z((a + rb)/(c + rd)) for fractional linear transformations. Typically, the characters are transformed linearly under the action of SL(2;Z), e.g. X a ( - l / r ) = J2 b
Sa b
' ^(r)'
Xa(r + 1) = J ] T»>>> ^(r) • b
Here 5 is a symmetric unitary matrix which diagonalises the fusion rules (see equation (10) below), with 5 0 ,A > 0, and T is diagonal. These matrices S and T define the modular data. For SU(2), and more generally by increasing the number of parameters to make the characters linearly independent, the problem of modular invariance of the partition function then reduces to question of whether the mass matrix {Z\^\ commutes with the S and T matrices. Thus a modular invariant corresponds to a matrix Z which satisfies — ZS = SZ, ZT = TZ, — 2^60,1,2,3,..., — Zoto = 1.
The second condition comes from the understanding of the coefficients as multiplicities of the decomposition of the underlying Hilbert space. The normalisation condition Zo,o = 1 coming from uniqueness of the vacuum, turns out to be something of a red herring. Relaxing this condition leads to a better understanding of even the structure of the normalized modular invariants. In particular the modular invariants, the physical ones which are related to a conformal field theory, form a fusion rule algebra in their own right [20,24-26]. To go further than the Ising model, one needs integrability in the form of the YBE, namely the Yang Baxter equation. Quantum groups and Hecke algebras provide a natural framework. The Boltzmann weights for the SU(n) models lie in the fixed point algebra of the product action of SU(n) on ® Mn or more precisely of the deformation, quantum SU(n). By Weyl duality, this identifies with a Hecke algebra, or at least the representations of the Hecke algebra which are in Weyl duality or can be identified with those of quantum SU(n). In particular, when q is a root of unity e2**/(n+k)f these are finite in number, with a truncation of Young tableaux with the number of rows and columns dictated by the size of n and k respectively. Thus for SU(2), these are the Dynkin diagrams A^+i, with the Ising model corresponding to SU(2)2 or fc = 2. Thus the labels of the statistical mechanical model, which describe the choice at each lattice site have a dual meaning as labels of the irreducible representations of quantum SU(n) or more precisely the irreducible positive energy representations of the loop group. We will approach the modular invariants in the subfactor/inclusion setting. Here we will have a type III factor N with a non-degenerately braided system of endomorphisms. That is we have a finite system NXN of endomorphisms of N, closed under the natural operations
468
DAVID E. EVANS
[A][/i] = ^N^ [v] for some fusion coefficients, where [A] denotes the sector of A where we identify A with Ad(u)A for unitaries u in N. The system will not only be commutative as sectors i.e., X/J, = Ad(u)/j,\, but there will be a choice of unitaries u = u(A,/i) satisfying the YBE, braiding fusion relation etc. and such that the only degenerate sector will be the trivial one. In this case there are natural S and T matrices represented by a Hopf link and a twist respectively which provide the modular data [54,57].
3. Fusion rule structure of modular invariants To construct modular invariants, we take a braided inclusion i : N —> M, where M is not necessarily a factor, but the inclusion is related to the original braided system in that It is in E(jvA'iv), i-e., the irreducible components of the dual canonical endomorphism 6 = u lie in our braided system. Then we can induce sectors of JV to those of M by a-induction using the construction of Longo-Rehren ([45], see also [60],[6-8]). This depends on the braiding, so there are indeed two paired inductions a±, since if there is one braiding, then there is another twin braiding where all crossings are reversed. Then a fundamental result of [12] (see also [21] for an alternative proof— algebraic rather than graphical) is that Z\,v = (a$,a~),
X,fi£NXN.
(9)
is a modular invariant. Here the RHS is interpreted as the common multiplicity of irreducible subsectors of a\~ and a~. Thus if MX^ and MX^ denote the systems generated by a+ and a "-inductions, then what really features are the neutral sectors X° = X+ C\X~. This system is non-degenerately braided whilst X ± may not even be commutative, let alone braided. They are commutative when the branching coefficients b^x = (r, a*) do not exceed one (equation (14)). We can understand this [9] as a Moore-Seiberg dilation [46]. We always have the trivial modular invariant Z = Yl\ |XA|2> where we now interpret \\ as a formal character tr<7 Lo_c / 24 even when the Hamiltonian LQ may not exist. More generally we may introduce twists or permutations i? of the fusion rules preserving S and T and the vacuum 0 so that we should also consider Z = ^2X XAXS(AV Representing a modular invariant by a subfactor puts that modular invariant in this form. The trivial or twisted modular invariants for the B-system when restricted to A, written formally in terms of characters becomes Xr = J2\br,\XXi with branching coefficients bTt\. Then restricting the diagonal modular invariant Z e x t = E r e s l X r l 2 , to the original system we have: E r e B IXr|2 = E T e B I H\eAbr\X\\2, with mass matrix Z\tli — J2T br,\bTt^. These invariants are called type I (or more precisely the inclusion N C M describing this modular invariant is type I) and are necessarily symmetric Zx^ = Znx. In the presence of non-trivial twist •& of the B system, we have type II invariants %\» = £T^T,A&)9(T),M' which have symmetric vacuum coupling Z0\ = Z\Q. A type III phenomena occurs, due to some underlying heterotic structure which results in needing different labellings B+ and B~ on left and right extended systems Ac B±. In the subfactor framework, this is found in two intermediate subfactors Af c M± C M, where M± carry systems of endomorphisms B±. Not only can this situation be found with modular invariants with non-symmetric vacuum coupling as in the orthogonal at loop groups low levels [9], but even in some symmetric S't/(n)-modular invariants [20]. The extensions N C M± should
Modular invariant partition functions ...
469
be thought of as subfactor versions of the left and right maximal extensions of the chiral algebra [9]. This theory and the relation between being type I and locality developed in [9] has been exploited by Kawahigashi and Longo [41,42] in the classification of local conformal nets. The celebrated ADE classification of SU{2) modular invariants was originally through identifying the diagonal part of the invariant with the exponents of a Dynkin diagram. Nowadays, this is understood in terms of a further algebraic structure, namely a nimrep, a representation {Gu : v G N%N} of the original fusion rules of ^XN by non-negative integer matrices where the spectrum of Gv is described by the diagonal part of the modular invariant with correct multiplicity {SV\/SVQ : Z\^\ ^ 0}. Recall that by the Verlinde formula, the fusion rules are diagonalised by the symmetric S-matrix, namely ^ A = £ , f ^ . A > -
(10)
Then in our inclusion setting, we can canonically associate [12,13] a nimrep to a braided subfactor (and hence though not canonically to a sufferable modular invariant) a nimrep {G\} through the action of N%N o n N%M, a system of endomorphisms obtained by taking irreducible components of NX^T. In coordinate form, the nimrep will be diagonalised by a family ip of eigenvectors G
a,6 = E f^-.W'bV
(11)
with some diagonalising vectors, which may stubbornly refuse to yield explicit closed expressions in a particular example. Even if a modular invariant can be realised by an inclusion, such an inclusion is by no means unique. It is still however a natural question to ask if there is a canonical way to construct a subfactor or inclusion. For the normalised SU(2) modular invariants, there is such a canonical subfactor, whose dual canonical endomorphism is given by one allencompassing formula, namely: [u] = ®xZXiX[\}. (12) where the summation is over all even sectors A in NXN [20]. This formula of course does not realise the subfactor uniquely. The dual canonical endomorphisms for D and A0dd coincide at levels 6,10,14,... . There is more to an inclusion than its dual canonical endomorphism. There is moreover, an analogous dual form for the canonical endomorphism {u}=®0[0}.
•
(13)
where the summation is over all even real sectors in M%M [20]. Here M%M = M^-M V M%M has a natural symmetry arising from the interchange of a+ with a~. (In general the symmetry does not exist on MXM, in that M%M and MX^ may not even be isomorphic although they always have the same global dimensions. For q-Ss, it can happen that M^-M and jv/^jw a r e isomorphic to S3 and S3 , as sectors so that one can be commutative and the other not [25].) Then the real sectors are those fixed under the symmetry. The possible noncommutative structure of MXj^ which we have alluded to previously is precisely described as follows [13]. The complexified algebras of MX^ can be identified
470
DAVID E. EVANS
with:
0
©
Mat(6±A).
(14)
The complexified algebra of the full system decomposes as
0
Mat(ZAl/1),
(15)
\,H€.NXN
and the action of N^N on NXM ( o u r nimrep) only sees the diagonal part of this representation on
0
Mat(ZA,A).
(16)
Some elementary computations show that — #M<*W =
X]A,M^A,M>
— #A"^M = ^ A ^ A . A ,
which can be read off as tr ZZ* and tr Z respectively. This raises the question, which we first noticed in 1999, of how to understand ZZ* as a modular invariant, albeit in general not a normalized one in terms of an inclusion. Moreover, by looking at the SU{2) examples and further one suspected that the decomposition of the modular invariant ZZ* into normalised modular invariants was related to the decomposition of N%N m^° its sheets [9,20]. This programme has been carried out (the details of the story is in [25]). Suppose N C Mc is an inclusion realising a modular invariant Zc for c = a,b. Then ZaZ^ is realised by an inclusion N c M, such that the central decomposition of M breaks up ZaZ£ into normalised modular invariants Zp realised by a subfactor N —> Mp for each minimal central projection in M'C\M. The system Ma^Mb is identified with the system N^M, and the central decomposition of the latter is identified with the sheets obtained from the N%N orbits. Note that even in the case Ma = Mb, the decomposition of ZaZ* and Z*Za can be very different (e.g. case Z25 for q-S3 [25, page 358]).
4. WZW and quantum doubles There are two particular classes of examples of braided factors which concern us here. The first as we have alluded to are related to the statistical mechanical IRF (interaction-rounda-face) models which began our discussion. They are the loop group nets of factors provided by the work of Wassermann [59] (e.g. for SU(n)). The second class are those of quantum doubles. Ocneanu's asymptotic subfactor can be regarded as the subfactor analogue of the Drinfel'd quantum double construction. This quantum double subfactor is basically the same as the Longo-Rehren inclusion [45] and is a way of yielding braided systems from not necessarily commutative systems. The modular data from a quantum double subfactor was first established by Ocneanu [22, Section 12.6] and the theory developed by Izumi [37]. The situation with WZW is well studied although not totally understood. See [21] for a review. Here we focus instead on quantum doubles, particularly of finite groups. We summarise the situation for some key examples as follows. Note first however that abelian groups of sectors can be braided in a number of ways. One could have simply an abelian
Modular invariant partition functions . . .
471
group of automorphisms where the braiding is totally degenerate. We will consider the double of these systems which are then non-degenerately braided as well as doubles of other finite non-abelian systems. (At the other extreme, an abelian group of sectors could still be represented by a system of automorphisms which is non-degenerately braided such as SU(n) at level 1 which has Z„ fusion rules). — q double of Z2. Here all six normalised modular invariants are realised, as first shown by [9] since this is also the case of 50(16^) at level one, which interestingly enough has some non-symmetric modular invariants. — q double of Z3. Here all 8 normalized modular invariants are realised, as shown in [25]. — q double of S3. There are a total of 48 normalised modular invariants, but only 28 can be realised by subfactors. Of the remaining 20, only 6 are nimble (i.e., they possess compatable nimreps) but 14 are nimless. This was analyzed in detail in [25], a key tool being the fusion rule algebra which the sufferable modular invariants must satisfy, and the previous analysis of the q-double of Z3. This is natural as Z3 is a normal subgroup and the structures of the q-doubles are inherently closely related. It would be natural to expect the quantum double of Z3 to often appear as the neutral system. — q double of D^1}. There are three subfactors whose principal graph is Dg . The modular data for one coincides with the modular data for the quantum double of S3. The others have conjugate modular data, with 9 modular invariants of which 8 are sufferable and one nimless. — q double of E6- The E6 system (coming from the E6 SU{2) modular invariant or the conformal embedding SU(2)\o C SO(5)x) is commutative but not braided. The quantum double has 10 primary fields with modular date computed in [38]. There are exactly 4 modular invariants all whom can be realised [24]. — q double of the Haagerup system. The first irreducible finite depth (i.e., rational) subfactor with index greater than four is that of Haagerup [2] A C B. The A-A system has 6 elements {id, a,a2,p, ap, a2p} with noncommutative fusion rules [a] 3 = [id],
[a}[p] = [p\[a]2,
[p]2 = [id] © [p] © [ap] © [a2p].
From the last relation, we see that d2p — 1 + 3dp where dp is the statistical dimension of p, so that the subfactor A C. B has index (5 + \/l3)/2. The B-B system {id, a, b, c} has the commutative fusion rules: [a]2 = [id] © [a] © [6] © [c], [a][b] = [a]®[c],
[b}2 = [id] © [c],
[a][c] = [a]®[b]®[c],
[c]2 = [id] © 2[a] © [b] © 2[c], [6][c] = [a] © [b] © [c].
The modular data of the quantum double of this system has been determined in [38] with 12 primary fields. Here there are 28 modular invariants, of which 7 are known to be sufferable and 20 to be insufferable [26]. These sufferable modular invariants can be identified with a fusion rule subalgebra of the sufferable q-53 modular invariants. (This is not too surprising, as the A-A system has Z3 as a subsystem and is a perturbation of the usual S3 multiplication table [a] 3 = [id], [a][p] = [p][a]2, [p]2 = [id]) and similarly the B-B system reduces to the fusion table for .S3 when ignoring [c]). For details see [26].
472
DAVID E. EVANS
5. Boundary conformal field theory and statistical mechanics What does all this mean for boundary conformal field theory and statistical mechanics? The system N%M labels the boundary fields and M%M labels the twist or defect lines. The basic torus Hilbert space of equation (7) decomposes as tf = 0 ^ # A ® # M -
(17)
This means that when we insert a twist or defect line x consistent with the underlying symmetry, Z(T) = t r ( : r e 2 7 r i r ( L o - c / 2 4 ) e - 2 7 r ^ ( i o - c / 2 4 ) ) , (18) we have a degree of freedom given by J^A Zj [52]. This of course has at least the same cardinality of M%MSimilarly, let us move from the torus to a situation with a boundary. The Hilbert space now for a boundary condition 6 decomposes as He = ($exHx.
(19)
Consider for simplicity the case where the Verlinde fusion rules are multiplicty free and where He is also multiplicity free in that 0\ is 1 for say A in some subset J of N%N and 0 otherwise. For each A there would be a boundary field operator ip\(x) with operator product expansion k
The consistency of the operator product through associativity is the sewing relation W W
l^lwjkwietel
ij lk~ eeJ
where F are the N%N 6j-symbols. This means that {wv\^} defines an element w in Hom(0,6 2 ) such that w2 = 0(w)w where 6 is the endomorphism ®\0\\. In this way the boundary field set up of equation (20) is just the condition that there is a Q-system (9, v, w) for this 9 where v is in Hom(id, 9). In other words, by Longo's Q-system characterisation these are precisely the conditions that there exists a subfactor N C M whose dual canonical endomorphism is 0. In this way Q-system corresponds to a boundary condition. Translating further, a N-M sector (3 corresponds [21, Lemma 3.1] to a module 1(3 for the Q-system (6, v, w) in the language of [49]. The latter is interpreted as boundary changing fields in [27, Section 4.4], One new feature which appears when going from 5(7(2) to 5t/(3), 5(7(4) etc., is the presence of the conjugate modular invariant adding to the natural constructions of orbifold and conformal embedding invariants. What we would like to do is realise the statistical mechanical model which gives rise to the modular invariant in the continuum limit. This has been studied in [3]. Nimrep graphs for the conjugate modular invariant were discussed in [33,53] but a modular invariant and nimrep does not usually completely describe a rational conformal field theory. The programme was begun in [3] to provide the additional data through the corresponding integrable lattice models. For a SU{n)k modular invariant Z and corresponding nimrep G, there may exist an integrable interaction-round-a-face lattice
Modular invariant partition functions . . .
473
model whose continuum limit provides a realisation of the SU(n)k_1 x SU(n)1/SU(n)k coset conformal field theory with torus partition function described by Z. In such a model the spin states are the labels of the rows and columns of the nimrep matrices G\, with adjacency condition for neighbouring spins for the nimrep graph Gf for the fundamental vector representation / of SU(n)k, and the Boltzmann weights at some value of the spectral parameter provide a representation of the Hecke-.algebra. T h e approach of [33,53] does not yield nonnegativity of the nimreps in general. However a-induction as we have described here provides construction of modular invariants and associated nimreps in which the nimrep matrices are guaranteed to have non-negative entries. If this were applied to the case of the SU(n)k conjugate modular invariant, t h e critical SU(n)k Boltzmann weights would be used together with intertwiner cells to construct braided subfactos, which through a-induction would be expected to lead t o the nimreps of [33,53].
Acknowledgement This work is supported by the E U QSNG network in Q u a n t u m Spaces—Noncommutative Geometry, and the E P S R C network A B C - K L M .
References 1. H. Araki, D. E. Evans, "On a C*-algebra approach to phase transition in the two dimensional Ising model", Comm. Math. Phys. 91, 489-503 (1983). 2. M. Asaeda, U. Haagerup, "Exotic subfactors of finite depth with Jones indices (5 + vT3)/2 and (5 + V?7)/2", Comm. Math. Phys. 202, 1-63, (1999). 3. R. E. Behrend, D. E. Evans, "Integrable lattice models for conjugate An ', J. Phys. A, to appear; arXiv:hep-th/0309068. 4. R. E. Behrend, P. A. Pearce, "Integrable and conformal boundary conditions for sl(2) A-D-E lattice models and unitary minimal conformal field theories", J. Stat. Physics 102, 577-640 (2001). 5. R. E. Behrend, P. A. Pearce, V. B. Petkova, J.-B. Zuber, "Boundary conditions in rational conformal field theories", Nucl. Phys. B 570, 525-589 (2000). 6. J. Bockenhauer, D. E. Evans, "Modular invariants, graphs and a-induction for nets of subfactors, I", Comm. Math. Phys. 197, 361-386 (1998). 7. J. Bockenhauer, D. E. Evans, "Modular invariants, graphs and a-induction for nets of subfactors II", Comm. Math. Phys. 200, 57-103 (1999). 8. J. Bockenhauer, D. E. Evans, "Modular invariants, graphs and a-induction for nets of subfactors, III", Comm. Math. Phys. 205, 183-228 (1999). 9. J. Bockenhauer, D. E. Evans, "Modular invariants from subfactors: Type I coupling matrices and intermediate subfactors" Comm. Math. Phys. 213, 267-289 (2000). 10. J. Bockenhauer, D. E. Evans, "Modular invariants from subfactors", in Quantum symmetries in theoretical physics and mathematics (Bariloche, 2000), Contemp. Math. 294, Amer. Math. Soc, Providence, RI, 2002, pp. 95-131. 11. J. Bockenhauer, D. E. Evans, "Modular invariants and subfactors", Mathematical physics in mathematics and physics (Siena, 2000), Fields Inst. Comm. 30, Amer. Math. Soc, 2001, pp. 1 1 37. 12. J. Bockenhauer, D. E. Evans, Y. Kawahigashi, "On a-induction, chiral generators and modular invariants for subfactors", Comm. Math. Phys. 208, 429-487 (1999). 13. J. Bockenhauer, D. E. Evans, Y. Kawahigashi, "Chiral structure of modular invariants for subfactors" Comm. Math. Phys. 210, 733-784 (2000).
474
DAVID E. EVANS
14. J. Bockenhauer, D. E. Evans, Y. Kawahigashi, "Longo-Rehren subfactors arising from ainduction", Publ. RIMS, Kyoto Univ. 37, 1-35 (2001). 15. A. Cappelli, C. Itzykson, J.-B. Zuber, "The A-D-E classification of minimal and A[^ conformal invariant theories", Coram. Math. Phys. 113, 1-26 (1987). 16. J. Cardy, "Operator content of two dimensional conformally invariant theories", Nucl. Phys. B 270, 186-204 (1986). 17. A. L. Carey, D. E. Evans, "The operator algebras of the two dimensional Ising model", in Braids, Contemporary Mathematics 78, 117-165 (1988). 18. A. Coste, T. Gannon, R. Philippe, "Finite group modular data", Nucl. Phys. B 581, 679-717 (2000). 19. S. Doplicher, R. Haag, J. E. Roberts, "Fields, observables and gauge transformations. II", Comm. Math. Phys. 15, 173-200 (1969). 20. D. E. Evans, "Fusion rules of modular invariants", Rev. Math. Phys. 14, 709-732 (2002). 21. D. E. Evans, "Critical phenomena, modular invariants and operator algebras", Operator algebras and mathematical physics (Constanta 2001), J. Cuntz, G. A. Elliott, S. Stratila et al., eds., The Theta Foundation, Bucharest, 2003. 22. D. E. Evans, Y. Kawahigashi, Quantum symmetries on operator algebras, Oxford University Press, 1998. 23. D. E. Evans, J. T. Lewis, "On a C"*-algebra approach to phase transition in the two dimensional Ising model II", Comm. Math. Phys. 102, 521-535 (1986). 24. D. E. Evans, P. R. Pinto, "Modular invariants and their fusion rules", in Advances in Quantum Dynamics (Mount Holyoke, 2002), Contemp. Math., Amer. Math. Soc, Providence, RI, to appear. 25. D. E. Evans, P. R. Pinto, "Subfactor realisation of modular invariants", Comm. Math. Phys. 237, 309-363 (2003). 26. D. E. Evans, P. R. Pinto, "Modular invariants and the double of the Haagerup subfactor", in preparation. 27. J. Fuchs, I. Runkel, C. Schweigert, "TFT construction of RCFT correlators I: Partition functions", Nucl. Phys. B 646, 353-497 (2002); arXiv:hep-th/0204148. 28. J. Fuchs, I. Runkel, C. Schweigert, "TFT construction of RCFT correlators II: Unoriented world sheets", preprint arXiv:hep-th/0306164. 29. J. Fuchs, J. Frohlich, I. Runkel, C. Schweigert, "Correspondences of ribbon categories", preprint arXiv:hep-th/0309465. 30. J. Fuchs, C. Schweigert, "Category theory for conformal boundary conditions", Fields Inst. Communications 39, 25-71 (2003); preprint arXiv:math.CT/0106050. 31. J. Fuchs, C. Schweigert, "Solitonic sectors, a-induction and symmetry breaking boundaries", preprint arXiv:hep-th/0006181. 32. P. di Francesco, J.-B. Zuber, "SU(N) lattice integrable models associated with graphs", Nucl. Phys. B 338, 602-646 (1990). 33. M. R. Gaberdiel, T. Gannon, "Boundary States for WZW Models", Nucl. Phys. B 639, 471-501 (2002); arXiv:hep-th/0202067. 34. T. Gannon, "Modular data: the algebraic combinatorics of conformal field theory", preprint arXiv:hep-th/0103044. 35. T. Gannon, "Boundary Conformal field theory and fusion ring representations", Nucl. Phys. B 627, 506-564 (2002). 36. M. Izumi, "Subalgebras of infinite C*-algebras with finite Watatani indices, II. Cuntz-Krieger algebras", Duke Math. J. 91, 409-461 (1998). 37. M. Izumi, "The structure of sectors associated with Longo-Rehren inclusions, I. General theory", Comm. Math. Phys. 213, 127-179 (2000). 38. M. Izumi, "The structure of sectors associated with Longo-Rehren inclusions, II. Examples", Rev. Math. Phys. 13, 603-674 (2001). 39. M. Izumi, H. Kosaki, "On the subfactor analogue of the second cohomology", preprint, KyotoMath 2002-01.
Modular invariant partition functions . . .
475
40. V. F. R. Jones, "Index for subfactors", Invent. Math. 72, 1-25 (1983). 41. Y. Kawahigashi, R. Longo, "Classification of local conformal nets. Case c < 1." Annals of Mathematics, to appear; preprint arXiv:math-ph/0304022 . 42. Y. Kawahigashi, R. Longo, "Classification of two dimensional local conformal nets with c < 1 and 2-cohomolgy vanishing for tensor categories", Comm. Math. Phys. 244, 63-97 (2004). 43. R. Longo, "Index of subfactors and statistics of quantum fields, II", Comm. Math. Phys. 130, 285-309 (1990). 44. R. Longo, "Duality for Hopf algebras and for subfactors, I", Comm. Math. Phys. 159, 133-150 (1994). 45. R. Longo, K.-H. Rehren, "Nets of subfactors", Rev. Math. Phys. 7, 567-597 (1995). 46. G. Moore, N. Seiberg, "Naturality in conformal field theory", Nucl. Phys. B 313, 16-40 (1989). 47. A. Ocneanu, "Paths on Coxeter diagrams: From Platonic solids and singularities to minimal models and subfactors", in Lectures on operator theory, Rajarama Bhat, B. V. et al. (eds.) (Notes recorded by S. Goto), The Fields Institute Monographs, Providence, Rhode Island: AMS publications 2000, pp. 243-323. 48. A. Ocneanu, "The classification of subgroups of quantum SU(N)", in Quantum symmetries in theoretical physics and mathematics (Bariloche, 2000), Contemp. Math., 294, Amer. Math. Soc, Providence, RI, 2002, pp. 133-159. 49. V. Ostrik, "Module categories, weak Hopf algebras and modular invariants", Transform. Groups 8, 177-206 (2003); preprint arXiv:math.QA/0111139 . 50. V. Ostrik, "Module categories over the Drinfeld double of a finite group" Inter. Math. Research Notices 27, 1507-1520 (2003); preprint arXiv.math.QA/0202130. 51. V. B. Petkova, J.-B. Zuber, "The many faces of Ocneanu cells", Nucl. Phys. B 603, 449-496 (2001). 52. V. B. Petkova, J.-B. Zuber, "Conformal field theories, graphs and quantum algebras", preprint arXiv:hep-th/0108236. 53. V. B. Petkova, J.-B. Zuber, "Boundary conditions in charge conjugate sl(N) WZW theories", preprint arXiv:hep-th/0201239 . 54. K.-H. Rehren, "Braid group statistics and their superselection rules", in The algebraic theory of superselection sectors (Palermo, 1989), Singapore: World Scientific, 1990, pp. 333-355. 55. K.-H. Rehren, "Space-time fields and exchange fields", Comm. Math. Phys. 132, 461-483 (1990). 56. K.-H. Rehren, "Canonical tensor product subfactors", Comm. Math. Phys. 211, 395-406 (2000). 57. V. G. Turaev, "Quantum invariants of knots and 3-manifolds", in de Gruyter Studies in Mathematics, vol. 18, Walter de Gruyter, Berlin, 1994. 58. E. Verlinde, "Fusion rules and modular transformations in 2D conformal field theory", Nucl. Phys. B 300, 360-376 (1988). 59. A. Wassermann, "Operator algebras and conformal field theory III: fusion of positive representations of LSU(n) using bounded operators", Invent. Math. 133, 467-538 (1998). 60. F. Xu, "New braided endomorphisms from conformal inclusions", Comm. Math. Phys. 192, 349-403 (1998).
Classification of operator algebraic conformal field theories in dimensions one and two YASUYUKI KAWAHIGASHI
(U. Tokyo)
We formulate conformal field theory in the setting of algebraic quantum field theory as Haag-Kastler nets of local observable algebras with diffeomorphism covariance on the two-dimensional Minkowski space. We then obtain a decomposition of a twodimensional theory into two chiral theories. We give the first classification result of such chiral theories with representation theoretic invariants. That is, we use the central charge as the first invariant, and if it is less than 1, we obtain a complete classification. Our classification list contains a new net which does not seem to arise from the known constructions such as the coset or orbifold constructions. We also present a classification of full two-dimensional conformal theories. These are joint works with Roberto Longo.
1. Introduction Our main results, together with R. Longo, are classification results for conformal field theories, in the operator algebraic approach. We first briefly describe our basic framework for quantum field theory and its relation to a more conventional approach based on Wightman axioms using operator-valued distributions. Our framework is called algebraic quantum field theory or local quantum physics, and its standard textbook is [14] by R. Haag. We first explain our axiomatic setting on the 4-dimensional Minkowski space, although we will later work on lower dimensional spacetime. Recently, several attempts have been made on studies on curved spacetime or even noncommutative spacetime, but we will not deal with such topics in this review. In our setting, a physical system is described by a family of operator algebras -4(0) on a fixed Hilbert space H, where O is a bounded region in the Minkowski space. As such a region O, we consider only double cones, which are of the form (x + V+) n {y + VL), where x,y GM4 and V± = {z = (zo, zuz2,
z3) e K4 | z\ - z\ - z\ - z\ > 0, ±z0 > 0}.
We assume that we have a von Neumann algebra .4.(0) acting on H for each double cone O and the following properties hold. (An algebra of bounded linear operators on a Hilbert space is called a von Neumann algebra if it is closed under the ^-operation and weak-operator topology.) (1) (Isotony) For Oi C 0 2 , we have A(Oi) C A(02). (2) (Locality) If 0\ and 02 are spacelike separated, then elements in A{Oi) and A(02) commute. (3) (Poincare Covariance) There exists a unitary representation U of the universal covering of the restricted Poincare group satisfying A(gO) = UgA(0)U*.
476
Classification of operator algebraic conformal field theories ...
477
(4) (Vacuum) We have a unit vector 0 e H, unique up to phase, satisfying UgQ = Q for all elements g in the restricted Poincare group and \J0 A(0)Q is dense in H. (5) (Spectrum Condition) If we restrict the representation U to the translation subgroup, its spectrum is contained in the closure of V+. The isotony axioms simply states that we have more observables for a larger region. The locality axiom means that if we have two spacelike separated regions, then we have no interactions between them even at a speed of light, so the two operators taken from the two regions mutually commute. It is also called the Einstein causality. Covariance means that a "spacetime symmetry" acts as a symmetry of the family of operator algebras. We will later use a higher spacetime symmetry than restricted Poincare group. The vector Q is called a vacuum vector and it gives a vacuum state. The spectrum condition means stability. If we denote the set of the elements that are spacelike separated with all the elements of a region D by DL, then we have 0±J- — O for a double cone O. This is why we use only double cones. For a general region D, we could define A(D) as the von Neumann algebra generated by A(0) for all double cones O contained in D. Since the set of double cones is directed with respect to inclusions, we often say that the family A(0) is a net of von Neumann algebras. We also say a net of factors, if each von Neumann algebra .4(0) has a trivial center, which is often the case in the lower dimensional spacetime as below, since such a von Neumann algebra is called a factor. A basic idea is that all information about a certain physical system is contained in such a net .4.(0). In many cases, the local algebras .4(0) are all isomorphic, so each algebra A(0) itself does not contain physical information about the system. From a mathematical viewpoint, such a net of von Neumann algebras is simply a family of operator algebras subject to certain set of axioms, so we can study classification theory of such families of operator algebras up to an obvious notion of isomorphism. A useful and important tool for such a study is a representation theory of a net of von Neumann algebras. A basic tool to study a net of von Neumann algebras is its representation theory formulated by Doplicher-Haag-Roberts (DHR) [7]. Each operator algebra A(0) acts on a fixed Hilbert space from the beginning, but we can also consider a representation of a family of operator algebras on a different Hilbert space where we do not have a vacuum vector any more. A basic idea of Doplicher-Haag-Roberts is that if we assume a nice condition called the Haag duality and select a nice class of representations with their criterion, then each such representation is realized, up to unitary equivalence, as a certain endomorphism of (the norm closure of) \J0 A(0). Such an endomorphism is often called a DHR endomorphism. An important feature of endomorphisms is that they can be composed. This composition gives an operation in the set of DHR endomorphisms which plays a role of a tensor product. Through this operation (and others), mathematical structure of DHR endomorphisms becomes quite similar to that of unitary representations of a compact group, and it gives a C*-tensor category. We briefly mention a relation of the above approach to a more conventional one based on the Wightman axioms. In the setting of the Wightman axioms, one considers a family of operator-valued distributions {cpj(x)} on the Minkowski space. If we have such a family at the beginning, then, roughly speaking, we apply smooth functions supported in O to these distributions, apply bounded functional calculus to the resulting (unbounded) operators,
478
YASUYUKI KAWAHIGASHI
and let A(0) be the von Neumann algebra generated by these bounded operators. In this way, we should obtain a local net of von Neumann algebras. If we start with a local net A(0) of von Neumann algebras, we should obtain operator valued distributions {(f)j(x)} through a certain limiting procedure in which bounded regions 0 shrink to one point x. It is believed that the approach based on the Wightman axioms and the one based on local nets of von Neumann algebras are essentially equivalent, and there have been many works which study under what conditions we obtain one from the other, but the exact relations between the two approaches have not been fully understood yet.
2. Full and chiral algebraic conformal q u a n t u m field theories The above general framework in the previous section obviously works on a Minkowski space of any dimension. We now specialize on the 2-dimensional Minkowski space M and require higher symmetry than the general Poincare covariance. This is our approach to conformal field theory. Then through a chiral decomposition of a full algebraic conformal quantum field theory, we obtain a chiral algebraic conformal quantum field theory which is now described as a one-dimensional net of factors. After such a general description, we will briefly mention a relation to vertex (operator) algebras, which give another mathematical approach to chiral conformal field theories. We now work on a two-dimensional Minkowski space M. where we use t and x for the time and space coordinates, respectively. We have a von Neumann algebra A(0) on a fixed Hilbert space H for each double cone O in this Minkowski space M as above. We set L± = {t ± x = 0} and each double cone is a direct product I+ x I_, where I± are bounded intervals in L±, respectively. We consider the Mobius group PSL{2, R) which acts on R U {oo} as linear fractional transformations. In this way, we obtain a local action of the universal covering group PSL(2, R) on R. We impose the following axioms for our net of von Neumann algebras A(0) on H and call such a net a Mobius covariant net of von Neumann algebras. (See [18] for more details.) (1) (Isotony) For Oi C 0 2 , we have A{0{) C A(02). (2) (Locality) If 0\ and 0 2 are spacelike separated, then elements in A{0\) and .4(02) commute. (3) (Mobius Covariance) There exists a unitary representation U of ~PSL(2,R)
xP~SL(2,R)
on H such that for every double cone O, we have A(gO) where W is a connected neighbourhood of the identity satisfying gO C M for all g GW. (4) (Vacuum) We have a unit vector Cl e H, unique up to for all elements g and ( J 0 A(O)0, is dense in H. (5) (Positive energy) The one-parameter unitary subgroup time translation has a positive generator.
= UgA(0)U* when g &W in PSL(2,R) x PSL(2,R) phase, satisfying UgQ, = £1 of U corresponding to the
We now further strengthen the axiom of Mobius covariance as follows. Let G be the quotient of PSL(2,R) xPSL(2,R) modulo the relation (r 27r ,r_ 27r ) = (id,id). Then it turns
Classification of operator algebraic conformal field theories ...
479
out that our representation U as above gives a representation of this group G, due to the spacelike locality. We then find that our net A{0) extends to a local G-covariant net on the Einstein cylinder £ = Rx S1, which is the cover of the 2-torus obtained by lifting the time coordinate from S1 to R. We also have several consequences from the above set of axioms. See [18, Proposition 2.2], for example. Let Diff (R) be the group of the orientation preserving diffeomorphisms which are smooth at infinity. Then this group naturally acts on £ as a diffeomorphic action. Let Conf(£) be the group of global, orientation preserving conformal diffeomorphisms of £. This group is generated by Diff (R) x Diff (R) and G. If a Mobius covariant net A further satisfies the following axiom, we say that the net A is a local conformal net. This is the class we study. (Diffeomorphism covariance) The unitary representation U of G extends to a projective unitary representation of Conf {£) such that the extended net on £ is covariant. Furthermore, we have UgXU* = X for g e Diff(R) x Diff(R), it X € A(0) and g acts on O as identity. This gives our framework for conformal quantum field theory. We study a net A{0) as a family of von Neumann algebras satisfying the above set of axioms. Such a family here is also called a full algebraic conformal quantum field theory. The DHR theory works in this setting perfectly. Suppose we have a local conformal net A as above. Then for each bounded interval / C i + , we set A+(I) = f]jA(I x J ) . In this way, we have a family of von Neumann algebras A+ parameterized by bounded intervals J. We regard these von Neumann algebras as subalgebras of B(H+), where H+ is the closure of \Jj A+(I)£l. This family extends to a family A+(I), where I is any open, nondense, nonempty, and connected set of S 1 = R U {oo}. (Such J is simply called an interval in S1.) This family A+{I) satisfies the following conditions. We may take these as axioms for such a family. (See [13] for more details.) (1) (Isotony) For h C J 2 , we have A+(h) C A+(h)(2) (Locality) If I\ and I2 are disjoint, then elements in A+(h) and A+fa) commute. (3) (Diffeomorphism Covariance) There exists a projective unitary representation U of Diff(51) on H+ such that for every interval / , we have A+(gI) = UgA+(I)U*. Furthermore, we have UgXU* = X for g e Diff (S1), if X e. A(I) and g acts on / as identity. (4) (Vacuum) We have a unit vector Q £ H+, unique up to phase, satisfying UgQ, = fi for all elements g in the Mobius group PSX(2,R) and |J 7 A+(I)Cl is dense in H+. (5) (Positive energy) The one-parameter unitary subgroup of U corresponding to the rotation on 5 1 has a positive generator. A family A-+ satisfying the above set of axioms is called a chiral algebraic conformal quantum field theory. We can similarly define A- • We then have an embedding A+ (I) <8> A-(J) C A(I x J ) . The DHR theory works fine for a chiral algebraic conformal quantum field theory. Now, for two DHR endomorphisms p,
480
YASUYUKI KAWAHIGASHI
theories A±. In the second step, we classify the embedding A+(I) ® A-(J) which is a non-trivial subfactor, usually.
C A(I x J ) ,
3. Complete rationality and classification Our basic idea for classification is that if we have a certain nice condition, generally called "amenability", a simple set of invariants related to representation theory should give a complete classification. We have given a general idea along this line in [16], so here we only briefly explain the condition called complete rationality, which was introduced in [19] and plays a role of amenability in classification theory. Here we state complete rationality for a chiral algebraic conformal quantum field theory A(I), I C S1. We also have a version for a full algebraic conformal quantum field theory and we refer the reader to [18] for the definition in such a setting. Consider a chiral algebraic conformal quantum field theory A. Split S1 into In intervals, and label them I\, I2, • • •, hn in the counterclockwise order. Let /j.n be the Jones index of the subfactor A(h) V A{h) V • • • V A{hn-i) C {A(h) V A(h) V • • • V A(hn))'- (Note that we have this inclusion because of locality.) This number is independent of the way to split the circle. We remark that we automatically have /xi = 1, which is called the Haag duality. Complete rationality consists of the following three conditions. (1) (Strong additivity) Remove one point from an interval J and label the resulting two intervals as h,l2- Then we have A(I) = A{I{) V A(h). (2) (Split property) Consider two intervals I\,h with / i n / 2 = 0. Then the von Neumann algebra A(h) V A(h) is naturally isomorphic to A(h) ® .4(12 )• (3) (Finiteness of the ^t-index) We have fi2 < 00 • The main results in [19] give the following two conditions under complete rationality. (1) We have only finitely many equivalence classes of irreducible DHR endomorphisms of the net A. (2) The braiding naturally gives a unitary representations of SL(2, Z) whose dimension is the number of the equivalence classes in (1). This shows that the category of the DHR endomorphisms of the net A gives a modular tensor category, which plays an important role in theory of quantum invariants of 3-manifolds as in [31]. We next explain the first numerical invariant of a local conformal net, a central charge, of A. Let A be a local conformal net. (Here we do not need complete rationality.) Then we have a projective unitary representation of Diff (S1). Recall that the Virasoro algebra is the infinite dimensional Lie algebra generated by elements {Ln \ n £ Z} and c with relations [Lm, Ln] = (m - n)Lm+n
+ j^im3
- m)5 m ,_ n ,
and [L n ,c] = 0. This is unique, non-trivial one-dimensional central extension of the Lie algebra of Diff(5 1 ). We now obtain a representation of the Virasoro algebra and then the central element c is mapped to a scalar. This value is the central charge of the net A and is
Classification of operator algebraic conformal field theories ...
481
also denoted by c. It has been shown by Priedan-Qiu-Shenker [11] that this central charge value is in {1 - 6/m(m + 1) | m = 2,3,4,...} U [1, oo) and the values {1 - 6/m(m +1) | m = 2,3,4,...} have been realized by Goddard-Kent-Olive [12]. (The values in [1, oo) are easier to realize.) Jones has proved in his theory of index for subfactors [15] that the index value is in the set {4cos 2 n/m | m = 3,4,5...}U [4, oo] and all the values in this set can be realized. It is obvious that we have a formal similarity between the two cases. A relation between the Jones theory of subfactors and algebraic quantum field theory was found in [21], Our classification results give further deeper relations between the two. (See [8] for a general theory of subfactors and related topics.) In classification theory of subfactors, Ocneanu [26] has found a paragroup, which gives a combinatorial invariant for a subfactor through its representation theory. If the Jones index value is less than 4, the subfactor is of finite depth automatically, and this finite depth case is a special case of the amenable case which Popa's classification theorem [28] covers. We have shown in [17] that if the central charge value is less than 1, then the net of factors is automatically completely rational. Wassermann's construction of the SU{n)k nets based on loop group representations gives the first examples of chiral algebraic conformal quantum field theories and they are completely rational. The finiteness of the //-index for these nets was proved by Xu [34]. Xu also studied the coset and orbifold constructions in the setting of chiral algebraic conformal quantum field theory in [35,36]. They give completely rational nets by [23,36]. We briefly note that we have some formal similarity between our complete rationality and a condition in theory of vertex operator algebras, which is another mathematical approach to a chiral conformal field theory. They have a condition called C\-finiteness introduced by Zhu [38], which is formally analogous to the above finiteness of the //-index. See [10] for more details on vertex operator algebras. On a vertex operator algebra V, we have binary operations a^b, a,b e V, parameterized by integers n. The finiteness of the codimension dimVyV(_2)V is called the C2-finiteness condition. A vertex operator algebra V is said to be rational if every l/-module is completely reducible, and this condition implies that V has only finitely many inequivalent simple modules. Zhu has proved that if we have the C2-finiteness condition, then the modular group SL(2, Z) acts on the space of characters of all the mutually inequivalent simple V-modules. This finiteness of the codimension and the above finiteness of the //-index have some formal common similarity as follows. (1) The //-index is also a certain multiplicative codimension. (2) Both the codimension dim V/V(_k)V and the index //£ can be defined for any positive integer k. (3) The above codimension and the index are trivial for k = 1. (4) If the above codimension and the index are finite for k = 2, then they are also finite for all positive integers k, and we obtain a unitary actions of the modular group SL(2,Z) on certain natural finite dimensional spaces. However, the action of 51/(2, Z) in the setting of vertex operator algebras is on the space of characters while its action in the setting of representation categories of a net of factors is on the intertwiner spaces, and we do not have any direct relation between the two situations. It would be very interesting to clarify this formal analogy.
482
YASUYUKI KAWAHIGASHI
4. a-induction, modular invariants, and classification Here we explain our classification method for completely rational nets on S1 with central charge less than 1. Suppose we have a chiral algebraic conformal quantum field theory A(I), I C S1, with central charge c < 1. Then the projective representation U of the diffeomorphism group gives a subnet as follows. For an interval I C S1, we define B{I) be the von Neumann algebra generated by Ug, where g is a diffeomorphism which acts trivially outside of J. It is easy to see that this B(I) is a subalgebra of A(I) by the Haag duality and B gives a subnet in the sense of [23]. (Note that the vacuum vector is not cyclic for B.) We use a notation Vir c (7) for this subnet and call it a Virasoro subnet with central charge c. This net is among the coset constructions due to Xu [35], which relies on A. Wassermann's construction of 5?/(2)fc-nets [32]. (See [5] for more on the Virasoro nets.) We have shown in [17] that the subfactor Vir c (7) c A{I) has a trivial relative commutant and finite index, which is a quite nontrivial fact. For a net of subfactors Virc c A, we have a machinery of a-induction, which is analogous to a machinery of induction and restriction for representations of groups. For a DHR endomorphism A of the smaller net Vir c , we obtain an endomorphism a* of the larger net A. This is "almost" a DHR endomorphism, but not completely. This operation is regarded as an extension of an endomorphism and depends on a choice ± of the braiding on the system of DHR endomorphisms of the smaller net. This method was defined by Longo-Rehren [24] and many interesting properties and examples were studied by Xu [33] and Bockenhauer-Evans [1]. Ocneanu [27] had a graphical method based on a quite different motivation, and it was unified with the theory of a-induction by us in [2-4]. One of the .main results in [2] is that if we define a matrix Z by Zx^ = dimHom(a£, a~) for irreducible DHR endomorphisms A, fi of the smaller net, then this matrix is in the commutant of the unitary representation of SX(2,Z) arising from the braiding on the system of DHR endomorphisms of the smaller net. Obviously, each entry of Z is a nonnegative integer and we have Zoo — 1 for the vacuum representation denoted by 0. Such a matrix Z is called a modular invariant (of the unitary representation of SL(2, Z)). It is easy to see that for a given unitary representation of 5L(2,Z), we have only finitely many modular invariants, and this finite number is often quite small in concrete examples. For the Virasoro net Vir c , the corresponding modular invariants have been completely classified by Cappelli-Itzykson-Zuber [6] and they are labeled with pairs of A-D-E Dynkin diagrams with difference of the Coxeter numbers being 1. Also it is fairly easy to see in our current context that we have only so-called type I modular invariants in the classification of [6] where we have only An, Z?2n> E$, E% diagrams. In this way, starting with a chiral algebraic conformal quantum field theory A with central charge less than 1, we obtain a type I modular invariant matrix Z in the classification list of [6] labeled with pairs of the An-D2n-E6ts Dynkin diagrams with difference of the Coxeter numbers being 1. Our main result in [17] with R. Longo is that this correspondence gives a complete classification of chiral algebraic conformal quantum field theories. Note that we have no reason, a priori, to believe or expect that this correspondence from a conformal field theory to a matrix in a certain list is injective or surjetive, but we have proved both injectivity and surjectivity of this correspondence.
Classification of operator algebraic conformal field theories ...
483
Our classification list is as follows. (1) (2) (3) (4) (5) (6)
Virasoro nets with central charge c = 1 — 6/m(m + 1). Their simple current extensions of index 2. The exceptional net labeled with (Ee, A12). The exceptional net labeled with (Es, A 3 0 ). The exceptional net labeled with (Ai0,E6). The exceptional net labeled with (A^jEs).
The first two of the above exceptional ones are realized as the coset constructions for SU(2)n C SO(5)1®SU(2)1 and SU(2)29 C (G 2 )i ® SU(2)1. They were first considered by Bockenhauer-Evans [1, II, Subsection 5.2] as possible candidates realizing the corresponding modular invariants in the Cappelli-Itzykson-Zuber list, but they were unable to show that these coset constructions indeed produce the desired modular invariants. With our complete classification, it is easy to identify these cosets with the above two in our list. Recently, Koster [20] identified the third exceptional net in the above list, (A10, Ee), with the two cosets SU(9)2 C (^8)2 and (#8)3 C (E8)2®{Eg)i, assuming that the local conformal nets (E$)k have the expected WZW-fusion rules. The last one, (A28,E&), does not seem to be a coset nor an orbifold, and it appears to be a genuine new example. Carpi [5] and Xu [37] recently obtained certain classification results of chiral algebraic conformal quantum field theories with central charge equal to 1, independently.
5. Classification of 2-dimensional theories and 2-cohomology We now explain how to obtain a classification of full algebraic conformal quantum field theories with central charge less than 1, using the results in the previous section. This is our joint work with R. Longo [18]. As we mentioned before, our strategy is to study a subfactor A+(I) ® A-(J) C A(I x J ) , where A(I x J ) is a given full algebraic conformal quantum field theory with central charge 1. By the classification list in the previous section, we have a complete information on the chiral ones A±. We now assume the so-called parity symmetry condition for a full algebraic conformal quantum field theory A, which in particular implies that A+ and A- are isomorphic and they contain the same Vir c . Then the dual canonical endomorphism for the subfactor Virc(/)<S>Virc( J ) C A(IxJ) gives a decomposition ® A Zx^X®^, where A, /x are irreducible DHR endomorphisms of Vir c . In a more general setting, the following was conjectured by Rehren and proved by Miiger [25]. Theorem 5.1. Under the above conditions, the following are equivalent. (1) The net A has only the trivial representation theory. (2) The fi-index of the net A is 1. (3) The matrix Z above is a modular invariant. In this way, we obtain a modular invariant Z for Virc in the classification list of CappelliItzykson-Zuber [6] from a full algebraic conformal quantum field theory A with parity symmetry and trivial representation theory. Then we can prove as in [18] that a full algebraic conformal quantum field theory A with parity symmetry has only the trivial representation
484
YASUYUKI KAWAHIGASHI
theory if and only if it is maximal with respect to extensions. Thus we obtain a modular invariant in [6] from full algebraic conformal quantum field theory A with parity symmetry and maximality. Our main result in [18] with R. Longo shows that this gives a bijective correspondence. Note that the modular invariants in [6] are labeled with pairs of A-D-E Dynkin diagrams with difference of the Coxeter numbers being 1 as before, but we now do not have a restriction to so-called type I modular invariants, so the Dynkin diagrams D^n+i and E-j do appear. Surjectivity of this correspondence is not difficult by Rehren's result [30], together with our previous analysis in [3,17]. To prove injectivity of this correspondence, we need to study the subfactor A+{I) ® A-(J) C A{I x J ) , where we have a natural identification of A+(I) and A-(J) and the dual canonical endomorphism decomposes in the form of ® A
Acknowledgments We gratefully acknowledge the support of GNAMPA-INDAM and MIUR (Italy) and Grantsin-Aid for Scientific Research, JSPS (Japan). We thank R. Longo, A. Matsuo, and C. Schweigert for useful correspondences on the related materials.
References 1. J. Bockenhauer, D. E. Evans, Comm. Math. Phys. 197, 361 (1998); II 200, 57 (1999); III 205, 183 (1999). 2. J. Bockenhauer, D. E. Evans, Y. Kawahigashi, Comm. Math. Phys. 208, 429 (1999). 3. J. Bockenhauer, D. E. Evans, Y. Kawahigashi, Comm. Math. Phys. 210, 733 (2000). 4. J. Bockenhauer, D. E. Evans, Y. Kawahigashi, Publ. RIMS, Kyoto Univ. 37, 1 (2001). 5. S. Carpi, arXiv:math.0A/0306425, to appear in Comm. Math. Phys. 6. A. Cappelli, C. Itzykson, J.-B. Zuber, Comm. Math. Phys. 113, 1 (1987). 7. S. Doplicher, R. Haag, J. E. Roberts, I Comm. Math. Phys. 23 199 (1971); II 35, 49 (1974).
Classification of operator algebraic conformal field theories . . .
485
8. D. E. Evans, Y. Kawahigashi, Quantum symmetries on operator algebras, Oxford University Press, Oxford, 1998. 9. K. Fredenhagen, K.-H. Rehren, B. Schroer, I. Comm. Math. Phys. 125, 201 (1989), II. Rev. Math. Phys. Special issue, 113 (1992). 10. E. Frenkel, D. Ben-Zvi, Vertex algebras and algebraic curves, Math. Surv. Monog. 88, Amer. Math. Soc, 2001. 11. D. Friedan, Z. Qiu, S. Shenker, Comm. Math. Phys. 107, 535 (1986). 12. P. Goddard, A. Kent, D. Olive, Comm. Math. Phys. 103, 105 (1986). 13. D. Guido, R. Longo, Comm. Math. Phys. 181, 11 (1996). 14. R. Haag, Local Quantum Physics, Springer-Verlag, Berlin-Heidelberg-New York, 1996. 15. V. F. R. Jones, Invent. Math. 72, 1 (1983). 16. Y. Kawahigashi, arXiv:math.0A/0211141. 17. Y. Kawahigashi, R. Longo, arXiv:math-ph/0201015, to appear in Ann. Math. 18. Y. Kawahigashi, R. Longo, arXiv:math-ph/0304022, to appear in Comm. Math. Phys. 19. Y. Kawahigashi, R. Longo, M. Miiger, Comm. Math. Phys. 219, 631 (2001). 20. S. Koster, arXiv:math-ph/0303054. 21. R. Longo, I Comm. Math. Phys. 126, 217 (1989); II Comm. Math. Phys. 130, 285 (1990). 22. R. Longo, Comm. Math. Phys. 159, 133 (1994). 23. R. Longo, Comm. Math. Phys. 237, 7 (2003). 24. R. Longo, K.-H. Rehren, Rev. Math. Phys. 7, 567 (1995). 25. M. Miiger, "Extensions and modular invariants of rational conformal field theories", in preparation. 26. A. Ocneanu, "Quantized group, string algebras and Galois theory for algebras", in Operator algebras and applications, vol. 2, ed. D. E. Evans and M. Takesaki, Cambridge University Press, Cambridge, 1988, p. 119. 27. A. Ocneanu, "Operator algebras, topology and subgroups of quantum symmetry — construction of subgroups of quantum groups" (written by S. Goto and N. Sato), in Taniguchi Conference in Mathematics Nara '98 Adv. Stud. Pure Math. 31, Math. Soc. Japan, 2000, p. 235. 28. S. Popa, Acta Math. 172, 352 (1994). 29. S. Popa, Math. Res. Lett. 1, 409 (1994). 30. K.-H. Rehren, Comm. Math. Phys. 211, 395 (2000). 31. V. G. Turaev, Quantum Invariants of Knots and 3-manifolds, Walter de Gruyter, 1994. 32. A. Wassermann, Invent. Math. 133, 467 (1998). 33. F. Xu, Comm. Math. Phys. 192, 347 (1998). 34. F. Xu, Commun. Contemp. Math. 2, 307 (2000). 35. F. Xu, Comm. Math. Phys. 211, 1 (2000). 36. F. Xu, Proc. Nat. Acad. Sci. U.S.A. 97, 14069 (2000). 37. F. Xu, arXiv:math.QA/0303266. 38. Y. Zhu, J. Amer. Math. Soc. 9, 237 (1996).
An application of the Lieb-Thirring inequality in quantum information theory CHRISTOPHER K I N G
(Northeastern U., Boston)
Quantum information theory has generated several interesting conjectures involving products of completely positive maps on matrix algebras, also known as quantum channels. In particular it is conjectured that the minimal entropy output state from a product channel is always a product state. We show how the Lieb-Thirring inequality can be used to prove this conjecture for one special case, namely when one of the components of the product channel is of the type known as a diagonal channel.
1. Introduction The minimal output entropy of a quantum channel $ is defined by Sml„(*)=infS($(p)),
(1)
p
where S is the von Neumann entropy, and the inf runs over states in the domain of $. The following additivity property is conjectured. Conjecture 1.1. Let $ and ^ be any quantum channels, that is completely positive, tracepreserving maps on finite-dimensional matrix algebras. Then 5 m i n ( $ ® * ) = 5 m i n ($) + 5 m i n ( * ) .
(2)
Within the last year, Shor [11] proved that Conjecture 1.1 is equivalent to several other outstanding conjectures in quantum information theory, among them additivity of Holevo capacity of a quantum channel, and additivity of the entanglement of formation. Thus a proof of Conjecture 1.1 would settle quite a few outstanding problems in the field. The additivity conjecture has been proved for several special classes of channels [6,7,10]. The Lieb-Thirring inequality [9] was a key ingredient in several of those proofs. The purpose of this paper is to illustrate the application of the Lieb-Thirring inequality to this problem, by using it to demonstrate (2) for a particularly simple class known as the 'diagonal' channels. The problem is attacked by making use of the maximal output p-norm, also called the maximal output purity of a channel [1], which is defined for p > 1 by M*)
= s u p | | $ ( p ) | | p = sup(Tr($( / 0 )) p ) 1 / P . p
P
v
(3)
'
The derivative of vv{§) at p = 1 is the negative minimal output entropy, so Conjecture 1.1 is a consequence of the following stronger conjecture. Conjecture 1.2. There is some po > 1, such that for all quantum channels $ and $, and all 1 < p < po, l/ p ($® ¥ ) = !/„($) !/„(*). (4)
486
An application of the Lieb-Thirring inequality in quantum information theory
487
In this paper we consider a class of channels known as the diagonal channels, and show how the Lieb-Thirring inequality can be used to derive (4) under the assumption that at least one of the channels $ or ^ is in this class. It turns out that the multiplicativity result for diagonal channels holds for all p > 1, so this might lead one to hope that p0 = oo in Conjecture 1.2. However it is known that multiplicativity fails in general for p > 5 [12], and indeed this probably provides evidence that the strategy used in this paper cannot be directly extended to prove Conjecture 1.2 in the general case. Nevertheless it seems worthwhile to explain the approach used, as the techniques may be useful for other reasons, and the results may have other applications in quantum information theory. The method of proof is quite similar to the approach used by the author to prove Conjecture 1.2 for the class of entanglement-breaking channels [5].
2. Statement of results Since we will be concerned with p-norms from now on, the trace-preserving condition for quantum channels is unimportant, and so we deal just with completely positive (CP) maps. The diagonal class of channels was described by Landau and Streater [8]. We define the Hadamard product of two n x n matrices A and B by (A*B)ij=AijBij.
(5)
Definition 2.1. The CP map $ is called diagonal if there is a positive semidefinite matrix C such that
(7)
where Diag(V') is the diagonal nx n matrix with the components of \ip) along the diagonal. Using the spectral representation it follows that a map is diagonal if and only if it has a Kraus representation with all diagonal matrices. Our main result is stated below in Theorem 2.1. Theorem 2.1. Let $ be a diagonal map, and let ^ be any other CP map. Then for all P>1, i/ p ($ ® tf) = !/„($) !/„(*).
(8)
The main tool used in the proof is the Lieb-Thirring inequality [9], which we now state. Let K > 0 be a positive semidefinite n x n matrix, and let V be a m x n matrix, for some integers m,n. Then for all p > 1, Tr (VKV*Y
< Tr(V*V)p/2
Kp (V*V)p/2
= Ti{V*V)p Kp .
(9)
There are several proofs of this inequality [9], [2]. The original proof of Lieb and Thirring employs Epstein's concavity theorem [3], which is based on a combination of spectral theory and analytic continuation methods.
488
CHRISTOPHER KING
3. The factorization The goal of this section is to rewrite the output of the product channel $®\I> in the factorized form VKV* so that (9) can be applied. We assume that $ is a diagonal channel which acts by Hadamard product with the nxn matrix C. Let p be a state on Ckn, that is a positive semidefinite kn x kn matrix with trace 1, for some k > 1. Then p can be written as a n x n block matrix where the blocks (p)ij are k x k matrices. The diagonal blocks (p)u are positive semidefinite, and we define a 4 = Tr(p)u . (10) Define a new kn x kn matrix r with blocks (r)y = (aia,-)" 1 / 2 (Ph
(11) 1 2
and let A denote the n x n matrix with entries A^ = (atietj) / . Then p can be written as a Hadamard product of A with r, that is p=(A®Jk)*T,
(12)
where Jfc is the k x k matrix with all entries equal to 1:
n Jfc =
... A :
:
\1
...
.
(13)
1/
Notice that Jfc acts as the identity for the Hadamard product. Furthermore this decomposition commutes with the action of * on the second factor, that is (/®*)(p) = (A®Jfc)*((-f®*)W)-
(14)
The map $ ® 7 acts on (14) by a Hadamard product with the matrix C on the first factor. This Hadamard product acts just on the matrix A, and the result is ($ ® *)(p) = ($(A) ® J fc ) * ( ( / ® * ) ( r ) ) .
(15)
The next step is to factorize the matrix (I ® * ) ( r ) . To do this, let V j , . . . , Vn be the k x kn matrices which are the block-rows of its square root, that is 'V,'
((/®*)(r))
1/2
= [ : |.
(16)
KVU,
Then it follows that
/ViVi* ... ViV^N (/®*)(r)= | : | (VT ... KT)= : •.. : . \V„VY ... VnV*J Notice that the diagonal terms are ^((T)U) with Tr(T)ij = 1 it follows that
(17)
= ViV*, and since (T)U is positive semidefinite
IIWUP
(18)
An application of the Lieb-Thirring inequality in quantum information theory
489
Applying the factorization (17) to (15) gives /GMnVM
...
(*®¥)(p)=
:
9(A)lnV1V'\
•
.
\$(A)nlVnV{ ...
.
:
.
(19)
$(A)nnVnvJ
Now the right side of (19) can be rewritten as a product of three matrices: /Vi
0
...
0\
o v2 ... o \o
o ...
*(A)
'*(A)nI'
vj
K*(A)nii'
...
lnI"
*{A)nnr,
/V{
0
o v2* \0
^
0
...
(20)
V*)
where the I' in the middle term is the kn x kn identity matrix. This is the same as /Vi 0
0 V2
... ...
(V?
0\ 0
0
0
0
F2*
(21)
($(4)®/')
\o
o ... v j
\o
o
V*J
Therefore (19) has been written in the factorized form (22)
($®*)(p) = i / i f r , 2
where V is the kn x kn
matrix (Vi
0
...
0\
o v2 ... o
v =
\0
0
(23)
VJ
...
and (24)
X = ($(A) ® / ' ) . 4. A p p l y i n g t h e inequality
The last step is to apply the Lieb-Thirring inequality (9) to (22). It follows from (23) that (V*V)P is block diagonal, that is
/OW
K \
o
0 P
\
0
{V2*V2)
0
o
0
(v:vny)
(25)
Also Kp = ($(A)) P ®/', so the diagonal blocks of Kp are just the diagonal entries of ($(A))P multiplied by the identity matrix / ' . Hence n
T r ( V * v ) Kp = ^Tr(V*Vi)p
(($(A)) P )
.
(26)
The matrices V*Vi and ViV* share the same nonzero spectrum, and so (18) can be used to bound the terms Ti(V*Vi)p on the right side of (26). This gives
490
CHRISTOPHER KING
P
n
Tr (yv) K* < Y, ("PO*))"^^))"),.
(2?)
i=l
= (up(*))P
Tr(*{A))>
.
(28)
Furthermore since p is a state, it follows t h a t TiA = Trp=l, and hence T r ( $ ( j 4 ) ) p can be bounded using the definition (3). P u t t i n g it all together we deduce T r ( $ ® *)(Py
< (^P(*))P(^($))P.
(29)
From this it follows t h a t i> P (* ® # ) < i/ p ($) i/ p (*) •
(30)
T h e inequality in t h e other direction follows easily by restricting to product states, hence Theorem 2.1 is proved.
Acknowledgments This work was supported in part by National Science Foundation Grant DMS-0101205. T h e author is grateful to E. Lieb and M. B. Ruskai for first demonstrating how t h e LiebThirring inequality could be applied to the additivity problem, and for allowing their work to be included in the Appendix of t h e paper [4].
References 1. G. G. Amosov, A. S. Holevo, R. F. Werner, "On Some Additivity Problems in Quantum Information Theory", Problems in Information Transmission 36, 305-313 (2000). 2. H. Araki, "On an inequality of Lieb and Thirring", Letters in Mathematical Physics 19, 167-170 (1990). 3. H. Epstein, "Remarks on two theorems of E. Lieb", Communications in Mathematical Physics 31, 317-325 (1973). 4. C. King, "Maximization of capacity and p-norms for some product channels", Journal of Mathematical Physics 4 3 , 1247-1260 (2002). 5. C. King, "Maximal p-norms of entanglement breaking channels", Quantum Information and Computation 3, 186-190 (2003). 6. C. King, "Additivity for unital qubit channels", Journal of Mathematical Physics 4 3 , 4641-4653 (2002). 7. C. King, "The capacity of the quantum depolarizing channel", IEEE Transactions on Information Theory 49, 221-229 (2003). 8. L. J. Landau, R. F. Streater, "On Birkhoff's theorem for doubly stochastic completely positive maps of matrix algebras", Linear Algebra and its Applications 193, 107-127 (1993). 9. E. Lieb, W. Thirring, "Inequalities for the moments of the eigenvalues of the Schrodinger Hamiltonian and their relation to Sobolev inequalities", in Studies in Mathematical Physics, E. Lieb, B. Simon, A. Wightman eds., Princeton University Press, 1976, pp. 269-303. 10. P. W. Shor, "Additivity of the classical capacity of entanglement-breaking quantum channels", Journal of Mathematical Physics 4 3 , 4334-4340 (2002). 11. P. W. Shor, "Equivalence of additivity questions in quantum information theory", preprint arXiv:quant-ph/0305035; to appear in Communications in Mathematical Physics. 12. R. F. Werner, A. S. Holevo, "Counterexample to an additivity conjecture for output purity of quantum channels", Jour. Math. Phys. 43, 4353-4357 (2002).
P a t h integrals and stochastic analysis Session organized by S. ALBEVERIO (Bonn) and G. BEN AROUS (Lausanne)
This page is intentionally left blank
Connes-Hida Calculus in index theory R E M I LEANDRE (U. Nancy I)
We apply Hida calculus to understand the algebraic properties of the auxiliary bundle in index theory.
1. Introduction There are two ways to see the relations between the index theorem and the algebraic properties of the complex auxiliary bundle associated to a Dirac operator: — The first one uses Bismut-Chern character over the free loop space, associated to the equivariant cohomology of the free loop space. — The second one uses the cyclic complex associated to the algebra of complex valued functions on a manifold. It enters in the heart of non-commutative differential geometry of Connes. On the other hand, there are 3 types of infinite dimensional distribution theories: — The first one is Hida distribution theory, which uses the Fock space; it is Hilbertian and algebraic. — The second one is Watanabe's distribution theory, which uses the algebra of functionals which belong to all the Sobolev spaces over the Wiener space of the Malliavin Calculus, and which is probabilistic. — The last is Connes's non commutative differential geometry. It is based upon the cyclic complex associated to some Banach algebras. The goal of this communication is to apply tools of Hida Calculus in order to understand the role of the auxiliary bundle in the Index theory in these two aspects. For details, the reader can see [1] and [2] and the references therein.
2. The case of the Dirac operator and the Index theorem Let us suppose that M is a compact orientable spin manifold. We can consider the Dirac operator D over M, which acts over the section of spinors over M. The spin bundle Sp is divided in two bundles Sp+ and Sp-, and D interchanges the section of Sp+ and of Sp-. It is a symmetric operator. Moreover the Clifford bundle acts over Sp. We can consider the bundle of forms 0 . To a form u, there corresponds an element of the Clifford bundle called u)c\. We consider as Hilbert space HQ the space of L 2 forms L(fi) complexified, and the Hilbert space of this part is H = HQ © HQ. Over HQ, we can consider the differential operator dd*+d*d+l, and the corresponding Sobolev spaces Hp. The Ck norm of a form can be estimated in term of its Sobolev norms Hp for some p > k. We can consider the space DQQ of Hida test functions associated to H. Let us introduce P(M) the smooth path space and
493
494
REMI LEANDRE
L{M) the smooth free loop space, that is the space of smooth map from the circle Si into M. P(M) and L(M) are Frechet manifolds. To a = X > ! _ 1 a 4 ® - • •®oj™®n\~luj1'1 ®- • -®ul'n, we associate the Chen iterated integral *V) = E
/
H(d-YM,.)
+ w^dst)
A• • •
J0<si<---<sn
A « ( d 7 ( * „ ) , •) + wi' n ds n > = J2
F
»
'
^
where Fr(cr) is a form of degree r. In order to get the relation with the symmetric Fock space, we suppose that w%n is of odd degree and CJ^1 is of even degree. Namely, in the theory of iterated integral, we should integrate over the square [0,1]" and use the fact that even forms over the loop space commute, in order to integrate on the simplex. By considering a basis Wj of eigenforms associated to dd* + d*d + 1 and to the eigenvalues Xj counted in increasing order (let us recall that Xj ~ Cja for some a and remark that ||U;J||OO < C(Xj + l)k for some fc), we find Lemma 2 . 1 . If a € Soo, F^(a) is a measurable r-form over P(M) or over L(M). a = exp[w © a;1], we can consider, when h is a section of the spin bundle: K(:y(s)/4:ds)'^2
E expf-/ \
JO
J
fo,Sl(^c\(dj(si)) n
If
+ LJ^dsi)
J0<si< — <sn
fSl,s2(^c\(d^(s2))+^c\ds2)--i^c\(d;y{sn))
+
Ljlldsn)fSniih('y(l))
(2)
Let us explain this formula: K is the scalar curvature associated to the Levi-Civita connection on M. s —> 'y(s) is a Brownian motion associated to the semi-group exp[—£A]. If s > s', fS:S> is the stochastic parallel transport for the Levi-Civita connection on the tangent bundle, or the Clifford bundle or the spin bundle. uci(d;y(s)) can be seen as follows: iv(dj(s)) is formally a form with degree the degree of u minus 1 on T^sy. we identify this form to an element of the Clifford bundle in 7(s). If X 6 TX(M), X —> uc\(X) can be seen as a 1-form with values in the fiber of the Clifford bundle at x. It is called uic\(.). By considering the Levi-Civita connection V on Sp, we deduce that V + wci(.) = V w is another connection on the spin bundle. We consider the Laplacian A v " acting on the section of Sp and the Schroedinger operator L{w,ul) = A v " — u\x + K/A. The heat semi-group exp[—iL^a; 1 )] has a probabilistic representation. We consider f/j the parallel transport in the reversed sense along the trajectory on the Brownian paths for the connection V w s < t. We have: exp[-*L] h(x) = E exp (- J
K(j(s))/A
ds^j oV 0 V r
Hl(t))
where we consider the Brownian motion starting from x and where Ut is the solution of the differential equation:
dut = Utffcui^rr'dt issued of / .
Connes-Hida Calculus in index theory
495
Lemma 2.2. Let Q%'u the operator exp[aL(a;,w 1 )] where Re a < 0. It is a holomorphic family of bounded operators over the space of L2 sections of the spin bundle and the norm of Q%,lv is bounded by exp[K(a)(\\w\\p + H^Hp] for some p, where K(a) is bounded over each compact of R+* © iR. Moreover z —* Qa zu ,u is a holomorphic family of operators. This lemma can be improved: Lemma 2.3. Q%'u is trace class and G^u),^1) depends holomorphically of a.
= Tr s Q£' w
is a U-functional
which
Remark 2.1. The supertrace Tr s is the habitual supertrace of Index theory related to the Z/2Z grading of the spin bundle. Theorem 2.1. &£" is the S-transform of a Hida distribution $ a over S^, which depends holomorphically of a. Remark 2.2. Let us consider the Killing vector field s —> d/dsj(s) over L(M) X^^)). generates the natural circle action over L(M). We can consider the form
It
J ( 7 ) = / |d/ds7(s)| 2 ds + dX 0 0 ( 7 ) Jo over the smooth loop space. Atiyah and Bismut gave the following heuristic formula for $ Q : ($«,a) = Z~l [ JL(M)
F(a) A exp[l/4a/( 7 )] •
Let us introduce a complex auxiliary bundle £ over M. We can trivialize it, since M is compact, and £ is given by a projector p. We can consider the projection connection whose connection form is A = p dp and curvature is R = pdp A p dp. Bismut has introduced the Bismut-Chern character ch(£00) on the free loop space. Jones-Leandre have shown that ch(£00) = F(a^) whereCTJbelongs to S ^ . We can twist the Dirac operator by £: we get D^ We have Df- = AH + K/A + R\ by the Lichnerowicz formula where AH is the Laplacian over Sp <8> £ induced by the tensor product of the LeviCivita connection on Sp and the projection connection on £. We find, if a is a real strictly negative, ($a,cre) = I n d D e . But a —> ($a,(?) is Holomorphic for all (X in Soo- We deduce: Theorem 2.2. (^a, 0 '?) = Ind-D^ if the real part of a is strictly negative.
3. Cyclic h o m o l o g y in H i d a s e n s e Let us consider a compact Riemannian manifold M of dimension d endowed with the Riemannian measure dniM- Let A be the Laplace-Beltrami operator over it: it is self-adjoint, densely defined over the Hilbert space Ho = L2(M)®C. We consider the Sobolev space Hk (k € N) of complex functions cf> on M such that / M ( A + l)k(fxpdmM < oo. The LaplaceBeltrami operator A has eigenvalues A„ (numbered in increasing order) associated to the eigenfunctions
496
R E M I LEANDRE
*• =
$ > „ ,
n>0
where Fn belongs to Hk ® Hfn equipped with its natural Hilbert structure. We define over Sk,c the following Hilbert structure:
n>0
v
';
Sk,c is called a weighted Boltzmannian Fock space. Definition 3.1. The space of Hida test functionals is r\keN,c>oSk,c — 5oo- endowed with the projective topology. S ^ is a nuclear Frechet space, because the Sobolev imbedding theorem. We consider the Hochschild boundary: n-l 6(0O®01®"-®>n) = ^ ( - l ) i 0 O ® - - - ® ^ i + l ® - - - ® < / ' n + (-l) n 'n0O<8>01---®^n-lt=0
Theorem 3.1. The Hochschild boundary b is a continuous linear application from Soointo 5oo_. Theorem 3.2. The Connes boundary B is a linear continuous map from SQQ- into S^-. It is classical that
b2 = B2 = bB + Bb = 0
so that b + B is a complex. Definition 3.2. b + B operating on 5QO- is called the cyclic complex in Hida sense. Let us give some classical examples of elements belonging to the cyclic complex. We denote Hoc- = C\k>oHk .
By the Sobolev imbedding theorem, i?oo- is nothing else that the algebra of smooth functions on the manifold. We consider a complex bundle associated to a smooth projector p om M, and we consider the associated Chern Character Ch«(p): this gives an element of the cyclic complex in the sense of Hida.
4. The J.L.O. cocycle as an Hida distribution Let us consider a fibration IT : M —> B of compact Riemannian manifolds. The generic element of B is denoted by y and x denotes the generic element of the fiber Vy. We denote by A(B) the exterior bundle on B: we consider TT*(A(B)) the pullback bundle on B endowed with the trivial connection V. We suppose that the fiber Vy is a spin, and we consider the spin bundle Sy = S+tBSy over Vy. We suppose that the bundles Sy fit together in a complex bundle S on M. The Clifford bundle Cly acts over Sy, and the product by an element of the tangent bundle, considered as a subbundle of the Clifford bundle, is odd relatively to the natural graduation on Sy. We consider the Levi-Civita connection V y on Vy, which
Connes-Hida Calculus in index theory
497
passes to Sy and to Cly. We consider the family of Dirac operators Dy. In local coordinate, Dy = J2 ei.yV^ where eiyy is a local orthonormal basis of T(Vy). Let us define H°° = Hf © Hf the infinite dimensional bundle on B of smooth sections of Sy. We consider the bundle on B A{B)®H°°. If cr is a smooth form on B and ipv a section of Sy, we write: Dy(a®i;v) = (-l)dez°o-®Dyi>y We will use the Bismut superconnection, called by Bismut the Levi-Civita superconnection. Let us consider over the space of operators on A(TB)®H°° the natural Z/2Z graduation, such that V£° is an odd operator. We can introduce the curvature Rf of V£°. The semigroup exp[—t/2R^°y] acting on smooth sections rpy of Ey has a probabilistic representation. Let F belonging to Soo- Let us introduce the J.L.O. cocycle Ch*(V£°) by ( C h * ( V r ) , F ) = Y, n
f
Tr.^expt-aiiZfllVf.^lexphCaa-S!)^] JO<s1<-<sn
• • • [Vf, ft] exp[-(l - sn)Rf]
dsi...
dsn .
(3)
It takes its values in the form on B. Let us recall that the supertrace Tr s of an an operator transforming Ay(B)<§Sy~(x) into Ay(B)
lklli(B),fc = J <(AB + 1)*.^> dmB. Using tools of Malliavin Calculus, we can prove that: Theorem 4.1. The J.L.O. cocycle Ch*(V£°) is a Hida distribution (an element of the topological dual S-oo of Soo-) with values in A(B) c o _. Malliavin Calculus allowing to justify the algebraic computations, we get: Theorem 4.2. (b + B) Ch*(V~) = 0. Remark 4.1. This means that (Ch*(Vf), (b + B)F) = 0 for all F belonging to S ^ - . Malliavin Calculus allows to justify analytically the formal algebraic computations leading to the following: Theorem 4.3. (Ch*(V^°),Ch*(p)) = Ch(IndpD.p) where Ind(pD.p) is the Index bundle on B of the family of twisted Dirac operators y —> pDyp and Ch its Chern character in real phase.
References 1. R. Leandre, "Theory of distribution in the sense of Connes-Hida and Feynman path integral on a manifold", Inf. Dim. Ana., Quant. Probab., Rel. Top., to appear. 2. R. Leandre, H. Ouerdiane, "Connes-Hida calculus and Bismut Quillen superconnections", preprint.
Wilson loops and spin networks THIERRY LEVY (ENS
Paris)
If G is any finite product of orthogonal, unitary and symplectic matrix groups, then Wilson loops generate a dense subalgebra of continuous observables on the configuration space of lattice gauge theory with structure group G. If G is orthogonal, unitary or symplectic, then Wilson loops associated to the natural representation of G are enough. This extends a result of A. Sengupta [4]. In particular, our approach includes the case of even orthogonal groups.
1. Introduction On a compact Lie group, the Peter-Weyl theorem asserts that the characters of irreducible representations generate a dense subalgebra of continuous functions invariant by adjunction. In lattice gauge theory, configuration spaces are powers of a Lie group on which another power of the same group acts, according to the geometry of a given graph and in a way which extends the adjoint action of the group on itself. Peter-Weyl theorem can be adapted to this situation and the functions that play the role of the characters are called spin networks. Despite the fact that spin networks were introduced about forty years ago in a physical context a , their importance in lattice gauge theory has been recognized rather recently [1]. In the mean time, another set of functions, easier to define, has been used as the standard set of observables: Wilson loops. However, it is not clear at all a priori that this set is complete, that is, that Wilson loops generate a dense subalgebra of continuous invariant functions on the configuration space. A. Sengupta has proved in [4] that it is true when the group is a product of odd orthogonal, unitary (and symplectic) groups. In [2], an approach similar to that of Sengupta but with a little more classical invariant theory combined with the use of spin networks allows us to add even orthogonal groups to the list and, hopefully, to clarify the argument. This short text is a condensed version of [2]. In particular, no proof is given here, except for a graphical description of the idea behind the main argument.
2. The configuration space Let G be a compact connected Lie group. Let T — (E, V) be a graph with oriented edges. By this we mean that V is a finite set and E is a set of pairs of elements of V. Diagonal pairs are allowed and a pair can occur several times in E. If e = (v, w) £ E is an edge, we define the source and target of e respectively by s(e) = v and t(e) = w. We make the assumption that no vertex is isolated, that is, s(E) U t(E) = V. Define an action of Gv on GE, as follows. For (f> = (cf>v)vev S Gv and g = (ge)e£E £ GB, set
with (<j> • g)e = 4>~^}ge4>s{e)-
R. Penrose introduced them for the purposes of quantization of the geometry of space. See [5] for a historical account.
498
Wilson loops and spin networks
499
The configuration space for lattice gauge theory on T with structure group G is the topological quotient space C£? = GV\GE and it can be thought of as a finite-dimensional approximation of a space of connections modulo gauge transformations. Example 2.1. Choose an integer r > 1 and consider the graph L r with r edges depicted below.
Figure 1. The graphs Li and L r .
For this graph, GE = Gr on which Gv = G acts by diagonal conjugation, and we will call diagonal conjugacy classes of Gr the points of C^ . Wilson loops are continuous functions on Cj? or, equivalently, continuous functions on GE invariant under the action of Gv. We recall briefly how they are defined. Let E^ denote the set containing twice each edge of T, once with its natural orientation and once with the reversed one. Formally, set J5 ± = E x {+, —}, extend the functions s and t to E^ by s(e, +) = s(e), s(e, —) = t(e) and the two similar rules for t. A point of GE determines a point of GE by the rules 9(e,+) = 5e and 5( e ,-) = gjl- For the sake of clarity, we identify e with (e, +) and denote (e, —) by e~1. Moreover, we use the notation e to denote a generic element of E^. A path in T is a finite sequence p = ( e i , . . . ,e„) of elements of E± such that i(e,) = s(ei+i) for all i = 1 , . . . , n — 1. It is a loop based at v if t(en) = s(ei) = v. To a loop / = ( e i , . . . , e„) one associates a function hi : GE —• G defined by hi(g) = ge„ • • • <7ei- One checks easily that the action of
—• C
is well-defined. It is called a Wilson loop. Remark 2 . 1 . A wider class of functions can be defined on Cf. Instead of considering one loop, we can consider several loops l\,...,ln based at the same point. Then, for any function / : Gn —> C invariant by diagonal adjunction, that is, such that for all p i , . . . ,gn, h e G, one has f(gi,. • •,gn) = fihgih-1,...,hgnh"1), the function f°(hh,...,hln):C°
-^C
is well-defined. In words, the diagonal conjugacy class of (h^(c),... for every c in the configuration space.
,hin(c)) is well-defined
500
THIERRY LEVY
3. Statement of the result In this paper, 0{n) and SO(n) denote respectively the groups 0 „ R and 5 0 „ R . By the symplectic group Sp(n) we mean the subgroup b U(2ri) n Sp2nC of GL 2 n C. It is isomorphic to the quaternionic unitary group Uu(n). The main result is the following. Theorem 3.1. Let G be a finite product of groups among U(n), SU(n), 0(n), SO(n), Sp(n). Let r = (E, V) be a graph. Then the algebra generated by the Wilson loops is dense in the space of continuous functions on C£? = GE/Gv. Example 3.1. In the case of the graph Li, Theorem 3.1 is equivalent to Peter-Weyl theorem. Example 3.2. Consider the case of the graph L>. Loops in L r are in one-to-one correspondence with words in the letters of E± = {ef1,..., ef1}. For such a word w and given a point g = (gi,...,gr) of GE, let us denote by w(g) the corresponding product in reversed order of the g[s and their inverses. Observe that, if a loop I corresponds to a word w, then h(g) = w(g) for all g. An important step in the proof consists in rephrasing Theorem 3.1. Assume for a moment that it is proved for the graphs L r . Then the following proposition is easily seen to hold. Proposition 3.1. Let G be a group as in Theorem 3.1. If g and g' are two points of Gr such that for all word w in r letters and their inverses, the elements w(g) and w(g') of G are conjugate, then g and g' belong to the same diagonal conjugacy class. It turns out that Proposition 3.1 is almost equivalent to Theorem 3.1. The gap is filled by the following result, which is proved in a slightly different language in [3]. Proposition 3.2. Let G be a compact group. Let T = (E, V) be a graph. Let c and c' be two points ofCp. Assume that, for any vertexv ofV and any finite sequence li,... ,lr of loops in T based at v, the diagonal conjugacy classes of (/ijj(c),..., hir(c)) and {hi^c'),... ,hiT(c')) are equal. Then c = c'. The problem has been reduced as follows. Proposition 3.3. Theorem 3.1 is logically equivalent to its specialization to the graphs L r , r > 1, which is in turn equivalent to Proposition 3.1. This translation in algebraic language allows us to reduce the list of groups that we need to consider, thanks to the following elementary lemma. Lemma 3.1. If Proposition 3.1 holds for two groups G\ and G2, then it holds for their product G\ x G2. According to this lemma, it is enough to prove Theorem 3.1 when G is one of the groups 0(n), SO(n), U(n), SU{n), Sp(n). Sp2nC is the group of matrices which preserve the skew-symmetric form whose matrix in the canonical basis is f °
[\
Wilson loops and spin networks
501
Remark 3.1. One might expect that the property expressed by Proposition 3.1 is preserved by standard transformations of the group such as quotients or central extensions. Unfortunately, no such result seems easy to prove. For central extensions, A. Sengupta has stated and proved in [4] a partial result, namely that a property slightly stronger than that of Proposition 3.1 is preserved. I have not been able to improve this result.
4. Spin networks
From now on, we concentrate on the case where T is the graph L r for some r > 1 and G is one of the groups listed above. Instead of working on the configuration space, we prefer to work on GE = Gr and consider only objects which are invariant under the diagonal adjoint action of G. Spin networks provide us with a very natural dense subalgebra of the space of invariant continuous functions. They are denned as follows. Choose r finite-dimensional representations a i , . . . ,ar of G with spaces V\,..., Vr. Then G acts on V\
502
THIERRY LEVY
5. Natural representations The main problem we are going to encounter in handling with spin networks is that they involve invariant endomorphisms of spaces of representations of G, which are in general very difficult to describe. In the case where G is a group of complex matrices of some size n, that is, an orthogonal, unitary or symplectic group 0 , G acts by left multiplication on V = C™ and this is called the natural representation. The contragredient of this representation is the action on V* given by 9-
6. Unitary groups Let us concentrate on the case of unitary groups and try to give an idea of the main step of the proof. The orthogonal and symplectic groups are treated in a similar way. Let n > 1 be an integer and let G be either U(n) or SU(n). The group G acts on V = C n by multiplication on the left. For any integer d > 1, there is a corresponding diagonal action of G on V®d, that we denote by p : G —> GL{V®d). On the other hand, the c
Recall that the elements of Sp(n) are complex matrices of size 2n.
Wilson loops and spin networks
503
symmetric group &d acts by permutation of the factors on V®d. We denote this action by IT : &d —• GL(V®d). It is obvious that the actions p and n commute to each other. The following theorem is known as Schur-Weyl duality theorem. Theorem 6.1. The two subalgebras p(CU(n)) and Tr(C&d) of End(V®d) are each other's commutant. In other words, End[/(„)(F® d ) is generated as a vector space by the permutations of the factors. The case of SU(n) follows immediately, since EndSu(n)(V®d) — End(/(„)(V® d ). Consider the following isomorphisms of G-modules: End(V® p
(1)
where the second one is chosen in the simplest possible way, namely
...Vg.
If a belongs to &p+q, let us denote by I„ the element of End(V®p
Figure 2. Schematic representation of tensors.
The middle picture represents the tensor 7r((123)) e End(V® 3 ). The rightmost picture represents the same tensor, via the identification End(F® 3 ) ~ End(I^®2 ® V*).
504
THIERRY LEVY
In this representation, tensor product corresponds to juxtaposition of the boxes and a contraction is represented by joining an outcoming leg with an incoming one. Let us consider a particular case, for example r = 2, p\ = q% = 0, q\ = 1 and p2 = 2. We take the permutation a = (123). Choose (g, h) e G2. T h e picture corresponding to t r ( n v ( g )
,.
1
1
•
f
Figure 3. The spin network ip,n
,
i
n{h) n(h)
r^
,
i
1
t
I
2\ r
,,
''
ny{g)
''
on L2 as the Wilson loop W , -•
<=2,e2)'
If one remembers t h a t , through the identification End(V*) ~ E n d ( V ) , nv(g) corresponds to n(g~l), it becomes almost evident t h a t the trace we are computing is also a Wilson loop, namely tvn(g~lh2). T h e actual proof of Proposition 6.1 is nothing b u t a formalization of this observation.
References 1. 2. 3. 4.
John C. Baez, "Spin networks in gauge theory", Adv. Math. 117, 253-272 (1996). Thierry Levy, "Wilson loops in the light of spin networks", arXiv:math-ph/0306059 (2003). Ambar Sengupta, "The Yang-Mills measure for S2", J. Fund. Anal. 108, 231-273 (1992). Arnbar Sengupta, Gauge invariant functions of connections, Proc. Amer. Math. Soc. 121, 897905 (1994). 5. Lee Smolin, "The physics of spin networks", in The geometric universe (Oxford, 1996), Oxford Univ. Press, Oxford, 1998, pp. 291-304.
Some new developments in Feynman path integrals and applications S.
MAZZUCCHI
(U. Trento)
A survey of recent developments concerning rigorously defined infinite dimensional oscillatory integrals, "Feynman path integrals", is given. Both the theory and its applications in quantum theory are discussed. As for the theory, general results are discussed including the case of polynomially growing phase functions, which are handled by exploiting the connection with probabilistic functional integrals. Applications to continuous measurement theory and stochastic Schrodinger equation are also discussed.
In 1942 Richard Feynman, following a suggestion by Dirac, proposed a heuristic, but very suggestive, representation for the solution of the Schrodinger equation for a (non-relativistic quantum) particle moving in a potential V: i h
F t ^ - ^
^(0,x)
+ v
^
(i)
= ipo{x).
According to Feynman the state of the system, the "wave function", at time t evaluated at the point x E Rd is given as a "sum over all possible histories", that is an integral over the paths 7(s)se[o,t] with fixed end point: e* S t ^Vo(7(0)) £> 7 ",
rl>(t,x) = "const f
(2)
where St (7) = Sf(7) — f0 V(-y(s)) ds, S^{y) = m fQ -^-^- ds, is the classical action functional of the system, evaluated along the path 7. Feynman also extended this kind of representation to general quantum dynamical systems, including the case of quantum fields. Even if more than fifty years have passed since Feynman's original proposal, it is still fascinating. Indeed it creates a link between the classical Lagrangian description of the physical world and the quantum one. It provides a quantization method, allowing, at least heuristically, to associate to each classical Lagrangian a quantum evolution. Moreover in this setting the study of the "semiclassical limit" of quantum mechanics, that is the study of the detailed behavior of the wave function when the Planck constant h is regarded as a mathematical parameter allowed to approach zero, can be performed, at least heuristically, by means of a stationary phase argument. Indeed when h becomes small the integrand is strongly oscillating and the main contributions to the integral should come from those paths which make stationary the phase St, that is the action functional. These, by Hamilton's least action principle, are exactly the classical orbits of the system. Nevertheless a formula like (2), as it stands, has not a well denned mathematical meaning: indeed neither the normalization constant in front of the integral, nor the "infinite dimensional Lebesgue measure" Z>y on the space of paths are well defined. In 1947 Kac observed
505
506
S. MAZZUCCHI
that, by considering the heat equation (with m = h = 1 for simplicity) d 1A Vu dt 2 u(0,x) = UQ(X)
(3)
instead of Schrodinger equation and by replacing the oscillatory factor e^St^Dj with the non oscillatory e~^St^Dj, one can give mathematical meaning to Feynman's formula in terms of a well defined Gaussian integral on the space of continuous paths: an integral with respect to the well known Wiener measure, i.e., an expectation E with respect to the Brownian motion process w u(t,x) = " /e~ St{ul) u 0 {uj{i))Du"
= " E [ e - ^ V M s ) + : r ) d s ) u o ( w ( i ) +a;)'
(4)
Equation (4) is called Feynman-Kac formula. Such an interpretation is not possible for Feynman's heuristic complex measure e^St^D'y: indeed Cameron proved it cannot be cr-additive and of bounded variation, even on very nice subsets of paths' space. As a consequence mathematicians tried to realize it as a linear continuous functional on a suitable Banach algebra of functions. Nowadays we can find in the physical and in the mathematical literature several realizations of this program, that is several definitions of the "Feynman's functional", for instance by means of analytic continuation [18,20,22,26,27,30,33], or as an infinite dimensional distribution in the framework of Hida calculus [24], or via "complex Poisson measures" [29], or by means of non-standard analysis [4], or as a infinite dimensional oscillatory integral [2,9,23,25]. In the following we shall focus on the latter approach, which has its root in the work of Ito and was extensively developed by S. Albeverio and R. H0egh-Krohn, D. Elworthy and A. Truman, S. Albeverio and Z. Brzezniak. Such an approach allows the implementation of an infinite dimensional version of the stationary phase method and the corresponding study of the behavior of the integral in the limit h J. 0. Before coming to a systematic exposition of the latter approach let us briefly stress the strong connections between Feynman's inspiration and probabilistic ideas, as particularly stressed in yet another approach, the "Euclidean quantum mechanical approach" [21]. An oscillatory integral with quadratic phase function
f J n
{x Qx)
e^
'
f(x)dx
(5)
(where Q : D{Q) C ft —> 7i is a linear self-adjoint invertible operator), a so-called Fresnel integral, on a real separable Hilbert space (H, (•, •)) can be defined by a sequential approach. Following Hormander, if H is finite dimensional, the oscillatory integral is defined as the limit of a sequence of regularized absolutely convergent Lebesgue integrals. If 7i is infinite dimensional, according to [23], an infinite dimensional oscillatory integral (IDOI) is defined as the limit of a sequence of finite dimensional oscillatory integrals. The description of the largest class of functions / for which the integral (5) is well defined is still an open problem, but one can find some interesting subsets of it, as F(H), the Banach algebra of complex functions which are Fourier transforms of complex bounded variation measures on
Some new developments in Feynman path integrals and applications
507
H- Indeed, even if dim(W) = oo, one can prove that if / is the Fourier transform of an arbitrary complex bounded variation measure /x/ on TC:
f(x) = f Jn
e^vAda)
and if Q — I is any trace class operator, then the oscillatory integral (5) is well defined and it can be explicitly computed in terms of an absolutely convergent integral with respect to a cr-additive measure on H by means of a Parseval-type equality [2,9,23]: je^x'Qx)f{x)dx
= ( d e t Q ) - 1 / 2 J e-^{x'Q~lx)fif(dx),
(6)
detQ being the Fredholm determinant of the operator Q. In this setting [2,9,23] it is possible to give a rigorous mathematical realization of the Feynman heuristic formula (2) in terms of an IDOI on the Hilbert space of absolutely continuous paths with square integrable weak derivative Ht = {7 : [0, t] —> IR^, j(t) = 0 , / 0 \j(s)\2ds < 00} endowed with the inner product (71,72) = J0 71 (s) • 72(5) ds, the socalled Cameron-Martin space. Let us consider the Schrodinger equation (1) on L2(Rd), where the potential is the sum of an harmonic oscillator part plus a perturbation V{x) = ^xQ2x + Vi(x). Under the assumption that both the initial datum ^0 and the perturbation V\ are Fourier transforms of complex bounded variation measures on R d , it is possible to prove that the infinite dimensional oscillatory integral on Ti f
e A<7.7> e
- A / 0 '(7(,)+x)a 2 (7( S )+x)d S
e
- i /„' VlMs)+X)ds
^
( 7 ( 0 )
+
x ) d l
(?)
is a representation of the solution of equation (1) and a rigorous mathematical realization of Feynman's heuristic formula (2). In this setting an infinite dimensional version of the stationary phase method has been developed: a detailed asymptotic expansion of the above integrals in powers of H, with a good control on the remainder, has been computed and, under suitable assumptions on the potential, its Borel summability has been proved [2,10,31]. Infinite dimensional oscillatory integrals are a flexible tool and can be used to give a rigorous mathematical realization of a more general class of Feynman path integral representations. They have even been applied in Chern-Simon quantum field theory (see [8,15] for more details). A recent application in non relativistic quantum mechanics is the definition of the "phase space Feynman path integral" [5]. Let us recall that Feynman's original aim was to give a Lagrangian formulation of quantum mechanics. From many points of view, on the other hand, an Hamiltonian formulation could be preferable, so in the physical literature one can often find the following heuristic formula: i/>(t,x)="
[
e%St{q
(8)
J{(q,p)\q(t)=x)
where the integral is meant on the space of paths in the classical phase space of the system and S(q,p) = f0 \p(s)q(s) — H(q(s), p(s))} ds is the classical action functional in its Hamiltonian realization. By means of an infinite dimensional oscillatory integral on the Hilbert
508
S. MAZZUCCHI
space Tit x L2(M.d) it is possible to give to equation (8) a rigorous mathematical realization. Moreover, by considering the Schrodinger equation in which the potential V can depend both on position and on momentum in an additive way V(p,x) = V\{x) + V2(p), under suitable assumptions on Vi and V2 it is possible to prove that the IDOI is a representation of the solution. The definition of IDOIs can be enlarged in order to include in it a suitable class of complex valued phase functions. It can be applied to the representation of the solution of particular Schrodinger equations, with "complex Hamiltonians", including a stochastic Schrodinger equation describing a continuous quantum measurement: Belavkin equation. Let us recall that the continuous evolution of the state of a quantum particle described by the "traditional" Schrodinger equation is valid if the particle is "undisturbed", but if it is submitted to the measurement of one of its observables (for instance its position) and interacts with the measuring apparatus, then its time evolution is no longer continuous. In fact the state of the particle after the measurement is the result of a random and discontinuous change: the so-called "collapse of the wave function". In the physical literature it is possible to find several descriptions of the selective dynamics of a quantum particle in d-dimensional configuration space whose position is continuously observed. An heuristic "restricted path integral" representation for the state of the particle if the observed trajectory is the path w(s)s6[o,t] was proposed by Mensky: [ ets'We-^o'W')-»W)!j>(7(0))J)7". J{~f(t)=x}
il>(t,x,[a])="
(9)
One can see that, as an effect of the correction term e~xfo(~f(s)~"(s)) ds due to the measurement, the paths 7 giving the main contribution to the integral (9) are those closer to the observed trajectory w. Some years later V. P. Belavkina proposed a stochastic Schrodinger equation dip = -^Hipdt-^i;dt
+ VXxipdW(t),
(10)
where W is a (i-Brownian motion, A a coupling constant, proportional to the accuracy of the measurement. In this setting the theory of stochastic processes and the infinite dimensional oscillatory integrals meat each other. Indeed recently, by means of an IDOI with complex phase on the Cameron Martin space Ht, a Feynman path integral representation for the strong solution (in the probabilistic sense) of Belavkin equation (also discussed by Gatarek and Gisin) has been proven [6,11], providing a rigorous mathematical realization of Mensky formula: 1>(t,x)=
f
St
ei
^-xti^+^2dseti^W+x^w^ds<j>(j(0)+x)di.
(11)
An analogous result for Belavkin equation describing continuous momentum measurement has been obtained by means of phase space Feynman path integrals [7]. The examples above have shown the flexibility of the IDOIs. On the other hand in the above formulations there is an important restriction: in fact the class of potentials a
Related equations have been proposed also by other physicists, for instance in work by Ghirardi, Rimini and Weber, Mensky, Diosi, Gisin (see the references in [6,7]).
Some new developments in Peynman path integrals and applications
509
for which an infinite dimensional oscillatory integral representation for the solution of the corresponding Schrodinger equation can be defined are of the type "harmonic oscillator plus Fourier (or Laplace) transform of measure". This situation is rather unsatisfactory as this class does not include some physically interesting potentials, like those growing polynomially at infinity. Recently this problem has been solved [12,13]: the definition of infinite dimensional oscillatory integral has been extended to the case where the phase function is the sum of a quadratic and a quartic term:
JH
where B : D(B) Cg H —> H is a linear self-adjoint operator, with (I — B) > 0, P(x) = A(x, x,x,x), where A:H'x'HxHxH—*B.a, positive completely symmetric fourth order covariant tensor operator on H. The main tool is the the generalization of the Parseval type equality (6). Indeed, if H = R™, a rotation in the complex plane allows one to to compute the Fourier transform of the distribution e27i(x>(/-B):,:>+X'4(x>x'x>x) and to represent it as a Gaussian integral: i(k,x)
e
(27rift)™/2
eJK(x,(I-B)x)
+ iP(x) dnx
=
E
^e^^Vhik,*)
i(x,Bx)
e-ihP(x)
e%{x,tix)e-inr{x)\
Q2)
where E denotes the expectation with respect to the centered Gaussian measure N(0, J) on W1, with covariance the identity operator J. As a consequence it is possible to prove that if / is the Fourier transform of a complex bounded variation measure fif satisfying some restrictions on its growth at infinity, then the corresponding generalized Fresnel integral is well defined and can been computed by means of a absolutely convergent Gaussian integral:
I
e ^<x,(i-B)*>+£P(z) f^x)dx
= E J e i<«.B*> e -iW(*)/(e*»/4 > /fix)]
(13)
Equation (13) can be generalized to the case Ti is a real separable infinite dimensional Hilbert space. N(0,I) is the standard Gaussian measure associated with H, and the functions (•, B •), P and / on 7i are lifted to the abstract Wiener space built on H: f eJn{x,(I-B)x)
+ ±P(x)
f
^
d x
= EreJ
(14)
This formula has been applied to the Feynman path integral representation for the solution of the Schrodinger equation with an anharmonic potential of the type V(x) = ^xQ2x + Xx , A > 0. Indeed by considering two vectors <£,V>o £ L2(Rd) n Jr(Rd), it is possible to prove that the Feynman path integral representation for the inner product ((j),e~%Htipo) can be realized as the analytic continuation (in the parameter A) of a generalized IDOI on the Hilbert space Rd x Ht: JndxHt which, by Parseval-type equality (14), is equal to the Gaussian integral with respect to the
510
S. MAZZUCCHI
Wiener measure: ^rf/2
f
ei%
f*(Vhu,(s)+x)4ds
e
±
f*n?{Vhu{s)+x)2ds
d
Jm xct fa^^x)
ifo(j*f*y/hw(t)
+ e i 7 r / 4 z ) W(du) dx.
Work on a rigorous Borel summable asymptotic expansion of (14) in powers of h by a saddle point method is in preparation [14]. One expects t h a t the rigorous definition (14) can find application also in t h e study of q u a n t u m fields and t h a t , by rigorously defined Feynman p a t h integrals, asymptotic expansions can be justified (see [15]).
References 1. 2. 3. 4.
5. 6. 7. 8. 9. 10. 11. 12. 13. 14. 15. 16. 17. 18. 19. 20. 21. 22. 23. 24. 25. 26. 27. 28. 29. 30. 31.
S. Albeverio, Ph. Blanchard, R. H0egh-Krohn, Coram. Math. Phys. 8 3 , 49 (1982). S. Albeverio, Z. Brzezniak, J. Fund. Anal. 113, 177 (1993). S. Albeverio, Z. Brzezniak, Z. Haba, Potential Anal. 9, 65 (1998). S. Albeverio, J. E. Fenstad, R. H0egh-Krohn, T. Lindstr0m, Non Standard Methods in Stochastic Analysis and Mathematical Physics, Pure and Applied Mathematics 122, Academic Press, Inc., Orlando, FL, 1986. S. Albeverio, G. Guatteri, S. Mazzucchi, J. Math. Phys. 43, 2847 (2002) S. Albeverio, G. Guatteri, S. Mazzucchi, Probab. Theory Relat. Fields 125, 365 (2003). S. Albeverio, G. Guatteri, S. Mazzucchi, to appear in IDAQP. S. Albeverio, A. Hahn, A. Sengupta, SFB 611, preprint, Bonn, 58 (2003). S. Albeverio, R. H0egh-Krohn, Mathematical theory of Feynman path integrals, Springer-Verlag, Berlin, 1976, Lecture Notes in Mathematics vol. 523. S. Albeverio, R. H0egh-Krohn, Invent. Math. 40, 59 (1977). S. Albeverio, V. N. Kolokol'tsov, O. G. Smolyanov, Rev. Math. Phys. 9, 907 (1997). S. Albeverio, S. Mazzucchi, preprint of the University of Bonn (2003). S. Albeverio, S. Mazzucchi, preprint of the University of Trento (2003). S. Albeverio, S. Mazzucchi, in preparation. S. Albeverio, I. Mitoma, in preparation. R. Azencott, H. Doss, Stochastic aspects of classical and quantum systems (Marseille, 1983), Lecture Notes in Math. 1109, Springer, Berlin, 1985, pp. 1-1.7. G. Ben Arous, F. Castell, J. Fund. Anal. 137, 243 (1996). R. H. Cameron, J. Math, and Phys. 39, 126 (1960). P. Cartier, C. DeWitt-Morette, J. Math. Phys. 41, 4154 (2000). D. M. Chung, Proc. AMS 112, 479 (1991). K. L. Chung, J.-C. Zambrini, Introduction to Random Time and Quantum Randomness (New Edition), Monographs of the Portuguese Mathematical Society, World Scientific, 2003. H. Doss, Comm. Math. Phys. 73, 247 (1980). D. Elworthy, A. Truman, Ann. Inst. H. Poincare Phys. Theor. 4 1 , 115 (1984). T. Hida, H. H. Kuo, J. Potthoff, L. Streit, White Noise, Kluwer, Dordrecht, 1995. K. Ito, Proc. Fifth Berkeley Symposium on Mathematical Statistics and Probability vol. 2, part 1, California Univ. Press, Berkeley, 1967, pp. 145-161. G. W. Johnson, M. L. Lapidus, The Feynman integral and Feynman's operational calculus, Oxford University Press, New York, 2000. G. Kallianpur, D. Kannan, R. L. Karandikar, Ann. Inst. H. Poincare, Prob. Th. 21, 323 (1985). V. N. Kolokol'tsov, Semiclassical analysis for diffusions and stochastic processes, Lecture Notes in Mathematics 1724, Springer-Verlag, Berlin, 2000. V. P. Maslov, Methodes Operationelles, Mir, Moscou, 1987. E. Nelson, J. Math. Phys. 5, 332 (1964). J. Rezende, Comm. Math. Phys. 101, 187 (1985).
Some new developments in Feynman path integrals and applications
32. O. G. Smolyanov, E. T. Shavgulidze, Path integrals, Moskov. Gos. Univ., Moscow, 1990. 33. H. Thaler, to appear in Potential Analysis. 34. A. Truman, T. Zastawniak, J. Korean Math. Soc. 38, 469 (2001).
511
Super-diffusion and its collapse in a quenched multi scale passive transport model HOUMAN OWHADI (Caltech) We consider a general model of passive transport in a deterministic flow is characterized by an infinite number of spatial scales without separation between them. We observe that the transport is super-diffusive and does not depend on the particular geometry of the flow but only on the power law in the velocity field. However it appears that as the circulation increase in the eddies, this self-averaging property may brutally collapse towards a chaotic behavior where advection dominates averaging and the transport becomes highly sensitive to the geometry of the flow.
1. The model The understanding of passive transport in incompressible flows is an active field of research [12], important for the understanding of passive and active transports in turbulence [6]. In most models [10] the velocity v of the flow is considered as random and the transport is studied in an annealed regime by averaging on the randomness of the flow. We report on rigorous results (proven in [15]) obtained on a deterministic (quenched) model of passive tracer in which the flow is characterized by an infinite number of scales without the usual assumption of separation between scales [3]. We consider the passive transport equation in M.d for a scalar quantity T convected by a steady incompressible flow v (time independent and such that div(u) = 0) and diffused with a molecular diffusivity K dtT + vVT =
KAT.
(1)
We write T the stream matrix of v, thus T is a skew-symmetric matrix such that div(r) = v. We assume that T is given by the following sum ,
OO
r:
s
=E^(^j-
(2)
In the decomposition (2) we have three important ingredients: — The parameters pk are positive real numbers representing the different spatial scales and grows exponentially fast with k. We assume that p £ R and 2 < p < oo. — The parameters j k are positive real numbers associated to the amplitude of the circulation rate in eddies at scale k. We assume that 1 < 7 < p. — The parameters Ek are skew-symmetric matrices with periodic coefficients of period Td := M d /Z d the unit torus of Rd; they are the stream matrices associated to the eddies of scale k. We assume that Ek(0) = 0 and s u p f c 6 N s u p m i j e { l i dy \dmEk,ij\oo < 1. As a simple example, we have illustrated in figure 1 the stream lines of a two scale flow with stream function hl(x,y) = Y?k=o1kh{x/Pk,y/pk), with p = 2.9, 7 = 1.25 and
512
Super-diffusion and its collapse in a quenched multi scale passive transport model
513
Mawc» XN
.'
Figure 1. A simple example of the multiscale flow.
h(x,y) = cos(27nr) sin(27ry). The flow in this model has an infinite number of scales, of course real turbulent flows are characterized by an inertial range corresponding to a finite of scales. One can take the sum in (2) only over a finite number of scales, this would lead to an inertial range and the results given in this paper would remain unchanged but valid only between time and space scales corresponding to the smallest and largest scale in (2). Observe also that energy spectrum in the flow does not follow the Kolmogorov law (to be consistent with a Kolmogorov spectrum one should have 7 = ps), the purpose of this paper is only to analyze the effect of the multi-scale structure of the flow on the passive transport.
2.
Super-diffusion
We will now show that the transport in our model is supper-diffusive and controlled by a multi-scale homogenization process. For a, a symmetric elliptic constant dx d matrix and E a skew symmetric d x d matrix with periodic coefficients in C(Td), we write a(a, E) the symmetric elliptic constant dx d matrix defined by the following variational formulation: for £ G Rd,
iel'-i(«,B) = H I \C-divH + (a + E)Vf\Udx.
(3)
f
(6)
514
HOUMAN OWHADI
We consider the Lagrangian formulation of (1), i.e., the Ito's stochastic differential equation associated to the transport of one tagged particle in suspension in the flow: dyt = V2Kdut + v(yt) dt.
(7)
Let us write r{r) the exit time of the passive tracer yt (given by (7)) from the ball B(0, r). r{r) := inf{t > 0 : \yt\ > r } .
(8)
Let us remind that for a normal diffusion one should have r{r) ~ r2, we will show below that this is not the case and one has r(r) ~ r2~" with v > 0. In order to show that this phenomenon is a super-mixing phenomenon (the exit times are anomalously fast not because of convection) we will consider zt a second passive tracer convected by the same flow as yt but diffused with an independent thermal noise and show that the separation between yt and zt is anomalously fast. More precisely we the solution of the following stochastic differential equation dzt = v2Kdu>t + v(zt) dt,
(9)
where u>t is a standard Brownian Motion independent of u>t- We write B(0, r, I) the subset of Rd x R d such that \x — y\
We write H(r) and H(r, I) the events {r{r) < r2~s} and {r(r, I) < r2~5} with 5 = 0 . 9 ^ (observe that S > 0). We write n(r) := sup{p G N : pp < r}
(11)
the number of scales in our model that are smaller than the length r. We have the following rigorous result Theorem 2.1. If \~ > 0 then there exists a constant C(d, 1/A - , l / l n 7 ) < oo such that for p > Cf and r > p one has Emr[r(r)]=r2-^
(12)
and limEmri,[r(r,0]=r2-^
(13)
with »(r)=l^p(l
+ e(r))
(14)
Super-diffusion and its collapse in a quenched multi scale passive transport model
515
with |e(r)| < 0.5C^/p < 0.5. Moreover \imo¥mr[H(r)}=l,
(15)
lim i l i m P m r l [ f l - ( r , 0 ] = l .
(16)
Equation (12) shows the fast transport the passive tracer. There exists an important literature on the fast transport phenomenon in turbulence. We refer to [12] and [17] for a review. For the shear flow model we refer to [4], [7] and [1]. For rigorous analysis on non shear flow models we refer to [16] and [9]. In the latter papers, the velocity vector field v appearing in equation (1) is considered as random and characterized by long ranged spatial and temporal correlations. Then physical parameters such as the mean squared displacement are averaged in an annealed regime with respect to the law of the vector field, the thermal noise and time. In our model the velocity vector field v is deterministic (which corresponds to a quenched regime) and it is not a priori obvious that the dynamics of the passive tracer should be independent of the particular geometry of the eddies Ek • According to theorem 2.1 it appears that this is the case if A - > 0: the dynamics depends only on the power law In 7 /Inp. Equations (13) show the fast fixing or super-diffusive property of the passive transport. It can be generalized to any finite number of passive tracers. Equations (15) and (16) show that the flow is strongly averaging and super-diffusion is an almost sure event.
3. When A~ > 0? It is well known [5] that t
E(x)a~1E(x)dx.
a
(17)
Let us write A+ :=limsupA m a x (,4„)
(18)
n—>oo
and (i := limsup (A max (vl„)/A min (A„)).
(19)
It is easy to obtain by iteration from (17) and (5) that Cd
A+<-^
IF
, . Cd and M
-(F)
(20)
In particular if the flow is strongly anisotropic (/i = oo), we have A = 0 . Let us write for C > 0 the function
V(() := liminf ^ " K " ^ ' ^ ) )
(21)
From variational formulas associated to crsym(a, E) [13], [15] one obtains that V is decreasing and always greater than 1. Thus ^(0) := lim^o ^(C) is well defined. Moreover, we have obtained by induction that if fi < oo and 7 < V(0) then A~ > 0 and Ci < A~A+ < C 2 .
(22)
516
HOUMAN OWHADI
It is easy to build from previous result non trivial examples of flows such that A~ > 0. We say that An is self-similar and isotropic if and only if for all k, Ek = E and for all £ > 0, &sym((Id, E) is a multiple of the identity matrix. Observe that by the relation (5) if An is self-similar then it is a low order dynamical system and we have the following theorem showing that if V(0) < oo then as the circulation rate 7 increases in the flow, the behavior of An and thus the transport properties of the flow go through a phase transition that we have called viscosity implosion. T h e o r e m 3 . 1 . If An is self-similar and isotropic then (1) If 7 < V^(O) then X~ > 0, in particular limn_,oo An = ($14 where (0 is the unique solution of V(£o) = J(2) Ifj = V(0) and (V(0) — V{x))x~p admits a non null limit as x [ 0 with p > 0 then X~ = 0 in particular lim„_^oo " l n n = — £. (3) 7/7 > V(0) then X~ = 0, in particular l i m , , - ^ £ \nX(An) = In ( ^ p ) . It is easy to obtain that if An is self-similar and isotropic and div E is not identically null then V(0) > 00. Moreover if there exists S > 0 such that d i v £ is null on [0, l ] d \ [S, 1 - 5}d then V(0) < 00. In particular if d = 2 and Ei^(x,y) = cos(27r:r) sin(27ry) then V(0) = 00 and the stability condition A - > 0 is always satisfied.
4. Idea of the proof of theorem 2.1 The super-diffusion is created by a mixing phenomenon over all spatial scales (see [2] for sub-diffusions, we also refer to [14]). First let us write for p,n &N
r- , n :=E7*^(4). k=p
(23)
H
For simplicity of the presentation let us assume that A^ is self similar and isotropic (these points are not needed). Then r 0 , n is periodic and the effective conductivity a(K,Id,T°'n) is well defined. It appears that lim a(KId, r 0 ' " ) = jn+1An+1.
(24)
p—>oo
Now if the scales are not separated (p < 00) and A - > 0 one can obtain that a(KJd,r0'")~7"+1.
(25)
If we write v(r) (a(r)) the magnitude of the velocity field (the eddy viscosity) at the scale r. Equation (25) corresponds to the relation a(r) ~ rv{r) explicitly used in the theory of fully developed turbulence exposed in [11]. Now it appears that at the observation scale r, one can divide the infinite number of scales associated to V into three categories. The scales 0 , . . . , n(r) (given by (11)) have been homogenized and intervene in the dynamics of yt through their bulk property like a Brownian Motion with effective diffusivity a(Kld, r°>™(r)). The scales n(r) + 2 , . . . , 00 have no influence on the dynamics. The scale n(r) + 1 can not be considered as averaged, neither its influence is negligible: it influences the dynamics of
Super-diffusion and its collapse in a quenched multi scale passive transport model
517
yt through the particular geometry of its drift. Now let us observe that the exit time from B(0, r) for the effective diffusivity a{n,Id,T0,n^)
created by the smaller scales is
rD(r)~r2(7"M+1A(An(r)+1))"1,
(26)
where we have used the notation An = \{An)Id- The magnitude of the velocity associated to the scale n(r) + 1 is -y n ( r ) + 1 /p n ( r ) + 1 . It follows that the exit time from -B(0,r) for the convection associated to the drift of the scale n(r) + 1 is TC(r)
~ r X ( 7 n M + l / p n ( r ) + l ) - l „ r2^n(r)
+i y \
(2?)
Now one can define a local Peclet number Pe(r) by ~ (A(A„)) - 1 .
Pe(r) := TC(r)/rD(r)
(28)
It follows that if A - > 0 then the local Peclet number remains bounded away from infinity and the influence of the convective scale n(r) + 1 (compared to the influence of the diffusive scales 0 , . . . ,n(r)) on the exit time of yt from B(0,r) remains bounded and one can obtain that r(r) ~ Tjj(r). Now using n(r) ~ In r / h i p one obtains that r(r)~r27-]nr/ln^~r2-^.
(29)
5. What happens when A - = 0? When A~ = 0 the local Peclet number Pe(r) (28) diverges towards infinity. Which means that Tc(r) -C TD(T) even if p is large and the multi-scale averaging scenario mentioned above collapse. Let us now show on a simple example that if A - = 0, the particular geometry of the eddies can not be neglected even if the separation between scales p is large. Assume An to be self-similar and isotropic for instance. Let us write r 0,n-l ; = g 7 f c £(_M
(30)
k
fe=0
for y E Rd a(n,y) := a ^ r
0
'""
1
+7"f?(l/ + ^ - ) )
(31)
and a(n,y,p)
:=
lim
a(n,y).
Ri/R0,...,Rn-i/Rn-2-*°o
;Rn/Rn-i=p
One obtains that a(n, y, p) = 7 " - V ( A n _ 1 , E(px) + -yE(x + y)). -
(32)
It is then possible to show that if A > 0 then a(n, y, p) is not sensible to the relative translation between successive scales and for any I S (Rd)* and y e [0, l}d,
l „^oo i m s utlcr(n,0,p)l PM^4
+
C^A-)-i
518
HOUMAN OWHADI
However if A = 0 then for any p > 1 there exists E and y £ [0, l]d such that for any I G {Rd)*, t la(n,y,p)l lim ,, t v. ,y'H' = oo. n—oo la(n,0,p)l We believe that this collapse is not a mathematical artefact. If A - = 0 then the transport of yt is dominated by the convection of the scale n(r) + 1 and its particular geometry can no longer be neglected. Let us illustrate what we mean by such bifurcation. Assume that d = 3 and that An is self-similar and isotropic.
(a) Shape of E.
(b) 7 < V(0).
(c) 7 > V(0).
Figure 2. Viscosity implosion.
The geometry of the eddy E over a period has been illustrated in figure 2(a). Now imagine that one puts a drop of dye in such a flow at the origin at t = 0 and observe its density at a large time scale t ^> 1 at a spatial scale r » 1. The drop of dye has been transported by two competing phenomenons. The scales smaller than r act trough their bulk mixing properties and diffuse the drop of dye making its density distribution homogeneous. The eddies of scale of order r transport the drop of dye by convection and tends to create high density gradients in its distribution. For 7 < V(0) (figure 2(b)) we have A - > 0, the local Peclet number remains bounded and the diffusive property of the smaller scales compensate the convective power of the larger ones: the drop of dye is dispersed in an homogeneous way (like an enhanced diffusion). This picture undergoes a brutal transformation at 7 > V(0) (figure 2(c)); in this domain A - = 0 and the local Peclet number tends towards infinity and the drop of dye is mainly transported by the convective power of the larger scales. This convection is no longer compensated by the mixing property of the smaller scales and the density distribution of the drop of dye presents high density gradients. Remark. Consider a turbulent flow, write a(r) the turbulent viscosity of the flow at the scale r and v{r) the magnitude of its velocity at this scale. If the flow is isotropic, Kolmogorov [8] through a dimensional analysis has observed that one should have a(r) ~ rv(r).
(33)
Next, writing that the energy dissipation a{r)(v(r))2r~2 should be scale invariant, he obtained the so called Kolmogorov law. Thus the relation (33) is at the core [11] of this law.
Super-diffusion and its collapse in a quenched multi scale passive transport model
519
In this paper we have observed on a general passive t r a n s p o r t model t h a t t h e relation (33) is a consequence of t h e multi-scale structure and the isotropy of the flow. Observe t h a t the relation (33) does not depend on the particular geometry of the flow, we have also noticed t h a t as t h e circulation rate increases in the flow this self-averaging property may brutally collapse.
References 1. Gerard Ben Arous, Houman Owhadi. "Super-diffusivity in a shear flow model from perpetual homogenization", Comm. Math. Phys. 227, 281-302 (2002). 2. Gerard Ben Arous, Houman Owhadi, "Multi-scale homogenization with bounded ratios and anomalous slow diffusion", Communications in Pure and Applied Mathematics 56, 80-113 (2003). 3. M. Avellaneda, "Homogenization and renormalization, the mathematics of multi-scale random media and turbulent diffusion", in Lectures in Applied Mathematics, volume 31, 1996, pp. 251268. 4. M. Avellaneda, A. Majda, "Mathematical models with exact renormalization for turbulent transport", Comm. Math. Phys. 131, 381-429 (1990). 5. M. Avellaneda, A. J. Majda, "An integral representation and bounds on the effective diffusivity in passive advection by laminar and turbulent flows", Comm. Math. Phys. 138, 339-391 (1991). 6. U. Prisch, Turbulence. The legacy of A. N. Kolmogorov, Cambridge University Press, 1995. 7. J. Glimm, Q. Zhang, "Inertial range scaling of laminar shear flow as a model of turbulent transport", Comm. Math. Phys. 146, 217-229 (1992). 8. A. N. Kolmogorov, "The local structure of turbulence in incompressible viscous fluid for very large Reynolds numbers", Dokl. Akad. Nauk SSSR 4, 9-13 (1941). 9. T. Komorowski, S. 011a, "On the superdiffusive behavior of passive tracer with a gaussian drift", Journ. Stat. Phys. 108, 647-668 (2002). 10. R. Kraichnan, "Small-scale structure of a passive scalar convected by turbulence", Phys. Fluids 11, 945-963 (1968). 11. L. D. Landau, E. M. Lifshitz, Fluid Mechanics, 2nd ed., MIR, 1984. 12. A. J. Majda, P. R. Kramer, "Simplified models for turbulent diffusion: Theory, numerical modelling, and physical phenomena", Physics reports 314, 237-574 (1999). 13. J. R. Norris, "Long-time behaviour of heat flow: Global estimates and exact asymptotics", Arch. Rational Mech. Anal. 140, 161-195 (1997). 14. H. Owhadi, Anomalous diffusion and homogenization on an infinite number of scales, PhD thesis, EPFL — Swiss Federal Institute of Technology, 2001, available at http://www.acm. caltech.edu/~owhadi/. 15. Houman Owhadi, "Averaging vs chaos in turbulence?", preprint (2002). 16. L. Piterbarg, "Short-correlation approximation in models of turbulent diffusion", in Stochastic models in geosystems (Minneapolis, MN, 1994), volume 85 of IMA Vol. Math. Appi, Springer, New York, 1997, pp. 313-352, 17. W. A. Woyczynski, "Passive tracer transport in stochastic flows", in Stochastic Climate Models, Birkhauser-Boston, 2000, p. 16.
An analytic approach to Kolmogorov's equations in infinite dimensions and probabilistic consequences MICHAEL ROCKNER
(U. Bielefeld)
1. Introduction The purpose of this note and our talk at the ICMP is to present a mathematical programme designed to construct and analyze (weak) solutions to stochastic partial differential equations (SPDE) appearing in mathematical physics, e.g. in hydrodynamics or interacting infinite particle systems. Our main point is to directly solve the corresponding Kolmogorov equations to first obtain (and in fact "calculate" by finite dimensional approximations) the transition probabilities of the desired solution process. The latter is actually constructed only subsequently rendering the appropriate physical interpretation. In Section 3 we shall start with examples of typical applications giving recent references where the said programme has (at least partially) been implemented. The programme itself is described in the subsequent sections. Given the restrictions in length of this note, concerning work by other authors on these topics, we refer the reader to the discussions in our quoted papers as well as the references therein. At this point I would like to express my sincere gratitude to all colleagues and friends who have contributed as coauthors to many of the results presented here.
2. Applications We start with applications which are covered by our method. 2.1. SPDEs in hydrodynamics (a) Stochastic generalized Burgers equation dXt{£) = [ A € X t ( 0 + V € tf ( X t ( 0 ) + *{Xt(Q)]
dt +
VAdWtW
on the state space E:=L2((0,l),ds) (so Xt £ E, ££ (0,1)) with ds := Lebesgue measure on (0,1), $ : R -> R, $ : R x R -> R. The noise term dWt(£) and the nature of the diffusion coefficient yJ~A are explained in the next section. References: [22]. (b) Stochastic Navier-Stokes equation dXt(0
= [vAfXt(C)
+ ( X t ( 0 • V j ) X ( ( 0 ] dt +
520
VAdWt(0
An analytic approach to Kolmogorov's equations in infinite dimensions ...
521
on the state space E := {x G L2(D -> R2,dvo\) | diva; = 0} (so Xt G E, £ G D) with D := domain in R 2 , dvo\ := Lebesgue-measure on D, A | := Stokes operator, and v : = viscosity. References: [23]. (c) Stochastic porous media equations
dXt(0 = A € (aX t (0 + X?(£)) dt + yfAdWtH) on the state space E:=H~l{D)
(= dual of H^'2(D))
(so Xt G E, £ G D) with £> := domain in R d and a G [0, oo), m G N, m odd. References: [7, 15,16]. One motivation for studying SPDEs of type (a)-(c) above is to "put back randomness" into the classical deterministic macroscopic partial differential equations by adding a function of the noise, and in a first step only taking into account the first order contribution of the Taylor expansion of this function. 2.2. Infinite particle systems with singular interactions in R d (or a Riemannian manifold) oo
dx^ =
EvV^-xft
.(fc)
dt + dWr\
fcGN,
•3 = 1
3?k
on the state space E := all locally finite subsets of Kd, (so Xt := {x[k)\k e N } e £ ) with (Wt(fc))t>o, fc G N, independent Wiener processes in Rd, and e.g. V(x) := >(|z|), x G Rd, ("two body potential") where typically
3. General framework and strategy All examples in Subsection 2.1 belong to the following general class of stochastic differential equations on a separable Hilbert space (E, (, )), dXt = [HXt + F(Xt)} dt + VAdWt, XQ = x G E
(1)
(typically a function space).
Here H is a (in general unbounded) linear operator on E generating a strongly continuous semigroup on E, A is a positive definite symmetric bounded linear operator on H (often assumed to be of trace class), F : Dom(F) C E —> F is a non-linear map, and (Wt)t>o is a (cylindrical) Brownian motion on E.
522
MICHAEL ROCKNER
The associated infinitesimal generator, also called "Kolmogorov operator", (obtained by heuristically applying Ito's formula to (1)) is given as follows: L
+ J2(Hx
iyj = l
+
F{x),ei)^-{x),
i—X
x G E, ip = 5 ( ( e i , • ) , . . . , {eN, •)) with N G N, g G C%(RN), and {et \ i G N} is a (in the respective application properly chosen) orthonormal basis of E. Such functions ip : E —> M are called finitely based and the set they form is denoted by TC2. D, D2 denote first and second Frechet derivatives and d/dei stands for partial derivative in direction e*.. The corresponding Kolmogorov equation in infinitely many variables (= generalized heat equation on the Hilbert space E) is then given by —u(t,x)=Zu{t,x),
u(0, •) = / ,
(2)
where / : E —> R is the initial condition of this infinite dimensional deterministic PDE. We emphasize that in (2) we have to consider an appropriate extension L of L with domain TC\, since due to the highly non-diagonal nature of (1) caused by the non-linearity F, even if / is finitely based this will never be the case for the solution u(t, •), apart from trivial uninteresting cases. As in the classical finite dimensional case the connection between (1) and (2) is that for / := IA (= indicator function of the set) Ac E, we have u(t,x) = probability that the solution of (1) with initial condition x is in A at time t = Prob(X0=x,Xt€A),
(3)
i.e., the solution to (2) is the "transition semigroup" for the stochastic process solving (1). For many years the approach to solve the Kolmogorov equation (2) in infinite dimensions was to first solve (1) and use (3) as a definition for u(t,x) and check under what additional conditions it really solves (2). The reason for this was that in infinite dimensions there were many techniques known to solve (1), albeit under quite strong assumptions on the coefficients, but almost none was known to solve (2) directly. Our main point is FIRST solve (2) and then (1). This turns out to be possible under much weaker regularity conditions on the coefficients. In addition, in contrast to previous approaches this opens up ways to "calculate" the transition probabilities in (3) by finite dimensional approximations. Our strategy to solve (2) is as follows: Solve (2) via u(t,x) = (etLf)(x)
in
L*(E,LI)
(p € [l,oo])
for a suitably chosen probability measure /x on E. Construct from this strongly continuous semigroup on LP(E, //) a semigroup of probability kernels on E, and then using Kolmogorov's scheme a stochastic (Markov) process having these kernels as transition semigroup. Finally, show that this process solves (1) in the sense of a martingale problem uniquely.
An analytic approach to Kolmogorov's equations in infinite dimensions ...
523
4. Programme to implement the strategy Now we summarize six single steps to implement this strategy to solve (2). For each step we give at least one published reference, which is basic for the respective main underlying ideas. More recent references (> 2003) have been mentioned already in Section 2 and more will be given in the next section. Step 1: Reference measures. Find an appropriate reference measure on the Hilbert space E. It has turned out, that they can be obtained as solutions to the elliptic equation L > = 0,
(4)
that is, [x is a probability measure on E such that
Such measures are called "infinitesimally invariant". Often it is enough to find a probability measure fi on E such that for some A G [0, oo) L
tp dfi,
V ip G TCl,
We stress that such a measure fi essentially always exists if SDE (1) at all has a solution for sufficiently many starting points x € E. For techniques how to solve (4) we refer to [6] for the symmetric case and to [9], [10] for the general case. Step 2: ZAsolutions for PDE (2). Construct etL, t > 0, on LP(E,/J,), where L denotes the closure of L. Then t>0.
(5)
This is the solution of the Kolmogorov equation (2) in Lp(E,fi). reference here is [14] (see also [21]).
A possibly quite useful
(^(£,M)-)
Step 3:
^(e^/)=L(£V),
Regularity of Lp-solutions of PDE (2).
To find probability kernels pt, t > 0, on E such that for t > 0 " etLf(x)
= " J f(y)Pt(x,dy)
=:
Ptf(x)
and V (p E TC%> ptip(x) - ip(x) = / ps(L
V x G Efj, C E,
(6)
524
MICHAEL ROCKNER
with ^(Efj,) = 1. (6) means that we have a pointwise solution of the Kolmogorov equation on E^. Furthermore, try to prove pt (Ciw) (E)) C C{bw) (E)
"Feller-property"
or even pt(Bb(E))
C Cb{E)
"strong Feller-property",
where Cb(E), Cb(E) denote the set of real-valued bounded continuous, weakly sequentially continuous functions on E respectively. Bb (E) denotes the set of bounded Borel measurable functions on E. Reference: e.g. [14]. Step 4: Weak solutions of SDE (1). Construct a (strong Markov) process with continuous sample paths on E^ with transition semigroup (pt)t>o, which uniquely solves SDE (1) for all starting points x G E^. Reference: [27]. Step 5: Characterization of E^. Typically, we have that E^ is the topological support of /x, which often can be identified as all of £ or a well-described subset thereof, as e.g. a closed ball. In this case everything is independent of the chosen measure /x. We refer to [14] for examples. Step 6: Finite dimensional approximations of solutions. Construct a Galerkin-type approximation of PDE (2), to calculate pt on E as a limit for iV —> oo of solutions p^ for corresponding finite dimensional Kolmogorov equations (with Lipschitz coefficients) on EN '•= span{ei,..., e^v}-
5. Some results and consequences The above programme has been (a) completely implemented e.g. for — stochastic generalized Burgers equations (see [22]); — 2D-stochastic Navier-Stokes equation (see [23]); (b) partially implemented e.g. for — stochastic porous media equations (see [15], [16], [7]); — infinite particle systems in Rd (or in a Riemannian manifold) with singular interactions (see [5]); — 3D-stochastic Navier-Stokes equation (work in progress, e.g. existence of infinitesimally invariant measures has already been proved in [9]). Then: Analysis of solutions of (1) and (2) (We only give very recent references, i.e., > 2003) For example:
An analytic approach to Kolmogorov's equations in infinite dimensions ...
525
— large time asymptotics and invariant measures, e.g., existence and uniqueness of invariant measures for non-linearities F of gradient type (i.e., the Gibbsian case), see [3], [1], [2], also [11] with respect to the classical problem whether invariance => Gibbsian; — small time asymptotics and large deviations, [18], [26]; — small noise limits and large deviations, [12] (for paths), [13] (for the invariant measures); — scaling limits [17]; — explicitly time dependent coefficients [8]; — spectral properties of the Kolmogorov operator L, [24], [25]; — Yamada-Watanabe principle and existence of strong solutions, [19]; — non-local Kolmogorov operators, [20]; etc.
Acknowledgement Financial support of the BiBoS-Research Centre and the German Science Foundation (DFG) through the DFG-Research Group "Spectral Analysis, Asymptotic Distributions and Stochastic Dynamics" is gratefully acknowledged. We also thank the Faculty of Mathematics at Bielefeld University for the substantial contribution to the travel expenses for participating in the ICMP.
References 1. S. Albeverio, Y. G. Kondratiev, Y. Kozitsky, M. Rockner, BiBoS-Preprint No. 02-12-104, to appear: Comm. Math. Phys. (2003). 2. S. Albeverio, Y. G. Kondratiev, Y. Kozitsky, M. Rockner, Phys. Rev. Lett. 90, 170603 (2003). 3. S. Albeverio, Y. G. Kondratiev, T. Pasurek, M. Rockner, BiBoS-Preprint No. 02-05-086, to appear: Ann. Prob. (2003). 4. S. Albeverio, Y. G. Kondratiev, M. Rockner, J. Fund. Anal. 154, 444 (1998). 5. S. Albeverio, Y. G. Kondratiev, M. Rockner, J. Fund. Anal. 157, 242 (1998). 6. S. Albeverio, Y. G. Kondratiev, T. V. Tsikalenko, M. Rockner, J. Fund. Anal. 171, 366 (2000). 7. V. I. Bogachev, G. Da Prato, M. Rockner, preprint: "Invariant measures of stochastic porous medium type equations" (2003). 8. V. I. Bogachev, G. Da Prato, M. Rockner, BiBoS-Preprint No. 02-10-100, to appear: Proc. London Math. Soc. (2003). 9. V. I. Bogachev, M. Rockner, Prob. Th. Rel. Fields 120, 445 (2001). 10. V. I. Bogachev, M. Rockner, F.-Y. Wang, J. Math. Pures Appl. 80, 177 (2001). 11. V. I. Bogachev, M. R'ockner, F.-Y. Wang, BiBoS-Preprint No. 02-12-106 (2003). 12. S. Cerrai, M. Rockner, BiBoS-Preprint No. 02-02-074, to appear: Ann. Prob. (2003). 13. S. Cerrai, M. Rockner, FgWeb-Preprint No. 03-038 (2003). 14. G. Da Prato, M. Rockner, Prob. Th. Rel. Fields 124, 261 (2002). 15. G. Da Prato, M. Rockner, preprint "Invariant measures for a stochastic porous medium equation", to appear: Proc. of Conf. in Honour of K. ltd, Kyoto 2002 (2003). 16. G. Da Prato, M. Rockner, preprint "Weak solutions to stochastic porous media equations" (2003). 17. M. Grothaus, Y. G. Kondratiev, E. Lytvynov, M. Rockner, Ann. Prob. 31, 1494 (2003). 18. Y. G. Kondratiev, E. Lytvynov, M. Rockner, Publ. Res. Inst. Math. Sci. Kyoto University 39, 1 (2003).
526
MICHAEL ROCKNER
19. N. V. Krylov, M. Rockner, preprint "Strong solutions of stochastic equations with singular time dependent drift" (2003). 20. P. Lescot, M. Rockner, BiBoS-Preprint No. 02-03-076, to appear: Pot. Anal. (2003). 21. M. Rockner, Led. Notes Math. 1715, 65 (1999). 22. M. Rockner, Z. Sobol, preprint "Generalized stochastic Burgers equation: regularity theory for their Kolmogorov operators and applications" (2003). 23. M. Rockner, Z. Sobol, preprint "Regularity of solutions to the Kolmogorov equations corresponding to the stochastic 2D-Navier-Stokes equation" (2003). 24. M. Rockner, F.-Y. Wang, FgWeb-Preprint No. 01-019, to appear: Bull. London Math. Soc. (2003). 25. M. Rockner, F.-Y. Wang, BiBoS-Preprint No. 02-05-088, to appear: J. Fund Anal. (2003). 26. M. Rockner, T.-S. Zhang, BiBoS-Preprint No. 02-07-094, to appear: Publ. Res. Inst. Math. Set., Kyoto University (2003). 27. W. Stannat, Memoirs AMS 678, (1999). Preprints listed are available from the following web servers: http://www.mathematik.uni-bielefeld.de/fgweb/ and http://www.physik.uni-bielefeld.de/bibos/.
A functional integral applied to topology and algebra AMBAR
N.
SENGUPTA
(Louisiana State U.)
The purpose of this article is primarily to give an account of the ideas connecting the Yang-Mills functional integral to certain classical formulas of Frobenius and Schur in algebra and to the moduli space of flat connections.
1. Introduction Consider: — the task of counting the number of solutions of equations of the type 2 2
2 _
X j X 2 • • • xk — e
or of the type xiyiXi1Vi1
• • • XkykX^y^1
= e
for xi, j / i , . . . , x k , Vk in a given group; — the Yang-Mills functional integral
JA where SYM is the Yang-Mills action and the integration is over the space A of connections over a surface and t a positive parameter; — the moduli space of flat connections over a surface or the representation variety for the surface in a given group. Though not obvious at first glance, these ideas are all connected with each other. In the following sections we shall give an account of some of these ideas and indicate how everything ties together. The relation between the algebraic problem of counting solutions of equations in a group with two-dimensional Yang-Mills theory has been explored in several works by Mulase et al. [6,7]. (We shall not describe relations with Chern-Simons theory.) Ideas from quantum Yang-Mills may be useful in understanding representation varieties of finitely generated groups. For the case of surface groups, it still remains to carry out a complete study of the topological and symplectic structure of the various strata of the moduli space of flat connections from the quantum Yang-Mills point of view, though much progress has been made (initiated by Witten's works [11,12]).
2. Counting solutions of equations in a finite group Let G be a finite group. Is every element of G expressible as a commutator xyx~1y~1? Certainly not always. So one may well ask how many solutions there are of the equation
527
528
AMBAR N. SENGUPTA
xyx~1y~1 = w, for a given w £ G. Of course, one may also consider other equations; for example, solving x\---x\ = e for x\,..., Xk € G with e being the identity. Questions of this type were considered extensively by Frobenius [2] (see, for example, equation (13) in [3], and the paper with Schur [4]), and, later and in a different context, by Mednykh [5]. Here we shall consider the question discussed by Frobenius and Schur [4]. Let G be the set of all equivalence classes of complex irreducible representations of G. As noted in [4], the elements a of G fall into three different types, according to the value of the function c(a) = ±-J2x«(x2)
(1)
where \G\ is the number of elements of G and Xa is the character of the representation a. The following result is from equation (9) in [4]: Theorem 2.1. For a finite group G, the number of solutions (xi,... equation 2
,Xk) £ Gk of the
2
V'Zfc
is equal to \G\
k-l
J2
(dima)2-fc+
{a6G:c(a)=l}
^
(-dime*) 2 "*
(2)
{a€G:c(a)=-l}
This 1906 result of Frobenius and Schur is purely algebraic, but there is surface topology hidden in it. The following special case of a 1978 result of Mednykh [5] brings out the role of topology: Theorem 2.2. With notation and hypotheses as above, the number of solutions xg,yg) £ G2g of the equation xiyix^Vl1
• • • XgygX^y-1
=e
(x\,yi,...,
(3)
is equal to |G|i-x(S)^(dima)X(s)
(4)
aeG where x(^) = 2 — 2g is the Euler characteristic of a closed oriented surface S of genus g, this being the surface whose fundamental group is generated by elements <2i,&i,... ,ag,bg satisfying the relation ai&ia^&J"1 • • • agbga^bg1 = e. (5) The main technique used by Frobenius and Schur is the expansion of central functions on G in terms of characters. Write the "delta function" 8e for the group G as a "Fourier" series in terms of the characters Xa of G. Then the number of solutions is obtained by summing 8e{x\ ... x\) over all x i , . . . , Xk- Using the Schur identities for convolution of characters one gets the formula for the number of solutions. Let us see how this works for the commutator equation (3). Let V be the number of solutions of the equation (3). Then V=
£ {xu...,yg)€G^
Se([x1,y1]---[xg,yg])
(6)
A functional integral applied to topology and algebra
529
where 8e(x) = 1 if x = e and is 0 if x ^ e. Now expand the delta function as a sum over characters:
where da is the dimension dim(a) of the representation space for the representation a. For convenience we use integration over G with unit-mass Haar measure (of course, this is just averaging over G). Then, substituting the expansion of the delta function into (6), we have V=
Xa([xi,Vi]---[xg,yg])dx1...dyg\G\2B.
YsT^\
(8)
a€G
The integral in the summand works out to /
Xa([xi,yi]---[xg,yg])dxi...dyg
JG2n
=
-^da, da
by using (as was done by Witten [11]) the following consequences of the Schur-relations on orthogonality of matrix entries of representations: / Xa(aba~1c)da=—xa(b)x<x{c) JG
and
a
a
/ Xa(ab)xp{b~1c)db = JG
-j-da0Xa(ab).
a
a
(9)
Putting everything together gives Mednykh's formula:
v
= E w\ i ^ i G i 2 9 = iG'29_1 E <~29a
a
a£G
Mulase, Penkava and Yu [6,7] have generalized these ideas to the case of Lie groups G and studied the "volume" (in place of the number of elements) V of the the set of solutions of equations such as (3). They also extend these ideas to producing invariants for finitely generated groups (not necessarily surface groups).
3. Euclidean q u a n t u m Yang—Mills over a surface Let E be a two-dimensional Riemannian manifold, G a compact, connected, semisimple Lie group with Lie algebra LG equipped with an Ad-invariant metric. Consider the space A of smooth LG-valued 1-forms A over E, viewed as connections on a trivial bundle over E. Let FA = dA + ^[A,A] be the curvature two-form of A. Then the Yang-Mills action SYM(A) is \ \\FA|2, where we have the L 2 -norm of FA defined using the Riemannian area measure on £ and the metric on LG. Let C\,. • •, Ck be piecewise smooth loops on E, all based at the same point o (for convenience), and let /i(C,-; A) denoted the holonomy of A around C*. Let / be a bounded measurable function on Gk. In Euclidean quantum Yang-Mills over E the central objects are the Wilson loop expectation values
-j— /
f(h(C1;A),...,h(Ck;A))e-s™(AVtDA,
(10)
530
AMBAR N. SENGUPTA
where t > 0 is a parameter, DA is the (rigorously non-existent) "Lebesgue measure" on A and Z t (E) is the "partition function" Zt(T) = f
e-^W/tDA.
JA
Despite the presence of mathematically fictitious entities such as the infinite-dimensional "Lebesgue measure" DA, the loop expectation value can be given a rigorous meaning. We give a very brief outline here. In one approach [8], the Yang-Mills integral is constructed by means of a conditioned Gaussian measure and the holonomies are defined by re-interpreting the classical equation of parallel transport as a stochastic differential equation (this idea being due to L. Gross). Another approach, developed extensively by Thierry Levy [10], is to construct the measure rigorously from heuristically deduced values of the loop expectations. Let r be a triangulation of E such that the curves C\,..., Ck are made up of edges of T, and let Ar is the set of maps x from oriented edges of T to G such that x(b) = xQ})-1 for each oriented edge b. Then the Wilson loop expectation value works out to be j ^ j
A
f(x(Ci),...,x(Ck))dvt(x)
(11)
where ut is a certain measure on Ar defined in terms of the heat kernel on G, and Nt(E) is the finite-dimensional version of the partition function and is given by NtCE)=
f
JGT
Qt\s\(Wv(y1,...,yr))dy1...dyr.
(12)
Here Qs(z) is the heat kernel on G, |E| the Riemannian area of E, the integration is with respect to normalized Haar measure and the function WE is specified by the requirement that the fundamental group 7i"i(E) be generated by elements y[,..., y'r subject to the relation that Ws(y[, ...,y'r)be the identity. The numerator in (11) can also be viewed as a "partition function."
4. The representation variety and the moduli space of flat connections Consider again a surface E, a compact, connected, semisimple Lie group G with Lie algebra LG equipped with an Ad-invariant metric. Suppose first that E is oriented of genus g. Then its fundamental group 7Ti(E) is generated by elements A\,Bi,..., Ag, Bg subject to the relation that Wz(A\,..., Bg) is the identity, where WxiAu. ..,Bg) = B^A^BgAg • • • B^A^B^ . (13) If E is unorientable then ir\ (E) is generated by elements A\,..., that Ws(Ai,. • • ,Ak) is the identity, where now Wz(A1,...,Ak)
= Al---A2l.
Ak subject to the relation (14)
Let i?(E, G) be the set of all homomorphisms of 7Ti(E) into G. This is the representation variety for E in G. There is a natural conjugation action of G on i?(E, G), and the quotient Mx=R(Z,G)/G
A functional integral applied to topology and algebra
531
is identifiable with the moduli space of flat G-connections over £ (see [9] for details). A flat connection over £ is specified up to gauge equivalence by the holonomies around the loops generating 7ri(E), subject to the constraint imposed by the relation WE between these generators. The moduli space Ms is of great interest from many different points of view, and it has a natural volume measure. Witten's classic papers [11,12] offer many insights into the calculation of the volume of the space M-£. One idea used by Witten was to obtain this volume as a limit of the Yang-Mills partition function Zt(E) with t I 0. In the case of closed oriented E of genus > 2, this approach has been used rigorously in [9] to compute the volume of Ms. Here we simply state the result (which follows from (16) below): vol(.M s ) = \Z(G)\vol(G)2g-2J2(dima)2~23
( 15 )
a
where the sum is over all equivalence classes of irreducible representations a of G, and \Z(G)\ is the number of elements in the center Z(G) of G. Comparing the Witten volume formula (15) with Mednykh's formula (4) reveals an amazing similarity! Indeed, as explained by Mulase et al. in [6,7], this mysterious similarity is clarified once one realizes that using the character expansion of the delta function for the group G is exactly what is going on in the Yang-Mills partition-function approach to computing the volume of M%. In order to examine in slightly greater detail how the limiting form of the Yang-Mills partition function relates to the moduli space of flat connections we introduce some notation. Let Kg : G2g —• G be the product of commutators given by Kg(ai,bi,...,ag,bg)
= b~la~ 1bgag • •
-b^a^biai.
Then the subset K~1(e) is identifiable with the representation variety Hom(-7ri(E), G). The group G acts by conjugation on K~1(e) c G2g. The quotient K~x{e)/G is identifiable with the moduli space of flat connections over E: this arises from associating to each flat connection u its holonomies around the loops Ai,Bi,... ,Ag,Bg. The action of G on K~1(e) is in general not free; the subset Kg~1(e)° consisting of all points where the conjugation action has isotropy Z{G) (the center of G) yields a smooth manifold K~1(e)°/G as quotient. This quotient may, of course, be viewed as a subset . M 2 of the moduli space of flat connections. On this moduli space there is a smooth symplectic structure arising from a natural symplectic structure (due to Atiyah and Bott) on the space A of all G-connections over E. With these notions in place we have the following result from [9]: Theorem 4.1. Suppose g > 2. Let f be a continuous G-conjugation-invariant function on G2g, and f the function induced on M° = K~1(e)°/G. Then lim/
f(x)Qt(Kg(x))dx=VO]f]*~29
j
/dv<%,
(16)
where the integration on the left is with respect to unit mass Haar measure, the integration on the right is with respect to the symplectic volume measure, \Z(G)\ is the number of elements in the center Z(G) of G, and vol(G) is the volume of G with respect to the Riemannian structure on G given by the Ad-invariant metric on LG.
532
AMBAR N. SENGUPTA
T h e exact value (15) is obtained from (16) with / = 1 by expanding the heat kernel Qt in characters of G and carrying out t h e integrations using the Schur convolution formulas. Writing the limit l i m t | 0 Qt formally as t h e delta function 6e reveals t h a t this is exactly the extension to the Lie group case of the Probenius-Schur strategy (which was for t h e case where G is a finite group and £ unorientable). T h e r e are some substantial technical difficulties in proving t h e limit formula (16). Some of these stem from the quotienting in R(T,, G)/G, i.e., from the fact t h a t t h e action of G on i ? ( S , G) is not free. In the context of Mulase et al. one is simply working out /
Se(W^(ai,...,bg))dai
...dbg
and so the troubles with quotienting are absent.
Acknowledgments I am thankful to Jean-Claude Zambrini and the other organizers of I C M P 2003, especially Sergio Albeverio and Gerard Ben-Arous, for the opportunity to speak here. Financial support from the I C M P and the US National Science Foundation grant DMS 0201683 is gratefully acknowledged.
References 1. S. Albeverio, A. Hahn, A. N. Sengupta, "Rigorous Feynman path integrals, with applications in quantum theory", preprint (2003). 2. F. G. Frobenius, Gesammelte Abhandlungen Band III, Springer-Verlag, 1968. 3. F. G. Frobenius, "Uber Gruppencharaktere", Sitzung der Koniglich Preussischen Akademie der Wissenschaften zu Berlin 985 (1896). 4. F. G. Frobenius, I. Schur, Uber die reellen Darstellungen der endlichen Gruppen, Sitzung der Koniglich Preussischen Akademie der Wissenschaften zu Berlin 186 (1906). 5. A. D. Mednykh, Soviet Mathematics Doklady 19, 318 (1978). 6. M. Mulase, M. Penkava, Volume of Representation Varieties, 2002. 7. M. Mulase, J. Yu, "Non-commutative matrix integrals and representation varieties of surface groups in a finite group", preprint (2002). 8. A. Sengupta, "Gauge theory on compact surfaces", Memoirs of the Amer. Math. Soc. 126 (1997). 9. A. Sengupta, "The volume measure for fiat connections as limit of the Yang-Mills measure", to appear in J. Geometry and Physics (2003). 10. T. Levy, "Yang-Mills measure on compact surfaces", to appear in Memoirs of the Amer. Math. Soc. (2003). 11. E. Witten, "On quantum gauge theories in two dimensions", Comm. Math. Phys. 141, 153-209 (1991). 12. E. Witten:, "Two dimensional quantum gauge theory revisited", J. Geom. Phys. 9, 303-368 (1992).
Quantum field theory Session organized by D.
BUCHHOLZ
(Gottingen) and J.-B.
ZUBER
(Saclay)
This page is intentionally left blank
Conformal field theories in random domains and Stochastic Loewner evolutions DENIS BERNARD
(SPhT Saclay)
We review the recently developed relation between the traditional algebraic approach to conformal field theories and the more recent probabilistic approach based on stochastic Loewner evolutions. It is based on implementing random conformal maps in conformal field theories.
1. Introduction Fractal critical clusters are hallmarks of criticality, as it may be illustrated by considering the (J-state Potts models whose lattice partition functions are: Z=
^2 exp J ^ ^ ( r ) , s ( r ' } • {s(r)} *- rUr' -I
The sum is over all spin configurations and r U r' refers to neighbor sites r and r' on the lattice. The spin s(r) takes Q possible values. By expanding the exponential factor using exp[J(5 s ( r ) iS ( r ') — 1)] = (1 —p) +p5 s ( r ) )S ( r /) with p = 1 — e"J, these partition functions may be rewritten as sums over cluster configurations
Z=
eJLJ2pllCHl-P)L~llC]lQNc c
where L is the number of links of the lattice, Nc the number of clusters in the configuration C and ||C|| the number of links inside the Nc clusters, usually called FK-clusters. Criticality is then encoded in the fractal nature of these clusters. The stochastic Loewner evolutions (SLE) [2] are mathematically well-defined processes describing the growth of random sets, called the SLE hulls, and of random curves, called the SLE traces, embedded in the two-dimensional plane. The growths of these sets are encoded into families of random conformal maps satisfying specific evolution equations. Their distribution depends on a real parameter K. The connexion between critical systems and SLE growths may intuitively be understood as follows: imagine considering the Q-state Potts models on a lattice covering the upper half plane with boundary conditions on the real line such that all spins on the negative real axis are frozen to the same identical value while spins on the right of the origin are free with non assigned values. Then, in each configuration there exists a FK-cluster growing from the negative half real axis into the upper half plane whose boundary starts at the origin. In the continuum limit, this boundary curve is conjectured to be statistically equivalent to a SLE trace. The SLE parameter K is linked to the Potts parameter by Q = 4COS 2 (47T/K), with K > 4. See figure 1. The aim of this note is to described a precise connexion, which we have developed in refs. [5-7], between the traditional algebraic approach to conformal field theories and the
535
536
DENIS BERNARD
Figure 1. A FK-cluster configuration in the Potts model. The SLE trace is the boundary of the FK-cluster connected to the negative real line.
probabilistic approach based on stochastic Loewner evolutions. The main point consists in considering conformal field theories on random domains defined as the complements of the growing random SLE hulls. Although we illustrate this connexion using the chordal version of SLE, we shall also touch upon the generalization to the radial SLE.
2. (Chordal) stochastic Loewner evolutions Given a simply connected domain U in the complex plane, stochastic Loewner evolutions (SLE) describe the growth of random curves emerging from the boundary of U. There are two cases depending whether these curves connect two points on the boundary 3U (for the chordal SLE), or one point on the boundary and one in the bulk of U (for the radial SLE). We shall mainly deal with the chordal case, except in the last section. To be more precise, let a hull in the upper half plane I = {z € C, 9 m z > 0} be a bounded subset K c i such that M \ K is open, connected and simply connected. The local growth of a family of hulls K t parametrized by i £ [O^I w-ith Ko = 0 is related to complex analysis as follows. By the Riemann mapping theorem, Mt = H \ K t , the complement of Kt in H, which is simply connected by hypothesis, is conformally equivalent to M via a map ft. This map can be normalized to behave as ft(z) = z + 2t/z + 0(l/z2), using the PSL2(R) automorphism group of H. The crucial condition of local growth leads to the Loewner differential equation 2 dtft(z)
= -r-r-r
Mz) with £( a real function.
For fixed z, ft(z)
, _
ft=0(z)
= Z,
(1)
st is well-defined up to the time TZ for which
(Chordal) stochastic Loewner evolutions is obtained [2] by choosing £t = s/HBt with Bt a normalized Brownian motion and K a real positive parameter so that E[£ t £ s ] = «min(t, s). The SLE hull is reconstructed from ft by Kt = {z € H : TZ < t) and the SLE trace 7[o,t] by 7(f) = lime_>0+ / t _ 1 (£t + if)- Basic properties of the SLE hulls and SLE traces are described in [2-4]. In particular, 7[0)t] is almost surely a curve. It is non-self intersecting and it coincides with K t for 0 < K < 4, while for 4 < K < 8 it possesses double-points and it does not coincide with Kt. For establishing contact with conformal field theories (CFT), it is useful to define kt(z) =
Conformal field theories in random domainsiand Stochastic Loewner evolutions
537
ft{z) — £t which satisfies the stochastic differential equation 2 d t
AV
AC
The conditions at spatial infinity satisfied by kt imply that its germ there belongs to the group iV_ of germs of holomorphic functions at oo of the form z + J2m<-i fmZm+1- The group N- acts on itself by composition, 7/ • F = F'o / for / G iV_, and jgof = 7/ • 7 9 - In particular, to kt we can associate fkt G JV_, which satisfy, by Ito's formula: dlkt -F=
'2dt (7fct - F ' ) ( ^ -d&)
+|{lkt
-F").
Alternatively, this may be read as:
Ik! • d^t = dt{~zd* + ffi) - dttdz.
(2)
The operators /„ = — zn+1dz are represented in conformal field theories by operators Ln which satisfy the Virasoro algebra trit: [L„,Lm] = (n - m)Ln+m
+ j^(n3 - n)5n+mfi,
[c,Ln] = 0.
The representations of trit are not automatically representations of N_, one of the reasons being that the Lie algebra of iV_ contains infinite linear combinations of the Z„'s. However, as we shall explain in the next section, highest weight representations of trit can be extended in such a way that iV_ get embedded in a appropriate completion U(n-) of the enveloping algebra of some subalgebra n_ of trit. This will allows us to associate to any jf G Nan operator Gf acting on appropriate representations of Oit and satisfying Ggoj = GfGg. Implementing this construction to kt yields random operators Gkt G W(n_) which satisfy the stochastic Ito equation: G^dGkt
= dt(-2L-2
+ f i - i ) + dttL-u
(3)
Compare with equation (2). This may be viewed as defining a Markov process in W(n_). Since Gkt turn out to be the operators intertwining the conformal field theories in H and in the random domains H(, the basic observation which allows us to couple CFTs to SLEs is the following [5]: Let \uj) be the highest weight vector in the irreducible highest weight representation of tut of an central charge cK = d conformal weight hK = h\-2 = ^ f . 2 K ~ Then E[Gjtt \u) \{Gku}u<s} is time independent for t> s and: E [ G f c » l{G feu }„< s ] = G f c »
(4)
This result is a direct consequence of equation (3) and the null vector relation (—2L_2 + f L i i ) |w) = 0 so that dGkt \w) = GktL^x \<J) d£,t. This result has the following consequences. Consider CFT correlation functions in Mt. They can be computed by looking at the same theory in H modulo the insertion of an
538
DENIS BERNARD
operator representing the deformation from H to Hit, see the next section. This operator is Gfct. Suppose that the central charge is cK and the boundary conditions are such that there is a boundary changing primary operator of weight hK inserted at the tip of Kt. Then in average the correlation functions of the conformal field theory in the fluctuating geometry H t are time independent and equal to their value at t = 0. The state Gkt \u>) may be interpreted as follows. Imagine defining the conformal field theory in Btt via a radial quantization, so that the conformal Hilbert spaces are defined over curves topologically equivalent to half circles around the origin. Then, the SLE hulls manifest themselves as disturbances localized around the origin, and as such they generate states in the conformal Hilbert spaces. Since Gkt intertwines the CFT in H and in H t , these states are Gkt |w) with |w) keeping track of the boundary conditions. See figure 2. .MZL
a
B
Figure 2. A representation of the boundary hull state and of the map intertwining different formulations of the CFT.
3. Conformal transformations in CFT and applications The basic principles of conformal field theory state that correlation functions in a domain U are known once they are known in a domain Uo and an explicit conformal map / from U to Uo preserving boundary conditions is given. Primary fields have a very simple behavior under conformal transformations: for a bulk primary field
{..^{z,z)--^{x)---)v
= {...V{f{z),J{^)f\z)hJ^h-.^(f{x))\r{x)\5---)Vo.
(5)
Infinitesimal deformations of the underlying geometry are implemented in local field theories by insertions of the stress-tensor. In conformal field theories, the stress-tensor is traceless so that it has only two independent components, one of which, T(z), is holomorphic (except for possible singularities when its argument approaches the arguments of other inserted operators). The field T(z) itself is not a primary field but a projective connection, <• • • T(z) •••>„ = (••• T{f{z))f{zf
+ -^Sf(z)
••^^ ,
with c the CFT central charge and Sf{z) = (77M)' - \ (f77~l) the Schwartzian derivative of / at z. This applies to infinitesimal deformations of the upper half plane. Consider an infinitesimal hull Ke;M, whose boundary is the curve x —> e7r/j,(x), x real and e < 1, so that ^c;n = {z = x + iy £ W, 0 < y < enfi(x)}. Assume for simplicity that Ke;M is bounded
Conformal field theories in random domains and Stochastic Loewner evolutions
539
away from 0 and oo. Let He;|U = H \ K e ; / / . To first order in e, its uniformizing map onto H is
z + e[!M*y+oie). z-y
JR
To first order in e, correlation functions in Me;M are related to those in H by insertion of T: d_ de ((•••9(z,z)---i;(x)
•••))„
= ( dyix{y) {T{y)(;--
(6)
JR
With the basic CFT relation [1] between the stress tensor and the Virasoro generators, T(z) = ^2n Lnzn~2, this indicates that infinitesimal deformations of the domains are described by insertions of elements of the Virasoro algebra. Finite conformal transformations are implemented in conformal field theories by insertion of operators, representing some appropriate exponentiation of insertions of the stress tensor. Let U be conformally equivalent to the upper half plane H and / the corresponding uniformizing map. Then, following [6], the finite conformal deformations that lead from the conformal field theory on U to that on EI can be represented by an operator Gf. (• • •
.
This relates correlation functions in U to correlation functions in H where the field arguments are taken at the same point but conjugated by Gf. Radial quantization is implicitly assumed in equation (7). Compare with equation (5). The following is a summary, extracting the main steps, of a construction of Gf described in details in [6]. To be more precise, we need to distinguish cases depending whether / fixes the origin or the infinity. We also need a few simple definitions. We let trit be the Virasoro algebra generated by the Ln and c, and n_ (resp. n+) be the nilpotent Lie subalgebra of trit generated by the L n 's, n < 0 (resp. n > 0), and by b_ (resp. b+) the Borel Lie subalgebra of trit generated by the L n 's, n < 0 (resp. n > 0) and c. We denote by W(n_) (resp. U(n+)) appropriate completion of the enveloping algebra of n_ (resp. n+). We shall only consider highest weight vector representations of the Virasoro algebra. F i n i t e deformations fixing 0. Let ./V+ be the space of power series of the form z+^2m>1 fmzm+1 which have a non vanishing radius of convergence. With words, iV+ is the set of germs of holomorphic functions at the origin, fixing the origin and whose derivative at the origin is 1. In applications to the chordal SLE, we shall only need the case when the coefficients are real. But it is useful to consider the fm's as independent commuting indeterminates. •/V+ is a group for composition. Our aim is to construct a group (anti)-isomorphism from N+ with composition onto a subset Af+ C W(n+) with the associative algebra product. We let N+ act on Oo, the space of germs of holomorphic functions at the origin, by 7/ • F = F o f for / € N+ and F e Oo- This representation is faithful and 7 ff0 / = Iflg- We need to know how jf varies when / varies as / —> f+ev(f) for small e and an arbitrary vector field u. Taking g = z + ev(z) in the group law leads to jf+ev^F = ryfF + e'yf(v-F)+o(e), where v • F(z) = v(z)F'{z) is the standard action of vector fields on functions. Using the Lagrange inversion formula to determine the vector field v corresponding to the variation
(7)
540
DENIS BERNARD
of the indeterminate fm yields: 1 dlf__ y - (jdWwm+l d
f™
f'H
„tlV/o2i7r n>m
\ zn+ld
/H«+V
This system of first order partial differential equations makes sense in W(n+) if we replace ln = — zn+1dz by Ln. So, we define a connection Am in W(n+) by
^«-E^(i^»" x
n>m
"u
^
v
y
which by construction satisfies the zero curvature condition, dAi/dfk — dAk/dfi = [Ak, A{\. We may thus construct an element Gf € 7V+ C W(n + ) for each / e N+ by solving the system | f i = -GfArntf),
m > 1.
(8)
This system is guarantied to be compatible, because iV+ is convex and the representation of N+ on OQ is well defined for finite deformations / , faithful, and solves the analogous system. The existence and unicity of Gf, with the initial condition G/=z = 1, is clear and the group (anti)-homomorphism property, G;Gg = Ggof, is true because it is true infinitesimally and N+ is convex. To lowest orders: Gf = l - / i L i + ?±{L\ + 2L2) - f2L2
+ • • •.
The element Gf, acting on a highest weight representation of the Virasoro algebra, is the operator which implements the conformal map / in conformal field theory. It acts on the stress tensor by conjugation as: G~fl T(z) Gf = T(f(z))
f{zf
+ ^Sf(z),
(9)
a formula which makes sense as long as z is in the disk of convergence of f{z) and Sf(z), but which can be extended by analytic continuation if f(z) allows it. A similar formula would hold if we would have considered the action of Gf on local fields. In particular, by equation (9), Gf induces an homomorphism of the Virasoro algebra by Lm —> Lm(f) = GjlLmGf with GjlT(z)Gf = EmLm(f)z-m-2:
'
m
f
12 V/o 2i7T
M
V
^
n
\JQ2m
/(w)"+V
Equation (8), which specifies the variations of Gf, can be rewritten in a maybe more familiar way involving the stress tensor. Namely, if / is changed to / + 6f with 5f = ev(f), then: SGf = -eGf j^T{z)v{z). If v is not just a formal power series at the origin, but a convergent one in a neighborhood of the origin, we can freely deform contours in this formula, thus making contact with the infinitesimal deformations considered in equation (6).
Conformal field theories in random domains and Stochastic Loewner evolutions
541
Finite deformations fixing oo. All the previous considerations could be extended to the case in which the holomorphic functions fix oo instead of 0. Let JV_ be the space of power series of the form z + S m < - i fmzm+1 which have a non vanishing radius of convergence. We let it act on Ooo, the space of germs of holomorphic functions at infinity, by 7/ • F = F o / . The adaptation of the previous computations shows that
„-ijhf_- v
(I
dw m+1 w
f {w)
'
}zn+1a
We transfer this relation to W(n_) to define an (anti)-isomorphism from AL to N- C W(n_) mapping / to Gf such that
Dilatations and translations. We have been dealing with deformations around 0 and 00 that did not involve dilatation at the fixed point: /'(0) or /'(oo) was unity. To gain some flexibility we may also authorize dilatations, say at the origin. The operator associated to a pure dilatation z —> Xz is X~L°. One can view a general / fixing 0 as the composition f(z) = f'(0)(z + ^2m fmzm+l) of a deformation at 0 with derivative 1 at 0 followed by a dilatation, so that Gf = G///'(o) / ' ( 0 ) - L ° . We may also implement translations. Suppose that f(z) = /'(0)(z + ^ m fmzm+1) is a generic invertible germ of holomorphic function fixing the origin. If a is in the interior of the disk of convergence of the power series expansion of / and f'(a) / 0, we may define a new germ fa(z) = f(a + z) — f(a) with the same properties. The operators Gf and Gfa implementing / and fa are then related by Gfa = e~aL-1 Gf e-^ 0 ^- 1 . Finite deformations around 0 and 00. Consider now a domain HUUB of the type represented on figure 3 which is the complement of two disjoint hulls, the first one, say A, located around infinity but away from the origin, and the second one, say B, located around the origin and away from infinity. The uniformizing map /AUB of MAUB onto H then does not exist at 0 or at 00. Z
H
AuB
2
zl b
Figure 3. A typical two hull geometry.
However, in this situation, we may obtain the map ,/UUB by first removing B by / s , which is regular around 00 and such that /B(Z) = z + O(l) at infinity, and then A = JB(A) by f£ which is regular around 0 and fixes 0 (/~(0) ^ 1 is allowed). Of course, the roles of A and B could be interchanged, and we could first remove A by /A which is regular
542
DENIS BERNARD
around 0 and fixes 0 and then B = /A{B) by fg which is regular around oo and such that
fs(z)=z
+ 0(l).
Suppose that fA and fB are given. There is some freedom in the choice of fA and / j : namely we can replace f^ by ho o f^ where ho 6 PSL2(R) fixing 0, and fg by h^ o fg where hoo S PSL2(R) such that h0O(z) = z + 0(1) at infinity, i.e., a translation. A simple computation shows that generically there is a unique choice of fA and fg such that f§ofA = fA ° / s a n d b o t n e c l u a l t o /AUBFor sufficiently disjoint hulls A and B, as in figure 3, there exists an open set such that for z in this set, both Gj^(Gj^T(z)GfB)GfA and Gj}(GjlT(z)GfA)Gfs are well defined, given by absolutely convergent series, and are both equal to T(fAuB{z))f'AuB{z)2 + -^SfAuB(z). As the modes Ln oiT(z) generate all states in a highest weight representation, the operators GfBG/~ and GfAGf- have to be proportional: they differ at most by a factor involving the central charge c. We write G/BGf~ = Z(A,B) G/AG/-, or G-}lAGjB=Z{A,B)GfsG-l.
(10)
Formula (10) plays for the Virasoro algebra the role that Wick's theorem plays for collections of harmonic oscillators. Since GjA and Gfx belong to A/+ while G/B and Gj~ to A/1, equation (10) may also be viewed as defining a product between elements in A/+ and A/1. Note that G/~ Gjl is clearly well defined in highest weight vector representations of trit. As implicit in the notation, Z(A, B) depends only on A and B: a simple computation shows that it is invariant if JA is replaced by ho° JA and fg by h^ ofB. It may be evaluated as follows. Let As and Bt be two families of hulls that interpolate between the trivial hull and A or B respectively and / A , and fst be their uniformizing map. We arrange that JAS and JBt satisfy the genericity condition, so that unique /ASit and fst>s exist, which satisfy / B , , ° JA, = fAs.t ° /fir Define vector fields by vAs and vBt by dfAJds = vAs(fAa) and
dfBJdt = vBt{fBt). Then [6]:
log Z(Aa, BT) = ±£dsf^
VAa (w)SfBT,. (W)
= -Y2[dti^VBAz)SfA-Az)-
(U)
Z(A, B) may physically be interpreted as the interacting part of the CFT partition function inH\(iUfl). This two hull construction may be used to define operators which are analogues of what vertex operators of dual models are for the Heisenberg algebra. Indeed, consider a hull A whose closure does contain neither the origin nor the infinity. Let us pick the uniformizing map /A of its complement in H onto H which is regular both at the origin and at infinity and such that f'A{0) = f'A(oo). Since it is regular at the origin, we may implement fA in conformal field theory by GA+ /^(0) _ z , ° with GA+ in W + . Since it is also regular at infinity, we may alternatively implement it by GA- / A ( ° ° ) - L ° w^ith GA- £ A/L. The product VA
= GA- G^+
is well defined and non trivial in highest weight representation of oit. It may be thought of as the factorization of the identity since the conformal transformation it implements is the composition of two inverse conformal maps.
Conformal field theories in random domains and Stochastic Loewner evolutions
543
4. Virasoro representations The above formula may be used to define generalized coherent state representations of trit. The key point is to interpret the 'Virasoro Wick theorem', equation (10), as defining an action of trit on M-. This is a reformulation of a construction a la Borel-Weil presented in ref. [7]. R e p r e s e n t a t i o n s a r o u n d infinity. Consider a Verma module V(c, h) and take x ^ 0 its highest weight vector. Let / = z + J2m<-i fm.zm+l G AL and Gf be the corresponding element in N_. The space {Py[f} = (y,Gfx),y G V(c, h)*}, or {Qy[f} = {y,G~jlx),y G V(c,h)*}, is the space of all polynomials in the independent variables / _ i , / _ 2 , . . . . So we have two linear isomorphisms from V(c, h)* to C[/_i, f-2, • • •] and we can use them to transport the action of etc. We denote by Tln and Sn the differential operators such that (Lny,Gfx)
-TZn(y,Gfx),
(Lny,Gj1x)
—
Sn{y,G~jlx),
for y € V*(c, h). By construction the operators 1Zn and Sn are first order differential operators satisfying the Virasoro algebra with non vanishing central charge. To be more precise, let us first consider Py[f] = (y,Gfx) and PLnV[f] = HnPy[f}. We have (Lny,G/x} = (y,L-nG/x). If n > 0, £_„ G n_, and, by the group law, the product L^nGf corresponds to the infinitesimal variation of / generated by /„ = — z1~ndz, namely 5nf{z) = -z1~nf'(z).Iin<0,L —n G b+ so that we need to re-order the product L—nGf in such way that it corresponds to an action of b + associated to a variation of / . This may be done using the Virasoro Wick theorem, G^Gf — Z(cj),f)GfG~1, equation (10), which follows from the commutative diagram
= -z1-nf'(z)+y(f(z))
(12)
where y is fixed by demanding that Snf(z) = o(z) at infinity. Namely, y(w) = ^2k ykWk+1 with j/fc = ^ ^ f?z)k+2f'(z)2. Equation (12) are infinitesimal conformal transformations in the source space generated by £n — —z1~ndz preserving the normalization at infinity. For 4>{z) = z + ezl~n, we also have G^1 = 1 + eL-n and G-1 = 1 + e(Gj1L^nGf)t,+ with (Gj1LnGf)b+ = ^feJ/fe-^fc- The partition function is Z((j),f) = 1 + e£ with £ = zl S z As a Tl foo ^ ~" f( )consequence, {Lny,Gfx) = (C + hyo)(y,Gfx) + ^{y,Gjx)E=0. We thus get a representation of Dit with [6]: ^n = - ^ (
m
+ 1)/m^
Tlo = h- Y2
,
n>l,
C/m-n
m<0 m
fi
dfn
m< — 1
7l_i = -2/_i/i - J2 (mfm-i ~ 52 m<-l V
k+l=m-l
Ml + 2 / - i / - ) /
dfrr
544
DENIS BERNARD
7£_2 = -c/_2/2 - h(4/_a - 3/!x) + J2 (4/-2 ~ V-i)fm-£°fm
m<-l
- J ] ((m-l)/ r o _ 2 - ^ m<-l \
fjfkfl + Bf-1 £
j+fc+J=m-2
/k/Aaf--
fc+(=m-l
/
, m
'
All other 7£n are generated from these ones. A similar construction may be used to deal with Qy[f] — {y,Glxx) giving formulae for QL„y[f] BS a n r s t order differential operator «S„ acting on Qy[f}- Once again the key point is that equation (10) allows to induce an action of trit on A/1. The operators Sn and Hn are of course related as one goes from ones to the others by changing / into its inverse. As a consequence the variation / — > / = / + eSnf induced by Ln is now:
Snf(z) = f(z)1-n-y(z)f'(z),
(13)
where y(z) is fixed by demanding that Snf(z) = o(z), i.e., y(z) = (f(z)1~n/f'(z))+. Equation (13) are infinitesimal conformal transformations in the target space generated by ln = f1~ndf preserving the normalization at infinity. In particular, <Si corresponds to the variation Sif = 1 and S2 to £2/ = 1 / / : *
d
as- 1
* = _£.(£*"755?s0 01, m<-2
The other operators Sn may easily be found, and are explicitly given in ref. [7]. Representations around the origin. The presentation parallels quite closely the case of deformations around 00 so we shall not give all the details. Let / = z + ]T)m>i fmZm+1 be an element of A^+. Consider a Verma module V(c, h) and take x its highest weight vector. The space {(Gfy,x),y G V(c, h)*}, or {{Gj1y,x),y € V(c,h)*}, is the space of all polynomials in the independent variables / 1 , /2, So we again have two linear isomorphisms from V(c, h)* to C[/i, /2, •. •], and we can use them to transport the action of ok. This yields differential operators Vn and Qn in the indeterminates fm such that: (GfLny,x)
= Vn(Gfy,x),
{G~flLny,x)
=
Q^G^y.x),
for y £ V*(c,h). By construction the operators Vn and Q„ satisfy the Virasoro algebra with central charge c. Their expressions are given in [6]. It is interesting to notice the operators Qn, n > 0, coincide with those introduced in matrix models. However, the above construction provides a representation of the complete Virasoro algebra, with central charge, and not only of one of its Borel subalgebras. Applications t o SLE. We are now in position to rephrase the main result, equation (4), in this language. Let %n and Sn be the differential operators define above and consider f — kt the SLE map. Its coefficients / _ i , /_2, •.. are random (for instance / _ ! is simply a Brownian motion of
Conformal field theories in random domains and Stochastic Loewner evolutions
545
covariance K). Because Sn, n > 0, are the differential operators implementing the variation $nf = / 1_ ™, the stochastic Loewner evolution (1) may be written in terms of the Virasoro generators Sn acting on functions J-\f] of the fm: <&[f] = dt (2S2 + ^S\y\f]
- d&
S^[f}.
Consider now the Verma module V(cK,hK), with cK = ( 6 -"X3"- 8 ) a n c j ^ = §=2. Tt j s not irreducible, since ( - 2 L _ 2 + f L ^ x is a singular vector in V(cK,hK), annihilated by the Ln's, n > 1, so that it does not couple to any descendant of x*, the dual of x. The descendants of x* in V*(cK,hK) generate the irreducible highest weight representation of weight (cK, hK). If y is a descendant of x*, (y, Gj(—2L-2 + f L?_1)x) = 0, or equivalently, (2S2 +
^S?){y,Gfx)=0
since, as function of the fm, (y,GfL-nx) — —Sn(y,Gfx) for n > 1. As a consequence, all the polynomials in / _ i , /_2, • • • obtained by acting repeatedly on the polynomial 1 with the 7?.m's (they build the irreducible representation with highest weight (c K ,/i K )) are annihilated by 2S2 + f<5f. For generic K there is no other singular vector in V(cK,hK), and this leads to a satisfactory description of the irreducible representation of highest weight (cK,hK): the representation space is given by the kernel of an explicit differential operator acting on C [ / _ i , / _ 2 , • • •], and the states are build by repeated action of explicit differential operators (the Tim's) on the highest weight state 1. So the above computation can be interpreted as follows: the space of polynomials of the coefficients of the expansion of kt at 00 for SLEK can be endowed with a Virasoro module structure isomorphic to V*(cK, hK). Within that space, the subspace of (polynomial) martingales is a submodule isomorphic to the irreducible highest weight representation of weight {cK,hK).
5. Martingales and crossing probabilities Let us now go to other applications to SLEs. As already mentioned the basic point is equation (4) which says that Gkt |w) is a local martingale. The partition function martingale. The simplest application [6] consists in using results of the previous two hull constructions in the case when B is the growing SLE hull Kt and A is another disjoint hull away from K t and the infinity. Let JA be the uniformizing map of H \ A onto H fixing the origin. Since Gkt |w) is a local martingale, so is MA(£) = (<j|G^Gk t |w). To compute it, we start from / ^ and kt to build a commutative diagram as in the previous section, with maps denoted by / ^ and kt uniformizing respectively kt(A) and /U(K t ) and satisfying kt o fA = / ^ o kt. Then (w\Gj*Gkt\w) Z(A,Kt)
= Z(A,Kt)
(u\G~kt
may be computed using equation (11): logZ(A,Kt)
G^u). = —| J0 rfr5/v4r(0). We have
546
DENIS BERNARD
(w|C?£ Gfl \OJ) — f'-r (0)h"-. Thus the partition function martingale A/^(^) reads: MA{t) = 4 ( 0 ) h - e x p ( - ^ j f ' d T S / A ^ O ) ) . This local martingale was discovered without any recourse to representation theory in [4]. We hope to have convinced the reader that it is deeply rooted in CFT. From it, one may deduce [4] the probability that for K = 8/3 the SLE trace 7[o,oo] does not touch A: P[7[o,oc]nA = 0 ] = / ^ ( O ) 5 / 8 , where /A has been further normalized by ,/U(0) = 0 and JA{Z) = z + 0(l) at infinity. Recall that, for K = 8/3 < 4, the SLE hull kt coincides with the SLE trace 7[0)tj and that it almost surely avoids the real axis at any finite time. Crossing probabilities. Crossing probabilities are probabilities associated to some stopping time events. The approach we have been developing [5] related them to CFT correlations. It consists in projecting, in an appropriate way depending on the problem, the martingale equation, equation (4), which, as is well known, may be extended to stopping times. Given an event £ associated to a stopping time r, we shall identify a vector (vs\ such that {ve\GkT\u)
= le.
The martingale property of Gkt |w) then implies a simple formula for the probabilities: P[5] = E[l £ ] = (ve |w>. For most of the considered events £, the vectors (vs\ are constructed using conformal fields. The fact that these vectors satisfy the appropriate requirements, (vs\ GkT |w) = Is, is then linked to operator product expansion properties [1] of conformal fields. This leads to express the crossing probabilities in terms of correlation functions of conformal field theories defined over the upper half plane. Consider for instance Cardy's crossing probabilities [9]. The problem may be formulated as follows. Let a and b be two points at finite distance on the real axis with a < 0 < b and define stopping times r 0 and n as the first times at which the SLE trace 7[0,t] touches the interval (-co, a] and [b,+oo). The generalized Cardy's probability is the probability that the SLE trace hits first the interval (—oo,a], that is P[r 0 < r t ] . For this event, the vector (VE\ is constructed using the product of two boundary conformal field ipo{a) and Vo(&)i e a c n of conformal weight 0. This leads to the formula for 4 < K < 8: nra
o(a/fr)-<E0(oo) $o(Q)_$o(oo) ,
where $o is the CFT correlation function, which only depends on a/b: $o(a/&) = M Vo(o) ipo(b) H • More detailed examples have been described in [5].
Conformal field theories in random domains and Stochastic Loewner evolutions
547
Our approach and that of refs. [2,4] are linked but they are in a way reversed one from the other. Indeed, the latter evaluate the crossing probabilities using the differential equations they satisfy — because they are associated to martingales — while we compute them by identifying them with CFT correlation functions — because they are associated to martingales — and as such they satisfy the differential equations.
6. (Radial) SLEs We now briefly illustrate how previous results can be adapted to deal with the radial stochastic Loewner evolutions. For CFT convenience, we prefer to view them as describing hulls K( growing outside the unit disc centered at the origin, and not into as usual. Let 1DX be the disc of unit radius centered in x and ITJ>X = C \ U>x be its complement in the complex plane. The Loewner equation for the radial SLE conformal map gt is: dt9t(z) = -fft(z)—rr—ff, gt(z) - Ut
9t=o = z,
(14)
with Ut = e*£', a Brownian motion on the unit circle. As for the chordal case, the SLE hulls are the set of points which have been swallowed: Kt = {z e BO;T 2 < t} with r 2 the swallowing time such that gTz(z) = UTz. Since we view the hulls as growing toward infinity, gt is the uniformizing map of the complement of Kt in B 0 , and it is normalized by gt(z) = e~*z + O(l) at infinity. For making contact with CFT, it is useful to translate the disc by —1 so that the SLE hulls start to be created at the origin and grow into D_i. So we define ht by ht(z) + 1 = Uj~1gt(z +1). Both ht and gt are regular at infinity and, by the results of previous sections, we may associate to them operators Ggt and Ht = Ght in W(b_) which implement these conformal maps in CFT. They are linked by Ht =
e-L-1Ggtei^L°eL-1.
By Ito calculus, Ht satisfies the stochastic equation: H^dHt
= (-2u;_ 2 + ^w2.^
dt + w^ d£t
(15)
with W-i — i(Lo + L-i) and w-2 = — | ( L o + 3 L _ i + 2 L _ 2 ) . These generators have a simple interpretation: W-\ generates rotations of D_i around its center, and u>_2 is the vector generating infinitesimal slits at 0 and away from ID>_i. As in the chordal key remark is the following: Let \LV) be the highest weight vector in the irreducible highest weight representation of Dir of central charge cK = ~ 2« ~ an<^ conformal weight hK ~ hip = ^ T - Let dK = 2^o;i/2 = (6 td si* • Then e~ * Ht \u>) is a local martingale, in particular E[e~tdK Ht |w)] is time independent. This result follows from the fact that the evolution operators Ar = (—2w_2 + f w i i ) read -L22__!1 j) + + nL_ KL_xx{h (hKK - L LQ)-+ A- = ( 2L_ 2 - 'iL 0)•+ L0Q -
-, 20, -L
548
DENIS BERNARD
so that dHt |u>) = dKHt |w) dt + Htw-i \u) d£t with dK — hK — ^h2K. Notice that the radial evolution operators Ar have a triangular structure contrary to the chordal one. This may be used to construct the restriction martingale coding for the influence of deformations of domains on SLE. Let A be a hull in D_i and <J>A be one of its uniformizing maps onto B-i fixing the origin, >A(0) = 0. Given 4>A and ht, we may write in a unique way a commutative diagram
f dsS
By construction, e~td,K G^j Ht \u>) is a local martingale. It is convenient to project it on the state (0| created by the bulk conformal operator of dimension 2/io;i/2 located at infinity. (This is compatible with CFT fusion rules.) Computing (fl\ G7 1 Ht |w) using the commutative diagram yields the martingale MA{t) = e-td" |^(oo)|- 2 h °;v 2 ^ (
0
)^
Zt(A).
Alternatively, since /iJ(oo)>f~ (oo) = /ij(00)^(00) and dK = 2/i0-i/2) this reads: MrA(t) | ^ ( o o ) | - " « =
Zt(A).
It may be further generalized by considering states created by bulk operators of dimension 2 2/IQ ; I/2 + f s but of non trivial spin s. It may be used to evaluate the probability that the radial SLE hull at K — 8/3 does not touch A. More details on the radial SLE will be described elsewhere [10].
Acknowledgments All results described in this note were obtained in collaboration with Michel Bauer. Work supported in part by EC contract number HPRN-CT-2002-00325 of the EUCLID research training network.
References 1. A. Belavin, A. Polyakov, A. Zamolodchikov, "Infinite conformal symmetry in two-dimensional quantum field theory", Nucl. Phys. B 241, 333-380 (1984). 2. O. Schramm, Israel J. Math. 118, 221-288 (2000). 3. S. Rhode, O. Schramm, "Basic properties of SLE", arXiv:math.PR/0106036 , and references therein. 4. G. Lawler, O. Schramm, W. Werner, "Values of Brownian intersections exponents I: half-plane exponents", Acta Mathematica 187, 237-273 (2001); arXiv:math.PR/9911084; G. Lawler, O. Schramm, W. Werner, "Values of Brownian intersections exponents II: plane exponents", Acta Mathematica 187, 275-308 (2001); arXiv:math.PR/0003156; G. Lawler, O. Schramm, W. Werner, "Values of Brownian intersections exponents III: two-sided
Conformal field theories in random domains and Stochastic Loewner evolutions
5. 6. 7. 8. 9. 10.
549
exponents", Ann. Inst. Henri Poincare 38, 109-123 (2002); arXiv:math.PR/0005294; G. Lawler, 0 . Schramm, W. Werner, "Conformal restriction: the chordal case", arXiv:math. PR/0209343. M. Bauer, D. Bernard, "Conformal field theories of stochastic Loewner evolutions", arXiv: hep-th/0210015, to appear in Coram. Math. Phys. . M. Bauer, D. Bernard, "Conformal transformations and the SLE partition function martingale", arXiv:math-ph/0305061. M. Bauer, D. Bernard "SLE martingales and the Virasoro algebra", arXiv:hep-th/0301064, Phys. Lett. B 557, 309-316 (2003). S. Smirnov, "Critical percolation in the plane: conformal invariance, Cardy's formula, scaling limits", C. R. Acad. Sci. Paris 333, 239-244 (2001). J. Cardy, "Critical percolation in finite geometry", J. Phys. A 25, L201-206 (1992). M. Bauer, D. Bernard, "CFTs of SLEs: the radial case", in preparation.
Local string field theory J. DIMOCK (SUNY at Buffalo) We consider open bosonic strings. The non-interacting multi-string theory is described by certain free string field operators which we construct. These are shown to have local commutators with respect to a center of mass coordinate. The construction is carried out both in the light cone gauge and in a covariant formulation.
1. Overview A classical free open bosonic string in Rd is specified by a world sheet X : R x [0, n] which satisfies the wave equation
( d2 -2 \dr
d da2
X*(T,
(1)
with Neumann boundary conditions dX», „ , dX" . , n ^ ( r , 0 ) = —(r)7r)=0 and the constraint
(dX* , dX"\ ±
(dXv , dXv\ ± =
^{-9r -d^)[-w ^-) ^
(2)
n
(3)
where 77M„ is the Minkowski metric in R d with 7700 = — 1, r\kk = +1 for fc = 1 , . . . , d — 1. For a single quantized string in the Heisenberg picture one seeks an operator X satisfying canonical commutation relations as well as the field equation and the constraint. There are two standard ways to proceed. On the one hand one can use the constraint to eliminate extra variables and then proceed with canonical quantization. This only works well when carried out in light cone coordinates and is known as the light cone gauge. On the other hand one can quantize directly ignoring the constraint and then impose the constraint by insisting, that wave functions be annihilated by constraint operators. This is covariant quantization. We discuss both in more detail below. In each case we will be able to find a field equation satisfied by the wave functions \P = $(X). If string coordinate X is split into a center of mass coordinate x11 and internal coordinates X ** = X^ — x11, the wave function can be regarded as a function * = \P(a;, X1). The field equation has the form of a Klein-Gordon equation
( - • + M 2 )* = 0,
(4)
where • = rj^"{d/dx^)(d/dxu) is the d'Alembertian in the center of mass variable and M2 is a mass operator which acts on the internal variables X'. The string field theory describing many strings is obtained by introducing quantized field operators $ = $(x,X') obeying the field equation ( - • + M 2 ) $ — 0. This is second
550
Local string field theory
551
quantization. We carry out this construction in both the light cone gauge and the covariant theory. Our main interest is to show that the fields have a vanishing commutator when the center of mass coordinates are spacelike separated. Formally this is it(x-y)2>0
mx,X'),$(y,Y')]=0
(5)
and we will give a precise version. This report summarizes the results of two papers [1,2]. Earlier results in the physics literature can be found in [3-6]. Notation:
One can write the two-dimensional wave equation as a first order system
'
OT
dr
da2 •
{
'
Suppose we expand in a cosine series as dictated by the boundary conditions. The coefficients are the center of mass coordinates Xfi{r,a)da,
I"(T) = - /
P»{T) = - [
71" Jo
P»{T,a)da,
(7)
7T JO
and the internal coordinates for n = 1,2,... [ X»(T,a) T Jo
X%(T) = —
cosnada,
P£(T)
= —[ P"(T,
(8)
These satisfy dx"/dT=pfl,
dp»/dT = 0,
dx%/dr=rt,
dft/dT = -n2x£.
(9)
2. Light cone gauge Assuming X satisfies the wave equation, boundary conditions, and constraints, we still have many possibilities for parameterizing the solution. To take advantage of this we change to light cone coordinates defined by mapping x = (x°,..., xd~1) to (x + , x~, x) where x± = (x°±xd-1)/V2,
x = (x1,...,xd~2).
A solution is said to be in the light cone gauge if X+(r,a) this gauge we have the following: Lemma 2.1. Let X^{r,a) and
= p+r.
(10) As to the existence of
satisfy the wave equation, boundary conditions, and constraints dX+ P+ = ^ - > 0 .
(10)
OT
Then there exists a conformal diffeomorphism on (r, a) such that in the new coordinates all these conditions still hold and in addition X+ — p+T with p+ > 0.
552
J . DlMOCK
The condition (2.1) says that r is a forward moving parameter in a certain sense. For the proof see [1]. Suppose then we have a solution in the light cone gauge. This choice of gauge and as the constraints can be used to eliminate x+,p~,x^,Pn dynamical variables. If we take + + x = p r as the time parameter we find that the dynamical equations have become dx~ Idx+ = p~~ I'p+, dxk/dx+ k
dx Jdx
+
= pk/p+, k
dp+/dx+=0, dpk/dx+
+
k
= p Jp ,
+
dp Jdx
= 0,
(11) 2
=
k
(-n /p+)x n.
This is a Hamiltonian system with Hamiltonian
p~=^(^+D#+na*»>)-
(l2)
Now we quantize this system by imposing canonical commutation relations on the variables (p+,x~), (xk,pk), (x^jp^). These can be realized as operators on a Hilbert space of Fock valued functions £ 2 (R+ x Rd-2,P,dp+dp/2p+). (13) The operators p+,pk are multiplication operators. The operators x^pf^ space of transverse modes
act on the Fock
F = F(C2'x([0,Tr},Cd-2))-
(14)
Here £ 2 , x means the subspace of C2 orthogonal to the constants and xkn = (2n)-^2((akny
+ ak),
pk = i{n/2fl2{{akny
where (a£ )* is the creation operator for function ( 0 , . . . , fcth slot). The quantum Hamiltonian p~ then has the form P~ = ^(P2
\/2/TT
+ M2),
- akn),
cos na,...,
(15) 0) (entry in the
(16)
where oo
M2^^(:^:+n2:^:)-2a n=l
( (
oo d—2
oo d—2
v.
n=lfc=l
'
(17)
Here we have Wick ordered and allowed an adjustment by an arbitrary constant 2a. In the last line we have introduced the notation more common in string theory a£ = —iy/na^ and
Local string field theory
553
ottn = iy/n(a,n)*. The operator M 2 on J" is a sum of harmonic oscillators. It is self-adjoint and has spectrum —2a, 2 — 2o, 4 — 2a, . . . . The operator p~ is also self adjoint. In the Schrodinger picture our Fock-valued wave functions evolve according to *(x+,p+,p)
= e~ip~x+V{p+,p).
(18)
In configuration space this becomes y{x+,x-,x)
= (27r)- ( d - 1 ^ 2 f e-ip~x+-ip+x~+ipi
V(p+ ,p)dp+ dp/2p+,
(19)
which satisfies the Klein-Gordon equation (2d+d- - A + M 2 ) * = 0.
(20)
This shows that M 2 can be interpreted as a mass operator for the string. This will be our field equation. One can now ask whether the theory is Lorentz covariant. This is difficult because of the special choices that have been made. Nevertheless it is formally true provided d — 26 and a = 1. We do not pursue this, but instead turn to the covariant theory where Lorentz covariance comes naturally.
3. The covariant theory In the covariant theory we seek a quantization without making special choices of coordinates. Now X^ and P M = dX^/dr are quantized by solving the wave equation with the commutation relations at r = 0: [X»{a), Pu(a')} = in6{a - o')rfv.
(21)
We jump right to the solution which is
X"(T,
(22)
where the center of mass operators a;M,jD/t and the internal operators a% are required to satisfy the commutation relations [x»,if] = « T .
[ < , < ] = m6m+nrj^.
(23)
Operators satisfying these relations can be constructed on a Hilbert space of the form C2(Rd,T,dp)
(24)
with p** as a multiplication operator and the a% as creation and annihilation operators on the Fock space ^ = ^(£2'x([0,7r],Cd)). (25) These spaces have indefinite inner products, for example on £ 2 ([0, n],Cd) the inner product is (f,g) = r ^lHp)9vip) Jo
d*.
(26)
554
J. DIMOCK
Now consider the constraint equations which we want to impose as a condition on the wave functions. Passing to Fourier components and Wick ordering one finds that the conditions are Ln^! = 0 for integer n where: 1
°°
n=l
Lm=am-p
+
-
a
^
m-n
-Gin,
™ ¥" 0-
n#m,0
(27)
We allow a shift in Lo —* Lo—a, put aside the constraints for m < 0 (a standard compromise), and ask that ( L 0 - a ) t f = 0, L m $ = 0, m > 0 . (28) These constraints cannot be imposed naively since p2 has continuous spectrum and ^2n a_ n • an has discrete spectrum. To work around this we first decompose our Hilbert space as a direct integral C2(Rd,F,dp)
£2(Vr,f,dnr)dr,
=
(29)
where Vr = {p : p2 +r = 0} is the mass shell and /dr is the Lorentz invariant measure on Vr. Then Lo and L m are decomposable and we have Lo — a = f |(—r + M2) dr where M 2 on T is given by oo
M2 = 2 ^2 <*-n • OLn - 2a.
(30)
71=1
This again has spectrum —2a, 2 — 2a, To get a nontrivial null space for Lo — a we pick the values r = —2a, 2 — 2 a , . . . out of the direct integral and form the direct sum
H = 0 £ 2 ( ^ ( + ) ^ . W = ®Wr. r
(31)
r
Then Lo, Lm act on this space. On a vector ^ = {* r } the constraints are M2*r = r*r,
Lmyr
= 0,
(32)
which can be satisfied. Let
«'=©«;
(33)
r
be the subspace of H satisfying the constraints. We divide out isotropic elements W" = W ' n ( W / ) ± = 0 W r
(34)
r
which are null vectors, and form the physical Hilbert space ftphys = n,/n,i
=
Q ^Phys
(35)
Local string field theory
555
The famous "no-ghost" theorem (see for example [7]) asserts that the inner product is positive definite on Hphys provided d = 26 and a = 1. We make this choice, so that the sum is over r = —2,0,2,... The prize for all this is that one now has a natural unitary representation U(a, A) of the inhomogeneous Lorentz group. Let us exhibit some physical states. Iffiois the empty state in T and / e £ 2 (V_2,C, a!^_2) then *(P) = /(p)Oo (36) is an element of W_2- It satisfies M 2 * = - 2 * and LmVl> = 0 and so is an element of HL2 = Hp_%s- This is a scalar of mass —2 called the tachyon. If / € £ 2 ( V Q + , Cd, dfio) then *(P) = / M ( P ) < « O
(37)
is an element of HQ. It satisfies M 2 * = 0 and if p M / M (p) = 0 it satisfies Lm$! = 0 as well and hence is an element of H'0. Dividing by HQ removes longitudinal states with / M (p) = p^h{p) and we get elements of HQ y s . These are identified as photons and this is essentially the Gupta-Bleuler construction. Higher mass physical states can also be exhibited, see [2]. Finally consider a wave function ^ = {^ r } with at least the first constraint M2^r = r$?r satisfied. Then the Fourier transform *(*) = E
/
^^rip)
dUrip)
(38)
again satisfies the Klein-Gordon equation (-• + M2)$=0. 4. String
(39)
fields
Now we proceed to quantize the two field equations we have identified. Although we have arrived at them in quite different ways they both have the form of a Klein-Gordon equation (—• + M 2 ) $ = 0 for functions on $ : Rd —> J-. (We revert to standard coordinates for the light cone gauge). The difference is in the Fock space T. We have T =
f^(£ 2 - L ([0,7r],C d - 2 )) 2 L
d
\.F(£ - ([0,7r],C ))
light cone gauge, covariant theory.
In the second case there is an indefinite inner product and the remaining constraints Lm$ = 0 must still be satisfied, The mass operators M2 on T are different but have the same spectrum. We discuss the two cases in parallel. First we consider the "classical" equation, i.e., (—• + M2)U = 0 for functions U which are real valued with respect to some conjugation on T. These equations have advanced and retarded fundamental solutions E± which are defined on test functions F £ Co°(R
(^M-p^L^PTB*^)*
(41)
556
J . DlMOCK
where the p° contour T± is the real line shifted slightly above/below the real axis. We will also need the propagator function E = E+-E~.
(42)
Then U = EF solves (—D + M2)U — 0 and has CQ° Cauchy data on any spacelike hypersurface. We say it is a regular solution. Conversely any regular solution U has the form U = EF. If U, V are regular solutions then
a{u,v)=
LS(u{x)' ^{x)) ~ {^{x)'v{x)))ds
(43)
is independent of t by Green's identity. For definiteness take t — 0. Also by Green's identity any regular solution U regarded as a distribution can be expressed in terms of its Cauchy data by (U,F)=a(U,EF). (44) Now we turn to the quantized version. The phase space we want to quantize is the space of Cauchy data for (—0+M2)U = 0 on a spacelike hypersurface, or equivalently the space of regular solutions U itself. The form cr(U, V) is the natural symplectic form on the space and quantization consists of finding operators er($, U) indexed by regular solutions U : Rd —» T such that {cr(
(46)
Then $ satisfies the field equation in the sense of distributions, $ ( ( - D + M 2 ) F ) = 0,
(47)
and using a(EF, EG) = — (F, EG) it has the commutator [$(F),$(G)]
= -i(F,EG).
(48)
This is the structure we want, and in fact one can show that any operator valued distribution $ ( F ) satisfying (47) and (48) arises from a representation cr($,F) of the CCR in this manner. The locality result is now immediate. If F, G e Co°(Rd, T) have spacelike separated supports then (F, EG) = 0 and hence [*(F),$(G)]=0
(locality).
(49)
This completes the abstract discussion for the light cone gauge, but for the covariant theory there are still constraints to be satisfied and the interpretation of the constraints seems to depend on the representation. What representations of the CCR should we consider? What representations might have physical relevance? This is not clear.
Local string field theory
557
If we suppress tachyons there is a distinguished positive energy representation we can consider. The ad hoc suppression of tachyons is not really satisfactory, but it does give some insight and makes contact with point field theory. We give some details in the covariant case; the light cone gauge is similar. Excluding r — — 2 we define as before
H+= 0
£2(V+,JF,d/zr).
Hr= 0
r=0,2,...
(50)
r=0,2,...
For F G C§°(Rd, J") define 11+F <E H+ by (U+F)r(p)
= V^ PrF(p),
PeV+.
(51)
Here Pr is the projection onto the M2 = r subspace of T. Let a, a* be creation and annihilation operators on the Fock space K = J-(H+) and define $(F) = a{Il+F) + a*(Il+F).
(52)
This satisfies (47), (48) and generalizes the positive energy representation for point fields. Continuing with this positive energy representation we impose the constraint by asking for states annihilated by L m $ , actually just the annihilation part of this operator. This turns out to be KJ — T(Ti'+) and dividing out isotropic elements gives /CPhys
=
.F(WPhys)
w h i c h
has a positive definite inner product. The field operators $(F) act on K.phys if H+F e H'+. We call such fields observable fields. The observable fields still have the local commutator [$(F), $(£?)] = — i(F, EG). However now it is not clear whether this can be made to vanish, i.e., it is not clear whether one can satisfy HF 6 H'+ and still have some control over the support of F. Thus the existence of local observables is not settled in this covariant case; in the light cone gauge there is no problem.
5. Comments 1. There is an interesting extension of these results [1], originally due to Martinec [3]. Consider the light cone gauge and change from a Fock representation for the internal degrees of freedom to a Schrodinger representation. The x\ are now independent Gaussian random variables with mean zero and variance (2n) _ 1 and p% = —idjdx\ +inx^. The field equation ( - • + M2)U = 0 now takes the form
((^)2-E(^)2-EE((4)2--^)-2)^0. (53, Suppose we consider functions U = C/(xM, {x£}) which depend only on a finite number of these modes, and so restrict the sum over nton
-{dxy+x>fe)2 fc=l
+ f; f > £ ) 2 .
(54)
n=lfe=l
Test functions for field operators, formerly Fock valued, can now be regarded as functions F = F(x M , {x£}) of these variables. Then one can show that if F, G have spacelike separated
558
J . DlMOCK
supports with respect to the above metric then [ $ ( F ) , $ ( G ) ] = 0. One says t h a t the field is local with respect to the string light cone. This is a limitation on how fast the various modes can grow. 2. Interacting string field theory does not exist, although there are candidates. Is there any chance t h a t such a theory also has a locality property? For some speculation in this direction see [5].
Acknowledgments This research was supported by N S F Grant PHY0070905.
References 1. 2. 3. 4. 5.
J. Dimock, "Locality in free string field theory", J. Math. Phys. 4 1 , 40-61 (2000). J. Dimock, "Locality in free string field theory II", Annales Henri Poincare 3, 613-634 (2002). E. Martinec, "The light cone in string theory", Class. Quant. Grav. 10, L187-L192 (1993). D. Lowe, "Causal properties of free string field theory", Phys. Lett. B 326, 223-230 (1994). D. Lowe, L. Susskind, J. Uglum, "Information spreading in interacting string field theory", Phys. Lett. B 327, 226-233 (1994). 6. H. Hata, H. Oda, "Causality in covariant string field theory", Phys. Lett. B 394, 307-314 (1997). 7. I. B. Frenkel, H. Garland, G. J. Zuckerman, "Semi-infinite cohomology and string theory", Proc. Nat. Acad. Sci. 83, 8442-8446 (1986).
Energy inequalities in quantum field theory CHRISTOPHER J. FEWSTER (U. York) Quantum fields are known to violate all the pointwise energy conditions of classical general relativity. We review the subject of quantum energy inequalities: lower bounds satisfied by weighted averages of the stress-energy tensor, which may be regarded as the vestiges of the classical energy conditions after quantisation. Contact is also made with thermodynamics and related issues in quantum mechanics, where such inequalities find analogues in sharp Garding inequalities.
1. Introduction: energy conditions in general relativity In classical relativity, the energy-momentum current density seen by an observer with fourvelocity vb is defined to be I P = Tabvb, where Tab is the stress-energy tensor of surrounding matter. a The requirement that I P should be timelike and future-directed is known as the dominant e n e r g y condition (DEC) and is a natural expression of the fundamental relativistic principle that no influence may propagate faster than light. This interpretation is borne out by the fact that a conserved stress-energy tensor which obeys the DEC will vanish on the domain of dependence of any closed achronal set on which it vanishes (see section 4.3 in [1]), so the DEC prohibits acausal propagation of stress-energy. The DEC may, equivalently, be formulated as the requirement that Tabuavb > 0
(1)
for all timelike, future-directed ua, vb; it also contains (as the special case ua = vb) the weak energy condition (WEC), the assertion that all timelike observers measure positive energy density. By continuity, this implies the null energy condition (NEC), namely that Tabkakb > 0 for all null ka. The classical energy conditions are satisfied by most classical matter models and have several important consequences. Matter obeying the NEC tends to focus null geodesic congruences, a fact which plays a key role in the singularity theorems [1], and the WEC (respectively, DEC) is a sufficient condition for the positivity of the ADM (respectively, Bondi) mass [2, 3]. However, quantum fields have long been known to violate all such pointwise energy conditions [4] and, in many models, the energy density is in fact unbounded from below on the class of physically reasonable states. Moreover, the existence of negative energy densities draws indirect experimental support from the Casimir effect [5]. In this contribution we review these phenomena and the extent to which quantum fields satisfy weaker energy conditions, which may be called quantum energy inequalities (QEIs). We also describe connections between such inequalities and thermodynamics! stability, and some wider parallels in quantum mechanics. Finally, the physical picture of energy condition violation which emerges from these results is briefly discussed. a
Our metric signature is -I
; units with h = c = 1 will also be adopted.
559
560
CHRISTOPHER J. FEWSTER
2. The existence of negative energy densities in quantum field theory In 1965, Epstein, Glaser and Jaffe proved that the energy density in any Wightman field theory necessarily admits negative expectation values (unless it is trivial) [4]. Here, we give an elementary argument for this conclusion, the basis of which goes back at least to [6], and which applies quite generally. Consider a theory specified by a Hilbert space H, a dense domain D C Ti and a distinguished vector Q G H, which we call the vacuum. In this context, a field is an operator valued distribution on spacetime with the property that T{f)D C D for all test functions / . In addition, we assume only that T enjoys the Reeh-Schlieder property that no T(f) can annihilate the vacuum (for nontrivial / ) and, for simplicity, that T(f) has vanishing vacuum expectation values, which corresponds to adopting the vacuum as the zero of energy. This is what one would expect of the energy density in Minkowski space; one may easily adapt the argument to cope with nonvanishing vacuum expectation values. With these assumptions in place, let / be any nonnegative test function and define (for a £ E) V>a = cosafl + sina
T(f)Cl fl
.
(2)
Then an elementary calculation yields (il>a\T(f)i/,a) = C s i n 2 a + r ? ( l - c o s 2 a ) ,
(3)
where
By minimising over a, we therefore find M(rl>\T(f)1>)
+p ,
(5)
which is negative. Of course, this argument has very little to do with quantum field theory and almost nothing to do with energy density per se: the key ingredient is the linear structure of Hilbert space, and similar arguments also apply in quantum mechanics. We may pursue this line of reasoning a little further if we may assume that the vacuum admits a nontrivial scaling limit for T with positive canonical dimension (see section VII.3.2 of [7]). In this instance one may choose a sequence /„ of nonnegative test functions tending to a <5-function so that £n —> oo, while r]n/(n tends to a finite limit. It then follows from equation (5) that the expectation value of T at a point (if it exists) is unbounded from below as the state varies in D.
3. Quantum energy inequalities Although one cannot expect reasonable quantum field theories to satisfy any of the pointwise classical energy conditions, one may still hope that there would be some vestige of these conditions in quantum field theory: after all, they ought to emerge from the quantum field theory in the classical limit. This leads to the conjecture that smeared energy densities
Energy inequalities in quantum field theory
561
might satisfy state-independent bounds, which become progressively weaker as the support of the smearing function shrinks, and tighter as it grows. Bounds of this type, known as Quantum Weak Energy Inequalities1* (QWEIs) were first proved by Ford [8] who was initially guided by thermodynamic considerations (see section 4). The original bound actually concerned the energy flux, but was soon adapted to the energy density of the scalar and electromagnetic fields in Minkowski space [11,12]. In these bounds, the energy density is averaged along an inertial trajectory against a Lorentzian weight; for example, the massless scalar field in four-dimensions was shown to obey
r(T00(t,x)U
3_
/ for a large class of states tp. The parameter r sets the timescale over which the average is taken; as hoped, we find that the bound is tighter as r increases (leading to a proof of the averaged weak energy condition (AWEC) in the limit r —> oo). The fact that the bound diverges as r —» 0 is consistent with the unboundedness from below of the energy density at a point. Equation (6) is of course reminiscent of the time-energy uncertainty relation (although this is not an ingredient of the proof). Bounds of this type were generalised to ultrastatic spacetimes by Pfenning and Ford [15], for averages along static trajectories with the Lorentzian weight. In curved spacetimes (or even in compact flat spacetimes) it is of course possible to have a constant negative renormalised energy density, which could not satisfy a bound of the form above. The quantity appearing in the results of [15] is, instead, the difference between the renormalised energy density in state i/7 and that taken in the vacuum, which we might refer to as the normal ordered energy density. Thus these 'difference' QWEIs bound the extent to which the energy density can drop below the vacuum expectation value. A different approach to QWEIs was developed by Flanagan [13,14] for massless scalar fields in two dimensions. The resulting bound is not only valid for a large class of averaging weights, but is also sharp. Yet another approach was initiated in work with Eveson [16] for averages along inertial trajectories in Minkowski space of dimension d > 2 using a large class of weight functions. For example, a scalar field of mass m > 0 obeys J (Too)^(t, x) g{tfdt
> - J L j J°° du |s(u)| 2 u 4 Q 3 (u/ro)
(7)
in four dimensions, where Q3 : [1, 00) —> M+ is defined by Q 3 ( , ) = ( l - ^ )
1 / 2
( l - ^ ) - ^ l n ( x
+
V ^ T )
(8)
and obeys 0 < Qs(x) < 1 with Qa(x) —> 1 as x —> 00. In contrast to Flanagan's bound, equation (7) is not sharp, and differs from it by a factor of 3/2 in the d = 2, m = 0 case. Generalisations to static spacetimes [17], electromagnetism [18] and, on a slightly different tack, quantum optics [10] are known. b
The original terminology was simply "quantum inequality" (QI); the more specific term QWEI was introduced later [9], as there turn out to be many other situations in which similar bounds appear (see, e.g., section 5 and [10]).
562
CHRISTOPHER J. FEWSTER
The following general QEI is based on ref. [19] and essentially places the argument of [16] in a much more general setting. Consider a real, minimally coupled scalar field $ of mass m > 0 propagating on a globally hyperbolic spacetime (M,g). Each Hadamaxd state co of the quantum field determines a two-point function u>2{x,v) = (*(x)$(y))u
(9)
which, in particular, satisfies the following properties: — w2(F, F)>0 for all test functions F € V{M). — w2(F, G) - u2(G, F) = iA(F, G) for all F, G € V{M), where A is the advanced-minusretarded fundamental solution to the Klein-Gordon equation. The important point is that the right-hand side is state-independent. — The wave-front set [20] of LJ2 is constrained by WF(u>2) C N+ x Af~, where TV* is the bundle of null covectors on M directed to the future (+) or past (—). This is the microlocal spectrum condition, which encodes the Hadamard condition [21]. All Hadamard two-point functions are equal, modulo smooth terms. Given a second Hadamard state a / ° \ which we adopt as a reference state, the normal ordered two-point function ••u2:(x,y) = w2{x,y)-ujf\x,y)
(10)
is therefore smooth and symmetric and obeys :<J2:(F,F)>-40)(F,F).
(11)
The diagonal values :cj2'-(x,x) define the Wick square (:$2:)ul(x). Now let g be a smooth, real-valued function, compactly supported in a single coordinate patch of (M,g), and define an averaged Wick square by
A(g,w) := J'(:$2:)u>(x)g(x)2 .
(12)
Then, splitting the points in the definition of : $ 2 : by the introduction of a 5-function M9,u)
= / dvol(x) dvol(y) :uj2:(x,y) g(x) g(y) Sg(x,y),
(13)
where 6g is the 8-function on (M,g). Passing to the coordinate chart containing the support of g, and writing the (^-function as a Fourier integral, we find A(g,u;) = J - ^
Jd4xdiy:oJ2:(x,y)g(x)g(y)(p(x)P(y))l/2e-ik^-y\
where, in these coordinates, p(x) = |detg a b (x)| 1 / / 2 .
(14)
Energy inequalities in quantum field theory
563
Exploiting the symmetry of :w2:, the fc-integral may be restricted to the half-space with fco > 0 at the expense of a factor of 2. We then have d4k -rTZ-.u2:{gk,gk) Jk0>o (27r) f
A(g,u)=2 - ~
2
w 0) / 75^4 2 (5fc^fc) 27r Jk >o ( ) /feo>0 0
^-IP"-'' 4 1 '
(l5)
where gk(x) = eik-xg(x)/p(xy/2 and F(x,y) = g(x)g(y) {p{x)p{y))1f2uj{°\x,y). We may now invoke the microlocal spectrum condition and Prop. 8.1.3 in [20] to show that the righthand side of the inequality is finite because the Fourier transform of F decays rapidly in the integration region. (We are using a nonstandard convention for the Fourier transform in which f(k) = Jdx f(x)eik'x.) To convert this into a general quantum energy inequality, suppose fab is a tensor field for which, classically, N
T«fc/a6 = £ W )
2
,
(16)
j=i
where Pj are partial differential operators with smooth, real, compactly supported coefficients. Then exactly the same argument yields a (finite) lower bound on
/
dvol(x)(:Tab:)Ux)fab(x)
simply by replacing w2 by X)j=i (Pj ® Pj)wi m t n e definition of F. Since the scalar field obeys the DEC and WEC precisely because the appropriately contracted stress tensor may be written in the 'sum of squares' form (16), our QEI has, as special cases, the quantum dominant/weak energy inequalities (QDEI/QWEIs). Several remarks are appropriate here. First, the bound depends on the coordinate system chosen, so one has the freedom to sharpen the bound by modifying the coordinates. Second, it is remarkable that the bound remains finite if the support of g (or fab) is shrunk to a timelike curve.c The same is not true for averaging along null curves or within a spacelike slice, where one may show explicitly that the averaged quantity is unbounded from below [22, 23]. Third, the argument can be generalised to spin-one fields [24]. Fourth, restricted to static worldlines in static spacetimes, with the reference state chosen to be a static ground state, we find Jdt{:Tabuaub:)u(^(t))9(t)2>-J
duQ{u)\g{u)\2,
(17)
where ua is the four-velocity of the static worldline 7, and Q is monotone increasing and polynomially bounded. d As a special case, we recover equation (7); bounds of the form c
Indeed, the version of this argument in [19] considered only the case of averaging along a smooth timelike curve. d If wo is time-translationally invariant, but not a ground state, then Q(u) has a tail in the negative half-line which decays rapidly as u —> — oo.
564
CHRISTOPHER J. FEWSTER
equation (17) have also appeared in other contexts (see section 5). Finally, a different approach to scalar field QEIs, which also employs microlocal techniques, can be found in [25]. One of the key ideas underlying the argument just given was the positivity of the classical expression Tabfab. The situation is rather different in the case of a Dirac field, for which the classical [i.e., 'first quantised'] energy density is, like the Hamiltonian, unbounded from both above and below. Positivity of the total energy emerges for the first time after renormalisation. For some time, this frustrated attempts to obtain a QWEI for spin-| fields. The first success was due to Vollick [26], who adapted Flanagan's proof [13] to treat massless Dirac fields in two dimensions. Subsequently, Verch and the present author used microlocal techniques to establish the existence of Dirac and Majorana QWEIs in general four-dimensional globally hyperbolic spacetimes [9]. However, the first explicit QWEI bound for Dirac fields in four dimensions has only been obtained very recently [27]. This bound is also of the form (17). Finally, we should note that there are quantum field theories which do not satisfy QEIs. The simplest (and rather unphysical) example consists of an infinite number of fields with the same mass. More serious, perhaps, is the fact that the nonminimally coupled scalar field violates the energy conditions even at the classical level and is not expected to obey QEIs. In this regard, it is worth noting that the theory of Einstein gravity with a nonminimally coupled scalar field is mathematically equivalent6 (in the so-called 'Einstein frame') to the theory of a minimally coupled field plus gravity (see ref. [28] for a review). In the Einstein frame, of course, QEIs do hold. It is possible that one may require a full theory of quantum gravity to assess the significance of the failure of QWEIs in the usual 'Jordan frame'. Olum and Graham [29] have also argued that interacting quantum fields can violate worldline QWEIs. They consider two coupled scalar fields, one of which is in a domain wall configuration; away from the wall, the second field experiences a static negative energy density (as often occurs near mirrors). This suggests strongly that the existence of QEIs for worldline averages is a special feature of the free field. However, it is still plausible that QEIs exist for spacetime averages of the stress-energy tensor. Consider a family of smearings whose spacetime 'support radius' is determined by a parameter A. In the situation just described, sampling over longer timescales (say, by increasing A) would also involve sampling over larger spatial scales, eventually meeting the (large) positive energy in the domain wall. It is certainly conceivable that the averaged energy density could still satisfy a lower bound which tends to zero as A —» oo and diverges as 0(A~ 4 ) as A —> 0 + .
4. Connections with thermodynamics Quantum inequalities originate from a 1978 paper of Ford entitled "Quantum coherence effects and the second law of thermodynamics" [6]. Ford argued that unconstrained negative energy fluxes (e.g., a superposition of right-moving modes with a left-directed flux) could be used to violate the second law of thermodynamics, by directing such a beam at a hot body to lower both its temperature and entropy. However, macroscopic violations of the second Equivalence holds provided the scalar field does not take Planckian values, a regime in which the nonminimally coupled theory is, in any case, unstable.
Energy inequalities in quantum field theory
565
law cannot occur if the magnitude F and duration r of the negative energy density flux are constrained by \F\ < T ~ 2 because the absorbed energy would be less than the uncertainty of the energy of the body on the relevant timescale. This prompted Ford to seek mechanisms within quantum field theory which would limit negative energy fluxes and densities, and led ultimately to quantum inequalities of the type described in section 3. Recently, in work with Verch [30], a new twist has been added to the connection between quantum inequalities and thermodynamics: it turns out that there is a rigorous converse to Ford's original argument. We consider quantum systems in static spacetimes of the form R x E where the spatial section S is a compact Riemannian manifold. The algebra of observables, 21 is assumed to be a C*-algebra on which the time translations t1—> t + r are assumed to induce a strongly continuous one-parameter family of automorphisms aT, so that (21, ar) is a C*-dynamical system. We also assume that the system is endowed with an energy density p(t, x) whose spatial integral over any hypersurface {t} x E generates the time evolution in the sense that f dvo\s(x)e([p(t,x),A})
JT,
=
l
"r
(18)
-~^aT{A)) r= 0
for sufficiently large classes of observables A £ 21 and continuous linear functionals I e 21*. (Precise definitions are given in [30].) One may now investigate the consequences of assuming that p(t, x) satisfies various QWEI conditions, patterned on those obeyed by quantum fields. In particular, a state w of the system is said to obey a static quantum weak energy inequality with respect to a class of states <S if, for each real-valued g € C Q ° ( R ) there is a locally integrable non-negative function E s m q(g; x) such that /
dt g{t)2 [(p(t, x))v - (p(t, x))u] > -q(g; x)
(19)
for all ip € <S and x € E. The state u> is said to be quiescent if, in addition, each x has an open neighbourhood U such that A / dvol^x)
q{gx\ x) —> 0
as A -> 0+ ,
(20)
where g\(t) = g(Xt). (One may regard this as a spatially averaged version of a difference AWEC.) On the assumption that S is a sufficiently rich class of states, we proved, inter alia, the following result. Theorem 4.1. If a state ui £ S obeys a static QWEI then the C*-dynamical system admits a passive state. Moreover, if u is quiescent then it is passive. We recall that the defining property of a passive state of a C*-dynamical system is the impossibility of extracting net work from a system initially in such a state by a cyclical perturbation of the dynamics [31]. In this sense, the passivity criterion is an expression of the second law of thermodynamics; the force of the above result is that thermodynamic stability may be viewed as a consequence of QWEIs. The abstract results of ref. [30] are complemented by a detailed study of the free scalar field in static spacetimes with compact spatial sections. This does not immediately fit into our framework as the Weyl algebra describing the field theory is not a C*-dynamical system.
566
CHRISTOPHER J. FEWSTER
However, one may construct an auxiliary C*-dynamical system to which the structural assumptions do apply. (Such complications would be absent for the Dirac field.) These results lead to an interesting situation. As we have seen, QWEIs are consequences of the microlocal spectrum condition, while passivity is a consequence of QWEIs. Earlier work by Sahlmann and Verch [32] established that states of the scalar field obeying a certain passivity condition necessarily obey the microlocal spectrum condition. Thus the three conditions of passivity, QWEIs and the microlocal spectrum condition are mutually interconnected. And this is significant because these conditions may be interpreted as a stability conditions operating at different scales: microscopic [microlocal spectrum condition], mesoscopic [QWEIs] and macroscopic [passivity].
5. Quantum inequalities in quantum mechanics A nice analogy to quantum energy inequalities may be found in the context of Weyl quantisation. In this procedure, a classical observable (i.e., a function on phase space) F : M2n —> M is represented in quantum mechanics by the operator Fw on L2(Wn) with action
whose expectation values may be expressed in terms of the classical symbol F(x,p) by fdnxdnpF(x,p)W^(x,p),
{Fv,h=
(22)
where W^(x,p) is the Wigner function corresponding to ip: Wi,(x,p)=2
(dnye-iv<x-yVhi>{x
+ y)i>{x-y).
(23)
As is well known, the Wigner function need not be everywhere positive, so it is clear that the positivity of F in no way entails the positivity of Fw. This mirrors the situation with energy density: even fields which obey the energy conditions classically will violate them in quantum field theory. Given sufficient regularity of the classical symbol F, however, the quantised observable Fw satisfies a sharp Garding inequality [33] of the form (Fw)4, > -C{h),
W> € C0°°(]R"),
(24)
which, from our current standpoint, is precisely a quantum inequality. One may also investigate the specific example of energy densities in quantum mechanics. As in quantum field theory, the energy density at a point is unbounded from below, but time averages obey quantum inequalities of a form similar to equation (17) [34].
6. Physical interpretation Quantum energy inequalities demonstrate clearly that large negative energy densities and fluxes are associated with high frequencies (or short length-scales, as in the Casimir effect): averaging is required to obtain semibounded expectation values and it is crucial that
567
Energy inequalities in quantum field theory
the averaging function should decay sufficiently rapidly in the frequency domain in order that bounds of the form (17) are finite. Further insights have been provided by Ford and Roman [35], who discuss positive and negative energy densities in terms of the financial metaphor of credit and debt. Consider, for example, an energy density taking the form p{t) = A5(t) + A(l + e)S(t - T)
(25)
along some inertial worldline/ Here, one can interpret A as the magnitude of 'debt' incurred, T as the term of the 'loan' and e as the 'interest rate' due on repayment. Clearly a necessary condition for this to be the energy density of, say, a massless scalar field in two dimensions, is that it should satisfy
jdtp{t)g{t?>-±jdt\g'{t)\\
(26)
for all real-valued g e C Q ° ( R ) , which is Flanagan's QWEI [13]. Constraints on T and e may be obtained in terms of A by substituting particular test functions g [35]. Sharper bounds are yielded [36] by rephrasing equation (26) as the condition that the differential operator HP = -^5+fap{t) dt2
(27)
should be a positive quadratic form on Co°(K). In the example given, it turns out that T<JL 67T.4
and
e>-^f-. - 1 - 6TTAT
(28) V
;
The two striking features are, firstly, that there is a maximum loan term and, secondly, that the interest rate is always positive and diverges as the maximum loan term is approached. Thus quantum fields act so as to restore net energy density positivity locally (rather than globally); negative energy densities are obtained only at the expense of a nearby positive energy density of greater magnitude. For further results in this direction see [37,38]. One interesting consequence of the fleeting nature of negative energy densities is that it will be hard to observe them directly. Heifer [39] has argued, on the basis of various thought experiments, that quantum fields satisfy 'operational energy conditions': that is, the energy of any measurement device capable of resolving transient negative energy densities will necessarily be large enough that the net local energy density will be positive. Finally, we mention two important applications of quantum energy inequalities. First, they have been used to place constraints on various "designer spacetimes" including warp drive models [40] and traversable wormholes [41]. Second, as already mentioned, Marecki has adapted quantum inequality arguments to bound fluctuations of the electric field strength in quantum optics [10]. It is a tantalising prospect that these results may have direct relevance to experiments in the near future.
Acknowledgments Financial assistance under EPSRC grant GR/R25019/01 is gratefully acknowledged. f
This is to be regarded as a toy model for more realistic smooth energy distributions.
568
CHRISTOPHER J. F E W S T E R
References 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. 13. 14. 15. 16. 17. 18. 19. 20. 21. 22. 23. 24. 25. 26. 27. 28. 29. 30.
31. 32. 33. 34. 35. 36. 37. 38. 39. 40. 41.
S. W. Hawking, G. F. R. Ellis, The large scale structure of space-time, CUP, Cambridge, 1973. E. Witten, Coram. Math. Phys. 80, 381 (1981). M. Ludvigsen, J. A. G. Vickers, J. Phys. A: Math. Gen. 15, L67 (1982). H. Epstein, V. Glaser, A. Jaffe, Nuovo Cimento 36, 1016 (1965). H. B. G. Casimir, Proc. Ron. Ned. Akad. Wet. 51, 793 (1948). L. H. Ford, Proc. Roy. Soc. Lond. A 364, 227 (1978). R. Haag, Local quantum physics: Fields, particles, algebras, Springer Verlag, Berlin, 1992. L. H. Ford, Phys. Rev. D 43, 3972 (1980). C. J. Fewster, R. Verch, Coram. Math. Phys. 225, 331 (2002). P. Marecki, Phys. Rev. A 66, 053801 (2002). L. H. Ford, T. A. Roman, Phys. Rev. D 51, 4277 (1995). L. H. Ford, T. A. Roman, Phys. Rev. D 55, 2082 (1997). E. E. Flanagan, Phys. Rev. D 56, 4922 (1997). E. E. Flanagan, Phys. Rev. D 66, 104007 (2002). M. J. Pfenning, L. H. Ford, Phys. Rev. D 57, 3489 (1998). C. J. Fewster, S. P. Eveson, Phys. Rev. D 58, 084010 (1998). C. J. Fewster, E. Teo, Phys. Rev D 59, 104016 (1999). M. J. Pfenning, Phys. Rev. D 65, 024009 (2002). C. J. Fewster, Class. Quantum Grav. 17, 1897 (2000). L. Hormander, The analysis of linear partial differential operators I, Springer Verlag, Berlin, 1983. M. J. Radzikowski, Comm. Math. Phys. 179, 529 (1996). C. J. Fewster, T. A. Roman, Phys. Rev. D 67, 044003 (2003). L. H. Ford, A. Heifer, T. A. Roman, Phys. Rev. D 66, 124012 (2002). C. J. Fewster, M. J. Pfenning, "A Quantum Weak Energy Inequality for spin-one fields in curved spacetime", arXiv:gr-qc/0303106; to appear in J. Math. Phys. (2003). A. D. Heifer, "The Hamiltonians of linear quantum fields: II. Classically positive Hamiltonians", arXiv:hep-th/9908012. D. N. Vollick, Phys. Rev. D 6 1 , 084022 (2000). C. J. Fewster, B. Mistry, "Quantum Weak Energy Inequalities for the Dirac field in flat spacetime", arXiv:gr-qc/0307098. V. Faraoni, E. Gunzig, P. Nardone, Fund. Cosmic Phys. 20, 121 (1999). K. D. Olum, N. Graham, Phys. Lett. B 554, 175 (2003). C. J. Fewster, R. Verch, "Stability of quantum systems at three scales: Passivity, Quantum Weak Energy Inequalities and the Microlocal Spectrum Condition", arXiv:math-ph/0203010; to appear in Comm. Math. Phys. (2003). W. Pusz, S. L. Woronowicz, Comm. Math. Phys. 58, 273 (1978). H. Sahlmann, R. Verch, Comm. Math. Phys. 214, 705 (2000). C. Fefferman, D. H. Phong, Comm. Pure Appl. Math. 34, (1981) 285. S. P. Eveson, C. J. Fewster, R. Verch, in preparation. L. H. Ford, T. A. Roman, Phys. Rev. D 60, 104018 (1999). C. J. Fewster, E. Teo, Phys. Rev. D 6 1 , 084012 (2000). F. Pretorius, Phys. Rev. D 6 1 , 064005 (2000). E. Teo, K. F. Wong, Phys. Rev. D 66, 064007 (2002). A. D. Heifer, Class. Quantum Grav. 15, 1169 (1998). M. J. Pfenning, L. H. Ford, Class. Quantum Grav. 14, 1743 (1997). L. H. Ford, T. A. Roman, Phys. Rev. D 53, 5496 (1996).
Loop quantum gravity THOMAS THIEMANN
(Perimeter Inst. Theor. Phys. and U. Waterloo)
Loop Quantum Gravity (LQG) is an attempt to construct a four-dimensional, non-perturbative, background independent Quantum Field Theory of Lorentzian General Relativity including all known matter without assuming any experimentally unverified extra structure. We review the present status of Loop Quantum Gravity, from the mathematical foundations to active research.
1. Introduction It is well known that Quantum General Relativity and its supersymmetric generalization is a non-renormalizable theory [1], From this fact one may draw two conclusions: 1. Either General Relativity is only an effective theory, to be replaced by a more fundamental one or 2. perturbative renormalizability [2] is simply not a valid selection principle for the admissability of General Relativity as a fundamental theory. The first point of view is followed by string theory [3], the second one by Loop Quantum Gravity (LQG) [4]. Both points of view have proved useful in the history of physics: On the one hand, Fermi's theory of the weak interaction was replaced by the electroweak model which unified electromagnetism and the weak force and indeed string theory claims to be a unified theory of all interactions. On the other hand, ether theory was replaced by special relativity by abandoning Newton's principle of absolute time and space and replacing it by the relativity principle. Which principle is it then that in Loop Quantum Gravity (LQG) replaces the principle of perturbative renormalizability? It is the principle of Background Independence (BI). In fact, this principle is not new but is already an ingredient of the classical theory and arguably the most important lesson that we learnt from Einstein's theory: Nature does not distinguish, a priori, any particular background metric. One often says that Einstein's theory arises from general covariance, that is, no system of coordinates (or observer) is a priori distinguished but this is misleading in the sense that every equation of motion can be made generally covariant by writing it down in one system of coordinates and then transforming both sides of the equations into any other system of coordinates. BI means more than that: It means that the metric is a dynamical entity and cannot be prescribed by hand. It is here where BI clashes with perturbative quantum gravity schemes: In those schemes, including perturbative string theory, one expands the theory in terms of the deviation h^ — 9\iv—^\iv, usually called the graviton, where g is the full metric and 77 is a background metric around which one expands. It is obvious from this that perturbative schemes break BI right from the beginning and the question is whether this is acceptable or not. The following considerations motivate to seriously reconsider the validity of applying the technically and conceptually simple perturbative scheme to General Relativity:
569
570
THOMAS THIEMANN
1. Violation of Basic Principle One must feel at least worried about not taking a principle seriously that actually defines the most important new ingredient of a theory. 2. Violation of Local Gauge Invariance One should not forget that General Relativity coupled to all known matter is a diffeomorphism invariant theory. Like the group of local gauge transformations in Yang-Mills theory, the diffeomorphism group Diff (M) for a spacetime manifold M is a local gauge group. It is clear that the decomposition g = T) + h is not preserved under Diff(M) with r\ fixed, hence perturbative schemes break the local gauge invariance of General Relativity. Breaking of local gauge invariance in Yang-Mills theory leads to an inconsistent theory, hence one may wonder whether the perturbative non-renormalizability of General Relativity has its roots in this. 3. Limitations of the Perturbative Scheme Quantum Field Theory is more than computing scattering amplitudes, there is a whole realm of the theory which is inaccessible by perturbative methods, usually called non-perturbative effects, for the same reason that one cannot use Cartesian coordinates globally on a sphere. This limitation is especially serious in Quantum Gravity where one expects the most interesting deviations from the classical theory when gravity is strong, e.g. in black hole and cosmological situations. Perturbative schemes must break down in those regimes where the metric g is wildly fluctuating so that the deviation h = g — r) is huge and where it is no longer possible to speak about a background metric around which the deviation occurs. This is even true for string theory as currently defined. Researchers working in LQG are worried at least about this list of issues and therefore chose BI as a guiding principle for the construction of a quantum field theory of general relativity. One may object that such an approach seems to have little chance to unify geometry and matter in one theory or even to predict the matter content of the world or the dimensionality of spacetime such as string theory does. However, one should not forget that LQG is a young theory in which such consistency issues simply may have not shown up yet (if there are any). Moreover, one should remind oneself that string theory can be called a successful theory only when it is has shown to be compatible with experiment. For example, so far it has not been possible to show that any compactification of any of the five perturbative string theories yields a matter content corresponding to the standard model with supersymmetry breaking at sufficiently high (currently unobservable) energies. Also, it has not been possible to show that string theory is perturbatively finite beyond two loops [5] so that the often claimed fmiteness of the theory is actually not established. Finally, one may wonder whether six or seven unobserved extra dimensions and an infinite tower of unobserved Kaluza-Klein particles to make the perturbative scheme work are not too high a price to pay: Is there maybe a simpler, non-perturbative theory in four dimensions which starts with the currently observed matter content? To be sure, string theory has led to impressive mathematical results, but before it proves to be a successful physical theory of quantum geometry and matter, alternative ansatze must be pursued and LQG is just one of them. Once one accepts the BI principle, the question is whether it is compatible with the prin-
Loop quantum gravity
571
ciples of Quantum Theory (QT). Looking at the axiomatic definition of ordinary quantum field theory [6] it becomes obvious that joining BI and QT is a rather radical step: Looking for instance at the Haag-Kastler axioms for Minkowski space (and their microlocal generalizations for arbitrary background spacetimes) it is clear that the framework of ordinary QFT simply evaporates once one does not have access to a background metric. For instance, the causality axiom asks that field operators smeared over causally disconnected spacetimes regions (anti)commute. In a BI theory, such an axiom does not make any sense: If the lightcone structure itself is subject to quantization then we cannot tell whether field operators are supported on their causal complements, hence the causality axiom can have an at most semiclassical meaning in states with respect to which the gravitational fluctuations are small. The challenge therefore consists in harmonically joining the principles of BI and QT while recovering the well-established framework of matter QFT on background spacetimes (M, rj) (BST) in a regime where the gravitational field has low fluctuations around rj. What is needed is a generalization of QFT on a BST to QFT on a differential manifold M (or some even more radical generalization, e.g. giving up a fixed differential structure, topology, . . . ) . Loop Quantum Gravity (LQG) is a particular proposal for the construction of such a theory, no more and no less. More precisely, Loop Quantum Gravity (LQG) is an attempt to construct a four-dimensional, non-perturbative, background independent Quantum Field Theory of Lorentzian General Relativity including all known matter without assuming any experimentally unverified extra structure. In the remainder of this contribution we will try to sketch the status of this programme. Readers interested in more details are referred to the reviews [4,7] and references therein. For simplicity we will only consider the geometry part of the full Einstein-matter action in Euclidean signature, Lorentzian signature and matter can be treated by similar methods and we refer the interested reader to the literature. We will be rather sloppy with references for the sake of brevity of this article, the proper references can be found in [4,7].
2. Classical preliminaries It turns out that the canonical approach towards quantizing gravity is ideally suited in order to incorporate both the BI and QT principle. The classical canonical formulation has been derived already in the 60s by Arnowitt, Deser and Misner (ADM) [8] but real progress was only made with a reformulation due to Ashtekar [9] in the mid 80s. Theorem 2.1. Let M = R x a be a 4D differential manifold. Then canonical GR can be formulated as a constrained SU(2) gauge field theory: {Ai(x),Akb(y)}
= {E«(x),Ebk(y)}
{E;(x),A°b{y)}
= K5Z6]5{x,y)
=0 symplectic structure Gauss constraint
Cj = daE? + ejkiAaEt b
Ca = Tr(FabE ) C = Tr(Fab[Ea,Eb}) Here x,y,...
(1)
spatial diffeomorphism constraint 1
I det^)!" ^
Hamiltonian constraint
denote points in a, a,b,c,...... = 1,2,3 denote tensorial indices for a, j , k,l =
572
THOMAS THIEMANN
1,2,3 denote su(2) indices and K is proportional to Newton's constant. The relation between the SU(2) connection A and its conjugate electric field E on the one hand and the familiar ADM variables is given by Qab = e-^e^Sjk
spatial metric U
\
Kab = [A3, — r^ ]ej\Jjfc
)
extrinsic curvature
where all fields are pulled back to a under an arbitrary foliation (one parameter family of embeddings) Xt : a —> £ t ; x — i > Xt(x) = X(t,x). The Hamiltonian constraint for the Lorentzian theory has an additional term as compared to its Euclidean version displayed in (1) but otherwise there is no difference, so we will content ourselves with the Euclidean version for the purposes of this article. Motivated by global hyperbolicity (in Lorentzian signature) we have assumed M to be topologically R x a where a is a 3D differential manifold of arbitrary topology. The arbitrariness of the foliation expresses the invariance of GR under Diff (M) and leads to the spatial diffeomorphism and Hamiltonian constraint respectively. In particular, there is no true Hamiltonian because (local) time translations are equivalent to diffeomorphisms and therefore considered as gauge transformations. The Hamiltonian flow of the Hamiltonian constraint generates those gauge transformations. In particular, we can compute the transformations of the constraints among each other: Theorem 2.2. Consider the smeared constraints C{N) = f ofxNaCa,
C(N) = f afxNC.
(3)
The hypersurface deformation algebra 9) is given by {d(N),d(N')}
=
{C(N),C(N')}
= KC(C$N'),
{C(N),C(N')}
Kd{CffN'), l
(4)
= KC(q- [N'dN
-
NdN']).
Here C denotes the Lie derivative. We observe is that the constraint algebra is consistent (or first class in Dirac's terminology), that is, the gauge flow does not leave the constraint surface C{x) = Ca{x) = 0 Vx € a. However, the hypersurface deformation algebra is no Lie algebra due to the phase space dependent structure "constant" on the right hand side of the third line of (4). This implies that its representation theory is much more complicated than for Lie (Super) Algebras. One can show that for any function / on the infinite dimensional phase space (M, {.,.}) defined by the canonical pair (^4, E) we have on shell (that is, when the vacuum equations R^ = 0 hold) s
f},Nf
••= liCW
+ C(N),/}
= £x+NJ
= £Tf,
(5)
where n denotes the future oriented unit normal to the hypersurfaces E t and T = d/dt denotes the foliation vector field. This shows that we have an explicit representation of Diff (M) as symplectomorphisms on the phase space. In particular, C(N) and C(N) respectively generate diffeomorphisms tangential and normal to the foliation respectively.
Loop quantum gravity
573
2.1. Quantization strategy As we just showed, GR can be formulated as an infinite dimensional, constrained, dynamical system. No reference has been made to a background metric. The canonical quantization of such constrained systems also does not need any background metric and was introduced first by Dirac in the 40s. This quantization is now known by the name of refined algebraic quantization (RAQ) and was further developed by several authors. Its essential steps can be sketched as follows: 1. Select a suitable sub-*algebra V C C°°(M) of the Poisson algebra as the algebra of elementary kinematical observables. Here kinematical means that these observables are to separate the points of the unconstrained phase space M. The choice of V is critical for the success of the complete quantization programme and is usually guided by demanding to have a sufficiently rich representation theory thereof. If possible one will try to work with bounded functions which will usually turn into bounded operators, thus avoiding domain questions. 2. Define the corresponding abstract, kinematical "-algebra 21 by asking that Poisson brackets turn into commutators divided by iH. Then study the representation theory of 21. 3. Select a suitable kinematical, irreducible representation 7TKin of 21, for instance a cyclic representation induced by a GNS state o>Kin- Here suitable means that the representation space 7^Kin should support the constraint operators as self-adjoint operators, that is C(N), C(N) for the case of GR. A guiding principle for a BI theory is (spatial) diffeomorphism invariance (rather than Poincare invariance as in background dependent theories). 4. Let (Nf,Ni) be an orthonormal basis of Li{p, d3x)i denned by smooth functions of rapid decrease. Then consider the direct integral decomposition (if Wi
(6)
corresponding to the operator
»=-!
£(tff)2 + £<5W)2
(7)
In order to define (7) as a positive operator on Witim usually delicate BI regularization procedures have to be invoked. Then one defines the physical Hilbert space to be the induced Hilbert space Wphys = W®in(0). The physical observables are the self-adjoint operators on 7iphys5. Verify that Wphys is large enough, i.e., admits a sufficient number of semiclassical states and gauge invariant (Dirac) Observables. These are those physical observables which do have a well-defined classical limit as functions which have vanishing Poisson brackets with the constraints, that is, {0,C(N)} = {0,C(N)} = 0 for GR. These steps provide a guideline, but no algorithm for the canonical quantization of a given theory.
574
THOMAS THIEMANN
3. Quantum kinematics In this section we will carry out the first three steps of the quantization programme. The choice of V consists of the following kinematical variables A(e) — V expl / A I
( f
magnetic holonomy/string variable (8)
\
Eftt(S) — expl t / T r ( / * E) I electric flux/membrane variable where e is a curve, S a surface, / an su(2)-valued scalar test field, t a real parameter and *E is the background metric independent two-form dual of the vector density E. This choice is motivated by 1. Hamiltonian lattice gauge theory [10] (holonomies and electric fields transform simply under gauge transformations) and 2. spatial diffeomorphism covariance: We have a realization of the spatial diffeomorphism group Diff (<x) by automorphisms on the corresponding abstract algebra 21 a : Diff(a) - • Aut(2l) av(A(e))
(9)
= A(
=
E/(>v-itt{
The algebraic structure in 21 is generated by the non-Abelian Weyl-algebra relations [A(e)}* = [A(e-i)f, [A(e),A(e')} = 0,
EU(S)*
= Ef,_t(S), 1
Ef,t(S)A(e)Ef,t(S)-
(10) = exp(ihCXEft(s))
• A(e),
which basically say that both A(e),Efit are unitary operators and where Xf denotes the Hamiltonian vector field of / £ C°°(M) (we have abused notation by denoting operators and functions by the same symbol). Strictly speaking both functions in (8) are not in C°°{M), however, the smearing dimensions add up to dim(cr) = 3 and this is sufficient to make these commutators well-defined. The representation theory of 21 is now simplified by the following result [11]. Theorem 3.1. There exists a unique cyclic, spatially diffeomorphism invariant, weakly continuous (with respect to 11—• Eftt(S)) representation TTKin : 21 -> B(WKin) ,
(11)
where B denotes the bounded operators on a Hilbert space. There are many equivalent ways to define this representation, here is the shortest one [12]: Denote by A the space of smooth SU(2) connections on a. Consider the Abelian, unital C*-algebra Cyl generated by holonomies and completed in the sup-norm. A dense set in Cyl will consist of functions which depend on a finite number of holonomies only, that is, they depend on the connection supported on a finite graph 7 (the edges must be piecewise analytic for this to hold). For a graph 7 we define p 7 :-4-+[SU(2)]l £ WI,
A^{A(e)}eeEh),
(12)
where E(j) denotes the set of edges of 7 and |.E(7)| its cardinality. A cylindrical function will therefore be typically of the form F = p*F^ where F 7 : [SU(2)]I£(7)I -* C is a complex valued function.
575
Loop quantum gravity Theorem 3.2. i) UKin(F) := f y[su(2)]i«(-»>i
n
dfiH(he)
Fy({he}eeEM)
(13)
e e ^* 7 )
defines a constantly defined, positive, linear, spatially diffeomorphism invariant functional (iJKm °Ctv= WRin V3 € Diff (a)) OU Cyl. ii) Let S^Kin = [1] be the cyclic GNS vector corresponding to WKin- Then irKin(F)[F'}:=[FF'},
^.JEf^^^lE^FEf^S)-1},
(14)
defines a *-representation ofQl. One can extend u>Kin hi the obvious way to all of 21 and as a subalgebra of B(HKin) it becomes a C*-algebra in the uniform topology. The GNS Hilbert space Wxin can be characterized more explicitly: Theorem 3.3. Let A Gel'fand spectrum of Cyl = C(A). TiKm = L2{A,dnKin),
Then
^Kin=1>
(15)
where the probability measure //Kin is the a-additive extension of the cylindrical measure defined in (13). In summary, the representation 7TKin is the unique kinematical starting point when insisting that 21 be the algebra of elementary kinematical observables in terms of which all other operators have to be denned. We will now study to which extent this is indeed the case.
4. Quantum dynamics In this section we will complete part of the fourth step of the quantization programme. We will subdivide the discussion into two parts: The first part was already completed in [13], the second part is only partly completed and we will restrict ourselves to a rough description of the results obtained so far [14]. 4.1. I. Gauss and spatial diffeomorphism constraint Theorem 4.1. i) Let (tp,g) G Diff(
F,({A(
[U(g)F}(A):=F^{g(b(e))A(e)g(f(e))-%eEh)) defines the natural, unitary representation thereof on Wxin • ii) Since A is a compact Hausdorff space and /iKin is a probability measure thereon, the volume of the gauge group /XKin(Fun(a, SU(2))) is finite. Thus restriction to gauge invariant functions in Cyl solves Cj = 0 .
576
THOMAS THIEMANN
Hi) For any F € Cyl the Rigging
Map
mm : Cyl -» Cyl*;
F ^
mm{F)
:=
£
(F', .) Ki „,
F'€Orb(F)
Orb(F) := {U{
(18)
As expected, the solutions to the spatial diffeomorphism constraint are labelled by (generalized, since intersecting) knot invariants. Notice in particular that there are no anomalies among the spatial diffeomorphism operators which is in contrast to the representations of the diffeomorphism algebra (Virasoro algebra) of Diff(51) encountered in string theory which necessarily has a non-vanishing central charge and requires string theory to live in extra dimensions for consistency reasons. There is no sign to that effect in LQG and provides the first pay-off for having chosen a BI formulation. 4.2. Hamiltonian constraint, physical scalar product, Dirac observables The Hamiltonian constraint, sometimes called Wheeler-DeWitt constraint, displayed in (1), has been the toughest technical problem in the canonical programme since its derivation in the 60s. As is obvious from (1), since it involves non-polynomial, local functions of operator valued distributions, the UV divergences of this operator are expected to be much worse than for ordinary matter QFTs such as canonical Yang Mills theories. The surprise discovered in [14] is that this is not the case! The reason for this is ultimately again the fact that we were careful to use a BI quantization scheme and provides the second pay-off. Notice that we are talking here about non-perturbative finiteness rather than the much weaker notion of perturbative finiteness (there is no series to be summed). Before we go into more details, let us give a heuristic explanation for how that happens: Consider the massless Klein-Gordon-Hamiltonian on Minkowski space
HKG = \ I
d3x[7t2 + Sa%acf>ib}.
(19)
Quantizing it on Fock space leads to a divergent correction to the normal ordered piece HKG-
•• HKG :=h
d3x [y/-AxS(x,
y)]y=x ,
(20)
which has both an UV source (multiplying operator valued distributions at the same point) and an IR source (infinite number of harmonic oscillator ground state energies). The reason for the UV divergence is that the momentum •n conjugate to ^ is a scalar density of weight one as far as its transformation under Diff (M3) is concerned. This implies that the the piece 7T2 of the integrand of (19) has net density weight two. Hence, in any proper quantization
Loop quantum gravity
577
scheme the operator corresponding to 7r2 must remember its density weight of two and the only coordinate density of weight two at x is given by 5(x, u) 5(x, v) (or derivatives thereof) which is meaningless if u —» v, that is, when taking the point split regulator to zero. Let us now compare this with the gravity coupled situation (Einstein-Klein-Gordon theory): Now (19) is a term in addition to (1) in the full Hamiltonian constraint HE-KG(N)
= \ I d3xN
Vdet(
+ qa >, a
(21)
Notice that in (21) the background metric 6ab of (19) has been replaced by the dynamical metric qab as is required by a BI scheme. This has two effects: 1. a Fock space quantization is no longer available because a Fock space crucially depends on a background metric (e.g. through the notion of positive energy) and 2. the whole integrand of (21) now has net density weight one! In particular, the piece Tr2/^det(q) has density weight one. Now what happens when quantizing in BI representations for matter fields [14] is that the UV divergence disappears because, roughly speaking, 6(x,u)6(x,v) is replaced by 8(x,u) 5(x, v)/5(x,w) which now reduces to the well-defined distribution 6(x,w) when u —> v —> w in the limit of vanishing regulator. Moreover, the IR divergence disappears because the BI representations such as the ones that we displayed in section 3 automatically force us to work with states that are excited in finite volume only (and superpositions thereof). This implies that there is a finite vacuum energy operator (rather than a constant) [15]. Hence we can discuss, in a well-defined way the contribution of each matter type to the cosmological constant A (whose value is an expectation value and therefore actually state dependent), which may ultimately impose restrictions on the matter content of the world because A is indeed measurable nowadays! Let us now go into more details. The result of the previous section 4.1 was that both the Gauss and spatial diffeomorphism constraint can be explicitly solved and that we have at our disposal the spatially diffeomorphism invariant Hilbert space WDiff- We would now like to take this Hilbert space as our kinematical Hilbert space and solve the Hamiltonian constraint there along the lines sketched in section 2.1. However, there are two obstacles to that strategy which can be read off directly from (4): 1. Since the spatial diffeomorphism constraints generate a subalgebra of the hypersurface deformation algebra Sj but not an ideal, the Hamiltonian constraint must not preserve T^Dis- Hence C{N) cannot be defined as an operator on WDiff • Thus one is forced to define C(N) directly on Wi
578
THOMAS THIEMANN
It follows that the infinitesimal self-adjoint generator C(N) does not exist on TiKinBut then we are in trouble because in order to close the classical hypersurface deformation algebra (4) the infinitesimal generator C(N) would be needed when computmg [C(N),C(N')}. One may summarize this discussion as follows: The representation theory of the algebras 21 and f) are incompatible. Hence, there are two ways out: Either we change our starting point 21 or we change Sj. In [14] the latter strategy was adopted as follows: Definition 4.1. The Master Constraint corresponding to the Hamiltonian constraints C(x), x £ a is defined by M :=
I / tfx ^L 2./,
.
y/deT(q}(x)
(22) V
'
Advantages: a. The infinite number of constraints C(x) — 0, Vx e a, is replaced by the single Master Equation M = 0. b. The infinite number of conditions on weak Dirac Observables {O, C(X)}M=O = 0 Va; G a is replaced by the single Master Condition {O, {O, M}}M=O = 0. c. The constraint algebra simplifies tremendously: {C(N), M} = {M, M} = 0, hence the subalgebra formed by the spatial diffeomorphism constraint now defines an ideal so that M has a chance to preserve TCuis and moreover the infinitesimal generators of spatial diffeomorphisms are no longer needed. In summary, the representation theoretic problems associated with 21, Sj just mentioned are solved, provided we manage to define the Master Constraint Operator M. Theorem 4.2. i) M can be densely defined as a positive quadratic form QM on Horn- Thus, if QM is closable, then it defines a unique positive self-adjoint operator M. ii) In contrast to WKin> the Hilbert space Hum is separable. Thus, if M exists, the physical Hilbert space is given by Wphys := ^Diff(^) w^ere HuiE=
I
d/x(A)Wgiff(A)
(23)
JR+
denotes direct the integral decomposition o/Woiff subordinate to M. Hi) If M exists, define the 1-parameter unitary groups U{t) := exp(rtM).
(24)
Let O £ B(Hr>is) be a bounded s.a. operator. Under suitable regularity conditions the Ergodic Projection [0}:=lim^-[dtU(t)OU(trl
(25)
leads to a strong Dirac observable, [[0],M] = 0, and [O] induces a self-adjoint, bounded operator Ophys on Wphys-
Loop quantum gravity
579
Hence, once we have shown that QM is closable (which has not been done yet), we have shown that the fourth step of the quantization programme can be carried out without further obstacles on the way. The same holds with any matter coupling. One would then still be left with verifying that the theory constructed has GR as its classical limit.
5. Further results We conclude this article by listing further results. Some of these projects are robust and complete, others are active research. Our presentation will be very brief and we must refer the reader to the literature cited. 5.1. Planck scale discreteness Let Xs : X^iS) Functional
C E 2 —> a be a two-surface embedding and consider the associated Area Av[S):= f
d2yJdet(lX*sq}(y)).
(26)
Theorem 5.1. i) The operator Ar[S] exists as a positive, self-adjoint operator on Wxinii) Its spectrum Spec(Ar[S']) is pure point (discrete), its main series being given by 1
H{Je}eeE(-y)) = Pp ^2vOeO'e + 1),
O
j e = -,1,-,.--
(spin quantum numbers), (27)
where £P = HK is the Planck area. Hi) The spectrum has an area gap: Xmm = ^£P. iv) The spectrum Spec(Ar[S]) is sensitive to the topology of S. Remarks: — This is an astonishing and unexpected result, since the operator Ar[5] certainly is illdefined on Fock space. This is yet one more pay-off of incorporating the BI principle. — The spectrum is obtained on T^Kin and one can extend these results to WDiff- It is not clear whether they extend to Hphys but in case they do, one would conclude that LQG predicts a distributional, non-smooth Planck scale geometry! — The area operator was first quantized in [16]. Similar results hold for volume [17] and length operators [18]. 5.2. Black hole entropy Theorem 5.2. i) The classical Hamiltonian framework can be adapted to quasi-stationary (isolated horizon H) black hole boundary conditions, ii) Due to these boundary conditions, the area functional Ar[.ff] of the horizon itself is a full Dirac observable, hence the kinematical spectrum coincides with the physical one.
580
THOMAS THIEMANN
Hi) The quantum statistical entropy S = In (TV), where N is the number of states with area eigenvalue in the interval [A — £"p, A + £j^], is finite due to area gap and gives the celebrated Bekenstein-Hawking result S = A/{A(?P) for all non-rotating black holes such as the SS-RN family, black holes with dilatonic and Yang-Mills hair, etc. Notice that in contrast to string theory no SUSY and no extremal charge is necessary. iv) The entropy accounts approximately for one bit of information per puncture of a graph 7 with the Horizon H (Boolean d.o.f.). The rotating case is classically under complete control and its quantization is currently being considered. There are many more results on isolated horizons, see e.g. [19]. Notice that in contrast to the heuristic black hole spectrum due to Bekenstein [20] which is evenly spaced, the LQG spectrum reaches a continuum very fast for sufficiently large graphs and spin quantum numbers as is obvious from (27) which will presumably be important in order to get the correct black body spectrum of the Hawking radiation. 5.3. Path integral formulation A path integral formulation of quantum gravity is notoriously difficult for three reasons: 1. There is no Hamiltonian, just a Hamiltonian constraint and hence the path integral does not have the usual interpretation as a transition amplitude (or heat kernel) but rather as a projector on physical states. 2. Usually one starts with the Euclidean action in order to construct a measure. However the Euclidean action for GR is unbounded from above and below, hence even the regularized path integral is divergent. 3. Usually the Lorentzian and Euclidean theory are connected by a Wick rotation with respect to the Minkowski metric. In a theory where one integrates over all possible metrics, such a connection is absent, hence it is completely unclear what the Euclidean path integral means. Nevertheless there has been significant progress recently in terms of spin foam and state sum models directly for the Lorentzian theory. In particular, it was possible to demonstrate finiteness at least for the regularized path integral for a particular choice of measure whose physical significance is, however, not yet fully understood. We advise the reader to consult the excellent reviews [21] for further information. 5.4. Semiclassical limit and gravitons As became clear during the discussion of section 4.2, the most important unresolved problem is the proof, that the theory has GR as its classical limit. In order to show this, a strategy would be to construct semiclassical or, more specifically, coherent states with respect to which constraint operators have the correct expectation value and small fluctuations. In order to check whether M has the correct limit one must construct semiclassical states in WDiff- It would not make sense to construct semiclassical states within Wphys because M is exactly zero on such states by construction. As a starting point towards this aim semiclassical states for LQG have been defined at the level of Wnin [22] and the next step is to generalize the heat kernel methods employed there to HD\B-
Loop quantum gravity
581
Related to this is the explicit construction of graviton states with the BI methods of LQG at least for linearized gravity [23]. 5.5. Quantum cosmology and observation The BI quantization methods of LQG also have been successfully applied to minisuperspace truncations of the full theory, in particular to FRW models [24]. This is an important test for the full theory and has led to surprising results such as avoidance of the classical big bang singularity and the next step is to check whether these results extend to the full theory. More generally, LQG has recently seeked contact with observation and phenomenology [15,25], in particular one is interested in Poincare symmetry violating dispersion relations for photons which are plausible in LQG if the Planck scale structure of the theory is discrete as the area operator spectrum suggests.
6. Conclusions We hope to have convinced the reader that Loop Quantum Gravity is a lively, mathematically rigorous attempt to define a QFT for GR which has passed many non-trivial tests already. The attitude is that the theory a) either is mathematically consistent and is confirmed by experiments which test physics beyond the standard model or b) that the opposite happens and that one learns precisely which new mathematical and conceptual structures are needed rather than having to guess them. Hence, either one constructs a theory which can be viewed as fundamental or one finds out that the next "quantum leap" is necessary, similar to the passage from the Bohr-Sommerfeld heuristic description of the atom to the Heisenberg-Schrddinger formulation of quantum theory.
Acknowledgments We would like to thank the organizers of ICMP 2003, especially Prof. Detlev Buchholz and Prof. Jean-Bernard Zuber, for the invitation to the congress and for the opportunity to give this presentation about Loop Quantum Gravity.
References 1. S. Deser, "Two outcomes of two old (super)problems", arXiv:hep-th/9906178; "Infinities in quantum gravities", Annalen Phys. 9, 299 (2000) [arXiv.gr-qc/9911073]; "Nonrenormalizability of (last hope) D = 11 supergravity with a terse survey of divergences in quantum gravities", arXiv:hep-th/9905017. 2. S. Weinberg, The Quantum Theory of Fields, vol. 1-3, Cambridge University Press, Cambridge, 1995. 3. J. Polchinski, String Theory, vol. 1+2, Cambridge University Press, Cambridge, 1998. 4. T. Thiemann, "Introduction to modern quantum general relativity", arXiv:gr-qc/0110034, to appear in Living Reviews; "Introduction to loop quantum gravity", arXiv:gr-qc/0210094. 5. E. D'Hoker, D. H. Phong, "Lectures on two-loop superstrings", arXiv:hep-th/0211111; L. Smolin, "How far are we from a quantum theory of gravity?", arXiv:hep—th/0303185 . 6. R. F. Streater, A. S. Wightman, "PCT, spin and statistics, and all that", Benjamin, New York, 1964; R. Haag, "Local quantum physics", 2nd ed., Springer Verlag, Berlin, 1996.
582
THOMAS THIEMANN
7. C. Rovelli, "Loop quantum gravity", Living Rev. Rel. 1, 1 (1998); arXiv:gr-qc/9710008; "Strings, loops and others : a critical survey of the present approaches to quantum gravity", arXiv: gr-qc/9803024 ; "The century of incomplete revolution: searching for general relativistic quantum field theory", J. Math. Phys. 4 1 , 3776 (2000) [arXiv:hep-th/9910131]; M. Gaul, C. Rovelli, "Loop quantum gravity and the meaning of diffeomorphism invariance", Led. Notes Phys. 541, 277 (2000) [arXiv:gr-qc/9910079]. 8. R. M. Wald, General Relativity, The University of Chicago Press, Chicago, 1989, appendix E. 9. A. Ashtekar, Phys. Rev. Lett. 57, 2244 (1986); Phys. Rev. D, 36 1587 (1987). 10. R. Gambini, A. Trias, Phys. Rev. D 22, 1380 (1980); Nucl. Phys. B 278, 436 (1986); T. Jacobson, L. Smolin, "Nonperturbative quantum geometries", Nucl. Phys. B 299, 295 (1988); C. Rovelli, L. Smolin, "Loop space representation of quantum general relativity", Nucl. Phys. B 331, 80 (1990). 11. H. Sahlmann, T. Thiemann, "On the superselection theory of the Weyl algebra for diffeomorphism invariant quantum gauge theories", arXiv:gr-qc/0302090; "Irreducibility of the Ashtekar-Isham-Lewandowski representation", arXiv:gr-qc/0303074; A. Okolow, J. Lewandowski, "Diffeomorphism invariant representations of the holonomy flux *algebra", arXiv:gr-qc/0302059. 12. A. Ashtekar, C. J. Isham, "Representations of the holonomy algebras of gravity and nonAbelean gauge theories", Class. Quantum Grav. 9, 1433 (1992) [arXiv:hep-th/9202053]; A. Ashtekar, J. Lewandowski, "Representation theory of analytic Holonomy C* algebras", in Knots and Quantum Gravity, J. Baez (ed.), Oxford University Press, Oxford 1994; "Projective techniques and functional integration for gauge theories", J. Math. Phys. 36, 2170 (1995) [arXiv:gr-qc/9411046]; D. Marolf, J. M. Mourao, "On the support of the AshtekarLewandowski measure", Comm. Math. Phys. 170, 583 (1995). 13. A. Ashtekar, J. Lewandowski, D. Marolf, J. Mourao, T. Thiemann, "Quantization for diffeomorphism invariant theories of connections with local degrees of freedom", Journ. Math. Phys. 36, 6456 (1995) [arXiv:gr-qc/9504018]. 14. T. Thiemann, "Anomaly-free formulation of non-perturbative, four-dimensional Lorentzian quantum gravity", Physics Letters B 380, 257 (1996) [arXiv:gr-qc/9606088]; T. Thiemann, "Quantum spin dynamics (QSD)", Class. Quantum Grav. 15, 839 (1998) [arXiv: gr-qc/9606089]; "II. The kernel of the Wheeler-DeWitt constraint operator", Class. Quantum Grav. 15, 875 (1998) [arXiv:gr-qc/9606090]; "III. Quantum constraint algebra and physical scalar product in quantum general relativity", Class. Quantum Grav. 15, 1207 (1998) [arXiv:gr-qc/9705017]; "IV. 2+1 Euclidean quantum gravity as a model to test 3+1 Lorentzian quantum gravity", Class. Quantum Grav. 15, 1249 (1998) [arXiv:gr-qc/9705018]; "V. Quantum gravity as the natural regulator of the Hamiltonian constraint of matter quantum field theories", Class. Quantum Grav. 15, 1281 (1998) [arXiv:gr-qc/9705019]; "VI. Quantum Poincare algebra and a quantum positivity of energy theorem for canonical quantum gravity", Class. Quantum Grav. 15, 1463 (1998) [arXiv:gr-qc/9705020]; "Kinematical Hilbert spaces for fermionic and Higgs quantum field theories", Class. Quantum Grav. 15, 1487 (1998) [arXiv:gr-qc/9705021 ]. T. Thiemann, "The Phoenix Project: master constraint programme for loop quantum gravity", arXiv:gr-qc/0305080. 15. H. Sahlmann, T. Thiemann, "Towards the QFT on curved space-time limit of QGR. 1. A general scheme", arXiv:gr-qc/0207030 ; "2. A concrete implementation", arXiv:gr-qc/0207031. 16. C. Rovelli, L. Smolin, "Discreteness of volume and area in quantum gravity", Nucl. Phys. B 442, 593 (1995); Erratum: Nucl. Phys. B 456, 734 (1995); A. Ashtekar, J. Lewandowski, "Quantum theory of geometry I: Area operators", Class. Quantum Grav. 14, A55 (1997). 17. C. Rovelli, L. Smolin, "Discreteness of volume and area in quantum gravity", Nucl. Phys. B 442, 593 (1995); Erratum: Nucl. Phys. B 456, 734 (1995); A. Ashtekar, J. Lewandowski, "Quantum theory of geometry II: volume operators", Adv. Theo. Math. Phys. 1, 388 (1997). 18. T. Thiemann, "A length operator for canonical quantum gravity", Journ. Math. Phys. 39, 3372 (1998) [arXiv:gr-qc/9606092].
Loop quantum gravity
583
19. A. Ashtekar, C. Beetle, O. Dreyer, S. Fairhurst, B. Krishnan, J. Lewandowski, J. Wisniewski, "Isolated horizons and their applications", Phys. Rev. Lett. 85, 3564 (2000) [arXiv:gr-qc/ 0006006]; A. Ashtekar, "Classical and quantum physics of isolated horizons", Led. Notes Phys. 541, 50 (2000); "Interface of general relativity, quantum physics and statistical mechanics: some recent developments", Annalen Phys. 9, 178 (2000) [arXiv:gr-qc/9910101]. 20. J. D. Bekenstein, "Black holes and entropy", Phys. Rev. D 7, 2333 (1973). 21. J. C. Baez, "An introduction to spin foam models of quantum gravity and BF theory", Lect. Notes Phys. 543, 25 (2000) [arXiv:gr-qc/9905087]; A. Perez, "Spin foam models for quantum gravity", Class. Quantum Grav. 20, R43 (2003); arXiv:gr-qc/0301113. 22. T. Thiemann, "Gauge field theory coherent states (GCS): I. General properties", Class. Quantum Grav. 18, 2025 (2001) [arXiv:hep-th/0005233]; "Complexifier coherent states for canonical quantum general relativity", arXiv:gr-qc/0206037; T. Thiemann, O. Winkler, "Gauge field theory coherent states (GCS): II. Peakedness properties", Class. Quantum Grav. 18, 2561 (2001) [arXiv:hep-th/0005237]; "III. Ehrenfest theorems", Class. Quantum Grav. 18, 4629 (2001) [arXiv:hep-th/0005234]; "IV. Infinite tensor product and thermodynamic limit", Class. Quantum Grav. 18, 4997 (2001) [arXiv:hep-th/0005235]; H. Sahlmann, T. Thiemann, O. Winkler, "Coherent states for canonical quantum general relativity and the infinite tensor product extension", Nucl. Phys. B 606, 401 (2001) [arXiv:gr-qc/0102038]. 23. M. Varadarajan, "Fock representations from U(l) holonomy algebras", Phys. Rev. D 61,104001 (2000) [arXiv:gr-qc/0001050]; "Photons from quantized electric flux representations", Phys. Rev. D 64, 104003 (2001) [arXiv:gr-qc/0104051]; "Gravitons from a loop representation of linearized gravity", Phys. Rev. D 66, 024017 (2002) [arXiv:gr-qc/0204067]; J. Velhinho, "Invariance properties of induced Fock measures for U(l) holonomies", Comm. Math. Phys. 227, 541 (2002) [arXiv:math-ph/0107002]. 24. M. Bojowald, "Absence of Singularity in loop quantum cosmology", Phys. Rev. Lett. 86, 5227 (2001) [arXiv:gr-qc/0102069]; "Dynamical initial conditions in quantum cosmology", Phys. Rev. Lett. 87, 121301 (2001) [arXiv:gr-qc/0104072]. 25. R. Gambini, J. Pullin, "Nonstandard optics from quantum spacetime", Phys. Rev. D 59, 124021 (1999) [arXiv:gr-qc/9809038]; R. Gambini, J. Pullin, "Quantum gravity experimental physics?", Gen. Rel. Grav. 3 1 , 1631 (1999); J. Alfaro, H. A. Morales-Tecotl, L. F. Urrutia, "Quantum gravity corrections to neutrino propagation", Phys. Rev. Lett. 84, 2318 (2000) [arXiv:gr-qc/9909079]; "Loop quantum gravity and light propagation", Phys. Rev. D 65, 103509 (2002) [arXiv:hep-th/010806l].
This page is intentionally left blank
Quantum mechanics and spectral theory Session organized by C.
GERARD
(Orsay) and R.
WEDER
(Mexico)
This page is intentionally left blank
Long-time dynamics and localization lengths for the 3-d Anderson model at weak disorders THOMAS CHEN
(Courant
Institute)
We report on recent work [1] concerning lower bounds on the localization length of eigenfunctions in the three-dimensional Anderson model at weak disorders, that uses an extension of methods developed by L. Erdos and H.-T. Yau. Our results are similar to those obtained by C. Shubin, W. Schlag and T. Wolff [8] for dimensions one and two. Furthermore, we show that the macroscopic limit of the corresponding lattice random Schrodinger dynamics is governed by the linear Boltzmann equations.
1. Introduction Let AL = Z 3 /(Z,Z) 3 denote a of large box of side length I e N . We consider the discrete random Schrodinger operator
acting on ip S £2(AL), periodically continued over Z 3 , with impurity potential given by Vuix) = J2yeAL Uy8{x - y)- The coupling constant A > 0 is the small parameter of the theory, and LOV are bounded, independent, identically distributed random variables, with uniformly bounded moments, cf. Theorem 2.1 below. Hu is then a self-adjoint linear operator on £2(AL) for every realization of Vu. By techniques of harmonic analysis, C. Shubin, W. Schlag and T. Wolff recently established for the Anderson model at small disorders in dimensions d = 1,2 that as L —> oo, most eigenstates are, with probability one, in frequency space concentrated on shells of thickness < 0{\2) in d = 1, and < 0{\2~5) in d — 2 [8]. The eigenenergies are required to be bounded away from the edges of the spectrum of — |A Z d, and in d = 2, also away from its center. By the uncertainty principle, this implies the corresponding lower bounds on the localization lengths in position space. Closely related to their work and [1] are the papers [5,6] by Magnen, Poirot, Rivasseau, and [7] by Poirot. In [1], we prove that in the case d = 3, the localization lengths of most eigenfunctions of Hu are expected to be bounded from below by 0(X~2/\ log A|), without any restrictions on the eigenenergies. For the proof, we use an extension of the time-dependent methods of L. Erdos and H.-T. Yau in [2,3], developed for the derivation of the linear Boltzmann equations from a continuum random Schrodinger equation in dimensions d = 2 and 3. For macroscopic time and position coordinates (T, X) = \2{t,x), they established for all T > 0 that the dynamics in the macroscopic and small disorder limit A —> 0 is determined by the linear Boltzmann equations, and thus ballistic. The corresponding result for sufficiently small values of T was first proved by H. Spohn [9]. At larger time scales, the behaviour in the macroscopic limit is diffusive [4]. The link between the lower bound on the localization length of eigenfunctions and the Schrodinger dynamics generated by Hu, expressed in Lemma 2.1
587
588
THOMAS CHEN
below, is a joint result with L. Erdos and H.-T. Yau included in [1]. Furthermore, we prove that the macroscopic limit of the quantum dynamics in this system is governed by the linear Boltzmann equations. To this end, we extend the methods of [3] to the lattice case, and to random potentials with a non-Gaussian distribution. The additional contributions due to higher correlations turn out to have an insignificant effect, hence the present results do not differ qualitatively from the Gaussian case.
2. Localization lengths Let {ipa}aEA denote an orthonormal i/^-eigenbasis in £2(Ai), with eigenvalues ea € R, and \A\ — |Ai|. Let Box^(x) denote the translate of the cube Z 3 /(^Z) 3 that is centered at x, for 1
for £ > 0. For e small, {ipa \ a € -4£,<5,f} contains the class of exponentially localized states concentrated in balls of radius ~ 5£/log£ or smaller, where 5 is independent of (.. This observation and Lemma 2.1 below are both joint results with L. Erdos and H.-T. Yau. The following main theorem states that most eigenstates are expected to have localization lengths larger than 0(A~ 2 /|logA|). Theorem 2.1. Assume the moment conditions E[w^fe+1] = 0, E[o;^fc] =: C2k < cw < oo, C2 = 1, for all x € AL, k > 0, and cw independent of k. Then, for A 14 / 15 < S < 1, e(S) = 53/7 and L > A" 10 , E
l-4\4:(S),S,A -2 l
\A\
> 1 - ce{5f
with A > 0 sufficiently small, and a constant c < oo that is independent of L, A and 6. This theorem is a corollary of Lemmata 2.1 and 2.2 below. Lemma 2.1 links the dynamics generated by H^ to lower bounds on the localization lengths. Lemma 2.1. Let {ipa} denote an orthonormal H^-eigenbasis in some t > 0 and e > 0 small, E
Rx,s,ie-ttau6s
e2(AL)
> 1-e
£2(AL).
Suppose that for
(1)
is satisfied for all x S AL- Then, \A\Ae,s,t\
\A\
> 1 - 2e1'2
follows. To prove this result, we represent 6X on the left hand side of (1) in the basis {V'o}, and separate the contributions stemming from ASt s,e and its complement by a Schwarz inequality.
Long-time dynamics and localization lengths for the 3-d Anderson model . . .
589
Averaging over A/, (where |AL| = \A\), together with some standard manipulations then provides the result. Lemma 2.2 provides the condition (1). Lemma 2.2. Let t = e(5)2\~2,
0 < 5 < 1, and e(S) = 6$. Then, for H0 := - ± A ,
Rxs^e~itHo6x
>l-ce(S),
(2)
while for L > A - 1 0 , and A sufficiently small, E
R
x
^ (e-itH
|2
- e-itHo)6x\lHAJ
< c'e(5)2 + H
,
(3)
where c, d < oo are independent of L, x, A and 5. While (2) follows from a standard stationary phase estimate for the free time evolution in frequency space, the proof of (3) in [1] is based on an extension of methods in [3] to the lattice system and non-Gaussian distributed random potentials. The strategy comprises the following steps. Step 1. The small parameters are A and t 1 = 0(X2). We expand <j>t = e %tH"bx into a truncated Duhamel series with remainder term
/
ds0 • • • dsn s( J2 Sj - t)e-isoHo
V„e-islH"
• • • Vue-is«H°6x ,
j=o
and RN,t = -i f ds e-i(t-s)H" 4>N,S. Jo The number TV remains to be determined. Evidently, the left hand side of (3) is bounded by2En=iE[||^,t||i]+2E[||JRw,t||J2]. Step 2. For every fixed n with 1 < n < N, we determine the expectation E[||>„)t||^2,A J explicitly by taking all possible contractions among random potentials. This produces 0(n\) terms containing only pairing contractions, and < 0(2n2n) terms containing higher order contractions. To estimate the individual integrals, we classify them according to their contraction structure, which we represent as Feynman graphs. To this end, we draw two parallel, horizontal solid "particle lines" accounting for 4>n,t and 4>n,y> respectively. On each particle line, away from its endpoints, we insert n vertices, corresponding to n copies of Vu. The n + 1 edges on each particle line thus obtained correspond to free particle propagators. The particle lines are joined together at, say, both left ends, to account for the ^2-inner product. Furthermore, we draw dotted "interaction lines" interconnecting those vertices which are mutually contracted. Letting TUin denote the set of all such graphs o n n + n vertices, we have
E[||<M&(AJ < E
7ern,„
lAmPWI,
590
THOMAS CHEN
where Amp(7) is the integral (Feynman amplitude) corresponding to the graph 7. Let r„^„ denote the subset of graphs in r n > n that comprise only pairing contractions among the random potentials. The a priori bound I Amp( 7 ) I < P ( n , t )
(4)
holds for all 7 G T ^ i r ) , with P(n,t) := (logt)3(ct\2logt)n. Due to the factorially large number of pairings, this bound is insufficient (n\P(n,t) is not summable), and it is thus necessary to perform a finer classification of graphs. The set r„^„ is subdivided into: — The ladder graph {ln}, where the j - t h vertex on the upper particle line is contracted with the j - t h vertex on the lower particle line, for j = 1 , . . . , n (enumerated along the same direction on both lines). — Simple pairings, which correspond to decorated ladders. On each particle line, between the rungs of the ladder, there are possibly progressions of immediate recollisions, that is, pairings between neighboring copies of Vw. By definition, simple pairings include {In}. — Crossing and nested graphs, accounting for all non-simple pairing graphs. We prove |Am
P
({U)l<^,
(5)
(n!)2
I Amp( 7 e I t f ) \ {*n})| < HP(n,t).
(6)
The bound (5) on the ladder graph {ln} is summable in n, and by (6) all other pairings yield integrals that are, due to strong phase cancellations, at least 0(t~i) smaller than the a priori bound (4) on pairing contractions. Furthermore, it is shown that J2 -vpr
|Amp( 7 )| < Q(n,t) := C(ri{n\)
+t-2(2n)in)P(n,t)
x r C( pp aa iirr)) i,n \\lr n,n
holds for the sum of all non-pairing graphs. Thus, JV
Y,
Y,
\^P(l)\
+ CNQ(N,t)
n=i-yer n , n
follows. Step 3. We estimate E[||i?jv,t||^2/A J by splitting the time integration into K intervals of equal size, and by exploiting the fact that the event that a large number of quantum collisions take place in a small time interval is rare. The result is E[Piv,t||,VL)] <
(N2K2
+ 1-)
CNQ{AN,t).
Long-time dynamics and localization lengths for the 3-d Anderson model ... Step 4.
591
For a choice m
=
£&!rt
and
»w~o°gt) A .
with suitable positive constants /3i,/32 that are independent oit, we have 1 < N(t), n(t) < t, and the asserted estimate (3) follows. In other words, we prove that the sum of all graphs containing crossing, nested, and non-pairing contractions, only contributes to a small error of order at most 0{t~~s). The sum of contributions from ladder diagrams for n > 1 is bounded by t\2, up to a multiplicative constant that is independent of A and t. In comparison to the continuum model with a Gaussian distributed random potential [2,3] there are two main difficulties in the case at hand. The geometry of level surfaces of the kinetic energy function in momentum space is more complicated in the lattice case. For estimates on integrals related to crossing diagrams, one must study intersections of isoenergy surfaces with their translates, which now requires more effort. Furthermore, due to the non-Gaussian distribution of the random potentials, non-pairing contractions in the expectation of ||0n,tll|2(A \ require more complicated combinatorial considerations.
3. Linear Boltzmann equations The derivation of the macroscopic limit for the corresponding quantum dynamics is technically very closely related to the problem considered above. In this part of the discussion, we let KL —» Z 3 . Let
(7)
>e^2(z3), for a fixed realization of the random potential. Then, Wj,t : Z 3 x T 3 —> C,
w*t (*. *) = £ M* + v)Mx - v) ^iyv,
(8)
defines its Wigner transform. We introduce macroscopic variables T := et, X := ex, V := v, and consider the rescaled Wigner transform W|t(X,V):=£-3^t(X/£,y)
(9)
with X € (eZ) 3 , and V € T 3 . Theorem 3.1. Let e = A2, and let
,
Then, for any T > 0, E[W$.
(X,V)]^FT(X,V),
(10)
592
THOMAS CHEN
for X e R 3 , V € T3, weakly as e —> 0, where FT(X, V) solves the linear Boltzmann equation 3
dTFT(X,V)
.
+ 2 y 51x12^-VXjFT(X,V)
= /
dUa(U,V)
FT(X,U)-FT(X,V)]
, (11)
with collision kernel
e{VJ),
and initial condition F0 given by WJ. - \h{X)\H(V
- VS(X))
=: F0(X, V),
(12)
weakly as e —> 0. This result is established by extracting the main terms from the expectation of the Wigner distribution, consisting exclusively of simple pairings, which converge weakly to a solution of the linear Boltzmann equations as e —» 0, in analogy to the case in [3]. To prove that the errors stemming from the remaining classes of graphs tend to zero as e —* 0, one essentially uses the ^-estimates derived above.
Acknowledgements I am profoundly grateful to L. Erdos, and in particular H.-T. Yau, for their support and generosity. The author is supported by a Courant Instructorship, and in part by a grant from the NYU Research Challenge Fund Program.
References 1. T. Chen, "Localization lengths and Boltzmann limit for the Anderson model at small disorders in d = 3", submitted (2003). 2. L. Erdos, "Linear Boltzmann equation as the scaling limit of the Schrodinger evolution coupled to a phonon bath", J. Stat. Phys. 107, 1043-1127 (2002). 3. L. Erdos, H.-T. Yau, "Linear Boltzmann equation as the weak coupling limit of a random Schrodinger equation", Comm. Pure Appl. Math., Vol. LIII, 667-753 (2000). 4. L. Erdos, H.-T. Yau, M. Salmhofer, in preparation. 5. J. Magnen, G. Poirot, V. Rivasseau, "Renormalization group methods and applications: First results for the weakly coupled Anderson model", Phys. A 263, 131-140 (1999). 6. J. Magnen, G. Poirot, V. Rivasseau, "Ward-type identities for the two-dimensional Anderson model at weak disorder", J. Statist. Phys. 93, 331-358 (1998). 7. G. Poirot, "Mean Green's function of the Anderson model at weak disorder with an infra-red cut-off", Ann. Inst. H. Poincare Phys. Theor. 70, 101-146 (1999). 8. C. Shubin, W. Schlag, T. Wolff, "Frequency concentration and localization lengths for the Anderson model at small disorders", to appear in Journal d'Analyse Math. 9. H. Spohn, "Derivation of the transport equation for electrons moving through random impurities", J. Statist. Phys. 17, 385-412 (1977).
Leaky quantum wire and dots: a resonance model PAVEL E X N E R
(Academy of Sciences, Prague),
SYLWIA KONDEJ
(U. Zielona Gora)
We discuss a model of a leaky quantum wire and a family of quantum dots described by Laplacian in L 2 (R 2 ) with an attractive singular perturbation supported by a line and a finite number of points. The discrete spectrum is shown to be nonempty, and furthermore, the resonance problem can be explicitly solved in this setting; by Birman-Schwinger method it is reformulated into a Friedrichs-type model.
1. Introduction In this talk we are going to discuss a simple model with the Hamiltonian which is a generalized Schrodinger operator in L 2 (R 2 ). The interaction is supposed to be supported by a line and a finite family of points, i.e., formally we have n
-A-aSix-^
+ ^&Six-yU),
(1)
i=i
where a > 0, E := {(a;i,0); x\ G R}, and j/W G R 2 \ E; coupling constants of the twodimensional <5 potentials will be specified below. First one has to say a few words about a motivation of this problem. Operators of the type (1) or similar have been studied recently as models of nanostructures which are "leaky" in the sense that they do not neglect quantum tunneling. While various results about the discrete spectrum were derived [2-7,9-15] much less is known about scattering in this setting, in particular, about resonances. The simple form of the interaction support, E U II with II :— {y^}, will allow us to answer this question for the operator (1). We will achieve that by using the generalized Birman-Schwinger method which makes it possible to convert the original PDE problem into a simpler equation which in the present situation is in part integral, in part algebraic. What is important is that the method works not only for the discrete spectrum but it can be used also to find singularities of the analytically continued resolvent. The problem then boils down to a finite rank perturbation of eigenvalues embedded in the continuous spectrum, i.e., something which calls to mind the celebrated Friedrichs model. To fit into the prescribed volume limit we present here the main results with sketches of the proofs leaving detailed arguments and extensions to a forthcoming paper [8].
2. The Hamiltonian and its resolvent A proper way to define (1) as a self-adjoint operator is through boundary conditions [1]. Consider functions / G W^(R2 \ (E UII)) n L2(R2) which are continuous on E. For small enough r* > 0 the restriction / fri of / to the circle {x G R 2 : \x — j/W| = n } makes then sense. We say that such an / belongs to D(Ha^) iff the following limits Ei(f) := - lim - i - / fri, ri-*o Inrj
n , ( / ) := lim [/ \ri + S 4 ( / ) I n n ] , ri-»o
593
594
PAVEL EXNER, SYLWIA KONDEJ
for i = 1 , . . . ,n, and 3E(/)(II)
:= J ^ ( * i . 0 + ) - ^ ( ^ i ' 0 - ) > n s ( / ) ( « i ) : = /(*i.O),
are finite and satisfy the relations 27rJSi3i(/) = n 4 ( / ) ,
S2(/)(si) = - a n E ( / ) ( i i ) ;
(2)
we denote /3 := (/?i,... ,/?„). Then we define Ha^ : D{Hatp) —> L 2 (R 2 ) acting as Ha,fif(x)
= -&f(x)
for
i£l2\(Eun),
and i/a,/? as its closure. Modifying the argument of [1] to the present situation one can check that i?Qj/3 is self-adjoint; an alternative way is to use the method of [16]. We identify it with the formal operator (1). Notice that the /Vs do not coincide with the formal coupling constants in (1), for instance, absence of the point interaction at j/W means /% = oo. The key element in spectral analysis of i?a,/3 is finding an expression for its resolvent. Given z £ C \ [0, oo) we denote by R(z) := (—A — z)^1 the free resolvent, which is an integral operator in L2 = L 2 (R 2 ) with the kernel Gz(x,x') = ^Ko(y/—z\x — x'\), where K0(-) is the Macdonald function and z *—> y/z has a cut on the positive halfline. We also denote by R(z) the unitary operator denned as R(z) but acting from L2 to W 2 , 2 = W 2 ' 2 (R 2 ). To express the resolvent of Hag we need two auxiliary Hilbert spaces, Ho '•= L2(R) and Hi := C", and the corresponding trace maps ro : W2'2 —• W0 and T\ : W2'2 —• Hi which act as TO/ : = / t s >
nf
•= f\n=
(f\{yW},---,f\{y^)}),
respectively; as before the used symbols mean appropriate restrictions. These maps in turn allow us to define canonical embeddings of R(.z) to Hi by Ri,i(*) = TiR(z) : L2^Hi,
R L ,,(z) = [R;,L(Z)]* -Hi^L2,
(3)
and Rj,i(z) = TjRLii(z)
: Hi -* Hj .
(4)
We introduce the operator-valued matrix T(z) = [IV,-(z)] : Ho © Hi —> Ho © Hi with the "blocks" Tij(z) : Hj -> Hi given by r»j(z)s = -Ri,j(z)g T00(z)f
= [a'
1
for i =£ j and g
- Ro i0 (z)] / ,
eHj,
if / G H0 ,
Tu(z)
for tp € Hi,
where sp(z) — (3 + s(z) := /3 + ^ ( l n ^ — V'(i)) and the operator in the last row is written explicitly through the components of the corresponding n x n matrix. We will see that p(Ha>a) coincides with the set of z for which T(z) has a bounded inverse. The latter is contained in C \ [—|a 2 ,oo), hence we can define the "reduced determinant" D(z) := r n ( z ) - Tioiz^ooiz^Toiiz)
: Hi -> Hi,
Leaky quantum wire and dots: a resonance model
595
by means of which the "blocks" of [r(z)] _ 1 : H0 @ Hi —> H0 © Hi express as [T{z)]-i = TOW
D{z)~\ T1o(z)-1T11(Z)D(z)-1Tw(z)T0o(z)-1,
=
[r(z)]-1 = - r o o W - 1 ^ (z)D(z)-1, [r(z)]-1 = -DizyirMTooiz)-1; we use the natural notation which distinguishes them from the inverses of Tij(z). Now we can state the sought resolvent formula. Theorem 2.1. For z € p(Hat@) with Imz > 0 the resolvent of Ha^
is given by
l
1
Ra,0(z) = (Ha,0 - zy = R(z) + J2 Ri.iW [r(^)]--1 % W •
(5)
i,j=0
Proof. For simplicity we will assume n = 1 only, i.e., II = {?/}; extension to the general case is easy. We have to check that / € D(Ha^) holds if and only if / = Ra^(z)g for some g 6 L 2 , where Ra^(z) denotes the operator at the right-hand side of the last equation. Suppose that / is of this form. It belongs obviously to Wj2'c2(M2 \ ( E U II)) n L2 because all its components belong to this set. Combining the definitions of TUj, [T(z)}~j1, and functionals S ( / ) = £ i ( / ) , fii(/) = ^ i ( / ) introduced above with the asymptotic behaviour of Macdonald function, Ko(^/^zp) = — 2\np — Airs(z) + O(p) as p -* 0, we arrive at
2^{f) = Yj[T{Z)]-l?ni,L{z)g, i=0 1
fi(/) = RhL(z)g
1
- J2 TMiriz^R^g
- s(z) £
\T(z)]^Ri,L(z)g.
i=0
»=0
Let us consider separately the components of £ ( / ) , 01(f) coming from the behaviour of g at the point y and on E, i.e., S*(/) := ^\^(z)]i^i,L9 and
n°(/) := [-rjoWtr^ioo1 - ^[rcoiro 1 ] fiH/) := [1 -Tw(z)[T(z)}^
RO,L,
-s(z)[T(z)]^}
RliLg;
using the properties of [^-(z)] and its inverse it is straightforward to check that Cll(f) = 27r/3Ht(/) holds for i = 0,1. Similar calculations yield the relation S s ( / ) = —aQ^(f) which means that / belongs to D(Haip), and the converse statement, namely that any function from D(Ha^) admits a representation of the form / = Raip(z)g. To conclude the proof, observe that for such a function / € D(Hatp) which vanishes on SULT we have (—A—z)f = g. Consequently, Ra^(z) = Ra>p(z) is the resolvent of the Laplace operator in L 2 (R 2 ) with the boundary conditions (2). • In a similar way one can compare Ra^(z) to the resolvent Ra(z) of the operator Ha with the point interactions absent using the operators R<* ; I,.L(2), Ra-,L,i(z) mapping between L2 and H\, and R a ;i,i(z) = r a ; n ( z ) on Hi defined in analogy with (3) and (4); the latter is Taiii(z)
~ Gi o ) (l/ ( f c ) .W ( 0 )(l-fci))
for
596
PAVEL EXNER, SYLWIA KONDEJ
where sfl(z) := /3-lim^o {p^(y(k),y(k) + 77) + ^ In\r)\) and G{"] is the integral kernel of the operator Ra{z). Using the standard Krein-formula argument mimicking [1] we find that the two resolvents differ by Tta.iLti(z)[Ta;n(z)]~1'Ra;i,L(z). This can be simplified 1 further: we have Ra(z) = R{z) + RLfiiz^ooizy RQ,L(Z) for z e p(Ha) = C \ [-\a2,oo), and taking into account the asymptotic behaviour of Macdonald function we get 4?fcW = sp{z) - (Ri,o(^)r 00 (z) -1 Ro,i(^))fcfc • Taken together, these considerations mean that Ta.}\ti{z) = D(z), or in other words P r o p o s i t i o n 2 . 1 . For z G p{Hatp) with Imz > 0 the resolvent of Haip is given by = R«(z)
RaA*)
Ka.tL
+
3. Spectral properties Before addressing our main question about resonances in this model, let us describe spectral properties of HUtp. The spectrum of Ha is found easily by separation of variables; using Proposition 2.1 in combination with Weyl's theorem and [17, Thm. XIII.19] we find that cress(-flrQ)/3) = aac(Ha^)
=
- - a 2 , 00 J
Less trivial is the discrete spectrum. An efficient way to determine it is provided by the generalized Birman-Schwinger principle, which in view of Theorem 2.1 reads z e o-disc{Ha>f3) «• 0 e a d i s c ( r ( z ) ) ,
dimkerr(z) = d i m k e r ( i / a , / 3 - z ) ,
(6)
1
Ha,pz = zcj)z <$4>z = ^ R L , t ( z ) T 7 i , 2
ioi z € adisc(Ha>p),
(7)
t=0
where (r?o,z> Vi,z) G ker r(^) — cf. [16]. Moreover, it is clear from the explicit form of [r(z)] _ 1 that 0 £ 0disc(r(z)) o O e o-<nsc(D(z)); this reduces the task to an algebraic problem. Consider again the case n = 1 with the point interaction placed at (0, a) with a > 0. In absence of the line, the operator HQ^ has a single eigenvalue ep = — 4e 2 ( _ 2 7 r / 3 + ^ 1 ^; we will show thatCTdisc(#a,/3)is nonempty for any a > 0. More specifically, we claim that T h e o r e m 3 . 1 . For any a > 0 and (3 € R the operator Ha>p has one isolated eigenvalue —K2 with the eigenvector given in terms of the Fourier transform
C nSt
°
r
(Q-ipia
L y^r
1/2a
+
ae~(Pi+x-l)
\
eipx
(2(p?+ «2)V»-a) J WT* dp'
where p = {pi,P2)- The function a H-> — K2 is continuously increasing in (0,00) and satisfies lima_^oo(—K2) = min {e^, — ^ a 2 } , w/wZe the opposite limit —K2 := lim a ->o( — « 2 ) is finite. Proof. One has to find z for which kerD(-) is nontrivial. We put z = —K2 with K > 0 and introduce D(K) := D(—K2), and similarly for other quantities. By a straightforward calculation we find that D(K) acts as a multiplication by 7 a («) := sp(n) — 4>a{^), where
MK)
=
^ L
(2(P2 + K 2 ) 1 / 2 - «)(P2 + K 2 ) 1 / 2
dp
(8)
Leaky quantum wire and dots: a resonance model
597
and S0(K) = ^ [in ^ — ip(l)]. It is straightforward to check that K —> %(K) is continuous, strictly increasing, and tends to ±00 as K —> 00 and K —> \a+, respectively. Hence the equation %(K) = 0 has a unique solution na in (^a,00). Evaluating R L , I ( K ) we get the eigenfunction from (7). Moreover, using (8) we find that for a fixed K the function a H-» >a(«) is decreasing; combining this with the fact that sp(-) is increasing we conclude that a 1—> Ka is decreasing. Next we employ the relation lim a _ 0O (f>a(K) = 0, which is easily seen to be valid pointwise; in combination with Sp^—ep) = 0 it yields the sought limit for a —> 00. To finish the proof, recall that (8) is bounded from above by >O(K) and the equation S(J(K) — 4>O(K) = 0 has a unique finite solution Ko• If n > 1 the structure of the spectrum becomes more complicated. For instance, it is clear that Ha,0 can have embedded eigenvalues provided the sets II and f3 have a mirror symmetry w.r.t. E and o-disc(i70,/3) n (—\a 2 ,0) ^ 0. In this short paper we restrict ourselves to quoting the following general result, referring to [8] for proof and more details. Theorem 3.2. For any a > 0 and (3 — (/?i,... ,/3„) C R" the operator Ha^ has N isolated eigenvalues, where 1 < N < n. In particular, if all the point interactions are strong enough, i.e., the numbers —0i are sufficiently large, we have N = n.
4. Resonances 4.1. Poles of the continued resolvent For simplicity we consider again a single point interaction placed at y — (0, a) with a > 0. In addition we have to assume that if the tunneling between y and the line is neglected, the point interaction eigenvalue is embedded into the continuous spectrum of Ha, in other words, that e@ > — \o?. As usual, analyzing resonances means to investigate singularities in the analytical continuation of R(-) from the "physical sheet" across the cut [-\a2,oo). Our main insight is that the constituents of the operator at the right-hand side of (5) can be separately continued analytically. Consequently, one can extend the Birman-Schwinger principle to the complex region and to look for zeros in the analytic continuation of £>(•). A direct calculation shows that D(z) acts for z £ C \ [— \ot2,00) as a multiplication by M(M da{z) := sp(z) - 4>a{z) = sp{z) - r j dt, Jo t - z - \az
(9)
where
{a-2i{z-t)ll2)e2i
ia MM):
~167T
tl/2(2_t)l/2
We shall construct the continuation of da to a region £)_ of the other sheet which has the interval (—|a 2 ,0) as a part of its boundary at the real axis. To this aim we need more notation. Put fi°(\,t) := lim £ _oM(A+i£,t) and for A G (—\a2,Q) introduce the symbol
Jo
t-X-
\al
598
PAVEL EXNER, SYLWIA K O N D E J
with the integral understood as its corresponding principal value. Finally, we denote
'«-<*> := T ( , + - x ) v 2
for ze n u
- (-r 2 > °
Lemma 4 . 1 . The function z i-> 0 o (z) defined in (9) can be continued analytically across (— TQ! 2 ,0) to a region fi_ of the second sheet as follows, 02(A) = /(A) + ga,a(X)
for A e ( - ^ a 2 , o ) ,
&"(*) = - [°° t ^ ^ l 22dt-2ga,a(z) J0 t - z - \a
forzen-,lmz<0.
Proof. By a direct if tedious computation — cf. [8] — one can verify the relations -\a2<X<0,
l i r a l j i ( \ ± i e ) = 4>°a(\),
where >+ = >„; so the claim of the lemma follows from the edge-of-the-wedge theorem.
•
Notice that apart of fixing a part of its boundary, we have imposed no restrictions on the shape of fi_. The lemma allows us in turn to construct the analytic continuation of da{-) across the same segment of the real axis. It is given by the function T)a • M H-> C, where M = {z:lmz > 0 } U ( - | a 2 , 0 ) u Q _ acting as r,a(z) =
l (z) S/3(z)-4> a (z),
where l(z) = ± if ±Imz > 0 and l(z) = 0 if z € (—^a 2 ,0), respectively. The problem at hand is now to show that rja{-) has a second-sheet zero, i.e., for some z G f2_. To proceed further it is convenient to put <^ := yf—ep, and since we are interested here primarily in large distances a, to make the following reparametrization, b := e'^0
and fj(b, z) := r]a(z) : [ 0 , O O ) X M H . C ;
we look then for zeros of the function fj for small values of b. With this notation we have a (a + 2(t-Xy/i)bW-»1/2/v
o *
{A t}
'
~ Ifor"
tV2( t _ A )l/2
ia '
9a,alb)W - "J
*>'<> ( A+
1^)1/2 '
<10)
for A e (—\o?., 0), and similarly for the other constituents of fj. This yields our main result. Theorem 4.1. Assume e^ > —\a2. For any b small enough the function fj(-, •) has a zero at a point z(b) £ Q_ with the real and imaginary part, z(b) = fi(b) + iv(b), i/(b) < 0, which in the limit b —> 0, i.e., a —> oo, behave in the following way, r(b) = e(} + 0(b),
v{b) = 0{b).
(11)
Proof. By assumption we have C/3 € (0, \a). Using formulae (10) together with the similar expressions of n(z,t) and ga,a{z) in terms of b one can check that for a fixed b G [0,oo) the function fj(b, •) is analytic in M while with respect to both variables fj is just of the C 1
Leaky quantum wire and dots: a resonance model
599
class in a neighbourhood of the point ( O ^ ) . Moreover, it is easy to see that fy(0,e^) = 0 and dzrj(0, ep) 7^ 0. Thus by the implicit function theorem there exists a neighbourhood UQ of zero and a unique function z(b) : UQ \-* C such that fj(b, z(b)) = 0 holds for all b G UQSince Haip is self-adjoint, u(b) cannot be positive, while z(b) G (—|a 2 ,0) for b ^ 0 can be excluded by inspecting the explicit form of fj. Finally, by smoothness properties of f) both the real and imaginary part of z(b) are of the C 1 class which yields the behaviour (11). • Remark 4.1. Since fi_ can be arbitrarily extended to the lower complex halfplane and all the quantities involved depend analytically on a, it is natural to ask what happens with the pole for other values of a. Using Lemma 4.1 one can check that in the limit Im z —> —oo we have \4>~(z)\ —* 0 uniformly in a and \sff{z)\ —> oo. Thus the imaginary part of the solution z(a) to sp{z) —
Ra(z)+Va(z)~1(-,Vz)vz,
=
where vz := Ra;L,i(z). We apply this operator to w\+ie(x) := ei(^+^+^2/i)l/2x1 e-a\x2\/2 and take the limit e —> 0+ in the sense of distributions; then a straightforward if tedious calculation shows that Ha^ has a generalized eigenfunction which for large |xi| behaves as VA(*) » e^-*" 2 / 4 ) 1 ' 2 * 1 e-»l"l/ 2 + l- aVa(X)-1
e
^°
e i(A+aV4)^| X l | e _ a | I a | / 2
for each A G (—|a 2 ,0). This yields the sought quantities. Proposition 4.1. The reflection and transmission amplitudes are given by nX)=T(X)-l
=
-arja(Xr{x+1_a2)1/-;
they have the same pole in the analytical continuation to fi_ as the continued resolvent. 4.3. Resonances induced by broken symmetry If n > 2 the resonance structure may become more complicated. A new feature is the occurrence of resonances coming from a violation of mirror symmetry. We will illustrate it on the simplest example of a pair of point interactions placed at x\ = (0, a) and Xi = (0, —a) with a > 0 and coupling /% := (/?, /? + 6), where b is the symmetry-breaking parameter. We choose a, a, (3 in such a way that the Hamiltonian Ho,/30 with two identical point interactions spaced by 2a has two eigenvalues, the larger of which — called ti — exceeds —\o?. As we
600
PAVEL EXNER, SYLWIA K O N D E J
have pointed out, Ha,/}0 has then in view of antisymmetry the same eigenvalue e2 embedded in the negative part of its continuous spectrum. Modifying the argument which led us to Theorem 2.1 we have now to continue analytically t h e 2 x 2 m a t r i x D{z) and find zeros of its determinant. T h i s yields t h e equation + b) - ^ 0 ( 2 a v / = i ) 2 - (2Sf3(z)
sp{z){s0{z)
+ b)^z\z)
- 2K0(2aV^)
= 0,
(12)
where <j>a (•) is denned in Lemma 4.1 and the left-hand side can be understood as a function fj(b,z) : R \ {0} x M —> C. We denote also K2 = V~e2, §W •= —iga,aW a n d put s'p{-), K'0(-) for corresponding derivatives; t h e n we have the following result. T h e o r e m 4 . 2 . Suppose that e2 6 ( — ^ a 2 , 0 ) , then for all nonzero b small enough the equation (12) has a solution 22(b) € fi_ with the real and imaginary part, 22(6) = M2W + iv2(b), which are real-analytic functions with the following expansions, M2(6)
Mb)
+
s'/3(K2) +
2aK!,(2aK2)b+0(•b),
^(£2) 2(S'0(K2)
+ 2aK'0{2aK2))\sp{K2)
,
2
- >°(e2)|2
+ U{
°
3)
'
P r o o f . As in Theorem 4.1 we rely on the implicit function theorem, but fj is now jointly analytic, so is z2. Since s'p(K2) + 2aK'0{2an2) > 0 the leading t e r m of v2(b) is negative. •
Acknowledgments S. K. is grateful for the hospitality in Nuclear Physics Institute, AS CR, where a p a r t of this work was done. T h e research has been partially supported by the GAAS Grant A1048101.
References 1. S. Albeverio, F. Gesztesy, R. H0egh-Krohn, H. Holden, Solvable Models in Quantum Mechanics, Springer, Heidelberg, 1988. 2. P. Exner, Lett. Math. Phys. 57, 87 (2001). 3. P. Exner, in Proceedings of the NSF Summer Research Conference (Mt. Holyoke 2002), AMS Contemporary Mathematics Series, 2003. 4. P. Exner, T. Ichinose, J. Phys. A: Math. Gen. 34, 1439 (2001). 5. P. Exner, S. Kondej, Ann. H. Poincare 3, 967 (2002). 6. P. Exner, S. Kondej, J. Phys. A: Math. Gen. 36, 443 (2003). 7. P. Exner, S. Kondej, arXiv:math-ph/0303033. 8. P. Exner, S. Kondej, "Leaky quantum graphs: a solvable resonance model", in preparation. 9. P. Exner, K. Nemcova, J. Phys. A: Math. Gen. 34, 7783 (2001). 10. P. Exner, K. Nemcova, arXiv:math-ph/0306033. 11. P. Exner, M. Tater, arXiv:math-ph/0303006 . 12. P. Exner, K. Yoshitomi, J. Geom. Phys. 4 1 , 344 (2002). 13. P. Exner, K. Yoshitomi, Ann. H. Poincare 2, 1139 (2001). 14. P. Exner, K. Yoshitomi, J. Phys. A: Math. Gen. 35, 3479 (2002). 15. P. Exner, K. Yoshitomi, Lett. Math. Phys. (2003), to appear; arXiv:math-ph/0303072. 16. A. Posilicano, J. Funct. Anal. 183, 109 (2001), and Ann. Scuola Norm. Sup. Pisa, to appear. 17. M. Reed, B. Simon, Methods of Modern Mathematical Physics IV, Academic Press, N.Y., 1978.
Product formula for quantum Zeno dynamics TAKASHI ICHINOSE
(Kanazawa U.),
PAVEL E X N E R
(Academy of Sciences, Prague)
A product formula is proved which involves the unitary group generated by a semibounded self-adjoint operator and an orthogonal projection P. It gives a partial answer to the question about existence of t h e limit which describes quantum Zeno dynamics in the subspace Ran P.
1. Introduction and the main result The fact the decay can be slowed or even prevented by frequently repeated measurements, noticed first by Beskow and Nilsson [1], and called quantum Zeno effect by Misra and Sudarshan [2], attracted recently a lot of attention, in part because it seems to be nowadays experimentally accessible. On the other hand, mathematically the issue was addressed already long time ago by Friedman [3] and Chernoff [4,5], but the question about existence of the "Zeno limit" remained open. The aim of this note is to provide a partial answer. Let H and P be a nonnegative self-adjoint operator and an orthogonal projection acting in a separable Hilbert space H, respectively. Suppose that the operator HP :=
(H1/2Py(H^2P)
is densely defined, so that it is self-adjoint in "H. Then we claim that the said limit exists in the sense described below, and moreover, we are able to find its value. More specifically, the aim of this note is to announce the following theorem. Theorem 1.1. Let H be a nonnegative self-adjoint operator in a separable Hilbert space ft and P an orthogonal projection on H. Assume that HP := (Hll2P)*{Hll2P) is densely defined. Then there exists a sequence {m„} of increasing integers and a subset M C [0, oo) of Lebesgue measure zero such that the relation s-lim[Pe- i e t f f / m "P] m "
=e-ietHpP
holds for every t € [0, oo)\M for both e = ± l . In fact, the proof given in [6] yields a stronger result with P on the left-hand side replaced by values of a projection-valued family {P(t)} such that P(t) —> P strongly as t —> 0 and P(t)P = P(t) with some auxiliary condition, as well as non-symmetric versions of such formulae. The method we use employs a modification of a Kato's result [7] on the Trotter product formula in combination with analytic continuation. Before sketching the argument let us mention an example with a natural quantum mechanical interpretation. To this aim, consider H = —A in L2(Rd), and P projecting onto an open set fi C Md with a smooth boundary, i.e., P acts as multiplication by the indicator function Xfi of the ^- Then the Zeno limit exists for almost every t along some sequence of increasing integers and the corresponding operator Hp is easily checked to coincide with
601
602
TAKASHI ICHINOSE, PAVEL E X N E R
the Dirichlet Laplacian -AQ in L2(Q) with domain £>[-A n ] = WQ1(Q,) n W2(Q). While it does not prove fully the formal claim made recently in [8] using the method of stationary phase, this conclusion gives nevertheless a strong indication that the result is valid.
2. Outline of t h e proof We consider only the case e — + 1 . As mentioned above we modify Kato's method of proof of the Trotter product formula for the form sum of two non-negative self-adjoint operators. Let us describe the essential steps in the argument. Put F(C, T) := Pe~
Pe~^HP}.
-
In view of Chernoff's theorem [4,5] we need to show the following lemma. Lemma 2.1. For every sequence {r„} of positive numbers with rn —> 0 as n —> oo, there exists a subsequence {r^} of {r„} and a set M C [0, oo) of Lebesgue measure zero such that, for all 6 € [0, oo) \ M, (I + S{i6,T'n))-1—>
(I + i0HP)~1P
strongly as n -> oo .
Let us first show this implies the theorem. Let r n :— l/n. By the lemma, there exists a subsequence {m„} of {n} such that, for every t G [0, oo) \ M with t ^ 0, P(I + S(it,l/mn))~1P—y
(I + itHP)~1P
strongly as
mn->oo,
and hence on PH. we have ( ( / + S(it,l/mn)\pn)~
—> [(/ + itHp)\pn}~
strongly as mn -> oo;
then from the proof of Chernoff's theorem we have on the subspace PH [pe-itH/mnp}mnf
>e-itHPpf
strongly
^
mn ^ QQ
for any / G H for t £ M. On the other hand, it is straightforward to see that for / e (I-P)H the relations [ p e - l t / / / m " P ] m n / = 0 and e~ltHpPf — 0 are valid, proving thus the theorem. Proof of Lemma 2.1. Note that S(C,,T) is defined also in the closed halfplane Re£ > 0 except for (, = 0. Our aim is to show that the strong convergence of (/ + S{C,,r))_1 to the indicated limit persists for £ on the (positive) imaginary axis, except possibly for points of a "small" set M. This is achieved in three steps. First, we show it for C, on the positive real axis, next in the open first quarter, and finally we extend the result to the imaginary axis. Claim 1. For Q = t > 0 and S(t,r) {I + S^T))-1
= T'1^
- Pe~tTHP],
—ytf + tHpytp
we have
strongly as
T->0.
Proof is analogous to Kato's argument. Given an / £ W, put u(t,r) Then denoting Q := I - P and H(tr) := {tr)-1^ - e~tTH], we have
= (I + S(t, r ) ) - 1 / .
H/ll2 = ||w(*,7-)||2 + (l + 2^)x- 2 ||^^(t,^)|| 2 +2i||i?(*x) 1 / 2 ^w(i,x)|j 2 + £ 2 ||^i?(tT-)^^(£,x)|| 2 .
603
Product formula for quantum Zeno dynamics
Since all the terms on the right-hand side are bounded, for each fixed t there exists a subsequence {?"„(£)}, possibly dependent on i, such that r n (t) —> 0 as n —* oo, along which the involved vectors converge weakly in H, T~lQu(t,T)—>g0(t),
u(t,r)—>u(t), tPH(tT)Pu{t,T)
t1/2H(tr)1/2Pu(t,r)—>hi(t),
>h2(t),
for some vectors u(t), go(t), h\(t) and /i2(£) in H. We can check that these limits are hi(t) = tl'2Hl/2Pu{t),
h2(t) =
tHPu{t),
and / = u(t) + Qf + tHpu(t), or equivalently u(t) = (I + tHp)~lPf, and show that these vectors converge independently of a sequence {r„(i)} chosen, and that the convergence is in fact strong. This proves Claim 1. Using Claim 1 and mimicking Feldman's argument [9], cf. also [5, p. 90] and [3], in combination with Vitali theorem on analytic continuation we find that for Re C > 0 one has (/ + S{C,T))-1
—•+ (J + C,HP)-lP
strongly as
T -• 0
and therefore on the boundary halnine, Re Q = 0 or £ = it with t real, the convergence holds at least in the following weaker sense: Claim 2. For any pair of vectors f, g &H, the family (g, (I + 5 ( i £ , r ) ) - 1 / ) of functions of t in L°°(R) converges weakly* to (g, (I + itHP)-lPf) as r -> 0. The crucial part in the proof of Lemma 2.1 is the last step to the final assertion. Claim 3. For every sequence {r n } of positive numbers with rn —> 0 as n —> oo there exists a subsequence {r^} of {r„} and a set M C [0, oo) of Lebesgue measure zero such that for all t € [0, oo) \ M and all / G Tt we have (I + S{it, O )
- 1
/ —• (J + itHp)~lPf
strongly as
n -> oo.
Proof. The idea is to use an argument similar to that used to prove Claim 1 on W, however, this time not on the Hilbert ~H but on the Frechet space L2joc([0, oo); H) = L2]oc([0, oo))
+ t2\\P(B(tT) +
iA(tr))Pu(t,T)\\2.
Consequently, all the W-valued families of vectors on the right-hand side are bounded by ||/|| for all t > 0 and small r , and moreover, they are also locally bounded as W-valued
604
TAKASHI ICHINOSE, PAVEL EXNER
functions in L2ioc([0, oo); 7i). It follows that there exists a subsequence {r n } along which the vectors converge weakly in the space L2ioc([0, oo);W): u(t, T) —• u{t), tP(B(tr)
T~lQu{t, T)—»/o(<),
+ iA(tT))Pu(t,
t^2B(tT)^2Pu(t,
T)—>*(*),
T)—ny(t),
for some vectors u(t), fo(t), z(t) and y(t) in L2\oc([0,oo);H). Next we have to check that this weak-topology limit, u(t) = w-lim u(t,Tn) = w-lim(7 + S(it, r„)) _ 1 /> 71—>OC>
71—>0O
coincides with the one obtained in Claim 2, i.e., with the expression (I + itHp)~1Pf. To do so we show as before that these vectors converge, independently of a sequence {r„} chosen, and in fact strongly in I,2ioc([0, OO); Ti), SO that for a.e. t the following relations u(t) = Pu(t) e PH,
z(t) = 0,
fo(t)=Qf,
f = u(t)+Qf
+ iy(t)
are valid. It is at this stage when we encounter the exceptional set M of Lebesgue measure zero in t > 0. With this exception, i.e., for t $ M, we have y(t) = tHPu(t)
or
u(t) = w-lim(7 + S(it, r ) ) - 1 / = (I +
itHP)~lPf.
r—>0
To be able to choose M independently of / , we need to assume separability of H. Finally, one has to check that for every sequence {T„} of positive numbers with T„ —> 0 as n —> oo, there exists a set M C [0, oo) of Lebesgue measure zero and a subsequence {r^} of {r„} along which for all t & M, (I + 5(it,r^))' _ 1 converges strongly to (I + itHP)~1P in Ti. This finishes the outline of the proof; the details can be found in [6].
Acknowledgments The research of P. E. and T. I. respectively has been partially supported by GAAS and Czech Ministry of Education under the contracts 1048101 and ME482, and by the Grant-in-Aid for Scientific Research (B) No. 13440044, Japan Society for the Promotion of Science.
References 1. 2. 3. 4. 5. 6.
J. Beskow, J. Nilsson, Arkiv. Fys. 34, 561 (1967). B. Misra, E. C. G. Sudarshan, J. Math. Phys. 18, 756 (1977). C Friedman, Indiana Math. J. 21, 1001 (1971/2). P. R. Chernoff, J. Fund. Anal. 2, 238 (1968). P. R. Chernoff, Mem. Amer. Math. Soc. 140, Providence, R.I., 1974. P. Exner, T. Ichinose, "Product formula related to quantum Zeno dynamics", arXiv:math-ph/ 032060. 7. T. Kato, in Topics in Functional Analysis (I. Gohberg and Mark Kac, eds.), Academic Press, New York, 1978, pp. 185-195. 8. P. Facchi, S. Pascazio, A. Scardicchio, L. S. Schulman, Phys. Rev. A 65, 012108 (2002). 9. J. Feldman, Trans. Amer. Math. Soc. 108, 251 (1963).
The adiabatic theorem of quantum mechanics and the Riccati equation VADIM KOSTRYKIN KONSTANTIN
(Fraunhofer Inst, fur Lasertechnik), A. MAKAROV (U. Missouri)
We outline a new approach to the proof of the Adiabatic Theorem of Quantum Mechanics. The key idea of this approach is to parametrize the subspaces involved as graphs of some operators. This leads to a singular perturbation problem for the differential Riccati equation with operator valued coefficients.
1. Introduction For a family H(t), t G [0,1], of self-adjoint operators on the separable Hilbert space 7i denote by We(t,s) the propagator associated with this family, i.e., the solution of the Schrodinger equation iedtWe(t, s) = H(t)We(t,s) with WE(s,s) = IH. (1) Let P(t), t 6 [0,1], be a smooth family of orthogonal projections commuting with H(t), H(t)P{t) = P(t)H(t). Under the Schrodinger evolution the initial subspace RanP(O) (the range of P(0)) is transformed to RanP e (t) with Pe(t) = W£(t,0)P(0)W£(0,t). The statements about closeness of P£{t) to P(t) are called Adiabatic Theorems of Quantum Mechanics. If for alH G [0,1] the spectra of H(t)\RllnP(t) and H(t)\KerP(t) a r e separated by a gap of positive length (gap condition), then
\\Pe(t)-P(t)\\
= 0(e).
This result proven in [4] has a long history (see, e.g., [5,9,12]) which originates from the early age of Quantum Mechanics (see also [13] where the Adiabatic Theorem has been extended "to all orders" in e). The adiabatic theorem without gap condition has been proven in [1] (see also [7,14]). Assume in addition that P(t) is finite dimensional. If P(t) is the spectral projection of H(t) for almost all t G [0,1], then
\\P£(t)-P(t)\\=o(l) for all t G [0,1]. We mention also related results in [2,3,6-8]. In this short note we outline a new approach to the proof of the adiabatic theorem. The key idea of this approach is to parametrize the subspaces entering the adiabatic theorem as graphs of some operators. We show that the Schrodinger evolution of subspaces is determined by the differential Riccati equation with operator valued coefficients whereas the adiabatic evolution is governed by its algebraic version. This observation links the adiabatic limit for the Schrodinger equation to a singular perturbation problem for the differential Riccati equation. The analysis of this singular perturbation problem will be presented elsewhere [11].
605
606
VADIM KOSTRYKIN, KONSTANTIN A. MAKAROV
2. Adiabatic evolution of graph subspaces Let H(t), t £ [0,1], be a continuous family of bounded self-adjoint operators on the separable Hilbert space H. Assume that P(t) is a continuously differentiable family of orthogonal projections such that RanP(£) is invariant with respect to the operator H(t) for any t £ [0,1]. Consider the orthogonal decomposition H = Ho ® Hi, where Ho = RanP(O) and Hi = KerP(O). With respect to this decomposition the operator H{t) can be represented as a 2 x 2 operator matrix
(H0(t)
B(t)\
where Hi(t) := P(0)- L F(t)P(0)- L ,
H0(t) := P(0)H(t)P(0),
and B(t) = P ( 0 ) # ( i ) P ( 0 ) x .
Obviously, B(0) = 0. From the continuity of P(t) it follows that there is a ti £ (0,1) such that ||P(i)-P(0)||
(2)
for all t £ [0, t\]. The inequality (2) implies (see, e.g., [10]) that for all t € [0, ti] the subspace Ran P(t) is the graph of bounded operator X(t) : Ho —* Hi such that Ran P(t) = {x © X(t)x | x G H0}
for all t £ [0, t{\.
Moreover, by a result of [10] the family X(t) is continuously differentiable and P(t)
=
( (J«o + X*Xr' \X(IHo+X*Xr1
a«o + X*X)-iX* \ Xilnv+X'XriX*)'
where X is the shorthand for X(t). Since RanP(£) is invariant with respect to H(t), by Corollary 4.5 in [10] the operator X(t) satisfies the (algebraic) Riccati equation Hi(t)X(t)
- X(t)H0(t)
- X(t)B(t)X(t)
+ B{t)* = 0.
(4)
3. Schrodinger evolution of graph subspaces Let
= H{t)i>e{t)
with the initial condition 1JJE(0) =
(5)
The adiabatic theorem of quantum mechanics and the Riccati equation where ££(t) = P(0)tp£(t) and r)£(t) = P(0)xi/je(t). iedtMt)
607
Inserting this into (5) we get
= H0(t%(t)
+
isdtVeit) = B(t)*Ut)
B(t)r,£(t),
+ H!(t)rie(t),
(6)
with initial data ££(0) = ip, rj£(0) = 0. Obviously, the family P£(t) = We(t,0)P(Q)We(0,t) is continuously differentiable. As long as \\P£(t) - P(0)|| < 1 the subspace RanP £ (£) is a graph subspace associated with the decomposition H = Ho ® Hi, i.e., Ran P£(t) = {x® X£{t)x\ x € H0} with some bounded continuously differentiable X£(t) : Ho —> Hi. Since ipe(t) £ RanP e (£) we have
r,S) = X£(t)Ut)Inserting this in equation (6) we get iedt£(t) = (H0(t) + B{t)X£{t)) ^(t)
(7)
with initial data £(0) = ip(0) and the differential Riccati equation iedtX£(t)
= Hi{t)X£(t)
- X£(t)H0(t)
- X£{t)B(t)X£(t)
+ B(t)*
(8)
with X£(0)
= 0.
Conversely, if for some e > 0 the Riccati equation (8) has a bounded continuously differentiable solution X£(t) on some interval [0,T], then the inequality
HPe(t) - p(o)|| =
,
l|Xg(t)l1
holds for all t € [0,r]. Moreover, if in addition we may solve (7), then the solution of the initial value problem (5) is given by ip£(t) = ££ © X£(t)££(t). Although the family Ho(t)+B(t)X£{t) entering equation (7) is in general not self-adjoint, this equation can be reduced to an evolution equation with self-adjoint operators. Proposition 3.1. Assume that the Riccati equation (8) has a bounded continuously differentiable solution X£(t) on some nontrivial interval t £ [0,r]. Then the solution to equation (7) is given by
Ut) =
[i+x£{t)*x£{t)]-v2u£{t),
where ue(t) solves the initial value problem iedtu£(t) = Y£(t)u£(t),
u£(0) = ip,
with the self-adjoint family Ye(t) := [/ + Xe(t)*Xe(t)]1/2
(Ho(t) + B(t)Xs(t))
- ie[i + x£(tyx£(t))V2dt[i
+
In particular, ||£ e (*)|| 2 < ||v>||2||[/ + X e (t)*JC e (t)]~ 1/2 ||.
[I +
X£(t)*X£(t))-V2
x£{tyx£{t)}-1'2.
608
VADIM KOSTRYKIN, KONSTANTIN A. MAKAROV
Proof. Using the differential Riccati equation (8) it is easy to show that ([/ + Xe(tyXe(t)\
(H0(t) +
B(t)Xs(t)))*
= [I + XE(t)*XE(t)} (H0(t) + B(t)Xe(t))
+
iedt(XE(tyXE(t)).
Therefore,
([/ + xe(tyxe(t)}1/2 (H0(t) + B(t)xe(t)) [i + xe{tyxe{t)}-112)*
= ([i+xsyxE{t)rl/2[i+xsyxs)]
(#ow+s(wo) [i+xsyxs)]-1'2)*
= [i + xE{tyx£{t)}1'2 (H0(t) + B(t)xe(t)) [i + xsyxsT1'2 + ie[i + xE{tyxe{t)]-v2dt{xs{tyxe{t)) [i + xe{tyxE{t)\-1'2. Hence,
YE(ty - Y&) = ie[i + x^tyxs)}-1'2 dt(xe(tyxe(t)) [i + xE{tyxE{t)\-1'2 + iedt[i + xe{tyxE{t)}-1'2 • [i + xE{tyxE{t)Y'2 + ie[I + XE{tyXE{t)}1'2
dt[I + Xe{tyXE{t)]-1/2.
(9)
Noting that dt[i +
xE{tyxE{t)\-1/2
= -[i + xe{tyxE{t))-1/2 • dt\i + xe(tyxE(t)}-^
• [i + xE(tyxe(t)}-^2
we obtain
at[/+x£(t)*x£(i)]1/2 = -[/+x£(i)*xe(t)]1/2.at[/+x£(t)*x£(i)]-1/2.[/+x£(o*xe(t)]1/2. Therefore,
dt (x£(tyxE(t)) = dt[i + xE(tyx£(t))V2 • \i + x£{tyxE{t)Y'2 + [i + xsyxs)}1'2 •ft[/ + xsyxs)]1'2 = -[i + xE{tyxE{t)]1'2 • dt[i + xsyxs)}-1'2 • \i + x£(tyxE(t)\ 12 -[i + xE(tyxE(t)} • dt[i + x^tyxs))- ' • [/ + xE{tyxE{t)]1'2. Inserting this relation in equation (9) we obtain Y£{t)* =Y£{t).
•
4. Comparison of evolutions The arguments presented in the previous two sections show that the norm \\P£(t) - P(t)\\ is controlled by deviation of the solution X£(t) of the differential Riccati equation (8) from the solution X(t) to its algebraic version (4). The latter is formally obtained by setting e = 0 in (8). Equation (3) and a similar representation for P£(t) in terms of X£(t) immediately yield the following result.
The adiabatic theorem of quantum mechanics and the Riccati equation
609
Proposition 4.1. Assume that the Riccati equation (8) has a bounded continuously differentiable solution X£(t) on some nontrivial interval t G [0, r] C [0, t\}. Then there is a constant C > 0 such that \\Pe(t)-P(t)\\
PM = P(tn)
/P{h) P(0) = P(*o) Figure 1. Partition of the path.
Proposition 4.2. Assume that there is 5(e) > 0 such that for any k £ { 0 , . . . ,n — 1} the inequality \\W£(t,tk)P(tk)We(tk,t) holds for all t £ [tk,tk+i].
- P(t)\\ < 6(e)
(10)
Then \\P£(t)-P(t)\\
for all t£ [0,1]. Proof. We claim that for any k G { 0 , . . . , n — 1} the inequality \\Ps(t)-P(t)\\<(k
+ l)6(e)
(11)
holds for all t G [io,*A:+i]- For k = 0 this inequality is implied by (10). Assume that (11) holds for some k G {l,...,n — 2}. Now for all t G [£*+!>**+2]
we
610
VADIM KOSTRYKIN, KONSTANTIN A. MAKAROV
estimate \\P£(t) - P(t)\\
<
\\P£(t)-Ws(t,tk+1)P(tk+1)W£(tk+1,t)\\ + \\W£(t,tk+1)P(tk+1)W£(tk+1,t)
= \\W£(t,tk+1)(P£(tk+1)
-
-
P(t)\\
-
P(t)\\
P(tk+l))W£(tk+l,t)\\
+ \\W£{t,tk+1)P(tk+1)W£(tk+1,t) = ||P e (t f c + i) - P ( t f c + i ) | | + \\W£(t,tk+l)P{tk+1)W£(tk+1,t)
-
P(t)\\
<(k + l)S(e) + S(e) = {k + 2)8(e), using hypothesis (10) on the last step.
•
References 1. J. E. Avron, A. Elgart, "Adiabatic theorem without a gap condition", Comm. Math. Phys. 203, 445-463 (1999). 2. J. E. Avron, A. Elgart, "Smooth adiabatic evolutions with leaky power tails", J. Phys. A: Math. Gen. 32, L537-L546 (1999). 3. J. E. Avron, J. Howland, B. Simon, "Adiabatic theorems for dense point spectra", Comm. Math. Phys. 128, 497-507 (1990). 4. J. E. Avron, R. Seiler, L. G. Yaffe, "Adiabatic theorems and applications to the Quantum Hall Effect", Comm. Math. Phys. 110, 33-49 (1987); Erratum: ibid. 156, 649-650 (1993). 5. M. Born, V. Fock, "Beweis des Adiabatensatzes", Z. Phys. 51, 165-180 (1928). 6. F. Bornemann, Homogenization in Time of Singularly Perturbed Mechanical Systems, Lecture Notes in Mathematics vol. 1687, Springer-Verlag, Heidelberg, 1998. 7. A. Elgart, H. Schenker, "A strong operator topology adiabatic theorem", Rev. Math. Phys. 14, 569-584 (2002). 8. A. Joye, F. Monti, S. Guerin, H. R. Jauslin, "Adiabatic evolution for systems with infinitely many eigenvalue crossings", J. Math. Phys. 40, 5456-5472 (1999). 9. T. Kato, "On the adiabatic theorem of quantum mechanics", J. Phys. Soc. Japan 5, 435-439 (1950). 10. V. Kostrykin, K. A. Makarov, A. K. Motovilov, "Existence and uniqueness of solutions to the operator Riccati equation. A geometric approach", Contemporary Mathematics 327, 181-198 (2003). 11. V. Kostrykin, K. A. Makarov, "Another look at the adiabatic theorem of quantum mechanics", in preparation. 12. S. G. Krein, Linear Differential Equations in Banach Space, Translations of Math. Monographs vol. 21, Amer. Math. Soc, Providence, R.I., 1972. 13. G. Nenciu, "Linear adiabatic theory. Exponential estimates", Comm. Math. Phys. 152, 479-496 (1993). 14. S. Teufel, A note on the adiabatic theorem without gap condition, Lett. Math. Phys. 58, 2 6 1 266 (2001).
Zero energy asymptotics of the resolvent in the long range case E.
SKIBSTED
(U. Aarhus), S.
FOURNAIS
(U. Paris-Sud)
We present a limiting absorption principle at zero energy for two-body Schrodinger operators with long-range potentials having a positive virial at infinity. Furthermore, we prove existence of limits (in weighted spaces), as the spectral parameter tends t o zero, of all powers of the resolvent. The principal tools of proof are absence of eigenvalue at zero, singular Mourre theory and microlocal estimates. Some elements of the proof will be explained.
1. Statement of main results We give an account of some recent results on asymptotic expansion at zero of the resolvent R(() = (H - C) _ 1 of a two-body Schrodinger operator H = - A + V on L 2 (E d ); see [5] for details. It is well-known, see [12,18] and the more recent work [13] in which further references can be found, that if V(x) = 0(|:r|~( 2 + e ') with e > 0 then such an asymptotic expansion exists. For the 'long-range' case, V(x) = 0 ( | x | _ / i ) with fi < 2, much less is known. To our knowledge, the only results on limiting absorption principles for such potentials are [22] and [17] (and [2] for the purely Coulombic case). In [22] only radially symmetric potentials are treated, and though radial symmetry is not imposed in [17] some of the assumptions of that paper appear unnecessarily restrictive. Here we present a complete asymptotic expansion of the resolvent at zero energy, for a much wider class of potentials. Our basic assumption is a sign condition at infinity, V(x) < -c\x\-,i,
\x\ > R,
(1)
and a similar positive virial condition. For such potentials we prove complete asymptotic expansions (in weighted spaces) oo
Rp~)Xj
R(X + (-)tO) x ^
for A — 0+;
(2)
j=o
here R$ ^ RQ . We also show that zero is not an eigenvalue. (This is implicit in (2).) We notice that there is no explicit dimension-dependence or fractional/inverse powers in A. It is well-known that for 'long-range' potentials that are positive at infinity, zero can indeed be an eigenvalue. This explains one aspect of the condition (1). Probably the best intuitive explanation of the result (2) is given in terms of the WKB-ansatz for stationary solutions to the Schrodinger equation — tp" + Vip = Eip in dimension d = 1 ^ » C + ( £ - Vyieif(E-v)idx
+ C_(E-
y)-ie-i/
dx
.
(3)
Under the condition (1) the oscillatory behaviour survives for E « 0, with E > 0 (since J(—V)?dx ~ |a;| 1_ 2 —» oo). Moreover, (3) suggests that zero is not an eigenvalue, and
611
612
E. SKIBSTED, S. FOURNAIS
also indicates which weights one needs in (2). We remark that indeed (2) can be proved for d = 1 by WKB-methods, see [22], which can also be used to prove optimality of the weights in the results below. Let us state our main results precisely. Let 0 < 9 < -K and define Te = {z G C \ {0} | \z\ < 1, arg z S (0,0)}.
(4)
We have the following limiting absorption principle at zero energy for a Schrodinger operator H = -A + V onH = L2(Rd) recalling the notation R(0 = (H - £ ) _ 1 . Although we shall not elaborate here, it is enough to impose the conditions (1.1) and (1.1) near infinity. Theorem 1.1. Let V(x) = V\(x) + V^x), x € Rd, be a real-valued potential. Suppose there exists 0 < \x < 2 such that V satisfies the conditions (l.l)-(l.l) below. (1) There exists e\ > 0 such that V\(x) < —ei(x) ^; (a;) = (2) For all a g ( N U {0})d there exists Ca > 0 such that
yjl+x2.
{xY+^\dPVx{x)\
-1-H/2-S
for \x\ > R. (6) V satisfies unique continuation at infinity (see Assumption 2.1 in section 2). Then for all s £ (1/2 + /x/4,1/2 + /x/4 + 5) and all 0 < 0 < n the family of operators B(0 — (X)~SR(0(X)~S *s uniformly Holder continuous in I V In particular there exists Csfi > 0 such that mip\\(x)-'R(0{x)-'\\
lim
(x)->R{Q(x)-,
C—»u,cei e
- iO) (x)-° = ^ J i m
3
r$(x)-
R
(C) (^)~ S
B(L2(Rd)).
Next, we have existence of limits for powers of the resolvent. The asymptotic expansion (2) is an easy consequence of Theorem 1.2 below. Notice also that Theorem 1.1 is a particular case of Theorem 1.2. Theorem 1.2. Let V — V\ + V2 satisfy the conditions in Theorem 1.1 with (5) replaced by: For some mo S N (5') V2 = 0(fc- m °- £ ); k = k(x) =
{x)1+^2.
613
Zero energy asymptotics of the resolvent in the long range case Let m < mo, 9 £ (0, n) and e > 0. Then there exists C > 0 such that ||jfc_(m_1/2)_ei2^mA-(m-l/2)-e|| <
(g)
C)
for all £ € T$. Furthermore, the function £ ,_»
k~(m-l/2)-eR^mk-(m-l/2)-c
is uniformly Holder continuous in Tg. Using Theorem 1.1 one may define (with Hi =
fc-1/2_eL2(R,i),
JS'(+0) = {2ni)-1 {R(0 + tO) - R{0 - *0)} e
H2 =
fc1/2+eL2(Ed)):
B(Hi,Hi).
One can prove that indeed E'(+0) ± 0.
(7)
Let F{\x\ < C) denote the multiplication operator by the characteristic function of {a; | |x| < C} and let K = (1 + / V 2 ) - 1 Corollary 1.1. Under the conditions of Theorem 1.2 with (5') valid for all mo G N: (1) For all s > §(1 + f ) and f e Cg°(R)
J (x)- s (e-«^(/ l[0lOO))(H) + «-V(0)B'(+0)) (x)- s J = 0(t"2). f5) For a?/ 0 < e' < e < 1 there exists s > 1 (depending on e) such that, for all I e Cg°(R), |F(|X|
< t f 1 - ^ " ) e - ^ a l p . o o j ) ^ ) ^ ) - ' ! = 0(t~(1+^).
(8)
Remark 1.1. (1) By time reversal invariance there are similar bounds for t —> — oo. (2) Due to (7) and Corollary 1.1 (1.1) the best one could hope for to the right in (8) would be the bound 0 ( £ - 1 ) (for /(0) ^ 0). Moreover we would expect that tK is indeed the borderline for this kind of low energy, minimal velocity estimate. In fact there is a sharp analogous bound in classical mechanics, cf. [6] and [20]. (3) If V2 € Cg°(R d ) one may take / = 1 in Corollary 1.1 (1.1) and (1.1). This follows readily from the given statements and well-established high energy estimates, see [15, Theorem 1.1], [3, Theorem 1] or [11, Theorem 1.2 (ii)]; in fact some local singularities may be included. We shall outline the proof of Theorems 1.1 and 1.2 in the following sections. Apart from the notation {x) = y/1 + x2, used above, we will also need the notation p = —iV and A — (x -p + p- x)/2. The virial W of the potential V is defined by W = — 2V — x • VV. We recall the (formal) identity i[H, A] = 2H + W. By design of the splitting of V, the assumptions (1.1) and (1.1) of Theorem 1.1 yield Wx{x) = -2Vi(a:) - x • W i ( i ) > eie 2 (x)-^, so in particular, the virial W is positive in the case V2 = 0.
614
E. SKIBSTED, S. FOURNAIS
2. Absence of bound states for Schrodinger operators In this section we present a basic result of independent interest, namely the absence of zeroenergy bound states for long range potentials negative at infinity. The proof is a variation of the technique applied in the proof of [19, Theorem XIII.58] of which the present result is a generalization. The conditions which exclude zero-energy eigenfunctions are given in Assumptions 2.1 and 2.2 below. Notice that the assumptions in Theorem 1.1 are stronger than Assumption 2.2: Take h = e r - ^ / 2 for a small e > 0 and s close to 1. Let us specify the notation x = ruj £M.d, with u e S d _ 1 . Assumption 2.1. The function V : Rd —+ R is measurable, and if u G H2(Rd), u = 0 in a neighbourhood of oo, the product Vip G L2(Md) and u is a distributional solution to - A w + Vu = 0, then u — 0. We remark that for d > 3 the condition V G LIo'c (Rd) suffices, see [14]. Assumption 2.2. The function V can be written as V = V\ + V2, such that: For some s G [0,1), some R, C > 0 and a positive differentiable function h = h(r) defined on [R, 00) we have (1) (2) (3) (4)
Vi and V2 are bounded on \x\ > R, and Vi is negative on \x\ > R. sup w 6 5 ( i -i f (r^V^ru)) < ~rsh2{r) when r > R. r"1 + r s\ipueSd-i \V2(ruj)\ = o(h) as r —> 00. h'(r) < Ch2{r) on \x\ > R.
With the above assumptions we can prove the absence of zero-energy eigenstates. Theorem 2.1. Suppose V = Vx+Vz satisfies Assumptions 2.1 and 2.2. Suppose furthermore that ip e Hlc(Rd) satisfies (2.1)-(2.1) below. (V J\x\>RhHrM(x)\2dx < 00 and ^^Viix^ix)]2 dx < 00. 2 d (2)Pj1>eL (R );j = l,...,d. (3) The product Vip G L2oc(Rd), and ( - A + V)ip = 0 in the sense of distributions. Then tp = 0.
3. Extended limiting absorption principles We introduce that following symbols: a0(x,£) = fE(x)-2e, with
f = fE = yJ^E
6(X,0 = A - 7 - T - T ,
(x) fs{x) + (1 - M / 2 ) - 1 { x ) - " , KO,E>0,
and where the parameters will be specified below.
(9) (10)
Zero energy asymptotics of the resolvent in the long range case
615
Let us denote by Opw(a) the Weyl quantization of a symbol a. Explicitly Opw(a) acts as follows (Op»4>)(:r) = (2n)-d JJ ei(*-yKa((x + y)/2,Z)<j>(y)dydt. Theorem 3.1. Let V(x) satisfy the conditions of Theorem 1.1 with V2 = 0. We reformulate the assumption (1.1) as: For some KQ > 0 and 2 > /x > 0, W{x) = ~2V(x) - x • VV(s) > 2Kg(i)-".
(11)
Let 9 G (0,7r) and Fg be as defined in (4). Let ao and b be as defined in (9) with E — \(\. Define k = k(x) = (:r) 1 +' i / 2 . Then the following conclusions, (12)~(12), hold for H = p2+V with all bounds being uniform in £ G I V (1) Let m G N and let e > 0 be arbitrary. Then there exists C > 0 such that ||fc_(m_l/2)_eij^mjfe-(m_l/2)-e|| <
(12a)
c
(2) There exists CQ > 0, depending only on V, such that if supp(F + ) C (Co, 00) and F'+ G C Q ° ( E ) , then for all m G N and all e,t > 0 there exists C > 0 such that ll^.t-i/2-e Op w (F + (flo))i?(C) m ^" t _ m + 1 / 2 ~ £ || < C,
(12b)
w
(12c)
t 1/2 £
l| jfe _t_ Tn+ i/2-« i j^jm Op ( J F + (a 0 ))fc (3) Let F+,F-
- || < C.
satisfy (with K 0 from (11)j inf supp(F + ) > —Ko,
supsupp(F_) < K 0 I
F!_,F[ GCg°(R). Let F_ G
CQ°(R).
T/ien /or u l I m e N and all e,t>0
there exists C > 0 swc/i tficrf
|| fc t-i/2-« Op w (F_(a 0 )F_(6))^(C) m fc- t - m + 1 / 2 - £ || < C, t 1
e
|| f c _ t _ m + i/2_ e / j^j m Of(F_(a 0 )F + (b))k - ^- \\
< C.
(12d) (12e)
(^j Suppose F+ and F_ satisfy the assumptions from (12), F}_,F2 G Co°(R) and dist(supp(F + ),supp(F_)) > 0. Then for allm G N and allt > 0 there exists C > 0
SMC/I
i/iat
||fc*Op w (Fl(a 0 )F_(&))i?(Cr Op w (^(ao)F + (6))fc'|| < C.
(12f)
Suppose F+ is given as in (12), some functions F+,F-,Fare given as in (12) and suppose dist(supp(F_), supp(F + )) > 0. Then for allm G N and allt > 0 there exists C > 0 SMC/I i/iai
p * Op w (F + (a 0 )) R(()m Opw(F_(a0)F+(b)) w
m
w
k*\\ < C, 4
||fc* Op (F_(a 0 )F_(6)) i?(C) Op (F+(a 0 )) fc || < C.
(12g) (12h)
616
E. SKIBSTED, S. FOURNAIS
The proof of Theorem 3.1 reduces by elementary algebra to the case of bounds with only one resolvent i.e., m = 1, cf. [10] or [9]. The partition of unity needed for this reduction is indicated in (19). In the case m = 1, (12a) follows by a singular Mourre theory while (12d) and (12e) follow by a certain modification of a method of [7]. The bounds (12b) and (12c) may be thought of as energy localizations. Certain energy-dependent positivity bounds, given by the Fefferman-Phong inequality in a certain Hormander-Weyl calculus, play an important role in the proof of the bounds in (12), (12) and (12) for m = 1.
4. Perturbative argument Using Theorems 2.1 and 3.1 one may easily prove Theorems 1.1 and 1.2 by perturbative arguments. In this section we will show Theorem 1.1. Let us write for £ € Fe i?(0 = ( t f - C ) - \
JRi(C)
= (i?i-C)-1;
H1=P2
+ v1.
(13)
We shall proceed perturbatively using R(Q(I + V2R1{0) = Ri(0.
(14)
First we notice that Ri(() is uniformly Holder continuous in Tg. If s > 3/2(1 + /J./2) this follows from (12a) with m = 2 (showing in fact Lipschitz continuity in this case). If s < 3/2(1 + /u/2) we may interpolate the bounds of (12a) with m = 1 and m = 2. In particular R+ = R1(0 + iO)= and
R7=R1(0-i0)=
lim
i?i«)
lim RAO C-»o,<ere
are well-defined (in weighted spaces). To show (5) (in the general case) it suffices to show that (x)a (I + V2R'i){x)~s is invertible as an operator on L2(Rd). This follows from (14), the standard limiting absorption principle for positive energies and absence of positive eigenvalues, cf. [16], [21] and [4, section 6.5]. Since (x^V^-Rf (x)~3 is compact it suffices to show that the equation «£ = -V2R+
Using that
(15) Then we have in (16)
iD-
' 2 i ' > 0 we obtain from the calculation
0 = 3 < V , W = -3
(17)
$ e L2(Rd).
(18)
We claim that
Zero energy asymptotics of the resolvent in the long range case
617
We shall prove (18) using Theorem 3.1 in a b o o t s t r a p argument. (For a similar problem for the free Laplacian see the proof of [1, Theorem 3.3].) We pick a real-valued function F+ as in Theorem 3.1 (12) such t h a t F+(x) = 1 for \x\ > 2C 0 . Let F _ = 1 - F+. Pick real-valued functions F _ and F+ as in Theorem 3.1 (12) such t h a t F _ + F+ — 1. Then we decompose with the symbols a 0 and b being defined as in (9) with E = 0 in the expression (10) for / V* = Opw(F+(a0))
1> + O p w ( F _ ( a 0 ) F _ ( 6 ) ) if, + Op w (F_(a 0 )F+(6)) V-
(19)
By (12b) and (12d) the first two terms on the right hand side of (19) belong to (x)s> L2 where (assuming here (j> £ (X)~SL2)
We notice t h a t t h e bound (12e) for m — 1 is equivalent t o | | f c t - 1 / 2 - e O p w ( J F _ ( a 0 ) F + ( 6 ) ) f i 1 ( C ) * f c - t - 1 / 2 - £ | |
(21)
Taking C, —> 0 in the sector Te, (21) leads to ||fct-1/2-£Opw(F_(a0)F+(6))i?r^t"1/2"e|l < C ,
(22)
with the same convention for ao and b as above. We use the representation ip — R^
Acknowledgments S. Fournais was supported by a grant from the Carlsberg Foundation (before 31.12.02) and by a Marie Curie Fellowship of the European Community P r o g r a m m e 'Improving the H u m a n Research Potential and the Socio-Economic Knowledge Base' under contract number HPMF-CT-2002-01822 (from 01.01.03). E. Skibsted is (partially) supported by M a P h y S t o — A Network in Mathematical Physics and Stochastics, funded by T h e Danish National Research Foundation.
References 1. S. Agmon, "Spectral properties of Schrodinger operators and scattering theory", Ann. Scula Norm. Sup. Pisa 2, 152-218 (1975). 2. D. Bolle, F. Gesztesy, W. Schweiger, "Scattering theory for long-range systems at threshold", J. Math. Phys. 26, 1661-1674 (1985).
618
E. SKIBSTED, S. FOURNAIS
3. H. L. Cycon, P. A. Perry, "Local time-decay of high energy scattering states for the Schrodinger equation", Math. Z. 188, 125-142 (1984). 4. J. Dereziriski, C Gerard, Scattering theory of classical and quantum N-particle systems, Texts and Monographs in Physics, Springer-Verlag, Berlin, 1997. 5. S. Fournais, E. Skibsted, "Zero energy asymptotics of the resolvent for a class of slowly decaying potentials", MaPhySto Preprint no. 2003-14 (2003). 6. C. Gerard, "Asymptotic completeness for 3-particle systems", Invent. Math. 114, 333-397 (1993). 7. C. Gerard, H. Isozaki, E. Skibsted, "N-body resolvent estimates", J. Math. Soc. Japan 48, 135-160 (1996). 8. L. Hormander, The analysis of linear partial differential operators. HI, Springer-Verlag, Berlin, 1985. 9. H. Isozaki, "Differentiability of generalized Fourier transforms associated with Schrodinger operators", J. Math. Kyoto Univ. 25, 789-806 (1985). 10. A. Jensen, "Propagation estimates for Schrodinger-type operators", Trans. Amer. Math. Soc. 291, 129-144 (1985). 11. A. Jensen, "High energy resolvent estimates for generalized many-body Schrodinger operators", Publ. RIMS, Kyoto Univ. 25, 155-167 (1989). 12. A. Jensen, T. Kato, "Spectral properties of Schrodinger operators and time-decay of the wave functions", Duke Math. J. 46, 583-611 (1979). 13. A. Jensen, G. Nenciu, "A unified approach to resolvent expansions at thresholds", Rev. Math. Physics 13, 717-754 (2001). 14. D. Jerison, C. E. Kenig, "Unique continuation and absence of positive eigenvalues for Schrodinger operators", Ann. of Math. (2) 121, 463-494 (1985); with an appendix by E. M. Stein. 15. K. Kitada, "Time-decay of the high energy part of the solution for a Schrodinger equation", J. Fac. Sci. Univ. Tokyo Sect. IA Math. 3 1 , 109-146 (1984). 16. E. Mourre, "Absence of singular continuous spectrum for certain selfadjoint operators", Comm. Math. Phys. 78, 391-408 (1980/81). 17. S. Nakamura, "Low energy asymptotics for Schrodinger operators with slowly decreasing potentials", Comm. Math. Phys. 161, 63-76 (1994). 18. J. Rauch, "Local decay of scattering solutions to Schrodinger's equation", Comm. Math. Phys. 61, 149-168 (1978). 19. M. Reed, B. Simon, Methods of modern mathematical physics I-IV, Academic Press, 1972-78. 20. E. Skibsted, "Long-range scattering of three-body quantum systems: Asymptotic completeness", Invent. Math. 151, 65-99 (2003). 21. H. Tamura, "Principle of limiting absorption for Af-body Schrodinger operators", Letters in Math. Phys. 17, 31-36 (1989). 22. D. R. Yafaev, "The low energy scattering for slowly decreasing potentials", Comm. Math. Phys. 85, 177-196 (1982).
Fractional moment methods for Anderson localization in the continuum (U. Alabama at Birmingham), MICHAEL AIZENMAN (Princeton), ELGART (Stanford), SERGEY NABOKO (St. Petersburg State U.),
G U N T E R STOLZ ALEXANDER
J E F F R E Y H. SCHENKER (ETH
Ziirich)
The fractional moment method, which was initially developed in the discrete context for the analysis of the localization properties of lattice random operators, is extended to apply to random Schrodinger operators in the continuum. One of the new results for continuum operators are exponentially decaying bounds for the mean value of transition amplitudes, for energies throughout the localization regime. An obstacle which up to now prevented an extension of this method to the continuum is the lack of a uniform bound on the Lifshitz-Krein spectral shift associated with the local potential terms. This difficulty is resolved through an analysis of the resonance-diffusing effects of the disorder.
1. Introduction The addition of disorder through a random potential may have a drastic effect on the spectral and dynamical properties of a Schrodinger operator. In certain energy regimes the spectrum of the operator may turn from absolutely continuous into dense pure point with localized eigenstates. This phenomenon, known as Anderson localization, also manifests itself in the form of dynamical localization, that is the non-spreading of wave packets supported in the corresponding energy regimes. There are two known approaches to the mathematical analysis of localization properties for multidimensional random Schrodinger operators. Both were initially developed in the discrete context, i.e., for random lattice operators. The method of multiscale analysis goes back to the ground breaking work of Frohlich and Spencer [5] from 1983 and, by now, has lead to a multitude of results on spectral and dynamical localization for a wide range of models. In 1993, Aizenman and Molchanov [2] introduced the fractional moment method into the study of Anderson localization. For discrete systems this method has provided a simple perspective on localization and has enabled exponentially decaying bounds on expectation values of various propagation kernels. Multiscale analysis has meanwhile been extended to continuum Anderson-type models, see e.g. [4,7] and, for a state of the art account, [6]. For an introduction to multiscale analysis (which is not used in our work) and many references, see also the recent book [10]. Our goal here is to outline a continuum version of the fractional moment method and its consequences, with results roughly corresponding to those obtained for the lattice case in [3]. We focus on a continuum Anderson-type model in L2(Rd) of the form Hu := H0 + XVU .
(1)
In section 2 we will introduce a prototypical set of assumptions on the background operator Ho and random potential Vu. We then state three results: First, in section 3, we discuss
619
620
G. Stolz, M. Aizenman, A. Elgart, S. Naboko, J. H. Schenker
a crucial boundedness result for fractional moments of "smeared" Green functions, i.e., operator norms of spatially localized resolvents. Exponentially decaying bounds on this quantity will then be shown to imply pure point spectrum with exponentially decaying eigenfunctions as well as exponential decay for the mean values of transition amplitudes (section 4). A finite volume criterion for exponential Green function bounds is provided in section 5. Finally, as sketched in section 6, applications of the fractional moment method are found by verifying the finite volume criterion in the usual large disorder, Lifshitz tail or band edge regimes. A full account of this work with detailed proofs of all the results stated below is provided in [1]. For a discussion of other consequences of the localization results established here, e.g. Kubo conductance and quantum Hall effect, see the contribution of A. Elgart to this volume.
2. A prototypical model The operator Ho may incorporate deterministic magnetic and electric potentials, i.e., have the form H0 = (tV - A(q))2 + V0(q) .
(2)
Simple and for our considerations suitable assumptions are local boundedness of the vector potential A, its derivatives d{A and the positive part Vo,+ of the electric potential. We also assume that Vo is bounded from below. Thus HQ is bounded below and we let EQ := mia(Ho). Disorder is introduced into (1) through a parameter A and an Anderson-type random potential
Vu)(q) = Y,Tl«;Mq).
(3)
For simplicity we will assume here that X = Zrf and that Ua(q) = U(q — a), a £l, for a non-negative, bounded and compactly supported single site potential U, say suppC/ C BQ, where BTX = {q : \q — x\ < r}. For technical reasons we also assume that |9(suppf/)| — 0 (dA denoting the boundary of a set A and | • | d-dimensional Lebesgue measure) and that the Ua cover space in the sense that 0 < &_ < ^2 uoc(q)
(4)
uniformly in q £ Rd. Finally, we assume that the coupling parameters i) a , a e I , are independent, identically distributed random variables with absolutely continuous distribution rf/x(rj) = p{rf)dr]. The density p is bounded and supported in [0,1]. A number of these assumptions can be weakened, in particular those on I, Ua and rja. The coefficients of the background operator HQ may include certain L p -type singularities. For more discussion on what is technically necessary see [1].
Fractional moment methods for Anderson localization in the continuum
621
3. Finiteness of fractional moments A central object in the fractional moment approach to localization for lattice operators is given by the fractional moments E(\GE+io{x,y)\s), where Gz(x,y) = (SX\(H - z)~x\5y) is the Green function, 0 < s < 1, and E denotes averaging over the disorder. Finiteness of the fractional moments is seen relatively easily for suitable distribution of the random parameters as the singularities of the Green function become integrable through the exponent s
1 Xy W-E-ie H
Xx
' ) < C.,x{\ + 1/A)S(1 + \E-
£0|r(d+2>
(5)
for any open £1 c R d , x,y e fi and E s R . One can choose (1 + A) s ( d + 2 ' Cs A < const -i —+ . 1—s
(6)
As in the discrete case, the proof of this result proceeds by showing that the independent variation of some of the random parameters r]a resolves singularities which are due to the proximity of the given energy to an eigenvalue whose eigenvector has significant support nearby. However, a change in a parameter can also have the opposite effect, through the creation of a resonance. In the discrete setup the latter possibility occurs at not more than a single value of the random parameter, since each coefficient affects a rank-one term and the number of energy levels which can be moved past E is bounded by the rank of the perturbation. Aside from the fact that the rank-one analysis is not applicable, the source of the difficulty in the extension of the previous analysis can be traced to the fact that, in the continuum setup, there is no uniform bound on the corresponding "spectral shift". To circumvent these difficulties we employ the Birman-Schwinger principle in place of rankone analysis, and control the Lebesgue measure of the nearly-singular values of a coupling parameter by means of the following "weak 1-1" type bound \{T,:\\T(V + A + iO)-1T\\HS>t}\
< j\\TfHS,
(7)
valid for any maximally dissipative operator A and Hilbert-Schmidt operator T. This result was proven in [8].
622
G. Stolz, M. Aizenman, A. Elgart, S. Naboko, J. H. Schenker
As a consequence of this analysis we find that it suffices to average over "local environments" of x and y. Rather than taking the full expectation one merely averages over the na with a in suitable neighborhoods of x and y. This yields a bound as in (5) with constants which are uniform in the values of the remaining random parameters, an improvement of Lemma 3.1 which is important in the proof of Theorem 5.1 below.
4. Localization properties The uniform fractional moment bound (5) holds for all energies in the class of continuum Anderson models considered here. In the following we will identify the existence of exponentially decaying bounds (in \x — y\) for the left hand side of (5) as a characteristic of the localization regime. We first show that such bounds for finite volume operators (but uniformly in the volume) imply spectral and dynamical localization. Let Q c Rd be open and A„ c Q, n € N, a sequence of bounded open domains such that (J A n = fi and #(A™) converges to H^ in strong resolvent sense. We also define a modified distance by distn(:r, y) := min{|a; - y\, dist(x, f2c) + dist(y, f2c)} . (8) Pj(H)
denotes the spectral projection onto J for H and || • || tr the trace norm.
T h e o r e m 4 . 1 . Let H, Q. and An, n G N, be as above. Suppose that for some 0 < s < 1 and an open bounded interval J there are constants A < oo and fi > 0 such that
IA
i :
#(An)
dE < ^e-MdistAn(x,y)
-E Xy
(g)
for alln £ N and x, y £ A„. Then for every r < 1/(2 — s) there exists Ar < oo such that < Are-^dist"^^
E ( sup \\xxg(HM)Pj(HM)xv\L)
(10)
for every x,y £ Q. Here the supremum is taken over all Borel measurable functions g which satisfy \g\ < 1 pointwise. In the case Q. = Rd it further holds that the spectrum of H in J is almost surely pure point, with eigenfunctions ip which for every v £ (0,2/(2 — s)) satisfy limsup
, ,
|z|-oo
R
n
<-v.
(11)
The bound (10) with g(H) — eUH implies dynamical localization with exponential decay of the transition amplitudes for wave packets with energies restricted to J. This is stronger than the dynamical bounds which can be obtained through the multiscale analysis approach, e.g. [6] for the best known result. Theorem 4.1 as well as Theorem 5.1 below are applicable even when the operator exhibits extended boundary states in certain geometries, provided there is "localization in the bulk". This is the relevance of the domain adapted metric distn- Note that distKd(x,y) = \x — y\. While typical applications of Theorem 4.1 (see section 5) will work with exponential bounds for E ( | | x x ( # ( A n ) - Ey-XyW8) which are uniform in E 6 J, it is interesting to note that Theorem 4.1 only requires the energy-averaged bound (9).
Fractional moment methods for Anderson localization in the continuum
623
The proof of Theorem 4.1 in [1] proceeds by first verifying the bound (10) for the finite volume operators H^An\ with constants uniform in n. In finite volume the norm of Xxg(H)Pj(H)xy may be estimated in terms of sums of bounds on rank-one operators. The latter have equal operator and trace norms, which ultimately allows to state (10) as a trace norm bound.
5. A finite volume criterion In applications of Theorem 4.1 it is necessary to find energy regimes in which the exponential resolvent bound (9) can be verified. In this section we provide a finite volume sufficiency criterion for the desired exponential decay. We define the boundary layer of a set 5A:={q:r<
distfa, Ac) < 23r} ,
(12)
where the choice of the depth is somewhat arbitrary, but convenient for the technical implementation of the proof of the following result. Theorem 5.1. Let H be as above. Then for each s S (0,1/3) and A > 0 there exists Ms,\ < oo, such that if for some E € R and L > 24r, e-T := M,tX{l + 1/A) 2s (l + \E- E0\)5s{d+2) x lim sup sup E e->0
aGZ
(1 +
L)2{d~^
1
'H^k)-E-ie
SBk
(13)
<1, then for any open fi C M.d and any x, y 6 Q lim sup E £—0
Xx (Q) H
1 -E-ie Xy
< e>A(a,X,E)e
-•ydistn(x,y)/2L
(14)
with A(s, A, E) the right hand side of (5). One may choose (1 + A)5s(d+4> Ms>\ = const • 3s
(15)
The proof of Theorem 5.1 proceeds essentially by an iterative argument where the distance from x to y is covered by balls of radius L. The bound (13) serves as an initial decay estimate for the resolvent between the center and boundary of a ball of radius L, reflected in the positive exponent 7. An iterative geometric resolvent expansion is used to show that the decay adds up (or better: multiplies up) to exponential decay with rate proportional to 7 and distn(x, y). As this resolvent expansion does not work near the boundary of fi, one uses the modified distn. The factors which appear in each step of the resolvent expansion are not all independent. Therefore a (triple) Holder bound is used to factorize their expectations. This is the reason for having to work with s < 1/3. In order to not having to divide the exponent s by three in each step of the iteration (which would cause it to collapse into 0), the random parameters
624
G. Stolz, M. Aizenman, A. Elgart, S. Naboko, J. H. Schenker
r]a near the boundaries of domains used in the expansion are re-sampled in each step of the iteration. This means that they are replaced with parameters fja which are independent of the rja, but have the same distribution, a procedure which also appears, for example, in the spectral averaging argument of [9]. This allows to avoid the use of various versions of "decoupling lemmas" which have entered the fractional moment method for lattice models and seem to be harder to verify in the continuum. As opposed to the use of an iteratively increasing sequence of length scales in the multiscale analysis approach, only one length scale L is used by the fractional moment method to go from finite to infinite volume. In this iteration process, Lemma 3.1 plays a role similar to Wegner estimates in multiscale analysis. It provides a worst case bound on the growth of the resolvent over distances less than L, where (13) can not yet be used. The exponential decay bound (14) on resolvents is not only a necessary consequence of the finite volume criterion (13), but for Q = Rd is also sufficient for it, as shown by the following result: T h e o r e m 5.2. Let H be as above and suppose that for some A < oo, /i > 0 and B e l lim sup IE elo
1 H-E-ieXp
Xa
< Ae'^-ft
(16)
for all a, (3 G Z d . Then, for sufficiently large L, (13) is satisfied uniformly for all E' in an open neighborhood of E. A particular consequence of this is that Theorem 4.1 could be stated under assuming the infinite volume exponential decay bound (16) for the resolvent, as (16) implies (13) and (13) implies (14) (which allows for finite volume) and thus (9) for a neighborhood J of E and all A n . It is interesting to note that Theorem 5.2 allows to conclude localization on an open interval from a bound for a single energy. This is due to the fact that for finite volume A the fractional moments E ( | | x x ( # ^ — 2)~ 1 Xy|| s ) a r e Holder continuous in z G C Thus the set of energies where a bound like (13) is valid must be open.
6. Applications Applications of our method consist in verifying the bound (13) in concrete energy regimes. In this sense (13) is a fractional moment version of the initial length bounds used to start a multiscale analysis. Here are examples of regimes where (13) can be verified (for detailed statements see [1]): — The band edge/Lifshitz tail regime: Here (13) follows from smallness of the density of states in a suitable energy interval. One may work with smallness of the expected number of eigenvalues of finite volume operators H^ or directly with smallness of the integrated density of states in infinite volume, e.g. Lifshitz tails. — The large disorder regime: Under somewhat stronger assumptions on the distribution of the r]a, for example in the case of uniform distribution on [0,1], one can improve the bounds (6) and (15) and obtain CSt\ and Ms<\ which are bounded as A —> oo. This in turn may be used to prove localization in the large disorder regime: For every E' G R
Fractional moment methods for Anderson localization in the continuum
625
there exists A' sufficiently large such that for A > A' the energy interval (—oo,E') is localized (i.e., (13) can be verified for all E € (—oo,E')). — The multiscale analysis regime: One may also use the typical output of a multiscale analysis to verify (13) at sufficiently large L. Thus one gets the stronger dynamical localization bounds provided by the fractional moment method in all regimes where a multiscale analysis can be carried through and our general setup from section 2 holds.
Acknowledgements In t h e course of this project, t h e authors' work was supported in part by N S F grants P H Y 9971149 (MA, and AE), D M S 0070343 and 0245210 (GS), INT-0204308 ( J S ) , N S F postdoctoral research fellowship (JS), and a N A T O Collaborative Linkage Grant PST.CLG.976441 (GS a n d SN). T h e authors also t h a n k Caltech, Universite Paris 7 a n d t h e Institute MittagLeffler for hospitality which has facilitated this collaboration.
References 1. M. Aizenman, A. Elgart, S. Naboko, J. Schenker, G. Stolz, "Moment analysis for localization in random Schrodinger operators", preprint (2003), available at arXiv:math-ph/0308023 , also at mp_arc 03-377. 2. M. Aizenman, S. Molchanov, "Localization at large disorder and at extreme energies: an elementary derivation", Comm. Math. Phys. 157, 245-278 (1993). 3. M. Aizenman, J. H. Schenker, R. M. Friedrich, D. Hundertmark, "Finite-volume criteria for Anderson localization", Comm. Math. Phys. 224, 219-253 (2001). 4. J.-M. Combes, P. D. Hislop, "Localization for some continuous, random Hamiltonians in ddimension", J. Fund. Anal. 124, 149-180 (1994). 5. J. Frohlich, T. Spencer, "Absence of diffusion in the Anderson tight binding model for large disorder or low energy", Comm. Math. Phys. 88, 151-184 (1983). 6. F. Germinet, A. Klein, "Bootstrap multiscale analysis and localization in random media", Comm. Math. Phys. 222, 415-448 (2001). 7. S. Kotani, B. Simon, "Localization in general one-dimensional random systems", Comm. Math. Phys. 112, 103-119 (1987). 8. S. N. Naboko, "The structure of singularities of operator functions with a positive imaginary part", Funktsional. Anal, i Prilozhen. 25, 1-13 (1991). 9. B. Simon and T. Wolff, "Singular continuous spectrum under rank one perturbations and localization for random Hamiltonians", Comm. Pure Appl. Math. 39, 75-90 (1986). 10. P. Stollmann, Caught by disorder: bound states in random media, Progress in Mathematical Physics, vol. 20, Birkhauser, Boston, 2001.
A particle in the Bio-Savart-Laplace magnetic field: explicit solutions D. R. YAFAEV (U.
Rennes-1)
We consider the Schrodinger operator H = (iV + A)2 in the space L2(K 3 ) with a magnetic potential A created by an infinite straight current. We perform a spectral analysis of the operator H almost explicitly. In particular, we show that the operator H is absolutely continuous, its spectrum has infinite multiplicity and coincides with the positive half-axis. Then we find the large-time behavior of solutions exp(—iHt)f of the time dependent Schrodinger equation. Equations of classical mechanics are also integrated. Our main observation is that both quantum and classical particles have always a preferable (depending on its charge) direction of propagation along the current and both of them are confined in the plane orthogonal to the current.
1. Introduction They are very few examples of explicit solutions of the Schrodinger equation with a magnetic potential. Probably the only ones are a constant magnetic field, B(x,y, z) = BQ, (see, e.g., [3]) and, in the two dimensional case, a magnetic field localized at the origin, B{x,y) = B06(x,y) where 8(x,y) is the Dirac function (see [1]). The solution is expressed in terms of Hermite functions in the first case and in terms of Bessel functions in the second case. Here we suggest a third example of an explicitly solvable Schrodinger equation. Actually, we consider the magnetic field B(x,y,z) created by an infinite straight current. Physically, this case is opposite to the case of an (infinitely) thin straight solenoid considered in [1] by Aharonov and Bohm where the field is concentrated inside the solenoid. Suppose that the current coincides with the axis z and that the axes x, y and z are positively oriented. According to the Biot-Savart-Laplace law (see, e.g., [5]) B(x, y, z) = a(-r~2y,
r~2x, 0),
r = {x2 +
y2)1/2,
where |a| is proportional to the current strength and a > 0 (a < 0) if the current streams in the positive (negative) direction. The magnetic potential is defined by the equation B(x, y, z) = curl ^(a;, y, z) and can be chosen as A(x,y,z)
= -a(0,0,]nr).
(1) 3
Thus, the corresponding Schrodinger operator in the space L,2(M. ) has the form H = U1 = -d2x-d2
+ (idz-1\nr)2,
j = ea,
where e is the charge of a quantum particle of the mass m = 1/2 and the speed of the light c=l.
626
A particle in the Bio-Savart-Laplace magnetic field: explicit solutions
627
Since magnetic potential (1) grows as r —> oo, the Hamiltonian H does not fit to the well elaborated framework of spectral and scattering theory. Nevertheless, we perform in Section 2 its spectral analysis almost explicitly. To be more precise, we reduce the problem to an ordinary differential equation with the potential 7 2 ln r (let us call it logarithmic oscillator). We show that the operator H is absolutely continuous, its spectrum has infinite multiplicity and coincides with the positive half-axis. Then we find in Section 3 the large-time behavior of solutions exp(—iHt)f of the time dependent Schrodinger equation. In Section 4 we integrate equations of classical mechanics. Our main observation is that positively (negatively) charged quantum and classical particles always move in the direction of the current (in the opposite direction) and are localized in the orthogonal plane. A detailed presentation of the results of this note can be found in [6].
2. Spectral analysis of the operator H Let us first consider a more general magnetic potential A(x,y,z)
= (0,0, A(x,y))
(2)
with an arbitrary (we disregard here domain questions) real function A(x, y) which tends to infinity (either +oo or —oo) as r = (a;2 + y2)1^2 —» oo. The corresponding Schrodinger operator is H=-A + (idz + eA(x,y))2, where A is always the Laplacian in the variables (x,y). Since A does not depend on z, we make the Fourier transform $ = $ z in the variable z. Then the operator H = $ H $ * acts in the space Z2QR2 x K) as (Hu)(x,y,p)
=
(h(p)u)(x,y,p),
where h(p) = -A + (p-eA(x,y))2.
(3)
Here p € R (the momentum in the direction of the 2-axis) is the variable dual to z and the operator h(p) acts in the space L^M2). Since A(x, y) —> oo or A(x, y) —* —oo as r —> oo, the spectrum of each operator h(p) is positive and discrete. Let Xn(p), n = 1,2,... , be its eigenvalues, numerated in such a way that \n(p) are analytic functions of p. The spectrum of the operator H, and hence of H, consists of the sets (branches) covered by the functions Xn(p), n = 1,2,... , as p runs from —oo to oo. This is similar both to the cases of the constant field where Xn(p) = |eBo|(2n + 1) + p2 and to the periodic problem where the role of the momentum p in the direction of the z-axis is played by the quasimomentum (see, e.g. [4]). Thus, the general Floquet theory implies that the spectrum of H is absolutely continuous, up eventually to some eigenvalues of infinite multiplicity. Such eigenvalues appear if at least one of the functions \n{p) is a constant on some interval. Then this function is a constant for all p £ R . On the other hand, if, say, the function — eA(x, y) is semibounded from below, then inf
(x,3/)€K 2
(p-eA(x,y))2
=
(4)
628
D. R. YAFAEV
as p —> oo, and hence limp—xx, An (p) = oo for all n. Thus, we have the following simple result. Theorem 2.1. Suppose that A(x,y) is a semi-bounded function which tends either to +00 or to - c o as r —> co. Then the operator H is absolutely continuous. Note that the Thomas arguments (see, e.g., [4]) relying on the study of the operatorfunction h(p) for complex p are not necessary here. The problem may be further simplified if A(x, y) = A(r). Then we can separate variables in the polar coordinates (r, 6). Denote by Hm the space of functions u(r)elm6 where u € L^{R\ rdr) and m = 0, ± 1 , ± 2 , . . . is the orbital quantum number. Then 00
L2(R2) = 0
Hm.
(5)
m= — oo
Every subspace Hm is invariant with respect to the operator h(p). The spectra of their restrictions hm(p) on Jim consist of positive simple eigenvalues Xm,i{p) < ^m,2(p) < • • • > which are analytic functions of p. We denote by ipm,i('i",p),ipm,2(r'P)' • • • * n e corresponding eigenfunctions which are supposed to be normalized and real. Let us return to the operator H with the potential A(r) = —alnr. In this case operator (3) equals /i(p) = - A + ln 2 (e p r 7 ). Since H 7 u = H _ 7 u , it suffices to consider the case 7 > 0. It is convenient to transfer the dependence on the momentum p into the kinetic energy and to introduce the parameter a = ePl1 e (0,00) instead of p. Let us set K(a) = -a2 A + 7 2 In2 r, and let w(a), (w(a)f(x,y) L 2 (R 2 ). Then
= af(ax,ay),
(6)
be the unitary operator of dilations in the space
w*(a)h(p)w(a)=K(a),
a = eph.
(7)
We denote by fim,n(a) and 0 m , n (r, a) eigenvalues and eigenfunctions of the restrictions of the operators K(a) on the subspaces Hm. It follows from (7) that /xm,„(a) = Am,„(p) and
= 2a I
\V
(8)
This expression is obviously positive since otherwise <j)n(x,y,a) = const. The next assertion realizes an obvious idea that the spectrum of K{a) converges as a —• 0 (in the quasiclassical limit) to that of the multiplication operator by 7 2 In2 r, which is continuous and starts from zero.
A particle in the Bio-Savart-Laplace magnetic field: explicit solutions
629
Lemma 2.2. For every n, we have that lima_>oMn(a) = 0. Since the function Inr is not semibounded, relation (4) is not true in our case. Nevertheless, taking into account the kinetic energy, we obtain the following result. Lemma 2.3. For every n, we have that linia-^oo p.n(a) = oo. In terms of eigenvalues An(p) of the operators h(p), Lemmas 2.1-2.3 mean that X'n(p) > 0 for all p e R and limp__oo An(p) = 0, linip-Kx, \n(p) = oo (for 7 > 0). Let A n be multiplication operator by the function An(p) in the space L^iM). It follows from the results on the function Xn(p) that the spectrum of A„ is absolutely continuous, simple and coincides with the positive half axis. Let us introduce a unitary mapping 00
* : L 2 (R+ x R;r dr dp) -> 0 L 2 ( M ) n=l
by the formula (Vf)n{p)
= /0°° f(r,p)tpn(r,p)rdr.
Then 00
MH$*f*=0A„
(9)
n=l
(of course H = H m and A„ = A„)Tn), and we obtain the following T h e o r e m 2.2. The spectra of all operators H m and H are absolutely continuous, have infinite multiplicity and coincide with the positive half axis. As a by-product of our considerations, we have constructed a complete set of eigenfunctions of the operator H. They are parametrized by the orbital quantum number m, the momentum p in the direction of the z-axis and the number n of an eigenvalue Xm,n(p) of the operator hm(p) defined by formula (3) on the subspace 7im- Thus, if we set = e^e i m e Vm,„(r,p),
um,n,p(r,z,6) then xiu m>rii p = Am^n{jp)xim,n,p3. T i m e e v o l u t i o n
Explicit formulas obtained in the previous section allow us to find the asymptotics for large t of solutions u(t) = exp(—iHt)uo of the time dependent Schrodinger equation. On every subspace with a fixed orbital quantum number m, the problem reduces to the asymptotics of the function u(t) = exp(— iHmt)uo. Below we fix m and suppose that 7 > 0. Assume that (*«o)(r,p)=Vn(r,p)/(p),
(10)
where / G C£°(R). Then it follows from formula (9) that OO
/
eiPz-iX^Hn(r,P)f(p)dp. •00
(11)
630
D. R. YAFAEV
The stationary points of this integral are determined by the equation z = X'n(p)t.
(12)
Since X'n(p) > 0, the equation (12) has a solution only if zt > 0. We need the following information on the eigenvalues ^n(a) of the operator (6). Lemma 3.1. For every n, we have that lima-,o a,/J,'n(a) = 0. Indeed, it follows from equation (8) that a/j,'n(a) < 2/xn(o). Therefore it remains to use Lemma 2.2. Lemma 3.1 means that linip-^oo \'n(p) = 0. The following conjecture is physically quite plausible and is used mainly to formulate Theorem 3.1 below in a simpler form. Conjecture 3.1. For every n, we have that X'n(p) > 0 for allp £ R and lim^oo X'n(p) = oo. Therefore equation X'n(p) = v has a unique solution pn =
= l-
(13)
Let $ n (u) =
f{ipn{z/t))
{it)-"2
0(z/t) + Uoo(r, z, t), (14)
where lim ||«oo(-,*)ll = 0.
(15)
I—^±00
Note that the norm in the space L2(M.+ x R) of the first term in the right-hand side of (14) equals ||«o||- The asymptotics (14) extends of course to all / e L2(M.) and to linear combinations of functions (10) over different n. Thus, we have proven Theorem 3.1. Assume that Conjecture 3.1 is fulfilled. Suppose that 7 > 0. Let u(t) — exp(—iH m i)uo where UQ satisfies (10). Then the asymptotics as t —> ±00 of this function is given by relations (14), (15). Moreover, if f e CQ°(R) and =pz > 0, then the function u(r, z, t) tends to zero faster than any power of i:\t\~1 ast—> ±00. Conversely, for any g £ L2(R + ) define the function UQ by the equation
($u0)(r,p) = Mr, K<J>)) K(P)1/2 9(K(P))Then u(t) = exp(—iHmt)uo
has the asymptotics as t —» ±00
u(r, z, t) = e'*"^/*)Vn(r, z/t) g(z/t) (it)-^29(z/t) where Woo satisfies (15).
+ Uoo(r, z, t),
A particle in the Bio-Savart-Laplace magnetic field: explicit solutions
631
4. Classical mechanics Let us consider the motion of a classical particle of mass m = 1/2 and charge e in a magnetic field created by potential (2) where A(x, y) = A(r), r = (x2 +y2)1/2. We suppose that A(r) is an arbitrary C2-function such that A(r) = o(r~1) as r —» 0 and |-4(r)| —> oo as r —» oo. The solution given below is, to a large extent, similar to the Kepler solution of equations of motion for a particle in a spherically symmetric electric field. However, in the electric case the motion is always restricted to a plane, whereas in the magnetic case it is confined in the plane z = 0 but the propagation of a particle in the z-direction has Hamiltonian formulation. An approach based on the Newton equations can be found in [6]. Let r be a position of a particle and p be its momentum. Let us write down the Hamiltonian ff(r,p) = ( p 2 - e , 4 ( r ) ) 2 in the cylindrical coordinates (r,
= A(r), we have that
eA(r))2,
where pr, pv and pz are momenta conjugated to the coordinates r,
( m
where V(r)=M2r-2
+ (P-eA(r))2,
(17)
and
(18)
z'{t) = 2(P - eA(r{t))).
(19)
It suffices to solve the system (16) since, given r(t), the solutions of equations (18) and (19) are constructed by the formulas tp(t) = tp(0) +2M
[ r(sy2
ds
(20)
and z(t) - z(0) = 2 / (P - eA(r(s))) ds. (21) Jo The solution of the system (16) is quite similar to the solution of the Kepler problem although in our case the effective potential energy (17) depends additionally on the momentum P in the z-direction. In the solutions of the quantum problems, it is reflected by the fact that, for electric spherically symmetric potentials, the variables can be separated (in the spherical coordinates), whereas in our case the operators h(p) depend on p.
632
D. R. YAFAEV
Thus, to solve (16), we remark that A-lr'{tf
+ V(r(t)) = K,
(22)
where K = 4 ~ V ( 0 ) 2 + M 2 r(0)~ 2 + 4 _ 1 z'(0) 2 is a constant kinetic energy of a particle. Clearly, (22) is the equation of one-dimensional motion (see, e.g., [2]) with the effective potential energy V(r) and the total energy K. It admits the separation of variables and can be integrated by the formula 1 2
t = ±4 f(K-V{r))
dr.
(23)
Note that V(r) —> oo as r —• 0 and r —» oo. Let rmm and r m a x be the roots of the equation V(r) = K (r m j n and r m a x are the nearest to r(0) roots such that r m [ n < r(0) < r m a x ) . It follows from (23) that the function r(t) is periodic with period /•rmax
T=8/
,
.
-1/2
( # - V(r)J
(24)
rfr
and r m i n < r(i) < r m a x . One can imagine, for example, that on the period the function r(t) increases monotonically from r m ; n to r m a x and then decreases from r m a x to r m j n . Thus, we have integrated the system (16) and (18), (19). Theorem 4.1. In the variable r a classical particle moves periodically according to equation (23) with period (24). The angular variable is determined by equation (20) so that
(which is a
eA(r(t))).
Using equation (19), we see that 2ez'(t) = (r"(t) - 4M 2 r- 3 (£)).4'(r(i))- 1 . Integrating this equation and taking into account periodicity of the function r(t), we see that, for all t, fT
2e(z(t + T)-z(t))=
Jo
r"(s)A'(r(s))-1ds-AM2
,r Jo
= [ r'{t)2A'(r(t))-2A"{r{t))dt-AM2 Jo Let us formulate the results obtained.
r{s)-3A'{r(s))-1ds f r ( s ) - M ' ( r ( s ) ) - 1 d s . (26) Jo
A particle in the Bio-Savart-Laplace magnetic field: explicit solutions
633
Theorem 4.2. The increment of the variable z on every period is determined by equation (26). In particular, if ±eA'(r) < 0 and ±eA"(r) > 0 for all r, then inequality (25) holds. In this case z(t) = z0t + 0(1) with z0 = T-x(z(T) •*- z(0)), ±z0 > 0, as \t\ -> oo. In particular, for potentials A(r) = —alnr and A(r) = —ara where a £ (0,1), inequality (25) holds if ±ea > 0. Note that in these cases the fields B(x,y,z) = A'[r)r~1(y, — x,0) tend to 0 as r —> oo. It follows from equation (19) that if, say, eA!(r) < 0 and the point r c r is determined by the equation p = eA(rCI), then z(t) increasea for r(t) € ( r c r , r m a x ) and decreases for r(t) € (r m j n ,r c r ). Of course, it is possible that r c r < r m i n . In this case, z(t) always increases. Let us discuss this phenomena in more details on our leading example A(r) = —alnr. Then r c r = e~pl'1 where 7 = ea. The points r m ; n and r m a x , are determined from the equation V(r) = M2r~2 + ]n2(ffr'1)
= K.
The function z(t) is increasing for all t if r c r < r m i n or, equivalently, V(rCT) > K and rCr < r(0). The first of these conditions is equivalent to M2e2P^ > K or, since in view of (19) e2ph = r(0)- 2 e z '(°)/T, to M2r(0)-2ez'Wf
> 4-V'(0) 2 + M2r(0)~2
+ 4~V(0)2.
Thus, ^'(0) should be a sufficiently large positive number (^'(0) < 0 is definitely excluded). In this case the condition r c r < r(0) which is equivalent to z'(0) > 0 is automatically satisfied. Note finally that always r c r < r m a x , that is the function z(i) cannot be everywhere decreasing (this is of course also a consequence of Theorem 4.2). Indeed, inequality r c r > r m a x is equivalent to V(rcx) > K and r c r > r(0). The first of them require that z'(0) > 0 while the second require that z'(0) < 0. Thus, positively (negatively) charged classical and quantum particles always move asymptotically in the direction of the current (in the opposite direction). In the plane orthogonal to the direction of the current classical and quantum particles are essentially localized.
References 1. 2. 3. 4. 5. 6.
Y. Aharonov, D. Bohm, Phys. Rev. 115, 485 (1959). L. D. Landau, E. M. Lifshitz, Classical Mechanics, Pergamon Press, 1960. L. D. Landau, E. M. Lifshitz, Quantum Mechanics Pergamon Press, 1965. M. Reed, B. Simon, Methods of Modern Mathematical Physics IV (Academic Press, 1978). Y. Rocard, Electricite, Masson et Cle, 1956. D. Yafaev, Math. Phys. Anal. Geom. 6, 219 (2003).
Trotter-Kato product formula: some recent results V. A.
ZAGREBNOV
(U. de la Mediterranee (Aix-Marseille II) and CNRS,
Luminy)
Recently new results about the operator-norm and the trace-norm convergences of the Trotter-Kato product formulae with ultimate optimal error-bound estimates have been established. They concern the self-adjoint strongly continuous and Gibbs semigroups on Hilbert spaces. Explicit dependence of the rate of convergence on the generators domains is found.
1. Operator-norm convergence and optimal error bound 1.1. Operator-norm convergence Consider real-valued, Borel measurable functions / on [0, oo) satisfying 0(«)
/(0) = 1,
/'(0) = - l .
(1)
Some examples of functions satisfying (1) are f(s)=e~s,
f(s) = (l + k-1syk,
k>0.
(2)
We are interested in those functions / which satisfy not only (1) but also that for every small e > 0 there exists a positive constant 5 = 5(e) < 1 such that /(*) < 1 - 5(e),
s > e,
(3)
and that for some fixed constant K with 1 < K < 2, [ / ] . : = sup
I / ( S )
-1
+ S
' < oo.
(4)
SK
s>0
A function f(s) satisfying (1) has property (3), if it is non-increasing. Of course, the functions in (2) have properties (3) and (4). Condition (3) is necessary. For this account and some further remarks on conditions (3) and (4) we refer to [1] and [2]. The following theorem is proved by Ichinose-Tamura-Tamura-Zagrebnov in [2]. Theorem 1.1. Let f and g be functions having properties (3) and (4) with K = 2 as well as (1). If A and B are non-negative self-adjoint operators in a Hilbert space H with domains dom(A) and dom(B) such that the operator sum C := A + B is self-adjoint on domain dom(C) = dom(A) fldom(B), then the Trotter-Kato product formulae converge in operator norm with the error bound Ofa-1): \\[g(tB/2n) f(tA/n)g(tB/2n)]n n
tC
1
\\[f(tA/n)g(tB/n)\ -e- \\=0(n- ),
-e~tc\\
= 0 ^ ) , n -> oo.
(5)
The convergence is uniform on each compact t-interval in the half-line [0, oo). If the operator C is strictly positive, i.e., C >rjl for some constant n > 0, the convergence is uniform on the whole half-line [0, oo).
634
Trotter-Kato product formula: some recent results
635
Taking in (2) the exponential functions f(s) — g(s) = e s, one gets the following operatornorm converging symmetric and nonsymmetric Trotter formulae: Corollary 1.1. For non-negative self-adjoint operators A and B whose operator sum C := A + B is self-adjoint on dom(C) = dom(A) n dom(JB), we have || (e -tB/2„ e -M/n e -tB/2„ ) n _ e~tC|| = ||(e-M/ne-tB/n)n _ g-tCy
= 0(n-l)>
0(„-l)) n
^ QO,
(6)
uniformly on each compact t-interval in [0, oo), and uniformly on [0, oo) , if C is strictly positive. In [2] it is also shown that Theorem 1.1 allows a generalization to the case, when C :— Ai -\ h Am is self-adjoint on dom(C) = dom(yl1) n • • • l~l dom(Am). 1.2. Optimality of the error bound Notice that the results of the paper [2] are wonderfully optimal in many respects. Remark 1.1. One can not relax the self-adjointness of the sum C := A + B to essential self-adjointness, since in this case there is an example of a couple of non-negative self-adjoint operators A and B such that l i m i n f | | ( e - M / " e - t s / " ) " - e-tC\\
> D(t),
(7)
where D(t) is positive and continuous function of t > 0. This example is due to [3]. Notice that in this case we still have the strong convergence of the Trotter product formula (see, e.g. [4]): lim (e-tA'ne-tB(n)nu = e~t6u, (8) for any u € 7i. Here C is the closure of operator C := A + B with domain dom(C) = dom(A) ndom(B). Remark 1.2. Let operators A and B verify the conditions of Theorem 1.1. If they do not commute, then it is easy to check that in the nonsymmetric case (see (6)) the error bound can not be improved. Hence, in this case the estimate Ofa^1) is ultimate optimal. It is much less evident for case of symmetric Trotter product formula (see (6)). One of the important results of the paper [2] is the construction of a couple of unbounded operators A and B verifying conditions of Corollary 1.1 such that ^e-tB/2ne-tA/ne-tB/2^n
_ e _tC|| >
L{t)n~\
(9)
where L(t) is positive and continuous function of t > 0. This proves optimality of the error bound Ofo-1) in Theorem 1.1 (and Corollary 1.1) in the symmetric case. Remark 1.3. We would like to warn the reader against the conclusion that for any two noncommutative operators A and B verifying conditions of Theorem 1.1 the error bound estimate Oin"1) is the best possible for the symmetric Trotter-Kato product formula. Whereas for nonsymmetric case it is true.
636
V. A. ZAGREBNOV
Remark 1.4. The paper [2] covers the most of the known results concerning the operatornorm convergence of the Trotter-Kato product formula for self-adjoint semigroups. Moreover, this paper proves that Ofo*1) is optimal error bound estimate in the above sense. See discussion in [2].
2. O p e r a t o r - n o r m convergence and fractional power of self-adjoint generators 2.1. Fractional power and convergence rate The results of [2] show how decisive is the relation between operators A and B for the convergence of the product formula. A recent Ichinose-Neidhardt-Zagrebnov theorem [5] extends [6], where the problem of relation between domains of self-adjoint non-negative operators A and B and the bound on the convergence rate in the Trotter-Kato product formula was considered for the first time. Let A and B be two non-negative self-adjoint operators in a Hilbert space H such that dom(A1^2) n dom(5 1 / 2 ) is dense in Ti. By H we denote the form-sum of A and B, i.e., H = A + B, which is in turn a non-negative self-adjoint operator in H. dom(^4 1/2 ) n d o m ( 5 1 / 2 ) . In [5] we assume that
By definition dom(iJ 1//2 ) =
dom^E") C (dom(A a ) n d o m ( £ a ) ) ,
(10)
for some a € (1/2,1). This assumption is stronger than dom(fl' 1 / 2 ) C Aom.(A1/2) n Aova.{B1/2), but weaker than the assumption dom(ff) C dom(A) D dom(B) used in [1] and [2]. Notice that comparing to our paper [6], in [5] we do not demand smallness of the operator Ba with respect to Aa. We assume that the Kato functions further fulfill the conditions similar to those of [2]: [f}2a := sup s>0
\f( ) u ws
- l + s|[ <+oo,
[g]2a := sup | g ( s ) ~ I s>0
a 6(0,1],
(11)
a 6(0,1].
(12)
S + S|
< +oo,
S
It is easily seen that (11) and (12) are in fact conditions at the neighborhood of zero. We set mf(x) := sup f(y), x > 0. y€[x,oo)
Notice that the examples (2) satisfy condition (11) and mj{x) main statement of the [5] is the following
< 1 for x > 0. Then the
Theorem 2.1. Let A and B be non-negative self-adjoint operators and let H := A + B be their form sum. Assume that for some a 6 (1/2,1) the condition (10) is satisfied. Further, let f and g be Kato functions which obey conditions (11) and (12). If in addition one has
Trotter-Kato product formula: some recent results
637
dom(A1^2) C dom(5 1 / 2 ) andm,f(x) < 1 for x > 0, then for any finite interval [0,T] there is a constant Cr,2a-i > 0 such that \\(f(tA/2n)g(tB/n)f(tA/2n))n forte
[0,T] and n= 1,2,...
- e-tH\\ < CT,2a-i-^
(13)
.
The estimate (13) gives the ultimate optimal error bound for the convergence rate, which can be seen from the example given in [3]. Theorem 2.1 improves our previous result [6], where the same optimal convergence rate 0(n~^2a~1^) was obtained but under stronger conditions on the operators A and B and on the Kato functions / and g. Notice also that Theorem 2.1 treats a case which is not covered by [1], [2]. Nevertheless, setting formally a = 1 in Theorem 2.1 the assumptions and results of that theorem turn into those ones of [1], [2] except that assumption dom(A 1 / 2 ) C dom(B 1 / 2 ) becomes superfluous. In [7] it is conjectured that this condition is superfluous not only for a = 1 but also for a e (1/2,1). However, the proof of this conjecture remains still an open problem. Notice that condition dom(A 1 / 2 ) C dom(B 1 / 2 ) implies dom(if 1 / 2 ) = dom(A1^2) which guarantees the density of domiH1/2) in H. The proof of Theorem 2.1 relies on some ideas of Chernoff [8], [9] and of the Neidhardt-Zagrebnov paper [10]. In many aspects it follows the line of reasoning of [1] and [2]. Remark 2.1. (i) By example given in [3] the error bound estimate 0 ( n ~ ( 2 a _ 1 ) ) in Theorem 2.1 cannot be improved, i.e., it is the ultimate optimal one. (ii) This optimal error bound was already found in [6]. In [6] it was assumed that Ba is small with respect to Aa, i.e., dom(^4a) C dom(Ba) and \\Bau\\ < a | | ^ Q w | | + 6 | | u | | ,
(iii) (iv) (v)
(vi)
uedoxn(Aa),
(14)
for some a £ (1/2,1) and a G (0,1), b > 0. This condition is relaxed in [5] to the mild subordination condition dom( J 4 1 / 2 ) C dom(B1^2), which is obviously implied by (14). The yet open problem is to eliminate completely this subordination condition. The conditions on the Kato functions / and g in [5] are essentially relaxed compared to [6]. Theorem 2.1 shows that only relations between certain domains related to A, B and H are decisive for the convergence rate of the Trotter-Kato product formula, Theorem 2.1 holds for a G (1/2,1). The method of the proof does not allow one to include the case a = 1. We considered it separately in [2]. It is remarkable that one does not need any subordination condition in this case. For a = 1/2 we cannot expect operator-norm convergence in general, see [3]. However, if there is a subordination such as, for example, operator B is relatively compact with respect to A, then operator-norm convergence holds, see [10].
2.2. Example Let H := L2(ty where Q, is a bounded domain in R 1 , / = 1,2,... , with boundary d£l of C°°-class. By A we denote the negative half Laplacian with Dirichlet boundary conditions
638
V. A. ZAGREBNOV
in L 2 (fi), i.e A = -\AD. The domain is given by dom(A) := iff (O) n i ^ O ) , where H$(Q.) is the closure of CQ°(Q) in the Lebesgue space H^(ft). Using the space H%Bn(Q), HIBD{V)
•= & G H$(Sl) : BDu\dn 2
= 0},
we
where BD is given by BDU := u\aa for u G # ( 0 ) , obtain dom(A) := H%BD(Q). Further, let B be the negative half Laplacian with Neumann boundary conditions, i.e., B := — 5Aw- One has dom(-B) := H\ B (CI), where HIBA9)
:
= iu
e H
:B
^)
Nu\aa = 0}.
The boundary operator BN is given by (B^u)(x) = g^,,u(x), x G dCl, for w G ZZf(fi), where v(x) is the outer unit normal to the boundary 9 0 at the point x G 9 0 . Since d o m ^ 1 / 2 ) = H\(Cl) and dom(B 1 /2) = ffi(O), one gets dom(All2) g d o m ( B i / 2 ) H e n c e dom(H^2)
= dom(A1/2)
n dom^1/2) = a
ff|(n), a
and if := A + B = 2A = - A ^ , . Therefore dom(if ) = d o m ( ^ ) for arbitrary a G [0,1]. Now we calculate the domains dom(A a ) and dom(5 Q ) for a G (1/2,1). By [11] and [12] one finds: d o m ( ^ a ) = H$°BD(CI), a G (1/2,1), and
ii/2«Bw(0),
a G (3/4,1).
Since H$%D(Cl) C # f a ( 0 ) , one gets dom(ii" a ) = dom(A a ) C dom(B a ) for a G (1/2,3/4). Applying now Theorem 2.1 we find that fe-t(-AD)/2ne-t(-AN)/2n\n
_ e~t(-AD)
=
Qfn~K\
for any « := 2a — 1 < «o : = \ (a = \) uniformly in t G [0, T] a s n - > oo. If a = 3/4, then it does not hold that dom(A 3 / 4 ) C dom(B 3 / 4 ). Hence dom(,4 3 / 4 ) n dom(B 3//4 ) is a proper subset of dom(# 3 / 4 ) which does not allow one to apply Theorem 2.1. If a G (3/4,1), then dom(A a ) n d o m ( £ a ) = Hla{BDtBN)(Cl)
:= {u G H2a(Cl) : BDu\dn
= 0,BNu\an
= 0} C H%%D(Cl).
This yields that dom(A a ) n dom(Ba) is a proper subset of dom(iJ a ) which does not allow one to apply Theorem 2.1, either. If a = 1, then one gets that dom(A) n dom(JB) is a proper subset of dom(H) too which yields H ^ A + B. Therefore the results of [2] are not applicable. Notice that in contrast to [6] and to examples given there, here we have an example, when the operator Ba is not small with respect to Aa for a G (1/2,3/4).
3. T r o t t e r p r o d u c t formula for Gibbs semigroups 3.1. Trace-norm convergence rate Let H be a separable Hilbert space with B(7i) the algebra of all bounded operators on H. Let C00(7i) denote the ideal of compact operators in B(H). Then the eigenvalues {^k(\C\)}'^L1 of
Trotter-Kato product formula: some recent results
639
the compact self-adjoint operator \C\ = yC*C are known as the singular values {sk(C)}^=1 of the operator C. We denote by CP(H) (1 < p < oo) the ideals in B(H) consisting of all compact operators C such that oo
5>fc(C)p
Then each ideal Cp(H) is a Banach space with the norm: 1/p
\\c\\p = [Y.sk(CY V.k=l
In particular the ideal C\(TC) is called the trace-class, with the trace-norm HCHj = Tr \C\. Definition 3.1. A strongly continuous one-parameter semigroup {Ut}t>o is called a Gibbs semigroup if Ut £ C\(TL) for any t > 0 (see e.g. [13]). There are several results on the trace-norm convergence of the Trotter product formula for self-adjoint Gibbs semigroups, see e.g. [14]—[17]. A recent result by Zagrebnov [18] is related to the above optimal operator-norm convergence of the Trotter-Kato product formula. Theorem 3.1. Let A and B be non-negative self-adjoint operators such that: (i) the operator sum C := A + B is self-adjoint on dom(C) = dom(A) fl dom(B); (ii) either e~rA £ Ci{H), or e~TB € C^H), for r > 0. Then (a) the Trotter product formula converges in the trace-norm with error bound 0(n~l): -tc 00, (15) = 0{n n ) " uniformly on each compact t-interval in [0,oo). Further, if C is strictly positive, then (15) holds uniformly on [0,oo); (b) the error bound Ofo^1) is ultimate optimal. r
e-tA/ne-tB/n
- ! • *
The proof of the theorem is based on Lemma 3 . 1 . Let A be a self-adjoint operator such that e~tA £ Ci(W) for t > 0. Let B be generator of a contraction semigroup and let {e~tH}t>o be a Gibbs semigroup with -tH\ < 1. Let e(s,t):-for s £ N, s > 1 and t>0.
sup
-TA/.e-TB/.
TH
')'-
(16)
Then for each to > 0 there exist l(t0) and /'(to) such that:
ytA/ne-tB/ny _( for alln>
r
-tH
< l(to) e([n/2], t/2) +1'{to) e([(n + l)/2], t/2)
(17)
1 and t > to,
see [17] and [19]. Now conditions of the Theorem 3.1 and the mini-max principle imply that the operator C is generator of the contracting Gibbs semigroup with ||e~* c || < 1 for t > 0. On the other hand, by Theorem 1.1 one gets that e(s,t) = 0(1/s) in (16). Then the conditions of Lemma 3.1 are verified for C = H and e(kn,t/2) = 0(l/n), e(m„,t/2) = 0(l/n), where kn = [n/2] and mn = \(n + l)/2]. This proves the assertion (a) of the Theorem 3.1.
640
V. A. ZAGREBNOV
3.2. Optimality of the error bound Similar to section 1.2 it is enough to produce two operators A and B such that one can find a positive L\{t) ensuring in (15) for t > 0 the lower estimate: | | ( c - M / n e - t B /njn _ e~tC ^ >
Ll
(t)„-1.
(1 8 )
To this end let W be finite-dimensional. Then by the Baker-Campbell-Hausdorff formula we have for small \t\ e-tAe-tB
= exp[-t(A
+ B) + £ [A, B] - £[A - B, ±[A, B)] + Op(\t\4)],
with [A, B] = AB — BA, where and below Op(\t\k), for fc > 0, means some bounded operator with norm of order 0(\t\k). Then N(t) := ( e - ^ e - * 3 ) 1 / ' = exp[-(A + B) + |[A,B] + O p (|i| 2 )]. We understand N(Q) = e~( A+B ) and have A
i
°°
/•
E1(A; B) := — JV(t)| t=0 = - £
^
fc=i
i \fc—l
k
~
"[[(A + By~l[A,
'
j=i
B](A + B)k~\
(19)
of which the right-hand side is norm-convergent and can be a non-zero operator with bound 2 _ 1 1| [A, B] || e|lj4+BH, if A and B do not commute with each other. It follows that N(t) = e-( A+B > + tE^A; B) + Op(t2), so that with t = 1/n {e-A/ne-B'n)n
= N(l/n)n
= e~{A+B^ + rClEx{A; B) + Op(n-2).
Since for finite-dimensional Hilbert space H one has H-^ = ||-||, the error bound 0(n~l) optimal for nonsymmetric Trotter product formula for Gibbs semigroups.
(20) is
Remark 3.1. The proof that the symmetric Trotter product formula converges in the trace-norm with error bound 0(n~l): ^e-tB/2ne-tA/ne-tB/2n)n
_ e~tC^
= 0
(„-l))
„ _ ^
( 2 1)
follows along the same line of reasoning as in the proof of assertion (a) of Theorem 3.1. The proof of existence of the couple of unbounded operators A and B such that one can find a positive L2(t) ensuring in (21) for t > 0 the lower estimate: \\(e-tB/2ne-tA/ne-tB/2n)n
_ &-tC^
> ^ ( ^ " l ,
„ _>
00|
(22)
needs a more subtle reasoning based on the example [3]. Notice that Remark 1.3 about optimality is also valid in the case of symmetric Trotter product formula for Gibbs semigroups.
Trotter-Kato product formula: some recent results
641
Acknowledgement s I would like to t h a n k Christian Gerard and Ricardo Weder for inviting me t o give a talk in the Session on Q u a n t u m Mechanics and Spectral Theory at I C M P 2003. T h e present report is an extended version of this talk.
References 1. T. Ichinose, Hideo Tamura, Comm. Math. Phys. 217, 489 (2001). 2. T. Ichinose, Hideo Tamura, Hiroshi Tamura, V. A. Zagrebnov, Comm. Math. Phys. 221, 499 (2001). 3. Hiroshi Tamura, Integr. Equ. Oper. Theory 37, 350 (2000). 4. M. Reed, B. Simon, Methods of Modern Mathematical Physics I. Functional Analysis, Academic Press, New York, 1972. 5. V. T. Ichinose, H. Neidhardt, V. A. Zagrebnov, J. Fund. Anal. 204, 475 (2003). 6. H. Neidhardt, V. A. Zagrebnov, Intger. Equ. Oper. Theory. 35, 209 (1999). 7. T. Ichinose, H. Neidhardt, V. A. Zagrebnov, "Operator norm convergence of the Trotter-Kato product formula", in Proceedings of the International Conference on Functional Analysis (Kiev, August 22-26, 2001), Ukr. Acad. Press, Kiev 2003, pp. 100-106. 8. P. R. Chernoff, J. Fund. Anal. 2, 238 (1968). 9. P. R. Chernoff, Bull. Amer. Math. Soc. 76, 395 (1970). 10. H. Neidhardt and V. A. Zagrebnov, Comm. Math. Phys. 205, 129 (1999). 11. D. Fujiwara, Proc. Japan Acad. 43, 82 (1967). 12. P. Grisvard, Arch. Rational Mech. Anal. 25, 40 (1967). 13. V. A. Zagrebnov, Gibbs Semigroups, (Vol.10, Series A: Mathematical Physics), Leuven University Press, Leuven, 2003. 14. V. A. Zagrebnov, J. Math. Phys. 29, 888 (1988). 15. H. Neidhardt, V. A. Zagrebnov, Comm. Math. Phys. 131, 333 (1990). 16. T. Ichinose, Hideo Tamura, Asymptotic Analysis 17, 239 (1998). 17. H. Neidhardt, V. A. Zagrebnov, J. Fund. Anal. 167, 113 (1999). 18. V. A. Zagrebnov, "Ultimate optimal error bound for the Trotter product formula: Gibbs semigroups", in Proceedings of the International Conference on Functional Analysis (Kiev, August 22-26, 2001), Ukr. Acad. Press, Kiev 2003, pp. 94-99. 19. V. Cachia, V. A. Zagrebnov, J. London Math. Soc. 64, 436 (2001).
This page is intentionally left blank
String and M theory Session organized by D. MORRISON (Durham) and H. OoGURl (Berkeley & Caltech)
This page is intentionally left blank
Localizing gauge theories NlKlTA A.
NEKRASOV*
(Inst, des Hautes Etudes Scientifiques, Bures-sur- Yvette)
We study N = 2 supersymmetric gauge theories on toric four dimensional manifolds. We recall the notion of fi-background and calculate the partition function of gauge theory in this background. As a result, we get generalizations of the formulae of the author for the theories on R 4 and Nakajima-Yoshioka for the theories on R 4 blown up at one point. In the case of compact toric space our results give an alternative derivation of Donaldson invariants, and generalize the results of Gottche-Zagier.
1. Introduction In the past years some progress has been achieved in the quantum field theory calculations of the low-energy effective actions including non-perturbative effects. N = 2 supersymmetric Yang-Mills theory has simple perturbation theory and non-trivial instanton dynamics. By utilizing supersymmetry one is able to calculate the instanton corrections to the low-energy effective action [23]. This is done by placing M — 2 gauge theory in a non-trivial geometric background, involving i?-symmetry currents. In this paper we shall generalize the results of [23]. Namely, we study J\f = 2 gauge theory on a toric four-manifold X. The motivations for this work are manifold. First, we hope to learn about the gauge theory applications of the holomorphic anomaly, which is well-known in the context of topological strings [1,3]. Secondly, the similar formulae are expected to hold in the higher dimensional generalizations of the supersymmetric gauge theories. There, the applications to the black hole counting [30] are expected. Also, the topological vertex and Donaldson-Thomas theory also behave in the similar way. The crucial ingredient missing in all these problems is the detailed understanding of the stability conditions. Since in the case of Donaldson theory lots of information is already available, e.g. in the work [8], we may hope to advance along this route.
2. Af = 2 gauge theory We study pure M = 2 gauge theory. Generalizations involving matter are straightforward. It is well-known that the theory admits the so-called twisting [29], which permits supersymmetric compactifications on arbitrary four-manifold X. The field content of the twisted theory (which turns out to be the integral representation of Donaldson theory) is the following: the gauge field A, the complex adjoint scalar
645
646
NIKITA A. NEKRASOV
X admits isometries, then there are extra conserved supercharges G of opposite chirality. Mathematically, the theory on X studies equivariant cohomology of the space of gauge fields. In addition to the gauge group, the equivariance with respect to isometries of X is quite useful in analyzing the theory. The bosonic part of the N = 2 theory action is given by: 5 b o s = | T r (||F|| 2 + \\DA
•
(1)
3. O-background, local theory: R 4 We start with the theory on R 4 . Suppose Q, Q are two commuting infinitesimal rotations of Euclidean space R 4 , [Q,£l] = 0, fi = ||^2^t/||. These are generated by the vector fields V,V, V^ = fi^", and the indices are lifted by the Euclidean metric 5^. We skip the details and the motivation of the construction [23]. The bosonic part of the action of the theory in the $7-background is given by: Shos = JTr(\\F\\2
+ \\DA> - ivFf
+ \\[4>,4>] - t9DAcf> + oyD^f).
(2)
4. I n s t a n t o n p a r t i t i o n function o n R 4 In [23] we calculated the partition function of the gauge theory in the fi-background. It depends on the skew eigenvalues e\, t2 of the matrix fi and the vev a of the Higgsfield
(3)
^H^y (only N of chp are independent). We expand: 1 = y ^ TpCrip + y ^ Tpip2ctip1ctip2 p
+ ••• .
P1P2
The parameters f = (r p |r P l p 2 |...) are called times. All but T2 should be viewed as formal parameters. 4.2. Partition function The calculation in [10,22,23] gives: Z(a,f,ei,e 2 ) = ^ / x ? ( a , e i , e 2 ) expf f (m p (a,«,ei,e 2 )), t
(4)
Localizing gauge theories
647
m p (a,6,ei,e 2 ) = £ a f + £ ( a j + £i(i - l ) ) p - (a; + t1i)p l
l,i
- (ai + ei(i - 1) + €2ku)p + (a; + exi + e2ku)p , where t = ( 6 1 , . . . , 6jv) is the JV-tuple of partitions, li - {kn >ki2>ki3>--->
kik'n > 0).
Finally /x^(o, £1,62) is the measure on partitions, derived in [23]: fit(a, ei, e2) = exp / dx dy fi'(x) f~(y) -y£u£x (x - y)
(5)
and the function / j (the profile of the colored partition I), and the kernel 7 are defined in [22]. 4.3. Expansion of the partition function on M4 In order to proceed with more general four-manifolds we need the partition function on R 4 to be evaluated for the most general parameters, both toric £1,62 and the couplings r^. Expansion in €i,£2 has the following form: Z(a,e1,e2;7Vi)=exp^ + ^ ± ^ i + ^ ClE2
6162
2
1
+
( e i + £ 2 )
g1 + 0 ( e i , £ 2 )
(6)
6162
where ^ 0 , Fi, Til, Q\ are functions of a, r^.
5. ^ - b a c k g r o u n d , global theory The standard Donaldson theory in four dimensions can be viewed as the partially twisted six dimensional super-Yang-Mills theory, compactified on T 2 x X, where X is our four-fold, and the torus T 2 has vanishing size. For X with isometries we modify this construction. Consider the flat X-bundle over T 2 , which can be viewed as an orbifold of E 2 x X by the action of Z 2 , which acts by shifts in E 2 direction, and by two commuting isometries gi,g2 in the X direction: (z, z, x)
H->
(z + n + mr, z + n + mf, g " ^ ' x) •
(7)
By compactifying the gauge theory on this background, with the partial twist along X, in the limit of vanishing volume of T 2 we end up with the theory in the O-background. The bosonic part of its action coincides with equation (2). 5.1. Toric preliminaries In this section we collect all the necessary information about toric varieties. We use slightly unconventional language, which is more familiar to physicists. Let M,d be non-negative integers, and TM,Td,TM+d the standard tori jj{\)MAM+d of the corresponding dimensions. In what follows a runs from a to M + d, a runs from 1 to M,
648
NlKITA A . NEKRASOV
/i runs from 1 to d, and I runs from 1 to N. The torus TM+d acts in the standard fashion on the space CM+d: Z={Za)^(Zaexpi6a). Any integral matrix Q : Z M -> ZM+d defines a homomorphism of T M into TM+d, and the corresponding action of T M on CM+d:
Z = (Za)^ ( V e x p z ^ Q > « Y This action preserves the norms: pa = | | Z a | 2 and the symplectic form u = Y^ dpa A dtia,
Za = v ^ P a e w " .
i
Now, fix a collection r of M real numbers r\,..., TM — Fayet-Illiopoulos terms in the physical language — and consider the Hamiltonian reduction of C M + d with respect to T M at the level of the moment map m = (mi,...,mM),
ma = Y^QlPa,
given by f. Explicitly, we consider the space X =
m7l(r)/TM.
This is our toric variety. By construction, it comes endowed with the map p:X-*R™+d which factors through the projection X ->X/Td
= Ax,
where Td = T M + d / T M . We neglect possible torsion part (it is absent for generic Q). The image of A ^ under p i s the convex d-dimensional polyhedron (perhaps non-compact), which fits some d-dimensional affine subspace bLd in R++d. Another canonical structure on X is a set of complex line bundles La and Ca. More precisely, any character \ °f the torus T M defines a line bundle, which is associated with the help of \ to the principal T M bundle: &-\T)-*X.
(8)
All of the above is said under the assumption of genericity of r, so that the equation (8) indeed defines a principal bundle. The line bundles £a are associated to the characters x]f = expiQa<pa, while La are associated to \% = expiy>Q. Hence, topologically, CaK®aL®Ql.
(9)
The torus TM+d acts on X. Of course, only Td acts faithfully. The full torus TM+d becomes visible when we start looking at the line bundles over X. Then Ca become distinct
Localizing gauge theories
649
line bundles as TM+d equivariant bundles. They correspond to the characters x¥+d — elS° • In what follows we shall denote XM+d simply by \The fixed points v of T d action on X are the vertices of Ax- Generically, these vertices have d edges emanating from them. These edges correspond to fixed complex projective lines in X (if X is non-compact the edges may go all the way to infinity, in this case they correspond to the fixed complex lines in X). The tangent space TVX to X at any fixed point is a representation of Td, and, consequently, a representation of TM+d. The representation splits as a sum of d one-dimensional irreducibles: d lvsi
= ^JJ XV,/J. >
where we took the liberty of identifying the characters with the one dimensional representations. The characters \v,n define the weights of the TM+d action on TVX: Xv,n = exp a ^ wv^-a0a
(10)
which clearly obey Y\wVtliiaQ%
= 0,
VQ.
Each fixed point v defines a subset Iv of the set of indices { 1 , . . . , M + d} of cardinality d, such that for any Va G A,, Za(v) = 0. It is natural to label the elements of Iv by the index fi = 1,... ,d. In other words, Iv coincides with the set of weight subspaces of TVX. The remaining M coordinates Z% are completely fixed by the moment map equations, up to the action of TM. Let Zf C (ZM+d)t denote the sub-lattice { ( m , . . . ,nM+d) | na = 0,a e Iv}, and R ^ = Z™ ® R. Then d(v) G R™. Let Qv denote the restriction of Q* onto lJ^. It must be invertible over Z in order for v to be isolated fixed point. Then: Wv,»;a - ( < W - Q^Qv'Ta)
•
(11)
5.1.1. Second cohomology The line bundles Ca generate the if-group of X. Their first Chern classes generate H2(X, Z). In the T M+d -equivariant cohomology they are represented by the equivariantly closed forms: <Pa + ci(La). In the localized cohomology, which is spanned by the fixed points v G XT, each point contributing a copy of C ( ( £ i , . . . ,SM+d)), the Chern classes localized to
the 2-observables, say for single trace operators: Oj . We evaluate integrated 2-observables using localization formula: \ Of =Y/H^)f(chp(8v))
(12)
650
NlKITA A . NEKRASOV
where: £ is the 2-homology class, and HY^ + P.D. [£] is the equivariant version of its Poincare dual, iJs is a function on X, linear in pi, and H% its value at the fixed point v. Finally, £v is the localized universal sheaf, i.e., t*£, where C : SDT x {v} °-> DJl x X is the inclusion, where UJl is the moduli space of sheaves (compactified instanton moduli space). 5.2. Equivariant bundles In gauge theory with gauge group G we usually fix the principal G-bundle P, integrate over the space of all connections on P and then sum over topological classes of P. When we formulate the theory on some curved manifold -X" we have to fix the metric on X. If the manifold X is a toric variety the metric is uniquely specified by the matrix of charges Q and the set of Fayet-Iliopoulos terms f. If we want to turn on the fi-background an extra data is needed, namely the lift of the action of the torus TM+d on P. For the gauge group G = SU(N) this is the same as the choice of iV-dimensional representation E of TM+d. The latter is specified by the matrix ki,a,
l = l,...,N,
a = l,...,M
+ d,
(13)
of weights. The rank N vector bundle splits as a sum of N line bundles £j, of Chern classes:
a
5.3. Examples (1) Let us consider X = CP d , which corresponds to M = 1. In this case A x is just a simplex, a
(if r > 0, of course). The fixed points v are in one-to-one correspondence with the vertices of the simplex, i.e., we can set v = 1 , . . . , M + 1, pv = (0,.. V, r,..., 0). (2) The next example is non-compact. X is the total space of the vector bundle 0(qi) © O(qd-i) over P 1 . It corresponds to M = 1, the vector of charges: Q = (9l.92,---,9d-i,l,l)tThere are two fixed points v — they belong to the zero section of the vector bundle, and are the North and South poles of the sphere. 5.4. Stability and holomorphic anomaly The partition function of the gauge theory in fl-background is analytic in the vev of the Higgs field. However the expansion in the parameters ei,62 in the limit e —> 0 leads to holomorphic functions in a, FQ(a; A),T\(a; A), etc., which are actually not single-valued. They can be made single valued at the expense of adding some non-holomorphic terms (the example of the second Eisenstein series E
Localizing gauge theories
651
Another, more profound, reason to expect them is to recall the Gromov-Witten interpretation of the A/" = 2 partition functions. At least in the case t\ = —ti the expansion coefficients Tg{a\ A) are (the limits of) the generating functions of the genus g GromovWitten invariants of certain (local) Calabi-Yau manifolds, see [28]. These are well-known to be the large a limits of non-holomorphic automorphic forms on the moduli spaces of Kahler structures of these Calabi-Yaus, see [1,3].
6. Master formula We are now almost ready to state our main formula. We first fix the vacuum expectation value of the Higgs field $, ($) = a = diag(ai,...,ajv).
(14)
In the case of compact X we should take the integral over a, to take into account the tunneling between different vacua. The second cohomology group of X is isomorphic to Zd, generically. The U(N) gauge bundle is reduced to the U(1)N bundle, in the presence of the Higgs vev. We should sum over all equivalence classes of the U(1)N bundles. They are classified by the vectors ka = (fcaij), a = 1,... ,d. In the SU(N)/ZN gauge theory one fixes the traces: {ka} = wa = 52i=i ka,iActually, in the absence of the charged matter, everything depends only on wa mod N. Equivariantly, as we discussed above, topologically equivalent line bundles may differ. So in fact we might sum over equivalence classes of equivariant line bundles. These are labelled by vectors ke E ZN, for each edge e of the toric diagram Ax- Equivalent information is contained in the vectors ka = (kaj) from equation (13). However, the choice of equivariant bundle does not correspond to the fluctuating parameter. So the real sum goes over ka only. Moreover, the actual set of vectors ka is even smaller, because of the so-called stability conditions, to be mentioned below. The partition function of the N = 2 theory in ^-background on the toric manifold X, in the presence of 2-observables,
is given by
kaezN,{ka}=wa where e • kv is given by
v
J2Wia£aPaa
6.1. Non-equivariant limit In the case of compact X the partition function (15) has a finite non-equivariant limit, i.e., the limit where e —» 0. Indeed, the singular terms in the logarithm of partition function
652
NIKITA A. NEKRASOV
vanish thanks to the identity:
£ ^ - = / i = o, £^=/c l ( L a ) = o, ^
WviWv2
Jx
„ WviWv2
ywvl+wv2==
Jx
f Ci(x) = Q:
(16)
and the finite part reduces to: Zw(S;rA,tfi)
=
£
exp / Foia + X
fc„ez" ,{ka}=Wa
y^kgCiiLa)) a
+ J c1(X)nJa
+ Y,kaci(La)\+x(X)^1(a)+a(X)g1(a).
(17)
If we now integrate over a then the resulting function of f, t (for f = (logA, 0,0,...)) is, in fact, the generating function of Donaldson invariants of X, for some specific choice of the metric on X. This choice is correlated with the choice of contour for the a integral. P u t another way, the sum over ka's goes over some subset of all possible first Chern classes. This subset is singled out by some stability condition, which involves explicitly the Kahler class of X (i.e., the choice of f). 6.2. Non-holomorphic speculations Looking at the non-equivariant limit of the partition function we recognize the holomorphic approach formulae of [13], see also [8]. There, the low-energy effective theory of Af = 2 super Yang-Mills theory was exploited in order to write down a contribution of the Coulomb branch to the Donaldson invariants of manifolds with b% = 1 (all toric manifolds have this property). More systematic approach, taking into account full multiplets of N = 2 susy leads to the non-holomorphic integrand, see [13,19], whose integral can be converted to the contour integral of a holomorphic form, similar to what we have above. We hope to infer from this the non-holomorphic modular completion of our master formula. 6.3. Relation to the results of Nakajima-Yoshioka This section is devoted to the simplest non-trivial example of the application of the formula (15): X = C 2 , the blowup of a point on C 2 . It is a toric manifold, M = l,d — 2. In this case the formula (15) reduces to Zx,w(a,ei,e2,e3;Tn,tji)
= keZN
£
Z(a + fcei,ei + e 3 ,e 2 - ex;r s + t^(ei + e3)) ,{k}=w
x Z(a + ke2, ei - e2, e2 + e3; rn + tn(e2 + e 3 )), (18) which, for e3 = 0, coincides with the result of [21].
Localizing gauge theories
653
Acknowledgments I would like to t h a n k the organizers of t h e Congress for the beautiful conference, and for inviting me to give a lecture. I also wish to thank A. Losev and A. Okounkov for useful discussions. Research was partially supported by 01-01-00549 grant from R F F I .
References 1. I. Antoniadis, E. Gava, K. Narain, T. Taylor, Nucl. Phys. B 413, 162 (1994). 2. M. Atiyah, V. Drinfeld, N. Hitchin, Yu. Manin, Phys. Lett. A 65, 185 (1978). 3. M. Bershadsky, S. Cecotti, H. Ooguri, C. Vafa, "Kodaira-Sprencer theory of gravity and exact results for quantum string amplitudes", Comm. Math. Phys. 165, 311-428 (1994); arXiv: hep-th/9309140. 4. J. J. Duistermaat, G. J. Heckman, Invent. Math. 69, 259 (1982); M. Atiyah, R. Bott, Topology 23, 1 (1984); M. Atiyah, R. Bott, Phil. Trans. Roy. Soc. London A 308, 524-615 (1982); E. Witten, arXiv:hep-th/9204083 ; S. Cordes, G. Moore, S. Rangoolam, arXiv:hep-th/9411210; R. Bott, J. Diff. Geom. 4, 311 (1967); G. Ellingsrud, S. A. Stromme, Invent. Math. 87, 343-352 (1987); L. Gottche, Math. A. 286, 193-207 (1990). 5. A. Gorsky, A. Marshakov, A. Mironov, A. Morozov, Nucl. Phys. B 527, 690-716 (1998), arXiv:hep-th/9802007. 6. G. Ellingsrud, L. Gottsche, arXiv:alg-geom/9506019 , arXiv:alg-geom/9410005 ; L. Gottsche, arXiv:alg-geom/9506018. 7. A. Gorsky, I. Krichever, A. Marshakov, A. Mironov, A. Morozov, Phys. Lett. B 355, 466 (1995); arXiv:hep-th/9505035. 8. L. Gottsche, D. Zagier, arXiv:alg-geom/9612020. 9. A. Iqbal, N. Nekrasov, A. Okounkov, C. Vafa, "Quantum foam and topological strings", ITEPTH-60/03, IHES/P/03/65, arXiv:hep-th/0312022 . 10. A. Losev, A. Marshakov, N. Nekrasov, "Small Instantons, Little Strings and Free Fermions", ITEP-TH-18/03, MPIM-2003-26, FIAN/TD-05/03, IHES-P/03/09, arXiv:hep-th/0302191. 11. A. Losev, G. Moore, S. Shatashvili, arXiv:hep-th/9707250; N. Seiberg, a r X i v : h e p - t h / 9705221. 12. A. Losev, G. Moore, N. Nekrasov, S. Shatashvili, arXiv:hep-th/9509151. 13. A. Losev, N. Nekrasov, S. Shatashvili, arXiv:hep-th/9711108, arXiv:hep-th/980106i. 14. A. Losev, N. Nekrasov, S. Shatashvili, arXiv:hep-th/9908204, arXiv:hep-th/9911099. 15. I. Macdonald, Symmetric functions and Hall polynomials, Oxford University Press, 1998. 16. D. Maulik, N. Nekrasov, A. Okounkov, R. Pandharipande, "Gromov-Witten theory and Donaldson-Thomas theory", ITEP-TH-61/03 , IHES/M/03/67, arXiv:math.AG/0312059 . 17. G. Moore, N. Nekrasov, S. Shatashvili, "D-particle bound states and generalized instantons", Comm. Math. Phys. 209, 77-95 (2000), arXiv:hep-th/9803265 . 18. G. Moore, N. Nekrasov, S. Shatashvili, arXiv:hep-th/9712241, arXiv:hep-th/9803265. 19. G. Moore, E. Witten, arXiv:hep-th/9709193. 20. H. Nakajima, Lectures on Hilbert Schemes of Points on Surfaces, AMS University Lecture Series, 1999. 21. H. Nakajima, K. Yoshioka, arXiv:math. AG/0306198 , arXiv:math. AG/0311058 . 22. N. Nekrasov, A. Okounkov, "Seiberg-Witten theory and random partitions", ITEP-TH-36/03, IHES-P/03/43, arXiv:hep-th/0306238. 23. N. Nekrasov, "Seiberg-Witten prepotential from instanton counting", arXiv:hep-th/0206161. 24. N. Seiberg, E. Witten, arXiv: hep-th/9407087 , arXiv:hep-th/9408099 .
654
NIKITA A. NEKRASOV
25. N. Seiberg, E. Witten, arXiv:hep-th/9908142, JHEP 9909, 032 (1999). 26. K. Ueno, K. Takasaki, Adv. Studies in Pure Math. 4, 1 (1984). 27. A. M. Vershik, "Hook formulae and related identities", Zapiski sem. LOMI 172, 3-20 (1989) (in Russian); S. V. Kerov, A. M. Vershik, "Asymptotics of the Plancherel measure of the symmetric group and the limiting shape of the Young diagrams", DAN SSSR 233, 1024-1027 (1977) (in Russian); S. V. Kerov, "Random Young tableaux", Teor. veroyat. i ee primeneniya 3v, 627-628 (1986) (in Russian). 28. S. Katz, A. Klemm, C. Vafa, arXiv:hep-th/9609239. 29. E. Witten, Comm. Math. Phys. 117, 353 (1988). 30. H. Ooguri, A. Strominger, C. Vafa, "Black hole attractors and the topological string", arXiv: hep-th/0405146.
Cosmic acceleration and M-theory PAUL K. TOWNSEND (Cambridge)
The status of accelerating four-dimensional universes obtained by time-dependent compactifications of 10 or 11 dimensional supergravity is reviewed, as is the 'no-go' theorem that they evade. All flat cosmologies for a simple exponential potential are found explicitly. It is noted that transient acceleration is generic, and unavoidable for 'flux' compactifications. Included is an eternally accelerating flat cosmology without a future event horizon.
The current consensus that the Universe is undergoing accelerated expansion presents a challenge to the other current consensus that cosmology should be derivable from String/Mtheory, which has 10 or 11-dimensional supergravity as its low-energy effective field theory. There are two aspects to this challenge. One arises from the fact that there is no known formulation of String/M-theory in a spacetime with a future cosmological event horizon [1,2], whereas this is a typical feature of universes undergoing late time eternal acceleration. Of course, we don't know whether the accelerated expansion of the Universe will continue forever, so there is no real conflict with observations here. Still, it remains to find compactifications of String/M-theory for which the effective 4-dimensional theory admits a homogeneous and isotropic (FLRW) spacetime undergoing accelerated expansion, and the difficulty of finding such compactifications is the other aspect of the challenge posed by cosmic acceleration. Note that two periods of acceleration are required to explain both inflation in the early universe and acceleration in the current cosmological epoch. To see what cosmic acceleration entails, consider, a 4-dimensional FLRW spacetime in standard coordinates. The metric is ds2 = -dt2 + S2(t) [(1 - kr2yl
dr2 + r^n2] ,
(1)
where S(t) is the scale factor and k = —1,0,1 depending on whether the universe is open, flat or closed, and d&2 ls * n e -50(3) invariant metric on the unit 2-sphere. A computation of the Ricci tensor shows that R00 = -d2S. (2) It follows that an accelerating universe requires Roo < 0. However, the Einstein field equations imply that Roo ^ (Too+ 9^^) (3) and the Strong Energy Condition (SEC) on the matter stress tensor requires the right hand side to be non-negative. Thus, accelerated expansion is possible in a universe governed by Einstein's equations only if the matter in it violates the SEC. From a purely 4-dimensional perspective the fundamental condition on the matter stress tensor is the Dominant Energy Condition (DEC), which requires p > —p for a perfect fluid of pressure p and energy density p. In contrast, the SEC requires only that p > — | p , and is
655
656
PAUL K. TOWNSEND
typically violated in theories with a positive scalar potential V. For example, given a single scalar field <j> and the Lagrangian density
L=^feU)^R-±(d
(4)
the Einstein equations imply that R00 = 2[{dt4>f - V}.
(5)
If V > 0 then i?oo wiU be negative whenever dt(j) = 0, implying an accelerating universe. There will always be some cosmological solutions for which dt<$> passes through zero. In fact, as will become clear in due course, such solutions are the rule rather than the exception! Thus, all one needs to get an accelerating universe from String/M-theory is a compactification for which the effective 4-dimensional theory has a positive scalar potential V, or at least one that is positive in some region of the space of scalar fields. Although the SEC is in no way fundamental, in the sense that its violation would not imply a violation of fundamental physical principles, it is satisfied by the stress tensor of the D = 10 and D = 11 supergravities that serve as the low energy effective theories of String/M-theory a . This fact has consequences for the potential V in the effective 4dimensional theories that result from String/M-theory compactifications, as first pointed out in a 1985 article of Gibbons [3], and more recently by Maldacena and Nunez [4]. I will now summarize this 'no-go theorem'. Consider a /^-dimensional spacetime with metric ds% = Q2 (y) ds\ (x) + ds2n (y)
(n = D-4),
(6)
where ds\ is the metric of some 4-dimensional spacetime (with coordinates x), and ds\ is the metric of some compact non-singular n-manifold M (with coordinates y). The non-vanishing function O(y) is a 'warp factor'. A calculation shows that
R^\x,y)
= R00(x) - Ul-\y)VlU.\y).
(7)
Multiplying by fl2 and integrating over M we deduce that
f n2] R00 = f n2R^, JM
J
(8)
JM
and hence that i?oo > 0 if RQQ > 0. This result might appear to rule out the possibility of an accelerating universe arising from compactification of 10 or 11-dimensional supergravity. In view of our earlier remark that acceleration is always possible when the 4-dimensional scalar potential V is positive, this would be equivalent to the statement that the potential arising from such compactifications is never positive, a statement that is false, and was known to be false well before the no-go theorem was formulated. In fact b , all that can be inferred about the potential V is that it a
T h e 'massive' IIA supergravity is an exception but, for a reason to be explained later, this exception makes no difference. b This point was made in [5], in a note added to the published version; it has also been made, independently, in [6].
Cosmic acceleration a n d M - t h e o r y
657
has no stationary points with V > 0. To see that a positive potential without a stationary point is not excluded, it suffices to note that the field equations imply, under the given circumstances, that at least one scalar field is time-dependent. This scalar field could be one that arises from the mode expansion on the compact manifold M, in which case the metric on M will be time-dependent. But the theorem assumed time-independent M, and is therefore not applicable. One can see from this that the no-go theorem is actually a very weak constraint on the positive potentials that might, in principle, arise from compactification of String/M-theory! There are many potentials that it would allow but which, nevertheless, do not seem to be obtainable. For example, for many compactifications there is a consistent truncation to a single scalar field
(A > 0)
(9)
for (dilaton coupling) constant a (which we may assume to be positive). Any value of a would be permitted by the no-go theorem but only a > 1 arises in practice. This might be expected on the grounds that a < 1 allows an eternally accelerating cosmology with a future event horizon. This suggests the conjecture that such (Einstein conformal frame) spacetimes cannot arise from classical compactification of String/M-theory; if true (there is no known counterexample) this would impose much stronger constraints on the potential V than the no-go theorem. In particular, it would exclude a < 1 (but not a = 1, as will be shown later). Let us now turn to the question of how one gets positive potentials from (classical) compactification of higher-dimensional theories satisfying the SEC. These arise in one of two ways: — Flux compactifications: in this case a positive potential is generated by non-zero flux of antisymmetric tensor fields. The prototype is the T7 compactification of 11dimensional supergravity with non-vanishing 4-form field strength [7], which yields an exponential potential of the form (9) with c a — \fl, the scalar field arising from the 'breathing mode' of T7. The 4-form is dual to a 7-form proportional to the volume form of T7: more generally, some fc-form field strength will be set equal to a closed but not exact fc-form on the compact space M. Flux compactifications seem only to yield 'steep' exponential potentials with a > y/3. — Hyperbolic compactifications: in this case the compact space is a space of constant negative curvature. The fact that hyperbolic compactifications produce a positive potential was observed by Bremer et al. [9], and they were investigated by Kehagias and Russo [10] in the context of String/M-theory. Several attractive features (for example, the absence of moduli other than the volume) were noted and exploited in a cosmological context by Kaloper et al. [11], and their possible relevance to cosmic acceleration was noted by Wohlfarth and the author [5]. One could consider the prototype to be the compactification of 11-dimensional supergravity on a 7-dimensional compact hyperbolic space. In this case one finds a potential for the breathing-mode scalar <j> of the form (9) with a = 3/V7. In general, hyperbolic compactifications seem always lead to 'gentle' exponential potentials with 1 < a < \/3c
This value, which corresponds to A = 4 in the notation of [8] was given incorrectly in that paper. The correct value is the same as the one found from T6 compactification of the massive IIA supergravity, which is why the existence of this SEC violating theory does not alter our conclusions.
658
PAUL K.
TOWNSEND
In general, a positive multi-scalar potential can be generated by a combination of both mechanisms, in which case it takes the form of a sum of products of exponentials of canonically normalized scalar fields. However, the simple case of a single exponential for a single scalar field is sufficient for an understanding of the physics and here we shall consider only this case. The qualitative features of cosmologies derived from (4) with a potential of the form (9) were analysed in a 1987 paper of Halliwell [12], although the fact that there is typically a period of transient acceleration in the a > 1 cases was not noticed there. In 2002 Cornalba and Costa [13] noted the existence of a period of acceleration in a k = — 1 cosmology arising from a flux compactificationd. More recently, an explicit time-dependent hyperbolic compactification of the vacuum Einstein equations was shown to yield an Einstein-frame k = 0 universe in which a decelerating epoch with S-t1/3,
e*-*-1^5,
(10)
is followed by a period of transient acceleration [5]. This solution was subsequently shown to be the vanishing flux limit of a rather general class of solutions of Einstein's equations known as S-branes, and the phenomenon of transient acceleration was found to be a generic feature of these solutions [14]. As observed by Emparan and Garriga [15], this is an immediate corollary of the positive potentials generated by flux and hyperbolic compactifications. Consider the simple case of an exponential potential of the form (9) with a > 1. The initial conditions implied by (10) are i ^ > 1 with 0 < 0. Any such cosmological solution can be viewed as a ball rolling, with friction, up the potential. Clearly, it must reach a maximum at which (j> = 0 and at this point the expansion of the universe is accelerating, for the reason explained previously. Subsequently, the ball starts to roll back down the hill; the late-time behaviour will depend on the value of a and also on k, but in all cases the universe will be decelerating. For example, for a? < 3 and k = 0 the late-time behaviour will be given by the power-law k=0 attractor solution S~t1/a\
e^^t1^.
(11)
Nearby trajectories with k ^ 0 will eventually approach a Milne universe attractor or collapse to a big crunch singularity. Note that the compact Kaluza-Klein space M starts at infinite volume and ends at infinite volume; the acceleration of the 4-dimensional cosmology is associated to a 'bounce' of the compact space off its minimal volume. The above explanation of the period of transient acceleration relies only on the positivity of the potential V and makes no distinction between flux compactifications and hyperbolic compactifications. However, Halliwell's analysis [12] shows that just as the 'critical' value a = 1 separates qualitatively different behaviours of cosmological trajectories in the class of models under discussion, so does the 'hyper-critical' value a = V3, which also separates hyperbolic from flux compactifications. To see this, we introduce a new time parameter r such that dr = e-^dt, (12) d
T h e acceleration was claimed to occur in the neighbourhood of a resolved cosmological singularity. However, for reasons t h a t will hopefully become clear below, the accelerating epoch is necessarily far from the cosmological singularity.
Cosmic acceleration and M-theory
659
and set S = ea^T\ Letting an overdot indicate differentiation with respect to r, we find that the
(13)
while the Friedmann equation is .o l - 2 2 he2"* - 3 * = 3 - ^ -
,
a
,
(14)
For k — 0, the above two equations are equivalent to 4> = 3d fad -
(15)
and 3d 2 - ft = 2, (16) which is a hyperbola separating the the k = — 1 and k = +1 trajectories. The d > 0 branch of the hyperbola corresponds to an expanding universe. We can parametrize this branch by writing
d=
7!(c+rl)'
^= 7 i ( e " r l ) '
(c>0)
'
(17)
Equation (15) then becomes C = - ^ [ ( a + v / 3) + ( a - v / 3 ) e 2 ] -
(18)
We now see why a = y/3 is special. For a < \/3 there is a fixed point solution
which is just the power-law attractor solution (11). The fixed point separates two other k = 0 trajectories: (i) £ = £ocoth 7 r
(ii) £ = £ 0 tanh 7 T,
7
/3 _ a 2 \ I = (_ _ J ,
(20)
where r > 0. Only case (ii) includes £ = 1, and hence 4> = 0; this solution undergoes a period of acceleration whereas the other does not. In either case one can integrate (17). In case (i) one has S^
oc (cosh7r) A + (sinh7-r) A - ,
e* oc (cosh7r) _ A + (sinh 7 r) A - ,
(21)
where
A±
" TTTa
(22)
There is a big-bang singularity at r = 0, near which S~£3,
e
*~£73,
(23)
660
PAUL K. TOWNSEND
so the volume of the compact Kaluza-Klein space M is initially zero. Subsequently the solution approaches the attractor (11). In case (ii) we have S ^ o c (cosh7r) A - ( s i n h 7 r ) A + ,
e* oc (cosh7r) A " (sinh7T)" A+ .
(24)
This behaves initially as in (10) but then passes through a period of acceleration before approaching the attractor (11). Both the above solutions were found in [5], for particular values of a in the range 1 < a < y 3 , as solutions of the D > 6 vacuum Einstein equations with a compact hyperbolic D — 4 dimensional manifold of time-dependent volume. As solutions of the 4-dimensional effective theory with Lagrangian density (4), they are actually valid for 0 < a < y/3, in particular for o = l. The power-law attractor solution in this case has S ~ t and hence zero acceleration, so the case (ii) solution that approaches it asymptotically must be eternally accelerating6. In fact, the late time behaviour is e^-t
+^
+ Oir2),
S~t
+^
+ 0(r2),
(25)
from which one sees that d2S > 0. One might suspect from this fact that there would be a future cosmological event horizon, in which case a = 1 would be excluded by the (admittedly conjectural) stronger form of the no-go theorem proposed earlier in this article. However, it has been shown by Boya et al. [17] that if the acceleration tends to zero asymptotically, as it does in this case, then there is no cosmological event horizon. Halliwell's qualitative analysis of all cosmological trajectories can similarly be made quantitative for k = 0 when a > 3. Consider first the a > \/3 case. The solution of (18) is
t=[7^)
tana;r
'
w=
{—) '
(26)
where 0 < r < TT/2. The equations (17) can now be integrated to yield S ^ a (coswr) A -(sinwT) A + ,
e^ oc (COSU>T)A~ (sinwT) _A+ .
(27)
The asymptotic behaviour as \ogt —> ±oo is S~ti,
e*~t±1/V3.
(28)
In between there is a period of acceleration. For a = \/3 we have simply £ = VQT and hence S3~rhiT\
e^ocr-M1"2.
(29)
The late time behaviour now involves logarithmic corrections to the power law behaviour of the a > y/3 case. Despite the differences between flux compactifications and hyperbolic compactifications, all the above cases of accelerating k = 0 universes are qualitatively similar. There is always e
A similar observation was made in [16] in the a > 1 case for trajectories that approach the k = - 1 Milne attractor.
Cosmic acceleration a n d M - t h e o r y
661
a big-bang singularity near which the scale factor behaves as in (10). Inspection of the phase portraits in [12] shows that this is also true for the k ^ 0 cosmologies. This is not surprising because the singularity theorems that guarantee a cosmological singularity rely on the SEC which is violated by the 4-dimensional effective theory only in a later epoch. In addition, the acceleration leads to only a few e-foldings, insufficient for any application to inflation in the early universe; this disappointing conclusion is confirmed by systematic analyses of the possibilities of hyperbolic-flux compactifications involving many scalar fields [16,18]. However, the mechanism may be relevant to acceleration in the current cosmological epoch [19]. One of its attractive features is that any time-dependence of four-dimensional 'constants' due to the time-dependence of the compact space is absent precisely during the period of acceleration, i.e., now! Implicit in everything discussed so far in this article is the assumption that String/Mtheory is adequately described for cosmological purposes by the classical effective 10 or 11 dimensional supergravity theories. It seemed worthwhile to fully explore the implications of this assumption, but it now also seems that it must be discarded because although it has been possible to find models with transient acceleration of possible relevance to the 'observed' acceleration of the current cosmological epoch, it has not been possible to find models that allow a sufficient period of early universe inflation. This is not a disaster; String/M-theory includes orientifolds that violate (at least locally) the SEC [20], and branes which can produce brane-instanton quantum corrections to the potential V. It has been shown [21] that when these effects are taken into account it is possible for the potential V to have a local minimum that could lead to inflation1. Nevertheless, although string and brane effects might yield potentials V that are quite different from those obtainable from the classical compactifications of effective supergravity theories considered here, some of the lessons learned from the latter case may prove valuable. If one re-interprets a in (15) and (16) to be shorthand for V/2V then these equations describe the k — 0 cosmologies for any positive potential V. As long as a > y/3, there will be no fixed point on the k = 0 hyperbola and hence a single k = 0 trajectory, which will necessarily include a period of acceleration. Thus, for a large class of potentials, including all those that arise classically from 'product' flux compactifications, an accelerating epoch is not only possible, but (assuming a flat universe) unavoidable! If a < y/3 at some point then fixed points may occur, and acceleration can be avoided. But note that we are now discussing how to avoid acceleration rather than how to achieve it! When it was noted in [5] that hyperbolic compactification can yield both a flat 4-dimensional universe that undergoes a period of acceleration and a flat 4-dimensional universe that is always decelerating, it was the former case that appeared remarkable, but actually it is the latter case, an eternally decelerating universe, that is exceptional. Of course, the acceleration is transient, so even if it is generic one could ask why we happen to be around to observe it. But is it really any more surprising that we live at an atypical time in a typical universe than that we live at a typical time in a atypical universe?
f
Such a potential would allow classical solutions with a future cosmological event horizon, but since quantum mechanics has now been invoked one can presumably invoke it again to argue that the universe will eventually tunnel out of the metastable vacuum.
662
PAUL K.
TOWNSEND
Acknowledgments T h e author t h a n k s Michael Douglas, Gary Gibbons, J a u m e Garriga, J a u m e Gomis, Shamit Kachru and M a t t i a s Wohlfarth for helpful discussions.
References 1. T. Banks, "Some thoughts on the quantum theory of de Sitter space", a r X i v : a s t r o - p h / 0305037 . 2. S. Hellerman, N. Kaloper, L. Susskind, "String theory and quintessence", JHEP 0106:003 (2001); W. Fischler, A. Kashani-Poor, R. McNees, S. Pabhan, "The acceleration of the universe, a challenge for string theory", JHEP 0107:003 (2001). 3. G. W. Gibbons, "Aspects of supergravity theories", in Supergravity and related topics, eds. F. de Aguila, J. A. de Azcaarraga, L. Ibariez, World Scientific, 1985. 4. J. Maldacena, C. Nunez, "Supergravity description of field theories on curved manifolds, and a no-go theorem", J. Mod. Phys. A 16, 822 (2001). 5. P. K. Townsend, M. N. R. Wohlfarth, "Accelerating cosmologies from compactification", Phys. Rev. Lett. 9 1 , 061302 (2003). 6. S. de Alwis, "On potentials from fluxes", arXiv:hep-th/0307084. 7. A. Aurilia, H. Nicolai, P. K. Townsend, "Hidden constants: The theta parameter of QCD and the cosmological constant of N = 8 supergravity", Nucl. Phys. B 176, 509 (1980). 8. P. K. Townsend, "Quintessence from M-theory", JHEP 0111:042 (2001). 9. M. S. Bremer, M. J. Duff, H. Lii, C. N. Pope, K. S. Stelle, "Instanton cosmology and domain walls from M-theory and string theory", Nucl. Phys. B 543, 321 (1999). 10. A. Kehagias, J. G. Russo, "Hyperbolic spaces in String and M Theory", JHEP 0007:027 (2000). 11. N. Kaloper, J. March-Russell, G. D. Starkman, M. Trodden, "Compact hyperbolic extra dimensions: Branes, Kaluza-Klein modes and cosmology", Phys. Rev. Lett. 85, 928 (2000). 12. J. J. Halliwell, "Scalar fields in cosmology with an exponential potential", Phys. Lett. B 185, 341 (1987). 13. L. Cornalba, M. S. Costa, "A new cosmological scenario in String Theory", Phys. Rev. D 66, 066001 (2002). 14. N. Ohta, "Accelerating cosmologies from S-branes", Phys. Rev. Lett. 9 1 , 061303 (2003); S. Roy, "Accelerating cosmologies from M/String-theory compactifications", Phys. Lett. B 567, 322 (2003); M. N. R. Wohlfarth, "Accelerating cosmologies and a phase transition in M-theory", Phys. Lett. B 563, 1 (2003); N. Ohta, "A study of accelerating cosmologies from Superstring/Mtheory", Prog. Theor. Phys. 110, 269 (2003); C.-M. Chen, P.-M. Ho, I. P. Neupane, J. E. Wang, "A note on acceleration from product space compactification", JHEP 0307:017 (2003). 15. R. Emparan, J. Garriga, "A note on accelerating cosmologies from compactifications and Sbranes", JHEP 0305:028 (2003). 16. C.-M. Chen, P.-M. Ho, I. P. Neupane, N. Ohta, J. E. Wang, "Hyperbolic space cosmologies", JHEP 0310:058 (2003). 17. L. J. Boya, M. A. Per, A. J. Segui, "Graphical and kinematical approach to cosmological horizons", Phys. Rev. D 66, 064009 (2002). 18. M. N. R. Wohlfarth, "Inflationary cosmologies from compactification", Phys. Rev. D 69, 066002 (2004). 19. M. Gutperle, R. Kallosh, A. Linde, "M/String theory, S-branes and accelerating universe", JCAP 0307:001 (2003). 20. S. B. Giddings, S. Kachru, J. Polchinski, "Heirarchies from fluxes in string compactifications", Phys. Rev. D 66, 106006 (2002). 21. S. Kachru, R. Kallosh, A. Linde, S. P. Trivedi, "De Sitter vacua in string theory", Phys. Rev. D 68, 046005 (2003); S. Kachru, R. Kallosh, A. Linde, J. Maldacena, L. McAllister, S. P. Trivedi, "Towards inflation in string theory", JCAP 0310:013 (2003).
Contributed talks Condensed matter physics J. FELDMAN (Univ. British Columbia, Vancouver, Canada) Construction of a 2-d Fermi liquid: the results M. SALMHOFER, W. DE SIQUEIRA P E D R A (Univ. Leipzig,Germany) Fermi systems in two dimensions and Fermi surface flows Dynamical systems J. BOCHI (IMPA, Rio de Janeiro, Brazil) The Lyapunov exponents of generic volume preserving and symplectic maps D. DOLGOPYAT (Univ. Maryland, USA) Dynamics of particles in dispersive billiards J. MATHER (Princeton University, USA) Arnold diffusion Equilibrium statistical mechanics L. CHAYES (UCLA,
USA)
Thresholds for surface formation in equilibrium: mesoscopic and macroscopic phenomena F. GUERRA (Univ. of Rome "La Sapienza", Italy) Rigorous results for mean field spin glass models General relativity P. CHRUSCIEL (Univ. Tours, France) Initial data engineering I. RODNIANSKI (Princeton University, USA) On the Cauchy problem in general relativity E. SEIDEL (Max Planck Inst, Germany and NCSA, USA) Computational problems in general relativity Nonequilibrium statistical mechanics L. BERTINI (Univ. of Rome "La Sapienza", Raly) Large deviations for boundary driven lattice gases J. LEBOWITZ (Rutgers University, USA) Large deviations and fluctuations in some model stationary nonequilibrium systems D. UELTSCHI (University of Arizona, USA) On non-equilibrium stationary states between reservoirs of quantum particles H.-T. YAU (Courant Institute, N.Y., USA) Diffusion of random Schrodinger equation in scaling limits
663
664
Contributed talks
Operator algebras and quantum information I. BJELAKOVIC (Technische Univ., Berlin, Germany) Limit theorems for quantum entropies A. WASSERMANN (Univ. Aix-Marseille 2, France) Operator algebras and positive energy representations Quantum field theory R. LONGO (Rome, Italy) A dichotomy in conformal field theory J. TESCHNER (Freie Univ., Berlin, Germany) Geometric aspects of the Liouville quantum field theory String and M theory M. DOUGLAS (Rutgers Univ., USA) Counting critical points and supersymmetric vacua J. GOMIS (Caltech,
USA)
String interactions from gauge fields B. JULIA (ENS, Paris, France) A duality between M-theory and algebraic surfaces: BPS superalgebras M. MARINO (CERN, Geneva, Switzerland) The topological vertex
Poster sessions Condensed matter physics P. S. GOLDBAUM {Princeton) Lower bound for the segregation energy in the Falicov-Kimball model E. LANGMANN {KTH, Stockholm) A class of exactly solvable models of 2d correlated fermions S. T E U F E L , G. PANATI, H. SPOHN {Tech. U. Miinchen) Effective dynamics for Bloch electrons: Peierls substitution and beyond V. GEYLER {Mordovian State U.) Fractal properties of the spectrum of the three-dimensional periodic Landau operators P . ZEINER {Wien)
Coincidence site lattices for cubic lattices J. LAGES, P . D. SACRAMENTO {1ST, Lisbon), Z. TESANOVIC {Johns Hopkins U.) Disorder and vortices in unconventional superconductors V. ZAGREBNOV {CPT, Marseille) Do Bosons condense in a homogeneous magnetic field? A. SUTO {Res. ISSPO, Budapest) Bose condensation of trapped bosons K. WATANABE, K. NAKAMURA, H. EZAWA {Tokyo) A condition for Bose-Einstein condensation in a trap C. BROUDER {Paris VI and VII)
Quantum field theory of degenerate systems M. ANGELOVA {Northumbria U., Newcastle upon Tyne) Algebraic thermodynamic properties of anharmonic molecules and linear chains J. RiESS {CRTBT, CNRS Grenoble) On the theory of electric DC-conductivity: linear and non-linear microsopic eveolution vs macroscopic behaviour Dynamical systems V. M. IVANOV {Ural State U.) The control of asymptotic of discrete time dynamical system D. A. MENDES {ISCTE, Lisbon), J. S. RAMOS {1ST, Lisbon) Kneading theory for two-dimensional skew-product maps P. KASPERKOVITZ {IPT, Wien), C. TUTSCHKA {U. Carlos III, Madrid) Ergodic properties of a 2D Hamiltonian system with step potential
665
666
Poster sessions
C. N E T O DE CARVALHO (17. Lisbon) Fractal implications in spatial and evolutionary paleoecology of the daedalus producer (ichnofabric analysis) C. C.
RAMOS
(U. Evora), N.
MARTINS
(1ST, Lisbon),
A. N. SHARKOVSKY (NAS, Kiev), J. S. RAMOS (1ST, Lisbon)
C* algebras associated to the Feigenbaum attractor C. GRACIO (U. Evora), J. S. RAMOS (1ST, Lisbon) Geodesic length spectrum, systoles and eigenvalues of the Laplacian C. JANUARIO (ISEL, Lisbon), J. S. RAMOS (1ST, Lisbon) Symbolic dynamics for Piece-Wise linear area-preserving maps A. HERNANDEZ-GARDUNO (UNA, Mexico City)
Symmetries in simple mechanics systems J. D U A R T E (1ST, Lisbon), L. SILVA (U. Evora), J. S. Discrete dynamics of the Fitzhugh-Nagumo systems
RAMOS
(1ST, Lisbon)
J. L. ROCHA (ISEL, Lisbon), J. S. RAMOS (1ST, Lisbon)
Iterated maps of the interval with holes and overlaps M. LENCI (Stevens Inst. Tech.) Infinite-measure billiards J. F O N T , C. SIMO (U. Barcelona), A. NUNES (U. Lisbon)
Consecutive quasi-collisions in the RTBP A. PRYKARPATSKY (Krakow, Lviv), N. PRYKARPATSKA (Lviv)
Ergodic measure and homology properties of time-dependent periodic Hamiltonian flows on symplectic manifolds S. VINAGRE (U. Evora), R. SEVERINO (U. Minho), J. S. RAMOS (1ST, Lisbon) Symbolic dynamics for nonlinear difference operator of Schrodinger S. FERNANDES (U. Evora), J. S. RAMOS (1ST, Lisbon) Second largest eigenvalue of a Markov shift as a topological invariant O. MUL (ICIT, Ukraine) Analysis of vibrations in the dynamical systems of complex machine assemblies R. BERLANGA (UNA, Mexico City) Homotopy equivalence and groups of measure-preserving homeomorphisms A. J. MACIEJEWSKI (U Zielona), M. PRZYBYLSKA (INRIA, Project CAFE) Application of Morales-Ramis differential Galois theory to study the integrability for certain dynamical systems R. INOUE (U. Tokyo)
The matrix realization of afhne Jacobi varieties and the extended Lotka-Volterra lattice A. SACCHETTI (U. Modena) Chaotic behaviour of driven two-level systems
Equilibrium statistical mechanics
667
D. ROMERO, F . ZERTUCHE (UNA, Mexico City) On the asymptotic dynamics in the random map model B. M E S T E L (Exter, UK) Renormalisation in quasiperiodically drewn quantum systems Equilibrium statistical mechanics A. C. D. VAN E N T E R , S. B. SHLOSMAN (RUG, Netherlands) First-order transitions in vector models and lattice gauge models with continuous symmetries P . MARTIN (SFIT, Lausanne), J. PIASECKI (U. Warsaw) Quantum Mayer graphs and self-consistent equation for an interacting Bose gas P . L. FERARRI, H. SPOHN (ZM, TU, Munchen)
Shape fluctuations of a faceted crystal W. L. SPITZER (UC at Davis), S. STARR (Princeton) Improved bounds on the spectral gap above frustration free ground states of quantum spin chains V. A. ZAGREBNOV (Marseille)
Bose-Einstein condensation for homogeneous interacting systems with a one-particle spectral gap S. L. STARR
(Princeton)
Ferromagnetic ordering of energy levels K. R. ITO (Setsunen U.) Absence of phase transitions in two-dimensional O(N) spin models and Anderson localization J-N. AQUA, M. E. FISCHER (Maryland) Critical phenomena in ionic fluids: the spherical model Fluid dynamics and nonlinear P D E s T. KOBAYASHI (Tsukuba College of Technology) Zero-energy flows and vortex patterns in quantum mechanics G. VAN BAALEN (Geneve)
Phase turbulence in the complex Ginzburg-Landau equation via Kuramoto-Sivashinsky phase dynamics A. LUDU (Northwestern State U., LA) Generalized nonlinear equation and solutions for fluid contour/surface deformations L. F . GUIDI, D. H. U. MARCHETTI (USP, Brasil) A comparison analysis of Sivashinsky's type evolution equations describing flame propagation in channels S. GUSEINOV (Latvia)
One method for solution of some classes of mathematical physics inverse problems
668
Poster sessions
R. D. BENGURIA, M. C. DEPASSIER (U. Pontificia, Chile) Bounds on the bifurcation branches of an equation that models long-wave instabilities L. T s .
ADZHEMYAN
(St. Petersburg), J.
HONKONEN
M. V. KOMPANIETS, A. N. VASIL'EV (St.
(Helsinki), T . L.
KIM,
Petersburg)
Two-loop calculation of e expansion improved by summation of nearest dimensional singularities for randomly forced incompressible fluid E. Yu. ROMANENKO, A. N. SHARKOVSKY (NAS, Ukraine), J. S. RAMOS (1ST, Lisbon), S. VINAGRE (U. Evora)
Ideal turbulence and symbolic dynamics T. KOBAYASHI (Tsukuba College of Technology) Zero-energy flows and vortex patterns in quantum mechanics J. J. P E N A , A. R U B I O - P O N C E , J . MORALES (U. Autonoma Metropolitana, Azcapotzalco) Generalized solutions for the one-dimensional Fokker-Planck equation: Partner potentials S. NIKITENKOVA (Nizhny Novgorod State Technical U.) Korteweg-De Vries and Boussinesq equations and nonlinear transformation solitary waves at a bottom step G e n e r a l relativity I. AVRAMIDI (New Mexico Inst, of Mining and Tech.) Matrix Riemannian geometry and general relativity C. KLEIN (MPI, Miinchen)
Binary black-holes in Ernst picture J. LAGES (1ST, Lisbon), A. BERARD, Y. GRANDATI (IP, Metz), H. MOHRBACH (ICS, France) Classical spinning particle assuming a covariant Hamiltonian F. L O B O , P . CRAWFORD (U. Lisbon) Linearized stability analysis of thin-shell wormholes with a cosmological constant M. NARITA (MPI, Golm)
On the global existence problem in T 3 -Gowdy symmetric IIB superstring cosmology M. E. ROSADO (U. Autonoma Madrid) Structure of -X"(M)-invariant functions on V LM S. SZYBKA (Jagellonian U., Krakow) Chaotic dynamics of wave maps coupled to gravity in 3 + 1 dimensions S. VACARU (1ST, Lisbon) Horizons, geodesies and stability of black ellipsoids A. BERNAL, M. SANCHEZ (Granada) Fundamental geometric structures of spacetime K. SAIFULLAH, M. ZIAD, H. KHAN (Quaid-i-Azam
An algorithm for the Lie symmetries of tensors
U.)
Integrable systems and random matrix theory
669
Integrable s y s t e m s and random matrix theory A. SOSHNIKOV (UC at Davis) Janossy densities in determinantal and Pfafnan ensembles of random matrices K. TAKASAKI (Kyoto) Landau-Lifshitz equation, elliptic akns hierarchy and Sato Grassmannian C. KLEIN (MPI, Miinchen), J. FRAUENDIENER (17. Tubingen) Hyperelliptic theta functions and spectral methods I. CALVO, F . FALCETO (U. Zaragoza) Topological Poisson-Lie sigma models and the Heisenberg double I. V. KRASOVSKY
(Berlin)
Some computable Wiener-Hopf determinants and related polynomials orthogonal on an arc of the unit circle V. SHRAMCHENKO (Concordia U.) New Forbenius structures on Hurwitz spaces in terms of Schiffer and Bergmann kernels B. BAMBAH (Hyderabad)
Coherent states of superintegrable systems based on polynomial algebras V. GOLUBEVA (RIS and TI, Moscow) On universal r-matrices for two-parametric models with reflections in quantum field theory Nonequilibrium statistical mechanics E. CELEGHINI (Firenze) Nonequilibrium quantum statistics E. CARLEN (Georgia Tech) On the relation between the Master equation and the Boltzmann Equation in Kinetic Theory Y. ELSKENS (CNRS, Marseille), M. K-H KIESSLING (Rutgers), V. Ricci (Roma I) The Vlasov limit for a relativistic particle system interacting with a scalar wave field Y. ELSKENS (CNRS, Marseille) The analogue of Van Kampen-case modes for a finite number of degrees of freedom Y. ELSKENS, D. F . ESCANDE (CNRS, Marseille) Derivation of the Fokker-Planck equation from strongly chaotic Hamiltonian dynamics F. DUNLOP (U.
Cergy-Pontoise)
Stationary states of one-dimensional interfaces S. GROSSKINSKY, G. M. SCHUTZ, H. SPOHN
(Munich)
Condensation in the zero range process: Stationary and dynamical properties P . KOCIAN, K. LENDI (Zurich)
PCI, impact of Von Neumann conditions for density operators on kernel and inhomogeneity in non-Markovian quantum master equations
670
Poster sessions
J . - P . ECKMANN, E. ZABEY
(Geneve)
Strange heat flux in (an)harmonic networks J. P . ECKMANN, E. MOSES, D. SERGI (U. of Geneva and Weizmann Specialists and generalists in mathematical physics
Institute)
Operator algebras and quantum information S. FURUICHI, K. YAMANE (Tokyo U. of Science) A numerical example both of the optimal decision rule in the quantum channel coding and the concavity of the function appearing in quantum reliability function M. K E Y L (IMP,
TU-Braunschweig)
Quantum spin chains as entanglement resources P. LEROUX (U. De Rennes I and CNRS) From weighted directed graphs to coassociative manifolds F. LLEDO (IPAM, Aachen), H. BAUMGARTEL (U. Potsdam)
Hilbert C*-systems for C*-algebras with a nontrivial center and duality of compact groups A. MAJEWSKI (U. Gdansk) On quantum dynamical systems — some recent developments A. PRYKARPATSKY (UST, Poland) Quantum computing mathematics, holonomic algorithms and their applications D. SCHLINGEMANN (U. Constantin Brancusi) Algebraic approach to quantum cellular automata M. SVOBODOVA (Prague)
Fine gradings on non-simple Lie algebras: example of o(4;C) F. BENATTI (Trieste) Quantum dynamical entropies and quantum algorithmic complexities Path integrals and stochastic analysis F. SOBIECZKY (Berlin) Estimating the number of connected components of a random planar graph on Z 2 by isoperimetrical spectral estimates of the generator of a random walk H. HUEFFEL, G. KELNHOFER (ITP, Vienna) QED Revisited: proving equivalence between path integral and stochastic quantization E. LYTVYNOV (IAM,
Bonn)
Glauber dynamics of continuous particle systems A. HAHN (IAM,
Bonn)
A new approach for the path integral quantization of Chern-Simons models on S2 x S1
Quantum field theory
671
P . GOSSELIN (V. Grenoble I), H. MOHRBACH (Inst. Fourier), A. BERARD (Inst. Charles Sadron), Y. GRANDATI (LPMC, Metz) Statistical system with a field dependent wave function A. L. REBENKO (CNRS, Luminy), V. A. ZAGREBNOV (Aix-Marseille II) Gibbs state uniqueness for anharmonic quantum crystal with nonpolynomial double-well potential J. HONKONEN (NDC, Helsinki) Force-level fluctuations in Lanchester-Osipov combat models O. O R O N (Tel Aviv), L. P . HORWITZ (Tel Aviv and Bar Ilan V.) Relativistic Brownian motion and a relativistic Schrodinger equation on a manifold S. I. MUSLIH (ICTP, Trieste) Integrability and action function in multi-hamiltonian systems P . LEROUX, D. P E T R I T I S (IRM, Rennes I) Context-dependent grammars driven by oriented graphs and the analysis of the corresponding language through cluster expansions M. J. OLIVEIRA (U. Lisbon and V. Aberta, Lisbon), Yu. G. KONDRATIEV (Bielefeld V.) Invariant measures for Glauber dynamics of continuous systems Q u a n t u m field t h e o r y B. BAMBAH (V. Hyderabad), C. MUKKU (Panjab U.) Charged vs. neutral particle creation in expanding universes: a quantum field theoretic treatment J. M. BAPTISTA (Cambridge)
Some special Kahler metrics on SL(2,C) and their holomorphic quantization H. GROSSE (IPT, Vienna), R. WULKENHAAR (MPI, Leipzig)
Renormalization group approach to noncommutative scalar models N. HARSHMAN (American V., Washington DC) Poincare semigroup as emergent property of unstable particles M. H E N K E L (VHP Nancy I and V. Lisbon) Local scale invariance and applications to nonequilibrium criticality W. P . JOYCE (U. Canterbury, New Zealand) All Z„-graded Bose-Fermi recouplings for n > 2 lead to state confinement W. JUNKER (Muhlenberg), F . LLEDO (IPAM, Aachen) Quantum field theory of the electromagnetic vector potential on curved spacetimes F. KLEEFELD (1ST, Lisbon) On the (anti)casual analytical continuation of Hermitean quantum (field) theories B. KUCKERT (IPT Hamburg)
Covariant thermodynamics of quantum systems: passivity, semipassivity and the Unruh effect
672
Poster sessions
EDWIN LANGMANN
(Stockholm)
Exact solution of noncommutative field theory in a background magnetic field A. G. MARTINS, A. M. MARQUES, A. R. QUEIROZ,
P. TEOTONIO-SOBRINHO (USP, Brasil) Topological field theories of vortices and geons A. MIKOVIC (U. Lusofona, Lisbon) Quantum field theory of spin networks and quantization of gravity and matter C. D ' A N T O N I (U. Rome "Tor Vergata"), G. MORSELLA (U. Rome "La Sapienza"), R. VERCH (MPI-MIS, Leipzig) Scaling algebras for charge carrying quantum fields and superselection structure at short distances D. H. T . F R A N C O (CEFT, Belo Horizonte), C. M. M. Supersymmetric field theoretic models
POLITO
(CBPF, Rio de Janeiro)
O. RlCHTER (U. Leipzig) Spectral actions for leaf spaces of foliations N. M. ROMAO (MPI, Leipzig)
Dynamics of CP 1 lumps on a cylinder J. KIJOWSKI (Warsaw), G. RUDOLPH (U. Leipzig) Observable algebras and charge superselection structures of gauge theories on the lattice N. SZPAK (J. W. Goethe U.)
On the problem of "spontaneous pair creation" in strong electric fields P . A. FARIA DA VEIGA (USP,
Brasil)
Existence of baryons, baryon spectrum and mass splitting in lattice QCD Quantum mechanics and spectral theory F. BRAU (U.
Mons-Hainaut)
Upper and lower limits for the number of bound states in an attractive central potential N. CATARINO, R. MACKAY (U.
Warwick)
Spectrum of quasiperiodic discrete Schrodinger operator T. CHEN (Courant Institute) Localization of length and Boltzmann limit of the Anderson model at small disorders in d = 3 D. CHRUSCINSKI (Nicholaus Copernicus U.) Spectral theory and quantum damped systems R. FLOREANINI (Trieste)
Complete positivity from factorized dynamics G. GENTILE (Roma III)
Quasi-periodic solutions and pure point spectrum for two-level systems
Quantum mechanics and spectral theory D. HASLER (Copenhagen), F. BERNASCONI, G. G R A F (ETH Zurich) The heat kernel expansion for the elctromagnetic field in a cavity and its application to the Casimir energy P. EXNER (Kanazawa U.) Product formula for quantum Zeno dynamics A. IANTCHENKO (Malmo)
On the positivity of the Jansen-Hess operators for arbitrary mass T . ICHINOSE (Kazanawa) Quantum formula for quantum zero dynamics D. KREJCIRIK (1ST, Lisbon) The nature of the essential spectrum in curved quantum waveguides B. KUCKERT (Hamburg) Large spin and statistics in non-relativistic quantum mechanics J. DITTRICH, J. KIRZY (NPI,
Reiz)
Quantum waveguides with combined boundary conditions R. DE LA MADRID (U. Valladolid)
The role of the rigged Hilbert space in quantum mechanics D. MARCHETTI (Sao Paulo) Spectral transitions in a class of off-diagonal Jacobi matrices: A probabilistic approach H. MAKINO (Tokai U.)
Energy level statistics of integrable quantum systems based on the Mehta-Berry-Robink approach O. P O S T , F . LLEDO (IPAM, Aachen)
Spectral gaps on manifolds with noncommutative group actions N. BOGOLIUBOV (Steklov, Moscow), U. TANER (EMU, Famagusta),
A. PRYKARPATSKY (UMM, Poland) On modeling of the Maxwell-Bloch quantum optical super-radiance proccess and its application to quantum computing N. ROEHRL (Stuttgart) A new numerical method to solve the inverse Sturm-Liouville problem M. ROULEUX (CPT, Marseille) Tunneling effects between tori in double wells R. SCHUBERT (Bristol) Semiclassical time evolution for large times on manifolds of negative curvature N. STOILOVA (Ghent) Wigner quantum systems: non-commutative sl(l|3) oscillator N. UEKI (Kyoto)
The integrated density of states of random Pauli Hamiltonians
673
674
Poster sessions
D. YAFAEV (Rennes I) A particle in a magnetic field of an infinite rectilinear current V. ZAGREBNOV {CPT, Marseille) Trotter-Kato product formula: Recent results G. GARCI'A-CALDERON {UNAM, Mexico City) Solution of the time-dependent Schrodinger equation for decay in terms of resonant states String a n d M t h e o r y E. ALDROVANDI {Florida State U.) Hermitean-holomorphic classes and tame symbols related to uniformization, the dilogarithm and the Liouville action P . BORDALO {Paris) Quantum dielectric branes B. CERCHIAI (U. of North Carolina) The Seiberg-Witten map for a time-dependent background S. CHERKIS {Princeton)
Explicit hyperkahler metrics from string theory J. EDELSTEIN {1ST, Lisbon and La Plata) Wrapped D-branes and special holonomy manifolds I. GlANNAKis {Rockefeller) Spacetime supersymmetry and the super Higgs mechanism in string theory A. KHOLODENKO {Clemson)
"New" Veneziano amplitudes from "old" Fermat (hyper) surfaces M. MARCIC {Maribor, Slovenia) A tachyon-graviton scattering amplitude within the framework of string theory A. MIKOVIC {U. Lusofona, Lisbon) Spin foam models of string theory A. MISRA {Humboldt U., Berlin) {"Barely") G2-manifolds, (orientifold of) a compact Calabi-Yau and nonperturbative N = 1 superpotentials N. REIS {ENS, Lyon)
WZW branes and gerbes L. SNOBL {Prague) On modular spaces of Drinfeld doubles S. VACARU {1ST, Lisbon) (Non) commutative Finsler geometry from string/M-theory I. ZOIS {Cardiff) Holography and the Deligne conjecture
Satellite meetings
SWAGP03: School and Workshop on Algebraic Geometry and Physics, 2003 Salamanca, June 18-23 and 24~27 Advisory committee: E. Arbarello, R. Dijkgraaf, R. Donagi, M. Douglas, B. Dubrovin, M. S. Narasimhan, C. Viallet. Scientific Committee: D. Hernandez Ruiperez, J. M. Mufioz Porras, J. Mateos Guilarte (Salamanca), C. Bartocci (Genova), U. Bruzzo (SISSA). URL: http: //mukai. u s a l . es/wagp03/ Recent Trends in Dynamics 2003 Oporto, July 7-11 Scientific committee: M. Lyubich, J. Palis, Y. Sinai', S. van Strien, M. Viana, J.-C. Yoccoz. Organizing committee: J. F. Alves, V. Araujo, M. Carvalho, F. J. Moreira, J. Rocha. URL: http: //www. f c. up. pt/cmup/rtd/ Progress in Supersymmetric Quantum Mechanics (PSQM'03) Valladolid, July 15-19 International organizing committee: D. J. Fernandez C. (CINVESTAV, Mexico DF, Mexico), V. Hussin (CRM, Montreal, Canada), J. Negro (Universidad de Valladolid, Spain), L. M. Nieto (Universidad de Valladolid, Spain), B. Samsonov (Tomsk State University, Russia). Local organizing committee: J. Negro (Universidad de Valladolid, Spain), L. M. Nieto (Universidad de Valladolid, Spain). URL: http://metodos.fam.cie.uva.es/~susy_qm_03/ Madeira Math Encounters X X V Madeira, July 15-October 15 URL: http://www.uma.pt/Investigacao/Ccm/events.html X l l t h Oporto Meeting on Geometry, Topology and Physics Oporto, July 17-20 Organizing committee: Jose Mourao (Lisbon, 1ST), Roger Picken (Lisbon, 1ST), Joao Nuno Tavares (Oporto, FCUP). URL: http://www.math.ist.utl.pt/~jmourao/om/omxii/ Mathematical Problems in Quantum Mechanics Lisbon, July 21-24 Organizing committee: Pavel Exner (Prague), Pedro Freitas (Lisbon), Jan Philip Solovej (Copenhagen). URL: http://www.math.ist.utl.pt/~pfreitas/mpqm/mpqm.html
675
676
Satellite meetings
Infinite Dimensional Algebras and Quantum Integrable Systems Faro, July 21-25 Organizing committee: C. Burdik, P. P. Kulish, N. Manojlovic, H. Samtleben, A. Stolin, J.-C. Zambrini. URL: http://www.ualg.pt/cma/idaquis/announcement.html Workshop on Categorification and Higher-Order Geometry Lisbon, July 23-24 URL: http://www.math.ist.utl.pt/~rpicken/CH0G2003 CFIF Workshop on Time Asymmetric Quantum Theory: the Theory of Resonances Lisbon, July 23-26 Organizing committee: Arno Bohm (Texas), Lidia S. Ferreira (IST, Lisbon), M. Gadella (Valladolid), Nathan Harshman (Texas). URL: h t t p : / / c f if . i s t . u t l . p t / ~ r e s 2 0 0 3 / Joint MaPhySto and Q U A N T O P Workshop on Quantum Measurements and Quantum Stochastics Aarhus, August 7-12 Organizing committee: Ole E. Barndorff-Nielsen (MaPhySto, University of Aarhus), Richard D. Gill (University of Utrecht), Elena R. Loubenets (MaPhySto, University of Aarhus), Klaus M0lmer (QUANTOP, University of Aarhus), Eugene Polzik (QUANTOP, University of Copenhagen). URL: http://www.maphysto.dk/events2/QPFA2003/ XII Fall Workshop on Geometry and Physics Goimbra, September 8-10 Organizing committee: J. da Costa, H. Albuquerque, R. Caseiro, J. Correia, J. Gallardo. Scientific committee: J. de Azcarraga (U. Valencia), P. Perez (U. Salamanca), M. del Olmo (U. Valladolid), A. Romero (U. Granada), J. Carifiena (U. Zaragoza), M. Lecanda (U. P. C. Barcelona), A. Ibort (U. Carlos III de Madrid), M. de Leon (G S. I. C. Madrid), A. Lopez (U. C. Madrid), J. Marrero (U. La Laguna), R. Picken (Universidade Tecnica de Lisboa). URL: http://www.mat.uc.pt/~geomfis/
List of participants
ABRAMOV, V I K T O R
ANTOINE, J E A N - P I E R R E
I n s t i t u t e of P u r e M a t h e m a t i c s , University of T a r t u , Liivi 2 - 502, T a r t u 50409, Estonia abramovQut.ee
I n s t i t u t de Physique Theorique, Universite Catholique de Louvain, 2, chemin du Cyclotron, B-1348 Lduvain-la-Neuve, Belgium antodneflfyma.ucl.ac.be
AFCHAIN, STEPHANE
Ecole Polytechnique, C P H T , F-91128 Palaiscau cedex, France stephane.afchainGcpht.polytechnique.fr AIZENMAN, MICHAEL
Jadwin Hall, Princeton University, Princeton, NJ 08540-0708, USA aizenmanQPrinceton.EDU ALARCON, G U Y
17 Rue Pierre Curie, 94110 Arcueil, France galarconUfree.fr ALBEVERIO, SERGIO
Inst. Probability & M a t h . Statistics, Univ. Bonn, Wegelerstrasse 6, D 53115 Bonn, G e r m a n y albeverioUUni-Bonn.DE ALDROVANDI, E T T O R E
D e p a r t m e n t of M a t h e m a t i c s , 208 Love Building, Florida S t a t e University, Tallahassee, F L 32306-4510, USA aldrovandiQmath.fsu.edu ALICKI, ROBERT
I n s t i t u t e of Theoretical Physics and Astrophysics, University of Gdansk, W i t a Stwosza 57, PL-80-952 Gdansk, Poland f izraQuniv.gda.pi A L - R A S H E D , MARYAM
Flat 2B, J e r o m e House, Glendower Place, London S W 7 3DU, UK mha2Qic.ac.uk AMORIM, PAULO
E s t r a d a da Luz, 228, 6D, PT-1600-165 Lisboa, Portugal pamorimfflptmat.fc.ul.pt ANDREY, LADISLAV
UI AV CR, Pod vodarenskou vezi 2, 182 07 P r a g u e 8, Czech Republic andreQcs.cas.cz ANGELINI, LEONARDO
Sezione INFN, Via Amendola 173, 70126 Bari, Italia leonardo.angelinifiba.infn.it ANGELOVA, MAIA
School of C o m p u t i n g , Engineering and Information Sciences, N o r t h u m b r i a University, P a n d o n Building, C a m d e n Street, Newcastle upon T y n e , UK, G B - N E 2 1XE
AQUA, J E A N - N O E L
Ecole Normale Superieure de Lyon, 46 allee d'ltalie, 69364 Lyon cedex 07, France jnaquafflglue.umd.edu ARAKI, HUZIHIRO
Research I n s t i t u t e for M a t h e m a t i c a l Sciences, Kyoto University, Kitashirakawa-Oiwakecho, Sakyoku, Kyoto 606-8502, J a p a n arakiflkurims.kyoto-u.ac.jp AREDE, MARIA TERESA
Faculdade de Engenharia — D E M E G I , Edifi'cio M, sala 305, R. do Dr. R o b e r t o Frias, PT-4200-465 P o r t o , Portugal t aredeQf e . u p . p t ARGUIN, L O U I S - P I E R R E
Princeton University, D e p a r t m e n t of M a t h e m a t i c s , Fine Hall, Princeton NJ 08544-1000 USA larguinQmath.princeton.edu ATTAL, ROMAIN
11 rue de la Sabliere (Les Monceaux), F-91150 Morigny-Champigny, France attalQlpthe.jussieu.fr AVRAMIDI, IVAN
D e p a r t m e n t of M a t h e m a t i c s , New Mexico Tech, Socorro, NM 87801, USA iavramidQnmt.edu AVRON, Yosi Dept. of Physics, Technion, Haifa 32000, Israel avronQphysics.technion.ac.il BACH, VOLKER
I n s t i t u t fiir M a t h e m a t i k , F B Physik, M a t h e m a t i k and Informatik (08), J o h a n n e s Gutenberg-Universitat, Staudinger Weg 9, D-55128 Mainz, G e r m a n y vbachdmathematik.uni-mainz.de B A E Z , JOHN
D e p a r t m e n t of M a t h e m a t i c s , University of California, Riverside CA 92521, USA baezGmath.ucr.edu BAIK, JINHO
D e p a r t m e n t of M a t h e m a t i c s , University of Michigan, 2074 East Hall, 530 Church Street, Ann Arbor, MI 48109-1043, USA baikQumich.edu BAMBAH, BINDU
School of Physics, University of Hyderabad, H y d e r a b a d 500-046, India binduflpu.ac.in
maia.angelovaflunn.ac.uk ANICETO, INES
I n s t i t u t o Superior Tecnico, D e p a r t a m e n t o de M a t e m a t i c a , Av. Rovisco Pais, PT-1049-001 Lisboa, Portugal
BAMBUSI, DARIO
D i p a r t i m e n t o di M a t e m a t i c a , Unversita degli studi di Milano, Via saldini 50, 20133 Mllano, Italy bambusiQmat.unimi.it
anicetoOmath.ist.utl.pt
677
678
Participants: BAPTISTA - CABRAL
BAPTISTA, J O A O
BlZON, PlOTR
R. Coronel Apan'cio 2, PT-2300-558, Tomar, Portugal JMBat istaQdamtp. cam. a c . uk
Smoluchowski Inst, of Physics, Jagiellonian University, ul. R e y m o n t a 4, 30-059 Krakow, Poland bizontSth. i f . u j . edu. p i
BARREIRA, LUIS
I n s t i t u t o Superior Tecnico, D e p a r t a m e n t o de M a t e m a t i c a , Av. Rovisco Pais, PT-1049-001 Lisboa, Portugal barreiraQmath.ist.utl.pt BARTNIK, ROBERT
School of M a t h e m a t i c a l Sciences, Monash University, Clayton, Victoria 3800, Australia robert.bartnikOsc i.monash.edu.au BASTOS, M. AMELIA
I n s t i t u t o Superior Tecnico, D e p a r t a m e n t o de M a t e m a t i c a , Av. Rovisco Pais, PT-1049-001 Lisboa, Portugal abastos3math.ist.utl.pt BAUMGARTEL, HELLMUT
BJELAKOVIC, IGOR
Technische Universitat Berlin, Fakultat II, I n s t i t u t F u r M a t h e m a t i k , Sekr. MA 7-2, Strafie des 17 Juni 136, 10623 Berlin, G e r m a n y igor<9math. t u - b e r l i n . de BLANCHARD, P H I L I P P E
Fakultat fur Physik, Universitat Bielefeld, D-33615 Bielefeld, G e r m a n y blanchardQphysik.uni-bielefeld.de BLUE, P I E T E R
D e p a r t m e n t of M a t h e m a t i c s , University of Toronto, 100 St. George Street, Toronto, Ontario, M5S 3G3, Canada pblue0math.rutgers.edu
M a t h e m a t i c a l I n s t i t u t e , University of P o t s d a m , A m Neuen Palais 10, P F 601553, D-14451, P o t s d a m , Germany
BORDALO, P E D R O
baumgQrz.uni-potsdam.de
bordaloQlpthe.jussieu.fr
BAUMGARTNER, BERNHARD
Bozi, DANIEL M A R C O PHILIP
I n s t i t u t e for Theoretical Physics, Boltzmanngass 5, A-1090 Wien, A u s t r i a Bernhard.Baumgatnerfflap.univie.ac.at
Dep. Fisica, Universidade D o Minho, C a m p u s G u a l t a r , PT-4710-057 Braga, Portugal boziflfisica.uminho.pt
BEIGLBOCK, W O L F
BRAU, FABIAN
I n s t i t u t fur Angewandte M a t h e m a t i k , Universitat Heidelberg, Im Neuenheimerfeld 294, D-69120 Heidelberg, G e r m a n y wbeiglfflmath.uni-heidelberg.de
University of Mons-Hainaut, Pentagone, Place du P a r e 20, B-7000 Mons, Belgium
B E N AROUS, GERARD
Duke University, M a t h e m a t i c s Dept., Box 90320, D u r h a m , N C 27708-0320, USA
Ecole Polytechnique Federale de Lausanne, Section de M a t h e m a t i q u e s , CH 1015 Lausanne, Switzerland gerard.benarousOepfl.cn BENATTI, FABIO
D i p a r t i m e n t o di Fisica Teorica, Universita di Trieste, S t r a d a Costiera 11, 1-34014 Grignano-Trieste, Italy benattiSts.infn.it BENGURIA, RAFAEL
Facultad de Fisica, P. Universidad Catolica de Chile, Casilla 306, Santiago 22, Chile rbenguriQfis.puc.cl BERARD, ALAIN
Laboratoire de Physique Moleculaire et des Collisions, I n s t i t u t de Physique electronique et de Chimie, Universite de Metz, 1, Boulevard F . Arago, 57078 Metz cedex 3, France aberardOOlfflnoos. f r BERLANGA, RICARDO
Circuito Escolar, Ciudad Universitaria, 04510, Mexico D. F . rberlangaOleibniz.iimas.unam.mx BERNARD, DENIS
Service de Physique Theorique d e Saclay, C E A / S a c l a y , F-91191 Gif-sur-Yvette cedex, France [email protected] BERTINI, LORENZO
M a t h e m a t i c s D e p a r t m e n t , Universita di R o m a "La Sapienza", Piazzale Aldo Moro, 2, 1-00185 Roma, Italy bertiniQmat.uniromai.it
R u a Filipe Magalhaes, 4, 7° d t o , PT-1700-194 Lisboa, Portugal
fabian.brauQumh.ac.be BRAY, HUBERT
brayGmath.duke.edu BRICMONT, JEAN
U C L - F Y M A , 2, Chemin du Cyclotron, B-1348 Louvain la Neuve, Belgium bricmontOfyma.ucl.ac.be BROUDER, CHRISTIAN
Institut de Mineralogie et de Physique des Milieux Condenses, 140 rue de Lourmel, 75015 Paris, France christian.brouderOlmcp.jussieu.fr BRU, J E A N BERNARD
J o h a n n e s Gutenberg-Universitat, F B 08 - Institut fur M a t h e m a t i k , Staudinger Weg 9, G e b . 2413, D-55128 Mainz, G e r m a n y j eanbernard.bruQfree.fr BRUNETTI, R O M E O
II. I n s t i t u t fur Theoretische Physik, Universitat Hamburg, Luruper Chaussee 149, D-22761 Hamburg, Germany romeo.brunettiQdesy.de BRYDGES, DAVID
University of British Columbia, 121-1984 M a t h e m a t i c s Road, Vancouver, B.C., C a n a d a V 6 T 1Z2 db5d®math.ubc.ca BUCHHOLZ, DETLEV
Universitat Gottingen, I n s t i t u t fiir Theoretische Physik, Friedrich-Hund-Platz 1, 37077 Gottingen, Germany buchholzfltheorie.physik.uni-goettingen.de
BIVAR, ANTONIO
CABRAL, BENEDITO
C M A F , Av. Prof. G a m a P i n t o 2, PT-1649-003 Lisboa, Portugal
G F M U L , Av. Prof. G a m a P i n t o , 2, PT-1649-003 Lisboa, Portugal ben(9adonis.cii . f c . u l . p t
abivarQptmat.fc.ul.pt
Participants:
CALVO - D ' A N T O N I
679
CALVO, IVAN
C H E N , THOMAS
D e p a r t a m e n t o de Fisica Teorica, Facultad de Ciencias, Universidad de Zaragoza, Calle P e d r o Cerbuna, 12, E-50009 Zaragoza, Spain icalvofiunizar.es
D e p a r t m e n t of M a t h e m a t i c s , P r i n c e t o n University, Fine Hall, Washington Road, P r i n c e t o n , N J 08544, USA chenthomQcims.nyu.edu
CAM ASS A, PAOLO
CHENCINER, ALAIN
D i p a r t i m e n t o di M a t e m a t i c a , Universita di R o m a "Tor Vergata", Via deila Ricerca Scientifica, 00133 Roma, Italy
I M C C E , 77, avenue Denfert-Rochereau, 75014 Paris, France
camassaQmat.uniroma2.it CANNAS, ANA
I n s t i t u t o Superior Tecnico, D e p a r t a m e n t o de M a t e m a t i c a , A v. Rovisco Pais, PT-1049-001 Lisboa, Portugal acannasflmath.ist.utl.pt CARLEN, E R I C A.
Georgia Tech., M a t h e m a t i c s , 211 Skiles Building, A t l a n t a , GA 30332, USA CarlenlSmath. getech. edu CARVALHO, CoNCEigAo
C M A F , Av. Prof. G a m a P i n t o , 2, PT-1649-003 Lisboa, P o r t u g a l mcarvalhoflcii.fc.ul.pt CATARINO, NUNO
13 (2F2 - buzzer 5), Bruntsfield Place, Edinburgh EH10 4HN, UK catarinoGmaths.Warwick.ac.uk CATTO,
ISABELLE
C E R E M A D E ( U M R 7534), Universite de P ar i s-Dauphine , Place de L a t t r e de Tassigny, F-75775 P a r i s Cedex 16, France cattoQceremade.dauphine.fr CEGLA, W O J C I E C H
I n s t i t u t e of Theoretical Physics, University of Wroclaw, 50-204 pl.Maksa Borna 9, Wroclaw, Poland ceglaOift.uni.wroc.pi CELEGHINI, ENRICO
D i p a r t i m e n t o di Fisica, Universita di Firenze, Via Sansone 1, 150019 Sesto Fiorentino (Firenze) Italy celeghiniQfi.infn.it CERCHIAI, BIANCA LETIZIA
Lawrence Berkeley National Laboratory, Theory G r o u p , Bldg 50A5104, 1 Cyclotron Road, Berkeley, California 94720, USA BLCerchiaiSlbl.gov CHALLIPOUR, JOHN
chencineQimcce.fr CHERKIS, SERGEY
Trinity College, University of Dublin, College Green, Dublin 2, Ireland cherkisGmaths.ted.ie CHRUSCIEL, P I O T R
D e p a r t e m e n t de M a t h e m a t i q u e s , Faculte des Sciences, P a r e de G r a n d m o n t , F-37200 Tours, France chruscielQuniv-tours.fr CHRUSCINSKI, DARIUSZ
Inst. Of Physics, Nicolaus Copernicus University, Grudzia-Ced. 5 / 7 , 87-100 Torun, Poland darchQphys.uni.torun.pi ClOLLI, FABIO D i p a r t i m e n t o di M a t e m a t i c a , Universita di R o m a "Tor Vergata", Via della Ricerca Scientifica 1, 00133 Roma, Italy ciolliGdipmat.unime.it CIPRIANI, FABIO
Politecnico de Milano, piazza Leonardo d a Vinci 32, 20133 Milano, Italy fabio.ciprianiUmate.polimi.it CIPRIANO, FERNANDA
G F M U L , Av. Prof. G a m a P i n t o , 2, PT-1649-003 Lisboa, P o r t u g a l ciprianoSgfm.cii.fc.ul.pt CIRILLO, EMILIO
D i p a r t i m e n t o M e M o M a t , via A. Scarpa 16, 00161 Roma, Italy cirillofldmmm. uniromal. i t CLARK, JEREMY
University California, Davis, Dep. Of M a t h e m a t i c s , O n e Shield Av., Davis CA95616, USA Jeremyfflmath.ucdavis.edu COSTA SANTOS, RUBEN
I n s t i t u t o Superior Tecnico, D e p a r t a m e n t o de M a t e m a t i c a , Av. Rovisco Pais, PT-1049-001 Lisboa, Portugal
Dept. of Physics, I n d i a n a University, Bloomington, IN 47405, USA
costaflphys.uu.nl
challifoQindiana.edu
I n s t i t u t o Superior Tecnico, D e p a r t a m e n t o de M a t e m a t i c a , Av. Rovisco Pais, PT-1049-001 Lisboa, Portugal rcoutinQmath.ist.utl.pt
CHAVES, ROGERIO
Av. Lauricio P. Rasmussem, 1901, St. Morais, G o i a n i a - G O , C E P 74620-030, Brazil rogerio.chavesQwolfson.ox.ac.uk CHAYES, LINCOLN
M a t h e m a t i c s D e p a r t m e n t , University of California, Los Angeles, CA 90095-1555, USA lchayesGmath.ucla.edu
COUTINHO, RICARDO
CRUZEIRO, ANA BELA
I n s t i t u t o Superior Tecnico, D e p a r t a m e n t o de M a t e m a t i c a , Av. Rovisco Pais, PT-1049-001 Lisboa, Portugal abcruzQmath.ist.utl.pt
CHEN, YIH-YUH
D ' A N T O N I , CLAUDIO
Physics D e p a r t m e n t , National Taiwan University, Taipei, Taiwan 106, Republic of C h i n a yychenfflphys.ntu.edu.tw
D i p a r t i m e n t o di M a t e m a t i c a , Universita di R o m a "Tor Vergata", Via della Ricerca Scientifica, 00133 R o m a , Italy dantoniQmat.uniroma2.it
680
Participants: DE FALCO - FARIS
DE FALCO, DIEGO
DUNLOP, FRANCOIS
D i p a r t i m e n t o di Scienze deirinformazione, Universita di Milano, Via Comelico 39, 1-20100 Milano, Italy defalcoOdsi.unimi.it
Laboratoire de Physique Theorique et Modelisation, Universite de Cergy-Pontoise, 2 rue Adolphe Chauvin, B.P. 222 Pontoise, 95302 Cergy-Pontoise, France dunlopOptm.u-cergy.fr
DE LA M A D R I D , RAFAEL
D e p a r t a m e n t o de Fi'sica Teorica, Universidad del Pais Vasco, 48080 Bilbao, Spain wtbdemorOlg.ehu.es DE MELO, WELINGTON
DUNNING, TANIA CLARE
University of Queensland, M a t h e m a t i c s , Brisbane Qld 4072, Australia tcdlOyork.ac.uk
I n s t i t u t o de M a t e m a t i c a P u r a e Aplicada, E s t r a d a D o n a Castorina, 110, J a r d i m Botanico, Rio de Janeiro, 22460-320, Brazil demeloQimpa.br
n j e20phy. duke. edu
DE SANCTIS, LUCA
ECKMANN, J E A N - P I E R R E
via Boccaccio, 2700013 Fonte Nuova, Roma, Italy
D e p a r t e m e n t de Physique Theorique a n d Section de M a t h e m a t i q u e s , 32, bid. D'Yvoy, Universite de Geneve, 1211 Geneve 4, Switzerland
IdeOmath.princeton.edu DEIFT, PERCY
EARLY, NICHOLAS
Duke University, Physics, Science Dr., Box 90305, D u r h a m , NC 27708, USA
C o u r a n t I n s t i t u t e , 251 Mercer Street, New York NY 10012, USA
j ean-pierre. eckmannOphysics. unige. ch
deiftQcims.nyu.edu
D e p a r t m e n t of Particle Physics, University of Santiago de Compostela, E-15782 Santiago de Compostela, Spain
D E M U T H , MICHAEL
Technical University of Clausthal, I n s t i t u t e of M a t h e m a t i c s , Erzstrasse 1, 38678 Clausthal-Zellerfeld, Germany demuthOmath.tu-clausthal.de DEREZINSKI, JAN
EDELSTEIN, J O S E D.
jedelsOmath.ist.utl.pt ELGART, ALEXANDER
D e p a r t m e n t of M a t h e m a t i c s , Bldg. 380, Stanford University, Stanford, CA, 94305, USA
Dep. M a t h . Meth. in P h y s . , Warsaw University, Hoza 74, 00-682 Warszawa, P o l a n d
elgartOmath.Stanford.edu
j an. derezinskiOf uw. edu. pi
University of Paris 7, Institut de M a t h e m a t i q u e s , Case 7012, 2, place Jussieu, 75251 Paris cedex 05, France hakaneOmath.jussieu.fr
DlAS, GONQALO
Av. Brasil 155 7°D, PT-1700-067 Lisboa, P o r t u g a l cmflaminioOnetc.pt DIENG, M O M A R
D e p a r t m e n t of M a t h e m a t i c s , T h e University of Arizona, 617 N. S a n t a R i t a Ave., P.O. Box 210089, Tucson, AZ 85721-0089, USA momarflmath.arizona.edu DIMOCK, JONATHAN
D e p a r t m e n t of M a t h e m a t i c s , SUNY at Buffalo, Buffalo, NY, 14260, USA dimockOacsu.buffalo.edu DISERTORI, MARGHERITA
Viale Trieste 13, 38100 Trento, Italy disertorQphys.ethz.ch DITTRICH, JAROSLAV
Nuclear Physics I n s t i t u t e ASCR, CZ-250 68 Rez, Czech Republic dittrichQujf.cas.cz DOLGOPYAT, DMITRY
University of Maryland, Dept. of M a t h e m a t i c s , College P a r k M D 20742-4015, USA dmitryOmath.umd.edu DOREY, PATRICK
D e p a r t m e n t of M a t h e m a t i c a l Sciences, Science Site, S o u t h Road, D u r h a m D H 1 3LE, UK p. e . doreyOdur .ac.uk DOUGLAS, MICHAEL R.
Dept. of Physics, Rutgers University, Piscataway NJ 08855, USA mrdOphysics.rutgers.edu
ELIASSON, HA KAN
ELSKENS, Y V E S
Eq. turbulence plasma, case 321, Laboratoire de physique des interactions ioniques et moleculaires, U M R 6633 CNRS-univ. Provence, campus Saint-Jerome, F-13397 Marseille cedex 13, France elskensOup. univ-mrs. f r ERDOS, LASZLO
Mathematisches I n s t i t u t , LMU, Theresienstr. 39, D-80333 Munich, G e r m a n y lerdosOmath.gatech.edu ESTEBAN, MARIA J.
Universite de Paris-Dauphine, Ceremade, PI. Marechal de Tassigny, F-75775 Paris cedex 16, France estebanOceremade.dauphine.fr EVANS, DAVID
School of M a t h e m a t i c s , Senghennydd Road, Cardiff University, Cardiff, C F 2 4 4AG Wales, UK EvansDEOcf.ac.uk EXNER, PAVEL
D e p a r t m e n t of Theoretical Physics, N P I , Czech Academy of Sciences, 25068 R e z - P r a g u e , Czechia exnerOuj f . c a s . c z FALCO, PIERLUIGI
D i p a r t i m e n t o di M a t e m a t i c a , Piazzale Aldo Moro, 00185 Roma, Italy falcoOmat.uniromal.it FARIA DA VEIGA, PAULO AFONSO
I C M C - U S P , C.P. 668, Av. do T r a b a l h a d o r Sancarlense 400, 13560-970 Sao Carlos, SP, Brasil veigaOicmc. usp. br
DOYON, BENJAMIN
University of Oxford, Dept. of Physics, P a r k s Road, Oxford O X 1 3 P U , UK b. doyonlOphysics. ox. ac. uk
FARIS, WILLIAM
D e p a r t m e n t of M a t h e m a t i c s , University of Arizona, Tucson, Arizona 85721, USA farisOmath.arizona.edu
Participants:
FELDER
-
GOLDBAUM
681
FELDER, GIOVANNI
GALLAVOTTI, GIOVANNI
D e p a r t m e n t of M a t h e m a t i c s , E T H - Z e n t r u m , 8092 Zurich, Switzerland
D i p a r t i m e n t o di Fisica, INFN, Universita di R o m a "La Sapienza", Piazzale Aldo Moro, 2, 1-00185 Roma, Italy giovanni.gallavottiQromai.infn.it
Giovanni.felderUmath.ethz.ch FELDMAN, J O E L
D e p a r t m e n t of M a t h e m a t i c s , University of British Columbia, 1984 M a t h e m a t i c s Road, Vancouver, B C C a n a d a V 6 T 1Z2 f eldmamSmath. ubc. ca
GANNON, T E R R Y
D e p a r t m e n t of M a t h e m a t i c a l Sciences, University of Alberta, E d m o n t o n , A l b e r t a , C a n a d a T 6 G 2G1 tgannonGlafleur.math.ualberta.ca
FERNANDES, RUI LOJA
GARCIA CALDERON, GASTON
I n s t i t u t o Superior Tecnico, D e p a r t a m e n t o de M a t e m a t i c a , Av. Rovisco Pais, PT-1049-001 Lisboa, Portugal rfernQmath.ist.utl.pt
UNAM, Inst. Fisica, Mexico City, Mexico gastonQfenix.if isicacu.unam.mx
FERNANDES, SARA
D e p a r t a m e n t o de M a t e m a t i c a , Universidade de Evora, R. R o m a o R a m a l h o , 59, PT-7000-671 Evora, P o r t u g a l saffluevora.pt FERREIRA, FERNANDA
E S E I G - I n s t i t u t o Politecnico do P o r t o , R u a D. Sancho I, 981, PT-4480-876 Vila do Conde, P o r t u g a l znf. f e r r e i r a Q n e t c a b o . p t F E W S T E R , CHRISTOPHER
D e p a r t m e n t of M a t h e m a t i c s , University of York, Heslington, York, YO10 5DD, UK cjf3
I n s t i t u t fur Angewandte M a t h e m a t i k Universitat Bonn, Wegelerstr. 6, 53115 Bonn, G e r m a n y f incoQwiener. iam. uni-bonn. de
GASPARD, P I E R R E
Center for Nonlinear P h e n o m e n a and Complex Systems, C a m p u s Plaine, Code Postal 231, Universite Libre de Bruxelles, B-1050 Brussels, Belgium gaspardQulb.ac.be GAWEDZKI, KRZYSZTOF
ENS-Lyon, Laboratoire de Physique, 46, Allee d'ltalie, 69364 Lyon cedex 07, France kgawedzkHens-lyon.fr GENTILE, GUIDO
D i p a r t i m e n t o di M a t e m a t i c a , Universita di R o m a Tre, Largo San Leonardo Murialdo 1, 00146 R o m a , Italy gentileQmat.uniroma3.it G E R A R D , CHRISTIAN
Laboratoire de M a t h e m a t i q u e s , B a t i m e n t 425, Universite de Paris Sud, 91405 Orsay cedex, France c h r i s t i a n . gerardQznath. u-psud. f r GESUALDO, DELFINO
F I O R E , GAETANO
SISSA, Via Beirut 2-4, 34014 Trieste, Italy
D i p a r t i m e n t o di M a t e m a t i c a e Applicazioni, Universita di Napoli "Federico II", Via Claudio 2 1 , 1-80125 Napoli, Italy
delfinoQsissa.it
gaetano.f ioreflna.infn.it FLOREANINI, ROBERTO
INFN, c / o D i p a r t i m e n t o di Fisica Teorica, S t r a d a Costiera, 11, 34014 Trieste, Italy floreanflts.infn.it FLORENTINO, CARLOS
I n s t i t u t o Superior Tecnico, D e p a r t a m e n t o de M a t e m a t i c a , Av. Rovisco Pais, PT-1049-001 Lisboa, Portugal cflorenQmath.ist.utl.pt FONSECA, CARLOS
University of Coimbra, M a t h e m a t i c s , C M U C , A p a r t a d o 3008, PT-3001-454 Coimbra, P o r t u g a l erfonsecaQnetcabo.pt FORSTER, CHRISTINA
I n s t i t u t fiir Experimentalphysik, Universitat Wien, Strudelhofgasse 4, A-1090 Wien, Austria Christina.forsterQexp.univie.ac.at FRANgoiSE, J E A N - P I E R R E
Universite P.-M. Curie, Paris VI, Laboratoire J.-L. Lions, U M R 7598 C N R S , 175 Rue de Chevaleret, BC 187, 75252 Paris, France jpfficcr.jussieu.fr FREDENHAGEN, KLAUS
II. I n s t i t u t fiir Theoretische Physik, L u r u p e r Chaussee 149, D-22761 H a m b u r g , G e r m a n y klaus. f redenhagenQdesy. de FREITAS, P E D R O
D e p a r t a m e n t o de M a t e m a t i c a , I n s t i t u t o Superior Tecnico, PT-1049-001 Lisboa, P o r t u g a l pf reitasHmath. ist. utl. pt
GEYLER, VLADIMIR
c/o Prof. J.Bruening, I n s t i t u t fiir M a t h e m a t i k , H u m b o l d t Universitat, Rudower Chaussee 25, 12489 Berlin, G e r m a n y geylerQmrsu. ru GIANNAKIS, IOANNIS
Khuri Lab, 1230 York Avenue, Rockefeller University, New York, NY 10021, USA giannakflsummit.rockefeller.edu G I E R E , ECKHARD
T U Clausthal, I n s t i t u t fiir M a t h e m a t i k , Erzstrasse 1, 38678 Clausthal-Zellerfeld, G e r m a n y gierefBmath. t u - c l a u s t h a l . de GIORGILLI, ANTONIO
D i p a r t i m e n t o di M a t e m a t i c a e Applicazioni, Via R. Cozzi 53, 1-20125 Milano, Italy antonioSmatapp.unimib.it GIULIANI, ALESSANDRO
Via Ivanoe Bonomi 92, 00139, R o m a , Italy alessandro.giulianiQromal.infn.it GODINHO, SOFIA
G F M U L , Av. Prof. G a m a P i n t o , 2, PT-1649-003 Lisboa, P o r t u g a l sgodinhoQcii.fc.ul.pt GOLDBAUM, PEDRO
Princeton University, Physics, Princeton NJ 08544, USA goldbaumtDprinceton. edu
682
Participants:
GOLSE - H U B F F E L
GOLSE, FRANQOIS
HAINZL, CHRISTIAN
Laboratoire J.-L. Lions, Universite Paris 6, Boite courrier 187, 4 place Jussieu, 75252 Paris cedex 05, France golse(3math.jussieu.fr
D e p a r t m e n t of M a t h e m a t i c s , University of Copenhagen, Universitetsparken 5, 2100 Copenhagen, Denmark
GOLUBEVA,
HAIRER, MARTIN
VALENTINA
Viniti, 20 Usievicha Str. Moscow, 125190, Russia golubflviniti.ru GOMIS, JAUME
Perimeter I n s t i t u t e , Theor. Physics, 31 Caroline Str., Waterloo, O n t a r i o , C a n a d a gomisQtheory.caltech.edu GRACIO, CLARA
Universidade de Evora, Colegio Luis Antonio Verney, D e p a r t a m e n t o de M a t e m a t i c a , R u a R o m a o R a m a l h o , 59, PT-7000-671 Evora, P o r t u g a l mgracioOuevora.pt G R A F , GIAN MICHELE
I n s t i t u t fur Theoretische Physik, Schafmattstr. 32, E T H Honggerberg, H P Z G 19.2, 8093 Zurich, Switzerland gmgrafOitp.phys.ethz.ch
hainzlQmath.ku.dk
M a t h e m a t i c s D e p a r t m e n t , T h e University of Warwick, CV4 7AL Coventry, UK hairerflmaths.Warwick.ac.uk HARSHMAN, NATHAN
American University, C o m p u t e r Sciences, 4400, Massachusetts Av., N W , Washington DC 20016-8058, USA harshmanQamerican.edu HASLER, DAVID
P.O. Box 400137, University of Virginia, Charlottesville, VA 22904-4137, USA haslerfflmath.ku.dk H E L F F E R , BERNARD
D e p a r t e m e n t de M a t h e m a t i q u e s , B a t i m e n t 425, Universite Paris-Sud, 91405 Orsay cedex, France Bernard.HelfferOmath.u-psud.fr
GRANDATI, Y V E S
HENKEL, MALTE
Universite de Metz, U F R Sci. F.A. C a m p u s Bridoux, 57070 Metz, France grandati<3sciences. univ-metz. f r
Laboratoire de Physique des M a t e r i a u x , Universite Henri Poincare - Nancy I, B.P. 239, F-54506 Vandoeuvre les Nancy cedex, France henkelQlpm.u-nancy.fr
GRIESEMER, MARCEL
Univ. S t u t t g a r t , M a t h e m a t i c s , Pfaffenwaldrina 57, 70569 S t u t t g a r t , G e r m a n y Marcel.GriesemerQmathematik.uni-stuttgart.de GROSSE, HARALD
University of Vienna, I n s t i t u t e of Theoretical Physics, Boltzmanngasse 5, A-1090 Vienna, Austria harald.grosseOmailbox.univie.ac.at GROSSKINSKY, STEFAN
HERDEGEN, ANDRZEJ
I n s t i t u t e of Physics, Jagiellonian University, 30-059 Cracow, Poland herdegeniBth. i f . uj . edu. p i HERNANDEZ-GARDUNO, ANTONIO
D p t o . MMyN, IIMAS-UNAM, A p d o . Postal 20-726, Mexico City 01000, Mexico ahernandez<91eibniz. i imas. unam. mx
Zentrum M a t h e m a t i k , Bereich M5, Technische Universitat Miinchen, D-85747 Garching bei Miinchen, Germany stefartgSma.tum.de
Czech Techn. Univ., Physics, Brehova 7, 11519 P r a g u e 1, Czech Republic hlavatyfflfjfi.cvut.cz
GROTHAUS, MARTIN
HOLLANDS, STEFAN
M a t h e m a t i c s D e p a r t m e n t , University of Kaiserslautern, P.O.Box 3049, 67653 Kaiserslautern, Germany grothausQmathematik.uni-kl.de
Georg-August Universitat G o t t i n g e n , Physics, Friedrich-Hund P l a t z 1, 37077 G o t t i n g e n , G e r m a n y hollandsQtheorie.physik.uni-goettingen.de
GRUBER, MICHAEL J.
Klingenberger Strasse 24, 01187 Dresden, G e r m a n y holsteinQmpipks-dresden. mpg. de
Universitat Augsburg, I n s t i t u t fur Physik, T P II, 86135 Augsburg, G e r m a n y Michael. GruberCPhys ik. Uni-Augsburg. DE GUERRA, FRANCESCO
D i p a r t i m e n t o di Fisica, Universita di R o m a "La Sapienza", Piazzale Aldo Moro, 2, 1-00185 Roma, Italy francesco.guerraQromal.infn.it
HLAVATY, LADISLAV
HOLSTEIN, D E T L E F
HONG, DOOJIN
Univ. of Wisconsin, Fond du Lac, M a t h e m a t i c s , 400 Univ. Drive, W I 54935-2950, USA dhongffluwc. edu HONKONEN,
JUHA
Shaulu Street 1-98, Riga LV-1055, Latvia sharifQone.lv
Theory Division, D e p a r t m e n t of Physical Sciences, P.O. Box 64 ( G u s t a v Hallstromin katu 2), FI-00014 University of Helsinki, Finland j uha.honkonenShelsinki.f i
HAHN, ATLE
HORWITZ, LAURENCE P .
I n s t i t u t fur Angewandte M a t h e m a t i k , Abteilung Wahrscheinlichkeitstheorie und Mathematische Statistik, Universitat Bonn, Poppelsdorfer Allee 82, 53115 Bonn, G e r m a n y
Univ. Tel Aviv, Physics, R a m a t Aviv 69978 Israel
GUSEINOV, SHARIF
hahnQuni-bonn.de
larryHpost.tau.ac.il HUEFFEL, HELMUTH
Univ. Wien, Theoretical Physics, Boltzmanngasse 5, A-1090 Wien, Austria helmuth.hueffelQunivie.ac.at
Participants: IANTCHENKO - KOVARIK
IANTCHENKO, ALEXEI
K I N G , CHRISTOPHER
Videgatan 6, SE-213 62, Malmo, Sweden
D e p a r t m e n t of M a t h e m a t i c s , 567 Lake Hall, Northeastern University, Boston MA 02115, USA kingOneu.edu
aiOts.mah.se ICHINOSE, TAKASHI
Dept of M a t h e m a t i c s , Faculty of Science, Kanazawa University, Kanazawa, 920-1192, J a p a n ichinoseOkenroku.kanazawa-u.ac.jp IMBRIE, JOHN
21 Pembroke Dr, Lake Forest, IL 60045, USA ji2k
Research I n s t i t u t e for M a t h e m a t i c a l Sciences, Kyoto University, Kyoto, 606-8502 J a p a n reiOgokutan.c.u-tokyo.ac.jp IOFFE, DMITRY
William Davidson Faculty of Industrial Engineering and Management, Technion City, Haifa 32000, Israel ieioffeOie.technion.ac.il ITO, KEIICHI R. D e p a r t m e n t of M a t h . & Phys., S e t s u n a n University, Ikeda-Naka-Machi 17 - 8 , 572-8508 Osaka, J a p a n
683
KIRSCH, W E R N E R
R u h r Universitat Bochum, A r b e i t s g r u p p e Mathematische Physik, F a k u l t a t fur M a t h e m a t i k NA 3 / 2 9 , Universitiitsstrafie 150, D-44780 Bochum, Germany Werner.kirschOruhr-uni-bochum.de KITANINE, NIKOLAI
L P T M , Universite de Cergy-Pontoise, 2 av. Adolphe Chauvin, 95302 Cergy-Pontoise cedex, France nikolai,kitanineOptm.u-cergy.fr KLEEFELD, FRIEDER
Centro de Fi'sica d a s Interaccoes F u n d a m e n t a l s ( C F I F ) , I n s t i t u t e Superior Tecnico, Ediffcio Ciencia, Piso 3, Av. Rovisco Pais 1, PT-1049-001 Lisboa, Portugal kleefeldOcfif.ist.utl.pt KLEIN, ABEL
itoOmpg.setsunan.ac.jp
D e p a r t m e n t of M a t h e m a t i c s , University of California, Irvine, CA 92697-3875, USA
J A F F E , ARTHUR
akleinfflmath.uci.edu
Harvard University, Lyman L a b o r a t o r y of Physics, 17 Oxford Street, Cambridge MA 02138, USA
KLEIN, CHRISTIAN
jaffeflphysics.harvard.edu
MPI for M a t h e m a t i c s in t h e Sciences, Inselstr. 22, 04103 Leipzig, Germany
JANUARIO, CRISTINA
kle inOmppmu. mpg. de
Alameda dos Oceanos, 4.28.OIF, 6A, PT-1990-237 Lisboa, P o r t u g a l
KNORRER, H O R S T
cj anuarioOdeq.isel.ipl.pt
JlMBO, MlCHIO University of Tokyo, M a t h e m a t i c s , Meguro, Tokyo 153, J a p a n j imbomicOms.u-tokyo.ac.jp J O N E S , VAUGHAN F. R.
UC Berkeley, M a t h e m a t i c s , Berkeley, C A 94720, USA vfrOmath.berkeley.edu J O Y C E , WILLIAM
D e p a r t m e n t of Physics & Astronomy, University of C a n t e r b u r y , P r i v a t e B a g 4800, Christchurch, New Zealand w.j oyceOphys.canterbury.ac.nz JULIA, BERNARD
L P T E N S , 24 rue Lhomond, 75005 Paris, France bernard.j uliaOens.fr JUNKER, WOLFGANG
Universitat Hannover, I n s t i t u t fur M a t h e m a t i k , Welfengarten 1, D-30167 Hannover, G e r m a n y j unkerOae i.mpg.de KALOSHIN, VADIM
M a t h e m a t i c s 253-37, Caltech, P a s a d e n a , C A , 91106, USA kaloshinOits.caltech.edu KAWAHIGASHI, YASUYUKI
D e p a r t m e n t of M a t h e m a t i c a l Sciences, University of Tokyo, K o m a b a , Tokyo, 153-8914, J a p a n yasuyukiOms.u-tokyo.ac.jp K E Y L , MICHAEL
Trollblumenweg 33, 12357 Berlin, G e r m a n y m.keylOtu-bs.de KHOLODENKO, ARKADY L.
375 H. L. Hunter Laboratories, Clemson University, Clemson, SC 29634-0973, USA stringOclemson.edu
M a t h e m a t i k , E T H - Z e n t r u m , C H 8092 Zurich, Switzerland knoerrerOmath.ethz.ch KOBAYASHI, TSUNEHIRO
Research Center on Higher E d u c a t i o n for t h e Hearing and Visually Impaired, A m a k u b o 4-3-15, Tsukuba-shi, Ibarki 305-0005, J a p a n kobayashOa.tsukuba-tech.ac.jp KOCIAN, PAUL
Friedrichstrasse 12, CH-8051 Zurich pkocianOpci. unizh. ch KONDRATIEV, YURI
Forsch. BiBos, Bielefeld Univ., D 33615, Bielefeld, Germany kondratOmathemat ik.uni-bielefeld.de K O N O T O P , VLADIMIR
C F T C , Av. Prof. G a m a P i n t o , 2, PT-1649-003 Lisboa, Portugal konotopOcii.f c .ul.pt KOSTRYKIN, VADIM
Fraunhofer-Institut fuer Lasertechnik, Steinbachstrasse 15, D-52074 Aachen, Germany k o s t r y k i n O i l t . f raunhof e r . de KOTECKY, ROMAN
Charles University, Theoretical Physics, M F F , P r a h a 1, 11000, Czech Republic koteckyOcucc. ruk. c u n i . cz KOUKIOU, FLORA
Universite de Cergy-Pontoise, L P T M 2 , av. Adolphe Chauvin, 95302 Cergy-Pontoise cedex, France flora.koukiouOu-cergy.Ir KOVARIK, HYNEK
Faculty of M a t h e m a t i c s a n d Physics, S t u t t g a r t University, Pfaffenwaldring 57, D-70569 S t u t t g a r t , Germany kovarikOmathematik.uni-stuttgart.de
684
Participants:
KRASOVSKY
- Loss
KRASOVSKY, IGOR
LEBOWITZ, J O E L
Max P l a n k I n s t i t u t e , Physics, Dresden D-01187, Germany
Center for M a t h e m a t i c a l Sciences Research, R u t g e r s University, 110 Frelinghuysen Road, Piscataway, N J 08854-8019 lebovitzQmath.rutgers.edu
ivktfmath.tu-berlin.de KREJCIRIK, DAVID
D e p a r t m e n t of Theoretical Physics, Nuclear Physics I n s t i t u t e , AS C R , 25068 Rez, Czech Republic dkrejQmath.ist.utl.pt KRICHEVER, IGOR
Columbia University, M a t h e m a t i c s , 2990 Broadway, New York 10027, USA krichevQmath. Columbia. edu KRIZ, JAN
Univ. of Hradec, Physics, Kralove, Czech Republic kriztDujf . c a s . c z KuCKERT, B E R N D
II. I n s t i t u t fiir Theoretische Physik, L u r u p e r Chaussee 149, D-22761 H a m b u r g , G e r m a n y bernd.kuckertCdesy.de KUKSIN, SERGEI
H e r i o t - W a t t University, M a t h e m a t i c s , Edinburgh EH14 4AS, Scotland, UK S.B.KuksinQma.hw.ac.uk KUNZLE, H A N S - P E T E R
Dept. of M a t h e m a t i c a l and Statistical Sciences, University of Alberta, E d m o n t o n , C a n a d a T 6 G 2G1 hp.kunzleQualberta.ca KURASOV, PAVEL
D e p t . of M a t h e m a t i c s , LTH, Box 118, Lund University, SE-221 00 Lund, Sweden kurasovQmaths.1th.se KURENNAYA, KRISTINA
Donetsk University, M a t h e m a t i c s , 83055, Donetsk, Ukraine kurennayaQmatfak.dongu.donetsk.ua KUZNETSOV, EVGENII
L a n d a u I n s t i t u t e for Theoretical Physics, Russian Acad, of Sciences, Kosygina str., 2, 119334, Moscow, Russia kuznetso(3itp .ac.ru LAGES, J O S E
Universite P a u l Sabatier, Laboratoire de Physique Theorique, U M R 5152, Toulouse, France lagssUcfif.ist.utl.pt LANG MANN, EDWIN
M a t h e m a t i c a l Physics, Dept. of Physics, K T H , AlbaNova University C e n t e r , SE-106 91 Stockholm, Sweden langmanntatheophys . k t h . s e LAPTEV, A R I
D e p a r t m e n t of M a t h e m a t i c s , K T H , SE-100 44 Stockholm, Sweden laptevQmath. kth. se LAWLER, GREGORY
D e p a r t m e n t of M a t h e m a t i c s , Malott Hall, Cornell University, Ithaca, NY 14853-4201, USA lawlerftaath.Cornell.edu LEANDRE, REMI
I n s t i t u t de M a t h e m a t i q u e s de Bourgogne, Universite de Bourgogne, B.P. 47870, 21078 Dijon cedex, Prance remi.leandreQu-bourgogne.fr
LENCI, M A R C O
Stevens I n s t i t u t e of Technology, M a t h e m a t i c s , Hoboken, N J 0703, USA mlenciOaath.Stevens.edu LENDI, KARL
I n s t i t u t e of Physical Chemistry, University of Zurich, W i n t e r t h u r e r s t r . 190, CH-8057 Zurich, Switzerland carlenQpc i . u n i z h . c h LEROUX, P H I L I P P E
Univ. Rennes 1, U F R M a t h e m a t i q u e s , I R M A R , C a m p u s Beaulieu, 35042 Rennes, France p h i l i p p e . lerouxGuniv-rennesl. f r LESCHKE, H A J O
I n s t i t u t fiir Theoretische Physik I, Universitat Erlangen-Niirnberg, StaudtstraBe 7, D-91058 Erlangen, G e r m a n y hajo. leschkeGphysik. uni-erlangen. de LEV, ONDREJ
Czech Techn. Univ., F N S P E , Brehova 7, 11519, P r a h a , Czech Republic levQkmlinux.fjfi.cvut.cz LEVY, THIERRY
Ecole Normale Superieure, Dept. M a t h e m a t i q u e s , Equ. Probabilite, 45 rue d'Ulm, F-75230, Paris cedex 05, France thierry.levyGens.fr LIEB, ELLIOTT
J a d w i n Hall, Physics Dept., P r i n c e t o n University, P . O . Box 708, P r i n c e t o n , N J 08544-0708, USA liebQmath.princeton.edu LLEDO, FERNANDO
I n s t i t u t fiir Reine und A n g e w a n d t e M a t h e m a t i k , R W T H Aachen, Templergraben 55, D-52056 Aachen, Germany lledoQiram.rwth-aachen.de L O B O , FRANCISCO
Centro de A s t r o n o m i a e Astrofi'sica d a Universidade de Lisboa (CAAUL), C a m p o G r a n d e , Ed. C8, PT-1749-016 Lisboa, P o r t u g a l f loboQcosmo. f is. f c. ul. pt LOEFFEL, JEAN-JACQUES
Univ. Lausanne, Theoretical Physics, Dorigny, CH-1015 L a u s a n n e , Switzerland j ean-j acques.loeffelQipt.unil,ch LONGO, ROBERTO
Univ. di R o m a 1, Via Ricerca Scientifica 1, 1-00133, R o m a , Italy dopliche$mat.romal.it LOPES, PEDRO
I n s t i t u t o Superior Tecnico, D e p a r t a m e n t o de M a t e m a t i c a , Av. Rovisco Pais, PT-1049-001, Lisboa, Portugal pelopesQmath.ist.utl.pt Loss, MICHAEL
Georgia Tech, M a t h e m a t i c s , A t l a n t a , Georgia 30332, USA lossSmath.gatech.edu
Participants:
LUDU -
MOODY
685
LUDU, ANDREI
MASCARI, GIOVANNI FRANCESCO
Northwestern S t a t e University, D e p a r t m e n t of Chemistry and Physics, Natchitoches, LA 71497, USA luduaflnsula.edu
IAC CNR, Viale del Policlinico 137, 00161 Rome 4, Italy mascarifliac.rm.cnr.it
LYTVYNOV, EUGENE
MATHER, JOHN
Univ. of Wales, Swansea, M a t h e m a t i c s , Singleton P a r k , SA2 8 P P , U.K
Princeton University, D e p a r t a m e n t of M a t h e m a t i c s , Fine Hall, Washington Road, Princeton NJ 08544 1000 USA
lytvynovflwiener.iam.uni-bonn.de MACKAAY, M A R C O
Universidade do Algarve, D e p a r t a m e n t o de M a t e m a t i c a , C a m p u s de G a m b e l a s , PT-8005-139, Faro, Portugal
jnmflmath.princeton.edu MATIAS, P E D R O
mmackaayflualg. pt
P r a c e t a Natalia Correia, Lote 9, 3 B, PT-2660-314 Santo Antonio dos Cavaleiros, Portugal pmatiasflfisica.ist.utl.pt
MAGALHAES, LUIS T.
MAZZUCCHI, SONIA
I n s t i t u t o Superior Tecnico, D e p a r t a m e n t o de M a t e m a t i c a , Av. Rovisco Pais PT-1049-001, Lisboa, Portugal
Dip. M a t e m a t i c a , Universita di Trento, via Sommarive, 38050 Povo - Trento, Italy mazzucchflscience. u n i t n . i t
lmagalflmath.ist.utl.pt MAILLET, J E A N M I C H E L
Laboratoire de Physique, ENS Lyon, 46 allee d'ltalie, 69364 Lyon cedex, France mailletflens-lyon.fr MAJEWSKI, WLADYSLAW ADAM
I n s t i t u t e of Theoretical Physics and Astrophysics, Gdansk University, W i t a Stwosza 57, 80-952 Gdansk, Poland fizwamfluniv.gda.pl MAKINO, HIRONORI
D e p a r t m e n t of H u m a n &; Information Science at Tokai University, K i t a k a n a m e 1117, Hiratsuka-shi, K a n a g a w a 259-1292, J a p a n makinofltokai.ac.jp MANOJLOVIC, NENAD
D e p a r t a m e n t o de M a t e m a t i c a , Faculdade de Ciencias e Tecnologia, Universidade do Algarve, C a m p u s de G a m b e l a s , PT-8005-139 Faro, Portugal nmano jflualg. pt MARCHETTI, DOMINGOS
I n s t i t u t o de Fisica, Universidade de Sao Paulo, R u a do M a t a o , Trav. R 187, 05508-120 Sao Paulo, SP, Brasil marchettQif.usp.br MARCIC, MILAN
University of Maribor, Faculty of Mechanical Engineering, Smetanova 17, 2000 Maribor, Slovenia milan.marcicffluni-mb.si MARECKI, P I O T R
Wyzsza Szkola Inform., 43-300 Bielsko-Biala, Poland mareckiflmail.desy.de MARINO, MARCOS
T h e o r y Division, D e p a r t m e n t of Physics, C E R N , Geneva 23, CH-1211 Switzerland marcosflmail.cern.ch MARQUES, ALESSANDRO
R. Sao Carlos do Pinhal, 508, apto. 133, Bela Vista, C E P 01333-905 Sao Paulo - SP - Brasil amarquesflfma.if.usp.br MARTIN, P H I L I P P E
E P F L . Physique Theorique, 1015 Lausanne, Switzerland philippe-andre.martinflepf1.ch MARTINS, N U N O
I n s t i t u t o Superior Tecnico, D e p a r t a m e n t o de M a t e m a t i c a , A v. Rovisco Pais, PT-1049-001 Lisboa, Portugal nmartinsflmath.ist.utl.pt
MCLAUGHLIN, KENNETH
2427 E 1st Street, Tucson, AZ 85719, USA mclflamath.unc.edu MENDES, DIANA A.
I S C T E , D p t . Metodos Q u a n t i t a t i v o s , Av. For$as A r m a d a s PT-1649 Lisboa, Portugal diana.mendesfliscte.pt MESTEL, BENJAMIN
D e p a r t m e n t of C o m p u t i n g Science & M a t h e m a t i c s , University of Stirling, Stirling F K 9 4LA, Scotland, UK b.d.mestelflex.ac.uk MEUSBURGER, CATHERINE
Max Planck Inst., Am Muhlenberg 1, D 14476 Golm, Germany c.meusburgerAma.hw.ac.uk MICHOEL, T O M
Bioinformatics & Evolutionary Genomics, D e p a r t m e n t of P l a n t Systems Biology, V I B / G h e n t University Technologiepark, 927 B-9052 Gent, Belgium torn.michoelApsb.ugent.be MlCKELSSON, JOUKO
K T H , Roslagstuilsbacken 11, 10691 Stockholm, Sweden jouko.mickelssonflhelsinki.fi MIHAILOVICH, IVANOV VLADIMIR
ul. Kostromskaja, 21, Ekaterinburg, 620033, Russia ivmflusue. ru MIKOVIC, ALEKSANDAR
Dept. Matematica, Universidade Lusofona de Humanidades e Tecnologias, Av. do C a m p o G r a n d e , 376, PT-1749-024 Lisboa, P o r t u g a l amikovicflulusofona.pt MIRACLE-SOLE, SALVADOR
Centre de Physique Theorique, C N R S Luminy, Case 907, 13288 Marseille cedex 9, France miracleflcpt.univ-mrs.fr MISRA, AALOK
D e p a r t m e n t of Physics, Indian I n s t i t u t e of Technology Roorkee, Roorkee - 247 667 U t t a r a n c h a l , India aalokfphfliitr.ernet.in MlTTER, PRONOB
Universite Montpellier 2, L P T A , France mitterfllpm.univ-montp2.fr MOODY, ROBERT V.
1675 Prospect Place, Victoria, B C , V8R 5X7, C a n a d a rmoodyflualberta.ca
686
Participants: MORALES-GUZMAN - OLLA
MORALES-GUZMAN, J. DIONISIO
NATARIO, J O S E
Universidad A u t o n o m a M e t r o p o l i t a n a - A z c , Ciencias Basicas, Av. San Pablo 180, H-335, Col. Reynosa Tamaulipas, Azcapotzalco 02200, Mexico D F jdmgOcorreo. a z c . uam. mx
D e p a r t a m e n t o de M a t e m a t i c a , I n s t i t u t o Superior Tecnico, Av. Rovisco Pais, PT-1049-001 Lisboa, Portugal jnatarOmath.ist.utl.pt
MORCHIO, GIOVANNI
NEKRASOV, NIKITA
D i p a r t i m e n t o di Fisica dell'Universita, Largo B. Pontecorvo, 56100 Pisa, Italia
Institut des H a u t e s E t u d e s Scientifiques, Le Bois-Marie, 35 r o u t e de C h a r t r e s , 91440 Bures-sur-Yvette, France nikita0ihes.fr
morchioOdf.unipi.it M O R O , ANTONIO
D i p a r t i m e n t o di Fisica dell'Universita di Lecce, via per Arnesano, c.a.p. 73100, Lecce, Italy antonio. moroOle. inf n. it MORRISON, DAVID
N E T O , ORLANDO
C M A F , Av. Prof. G a m a P i n t o , 2, PT-1649-003 Lisboa, Portugal orlandoOptmat.fc.ul.pt
Duke University, M a t h e m a t i c s , 213 Physics Bldg, Box
N E T O DE CARVALHO, CARLOS
drmQmath. duke. edu
Centro Cultural Raiano, Avenida Zona Nova de E x p a n s a o , PT-6060-101 Idanha-a-Nova, P o r t u g a l praedichniaOhotmail.com
MORSELLA, GERARDO
D i p a r t i m e n t o di M a t e m a t i c a , Universita di R o m a "La Sapienza", Piazzale Aldo Moro, 2, 1-00185 R o m a , Italy morsellaOmat,uniromal.it MOURAO, J O S E
I n s t i t u t e Superior Tecnico, D e p a r t a m e n t o de M a t e m a t i c a , Av. Rovisco Pais, PT-1049-001 Lisboa, Portugal jmouraoOmath.ist.utl.pt MUELLER, P E T E R
NIEMI, ANTTI
D p t . Theoretical Physics, Uppsala University, Box 803 5 - 75108 Uppsala, Sweden A n t t i . NiemiOteorfys.uu.se NlKITENKOVA, SVETLANA
Nizhny Novgorod S t a t e Techn. Univ., 603000, Minis. Str. 24, Russia spnOwaise.nntu.sci-nnov.ru
Univ. Gottingen, Theor. Physik, Friedrich - Hund P l a t z 1, D37077 Gottingen, Germany peterl760web. de
Hauchsvej 15, St., 1825 Frederiksberg C, D e n m a r k maniss0ruc.dk
M U L , OLENA
NUNES, ANA
Univ. Aveiro, D e p t . M a t h e m a t i c s , PT-3810-193, Aveiro, Portugal olenaOmat.ua.pt
Centro de Fisica Teorica e C o m p u t a t i o n a l , Av. Prof. G a m a P i n t o 2, PT-1649-003 Lisboa, Portugal anunesOlmc.fc.ul.pt
M U S U H , SAMI
NUNES, JOAO PlMENTEL
Physics D e p a r t m e n t , Al-Azhar University, P.O. Box 1277 Gaza, Palestine
Niss, MARTIN
smuslihOictp.trieste.it
Dep. M a t e m a t i c a , I n s t i t u t o Superior Tecnico, Av. Rovisco Pais, PT-1049-001 Lisboa, Portugal jpnunesQmath.ist.utl.pt
NABER, GREGORY
OJIMA, IZUMI
1010 Race Street, Unit # P H L , Philadelphia PA 19107, USA gln220drexel. edu
Research I n s t i t u t e for M a t h e m a t i c a l Sciences, Kyoto University, Kyoto 606-8502, J a p a n
NACHTERGAELE, BRUNO
OKOUNKOV, ANDREI
D e p a r t m e n t of M a t h e m a t i c s , University of California, One Shields Avenue, Davis, CA 95616-8633, USA bxnOmath.ucdavis.edu
D e p a r t m e n t of M a t h e m a t i c s , Princeton University, Fine Hall, Washington Road, Princeton NJ 08544-1000, USA
NAGAMACIII, SHTGEAKI
Univ. Tokushima, Applied Physics a n d m a t h e m a t i c s , 2-1 Minamijosanjima, Tokushima 7708506 J a p a n shigeakiOpm.tokushima-u.ac-jp NAHM, W E R N E R
oj imaOkurims.kyoto-u-ac.jp
okounkovOmath.princeton.edu OLIVEIRA, ISABEL
Escola Superior Tecnologia, D e p a r t a m e n t o de M a t e m a t i c a , I P S , R u a Vale de Chaves, Estefanilha, PT-2914-761 Setubal, P o r t u g a l
Dublin IAS, Theoretical Physics, 10 Burlington Road, Dublin 4, Ireland
ioliveiraOest.ips.pt
wernerOth.physik.uni-bonn.de
C M A F , Av. Prof. G a m a P i n t o , 2, PT-1649-003 Lisboa, Portugal
NAKAYASHIKI, ATSUSHI
Kyushu University, M a t h e m a t i c s , Hahozaki, Fukuoka, 812 - 8581, J a p a n 6vertex0math. kyushu-u . a c . j p NARITA, M A K O T O
Center for Relativity a n d Geometric Physics Studies, D e p a r t m e n t of Physics, National Central University, Jhongli 320, Taiwan maknarOaei-potsdam.mpg.de
OLIVEIRA, MARIA J O A O
oliveiraOcii.fc.ul.pt OLIVIERI, ENZO
D i p a r t i m e n t o di M a t e m a t i c a , Universita di R o m a "Tor Vergata", V i a della Ricerca Sciehtifica, 00133 R o m a , Italy olivieriOmat.uniroma2.it OLLA, STEPANO
Ceremade, U M R - C N R S 7534, Universite Paris Dauphine, Place du Marechal de L a t t r e de Tassigny, F-75775 Paris cedex 16, France ollaOceremade. dauphine. f r
Participants:
OOGURI - REIS
687
OOGURI, HIROSI
POLIN, M A R C O
California I n s t i t u t e of Technology, Mail Code 452-48, P a s a d e n a , CA 91125, USA oogurifltheory.caltech.edu
Room 731, D e p a r t m e n t of Physics, New York University, 4 Washington Place, New York, N Y 10003 mp756<8nyu. edu
OWHADI, HOUMAN
POLITO, CAIO
Applied a n d C o m p u t a t i o n a l M a t h , 217-50, California I n s t i t u t e of Technology, 1200 E. California Blvd., P a s a d e n a , CA 91125, USA owhadiQcmi. univ-mrs. f r
Q R S W 0 7 Bloco A-l Apt. 201 - Setor Sudoeste Bsb-DF - Brazil ZIP Code: 70675-701
PACCIANI, PAMELA
caioQfis.unb.br PORRMANN, MARTIN
II. I n s t i t u t fur Theoretische Physik, Universitat H a m b u r g , L u r u p e r Chaussee 149, D-22761 H a m b u r g , Germany
Av. Prof. G a m a P i n t o , 2, PT-1649-003 Lisboa, Portugal paccianiOcii.fc.ul.pt
martin.porrmannSdesy.de
PANATI, GIANLUCA
POSILICANO, ANDREA
Z e n t r u m M a t h e m a t i k , G r o u p M5, Technische Universitat Miinchen, B o l t z m a n n s t r a s s e 3, D-85747 Garching bei Miinchen, G e r m a n y panat iQma.turn.de
D i p a r t i m e n t o di Scienze Fisiche e M a t e m a t i c h e , Universita dell'Insubria, Via Valleggio 11, 1-22100 Como, Italy posilicanoQuninsubria.it
PATERA, J I R I
POST, OLAF
C e n t r e de Recherches M a t h e m a t i q u e s , Universite de Montreal, Case postale 6128, Succursale Centre-ville, Montreal (Quebec) H3C 3 J 7 , C a n a d a pateraUCRM.Umontreal.CA
IRAM (Institute for P u r e and Applied M a t h e m a t i c s ) , R W T H Aachen, Templergraben 55, D-52062 Aachen, Germany posttOiram. rwth-aachen. de
PATUREL, E R I C
PRYKARPATSKA, NATALIA
Universite Nantes, L a b o r a t o i r e J e a n Leray, 2, rue de la Houssiniere, F-44322 N a n t e s cedex 3, France Eric.PaturelQmath.univ-nantes.fr
2 Subbotiwska Str, apt N14, Lwiw 79052, U k r a i n a prykanatGcybergal.com
PERDIGAO DIAS DA SILVA, J O S E A.
perdigaoQhermite.cii.fc.ul.pt
D e p a r t m e n t of Applied M a t h e m a t i c s , University of Mining and Metallurgy, 30 Mickiewicz Al. Bl. A4, 30059 Krakow, Poland prikaQmat.agh.edu.pi
PEREIRA, ISABEL R U T E
PRZYBYLSKA, MARIA
R u a das Dalias, n ° 8 , 3 ° d t o , PT-2900-044 Setubal, Portugal
Inst. Fourier, U M R 5582 C N R S , 100, r. des M a t h s , BP74, 38402 St Martin d'Heres, France mprzybQf o u r i e r . u j f - g r e n o b l e . f r
C E L C , Av. Prof. G a m a P i n t o , 2, PT-1649-003 Lisboa, Portugal
rutepeartreeOQmail.pt PERELMAN, GALINA
Ecole Polytechnique, U M R 7640, F-91128 Palaiseau cedex, France galina.perelmanQmath.polytechnique.fr PESTANA DA SILVA, T I A G O
I n s t i t u t o Superior Tecnico, D e p a r t a m e n t o de M a t e m a t i c a , Av. Rovisco Pais, PT-1049-001 Lisboa, Portugal tsilvaQmath.ist.utl.pt P E T R I T I S , DIMITRI
Universite Rennes 1, Inst. M a t h e m a t i q u e s , 35042 R e n n e s cedex France dimitri.petritisQuniv.rennesl.fr PIACITELLI, GHERARDO
Via Giacinto Gallina 3, 34122 Trieste - Italy piaciteK&mat.uniromal.it PICKEN, ROGER
D e p a r t a m e n t o de M a t e m a t i c a , I n s t i t u t o Superior Tecnico, Avenida Rovisco Pais, PT-1049-001 Lisboa, Portugal rpickenQmath.ist.utl.pt
PlNHEIRO, DlOGO R u a Dr. Abflio Torres, 1008, PT-4815-552 Vizela, Portugal diogoftmaths.Warwick.ac.uk P I N T O , PAULO
I n s t i t u t o Superior Tecnico, D e p a r t a m e n t o de M a t e m a t i c a , Av. Rovisco Pais, PT-1049-001 Lisboa, Portugal ppiBtoffimath.ist.utl.pt
PRYKARPATSKY, ANATOLIY
PUSZ, WlESLAW Dept. M a t h . Methods in Physics, Faculty of Physics, University of Warsaw, Hoza 74, 00-682 Warszawa, Poland Wieslaw.PuszQfuw.edu.pi RADNOVIC, MILENA
M a t h e m a t i c a l I n s t i t u t e SANU, K n e z a Mihaila 35, 11001 Beograd, p.p. 367, Serbia a n d Montenegro milena.radnovicQweizmann.ac.il milenaQmi.sanu.ac.yu RAYKOV, GUEORGUI
D e p a r t a m e n t o de M a t e m a t i c a s , Facultad de Ciencias, Universidad de Chile, Las P a l m e r a s 3425, Casilla 653, Santiago, Chile graykovffluchile. cl REBENKO, ALEXEI
Inst, of M a t h e m a t i c s , NASU, Kiev, Ukraine rebenkoiOimath. k i e v . ua REHREN, KARL-HENNING
I n s t i t u t fur Theoretische Physik, Universitat G o t t i n g e n , Friedrich-Hund-Platz 1, D-37077 Gottingen, Germany rehrenQtheorie.physik.uni-goe.de R E I S , NUNO
26 Glenhill Close, Finchley Central, London N3 2JS, United Kingdom nunobragareisGyahoo.co.uk
688
Participants: R.ENDALL - SANTOS
RENDALL, ALAN
ROMAO, NUNO
Max-Planck-Institut fur Gravitationsphysik, Albert-Einstein-Institut, A m Muhlenberg 1, D-14476 Potsdam, Germany
School of P u r e M a t h e m a t i c s , T h e University of Adelaide, N o r t h Terrace, Adelaide SA 5005, Australia nromaoQmaths.adelaide.edu.au
rendallQaei.mpg.de RESENDE, P E D R O
I n s t i t u t o Superior Tecnico, D e p a r t a m e n t o de M a t e m a t i c a , Av. Rovisco Pais, PT-1049-001 Lisboa, Portugal pmrflmath.ist.utl.pt REY-BELLET, LUC
Lederle G r a d u a t e Research Tower, D e p a r t m e n t of M a t h e m a t i c s a n d Statistics, University of Massachusetts, A m h e r s t , MA 01003, USA Ir7q9math. umass. edu REZENDE, J O R G E
G F M U L , Av. Prof. G a m a P i n t o , 2, PT-1649-003 Lisboa, P o r t u g a l rezendeflcii.fc.ul.pt RIESS, JURG
C R T B T , C N R S , 25 r u e des Martyrs, B P 166, 38042 Grenoble cedex 9, France jurg.riessfflgrenoble.cnrs.fr RINCON, Luis ANTONIO
D e p a r t a m e n t o de M a t e m a t i c a s , Facultad d e Ciencias UNAM, Circuito Exterior de C U , 04510 Mexico D F , Mexico
ROMEIRAS, FILIPE
I n s t i t u t o Superior Tecnico, D e p a r t a m e n t o de M a t e m a t i c a , A v . Rovisco Pais, PT-1049-001 Lisboa, Portugal filipe.romeirasQmath.ist.utl.pt ROSADO MARIA, MARIA EUGENIA
D e p a r t a m e n t o de Analisis Economico: Econonu'a C u a n t i t a t i v a (Mod. E-III-206a), Facultad de Ciencias Economicas y Empresariales, c / Francisco Tomas y Valiente, 5, Universidad A u t o n o m a de Madrid, 28049 Madrid, Spain eugenia. rosado<9uam. es ROULEUX, MICHEL
Centre de Physique Theorique, C N R S Luminy, Case 907, 13288 Marseille cedex 9, France rouleuxQcpt.univ-mrs.fr RUDOLPH, G E R D
Universitat Leipzig, F a k u l t a t fur Physik u n d Geowissenschaften n s t i t u t fur Theoretische Physik, A u g u s t u s P l a t z 1 0 / 1 1 , 04109 Leipzig, G e r m a n y Gerd.RudolphQitp.uni-leipzig.de RUSKAI, MARY B E T H
46 Lansdowne R d . , Arlington, MA 02474, USA
larsflfciencias.unam.mx
marybeth.ruskaifltufts.edu
RINGSTROM, HANS
Ruzzi, G I U S E P P E
D e p a r t m e n t of M a t h e m a t i c s , K T H , 100 44 Stockholm, Sweden hansrQae i,mpg.de
D i p a r t i m e n t o di M a t e m a t i c a , Universita di R o m a "Tor Vergata", Via della Ricerca Scientifica, 00133 Roma, Italy ruzziQmat.uniroma2.it
RIVASSEAU, VINCENT
L a b o r a t o i r e de Physique Theorique, Batiment 210, Unversite Paris XI, 91405 Orsay cedex, France rivassOth.u-psud.fr ROCHA, J O R G E
767 Cypress Walk, # F , Goleta, CA 93117, USA jmrochaflmath.ist.utl.pt ROCHA, LEONEL
I n s t i t u t o Superior Tecnico, D e p a r t a m e n t o de M a t e m a t i c a , A v . Rovisco Pais, PT-1049-001, Lisboa, Portugal jrochaQmath.ist.utl.pt ROCHA, TANIA
SACCHETTI, ANDREA
L a b o r a t o r y for Solid S t a t e Physics, E T H Zurich H P F - F 3 , CH-8093 Zurich, Switzerland saccbettiffiphys.ethz.ch SADUN, LORENZO
D e p a r t m e n t of M a t h e m a t i c s , University of Texas, 1 University Station C1200, Austin, T X 78712, USA sadunfflmath.utexas.edu SAIFULLAH, KHALID
D e p a r t m e n t of M a t h e m a t i c s , Quaid-i-Azam University, Islamabad, P a k i s t a n saifullahQqau.edu.pk
I n s t i t u t o Superior Tecnico, D e p a r t a m e n t o de M a t e m a t i c a , Av. Rovisco Pais, PT-1049-001 Lisboa, Portugal tarochafflmath.ist.utl.pt
SALMHOFER, MANFRED
RODNIANSKI, IGOR
SAMOILENKO, ANATOLIY
Princeton University, D p t . of M a t h e m a t i c s , Fine Hall, Washington Road, Princeton NJ 08544-1000 USA irodfflmath.princeton.edu
I n s t i t u t e of M a t h e m a t i c s of Ukrainian National Academy of Sciences, 3 , Tereshchenkivska St., 01601, Kiev-4, Ukraine samQinath.kiev.ua
ROCKNER, MICHAEL G.
D e p a r t m e n t of M a t h e m a t i c s , P u r d u e University 150 N. University Street, West Lafayette, IN 47907, USA roecknerfflmath.purdue.edu ROHRL, NORBERT
Universitat S t u t t g a r t , Fachbereich M a t h e m a t i k , I n s t i t u t fur Analysis, Dynamik u n d Modellierung, Lehrstuhl fur Analysis u n d Mathematische Physik, Pfaffenwaldring 57, D-70569 S t u t t g a r t , G e r m a n y Norbert.Roehrlfflmathematik.uni-stuttgart.de
I n s t i t u t fur Theoretische Physik, Universitat Leipzig, Postfach 100920, 04009 Leipzig, G e r m a n y salmhof erflitp. uni-leipzig. de
SANCHEZ, MIGUEL
Universidad de G r a n a d a , D e p a r t a m e n t o de Geometn'a y Topologia, Facultad de Ciencias, Avda. Fuentenueva s/n., E-18071 G r a n a d a , Spain sanchezmQugr. es SANTOS, FILIPE DUARTE
Observatorio Astronomico de Lisboa, Edificio Leste, Piso 2, T a p a d a d a Ajuda, PT-1349-018 Lisboa, Portugal fdsantosQoal.ul.pt
Participants: SCHENKER - SOBIECZKY
689
SCHENKER, J E F F R E Y
SERIE, EMMANUEL
Theoretische Physik, E T H Zurich, CH-8093 Zurich, Switzerland j schenkerflitp.phys.ethz.ch
Univ. Paris 11, Laboratoire de Physique Theorique, B a t i m e n t 210, 91405 Orsay cedex, Prance emmanuel.serieQth.u-psud.fr
SCHIAPPA, RICARDO
SEWELL, G E O F F R E Y
Av. do Brasil, 113, 2-A, PT-2750-309 Cascais, Portugal ricardofflfe.ucp.pt
D e p a r t m e n t of Physics, Queen Mary, University of London, Mile E n d Road, London E l 4NS, UK g.1.sevellfiqmul.ac.uk
SCHLAG, WlLHELM
SHEPPEARD, MARNI
Caltech, D e p a r t m e n t of M a t h e m a t i c s , 253-37, P a s a d e n a CA 91125, USA schlagQcaltech.edu
University of Canterbury, D e p a r t m e n t of Physics a n d Astronomy, Private Bag 4800, Christchurch 8020, New Zeeland m.sheppeardfiphys.canterbury.ac.nz
SCHLINGEMANN, DlRK
I n s t i t u t fiir Mathematische Physik, T U Braunschweig, Mendelssohnstrasse 3, 38106 Braunschweig, Germany d. 5chlingemann<9tu-bs. de SCHNEIDER, G U I D O
Mathematisches I n s t i t u t I, Universitat Karlsruhe, Englerstr. 2, 76131 Karlsruhe, G e r m a n y guido.schneiderfimath.uni-karlsruhe.de SCHOMERUS, VOLKER
D E S Y T h e o r y G r o u p , Notkestrasse 85, D 22603 H a m b u r g , Germany vschomerQspht.saclay. c e a . f r SCHRAMM, O D E D
Microsoft Corporation, O n e Microsoft Way, Redmond, WA 98052-6399, USA schrammfimicrosoft.com SCHUBERT, ROMAN
D e p a r t m e n t of M a t h e m a t i c s , University of Bristol, University Walk, Bristol, BS8 1 T W , UK Roman.Schubertflbristol.ac.uk SCRIVEN, N E I L
I O P Publishing, Dirac House, Temple Back, Bristol BS1 6 B E , U.K neil.scrivenfliop.org SEIDEL, EDWARD
Louisiana S t a t e University, Center for C o m p u t a t i o n and Technology, J o h n s t o n Hall, B a t o n Rouge, LA 70803, USA eseidelfflcct. lsu. edu SEILER, RUDOLF
I n s t i t u t fiir M a t h e m a t i k , MA 7-2, Technische Universitat Berlin, StraBe des 17 J u n i 136, D-10623 Berlin, Germany seilerOmath.tu-berlin.de
SHIRIKYAN, ARMEN
Laboratoire de M a t h e m a t i q u e s , Universite de Paris-Sud XI, B a t i m e n t 425, 91405 Orsay cedex, France armen. shirikyan(8math. u-psud. f r SHRAMCHENKO, VASILISA
Max Planck Institut fiir M a t h e m a t i k , Vivatgasse 7, D-53111 Bonn Germany vasilisaflalcor.concordia.ca SHUB, MICHAEL
University of Toronto, D e p a r t m e n t of M a t h e m a t i c s , Toronto, Ontario, M5S2E4, C a n a d a michael.shubfflutoronto.ca SIMRING, E R I C
Columbia University, 2990 Broadway, Mailcode 4406, New York, NY 10027, USA simringOmath.Columbia.edu SIMS, ROBERT
U.C. Davis, D e p a r t m e n t of M a t h e m a t i c s , # 5 7 7 Kerr Hall, One Shields Ave., Davis, CA 95616-8633, USA r j simsGmath.ucdavis.edu SINAI, YASHA G.
Princeton University, D e p a r t m e n t of M a t h e m a t i c s , Fine Hall, P r i n c e t o n N J 08544-1000 USA sinaiQmath.Princeton.EDU SIRUGUE-COLLIN, MADELEINE
Centre de Physique Theorique, C N R S Luminy, Case 907, 13288 Marseille cedex 9, France sirugue
I n s t i t u t for Matematiske Fag, Aarhus Universitet, Ny Munkegade, DK-8000 Aarhus C, Denmark skibstedfiimf.au.dk
SEIRINGER, R O B E R T
SLINGERLAND, J O O S T K.
D e p a r t m e n t of Physics, Jadwin Hall, Princeton University, P.O. Box 708, Princeton, NJ 08544, USA rseiringfimath.princeton.edu
K I P T , U C S B , Kohn Hall, S a n t a b a r b a r a CA 93106, USA
SEKIGUCHI, MASAYOSHI
SMIRNOV, STANISLAV
Kisarazu National College of Technology, Kiyomidai-Higashi 2-11-1, Chiba 292-0041, J a p a n masaQkisarazu.ac.jp
K T H , D e p a r t m e n t of M a t h e m a t i c s , S-10044 Stockholm, Sweden
SENGUPTA, AMBAR
SNOBL, LIBOR
D e p a r t m e n t of M a t h e m a t i c s , Louisiana S t a t e University, B a t o n Rouge, LA 70803, USA senguptaQmath.lsu.edu
C R M , Universite d e Montreal, P . O . Box 6128, Centre-ville Station, Montreal (Quebec) H 3 C 3 J 7 , Canada libor.snobKDfjfi.cvut.cz
SERGI, DANILO
Universite de Geneve, D e p a r t e m e n t de Physique Theorique, 24 quai E. Ansermet, 1211 Geneve 4, Switzerland danilo.sergifflphysics.unige. ch
j oostfimicrosoft.com
stasfimath.kth.se
SOBIECZKY, FLORIAN
Technische Universitat Graz, I n s t i t u t fiir M a t h e m a t i k C, Steyrergasse 30, Sekr. 5030, A-8010 Graz, Austria sobieczkyQtugraz.at
690
Participants:
SOKAL - TAVARES
SOKAL, ALAN
STOROZHEV, VALERY
New York University, D e p a r t m e n t of Physics, 4 Washington Place, New York, N Y 10003, USA sokalQnyu.edu
D e p a r t m e n t of M a t h e m a t i c s , Donetsk National University, 83055 Donetsk, Ukraine storozhevQmatfak.dongu.donetsk.ua
SOLOVEJ, J A N P H I L I P
STOSIC, M A R K O
D e p a r t m e n t of M a t h e m a t i c s , University of Copenhagen, Universitetsparken 5, DK-2100 Copenhagen, Denmark
D e p a r t a m e n t o de M a t e m a t i c a , I n s t i t u t o Superior Tecnico, Av. Rovisco Pais 1, PT-1049-001 Lisboa, Portugal
solovejQmath.ku.dk
mstosicHmath.ist.utl.pt
Soos, ANNA Babe s-Bolyai University, Faculty of M a t h e m a t i c s a n d C o m p u t e r Science, Cluj-Napoca, R o m a n i a asoosQmath. ubbcluj . ro
STREIT, LUDWIG
Universidade d a Madeira, C C M , C a m p u s P e n t e a d a , PT-9000-390 Funchal, P o r t u g a l streit<&uma.pt
SOSHNIKOV, ALEXANDER
SUIDAN, T O U F I C
University of California a t Davis, D e p a r t m e n t of M a t h e m a t i c s , O n e Shields Avenue, Davis, C A 95616-8633, USA
708 Nobel Drive, A p a r t . C, S a n t a Cruz, C A , 95060, USA
soshnikoOmath.ucdavis.edu SOUSA, NELSON
I n s t i t u t o Superior Tecnico, D e p a r t m e n t o de M a t e m a t i c a , Av. Rovisco Pais 1, PT-1049-001, Lisboa, P o r t u g a l nsousaQmath.ist.utl.pt SOUSA, NUNO
R. Arlindo Vicente, lote 23, 1C, PT-3030-298 Coimbra, P o r t u g a l nsousaQfc.up.pt
suidanQcims • nyu. edu SiJTO, ANDRAS
Research I n s t i t u t e for Solid S t a t e Physics a n d Optics, P.O.B. 49, H-1525 B u d a p e s t , Hungary sutoQszfki.hu SVOBODOVA, MlLENA
Czech Technical University, Faculty of Nuclear Sciences a n d Physical Engineering, D e p a r t m e n t of M a t h e m a t i c s , Trojanova 13, P r a h a 2, 120 00 Czech Republic svobodovOkml.fjfi.cvut.cz
SOUSA RAMOS, J O S E
SZMIGIELSKI, JACEK
I n s t i t u t o Superior Tecnico, D e p a r t a m e n t o de M a t e m a t i c a , Av. Rovisco Pais, PT-1049-001 Lisboa, Portugal sramosQmath.ist.utl.pt
D e p a r t m e n t of M a t h e m a t i c s a n d Statistics, Room 204 McLean Hall, University of Saskatchewan, 106 Wiggins Road, Saskatoon, SK, S7N 5E6, C a n a d a szmigielQmath.usask.ca
SPITZER, WOLFGANG
SZPAK, NlKODEM
Alte T r a u n s t r a s s e 78, A-4802 Ebensee, Austria spitzerQmath.ucdavis.edu
I n s t i t u t e for Theoretical Physics, J . W . G o e t h e University, Max-von-Laue-Straf3e 1, 60438 Frankfurt/Main, Germany szpakQth.physik.uni-frankfurt.de
SPOHN, HERBERT
Z e n t r u m M a t h e m a t i k , Technische Universitat Miinchen, B o l t z m a n n s t r . 3, D-85748 Garching bei Miinchen, G e r m a n y spohnQma. turn. de STARR, SHANNON
SZYBKA, SEBASTIAN
D e p a r t m e n t of General Relativity a n d Astrophysics (rm. 430), I n s t i t u t e of Physics, Jagellonian University, 4 R e y m o n t a st, 30-059 Cracow, Poland
UCLA, M a t h e m a t i c s D e p a r t m e n t , Box 951555, Los Angeles, CA 90095-1555
szybkaOif.uj.edu.pi
sstarrGmath.ucla.edu
G r a d u a t e School of H u m a n a n d E n v i r o n m e n t a l Studies, Kyoto University, Yoshida, Kyoto 606-8501, Japan takasakitflmath. h. kyoto-u . a c . j p
STEINACKER, HAROLD
Ludwig-Maximilians-University Munich, I n s t i t u t fur M a t h e m a t i s c h e Physik, Theresienstr. 37, D-80333 Munich, G e r m a n y Harold.SteinackerGphysik.uni-muenchen.de STERNHEIMER, DANIEL
20 rue de Tournon, F-75006 PARIS, France daniel.sternheimerQu-bourgogne.fr STEVENSON, DANIEL
Univ. California, Riverside, M a t h e m a t i c s , 900 Big Springs Dr., 92521 C a , USA stevendeQuwec. edu STOILOVA, NEDIALKA ILIEVA
I n s t i t u t e for Nuclear Research a n d Nuclear Energy, boul. Tsarigradsko Chaussee 72, 1784 Sofia, Bulgaria toilovaQinrne.bas.bg STOLZ, G U N T E R
D e p a r t m e n t of M a t h e m a t i c s , University of A l a b a m a a t Birmingham, 452 Campbell Hall, Birmingham, AL 35294-1170, USA stolzQmath.uab.edu
TAKASAKI, KANEHISA
TAKEBE, TAKASHI
D e p a r t m e n t of M a t h e m a t i c s , Ochanomizu Univ., Otsuka 2-1-1, Bunkyo-ku, Tokyo, 112-8610, J a p a n takebeQmath.ocha.ac.jp TARLINI, M A R C O
INFN Sezione di Firenze, D i p a r t i m e n t o di Fisica, Via G. Sansone, 1, 50019 Sesto Fiorentino ( F I ) , Italy Marco.TarliniQf i . i n f n . i t TATER, MILOS
T h e o r y D e p a r t m e n t , I n s t i t u t e of Nuclear Physics, CZ 250 68 Rez, Czech Republic tater<9ujf . c a s . c z TAVARES, J O A O NUNO
F C U P , M a t e m a t i c a P u r a , R. C a m p o Alegre 687, PT-4169-007 P o r t o , Portugal j nt avar
Participants: TENUTA - WASSERMANN
691
TENUTA, LUCATTILIO
VANPETEGHEM, DIMITRI
c / o SISSA, Via Beirut 2-4, 34014 G r i g n a n o (Trieste), Italy tenutafflsissa.it
I n s t i t u u t voor Theoretische Fysica, Celestijnenlaan 200D, B-3001 Leuven, Belgium d i m i t r i . vanpeteghemfflf y s . kuleuven. a c . be
TESCHNER, JORG
VARELAS DA ROCHA, J O R G E MIGUEL
I n s t i t u t fiir theoretische Physik, Freie Universitat Berlin, Arnimallee 14, D-14195 Berlin, G e r m a n y joerg.teschnerfflphysik.fu-berlin.de
I n s t i t u t o Superior Tecnico, D e p a r t a m e n t o de M a t e m a t i c a , Av. Rovisco Pais 1, PT-1049-001, Lisboa, Portugal jmrochafflmath.ist.utl.pt
TEUFEL, STEFAN
M a t h . Inst. Auf der Morgenstelle 10, 72076 T u b i n g e n , Germany Stefan.teufelSuni-tuebingen.de THIEMANN, THOMAS
Albert Einstein I n s t i t u t , M a x - P l a n c k - I n s t i t u t fiir Gravitationsphysik, Am Muhlenberg 1, 14476 Golm, Germany tthiemannfflperimeterinstitute.ca TIEDRA DE ALDECOA, RAFAEL
D e p a r t e m e n t d e physique theorique de 1'Universite de Geneve, 24, quai E. Ansermet, 1211 Geneve, Switzerland
VASSELLI, EZIO
D i p a r t i m e n t o di M a t e m a t i c a Universita di R o m a "La Sapienza", Piazzale Aldo Moro, 2, 1-00185 R o m a , Italy vassellifflmat.uniroma2.it VELHINHO, J O S E
D e p a r t a m e n t o de Fisica, Universidade da Beira Interior, Av. Marques d'Avila e Bolama, PT-6200-001 Covilha, P o r t u g a l jvelhifldfisica.ubi.pt VIANA, MARCELO
rafael.tiedrafflphysics.unige.ch
IMPA, E s t r a d a D o n a C a s t o r i n a 110, 22460-320 Rio de Janeiro, Brazil vianaQimpa.br
T O R R E S VALDERRAMA, ALDEMAR
VICNES TOURNERET, FABIEN
Minnaertgebouw, Leuvenlaan 4, P o s t b u s 80.195, 3508 T D Utrecht, The N e t h e r l a n d s A.TorresValderramafflphys.uu.nl
Lab. Phys. Theor., Bat. 210, Univ. Paris XI, F-91405 Orsay cedex, France fabien.vignesfflth.u-psud.fr
TOWNSEND, PAUL
VILELA MENDES, RUI
D e p a r t m e n t of Applied M a t h e m a t i c s and Theoretical Physics, Centre for M a t h e m a t i c a l Sciences, University of Cambridge, Wilberforce R o a d , Cambridge C B 3 0WA, United Kingdom p.k.townsendffldamtp.cam.ac.uk
C M A F , Av. Prof. G a m a P i n t o , 2, PT-1649-003 Lisboa, Portugal
T s o u , SHEUNG TSUN
University of Oxford, M a t h e m a t i c a l I n s t i t u t e , 24-29 St. Giles', Oxford O X 1 3 L B , United Kingdom tsoufflmaths.ox.ac.uk TUTSCHKA, CHRISTIAN
T h e Erwin Schrodinger I n t e r n a t i o n a l I n s t i t u t e for M a t h e m a t i c a l Physics, Boltzmanngasse 9, A-1090 Wien, A u s t r i a tutschkaffltph.tuwien.ac.at U E K I , NAOMASA
G r a d u a t e School of H u m a n and E n v i r o n m e n t a l Studies, K y o t o University, K y o t o 606-8501 J a p a n uekiQmath.h.kyoto-u.ac.jp UELTSCHI, DANIEL
Dept of M a t h e m a t i c s , Univ. of Arizona, 617 N. S a n t a R i t a , Tucson, AZ 85721, USA ueltschiQmath.arizona.edu VACARU, SERGIU
I n s t i t u t e Superior Tecnico, D e p a r t a m e n t o de M a t e m a t i c a , Av. Rovisco Pais, PT-1049-001, Lisboa, Portugal svacarufflmath.ist.utl.pt VAN BAALEN, GUILLAUME
Univ. Geneve, D p t . Physique Theorique, 24 quai Ansermet, CH-1211 Geneve 4, Switzerland vanbaal4<5kalyntnos.unige. ch VAN E N T E R , AERNOUT
Center for Theoretical Physics, R.U. Groningen, Nijenborgh 4, 9747 AG, Groningen, T h e Netherlands aenterflphys . r u g . n l
vilelafflcii.fc.ul.pt VlLLANI, CEDRIC
Ecole Normale Superieure de Lyon, 46, allee dTtalie, F-69364 Lyon cedex 07, France cvillanifflumpa.ens-lyon.fr VlLLANI, MATTEO
Universita di Bari, D i p a r t i m e n t o di Fisica, Via Amendola 173, 1-70126 Bari, Italy matteo.villanifflba.infn.it VINAGRE, SANDRA
D e p a r t a m e n t o de M a t e m a t i c a , Universidade de Evora, R u a R o m a o R a m a l h o , 59, PT-7000-671 Evora, Portugal smvffluevora.pt VINET,
Luc
University of Montreal, Dept. Physics, C P 6128, Montreal H3C 3 J 7 , C a n a d a luc.vinetOmcgill.ca VOICULESCU, DAN VIRGIL
D e p a r t m e n t of M a t h e m a t i c s , University of California, Berkeley, Berkeley C A 94720, USA dwflmath. berkeley. edu WALD, ROBERT
University of Chicago, D e p a r t m e n t of Physics, 5640 South Ellis Avenue, Chicago, IL 60637, USA rmwafflmidway.uchicago.edu WARZEL, SIMONE
P r i n c e t o n University, M a t h e m a t i c s a n d Physics, Princeton N J 08544, USA swarzelflprinceton.edu WASSERMANN, ANTONY
Univ. Mediterranee, I n s t i t u t de M a t h e m a t i q u e s de Luminy, C N R S , 163 Av. Luminy, 13288 Marseille cedex 9, France wassermffliml.univ-mrs.fr
692
Participants:
WASTAVINO - ZULETA
ESTRUGO
WASTAVINO, LETIZZIA
YNGVASON, J A K O B
Nueva de Valdes 951, Santiago, Chile l e t izziaQmacul. c i e n c i a s . u c h i l e . c l
I n s t i t u t fiir Theoretische Physik der Universitat Wien, Boltzmanngasse 5, A-1090 Wien, Austria yngvasonflthor.univie.ac.at
WATANABE, HISAO
6-19-10 Hinosato M u n a k a t a city, J a p a n 811-3425
ZABEY, EMMANUEL
hisaowflmuse.ocn.ne. jp
Universite de Geneve, D e p a r t m e n t de Physique Theorique, 24 quai Ansermet, 1211 Geneve 4, Switzerland
WATANABE, K E I J I
Meisei University, D p t . of Physics, Hino, Tokyo 191, Japan watanabeQnda.ac.jp W E D E R , RIOARDO
IIMAS-UNAM, A p a r t a d o Postal 20-726, Mexico D F 01000, Mexico
emmanuel.zabeyOphysics.unige.ch ZAGREBNOV, VALENTIN
Centre de Physique Theorique, Luminy, Case 907, 13288 Marseille cedex 9, France zagrebnovfflcpt.univ-mrs.fr
wederOservidor.unam.mx
ZAMBRINI, JEAN-CLAUDE
WEISFELD, MORRIS
G F M U L , Av. Prof. G a m a P i n t o , 2, PT-1649-003 Lisboa, Portugal
Box 61331, D u r h a m N C , 27715-1331, USA mwQmath.duke.edu W E S T O N , ROBERT
D e p a r t m e n t of M a t h e m a t i c s , Heriot-Watt University, Edinburgh EH14 4AS, Scotland, United Kingdom r.a.westonQma.hw.ac.uk WOLOWSKI, LECH
School of M a t h e m a t i c s , University of Bristol, Bristol BS8 1 T W , United Kingdom L.WolowskiQbristol.ac.uk WORONOWICZ, STANISLAW
Dept. of M a t h e m a t i c a l M e t h o d s in Physics, Faculty of Physics, Warsaw University, Hoza 69, 02-744 Warsaw, Poland stanislaw.woronowiczQf uw.edu.pi W Y S S , WALTER
University of Colorado a t Boulder, C a m p u s Box 390, Boulder, Co 80309-0390, USA W2ch<3hotmail.com YAFAEV, DMITRI
Dep. M a t h . , Univ. Rennes-1, C a m p u s de Beaulieu, 35042 Rennes cedex, France yaf aev<8univ-rennes 1. f r YAU, H O R N G - T Z E R
Harvard University, D e p a r t m e n t of M a t h e m a t i c s , One Oxford Street, Cambridge, MA 02138, USA htyauQmath.harvard.edu
zambriniQcii.fc.ul.pt ZEINER, P E T E R
I n s t i t u t fuer Theoretische Physik, Technische Universitaet Wien, Wiedner H a u p t s t r a s s e 8-10/136, A-1040 Wien, A u s t r i a zeinerQtph. tuwien. ac. at ZERTUCHE, FEDERICO
I n s t i t u t o de M a t e m a t i c a s (Unidad Cuernavaca), Universidad Nacional A u t o n o m a de Mexico, A.P. 273, A d m o n . d e correos # 3 , C.P. 62251 Cuernavaca, Morelos, Mexico zertucheOmatcuer.unam.mx ZlTO, PASQUALE
D i p a r t i m e n t o di M a t e m a t i c a , Universita di R o m a II "Tor Vergata", Via della Ricerca Scientifica, 00133 Roma, Italy zitoQmat.uniroma2.it Zois, IOANNIS
64 Biskini Street, GR-15771 Athens, Greece zoisipiBcf.ac.uk ZUBER, JEAN-BERNARD
L P T H E , Tour 24-25 5eme etage, Universite Paris VI, Boite 126, 4 Place Jussieu, F-75252 Paris cedex 5, France zuberQspht.saclay.cea.fr ZULETA ESTRUGO, J O S E LUIS
E P F L , SB IGAT, B a t i m e n t de Chimie, CH-1015 Lausanne, Switzerland j oseluis.zuletaestrugoUima.unil.ch
In 2003 the XIV International Congress on Mathematical Physics (ICMP) was held in Lisbon with more than 500 participants. Twelve plenary talks were given in various fields of Mathematical Physics: E Carlen «On the relation between the Master equation and the Boltzmann Equation in Kinetic Theory»: A Chenciner «Symmetries and "simple" solutions of the classical n-body problems; M J Esteban «Relativistic models in atomic and molecular physics»; K Fredenhagen «Locally covariant quantum field theory»;KGawedzki«Simple models of turbulent transport»; I Krichever «Algebraic versus Liouville integrability of the soliton systems»; R V Moody «Long-range order and diffraction in mathematical quasicrystals»; S Smirnov «Critical percolation and conformal invariance»; J P Solovej «The energy of charged matter»; V Schomerus «Strings through the microscope»; C Villani «Entropy production and convergence to equilibrium for the Boltzmann equations D Voiculescu «Aspects of free probability*. The book collects as well carefully selected invited Session Talks in: Dynamical Systems, Integrable Systems and Random Matrix Theory, Condensed Matter Physics, Equilibrium Statistical Mechanics, Quantum Field Theory, Operator Algebras and Quantum Information, String and M Theory, Fluid Dynamics and Nonlinear PDE, General Relativity, Nonequilibrium Statistical Mechanics, Quantum Mechanics and Spectral Theory, Path Integrals and Stochastic Analysis.
MATHEMATICAL PHYSICS
tfc^
orld Scientific
www worldsclBnlKic.com