Communications in Mathematical Physics - Volume 259

Commun. Math. Phys. 259, 1–44 (2005) Digital Object Identifier (DOI) 10.1007/s00220-005-1387-5 Communications in Mathe...

Author: M. Aizenman (Chief Editor)

33 downloads 730 Views 8MB Size Report

This content was uploaded by our users and we assume good faith they have the permission to share this book. If you own the copyright to this book and it is wrongfully on our website, we offer a simple DMCA procedure to remove your content from our site. Start by pressing the button below!

Report copyright / DMCA form

DOWNLOAD PDF

Commun. Math. Phys. 259, 1–44 (2005) Digital Object Identifier (DOI) 10.1007/s00220-005-1387-5

Communications in

Mathematical Physics

Integrable Structure of the Dirichlet Boundary Problem in Multiply-Connected Domains I. Krichever1,2,3 , A. Marshakov3,4,5 , A. Zabrodin3,6 1 2 3 4 5 6

Department of Mathematics, Columbia University, New York, USA. E-mail: [email protected] Landau Institute, Moscow, Russia ITEP, Moscow, Russia Max Planck Institute of Mathematics, Bonn, Germany Lebedev Physics Institute, Moscow, Russia Institute of Biochemical Physics, Moscow, Russia

Received: 21 September 2003 / Accepted: 3 February 2004 Published online: 8 July 2005 – © Springer-Verlag 2005

Abstract: We study the integrable structure of the Dirichlet boundary problem in two dimensions and extend the approach to the case of planar multiply-connected domains. The solution to the Dirichlet boundary problem in the multiply-connected case is given through a quasiclassical tau-function, which generalizes the tau-function of the dispersionless Toda hierarchy. It is shown to obey an infinite hierarchy of Hirota-like equations which directly follow from properties of the Dirichlet Green function and from the Fay identities. The relation to multi-support solutions of matrix models is briefly discussed. 1. Introduction The Dirichlet boundary problem [1] is to reconstruct a harmonic function in a bounded domain from its values on the boundary. Remarkably, this standard problem of complex analysis, related however to string theory and matrix models, possesses a hidden integrable structure [2], which we clarify further in this paper. It turns out that variation of a solution to the Dirichlet problem under variation of the domain is described by an infinite hierarchy of non-linear partial differential equations known (in the simply-connected case) as dispersionless Toda hierarchy. It is a particular example of the universal hierarchy of Whitham equations introduced in [3, 4]. The quasiclassical tau-function or, more precisely, its logarithm F , is the main new object associated with a family of domains in the plane. Any domain in the complex plane with sufficiently smooth boundary can be parameterized by its moments with respect to a basis of harmonic functions. The F -function is a function of the full infinite set of the moments. The first order derivatives of F are then moments of the complementary domain. This gives a formal solution to the inverse potential problem, considered for the simply-connected case in [5, 6]. The second order derivatives are coefficients of the Taylor expansion of the Dirichlet Green function and therefore they solve the Dirichlet boundary problem. These coefficients are constrained by an infinite number of universal (i.e. domain-independent) relations which, unified in a generating form, just constitute

2

I. Krichever, A. Marshakov, A. Zabrodin

the dispersionless Hirota equations. For the third order derivatives (their role in problems of complex analysis is not yet quite clear) there is a nice “residue formula” which allows one to prove [7] that F obeys the WDVV equations. Below we are going to demonstrate that for planar multiply-connected domains the solution to the Dirichlet boundary problem can be performed in a similar way. Specifically, we consider domains which are obtained by cutting several “holes” in the complex plane. Boundaries of the holes are assumed to be smooth simple non-intersecting curves. In this case, the complete set of independent variables can be again identified with the set of harmonic moments. However, a choice of the proper basis of harmonic functions in a multiply-connected domain becomes crucial for our approach. It turns out that the Laurent polynomials which were used in the simply-connected case should be replaced by the basis analogous to the one introduced in [8] – a “global” generalization of the Laurent basis for algebraic curves of arbitrary genus. The basis has to be also enlarged to include harmonic functions with multi-valued analytic part. This results in an additional finite set of extra variables. We construct the F -function and prove that its second derivatives satisfy non-linear relations, which generalize the Hirota equations of the dispersionless Toda hierarchy. These relations are derived from the Fay identities [9] for the Riemann theta functions on the Jacobian of Riemann surface obtained as the Schottky double of the plane with given holes. We note that extra variables, specific for the multiply-connected case, can be chosen in different ways and possess different geometric interpretations, depending on the choice of basis of homologically non-trivial cycles on the Schottky double. The corresponding F -functions are shown to be connected by a duality transformation – a (partial) Legendre transform, with the generalized Hirota relations being the same. Now let us give a bit more expanded description of the Dirichlet problem in planar domains. Let Dc be a domain in the complex plane bounded by one or several non-intersecting smooth curves. It will be convenient to realize Dc as a complement to another domain D, having one or more connected components, and to consider the Dirichlet problem in Dc : to find a harmonic function u(z) in Dc such that it is continuous up to the boundary, ∂Dc , and equals a given function u0 (ξ ) on the boundary. The problem has a unique solution written in terms of the Dirichlet Green function G(z, ξ ): 1 u(z) = − u0 (ξ )∂n G(z, ξ )|dξ | , (1.1) 2π ∂Dc where ∂n is the normal derivative on the boundary with respect to the second variable, the normal vector n is directed inward Dc , and |dξ | := dl(ξ ) is an infinitesimal element of the length of the boundary ∂Dc . The main object to study is, therefore, the Dirichlet Green function. It is uniquely determined by the following properties [1]: (G1) The function G(z, z ) is symmetric and harmonic everywhere in Dc (including ∞ if Dc ∞) in both arguments except z = z , where G(z, z ) = log |z − z | + · · · as z → z ; (G2) G(z, z ) = 0 if any one of the variables z, z belongs to the boundary ∂Dc . Note that the definition implies that G(z, z ) < 0 inside Dc . In particular, ∂n G(z, ξ ) is strictly negative for all ξ ∈ ∂Dc . If Dc is simply-connected (note that we assume ∞ ∈ Dc ), i.e., the boundary has only one component, the Dirichlet problem is equivalent to finding a bijective conformal map from Dc onto the complement to unit disk or any other reference domain for which the

Integrable Structure of the Dirichlet Boundary Problem

3

Green function is known explicitly. Such a bijective conformal map w(z) exists by virtue of the Riemann mapping theorem, then w(z) − w(z ) , G(z, z ) = log (1.2) w(z)w(z ) − 1 where bar means complex conjugation. It connects the Green function at two points with the conformal map normalized at some third point (say at z = ∞: w(∞) = ∞). It is this formula which allows one to derive the Hirota equations for the tau-function of the Dirichlet problem in the most economic and transparent way [2] (see also Sect. 2 below). For multiply-connected domains, formulas of this type based on conformal maps do not really exist. In general, there is no canonical choice of the reference domain, moreover, the shape of a reference domain depends on Dc itself. In fact, as we demonstrate in the paper, the correct extension of (1.2) needed for derivation of the generalized Hirota equations follows from a different direction which is no longer explicitly related to bijective conformal maps. Namely, logarithm of the conformal map log w(z) should be replaced now by the Abel map from the Schottky double of Dc to the Jacobi variety of this Riemann surface, and the rational function under the logarithm in (1.2) is substituted by ratio of the prime forms or Riemann theta-functions. We show that the Green function of multiply-connected domains admits a representation through the logarithm of the tau-function of the form 1 1 1 (1.3) G(z, z ) = log − + ∇(z)∇(z )F . z z 2 Here ∇(z) is a certain vector field on the moduli space of boundary curves, therefore it can be represented as a (first-order) differential operator w.r.t. harmonic moments with constant (in moduli) coefficients depending, however, on the point z as a parameter. In this paper we also obtain similar formulas for the harmonic measures of the boundary components and for the Abel map. A combination of these formulas with the Fay identities yields the generalized Hirota-like equations for the tau-function F . Our main tool is the Hadamard variational formula [10] which gives the variation of the Dirichlet Green function under small deformations of the domain in terms of the Green function itself: 1 δG(z, z ) = ∂n G(z, ξ )∂n G(z , ξ )δn(ξ )|dξ |. (1.4) 2π ∂Dc Here δn(ξ ) is the normal displacement (with sign!) of the boundary under the deformation, counted along the normal vector at the boundary point ξ . It was shown in [2] that this remarkable formula is a key to all integrable properties of the Dirichlet problem. An extremely simple “pictorial” derivation of the formula (1.4) is presented in Fig. 1. We start with a brief recollection of the results for the simply-connected case in Sect. 2. However, instead of “bump” deformations used in [2] we work here with their rigorously defined versions – a family of infinitesimal deformations which we call elementary ones. This approach is basically motivated by the theory of interface dynamics in viscous fluids, which is known to be closely connected with the formalism developed in [2] and in the present paper (see [5] for details). In Sect. 3 we introduce local coordinates in the space of planar multiply-connected domains and express the elementary deformations in these coordinates. Using the Hadamard formula, we then observe remarkable symmetry or “zero-curvature” relations

4

I. Krichever, A. Marshakov, A. Zabrodin z δn ξ

Fig. 1. A “pictorial” derivation of the Hadamard formula. We consider a small deformation of the domain, with the new boundary being depicted by the dashed line. According to (G2) the Dirichlet Green function vanishes, G(z, ξ ) = 0, if ξ belongs to the old boundary. Then the variation δG(z, ξ ) is simply equal to the new value, i.e. in the leading order δG(z, ξ ) = −δn(ξ )∂n G(z, ξ ). Now notice that δG(z, ξ ) is a harmonic function (the logarithmic singularity cancels since it is the same for both old and new functions) with the boundary value −δn(ξ )∂n G(z, ξ ). Applying (1.1) one obtains (1.4). The argument is the same for both simply-connected and multiply-connected domains

which connect elementary deformations of the Green function and harmonic measures. The existence of the tau-function and the formula (1.3) for the Green function directly follow from these relations. In Sect. 4 we make a Legendre transform to another set of local coordinates in the space of algebraic multiply-connected domains, which is in a sense dual to the original one. In these coordinates, Eq. (1.3) gives another version of the Green function which solves the so-called modified Dirichlet problem. We also discuss the relation to multi-support solutions of matrix models in the planar large N limit. In Sect. 5 we combine the results outlined above with the representation of the Green function in terms of the prime form on the Schottky double. This allows us to obtain an infinite system of partial differential equations on the tau-function which generalize the dispersionless Hirota equations. 2. The Dirichlet Problem for Simply-Connected Domains and Dispersionless Hirota Equations In this section we rederive the results from [2] for the simply-connected case in a slightly different manner, more suitable for further generalizations. At the same time we show that the results of [2] obtained for analytic curves can be easily extended to the smooth case. Let D be a connected domain in the complex plane bounded by a simple smooth curve. We consider the exterior Dirichlet problem in Dc = C \ D which is the complement of D in the whole (extended) complex plane. Without loss of generality, we assume that D is compact and contains the point z = 0. Then Dc is a simply-connected domain on the Riemann sphere containing ∞. 2.1. Harmonic moments and deformations of the boundary. Let tk be moments of the domain Dc = C \ D defined with respect to the harmonic functions {z−k /k}, k > 0: 1 tk = − z−k d 2 z , k = 1, 2, . . . , (2.1) πk Dc

Integrable Structure of the Dirichlet Boundary Problem

5

1 and {t¯k } be the complex conjugate moments, i.e. t¯k = − πk formula represents the harmonic moments as contour integrals

tk =

1 2πik

Dc

z−k z¯ dz

d 2 z¯z−k . The Stokes

(2.2)

∂D

providing, in particular, a regularization of possibly divergent integrals (2.1). Besides, we denote by t0 the area (divided by π) of the domain D: 1 t0 = π

d 2z .

(2.3)

D

The harmonic moments of Dc are coefficients of the Taylor expansion of the potential (z) = −

2 π

log |z − z |d 2 z

(2.4)

D

induced by the domain D filled by two-dimensional Coulomb charges with the uniform density ρ = −1. Clearly, ∂z ∂z¯ (z) = −1 if z ∈ D and vanishes otherwise, so around the origin (recall that D 0) the potential equals to −|z|2 plus a harmonic function, i.e. (z) − (0) = −|z|2 +

tk zk + t¯k z¯ k ,

(2.5)

k≥1

and one can verify that tk are just given by (2.1). For analytic boundary curves, one may introduce the Schwarz function associated with the curve. The function ∂z (z) = −

1 π

D

d 2 z z − z

is continuous across the boundary and holomorphic for z ∈ Dc while for z ∈ D the function ∂z + z¯ is holomorphic. If the boundary is an analytic curve, both these functions can be analytically continued outside the regions where they were originally defined, and, therefore, there exists a function, S(z), analytic in some strip-like neighborhood of the boundary contour, such that S(z) = z¯ on the contour. In other words, S(z) is the analytic continuation of z¯ away from the boundary contour, this function completely determines the shape of the boundary and is called the Schwarz function [11]. In general we are going to work with smooth curves, not necessarily analytic, when the Schwarz function does not exist as an analytic function. Nevertheless, it appears to be useful below to define the class of boundary contours with nice algebro-geometric properties. The basic fact of the theory of deformations of closed smooth curves is that the (in general complex) moments {tk , t¯k } ≡ {t±k } supplemented by the real variable t0 form a set of local coordinates in the “moduli space” of smooth closed curves [12] (see also [13]).

6

I. Krichever, A. Marshakov, A. Zabrodin

Important remark . This means that: (a) under any small deformation of the domain the set t = {t0 , t±k } is subject to a small change; (b) on the space of smooth closed curves there exist vector fields ∂tk such that ∂tk tn = δkn , which are represented in terms of infinitesimal normal displacements of the boundary that change xk = Re tk or yk = Im tk keeping all the other moments fixed; (c) the corresponding infinitesimal displacements can be locally integrated. The latter means that for each domain Dc with moments {t0 , t±k } and for an arbitrary integer N there exist constants m , |m| ≤ N , such that for } with |t − t | < , m ≤ N, t = t , |m| > N , in the neighborhood any set {t0 , t±k m m m m m }. We adopt this restricted c of D there is a unique domain with the moments {t0 , t±k notion of the local coordinates throughout the paper. It would be very interesting to find conditions on the infinite sets k for the corresponding rectangles to form an open set in an infinite-dimensional variety of smooth curves. We plan to address this problem elsewhere. Let us present a proof of this statement which later will be easily adjusted to the case of multiply-connected domains. At the same time this proof allows one to derive a deformation of the domain with respect to the variables tk . Suppose there is a one-parametric deformation D(t) (with some real parameter t) of D = D(0) such that all tk are preserved: ∂t tk = 0, k ≥ 0. Let us prove that such a deformation is trivial. The proof is based on two key observations: • The difference of the boundary values ∂t C ± (ζ )dζ of the derivative of the Cauchy integral C(z)dz =

dz 2πi

∂D

ζ¯ dζ ζ −z

(2.6)

is a purely imaginary differential on the boundary of D. Indeed, let ζ (σ, t) be a parameterization of the curve ∂D(t). Denote the value of the differential (2.6) by C − (z)dz for z ∈ Dc and by C + (z)dz for z ∈ D. Taking the t-derivative of (2.6) and integrating by parts one gets

ζ¯t ζσ + ζ¯ ζt, σ ζ¯ ζσ ζt − ζ −z (ζ − z)2 ∂D ¯ dz ζt ζσ − ζ¯σ ζt = dσ . 2πi ∂D ζ −z

dz ∂t C(z)dz = 2πi

dσ (2.7)

Hence,

∂t C + (ζ ) − ∂t C − (ζ ) dζ = ∂t ζ¯ dζ − ∂t ζ d ζ¯ = 2iIm ∂t ζ¯ dζ

is indeed purely imaginary. • If a t-deformation preserves all the moments tk , k ≥ 0, the differential ∂t ζ¯ dζ − ∂t ζ d ζ¯ extends to a holomorphic differential in Dc . If |z| < |ζ | for all ζ ∈ ∂D, then we can expand: ∂ ∂t C (z)dz = ∂t +

∞ dz k z ζ −k−1 ζ¯ dζ 2πi ∂D k=0

=

∞ k=1

k (∂t tk ) zk−1 dz = 0

(2.8)

Integrable Structure of the Dirichlet Boundary Problem

7

and, since C + is analytic in D, we conclude that ∂t C + ≡ 0. The expression ∂t ζ¯ dζ − ∂t ζ d ζ¯ is the boundary value of the differential −∂t C − (z)dz which has at most simple pole at the infinity and holomorphic everywhere else in Dc . The equality 1 (∂t ζ¯ dζ − ∂t ζ d ζ¯ ) = 0 ∂t t 0 = 2πi ∂D then implies that the residue at z = ∞ vanishes, therefore ∂t C − (z)dz is holomorphic. Any holomorphic differential which is purely imaginary along the boundary of a simply-connected domain must be zero in this domain. Indeed, the real part of the harmonic continuation of the integral of this differential is a harmonic function with a constant boundary value. Such a function must be constant by virtue of the uniqueness of the solution to the Dirichlet problem. Another proof relies on the Schwarz symmetry principle and the standard Schottky double construction (see the next section for details). Consider the compact Riemann surface obtained by attaching to Dc its complex conjugated copy along the boundary. Since ∂t C − dz is imaginary along the boundary, we conclude, from the Schwarz symmetry principle, that ∂t C − dz extends to a globally defined holomorphic differential on this compact Riemann surface, which has genus zero. Therefore, such a differential is equal to zero. Hence we conclude that ∂t ζ¯ dζ − ∂t ζ d ζ¯ = 0. This means that the vector ∂t ζ is tangent to the boundary. Without loss of generality we can always assume that a parameterization of ∂D(t) is chosen so that ∂t ζ (σ, t) is normal to the boundary. Thus, the t-deformation of the boundary preserving all harmonic moments is trivial. The fact that the set of harmonic moments is not overcomplete follows from the explicit construction of vector fields in the space of domains that changes any harmonic moment keeping all the others fixed (see below). 2.2. Elementary deformations and the operator ∇(z). Fix a point z ∈ Dc and consider a special infinitesimal deformation of the domain such that the normal displacement of the boundary is proportional to the gradient of the Green function G(z, ξ ) at the boundary point (Fig. 2): δn(ξ ) = − ∂n G(z, ξ ) . 2

(2.9)

For any sufficiently smooth initial boundary this deformation is well-defined as → 0. We call infinitesimal deformations from this family, parametrized by z ∈ Dc , the elementary deformations. The point z is referred to as the base point of the deformation. Note that since ∂n G < 0 (see the remark after the definition of the Green function in the Introduction), δn for the elementary deformations is either strictly positive or strictly negative depending of the sign of . Let δz be a variation of any quantity under the elementary deformation with the base point z. It is easy to see that δz t0 = , δz tk = z−k /k. Indeed, 1 δn(ξ )|dξ | = − ∂n G(z, ξ )|dξ | = , δz t0 = π 2π (2.10) 1 −k −k −k δz tk = ξ δn(ξ )|dξ | = − ξ ∂n G(z, ξ )|dξ | = z πk 2πk k

8

I. Krichever, A. Marshakov, A. Zabrodin

z

Fig. 2. The elementary deformation with the base point z

by virtue of the Dirichlet formula (1.1). Note that the elementary deformation with the base point at ∞ keeps all moments except t0 fixed. Therefore, the deformation which changes only t0 is given by δn(ξ ) = − 2 ∂n G(∞, ξ ). Now we can explicitly define the deformations that change only either xk = Re tk or yk = Im tk keeping all other moments fixed. As is clear from (2.10), the corresponding δn(ξ ) is given by the real or imaginary part of normal derivative of the function 1 Hk (ξ ) = zk ∂z G(z, ξ )dz (2.11) 2πi ∞ at the boundary. Here the contour integral goes around infinity. Namely, the normal displacements δn(ξ ) = Re (∂n Hk (ξ )) and δn(ξ ) = Im (∂n Hk (ξ )) change the real and imaginary part of tk by ± respectively keeping all other moments fixed. These deformations allow one to introduce the vector fields ∂ , ∂t0

∂ , ∂xk

∂ ∂yk

in the space of domains which are locally well-defined. Existence of such vector fields means that the variables tk are independent. For k > 0 it is more convenient to use their linear combinations ∂ ∂ ∂ ∂ 1 ∂ 1 ∂ , = −i = +i ∂tk 2 ∂xk ∂yk ∂ t¯k 2 ∂xk ∂yk which span the complexified tangent space to the space of simply-connected domains (with fixed area t0 ). If X is any functional of our domain locally representable as a function of harmonic moments, X = X(t), the vector fields ∂t0 , ∂tk , ∂t¯k can be understood as partial derivatives acting to the function X(t). Consider the variation δz X of a functional X = X(t) under the elementary deformation with the base point z. In the leading order in we have: δz X =

∂X k

∂tk

δz tk = ∇(z)X

(2.12)

Integrable Structure of the Dirichlet Boundary Problem

where the differential operator ∇(z) is given by z−k z¯ −k ∇(z) = ∂t0 + ∂tk + ∂t¯k . k k

9

(2.13)

k≥1

The right-hand side suggests that for functionals X such that the series ∇(z)X converges everywhere in Dc up to the boundary, δz X is a harmonic function of the base point z. Note that in [2] we have used the “bump” deformation and continued it harmonically into Dc . In fact, it was the elementary deformation (2.10) δz ∝ |dξ |∂n G(z, ξ )δ bump (ξ ) that was really used. The “bump” deformation should be understood as a (carefully taken) limit of δz when the point z tends to the boundary ∂Dc . 2.3. The Hadamard formula as integrability condition. Variation of the Green function under small deformations of the domain is known due to Hadamard, see Eq. (1.4). To find how the Green function changes under small variations of the harmonic moments, we fix three points a, b, c ∈ C \ D and compute δc G(a, b) by means of the Hadamard formula (1.4). Using (2.12), one can identify the result with the action of the vector field ∇(c) on the Green function: 1 ∇(c)G(a, b) = − ∂n G(a, ξ )∂n G(b, ξ )∂n G(c, ξ )|dξ | . (2.14) 4π ∂D Remarkably, the r.h.s. of (2.14) is symmetric in all three arguments, i.e. ∇(a)G(b, c) = ∇(b)G(c, a) = ∇(c)G(a, b) .

(2.15)

This is the key relation which allows one to represent the Dirichlet problem as an integrable hierarchy of non-linear differential equations [2], (2.15) being the integrability condition of the hierarchy. It follows from (2.15) (see [2] for details) that there exists a function F = F (t) such that 1 1 1 G(z, z ) = log − + ∇(z)∇(z )F . (2.16) z z 2 We note that existence of such a representation of the Green function was first conjectured by Takhtajan. For the simply-connected case, this formula was obtained in [14] (see also [13] for a detailed proof and discussion). The function F is (logarithm of) the tau-function of the integrable hierarchy. In [14] it was called the tau-function of the (real analytic) curves – the boundary contours ∂D or ∂Dc . 2.4. Dispersionless Hirota equations. Combining (2.16) and (1.2), we obtain the relation w(z) − w(z ) 2 1 1 2 log = log − + ∇(z)∇(z )F (2.17) z z w(z)w(z ) − 1 which implies an infinite hierarchy of differential equations on the function F . It is convenient to normalize the conformal map w(z) by the conditions that w(∞) = ∞ and ∂z w(∞) is real, so that z (2.18) w(z) = + O(1) as z → ∞ , r

10

I. Krichever, A. Marshakov, A. Zabrodin

where the real number r = limz→∞ dz/dw(z) is called the (external) conformal radius of the domain D (equivalently, it can be defined through the Green function as log r = limz→∞ (G(z, ∞) + log |z|), see [15]). Then, tending z → ∞ in (2.17), one gets log |w(z)|2 = log |z|2 − ∂t0 ∇(z)F .

(2.19)

The limit z → ∞ of this equality yields a simple formula for the conformal radius: log r 2 = ∂t20 F .

(2.20)

Let us now separate holomorphic and antiholomorphic parts of these equations, introducing the holomorphic and antiholomorphic parts of the operator ∇(z) (2.13): D(z) =

z−k k≥1

k

∂tk ,

¯ z) = D(¯

z¯ −k k≥1

k

∂t¯k .

(2.21)

Rewrite (2.17) in the form w(z) − w(z ) 1 1 1 log − − ∂t + D(z) ∇(z )F − log z z 2 0 w(z)w(z ) − 1

1 w(z) − w(z ) 1 1 ¯ = − log + log − + ∂t + D(¯z) ∇(z )F . z¯ z¯ 2 0 w(z )w(z) − 1 The l.h.s. is a holomorphic function of z while the r.h.s. is antiholomorphic. Therefore, both are equal to a z-independent term which can be found from the limit z → ∞. As a result, we obtain the equation w(z) − w(z ) z log = log 1 − (2.22) + D(z)∇(z )F z w(z) − (w(z ))−1 which, as z → ∞, turns into the formula for the conformal map w(z): 1 log w(z) = log z − ∂t20 F − ∂t0 D(z)F 2

(2.23)

(here we also used (2.20)). Proceeding in a similar way, one can rearrange (2.22) in order to write it separately for holomorphic and antiholomorphic parts in z : log

w(z) − w(z ) 1 = − ∂t20 F + D(z)D(z )F , z − z 2

− log 1 −

1 w(z)w(z )

¯ z )F . = D(z)D(¯

(2.24)

(2.25)

Writing down Eqs. (2.24) for the pairs of points (a, b), (b, c) and (c, a) and summing up the exponentials of both sides of each equation one arrives at the relation (a − b)eD(a)D(b)F + (b − c)eD(b)D(c)F + (c − a)eD(c)D(a)F = 0

(2.26)

which is the dispersionless Hirota equation (for the KP part of the two-dimensional Toda lattice hierarchy) written in the symmetric form. This equation can be regarded as

Integrable Structure of the Dirichlet Boundary Problem

11

a very degenerate case of the trisecant Fay identity [9]. It encodes the algebraic relations between the second order derivatives of the function F . As c → ∞, we get these relations in a more explicit but less symmetric form: 1 − eD(a)D(b)F =

D(a) − D(b) ∂t1 F a−b

(2.27)

which makes it clear that the totality of second derivatives Fij := ∂ti ∂tj F are expressed through the derivatives with one of the indices put equal to unity. More general equations of the dispersionless Toda hierarchy obtained in a similar way by combining Eqs. (2.23), (2.24) and (2.25) include derivatives w.r.t. t0 and t¯k : (a − b)eD(a)D(b)F = ae−∂t0 D(a)F − be−∂t0 D(b)F , ¯

1 − e−D(z)D(¯z)F =

1 ∂t ∇(z)F . e 0 z¯z

(2.28) (2.29)

These equations allow one to express the second derivatives ∂tm ∂tn F , ∂tm ∂t¯n F with m, n ≥ 1 through the derivatives ∂t0 ∂tk F , ∂t0 ∂t¯k F . In particular, the dispersionless Toda equation, 2

∂t1 ∂t¯1 F = e∂t0 F

(2.30)

which follows from (2.29) as z → ∞, expresses ∂t1 ∂t¯1 F through ∂t20 F . For a comprehensive exposition of Hirota equations for dispersionless KP and Toda hierarchies we refer the reader to [16, 17]. 2.5. Integral representation of the tau-function. Equation (2.16) allows one to obtain a ˜ representation of the tau-function as a double integral over the domain D. Set (z) := ∇(z)F . One is able to determine this function via its variation under the elementary deformation: ˜ δa (z) (2.31) = −2 log a −1 − z−1 + 2G(a, z) ˜ with the which is read from Eq. (2.16) by virtue of (2.12). This allows one to identify ˜ “modified potential” (z) = (z) − (0) + t0 log |z|2 , where is given by (2.4). Thus we can write vk 2 ˜ ∇(z)F = (z) = − z−k . (2.32) log |z−1 − ζ −1 |d 2 ζ = v0 + 2Re π D k k>0

The last equality is to be understood as the Taylor expansion around infinity. The coefficients vk are moments of the interior domain (the “dual” harmonic moments) defined as 1 2 vk = zk d 2 z (k > 0) , v0 = −(0) = log |z|d 2 z . (2.33) π D π D From (2.32) it is clear that vk = ∂tk F ,

k ≥ 0,

(2.34)

12

I. Krichever, A. Marshakov, A. Zabrodin

i.e., the moments of the complementary domain D (the “dual” moments) are completely determined by the function F of harmonic moments of Dc . In a similar manner, one arrives at the integral representation of the tau-function. Comparing (2.32) with (2.31) one can easily notice that the meaning of the elementary deformation δξ or the operator ∇(ξ ) formally applied at the boundary point ξ ∈ ∂D (where G(z, ξ ) = 0) is attaching a “small piece” to the integral over the domain D (the “bump” operator from [2]). Using this fact and interpreting (2.32) as a variation δz F we arrive at the double-integral representation of the tau-function 1 F =− 2 log |z−1 − ζ −1 |d 2 zd 2 ζ (2.35) π D D or F =

1 2π

D

2 ˜ (z)d z=

1 2π

((z) − 2(0)) d 2 z .

(2.36)

D

As we see below, the main formulas from this paragraph remain intact in the multiplyconnected case. 3. The Dirichlet Problem and the Tau-Function in the Multiply-Connected Case Let now Dα , α = 0, 1, . . . , g, be a collection of g + 1 non-intersecting bounded cong nected domains in the complex plane with smooth boundaries ∂Dα . Set D = ∪α=0 Dα , c so that the complement D = C \ D becomes a multiply-connected unbounded domain in the complex plane (see Fig. 3). Let bα be the boundary curves. They are assumed to g be positively oriented as boundaries of Dc , so that ∪α=0 bα = ∂Dc while bα = −∂Dα has the clockwise orientation. Comparing to the simply-connected case, nothing is changed in posing the standard Dirichlet problem. The definition of the Green function and the formula (1.1) for the solution of the Dirichlet problem through the Green function remain to be the same. A difference is in the nature of harmonic functions. Any harmonic function is still the real part of an analytic function but in the multiply-connected case these analytic functions are not necessarily single-valued (only their real parts have to be single-valued). In other words, the harmonic functions may have non-zero “periods” over non-trivial cycles1 . In our case, the non-trivial cycles are the boundary curves bα . In general, the Green function has non-zero “periods” over all boundary contours. Hence it is natural to introduce new objects, specific to the multiply-connected case, which are defined as “periods” of the Green function. First, the harmonic measure ωα (z) of the boundary component bα is the harmonic function in Dc such that it is equal to unity on bα and vanishes on the other boundary curves. Thus the harmonic measure is the solution to the particular Dirichlet problem. From the general formula (1.1) we conclude that 1 ωα (z) = − ∂n G(z, ζ )|dζ |, α = 1, . . . , g (3.1) 2π bα so the harmonic measure is the period of the Green function w.r.t. one of its arguments. From the maximum principle for harmonic functions it follows that 0 < ωα (z) < 1 1

Here and below by “periods” of a harmonic function f we mean the integrals trivial cycles.

∂n f dl over non-

Integrable Structure of the Dirichlet Boundary Problem

13

D2 b2

z2

D0 z1 b1

z =0 0

b0

D1

z

3

C\D b3

D3

Fig. 3. A multiply-connected domain Dc = C \ D for g = 3. The domain D = 3α=0 Dα consists of g + 1 = 4 disconnected parts Dα with the boundaries bα . To define the complete set of harmonic moments, we also need the auxiliary points zα ∈ Dα which should be always located inside the corresponding domains

g in internal points. Obviously, α=0 ωα (z) = 1. In what follows we consider the linear independent functions ωα (z) with α = 1, . . . , g. Further, taking “periods” in the remaining variable, we define 1 αβ = − ∂n ωα (ζ )|dζ |, α, β = 1, . . . , g . (3.2) 2π bβ The matrix αβ is known to be symmetric, non-degenerate and positively-definite. It will be clear below that the matrix Tαβ = iπ αβ can be identified with the matrix of periods of holomorphic differentials on the Schottky double of the domain Dc (see formula (3.5)). For brevity, we refer to both Tαβ and αβ as period matrices. For the harmonic measure and the period matrix there are variational formulas similar to the Hadamard formula (1.4). They can be derived either by a direct variation of (3.1) and (3.2) using the Hadamard formula or (much easier) by a “pictorial” argument like in Fig. 1. The formulas are: 1 δωα (z) = ∂n G(z, ξ ) ∂n ωα (ξ ) δn(ξ ) |dξ | , (3.3) 2π ∂D δ αβ

1 = 2π

∂n ωα (ξ ) ∂n ωβ (ξ ) δn(ξ ) |dξ | .

(3.4)

∂D

3.1. The Schottky double. It is customary to associate with a planar multiply-connected domain its Schottky double (see, e.g., [18], Ch. 2.2), a compact Riemann surface without

14

I. Krichever, A. Marshakov, A. Zabrodin

D0

ξ0 aα ξ

b

α

α

c

D =C\D

Dα

Fig. 4. The domain Dc with the aα -cycle, going one way along the “upper sheet” and back along the “lower sheet” of the Schottky double of Dc . For such a choice one clearly gets the intersection form aα ◦ bβ = δαβ for α, β = 1, . . . , g.

boundary endowed with antiholomorpic involution, the boundary of the initial domain being the set of the fixed points of the involution. The Schottky double of the multiply-connected domain Dc can be thought of as two copies of Dc (“upper” and “lower” g sheets of the double) glued along the boundaries ∪α=0 bα = ∂Dc , with points at infin¯ In this set-up the holomorphic coordinate on the upper sheet is ity added (∞ and ∞). z inherited from Dc , while the holomorphic coordinate2 on the other sheet is z¯ . The Schottky double of Dc with two infinities added is a compact Riemann surface of genus g = #{Dα } − 1. A meromorphic function on the double is a pair of meromorphic functions f, f˜ on Dc such that f (z) = f˜(¯z) on the boundary. Similarly, a meromorphic differential on the double is a pair of meromorphic differentials f (z)dz and f˜(¯z)d z¯ such that f (z)dz = f˜(¯z)d z¯ along the boundary curves. On the double, one may choose a canonical basis of cycles. We fix the b-cycles to be just the boundaries of the holes bα for α = 1, . . . , g. Note that regarded as the oriented boundaries of Dc (not D) they have the clockwise orientation. The aα -cycle connects the α th hole with the 0th one. To be more precise, fix points ξα on the boundaries, then the aα cycle starts from ξ0 , goes to ξα on the “upper” (holomorphic) sheet of the double and goes back the same way on the “lower” sheet, where the holomorphic coordinate is z¯ , see Fig. 4. Being harmonic, ωα can be represented as the real part of a holomorphic function: ωα (z) = Wα (z) + Wα (z) , where Wα (z) are holomorphic multivalued functions in Dc . The differentials dWα are holomorphic in Dc and purely imaginary on all boundary contours. So they can be 2 More precisely, the proper coordinates should be 1/z (and 1/¯z), which have first order zeros instead ¯ of poles at z = ∞ (and z¯ = ∞).

Integrable Structure of the Dirichlet Boundary Problem

15

extended holomorphically to the lower sheet as −dWα (z). In fact this is the canonically normalized basis of holomorphic differentials on the double: according to the definitions,

aα

ξα

dWβ =

dWβ (z) +

ξ0

−dWβ (z) = 2Re

ξ0 ξα

ξα

dWβ (z)

ξ0

= ωβ (ξα )−ωβ (ξ0 ) = δαβ . Then the matrix of b-periods of these differentials reads

Tαβ

i = dWβ = − 2 bα

∂n ωβ dl = iπ αβ ,

(3.5)

bα

i.e. the period matrix Tαβ of the Schottky double is a purely imaginary non-degenerate matrix with positively definite imaginary part π αβ (3.2).

3.2. Harmonic moments of multiply-connected domains. One may still use harmonic moments to characterize the shape of a multiply-connected domain. However, the set of harmonic functions should be extended by adding functions with poles in any hole (not only in D0 as before) and functions whose holomorphic parts are not single-valued. To specify them, let us mark points zα ∈ Dα , one in each hole (see Fig. 3). Without loss of generality, it is convenient to put z0 = 0. Then one may consider single-valued 2 analytic functions in Dc of the form (z − zα )−k and harmonic functions log 1 − zzα with multi-valued analytic part. The arguments almost identical to the ones used in the simply-connected case show that the parameters t0 , Mn,α , φα , where as in (2.3) t0 = Area(D)/π , Mn, α = −

1 π

Dc

(z − zα )−n d 2 z,

α = 0, 1, . . . , g, n ≥ 1

(3.6)

together with their complex conjugate, and 1 φα = − π

zα 2 2 log 1 − d z, z Dc

α = 1, . . . , g

(3.7)

uniquely define Dc , i.e. any deformation preserving these parameters is trivial. Note that the extra moments φα are essentially the values of the potential (2.4) at the points zα , φα = (0) − (zα ) − |zα |2 .

(3.8)

A crucial step for what follows is the change of variables from Mn,α to the variables τk which are finite linear combinations of the Mn,α ’s. They can be directly defined as moments with respect to new basis of functions: τ0 = t0 , τk =

1 2πi

Ak (z)¯zdz = − ∂D

1 π

Dc

d 2 zAk (z), k > 0 .

(3.9)

16

I. Krichever, A. Marshakov, A. Zabrodin

The functions Ak (z) are analogous to the Laurent-Fourier type basis on Riemann surfaces introduced in [8]. They are explicitly defined by the following formulas (here the indices α and β are understood modulo g + 1): Am(g+1)+α = R −m (z)

α−1

(z − zβ )−1 ,

β=0

R(z) =

g

(z − zβ ) .

(3.10)

β=0

In a neighbourhood of infinity Ak (z) = z−k + O(z−k−1 ). Any analytic function in Dc vanishing at infinity can be represented as a linear combination of Ak which is convergent in domains such that |R(z)| > const. In the case of one hole (g = 0) the formulas (3.10) give the basis used in the previous section: Ak = z−k . Note that A0 = 1, A1 = 1/z, therefore τ0 = t0 and τ1 = M1,0 = t1 . 3.3. Local coordinates in the space of multiply-connected domains. Now we are going to prove that the parameters τk , φα can be treated as local coordinates in the space of multiply-connected domains. (Here we use the same restricted notion of the local coordinates, as in the simply connected case (see the remark in Sect.2)). It is instructive and simpler first to prove this for another choice of parameters. Instead of φα one may use the areas of the holes 1 Area(Dα ) 1 = d 2z = z¯ dz , α = 1, . . . , g . (3.11) sα = π π Dα 2πi ∂Dα In order to prove that any deformation that preserves τk and sα is trivial, we introduce the basis of differentials dBk which satisfy the defining “orthonormality” relations 1 Ak dBk = δk,k (3.12) 2πi ∂D for all integer k, k . It is easy to see that explicitly they are given by: dBm(g+1)+α =

α−1 dzR m (z) (z − zβ−1 ) , z − zg

(3.13)

β=0

where we identify z−1 ≡ zg . The existence of a well-defined “dual” basis of differentials obeying the orthonormality relation is the key feature of the basis functions Ak , which makes τk good local coordinates comparing to the Mn,α . For the functions (z − zα )−n one cannot define the dual basis. The summation formulas ∞

dzdζ = dζ An (ζ )dBn (z), |R(z)| < |R(ζ )| , ζ −z n=1 ∞

dzdζ dζ A−n (ζ )dB−n (z), |R(z)| > |R(ζ )| , =− ζ −z n=0

(3.14)

Integrable Structure of the Dirichlet Boundary Problem

17

which can be checked directly, allow us to repeat arguments of Sect. 2. Indeed, the Cauchy integral (2.6), dz C(z)dz = 2πi

ζ¯ dζ , ζ −z

∂D

(3.15)

where the integration now goes along all boundary components, defines in each of the holes Dα analytic differentials C α (z)dz (analogs of C + (z)dz in the simply-connected case). In the complementary domain Dc the Cauchy integral still defines the differential C − (z)dz holomorphic everywhere in Dc except for infinity where it has a simple pole. The difference of the boundary values of the Cauchy integral is equal to z¯ : C α (z) − C − (z) = z¯ ,

z ∈ ∂Dα .

From Eq. (2.7), which can be written separately for each contour, it follows that • The difference of the boundary values

∂t C α (ζ ) − ∂t C − (ζ ) dζ of the derivative of the Cauchy integral (3.15) is, for all α, a purely imaginary differential on the boundary bα . The expansion (3.14) of the Cauchy kernel implies that • If a t-deformation preserves all the moments τk , k ≥ 0, then ∂t ζ¯ dζ − ∂t ζ d ζ¯ extends to a holomorphic differential in Dc . Indeed, since |R(z)| is small for z close enough to any of the points zα , one can expand ∂t C α (z) for any α as ∂t C α (z)dz =

∞ ∞ 1 dBk ∂t Ak (ζ )ζ¯ dζ = ∂t τk dBk (z) , 2πi ∂D k=1

(3.16)

k=1

and conclude that it is identically zero provided ∂t τk = 0. Hence −∂t C − (z)dz is the desired extension of ∂t ζ¯ dζ −∂t ζ d ζ¯ . It has no pole at infinity due to the equation ∂t τ0 = 0. Using the Schwarz symmetry principle we obtain that ∂t C − (z)dz extends to a holomorphic differential on the Schottky double. If the variables sα are also preserved under the t-deformation, then this holomorphic differential has zero periods along all the cycles bα . Therefore, it is identically zero. This completes the proof of the statement that any deformation of the domain preserving τk and sα is trivial. In this proof the variables sα were used only at the last moment in order to show that the extension of ∂t C − (z)dz as a holomorphic differential on the Schottky double is trivial. The variables φα can be used in a similar way. Namely, let us show that if they are preserved under t-deformation then aα -periods of the extension of ∂t C − (z)dz are trivial, and therefore this extension is identically zero. Indeed, the variable φα (3.7) can be represented in the form 2 φα = − Re π

zα

dz 0

Dc

d 2ζ . z−ζ

(3.17)

18

I. Krichever, A. Marshakov, A. Zabrodin

d2ζ α The differential dz z + C − (z))dz for π Dc z−ζ is equal to C (z)dz for z ∈ Dα and (¯ c z ∈ D . Let ξ0 , ξα be the points where the integration path from 0 to zα intersects the boundary contours b0 , bα . Then ξ0 zα ξα C 0 (z)dz + C α (z)dz + (¯z + C − (z))dz . (3.18) φα = −2 Re 0

ξα

ξ0

It is shown above that if a t-deformation preserves the variables τk then all ∂t C α (z)dz = 0. Thus vanishing of the t-derivative ∂t φα = 0 implies ξα ∂t C − (z)dz . (3.19) 0 = −∂t φα = 2 Re ξ0

The r.h.s. of this equation is just the aα -period of the holomorphic extension of the differential ∂t C − (z)dz. φ φ Let us construct the deformations ∂xk and ∂yk of the boundary that change the real or imaginary parts of the variable τk = xk + iyk , k ≥ 1, keeping all the other moments φ φ φ and the variables φα fixed. It is convenient to set ∂τk = 21 (∂xk − i∂yk ). The argument is similar to the proof of the fact that any deformation that preserves all the variables is trivial. φ φ φ φ • Suppose that the deformations ∂xk and ∂yk exist. Then the differential ∂τk ζ¯ dζ −∂τk ζ d ζ¯ c extends from ∂D to the Schottky double . Its extension is a meromorphic differential φ d k with the only pole at the infinity point ∞ on the upper sheet. In a neighborhood of ∞ it has the form

d k (z) = dBk (z) + O(z−2 )dz . φ

(3.20)

φ

The a-periods of d k are equal to φ d k = aα

zα

(3.21)

dBk .

0 φ

First of all, it is clear that the meromorphic differential d k on is uniquely defined by its asymptotics at ∞ and by the normalization (3.21) of its a-periods. To deduce these properties, we notice, using (3.16), that ∂xk C α (z)dz = dBk (z). Therefore, the φ φ differential ∂xk ζ¯ dζ − ∂xk ζ d ζ¯ extends to Dc as d φxk = −∂xk C − (z)dz + dBk .

(3.22)

Using the Schwarz symmetry principle we conclude that it extends to the Schottky douφ ble as a meromorphic differential. Around the two infinities it has the form d xk =

z→∞

dBk + O(z−2 )dz and d xk φ

−d B¯ k + O(¯z−2 )d z¯ . In the same way one gets that

=

¯ z¯ →∞

φ φ the differential ∂yn ζ¯ dζ − ∂yn ζ d ζ¯ extends to the double as a meromorphic differenφ φ tial d yk , which at the two infinities has the form d yk = idBk + O(z−2 )dz and φ d yk

=

¯ z¯ →∞

id B¯ k + O(¯z−2 )d z¯ respectively. Since φ

2d k = d φxk − id φyk ,

z→∞

Integrable Structure of the Dirichlet Boundary Problem

19

φ

the first statement is proven. From ∂xk φα = 0 and (3.17), (3.22) it follows that ξ0 zα ξα

φ 0 = Re −d xk + dBk + dBk + dBk . 0

Hence

ξ0

aα

In the same way one gets

ξα

d φxk = 2Re

aα

zα

dBk . 0

d φyk

= −2Im

zα

dBk . 0

The last two equations are equivalent to (3.21). Normal displacement of the boundary that accomplishes the deformations can be explicitly found using the following elementary proposition: • Let D(t) be a deformation with real parameter t such that the differential d = ∂t ζ¯ dζ − ∂t ζ d ζ¯ extends to a meromorphic differential d globally defined on the Schottky double . Then the corresponding z normal displacement of the boundary is proportional to normal derivative of Re d at the boundary point ξ : ξ 1 δn(ξ ) = δt ∂n Re d . (3.23) 2 Conversely, if δn(ξ ) = 21 δt ∂n H (ξ ), where H is a real-valued function such that dH = 0 along the boundary contours and ∂z H is meromorphic in Dc then the differential ∂t ζ¯ dζ − ∂t ζ d ζ¯ is meromorphically extendable to the Schottky double as 2∂z H dz on the upper sheet and −2∂z¯ H d z¯ on the lower sheet. In our case normal displacements of the boundary that change xk or yk keeping all the other moments and the variables φα fixed are thus given by ξ ξ 1 1 δn(ξ ) = δxk ∂n Re (3.24) d φxk , δn(ξ ) = δyk ∂n Re d φyk 2 2 respectively. Note that since the differentials d xk (z), d yk (z) (but not d k (z)!) are purely imaginary on the boundaries, d Re xk (z) = d Re yk (z) = 0 along each component of the boundary. With formulas (3.24) at hand, one can directly verify that these deformations indeed change xk or yk only and keep fixed all other moments. We leave this to the reader. φ In terms of the differential d k formulas (3.24) acquire the form ξ ξ φ φ δn(ξ ) = δxk ∂n Re (3.25) d k , δn(ξ ) = −δyk ∂n Im d k (cf. (2.11) for the simply-connected case). Indeed, taking the real part of 2 k (ξ ) = xk (ξ ) − i yk (ξ ), we get 2 Re k (ξ ) = Re xk (ξ ) + Im yk (ξ ). But the normal derivative of Im yk (ξ ) vanishes since, by virtue of the Cauchy-Riemann identities, it is equal

20

I. Krichever, A. Marshakov, A. Zabrodin

to the tangential derivative of the conjugate harmonic function Re yk (ξ ). This proves the first formula in (3.25). The second one is proven in a similar way by taking imaginary part of 2 k (ξ ). φ φ The construction of the vector fields ∂τ0 (which changes τ0 only) and ∂α (which changes φα only) is quite similar and even simpler since the derivative (3.16) vanishes. So, we present the results without going into details. φ

• The deformation ∂τ0 corresponds to the normal displacement 1 δn(ξ ) = − δτ0 ∂n G(∞, ξ ) . 2 φ φ The differential −(∂τ0 ζ¯ dζ − ∂τ0 ζ d ζ¯ ) extends from ∂Dc to the Schottky double . Its extension is a meromorphic third-kind Abelian differential d 0 which has simple poles at the infinities on the two sheets of the Schottky double (with residues ±1) and vanishing a-periods. φ • The deformation ∂α corresponds to the normal displacement

δn(ξ ) =

1 δφα ∂n ωα (ξ ) , 4

where ωα is the harmonic measure of the boundary component bα (see (3.1)). The φ φ differential ∂α ζ¯ dζ − ∂α ζ d ζ¯ holomorphically extends from ∂Dc to the Schottky double . Its extension is the canonically normalized holomorphic differential dWα = ∂z ωα (z)dz on the upper sheet (and dWα = −∂z¯ ωα (z)d z¯ on the lower sheet). φ

φ

φ

φ

So we see that ∂xk , ∂yk , ∂τ0 and ∂α are well-defined vector fields on the space of multiply-connected domains. This fact allows us to treat φα , τk as local coordinates on this space. At this stage it becomes clear why we prefer to use the moments τk rather than Mk,α . Although the latter are finite linear combinations of the former, they can not be treated as local coordinates because the vector fields ∂/∂Mk,α , being in general infinite φ linear combinations of the ∂τk , are not well-defined. 3.4. -variables. Up to now the roles of the variables sα and φα have been in some sense dual to each other. It is necessary to emphasize that this duality does not go beyond the framework of our proof of the statement that the first or the second sets together with the variables τk are local coordinates in the space of multiply-connected domains. For analytic boundary curves one can define the Schwarz function, which is a unique function analytic in some strip-like neighborhoods of all boundaries such that S(z) = z¯

on the boundary curves .

(3.26)

Then the variables sα are b-periods of the differential S(z)dz. At the same time, the variables φα in general can not be identified with periods of this differential (or its extension) over any cycles on the Schottky double. Now we are going to introduce new variables, α , which can be called virtual a-periods of the differential S(z)dz on the Schottky double, since in all the cases when the Schwarz function has a meromorphic extension to the double they indeed coincide with the a-periods of the corresponding differential (see below in this section).

Integrable Structure of the Dirichlet Boundary Problem

Let us consider the differential φ

d k = d k −

21

Bk (zα )dWα

(k ≥ 1) ,

(3.27)

α

where

z

Bk (z) =

dBk

(3.28)

0

is a polynomial of degree k. It is a meromorphic differential on with the only pole at ∞ on the upper sheet, where it has the form d k (z) = dBk (z) + O(z−2 )dz. From (3.21) it is clear that the differential d k has vanishing a-periods d k = 0 ,

(3.29)

(3.30)

aα

i.e. it is a canonically normalized meromorphic differential. The normal displacements of the boundary given by real and imaginary parts of the normal derivative ∂n k define a complex tangent vector field Bk (zα )∂αφ (3.31) ∂τk = ∂τφk − α

to the space of multiply-connected domains. These vector fields keep fixed the formal variable Bk (zα )τk . (3.32) α = φα + 2Re k

In a general situation this variable is only a formal one because the sum generally does not converge. Thus, we call α the virtual a-period of the Schwarz differential S(z)dz, since in the case when the Schwarz function has a meromorphic extension to the double , the sum does converge and the corresponding quantity does coincide with the aα -period of the extension of the Schwarz differential. 3.5. Elementary deformations and the operator ∇(z). Like in Sect. 2, we introduce the elementary deformations δa δ (α)

with δn(ξ ) = − ∂n G(a, ξ ) , a ∈ Dc , 2 with δn(ξ ) = − ∂n ωα (ξ ) α = 1, . . . , g , 2

(3.33)

where ωα (z) is the harmonic measure of the boundary component bα (see (3.1)). The deformations δ (α) were considered in [19] in connection with the so-called quadrature domains [19, 20]. In complete analogy with Sect. 2 one can derive the following formulas for variations of the local coordinates under elementary deformations: zα 2 δa τk = Ak (a), δa φα = log 1− , δ (α) τk = 0, δ (α) φβ = −2δαβ . (3.34) a

22

I. Krichever, A. Marshakov, A. Zabrodin

The first two formulas are particular cases of 2 h(ζ )d ζ = h(ζ )∂n G(a, ζ )|dζ | = −π h(a) δa 2 ∂Dc Dc which is valid for any harmonic function h in Dc (the last equality is just the formula for solution of the Dirichlet problem). Similarly, (α) 2 δ h(ζ )d ζ = h∂n ωα |dζ | = ∂n h |dζ | = −i ∂ζ h dζ 2 ∂Dc 2 bα Dc ∂Dα (the Green formula was used), and the last two formulas in (3.34) correspond to the particular choices of h(z). Variations of the variables α (in the case when they are well-defined) then read: δz α = 0,

δ (α) β = −2δαβ .

(3.35)

Therefore, for any functional X on the space of the multiply-connected domains the following equations hold: δz X = ∇(z)X ,

(3.36)

δ (α) X = −2∂αφ X = −2∂α X .

(3.37)

The differential operator ∇(z) in the multiply-connected case is defined by the formula Ak (z)∂τk + Ak (z)∂τ (3.38) ∇(z) = ∂τ0 + ¯k . k≥1

The functional X can be regarded as a function X = Xφ (φα , τk ) on the space of the local coordinates φα , τk , or as a function X = X (α , τk ) on the space of the local coordinates α , τk . We would like to stress once again, that although in the latter case the variables α are formal their variations under elementary deformations and the vector-fields ∂τk , which keep them fixed, are well-defined. For completeness, let us characterize elementary deformations δa in terms of meromorphic differentials on the Schottky double (as we have already seen, deformations δ (α) correspond to holomorphic differentials). • Let ∂t (a) be the vector field in the space of multiply-connected domains corresponding to the elementary deformation δa . Then the differential −(∂t (a) ζ¯ dζ −∂t (a) ζ d ζ¯ ) extends from ∂Dc to the Schottky double . Its extension is a meromorphic third-kind Abelian ¯ which has simple poles at the points a and a differential d (a,a) ¯ on the two sheets of the Schottky double (with residues ±1) and vanishing a-periods. In terms of the Green function we have: 2∂z G(a, z)dz on the upper sheet (a,a) ¯ d = −2∂z¯ G(a, z)dz on the lower sheet (cf. (3.23) and (3.33)). Note that the differential d 0 introduced before coincides with ¯ . d (∞,∞)

Integrable Structure of the Dirichlet Boundary Problem

23

Let K(z, ζ )dζ be a unique meromorphic Abelian differential of the third kind on with simple poles at z and ∞ on the upper sheet with residues ±1 normalized to zero a-periods. (Note that as a function of the variable z it is multi-valued on .) Then 2∂ζ G(z, ζ )dζ − 2∂ζ G(∞, ζ )dζ = K(z, ζ )dζ + K(¯z, ζ )dζ and the differential d k (ζ ) can be represented in the form dζ d k (ζ ) = K(u, ζ )dBk (u) , 2πi ∞

(3.39)

(3.40)

where the u-integration goes along a big circle around infinity. Using (3.14) we obtain that K(u, ζ )du dζ Ak (z)d k (ζ ) = − = K(z, ζ )dζ . (3.41) 2πi ∞ u − z k≥1

Therefore, the following expansion of the derivative of the Green function holds: ¯ k (ζ ) . 2∂ζ G(z, ζ )dζ = d 0 (ζ ) − (3.42) Ak (z)d k (ζ ) + Ak (z)d k≥1

¯ k is a unique meromorphic differential on with the only pole at infinity on Here d the lower sheet with the principal part −dBk (z) and vanishing a-periods. This formula generalizes Eq. (3.8) from [2] to the multiply-connected case.

3.6. The F -function. Applying the variational formulas (1.4), (3.3), (3.4), we can find variations of the Green function, harmonic measure and period matrix under the elementary deformations. In this way we obtain a number of important relations which connect elementary deformations of these objects: δa G(b, c) = δb G(c, a) = δc G(a, b) , δa ωα (b) = δ (α) G(a, b) = δb ωα (a) , δ (α) ωβ (z) = δ (β) ωα (z) , δz αβ = δ (α) ωβ (z) , δ (α) βγ = δ (β) γ α = δ (γ ) αβ .

(3.43)

From (3.36), (3.37) it follows that the formulas (3.43) can be rewritten in terms of the differential operators ∇(z) and ∂α := ∂/∂φα = ∂/∂α : ∇(a)G(b, c) = ∇(b)G(c, a) = ∇(c)G(a, b) , ∇(a)ωα (b) = −2∂α G(a, b) , ∂α ωβ (z) = ∂β ωα (z) , ∇(z) αβ = −2∂α ωβ (z) , ∂α βγ = ∂β γ α = ∂γ αβ .

(3.44)

These integrability relations generalize formulas (2.15) to the multiply-connected case. The first line just coincides with (2.15) while the other ones extend the symmetry of the derivatives to the harmonic measure and the period matrix.

24

I. Krichever, A. Marshakov, A. Zabrodin

Again, (3.44) can be regarded as a set of compatibility conditions of an infinite hierarchy of differential equations. They imply that there exists a function F = F (α , τ ) such that 1 G(a, b) = log a −1 − b−1 + ∇(a)∇(b)F , (3.45) 2 ωα (z) = − ∂α ∇(z)F ,

(3.46)

Tαβ = iπ αβ = 2πi ∂α ∂β F .

(3.47)

The function F is the (logarithm of the) tau-function of multiply-connected domains.

3.7. Dual moments and integral representation of the tau-function. To obtain the integral representation of the function F , we proceed exactly in the same manner as in Sect. 2 (see also [2] for more details). ˜ ˜ Again, set (z) = ∇(z)F . Equations (3.45) and (3.46) determine the function (z) for z ∈ Dc via its variations under the elementary deformations: ˜ δa (z) = −2 log a −1 − z−1 + 2G(a, z) , (3.48) ˜ δ (α) (z) = 2ωα (z) . It is easy to verify that the function 2 ˜ (z) =− log |z−1 − ζ −1 |d 2 ζ = (z) − (0) + τ0 log |z|2 π D

(3.49)

satisfies (3.48). Indeed, using (3.33), variation of (3.49) reads 2 −1 −1 2 δa − log |z − ζ |d ζ = |dξ |∂n G(a, ξ ) log |z−1 − ξ −1 | π D π ∂Dc = |dξ |∂n G(a, ξ ) log |z−1 − ξ −1 | − G(z, ξ ) π ∂Dc = −2 log a −1 − z−1 + 2G(a, z) , where we have used properties of the Dirichlet Green function and the fact that the Dirichlet formula restores harmonic function from its value at the boundary. Similarly, for z ∈ Dc we obtain: 2 (α) −1 −1 2 δ − log |z − ζ |d ζ = |dξ |∂n ωα (ξ ) log |z−1 − ξ −1 | π D π ∂Dc = |dξ |∂n ωα (ξ ) log |z−1 − ξ −1 | − G(z, ξ ) π ∂Dc = |dξ |∂n log |z−1 − ξ −1 | − G(z, ξ ) π bα = 2ωα (z) .

Integrable Structure of the Dirichlet Boundary Problem

The same calculation for z ∈ D yields 0 (α) ˜ δ (z) = 2δαβ

25

if z ∈ D0 . if z ∈ Dβ , β = 1, . . . , g

(3.50)

˜ given by (2.32), where D is We see that the expression in (3.49) coincides with ˜ at infinity now understood as the union of all Dα ’s. The coefficients of an expansion of define the dual moments νk : 2 ˜ ∇(z)F = (z) =− log |z−1 − ζ −1 |d 2 ζ = v0 + 2Re νk Ak (z) . (3.51) π D k>0

The coefficients in the r.h.s. of (3.51) are moments of the union of the interior domains with respect to the dual basis 1 νk = Bk (z)d 2 z . (3.52) π D From Eq. (3.51) it follows that νk = ∂τk F .

(3.53)

The same arguments show that the derivatives sα := − ∂α F

(3.54)

are just areas of the holes (3.11). Indeed, Eqs. (3.46), (3.47) determine these quantities via their variations: δa sα = ωα (a), δ (β) sα = αβ . A direct check, using (3.33), shows that 1 1 2 2 Area(Dα ) ∇(z)F = − log − d ζ , ∂α F = − . (3.55) π D z ζ π For example, 1 (α) 2 ˜ (z)d δ z 2π D 1 ˜ ˜ ) d 2ζ . =− |dξ |∂n ωα (ξ )(ξ ) + δ (α) (ζ 4π ∂Dc 2π D

δ (α) F =

In the last term we use (3.50) and obtain the result: ˜ )+ ˜ d 2 ζ + sα = 2sα δ (α) F = − |dξ |∂n (ξ d 2ζ = − 4π bα π Dα 4π Dα (here = 4∂z ∂z¯ is the Laplace operator). The integral representation of F is found in the same way through its variations which are read from (3.55). The result is given by the same formulas (2.35) and (2.36) as in g the simply-connected case with the understanding that D = ∪α=0 Dα is now the union of all Dα ’s.

26

I. Krichever, A. Marshakov, A. Zabrodin

3.8. Algebraic domains. In what follows we restrict our analysis by the class of algebraic domains. In the simply-connected case dealt with in the previous section the algebraic domains are simply images of the exterior of the unit disk under one-to-one conformal maps given by rational functions whose singularities are all in the other “half” of the plane, i.e. inside the unit circle. Note that the boundary of the unit circle is the set of fixed points of the inversion w → 1/w¯ which is the antiholomorphic involution of the w-plane compactified by a point at infinity (the Riemann sphere). Planar multiply-connected algebraic domains can be defined as the domains for which the Schwartz function has a meromorphic extension to a higher genus Riemann surface (a complex algebraic curve) with antiholomorphic involution. More precisely, let be a real Riemann surface by which we mean a complex algebraic curve of genus g with an antiholomorphic involution such that the set of fixed points consists of exactly g + 1 closed contours (such curves are sometimes called M-curves). Then can be naturally divided in two “halves” (say upper and lower sheets) which are interchanged by the involution. Algebraic domains with g holes in the plane can be defined as images of the upper half of the real Riemann surface under bijective conformal maps given by rational (meromorphic) functions on (see, e.g. [21]). For the purpose of this paper, it is convenient to use another, more direct characterization of algebraic domains. The domain Dc is algebraic if and only if the Cauchy integrals (3.15) 1 ζ¯ dζ α C (z) = for z ∈ Dα 2πi ∂D ζ − z are extendable to a rational (meromorphic) function J (z) on the whole complex plane with a marked point at infinity (see [21]). It is important to stress that this function is required to be the same for all α. The equality S(z) = J (z) − C − (z) valid by definition for z ∈ ∂Dc can be used for analytic extension of the Schwarz function. The function C − (z) is analytic in Dc . Therefore, J (z) and S(z) have the same singular parts at their poles in Dc . One may treat S(z) as a function on the Schottky double extending it to the lower sheet as z¯ . It is also convenient to introduce z J (z)dz (3.56) V (z) = 0

which is multi-valued if J (z) has simple poles (to fix a single-valued branch, we make cuts from ∞ to all simple poles of J (z)). In fact we need only the real part of V (z). In neighborhoods of the points zα one has J (z)dz = τk dBk (z), V (z) = τk Bk (z) . (3.57) k≥1

k≥1

The formula (3.32) α = φα + 2 Re V (zα ),

zα ∈ Dα

(3.58)

shows that for the algebraic domains the variables α , introduced in the general case as formal quantities, are well-defined. It is easy to show that they are equal to the a-periods of the differential S(z)dz on the Schottky double . Indeed, using the fact that C 0 (z)

Integrable Structure of the Dirichlet Boundary Problem

27

and C α (z) represent restrictions of the same function J (z), one can rewrite (3.18) in the form zα ξα φα = −2 Re J (z)dz + (¯z + C − (z) − J (z))dz . 0

ξ0

Under the second integral we recognize the Schwarz function. Combining this equality with the definition of α (3.32), we obtain: ξα ξα α = 2 Re (S(z) − z¯ )dz = S(z)dz . (3.59) S(z)dz − z¯ dS(z) = ξ0

aα

ξ0

As an example of algebraic domains, it is instructive to consider the case when only a finite-number of the moments τk are non-zero. Let AN be the space of multiply-connected domains such that τk = 0,

(3.60)

k>N.

Then the arguments similar to the ones used above show that • S(z) extends to a meromorphic function on with a pole of order N − 1 at ∞ and a ¯ simple pole at ∞. The function z extended to the lower sheet of the Schottky double as S(z) has a simple ¯ For a domain Dc ∈ AN the moments with pole at ∞ and a pole of order N − 1 at ∞. respect to the Laurent basis (cf. (2.1)) 1 tk = − z−k d 2 z (3.61) πk Dc coincide with the coefficients of the expansion of the Schwarz function near ∞: S(z) =

N

ktk zk−1 + O(z−1 ),

z → ∞.

(3.62)

k=1

The normal displacement of the bondary of an algebraic domain, which changes the variable tk keeping all zthe other moments (and α ) fixed is defined by normal derivative ˜ k . Here d ˜ k is a unique normalized meromorphic differential d of the function 2 Re on with the only pole at ∞ of the form k = d(zk + O(z−1 )), k = 0 . d d (3.63) aα

k is well-defined for a generic, not necessarily algebraic Note that the differential d z ˜ k defines a tangent d domain. Therefore, the normal derivative of the function 2 Re vector field ∂t on the whole space of multiply-connected domains. k The space AN is a particular case of algebraic orbits of the universal Whitham hierarchy. In this case the general formula (7.42) from [4] for the τ -function of the Whitham hierarchy, after proper change of the notation, acquires the form 1 1 2F = − τ02 + τ0 v0 + (2 − k)(τk νk + τ¯k ν¯ k ) − α sα 2 2 g

k≥1

α=1

(3.64)

28

I. Krichever, A. Marshakov, A. Zabrodin

which is a quasi-homogeneity condition obeyed by F (compare with formula (5.11) from [2]). Let d 0 be a unique normalized meromorphic differential on with simple poles ¯ Its Abelian integral at the infinities ∞ and ∞. z log w(z) = d 0 (3.65) ξ0

defines in the neighborhood of ∞ a function w(z) which has a simple pole at infinity. The dependence of the inverse function z(w) on the variables tk is described by the Whitham equations for the two-dimensional Toda lattice hierarchy. These equations have the form k (w), z(w)} := ∂t z(w) = { k

k (w) d dz k (w) ∂t0 z(w) − ∂t0 . d log w d log w

(3.66)

Algebraic domains of a more general form correspond to the universal Whitham hierarchy. Let AN1 ,... ,Nl be the space of domains such that the extension of the Schwarz function S(z) to Dc has poles of orders Nj − 1 at some points zj (which possibly ¯ Then, according to [4], the variables include ∞ and ∞). zj 1 t0,j = S(z)dz, tk,j = reszj (z − zj )k−1 S(z)dz, k = 1, . . . , Nj − 1 (3.67) k ξ0 together with the variables sα (or α ) provide a set of local Whitham coordinates on the space AN1 ,... ,Nl . Note that the definition of the algebraic orbits of the universal Whitham hierarchy is a bit more general than the definition of algebraic domains given above. It corresponds to the case when the differential dS of the Schwarz function is extendable to Dc as a meromorphic differential (in [21] such domains are called Abelian domains). (1) For example, let AN be the space of multiply-connected domains such that (1) Tk

=

Ak dS =

∂D

S (z)Ak dz = − ∂D

∂D

Ak S(z)dz = 0,

k > N . (3.68)

This space is characterized by the following property: there are constants Kα such that S(z) + Kα extends to a meromorphic function on Dc with a pole of order N at ∞. The variables 1 Kα , sα , tk = z−k z¯ dz, k ≥ 1, (3.69) 2πik ∂D (1)

are local coordinates on AN . The two cases when S(z) or its derivative S (z) have a meromorphic extension to c D are particular examples of the whole hierarchy of integrable domains, which can be defined in a similar way by the condition that the mth order derivative of the Schwarz function S(z) admits a meromorphic extension to the Schottky double . For example, (2) let AN be the space of multiply-connected domains such that (2)

Tk

= ∂D

S (z)Ak dz = −

∂D

Ak S(z)dz = 0, k > N .

(3.70)

Integrable Structure of the Dirichlet Boundary Problem

29

This space is characterized by the following property: there are linear functions kα (z) = Kα0 + Kα1 z such that S(z) + kα (z) extends to a meromorphic function on Dc with a pole of order N + 1 at ∞. The variables 1 Kα (z), sα , tk = z−k z¯ dz, k ≥ 1, (3.71) 2πik ∂D (2)

(m)

are local coordinates on AN . The other spaces AN with m > 2 can be defined in a similar way. 4. The Duality Transformation The independent variables α (3.59) or φα used in the previous section are not as transparent as the dual variables sα (3.11), which are simply areas of the holes Dα . In this section we show how to pass to the set of independent variables s1 , . . . , sg (3.11) (together with the infinite set of τk ’s). This transformation is similar to the passing from “external” to “internal” moments in the simply-connected case (see Sect. 5 of [2]). The difference is that only a finite number of times are subject to the transformation while the infinite set of τk ’s remains the same. The change to the variables sα can be done in the general case of domains with smooth boundaries. However, it is the change α → sα rather than φα → sα that leads to a transparent duality. Since α ’s are only defined as formal (“virtual”) variables for domains with smooth boundaries, we shall restrict our consideration to the class of algebraic domains discussed at the end of the previous section. In this case the variables α are well defined. 4.1. The Legendre transform. Passing from α to sα is a particular duality transformation which is equivalent to the interchanging of the a and b cycles on the Schottky double . This is achieved by the (partial) Legendre transform F (α , τ ) −→ F˜ (sα , τ ), where F˜ = F +

g

(4.1)

α sα .

α=1

The function F˜ is the “dual” tau-function. Below in this section, it is shown that F˜ solves the modified Dirichlet problem and can be identified with the free energy of a matrix model in the planar large N limit in the case when the support of eigenvalues consists of a few disconnected domains (a so-called multi-support solution, see [22] and references therein). The main properties of F˜follow from those of F . According to (2.34), (3.55) we have dF = − α sα dα + k νk dτk (for brevity, k is assumed to run over all integer values, τ−k ≡ τ¯k , etc.), so d F˜ = α α dsα + k νk dτk . This gives the first order derivatives: ∂ F˜ ∂ F˜ α = , νk = . (4.2) ∂sα ∂τk The second order derivatives are transformed as follows (see e.g. [23]). Set Fαβ =

∂ 2F , ∂α ∂β

Fαk =

∂ 2F , ∂α ∂τk

Fik =

∂ 2F ∂τi ∂τk

30

I. Krichever, A. Marshakov, A. Zabrodin

and similarly for F˜ . Then Fαβ = −(F˜ −1 )αβ , g (F˜ −1 )αγ F˜γ k , Fαk = γ =1

Fik = F˜ik −

(4.3) g

F˜iγ (F˜ −1 )γ γ F˜γ k .

γ ,γ =1

Here (F˜ −1 )αβ means the matrix element of the matrix inverse to the g × g matrix F˜αβ . Using these formulas, it is easy to see that the main properties (3.45), (3.46) and (3.47) of the tau-function are translated to the dual tau-function as follows: 1 ˜ G(a, b) = log |a −1 − b−1 | + ∇(a)∇(b)F˜ , 2

(4.4)

2πi ω˜ α (z) = − ∂sα∇(z)F˜ ,

(4.5)

∂ 2 F˜ 2πi T˜αβ = , ∂sα ∂sβ

(4.6)

where τk -derivatives in ∇(z) are taken at fixed sα . The objects in the left-hand sides of these relations are: ˜ G(a, b) = G(a, b) + iπ

g

ωα (a)T˜αβ ωβ (b) ,

(4.7)

α,β=1

ω˜ α (z) =

g

T˜αβ ωβ (z) ,

(4.8)

β=1

i T˜ = −T −1 = −1 . π

(4.9)

˜ is the Green function of the modified Dirichlet problem to be discussed The function G below. The matrix T˜ is the matrix of a-periods of the holomorphic differentials d W˜ α on the double (so that ω˜ α (z) = W˜ α (z) + W˜ α (z)), normalized with respect to the b-cycles − d W˜ β = δαβ , d W˜ β = T˜αβ , bα

aα

i.e. more precisely, the change of cycles is aα → bα , bα → −aα . An important remark is in order. By a simple rescaling of the independent variables one is able to write the group of relations (4.4)–(4.6) for the function F˜ in exactly the same form as the ones for the function F (3.45)–(3.47), so that they differ merely by the notation. We use this fact in Sect. 5.

Integrable Structure of the Dirichlet Boundary Problem

31

4.2. The modified Dirichlet problem. The modified Green function (4.7) solves the modified Dirichlet problem which can be formulated in the following way. One may eliminate all except for one of the periods of the Green function G, thus making it similar, in this respect, to the Green function of a simply-connected domain (recall that the latter has the non-zero period 2π over the only boundary curve b0 ). This leads to the following modified Dirichlet problem (see e.g. [24]): given a function u0 (z) on the boundary, to find a harmonic function u(z) in Dc such that it is continuous up to the boundary and equals u0 (z) + Cα on the α’s boundary component. Here, Cα ’s are some constants. It is important to stress that they are not given a priori but have to be determined from the condition that the solution u(z) has vanishing periods over the boundaries b1 , . . . , bg . One of these constants can be put equal to zero. We set C0 = 0. This problem also has a unique solution. It is given by the same formula (1.1) in terms ˜ of the modified Green function G(z, ζ ). The definition of the latter is similar to that of the G(z, ζ ) but differs in two respects: ˜ ˜ (G1) G(z, ζ ) is required to have zero periods over the boundaries b1 , . . . , bg ; ˜ ˜ ˜ (G2) The derivative of G(z, ζ ) along the boundary (not G(z, ζ ) itself!) vanishes on the boundary. ˜ Under the condition that G(z, ζ ) = 0 on b0 such a function is unique. The function given by (4.7) just meets these requirements. We conclude that the modified Green function is expressed through the dual tau-function F˜ . Note that variations of the modified Green function under small deformations of the domain are described by the same Hadamard formula (1.4), where each Green function ˜ This follows, after some algebra, from the formula for G ˜ in terms of is replaced by G. G, ωα and αβ . Therefore, all the arguments of Sect. 3 could be repeated in a completely parallel way starting from the modified Dirichlet problem. One may also say that ˜ differ merely by a preferred basis of cycles on the double: the the functions G and G ˜ has vanishing differential ∂z Gdz has vanishing periods over the a-cycles while ∂z Gdz periods over the b-cycles. ˜ The function wa (z) such that G(z, a) = − log |wa (z)| maps Dc onto the exterior of the unit circle which is slit along g concentric circular arcs (see Fig. 5). Since the periods

˜ Fig. 5. The image of a triply-connected domain under the conformal map wa (z) such that G(z, a) = − log |wa (z)|. The function wa maps the domain onto the exterior of the unit (dashed) circle with g = 3 concentric circular cuts. Positions and lengths of the arcs depend on the shape of Dc in Fig. 3 and depend also on the normalization point z = a which is mapped to ∞

32

I. Krichever, A. Marshakov, A. Zabrodin

˜ vanish, the function wa is single-valued. Positions of the arcs depend of the function G on the shape of Dc as well as on the point a which is mapped to ∞. The radii of the arcs, Rα , are expressed through the dual tau-function as log Rα2 = ∂sα∇(a)F˜ .

(4.10)

In particular, for a = ∞ we have log Rα2 = ∂sα∂τ0 F˜ (cf. Eq. (2.20) for the conformal radius).

4.3. Relation to multi-support solutions of matrix models. The partition function of the model3 , written as an integral over eigenvalues, reads:   N N N 1 1 ZN = exp  log |zi − zj |2 + U (zi ) d 2 zj . (4.11) N! i<j

i=1

j =1

The matrix model potential U is usually chosen to be of the form U (z) = −z¯z + V (z) + V (z) ,

(4.12)

where V (z) is at the moment some polynomial; however, we will see immediately that the coincidence of the notation with (3.57) is not accidental. The parameter , in the large N limit, tends to zero simultaneously with N → ∞ in such a way that t0 = N is kept finite and fixed. In the leading order, one can apply the saddle point method. The saddle point condition is

N j =1,=i

1 + ∂zi U (zi ) = 0 . zi − z j

The density of eigenvalues is a sum of two-dimensional delta-functions: ρ(z) = π δ (2) (z − zi ) . i

In the large N limit, one treats it as a continuous function normalized as t0 . In terms of the continuous density the saddle point equation reads 1 π

ρ(ζ )d 2 ζ + ∂z U (z) = 0 . z−ζ

1 π

ρ(z)d 2 z =

(4.13)

The solution is well known and easy to obtain: the extremal density ρ0 (z) is constant in some domain D (the support of eigenvalues which can be a union of disconnected domains Dα ) and zero otherwise. More precisely, 1 z∈D ρ0 (z) = . 0 z ∈ C \ D ≡ Dc 3

One may have in mind the model of all complex or mutually hermitian conjugated or normal matrices.

Integrable Structure of the Dirichlet Boundary Problem

33

Note that the saddle point condition is imposed only for z inside the support of eigenvalues. Writing (cf. (3.57)) V (z) =

p

τk Bk (z) ,

k=1

multiplying Eq. (4.13) by Ak (z) and integrating it over the boundary of the support of eigenvalues, one finds that the domain D is such that the coefficients τk are moments of its complement with respect to the basis functions Ak and the higher moments (with numbers greater than p) vanish. As is proven in Sect. 3, these conditions, together with the normalization condition for the density, locally determine the shape of the support of p eigenvalues. Equivalently, one might parametrize the polynomial as V (z) = k=1 tk zk , −k then tk are moments of the C \ D with respect to the functions z . An interesting problem is to obtain, for a given value of t0 , necessary and sufficient conditions on the polynomial V for the support of eigenvalues to be a union of g + 1 disconnected domains (”droplets”) with non-zero filling. One may approach this problem from a “classical” limit of very small (point-like) droplets. Clearly, for our choice of the signs in (4.11) and (4.12), the stable point-like droplets are located at minima of −U (z), or equivalently at maxima of U (z). As soon as we use the basis which explicitly depends on the marked points zα ∈ Dα , it is natural to consider the germ configuration with point-like eigenvalue droplets at the points zα . It is easy to see that the sufficient conditions for the potential (4.12) to have maxima at the points zα are z¯ α = V (zα ),

|V (zα )| < 1

for all α .

(4.14)

The first one means that there is an extremum of the potential U (z) at the point zα . The second one ensures that eigenvalues of the matrix of second derivatives of the potential at the extremum are both negative, i.e. the extremum is actually a maximum of U (z). The first condition literally coincides with the one used in [22] for the completely degenerate curve (point-like droplets) while the second one requires that not all extrema are filled, so that the “smooth” genus g is always less than the maximal possible genus (p−1)2 −1. Let now V0 (z) be a minimal degree polynomial that obeys these conditions. Then perturbed polynomials of the form V (z) = V0 (z) + τk Bk (z) k≥g+2

obey the same conditions for sufficiently small τk , and this is the advantage of using the basis (3.10), (3.13) from the perspective of matrix models. The saddle point equation (4.13) just means that the “effective potential” π1 D log |z− ζ |2 d 2 ζ + U (z) is constant in each Dα , i.e. for z ∈ Dα it holds: 2 log |z − ζ |d 2 ζ + U (z) = v0 + α , (4.15) π D where v0 = π2 D log |z|d 2 z, and so in our normalization 0 = 0. Let Nα be the number of eigenvalues in Dα , then limN→∞ Nα = sα . First we find the free energy with some fixed sα : 1 lim 2 log ZN ρ0 (z) log |z − ζ |ρ0 (ζ )d 2 zd 2 ζ = 2 N→∞ π sα fixed 1 + ρ0 (z)U (z)d 2 z, π

34

I. Krichever, A. Marshakov, A. Zabrodin

where one should substitute (4.15). The result is g 1 lim 2 log ZN =− 2 log |z−1 − ζ −1 |d 2 zd 2 ζ + α sα N→∞ π D D sα fixed α=1

= F˜ (sα , τ ) .

(4.16)

This quantity depends on sα ’s. It is given by the value of the integrand in (4.11) at the saddle point with fixed sα . If one wants to take into account the “tunneling” of eigenvalues between different components of the support, sα are no longer free parameters, and the free energy F (0) of the “planar limit” of the matrix model should be obtained by extremizing F˜ with respect to sα (with fixed t0 ). It follows from the above that ∂sα F˜ = α and so the extremum is at α = 0. Therefore, F (0) = F , (4.17) α =0

where F is given by (2.35). The results of Sect. 3 imply that the growth of the support of eigenvalues, when N → N + δN and the tunneling is taken into account, is given by the normal displacement of the boundary proportional to the normal derivative of the Green function with the minus sign, like in (2.9). Since this quantity is always non-negative, we conclude that if a point belongs to the support of eigenvalues, it does so as t0 increases. In particular, if one starts with point-like droplets at the points zα , as is discussed above, then these points always remain inside the droplets. Let us note that different aspects of multi-support solutions of the 2-matrix model and matrix models with complex eigenvalues were discussed in [22, 26, 25]. For matrix models one usually restricts oneself to the algebraic case of a finite amount of nonvanishing moments playing the role of the coefficients of the matrix model potential (4.12). In such a case the corresponding complex curve can be described by an algebraic equation [22], which can be thought of as an auxiliary constraint to the second derivatives of F . These auxiliary constraints look similar to the reduction conditions in the case of Landau-Ginzburg topological theories (see, for example, discussion of such conditions in the context of dispersionless Hirota equations and WDVV equations in [7]). Their meaning is that the derivatives w.r.t. the times {τk } for k > g + 1 (where g is the genus of the corresponding algebraic complex curve) can be expressed through the derivatives restricted to k ≤ g + 1. 5. Green Function on the Schottky Double and Generalized Hirota Equations Let us now turn to the generalization of the dispersionless Hirota equations to the multiply-connected case. For a unified treatment of the two “dual” representations of the Dirichlet problem discussed in the two previous sections, we make a simple change of variables. Namely, let us introduce the generic “period” variables Xα which are identified either with α or 2πi sα depending on the choice of the set of cycles, and the function F(Xα , τ ) Xα equal to F (Xα , τ ) or F˜ ( 2πi , τ ) respectively. Then the main relations (3.45)–(3.47) and (4.4)–(4.6) acquire the form 1 G(a, b) = log |a −1 − b−1 | + ∇(a)∇(b)F , 2

(5.1)

Integrable Structure of the Dirichlet Boundary Problem

35

ωα (z) = − ∂α ∇(z)F ,

(5.2)

Tαβ = 2πi ∂α ∂β F ,

(5.3)

where ∂α := ∂Xα and G, ω and T stand for the corresponding objects with or without tilde, depending on the chosen basis of cycles. We will see below that, in analogy to the simply-conected case, any second order derivative of the function F w.r.t. τk (and τ¯k ), Fik , will be expressed through the derivatives {Fαβ }, where α, β = 0, . . . , g together with {Fατi } and their complex conjugated. To be more precise, one can consider all second derivatives as functions of {Fαβ , Fαk } modulo certain relations on the latter, like the relation (5.14) to be discussed below. Sometimes on this “small phase space” more extra constraints arise, which can be written in the form similar to the Hirota or WDVV equations [27]; we are not going to discuss this issue here, restricting ourselves to the generic situation. 5.1. The Abel map. To derive equations for the function F = F in Sect. 2, we used the representation (2.23) for the conformal map w(z) in terms of F and Eq. (1.2) relating the conformal map to the Green function which, in its turn, is expressed through the second order derivatives of F. In the multiply-connected case, our strategy is basically the same, with the suitable analog of the conformal map w(z) (or rather of log w(z)) being the embedding of Dc into the g-dimensional complex torus Jac, the Jacobi variety of the Schottky double. This embedding is given, up to an overall shift in Jac, by the Abel map z → W(z) := (W1 (z), . . . , Wg (z)), where z dWα (5.4) Wα (z) = ξ0

is the holomorphic part of the harmonic measure ωα . By virtue of (5.2), the Abel map is represented through the second order derivatives of the function F: z dWα = −∂α D(z)F , (5.5) Wα (z) − Wα (∞) = ∞

2 Re Wα (∞) = ωα (∞) = −∂τ0 ∂α F .

(5.6)

The last formula immediately follows from (5.2).

5.2. The Green function and the prime form. The Green function of the Dirichlet boundary problem, appearing in (5.1), can be written in terms of the prime form (A.4) on the Schottky double (cf. (1.2)): E(z, ζ ) . G(z, ζ ) = log (5.7) E(z, ζ¯ ) Here by ζ¯ we mean the (holomorphic) coordinate of the “mirror” point on the Schottky double, i.e. the “mirror” of ζ under the antiholomorphic involution. The pairs of such mirror points satisfy the condition Wα (ζ ) + Wα (ζ¯ ) = 0 in the Jacobian (i.e., the sum

36

I. Krichever, A. Marshakov, A. Zabrodin

should be zero modulo the lattice of periods). The prime form4 is written through the Riemann theta functions and the Abel map as follows: E(z, ζ ) =

θ∗ (W(z) − W(ζ )) h(z) h(ζ )

(5.8)

when the both points are on the upper sheet and E(z, ζ¯ ) =

θ∗ (W(z) + W(ζ ))

(5.9)

ih(z) h(ζ )

when z is on the upper sheet and ζ¯ is on the lower one (for other cases we define E(¯z, ζ¯ ) = E(z, ζ ), E(¯z, ζ ) = E(z, ζ¯ )). Here θ∗ (W) ≡ θδ ∗ (W|T ) is the Riemann theta function (A.2) with the period matrix Tαβ = 2πi ∂α ∂β F and any odd characteristics δ ∗ , and h2 (z) = −z2

g

θ∗,α (0)∂z Wα (z) = z2

α=1

g

θ∗,α (0)

α=1

Ak (z)∂α ∂τk F .

(5.10)

k≥1

Note that in the l.h.s. of (5.9) the bar means the reflection in the double while in the r.h.s. the bar means complex conjugation. The notation is consistent since the local coordinate in the lower sheet is just the complex conjugate one. However, one should remember that E(z, ζ¯ ) is not obtained from (5.8) by a simple substitution of the complex conjugated argument. On different sheets so defined prime “form” E is represented by different functions. In our normalization (5.9) iE(z, z¯ ) is real (see also Appendix B) and lim

E(z, ζ ) = 1. − ζ −1

ζ →z z−1

In particular, limz→∞ zE(z, ∞) = 1. 5.3. The prime form and the tau-function. In (5.7), the h-functions in the prime forms cancel, so the analog of (2.17) reads 2 θ∗ (W(z) − W(ζ )) 2 = log 1 − 1 + ∇(z)∇(ζ )F . log (5.11) z ζ θ∗ (W(z) + W(ζ )) This equation already explains the claim made in the beginning of this section. Indeed, the r.h.s. is the generating function for the derivatives Fik while the l.h.s. is expressed through derivatives of the form Fαk and Fαβ only. The expansion in powers of z, ζ allows one to express the former through the latter. The analogs of Eqs. (2.19), (2.20) are, respectively: θ∗ (W(z) − W(∞)) 2 = − log |z|2 + ∂τ ∇(z)F , log (5.12) 0 θ∗ (W(z) + W(∞)) Given a Riemann surface with local coordinates 1/z and 1/¯z we trivialize the bundle of − 21 -differentials and “redefine” the prime form E(z, ζ ) → E(z, ζ )(dz)1/2 (dζ )1/2 so that it becomes a function. However for different coordinate patches (the “upper” and “lower” sheets of the Schottky double) one gets different functions, see, for example, formulas (5.8) and (5.9) below. 4

Integrable Structure of the Dirichlet Boundary Problem

37

2 h (∞) 2 = ∂2 F . log τ0 θ (ω(∞)) ∗

(5.13)

Here ω(z) ≡ 2 Re W(z) = (ω1 (z), . . . , ωg (z)) and h2 (∞) = lim z θ∗ z→∞

z ∞

g dW = − θ∗,α (0)∂α ∂τ1 F . α=1

¯ A simple check shows that the l.h.s. of (5.13) can be written as −2 log(iE(∞, ∞)). ¯ + O(z−1 ) as As is seen from the expansion G(z, ∞) = − log |z| − log(iE(∞, ∞)) ¯ −1 is a natural analog of the conformal radius, and (5.13) indeed z → ∞, (iE(∞, ∞)) turns to (2.20) in the simply-connected case (see Appendix B for an explicit illustrative example). However, now it provides a nontrivial relation on Fαβ ’s and Fαi ’s: 

 2 θ∗,α ∂α ∂τ1 F  θ∗,β ∂β ∂τ¯1 F  = θ∗2 (ω(∞))e∂τ0 F (5.14) α

β

so that the “small phase space” contains the derivatives modulo this relation. The next steps are exactly the same as in Sect. 2: we are going to decompose these equalities into holomorphic and antiholomorphic parts. The results are conveniently written in terms of the prime form. The counterpart of (2.22) is E(ζ, z)E(∞, ζ¯ ) ζ log + D(z)∇(ζ )F . (5.15) = log 1 − z E(ζ, ∞)E(z, ζ¯ ) Tending ζ → ∞, we get: log

¯ E(z, ∞) ¯ − ∂τ0 D(z)F . = log z + log E(∞, ∞) E(z, ∞)

(5.16)

Separating holomorphic and antiholomorphic parts of (5.15) in ζ , we get analogs of (2.24) and (2.25): log

E(z, ζ ) = log(z − ζ ) + D(z)D(ζ )F E(z, ∞)E(∞, ζ ) − log

¯ E(z, ζ¯ )E(∞, ∞) ¯ ζ¯ )F . = D(z)D( ¯ E(z, ∞)E(∞, ζ¯ )

(5.17)

(5.18)

Combining these equalities (with merging points z → ζ in particular), one is able to obtain the following representations of the prime form itself: 1 2 E(z, ζ ) = (z−1 − ζ −1 )e− 2 (D(z)−D(ζ )) F ,

(5.19)

1 ¯ ¯ 2 iE(z, ζ¯ ) = e− 2 (∂τ0 +D(z)+D(ζ )) F .

(5.20)

Note also the nice formula 1

iE(z, z¯ ) = e− 2 ∇

2 (z)F

.

(5.21)

38

I. Krichever, A. Marshakov, A. Zabrodin

5.4. Generalized Hirota relations. For higher genus Riemann surfaces there are no simple universal relations connecting values of prime forms at different points, which, via (5.19), (5.20), could be used to generate equations on F. The best available relation [9] is the celebrated Fay identity (A.5). Although it contains not only prime forms but Riemann theta functions themselves, it is really a source of closed equations on F, since all the ingredients are in fact representable in terms of second order derivatives of F in different variables. An analog of the KP version of the Hirota equation (2.26) for the function F can be obtained by plugging Eqs. (5.5) and (5.19) into the Fay identity (A.5). As a result, one obtains a closed equation which contains second order derivatives of the F only (recall that the period matrix in the theta-functions is essentially the matrix of the derivatives Fαβ ). A few equivalent forms of this equation are available. First, shifting Z → Z − W3 + W4 in (A.5) and putting z4 = ∞, one gets the relation a c b D(a)D(b)F (a − b)e θ dW + dW − Z θ dW − Z ∞

+ (b − c)eD(b)D(c)F θ

+ (c − a)eD(c)D(a)F θ

∞

b ∞ c

∞

dW +

∞ a

dW +

∞

c

∞

dW − Z θ dW − Z θ

a ∞ b

∞

dW − Z

dW − Z

= 0 . (5.22)

The vector Z is arbitrary (in particular, zero). We see that (2.26) gets “dressed” by the theta-factors. Each theta-factor is expressed through F only. For example,   z 2 dW = exp −2π 2 nα nβ ∂αβ F − 2π i nα ∂α D(z)F  . θ ∞

nα ∈Z

α

αβ

Another form of this equation, obtained from (5.22) for a particular choice of Z, reads a −1 21 (D(a)+D(b))2 F h(c)θ∗ dW (a − b)c e +

b

∞

∞

dW + [cyclic per-s of a, b, c] = 0 .

(5.23)

Taking the limit c → ∞ in (5.22), one gets an analog of (2.27): a b θ ( ∞ dW + ∞ dW − Z) θ (Z) D(a)D(b)F 1− e a b θ ( ∞ dW − Z) θ( ∞ dW − Z) a g θ ( ∞ dW − Z) D(a) − D(b) 1 ∂ ∂τ1 F + = log b ∂α ∂τ1 F , (5.24) a−b a−b ∂Zα θ( W − Z) α=1

∞

which also follows from another Fay identity (A.6). Equations on F with τ¯k -derivatives follow from the general Fay identity (A.5) with some points on the lower sheet. Besides, many other equations can be derived as various combinations and specializations of the ones mentioned above. Altogether, they form an infinite hierarchy of consistent differential equations of a very complicated structure which deserves further investigation. The functions F corresponding to different choices of independent variables (i.e., to different bases in homology cycles on the Schottky double) provide different solutions to this hierarchy.

Integrable Structure of the Dirichlet Boundary Problem

39

5.5. Higher genus analogs of the dispersionless Toda equation. Let us show how the simplest equation of the hierarchy, the dispersionless Toda equation (2.30), is modified in the multiply-connected case. Applying ∂z ∂ζ¯ to both sides of (5.18) and setting ζ = z, we get: ¯ z))F = −∂z ∂z¯ log E(z, z¯ ) . (∂D(z))(∂¯ D(¯

Here ∂D(z) is the z-derivative of the operator D(z): ∂D(z) = k Ak (z)∂τk . To transform the r.h.s., we use the identity (A.8) (Appendix A) and specialize it to the particular local parameters on the two sheets: θ (ω(z) + Z)θ (ω(z) − Z) θ 2 (Z)E 2 (z, z¯ ) +|z|4 (log θ (Z)), αβ ∂z Wα (z)∂z¯ Wβ (z) .

|z|4 ∂z ∂z¯ log E(z, z¯ ) =

α,β

Tending z to ∞, we obtain a family of equations (parametrized by an arbitrary vector Z) which generalize the dispersionless Toda equation for the tau-function: ∂τ1 ∂τ¯1 F =

θ (ω(∞)+Z)θ (ω(∞)−Z) ∂τ2 F e 0 θ 2 (Z) g − (log θ (Z)), αβ (∂α ∂τ1 F)(∂β ∂τ¯1 F).

(5.25)

α,β=1

(Here we used the z → ∞ limits of (5.5) and (5.21).) The following two equations correspond to special choices of the vector Z: ∂τ1 ∂τ¯1 F +

g

(log θ (0)), αβ (∂α ∂τ1 F)(∂β ∂τ¯1 F) =

α,β=1

∂τ1 ∂τ¯1 F = −

g

log θ∗ (ω(∞))

,αβ

θ 2 (ω(∞)) ∂τ2 F e 0 , θ 2 (0)

(∂α ∂τ1 F)(∂β ∂τ¯1 F) .

(5.26)

(5.27)

α,β=1

Finally, let us specify Eq. (5.25) for the genus g = 1 case. In this case there is only one extra variable X := X1 (1 or 2πis1 ). The is

Riemann theta-function

θ(ω(∞) + Z) then replaced by the Jacobi theta-function ϑ ∂X ∂τ0 F − Z T ≡ ϑ3 ∂X ∂τ0 F − Z T , where the elliptic modular parameter is T = 2πi ∂X2 F, and the vector Z ≡ Z has only one component. The equation has the form:

ϑ3 ∂X ∂τ0 F +Z 2πi ∂X2 F ϑ3 ∂X ∂τ0 F −Z 2π i ∂X2 F ∂τ2 F

∂τ1 ∂τ¯1 F = e 0 ϑ32 Z| 2πi ∂X2 F (5.28) − ∂Z2 log ϑ3 Z| 2πi ∂X2 F (∂X ∂τ1 F)(∂X ∂τ¯1 F) . Note also that Eq. (5.14) acquires the form

2 2 ϑ1 ∂X ∂τ0 F 2π i ∂X2 F

(∂X ∂τ1 F)(∂X ∂τ¯1 F) = e ∂τ 0 F , 2 ϑ1 0| 2πi ∂X F

(5.29)

40

I. Krichever, A. Marshakov, A. Zabrodin

where ϑ∗ ≡ ϑ1 is the only odd Jacobi theta-function. Combining (5.28) and (5.29) one may also write the equation

ϑ3 ∂X ∂τ0 F +Z 2πi ∂X2 F ϑ3 ∂X ∂τ0 F −Z 2π i ∂X2 F

∂τ1 ∂τ¯1 F = ϑ32 Z| 2πi ∂X2 F 

2 2 ϑ1 ∂X ∂τ0 F 2πi ∂X2 F

∂Z2 log ϑ3 Z| 2π i ∂X2 F  e∂τ0 F (5.30) − 2 ϑ1 0| 2πi ∂X F whose form is close to (2.30) but differs by the nontrivial “coefficient” in the square brackets. In the limit T → i∞ the theta-function ϑ3 tends to unity, and we obtain the dispersionless Toda equation (2.30). 6. Conclusions In this paper we have considered the Dirichlet boundary problem in planar multiplyconnected domains. A planar multiply-connected domain Dc is the complex plane with several holes. We study how the solution of the Dirichlet problem depends on small deformations of boundaries of the holes. General properties of such deformations allow us to introduce the quasiclassical tau-function associated to the variety of planar multiply-connected domains. By the tau-function, we actually mean its logarithm, which only makes sense for the quasiclassical or Whitham-type integrable hierarchies. Namely, the key properties are the specific “exchange” relations (3.43) which follow from the Hadamard variational formula for the Green function and the harmonic measure. They have the form of integrability conditions and thus ensure the existence of the tau-function. The tau-function corresponds to a particular solution of the universal Whitham hierarchy [4] and generalizes the dispersionless tau-function which describes deformations of simply-connected domains. The algebro-geometric data associated with the multiply-connected geometry include a Riemann surface with antiholomorphic involution, the Schottky double of the domain Dc = C\D endowed with particular holomorphic coordinates z and z¯ on the two sheets of the double, respecting the involution. This Riemann surface has genus g = #{holes} − 1. The (logarithm of) tau-function, F, describes small deformations of these data as functions of an infinite set of independent deformation parameters which are basically harmonic moments of the domain. These variables can be equivalently redefined as periods of the generating one-form S(z)dz over non-trivial cycles on the double and the residues of the one-forms Ak (z)S(z)dz, where Ak (z) z−k is some proper global basis (3.10) z→∞

of harmonic functions. We have obtained simple expressions for the period matrix, the Abel map and the prime form on the Schottky double in terms of the function F. Specifically, all these objects are expressed through second order derivatives of the F in its independent variables. The generalized dispersionless Hirota equations on F for the multiply-connected case (equivalent to the Whitham hierarchy) are obtained by incorporating the above mentioned expressions into the Fay identities. As a result, one comes to a series of quite non-trivial equations for (second derivatives of) the function F , which have not been written before (except for certain relations for the second derivatives of the Seiberg-Witten prepotential [28]). When the Riemann surface degenerates to the Riemann

Integrable Structure of the Dirichlet Boundary Problem

41

sphere with two marked points, they turn into Hirota equations of the dispersionless Toda hierarchy. Algebraic orbits of the universal Whitham hierarchy describe the class of domains which can be obtained as conformal images of a “half” of a complex algebraic curve with the antiholomorphic involution under conformal maps given by rational functions on the curve. In particular, all domains having only a finite amount of non-vanishing harmonic moments, are in this class. In this case one can define the curve by a polynomial equation written explicitly in [22]. This situation is an analog of the (Laurent) polynomial conformal maps in the simply-connected case and literally corresponds to multi-support solutions of matrix models with polynomial potentials. The definition of the tau-function for multiply-connected domains proposed above holds in a broader set-up of general algebraic domains. It does not rely on the finiteness of the amount of non-vanishing moments. In general any effective way to describe the complex curve associated to a multiply-connected domain by a system of polynomial or algebraic equations is not known. The curve may be thought of as a spectral curve corresponding to a generic finite-gap solution to the Toda lattice hierarchy. Acknowledgement. We are indebted to V.Kazakov, M.Mineev-Weinstein, S.Natanzon, L.Takhtajan and P.Wiegmann for illuminating discussions and especially to A.Levin for very important comments on Sect. 5. We are also grateful to the referee for a very careful reading of the manuscript and valuable remarks. The work was also partially supported by NSF grant DMS-01-04621 (I.K.), RFBR under the grant 01-01-00539 (A.M. and A.Z.), by INTAS under the grants 00-00561 (A.M.), 99-0590 (A.Z.) and by the Program of support of scientific schools under the grants 1578.2003.2 (A.M.), 1999.2003.2 (A.Z.). The work of A.Z. was also partially supported by the NATO grant PST.CLG.978817. A.M. is grateful for the hospitality to the Max Planck Institute of Mathematics in Bonn, where an essential part of this work has been done.

Appendix A. Theta Functions and Fay Identities Here we present some definitions and useful formulas from [9]. The Riemann theta function θ (W) ≡ θ (W|T ) is defined as θ (W) = eiπn·T ·n + 2πin·W . (A.1) n∈Zg

The theta function with (half-integer) characteristics δ = (δ 1 , δ 2 ), where δα = Tαβ δ1,β + δ2,α and δ 1 , δ 2 ∈ 21 Zg is θδ (W) = eiπ δ 1 ·T ·δ 1 +2πi δ 1 ·(W+δ 2 ) θ(W + δ) = eiπ(n+δ 1 )·T ·(n+δ 1 )+2πi(n+δ 1 )·(W+δ 2 ) .

(A.2)

n∈Zg

Under shifts by a period of the lattice, it transforms according to θδ (W + eα ) = e2πiδ1,α θδ (W) ,

θδ (W + Tαβ eβ ) = e−2πiδ2,α −iπTαα −2πiWα θδ (W) .

(A.3)

The prime form E(z, ζ ) is defined as θ∗ (W(z) − W(ζ )) , α θ∗,α dWα (z) β θ∗,β dWβ (ζ )

E(z, ζ ) =

(A.4)

42

I. Krichever, A. Marshakov, A. Zabrodin

where θ∗ is any odd theta function, i.e., the theta function with any odd characteristic δ ∗ (the characteristics is odd if 4δ ∗1 · δ ∗2 = odd). The prime form does not depend on the particular choice of the odd characteristics. In the denominator, ∂θ∗ (W) θ∗,α = θ∗,α (0) = ∂Wα W=0 is the set of θ-constants. The data we use in the main text contain also distinguished coordinates on a Riemann surface: the holomorphic co-ordinates z and z¯ on two different sheets, and we do not distinguish, unless it is necessary between the prime form (A.4) and a function E(z, ζ ) ≡ E(z, ζ )(dz)1/2 (dζ )1/2 “normalized” onto the differentials of a distinguished co-ordinate. Let us now list the Fay identities [9] used in the paper. The basic one is Fay’s trisecant formula (Eq. (45) from p. 34 of [9]) θ (W1 − W3 − Z) θ(W2 − W4 − Z) E(z1 , z4 )E(z3 , z2 ) + θ (W1 − W4 − Z) θ(W2 − W3 − Z) E(z1 , z3 )E(z2 , z4 ) = θ (W1 + W2 − W3 − W4 − Z) θ(Z) E(z1 , z2 )E(z3 , z4 ) .

(A.5)

Here Wi ≡ W(zi ). This identity holds for any four points z1 , . . . , z4 on a Riemann surface and any vector Z ∈ Jac. In the limit z3 → z4 ≡ ∞ one gets (formula (38) from p. 25 of [9]) z z θ ( ∞1 dW + ∞2 dW − Z) θ(Z) E(z1 , z2 ) z1 z2 θ ( ∞ dW − Z) θ( ∞ dW − Z) E(z1 , ∞)E(z2 , ∞) z g θ( ∞1 dW − Z) (z1 ,z2 ) = d (∞) + dWα (∞) ∂Zα log z2 , (A.6) θ( ∞ dW − Z) α=1 where d (z1 ,z2 ) (∞) = dz log

E(z, z1 ) E(z, z2 )

(A.7)

is the normalized Abelian differential of the third kind with simple poles at z1 and z2 and residues ±1. Another relation from [9] we use (see e.g. (29) on p.20 and (39) on p.26) is θ (W1 − W2 − Z)θ (W1 − W2 + Z) θ 2 (Z)E 2 (z1 , z2 ) g = ω(z1 , z2 ) + (log θ (Z)),αβ dWα (z1 )dWβ (z2 ),

(A.8)

α,β=1

where (log θ (Z)), αβ =

∂ 2 log θ (Z) ∂Zα ∂Zβ

and ω(z1 , z2 ) = dz1 dz2 log E(z1 , z2 )

(A.9)

is the canonical bi-differential of the second kind with the double pole at z1 = z2 .

Integrable Structure of the Dirichlet Boundary Problem

43

Appendix B. Degenerate Schottky Double For an illustrative purpose we would like to adopt some of the above formulas to the simplest possible case, which is the Riemann sphere realized as the Schottky double of the complement to the disk of radius r. In this case dW =

dz d z¯ = idϕ = − z z¯

(B.1)

is purely imaginary on the circle and obviously satisfies the condition dW (z)+dW (¯z) = 0. Further, (cf. (3.65)) z z z W (z) = dW = dW = log (B.2) r ξ0 r and z (B.3) r which is nothing but the conformal map of the exterior of the circle |z| ≥ r onto the exterior of the unit circle |w| ≥ 1. Note that on the “lower” sheet of the double z¯ r dW = log , (B.4) W (¯z) = z¯ r w(z) = eW (z) =

and instead of (B.3) one gets w(¯z) =

r z¯

(B.5)

which is the conformal map of the exterior of the disk |¯z| ≥ r on the lower sheet onto the interior of the unit circle |w| ≤ 1. The prime form on the genus zero Riemann surface is (cf. (A.4)) w1 − w2 E(z1 , z2 ) = √ , √ dw1 dw2

(B.6)

¯ which is understood in the main text, where wi ≡ w(zi ). Let us compute E(∞, ∞), as “normalized” on the values of the local coordinates z∞ = 1/z and z¯ ∞ = 1/¯z in the ¯ on two sheets of the double. One gets (cf. (5.9)) points ∞ and ∞ ¯ =− E 2 (∞, ∞)

¯ 2 (w(∞) − w(∞)) . ¯ (dw(∞)/dz∞ )(dw(∞)/dz ¯ ) ∞

(B.7)

Substituting into (B.7) the formulas (B.3), (B.5) and dz z2 = − lim dz∞ , z→∞ r z→∞ r ¯ = r dz∞ dw(∞) ¯ ,

dw(∞) = lim

(B.8)

one finally gets 1 z2 /r 2 = 2, z→∞ (z2 /r)r r

¯ 2 = lim E(∞, ∞)

and this demonstrates that (5.14) indeed turns into (2.20) in the limit.

(B.9)

44

I. Krichever, A. Marshakov, A. Zabrodin

References 1. Hurwitz, A., Courant, R.: Vorlesungen u¨ ber allgemeine Funktionentheorie und elliptische Funktionen. Herausgegeben und erg¨anzt durch einen Abschnitt u¨ ber geometrische Funktionentheorie. Berlin-Heidelberg-New York: Springer-Verlag, 1964 (Russian translation, adapted by M.A. Evgrafov: Theory of functions, Moscow: Nauka, 1968) 2. Marshakov, A., Wiegmann, P., Zabrodin, A.: Commun. Math. Phys. 227, 131 (2002) 3. Krichever, I.M.: Funct. Anal. Appl. 22, 200–213 (1989) 4. Krichever, I.M.: Commun. Pure. Appl. Math. 47, 437 (1994) 5. Mineev-Weinstein, M., Wiegmann, P.B., Zabrodin, A.: Phys. Rev. Lett. 84, 5106 (2000) 6. Wiegmann, P.B., Zabrodin, A.: Commun. Math. Phys. 213, 523 (2000) 7. Boyarsky, A., Marshakov, A., Ruchayskiy, O., Wiegmann, P., Zabrodin, A.: Phys. Lett. B515, 483– 492 (2001) 8. Krichever, I., Novikov, S.: Funct. Anal. Appl. 21, No 2, 46–63 (1987) 9. Fay, J.D.:“Theta Functions on Riemann Surfaces”. Lect. Notes in Mathematics 352, Berlin-Heidelberg-New York: Springer-Verlag, 1973 10. Hadamard, J.: M´em. pr´esent´es par divers savants a` l’Acad. sci., 33, (1908) 11. Davis, P.J.: The Schwarz function and its applications. The Carus Mathematical Monographs, No. 17, Buffalo, N.Y.: The Math. Association of America, 1974 12. Krichever, I.: 2000, unpublished 13. Takhtajan, L.: Lett. Math. Phys. 56, 181–228 (2001) 14. Kostov, I.K., Krichever, I.M., Mineev-Weinstein, M., Wiegmann, P.B., Zabrodin, A.: τ -function for analytic curves. In: Random matrices and their applications, MSRI publications, 40, Cambridge: Cambridge Academic Press, 2001 15. Hille, E.: Analytic function theory. V.II, Oxford: Ginn and Company, 1962 16. Gibbons, J., Kodama, Y.: Proceedings of NATO ASI “Singular Limits of Dispersive Waves”. ed. N. Ercolani, London – New York: Plenum, 1994; Carroll, R., Kodama, Y.: J. Phys. A: Math. Gen. A28, 6373 (1995) 17. Takasaki, K., Takebe, T.: Rev. Math. Phys. 7, 743–808 (1995) 18. Schiffer, M., Spencer, D.C.: Functionals of finite Riemann surfaces. Princeton, NJ: Princeton University Press, 1954 19. Gustafsson, B.: Acta Applicandae Mathematicae 1, 209–240 (1983) 20. Aharonov, D., Shapiro, H.: J. Anal. Math. 30, 39–73 (1976); Shapiro, H.: The Schwarz function and its generalization to higher dimensions. University of Arkansas Lecture Notes in the Mathematical Sciences, Volume 9, W.H. Summers, Series Editor, New York: A Wiley-Interscience Publication, John Wiley and Sons, 1992 21. Etingof, P., Varchenko, A.: Why does the boundary of a round drop becomes a curve of order four? University Lecture Series. 3, Providence, RI: American Mathematical Society, 1992 22. Kazakov, V., Marshakov, A.: J. Phys. A: Math. Gen. 36, 3107–3136 (2003) 23. De Wit, B., Marshakov, A.: Theor. Math. Phys. 129, 1504 (2001) [Teor. Mat. Fiz. 129, 230 (2001)] 24. Gakhov, F.: The boundary value problems. Moscow: Nauka, 1977 (in Russian) 25. Eynard, B.: Large N expansion of the 2-matrix model, multicut case. http://arxiv:org/list/mathph/0307052, 2003 26. Bertola, M.: Free energy of the two-matrix model/dToda tau-function. Nucl. Phys. B669, 435–461 (2003) 27. Marshakov, A., Mironov, A., Morozov, A.: Phys. Lett. B389, 43–52 (1996); Braden, H., Marshakov, A.: Phys. Lett. B541, 376–383 (2002) 28. Gorsky, A., Marshakov, A., Mironov, A., Morozov, A.: Nucl. Phys. B527, 690–716 (1998) Communicated by L. Takhtajan

Commun. Math. Phys. 259, 45–69 (2005) Digital Object Identifier (DOI) 10.1007/s00220-005-1378-6

Communications in

Mathematical Physics

Massless D-Branes on Calabi–Yau Threefolds and Monodromy Paul S. Aspinwall1 , R. Paul Horja2 , Robert L. Karp1 1

Center for Geometry and Theoretical Physics, Box 90318, Duke University, Durham, NC 27708-0318, USA 2 Department of Mathematics, University of Michigan, East Hall, 525 E University Avenue, Ann Arbor, MI 48109-1109, USA Received: 15 December 2003 / Accepted: 17 February 2004 Published online: 14 June 2005 – © Springer-Verlag 2005

Abstract: We analyze the link between the occurrence of massless B-type D-branes for specific values of moduli and monodromy around such points in the moduli space. This allows us to propose a classification of all massless B-type D-branes at any point in the moduli space of Calabi–Yau’s. This classification then justifies a previous conjecture due to Horja for the general form of monodromy. Our analysis is based on using monodromies around points in moduli space where a single D-brane becomes massless to generate monodromies around points where an infinite number become massless. We discuss the various possibilities within the classification.

1. Introduction The derived category approach to B-type D-Branes [1–5] appears to be extremely powerful. It allows one to go beyond the picture of D-branes as vector bundles over submanifolds so that α -corrections can be correctly understood. For example, the fact that B-type D-branes must undergo monodromy as one moves about the moduli space of complexified K¨ahler forms can be expressed in the derived category language [6–8]. The main purpose of this paper is to try to classify which D-branes can become massless at a given point in the moduli space. Again the language of derived categories will be invaluable. In order for an object in the bounded derived category of coherent sheaves to represent a D-brane it must be “-stable”. Criteria for -stability have been discussed in [9–12] although it is not clear that we yet have a mathematically rigorous algorithm for determining stability. Despite this, in simple examples such as in the above references and [13] one can compute stability with a fair degree of confidence. In particular if you have reason to believe that a certain set of a D-branes is stable at a given point in the moduli space then one can move along a path in moduli space and see how the spectrum of stable states changes. There is considerable evidence [12] that such changes

46

P.S. Aspinwall, R.P. Horja, R.L. Karp

in -stability depend only on the homotopy class of the path in the moduli space of conformal field theories. The fact that changes in -stability do depend on the homotopy class of such paths was used in [12] to “derive” Kontsevich’s picture of monodromy at least in the case of the quintic Calabi–Yau threefold. The moduli space of conformal field theories may be compactified by including the “discriminant locus” consisting of badly-behaved worldsheet theories. Typically one expects such theories to be bad because some D-brane has become massless [14]. Indeed, the monodromy seen in [2, 13] around parts of this discriminant locus was intimately associated to massless D-branes. It is this link between massless D-branes and monodromy that we wish to study more deeply in this paper. In simple cases as one approaches a point in the discriminant locus, a single D-brane becomes massless. Of more interest to us is the case where an infinite number become massless. In [7,8] one of the authors studied components of the discriminant locus corresponding to what was called “EZ-transformations”. Namely if one has a Calabi–Yau threefold X with some complex subspace E, there may be a point in K¨ahler moduli space where E collapses to a complex subspace Z of lower dimension than E. We will see that it is then the derived category of Z that describes the massless D-branes associated to this transformation. A particular autoequivalence was naturally associated to a particular EZ-transformation and it was conjectured in [7,8] that such an autoequivalence resulted from the associated monodromy. We will call this conjecture the “EZ-monodromy conjecture”. One purpose of this paper is to justify this conjecture. Because of the nature of our understanding of D-branes and string theory it will not be possible to rigorously prove any hard theorems about D-branes. Instead we will have to play with a number of conjectures whose interdependence leads to considerable evidence of the validity of the overall story. In particular, on the one hand we have the EZ-monodromy conjecture and, on the other hand, we have our conjecture concerning which D-branes become massless. These two conjectures are interlinked by -stability as we discuss in Sect. 2. In particular, in Sect. 2.1 we discuss an older conjecture concerning single massless D-branes. In Sect. 2.2 we then review a framework for the more general case which is linked to the simpler case in Sect. 2.3 for a particular example. The physical interpretation of the general case is then given in Sect. 2.4. The link discussed in Sect. 2.3 between the simple case of a single D-brane becoming massless and an infinite number becoming massless depends upon a mathematical result which is derived in Sect. 3. This section is more technical than the other sections and may be omitted by the reader if need be. That said, it shows how the sophisticated methods of derived categories are directly relevant to the physics of D-branes. In Sect. 4 we discuss a natural hierarchy of cases. The familiar “conifold”-like situation arises where Z is a point and only one soliton becomes massless. If Z has dimension one then the derived category of Z has more structure. This case corresponds to the Seiberg–Witten theory of some nonabelian gauge group. We study an explicit example of this elsewhere [15]. The case where Z has complex dimension two is more complicated as the derived category now has a rich structure. We show that it appears to be similar to the spectrum of massless D-branes one gets from a decompactification. We also see that it demonstrates how 2-branes wrapped around a 2-torus can become massless. At first sight this appears to contradict T-duality but we will see that this is not actually the case. Finally, for completeness, in Sect. 4.4 we discuss the case of an exoflop which is awkward to fit into our general classification but still yields a simple result.

Massless D-Branes on Calabi–Yau Threefolds and Monodromy

47

2. Monodromy and Massless D-Branes 2.1. A single massless D-brane. B-type D-branes on X correspond to objects in the bounded derived category of coherent sheaves on X [1–5]. A given object A is represented by a complex. We may then construct another object A[n] by shifting this complex n places to the left. Such a shift or “translation” is a global symmetry of physics if it is applied simultaneously to all objects [2]. Relative shifts are significant — an open string stretched between A and B is not equivalent to an open string stretched between A[n] and B if n = 0. We would like to consider the case of moving to a point in moduli space where a single physical D-brane A becomes massless. Because of the global shift symmetry all of its translates A[n] are equally massless. Thus an infinite number of objects in D(X) are becoming massless even though only one D-brane counts towards any physical effects of this masslessness as it would be computed by Strominger [14] for example. The analysis of -stability in [12, 13] showed that monodromy is intimately associated to massless D-branes. This should not be surprising since monodromy can only occur around the discriminant and the discriminant is associated with singularities in the conformal field theory associated with massless solitons [14]. Consider an oriented open string f stretched between two D-branes in a Calabi–Yau threefold X. In the derived category language this is written as a morphism between two objects in D(X), f : A → B.

(1)

These two objects may or may not form a bound state according to the mass of the open string f . If f is tachyonic then we have a bound state a` la Sen [16]. (As we emphasize shortly A is really an anti-brane in such a bound state.) A real number1 (dubbed a “grade” in [2]) ϕ is associated to each stable D-brane. We assume ϕ varies continuously over the moduli space and is defined mod 2 by the central charge Z: ϕ=−

1 arg(Z) π

(mod 2).

(2)

The precise definition of ϕ is discussed at length in [12]. In [2] it was argued that the mass squared of the open string in (1) is then proportional to ϕ(B) − ϕ(A) − 1 allowing the stability of this bound state to be determined. One of the key features of the derived category which makes it so useful for the study of solitons is the way that bound states are described using distinguished triangles. The open string f between A and B is best represented in the context of a distinguished triangle

A

[1]

C _@ @@ @@ @@ f / B.

(3)

1 It has been suggested that ϕ is defined modulo some integer such as 6 [2, 17]. Periodicity can also appear, if desired, in Floer cohomology (see [18, 19] for example) which is supposedly mirror to the structure we are considering. For simplicity we ignore such a possibility. To take such an effect into account one should probably quotient the derived category by such translations.

48

P.S. Aspinwall, R.P. Horja, R.L. Karp

The “[1]” represents the fact that one must shift one place left when performing the corresponding map. The object C, which is equivalent to the “mapping cone” Cone(f : A → B), is then potentially a bound state of A[1] and B. As explained in [2], A[odd] should be thought of as an anti-A. The triangle also tells us that B is potentially a bound state of A and C. Equally A is a bound state of B and C[−1]. The “[1]” could be interpreted as keeping track of which brane should be treated as an anti-brane. The fact that D(X) copes so well with anti-branes demonstrates its power to analyze D-branes. The other approach, namely K-theory, should be considered the derived category’s weaker cousin since it only knows about D-brane charge! Now suppose that A is stable and becomes massless at a particular point P in the moduli space. Furthermore, let us assume that the only massless D-branes at P are of the form A[m] for any m. Let us take a generic complex plane with polar coordinates (r, θ ) passing through P at the origin and assume that Z(A) behaves as cr exp(−iθ ) near P for c some real and positive constant. That is, we assume that Z(A) has a simple zero at P . Suppose B does not have vanishing mass. It follows that Z(B) and Z(C) are equal at P and nonzero. In particular if we circle the point P by varying θ , these central charges will be constant close to P . Furthermore, if C can be a marginally bound state of anti-A and B near P , then, according to the rules of [12], we have ϕ(B) = ϕ(C) near P . This allows us to rewrite (3) including the differences in the ϕ’s for the open strings (i.e., sides of the triangle) to give 1+a−b+ πθ [1]

A

C _@ @@ @@0 @@ f / B,

(4)

b−a− πθ

where ϕ(B) = b and ϕ(A) = a at θ = 0. The stability of a given vertex of this triangle depends upon the number on the opposite side being less than 1. By “stability” we mean relative to this triangle only. A given D-brane may decay by other channels. It follows that C becomes stable for θ > π(b − a − 1) while B becomes unstable for θ > π(b − a). Note that A is always stable near P consistent with our assumptions. Based on this idea that we “gain” C and “lose” B as θ increases, we can try to formulate a picture for monodromy around P . The meaning of monodromy is that after traversing this loop in the moduli space we should be able to relabel the D-branes in such a way as to restore the physics we had before we traversed the loop. It is important to note that monodromy is not really the statement that a certain D-brane manifestly “becomes” another D-brane explicitly as we move through the moduli space. It is much more accurately described as a relabeling process. Since stability is a physical quality, we are forced to relabel B since it has decayed. The obvious candidate in the above case is to call it C. Thus monodromy would transform B into C. Life can be more complicated than this however. If we have an open string f : A → C, then, since ϕ(B) = ϕ(C) when B decays to C + A[1], C will immediately decay further to D = Cone(f : A → C) plus another A[1]. Suppose A is “spherical” in the sense of [20] which means Hom(A, A[m]) = C for m = 0 or 3, and Hom(A, A[m]) = 0 otherwise. This condition is always satisfied in the context of this subsection — i.e., only A and its translates become massless. A long exact sequence associated to (3) then implies dim Hom(A, C) = dim Hom(A, B) − 1.

(5)

Massless D-Branes on Calabi–Yau Threefolds and Monodromy

49

It follows that this second decay will occur if dim Hom(A, B) > 1. Iterating this process one sees that B will decay splitting off an A[1] a total of dim Hom(A, B) times. Finally we should also worry about homomorphisms between B and A[m] for other values of m. We refer to the example in Sect. 4 of [13] for a detailed example of exactly how this happens in a fairly nontrivial example. All said, allowing for all these decays, C becomes a number of A’s (probably shifted) together with Cone . . . ⊕ Ab0 ⊕ A[−1]b1 ⊕ A[−2]b2 ⊕ . . . → B , (6) where bn = dim Hom(A[−n], B) = dim Extn (A, B).

(7)

The cone (6) may be written more compactly as KA (B) = Cone(hom(A, B) ⊗ A → B),

(8)

where hom(A, B) is the complex of C-vector spaces 0

0

0

0

. . . → Ext0 (A, B) → Ext1 (A, B) → Ext2 (A, B) → . . . .

(9)

We refer to [20] for further explanation of the notation.2 We can also write more heuristically   KA (B) = Cone 

As much massless stuff that can bind to B as possible.

→ B .

(10)

Interpreted na¨ıvely, we have shown that, upon increasing θ from −∞ to +∞, an object B will decay and a canonically associated object KA (B) will become stable and appear as one of the decay products of B. What is desired however is monodromy once around P , i.e., θ should only increase by 2π. We will indeed claim that monodromy once around P replaces B by KA (B). For some objects, increasing θ by only 2π (at the appropriate starting point) will cause the complete decay of B into KA (B). Thanks to its rather simple cohomology, this always happens for Ox , the structure sheaf of a point x ∈ X. Therefore the relabeling process under monodromy should replace B by KA (B). There are undoubtedly many other objects B under which this increase in θ by only 2π would not induce the entire decay to KA (B ). This doesn’t matter however, we can still leave physics invariant by relabeling B by KA (B ). For example in an extreme case, both B and KA (B ) may be stable with respect to the above triangle both before and after increasing θ by 2π . It is therefore harmless to relabel one of these states as the other. Well, it is fine saying that it is harmless to relabel B by KA (B ), but why are we forced to relabel like this? The reason is that we know that physics must be completely invariant under monodromy which implies that the relabeling must amount to an autoequivalence 2 Note that since “left-derived” L’s or “right-derived” R’s should be added to every functor in this paper, we may consistently omit them without introducing any ambiguities!

50

P.S. Aspinwall, R.P. Horja, R.L. Karp

of D(X). One can indeed show that KA defines an autoequivalence3 of D(X) so long as A is spherical [8, 20]. What’s more, as argued in [12, 21], it is pretty well the only autoequivalence that works. To be more precise, once we have argued that the specific objects Ox undergo monodromy given by KA then all other objects must undergo the same monodromy up to some possible multiplication by some fixed line bundle L. Note that the central charge Z is also a physical quantity. Insisting that monodromy acts correctly in this case amounts to insisting that the D-brane “charges” ch(B) transform under monodromy. This is precisely the same monodromy on H even (X, Z) that one deduces from mirror symmetry as in [22]. This determines that the above line bundle L is trivial (in a considerably overdetermined way!). It is known (see [7] for example) that KA then induces the correct transformation on these charges — indeed this was the reason why KA was conjectured as the monodromy action in the first place [6]! It is worth noting that in some special cases the transformation KA has nothing to do with decay. Consider how the spherical object A itself transforms: Cone(hom(A, A) ⊗ A → A) = Cone((C → 0 → 0 → C) ⊗ A → A) = Cone((A → 0 → 0 → A) → A) ∼ =0→0→A = A[−2],

(11)

where we use the convention of [4] by underlining the zero position when necessary. Such a transformation cannot be argued from -stability however. Clearly an open string between A and itself (perhaps translated) cannot have a mass that depends upon some angle as we orbit the conifold point as clearly the mass is constant. Instead one could argue that the transform (11) occurs simply because Z(A) has a simple zero at the conifold point and thus ϕ(A) shifts by −2 as we loop around the conifold point. Then we can apply the rule ϕ(A[n]) = ϕ(A) + n from [12].4 We must therefore view (8) as being motivated by -stability for most but not all of the objects in D(X). Note that the fact that the obvious physical requirement that monodromy be an autoequivalence of D(X) can force (8) to be the required transform for all the objects in D(X) once -stability has established it for a few elements. This was the basis of the proof in the case of the quintic in [12]. Anyway, all said we have motivated the following conjecture (which, in perhaps a slightly different form, is due to Kontsevich [6], Horja [7] and Morrison [23]): Conjecture 1. If we loop around a component of the discriminant locus associated with a single D-brane A (and thus its translates) becoming massless then this results in a relabeling of D-branes given by an autoequivalence of the derived category in which B becomes Cone(hom(A, B) ⊗ A → B). This transformation was also motivated by its relation to mirror symmetry and studied at length by Seidel and Thomas [20]. 3 Pedants will object that the cone construction is only defined up to a non-canonical isomorphism making the transformation on morphisms badly-defined. Fortunately, as is well-known and we discuss at length in Sect. 3, this transformation can be written as a Fourier–Mukai transform removing this objection. 4 In [17] it was suggested that the monodromy action on the derived category should be translated by 2 to undo this action on A. Since monodromy is a relabeling process, one is free to do this, but it looks unnatural from the perspective of associating monodromy with -stability.

Massless D-Branes on Calabi–Yau Threefolds and Monodromy

51

2.2. General monodromies. It is then natural to ask what happens more generally, i.e., if more than just a single D-brane A becomes massless. In order to answer this we need to set up a general description of how one might analyze monodromy in a multi-dimensional moduli space. There are two paradigms for monodromy — both of which are useful: 1. The discriminant locus decomposes into a sum of irreducible divisors. Pick some base point in the moduli space and loop around a component of the discriminant “close” to the base point. 2. Restrict attention to a special rational curve C in the moduli space. This rational curve contains two “phase limit points”, in a sense to be described below, and a single point in the discriminant. The loop in question is around this unique discriminant point. In the case of the one parameter models, such as the quintic, these two paradigms coincide. The moduli space is C ∼ = P1 and the discriminant locus is a single point. If a component of the discriminant intersects C transversely then we can again have agreement between these two pictures of monodromy. In general the discriminant need not intersect C transversely — a fact we use to our advantage in Sect. 2.3. We now recall the relationship between the discriminant locus and phases as analyzed in [24, 25]. The following is a very rapid review. Please refer to the references for more details. To make the discussion easier we suffer a little loss of generality and assume we are in the “Batyrev-like” [26] case X being a hypersurface in a toric variety. The data for X is then presented in the form of a point set A which is the intersection of some convex polytope with some lattice N . See [27], for example, for more details of this standard construction. The conformal field theory associated to this data then has a phase structure where each “phase” is associated to a regular triangulation of A [28, 29]. The real vector space in which the K¨ahler form lives is naturally divided into a “secondary fan” of all possible phases. One cone of this fan is the K¨ahler cone for X where we have the “Calabi–Yau” phase. Mirror to X, Y is described as the zero-set of a polynomial W in many variables. The points in A are associated one-to-one with each monomial in W . Thus the data A is associated to deformations of complex structure of Y via the monomial-divisor mirror map [30]. If we model the moduli space of complex structures on Y by the space of coefficients in W , then the discriminant locus can be computed by the failure of W to be transversal. This can be mapped back to the space of complexified K¨ahler forms on X. The result is that part of the discriminant asymptotically lives in each wall dividing adjacent phases in the space of K¨ahler forms. That is to say, if we tune the B-field suitably we can always hit a bad conformal field theory as we pass from one phase to another. Thus we may associate singular conformal field theories with phase transitions. The discriminant itself is generically reducible. The combinatorial structure of this reduction has been studied in detail in [31]. In particular, any time an m-dimensional face of the convex hull of the set A contains more than m + 1 points, the resulting linear relationship between these points yields a component of . One may then follow an algorithm presented in [25] to compute the explicit form of each component. The general picture then is of a discriminant with many components with each component having “fingers” which separate the phases from each other. Each phase transition is associated with fingers from one or more component of . Torically each maximal cone in the secondary fan is associated to a point in the moduli space which gives the limit point in the “deep interior” of the associated phase.

52

P.S. Aspinwall, R.P. Horja, R.L. Karp

The real codimension-one wall between two maximal cones corresponds to a rational curve C passing through two such limit points. The rational curve C will intersect the discriminant locus in one point as promised earlier in this section. One component of is distinguished — it corresponds to the case of viewing the full convex hull as a face of itself. This is called the “primary” component of . Closely tied in with Conjecture 1 (and at least partially attributed to the same authors) is the following conjecture: Conjecture 2. At any point on the primary component of (reached by a suitable path from a suitable basepoint) the 6-brane associated with the structure sheaf OX and its translates become massless. At a generic point no other D-branes become massless. This idea was perhaps first discussed in [32]. It is certainly a very natural conjecture — the primary component of the discriminant is a universal feature for any Calabi–Yau manifold and so must be associated with the masslessness of a very basic D-brane. The fact that it works for the quintic was explicitly computed in [12], and presumably it is possible to verify the conjecture in a much larger class of examples. We will assume this conjecture to be true. The K¨ahler cone is a particular maximal cone in the secondary fan corresponding to the “Calabi–Yau” phase. Let us concentrate on the walls of the K¨ahler cone. A typical situation as we approach the wall of the K¨ahler cone is that an exceptional set E collapses to some space Z. We depict this as

E

i

/X

(12)

q

Z

where i is an inclusion (which may well be the identity) and q is a fibration with a strict inequality dim(E) > dim(Z). Associated with such a wall in the secondary fan we have a rational curve C in the moduli space connecting the large radius limit point with some other limit point. We wish to consider the monodromy associated to circling the point in the discriminant in C. The resulting autoequivalence on the derived category has been studied in [8] where it was dubbed an “EZ-transformation”. The simplest example would be the case of the quintic Calabi–Yau threefold which has only one deformation of the K¨ahler form. This single component of the K¨ahler form gives the overall size of the manifold. Thus the “wall” (i.e., the origin) corresponds to X collapsing to a point. In this case i is the identity map and Z is a point. This is the case discussed above in Sect. 2.1. Indeed, it appears that in all cases where Z is a point, the resulting monodromy amounts to a transform of the type studied in Sect. 2.1. It is precisely when Z is more than just a point the case of interest to us. 2.3. New monodromies from old. The precise form of the “EZ-monodromy conjecture” which associates an autoequivalence of D(X) with a given EZ-transform was given in [8]. Rather than appealing this conjecture, let us derive the simplest example of a more general case, by assuming the conjectures above, dealing with the case of a single massless D-brane.

Massless D-Branes on Calabi–Yau Threefolds and Monodromy

53

We will look at the well-known example [33], where X is a degree 8 hypersurface in the resolution of a weighted projective space P4{2,2,2,1,1} . The mirror Y is then a quotient of the same hypersurface with defining equation a0 z1 z2 z3 z4 z5 + a1 z14 + a2 z24 + a3 z34 + a4 z48 + a5 z58 + a6 z44 z44 .

(13)

The “algebraic” coordinates on the moduli space are then given by a 4 a5 . a62

(14)

0 = (1 − 28 x)2 − 218 x 2 y.

(15)

x=

a1 a2 a3 a6 , a04

y=

The primary component of can be computed as

The edge of the convex hull containing the points labeled by a4 , a5 , a6 leads to another component 1 = 1 − 4y,

(16)

with = 0 1 . X can be viewed as a K3-fibration π : X → P1 . In this case the component of the 1 K¨ahler form given asymptotically (for x, y 1) by 2πi log(x) controls the size of the 1 K3 fibre. The component of the K¨ahler form given asymptotically by 2πi log(y) gives the size of the P1 base. The base P1 is made very large by setting y → 0. In this case, we hit the primary component 0 of the discriminant when x = 2−8 . Let us refer to this point as P1 . Increasing x beyond this value moves one out of the Calabi–Yau phase into the hybrid “P1 -phase” where the model is best viewed as a fibration with base P1 and a Landau– Ginzburg orbifold as fibre [29]. Fixing y = 0 and varying x spans a rational curve C in the moduli space shown in Fig. 1. We would like to analyze the monodromy around the singularity P1 in Fig. 1. Clearly the transition associated with this monodromy consists of collapsing X onto the P1 base. In the language of (12), E = X, i.e., the inclusion map i is the identity, and Z ∼ = P1 . The map q is given by the fibration map π. In a way, we have constructed the simplest possible example where Z is more than just a point.

Large Radius Limit ( x = 0)

Calabi-Yau Singularity P1 ( x = 1/256) Hybrid P

1

H x=∞

Fig. 1. Moduli Space for y = 0

54

Hybrid Limit

P.S. Aspinwall, R.P. Horja, R.L. Karp

L

P1

C

Large Radius Limit

H ∆0

K

Fig. 2. Full Moduli Space around P1

Now the useful trick is that the monodromy around the singularity in Fig. 1 can be written in terms of other monodromies that we already understand. This was originally described in [33], while this feature was also exploited in [7, 34]. The full moduli space near P1 is shown in Fig. 2 (with complex dimensions shown as real). The rational curve C given by y = 0 corresponds to an infinite radius limit and as such we understand the monodromy around it (see for example [7, 34]). (We will follow closely the notation and analysis of [34]). Let L refer to the autoequivalence of D(X) we apply upon looping this curve. It follows that L(B) = B ⊗ OX (S),

(17)

where S is the divisor class of a K3 fibre in X. Meanwhile let K refer to the autoequivalence of D(X) we apply upon looping the primary component 0 = 0 (i.e., denote KOX of Sect. 2.1 by K). Then from Conjectures 1 and 2 we know that K(B) = Cone(hom(OX , B) ⊗ OX → B).

(18)

It follows (see, for example Sect. 5.1 of [34] for an essentially identical computation) that the autoequivalence for the desired loop shown in Fig. 1 around P1 is given by L−1 KLK.

(19)

The desired goal therefore is to find the autoequivalence of D(X) obtained by combining the transforms in the above form. The result is that L−1 KLK = H,

(20)

where H is an autoequivalence that acts on D(X) by H(B) = Cone(π ∗ π∗ B → B).

(21)

Section 3 is devoted to the proof of this statement. Let us review briefly what is exactly meant by the rather concise notation of (21). Given the map π : X → Z and a sheaf E on X we may construct the “push-forward” sheaf π∗ E on Z by associating π∗ E (U ) with E (π −1 U ) for any open set U ⊂ Z. The π∗ appearing in (21) is the right-derived functor of this push-forward map. This π∗ “knows” about the cohomology of the fibre of π (see, for example, Chapter III of [35]). The pull-back map π ∗ is defined for sheaves of OZ -modules, and in particular for locally free sheaves, and thus for vector bundles.

Massless D-Branes on Calabi–Yau Threefolds and Monodromy

55

The map π ∗ appearing in (21) is the corresponding left-derived functor. It is a central result of the theory of derived categories [36] that π ∗ is the left-adjoint of π∗ : HomX (π ∗ E, F) ∼ = HomZ (E, π∗ F),

(22)

for any E ∈ D(Z) and F ∈ D(X). It follows that HomX (π ∗ π∗ B, B) ∼ = HomZ (π∗ B, π∗ B).

(23)

Thus the most natural morphism that would appear in (21) is the image of the identity on the right-hand side of Eq. (23) under this natural isomorphism. One can show that this is indeed the case. 2.4. Interpretation of monodromy. Let us interpret (21) in light of our discussion of monodromy from -stability in Sect. 2.1. To aid our discussion consider how one might rewrite the monodromy result (18) for the primary component of the discriminant. Let c : X → x be the constant map of X to a single point. One can then show, using the fact that sheaf cohomology is equivalent to c∗ , which, in turn, is also given by the global section functor Hom(OX , −) [35], that (18) is equivalent to K(B) = Cone(c∗ c∗ B → B).

(24)

Now, the only D-brane (up to translation in D(X)) which becomes massless in this case is OX , which is equal to c∗ C, where we denote the trivial (very trivial!) line bundle on the point x as C. That is, massless D-branes for the primary component of the discriminant are given by c∗ (something). The c∗ in (24) then gives a natural map to form a cone as required. The expression (10) immediately dictates that we may interpret Cone(π ∗ π∗ B → B) in a similar way. The D-branes becoming massless at the point P1 in the moduli space correspond to π ∗ z for some z ∈ D(Z). The push-forward map π∗ can be viewed as the natural ingredient required to form the cone from (23). There is one technical subtlety here which needs to be mentioned. The set of objects of the form π ∗ z for any z ∈ D(Z) is not closed under composition by the cone construction. If we write C = Cone(f : π ∗ a → π ∗ b),

(25)

for two objects a, b ∈ D(Z), then we may only write C = π ∗ Cone(f : a → b),

(26)

when there is a relationship between the morphisms f = π ∗ f . Unfortunately such an f need not exist for arbitrary f . Thus we might more properly say that the set of massless D-branes are generated by objects of the form π ∗ z, where we allow for composing such states. Flushed with success at interpreting the autoequivalence given by this example we now write down the most obvious generalization for the more general EZ-transform (12). First of all, we expect q ∗ (something) to be a massless D-brane on E. E is mapped into X by the inclusion map i. The push-forward map i∗ is then “extension by zero” of a sheaf which is the obvious way of mapping D-branes on E into D-branes on X. Our next conjecture is then

56

P.S. Aspinwall, R.P. Horja, R.L. Karp

Conjecture 3. Any D-Brane which becomes massless at a point on a component of the discriminant associated with an EZ-transform is generated by objects of the form i∗ q ∗ z for z ∈ D(Z). This implies a corresponding autoequivalence for the monodromy from -stability: B → Cone(i∗ q ∗ ζ B → B),

(27)

for some “natural” map ζ : D(X) → D(Z). To compute ζ we use the same trick as (23). Introduce the functor “i ! ” as the right-adjoint of i∗ : HomX (i∗ E, F) ∼ = HomE (E, i ! F),

(28)

for any E ∈ D(E) and F ∈ D(X). The existence of i ! is one of the most important features of the derived category in algebraic geometry [36] and (28) may be regarded as a generalization of Serre Duality. Now we have HomX (i∗ i ! B, B) ∼ = HomE (i ! B, i ! B).

(29)

This leads to a natural map i∗ q ∗ q∗ i ! B → i∗ i ! B → B. This implies Conjecture 4. The monodromy around the discriminant in the wall associated to a phase transition given by an EZ-transform leads to the following autoequivalence on D(X): B → Cone(i∗ q ∗ q∗ i ! B → B).

(30)

This is equivalent to the EZ-monodromy conjecture of [8]. In particular, it was proven there that this is indeed an autoequivalence of D(X). We should perhaps emphasize that we have not proven Conjectures 3 and 4. Rather we have used the known connection between -stability and monodromy and generalized the example we considered in the simplest and most obvious way. Note that we may derive Conjecture 1 from Conjectures 3 and 4 as follows. Suppose Z is a point, then i∗ q ∗ z can only be one thing, so we have a single massless D-brane. Furthermore, q∗ now becomes the cohomology functor giving i∗ q ∗ q∗ i ! B = i∗ (homE (OE , i ! B) ⊗ OE ) = homX (i∗ OE , B) ⊗ i∗ OE .

(31)

This also shows that the massless D-brane is i∗ OE — i.e., the D-brane wrapping around E as observed in [2, 10, 37]. For completeness we should also obtain the monodromy as one circles the discriminant in the opposite direction. Going back to the triangle (4) we see that decreasing θ would result in C being replaced by B = Cone(C → A[1])[−1]. This implies we modify the above monodromy arguments to consider the transformation under which C → Cone(C → i∗ q ∗ ηC)[−1],

(32)

for some “natural” map η : D(X) → D(Z). That is, the massless objects bind “to the right” of C in the mapping cone rather than to the left. The “[−1]” is needed because of the asymmetrical definition of the mapping cone — we need to keep C in its original position.

Massless D-Branes on Calabi–Yau Threefolds and Monodromy

57

The only nontrivial step in copying the above argument is that to construct η we need a left-adjoint functor for q ∗ . Given that q ! F = q ∗ F ⊗ q ! OZ we can construct such a functor from HomE (E, q ∗ F) = HomE (E ⊗ q ! OZ , q ! F) = HomZ (q∗ (E ⊗ q ! OZ ), F). Thus our desired transform is given by C → Cone C → i∗ q ∗ q∗ (i ∗ C ⊗ q ! OZ ) [−1].

(33)

(34)

It was shown in [8] that this is indeed the inverse of the transformation given in Conjecture 4. 3. Composing Transforms In this section we will prove (20). For completeness, we choose to adopt a more general point of view and work with the class of Calabi–Yau fibrations over projective spaces. Thus we cover examples such as elliptic fibrations over P2 as discussed in Sect. 4.3, as well as the case of K3 fibrations over P1 as desired in Sect. 2.3. Readers not familiar with manipulations in the derived category may well wish to accept the result and skip this section. Having said that, some of the methods used in this section are very powerful and may have many other applications to D-brane physics. For the sake of brevity we use the formalism of kernels to describe the Fourier-Mukai transforms. For the convenience of the reader we review some of the key notions involved. The notations follow those of [8]. For X a non-singular projective variety, an object G ∈ D(X × X) determines an exact functor of triangulated categories G : D(X) → D(X) by the formula G (−) := p2∗ (G ⊗ p1∗ (−)),

(35)

where p1 : X × X → X is projection on the first factor, while p2 is projection on the second factor. The object G ∈ D(X × X) is called the kernel. The convenience in using kernels comes about because of the following natural isomorphism of functors: G G ∼ = G ◦ G . The composition of the kernels G , G ∈ D(X × X) is defined as ∗ ∗ G G := p13∗ p23 (G ) ⊗ p12 (G) ,

(36)

(37)

where pij is the obvious projection from X×X×X to the relevant two factors. There is an identity element for the composition of kernels: (X )∗ (OX ), where X : X → X × X is the diagonal morphism. In this section X is assumed to be a smooth Calabi–Yau fibration of dimension n over Z ∼ = Pd , with π : X → Z the fibration map. For us, the Calabi–Yau fibration structure simply means that π : X → Z is a flat morphism (see Sect. III.9 of [35]) with the generic fibre a Calabi–Yau variety of dimension n − d. Further assumptions on the Calabi–Yau fibration will be added shortly.

58

P.S. Aspinwall, R.P. Horja, R.L. Karp

In order to set the functors L, K and H of the previous section on firm mathematical footing and to define them in the more general context of this section, we need to describe the kernels that induce them as exact functors according to formula (35). The following commutative diagram contains most of the maps that we use in the sequel: X _

(38)

j

p2 /X X×X E E z EE z EE π zz π1 zz EE 2 z π π π×π EE z z EE z z E z E" |zz / Z. Z × Z Zo s1 s2 Xo

p1

The maps are mostly projections, that are obvious from the context, except for j := X : X → X × X the diagonal of X and π : X → Z the fibration map. We now define the Fourier–Mukai functor L to be the autoequivalence of D(X) induced by the kernel L = j∗ (π ∗ OZ (1)). This functor acts on D(X) by (compare to (17)) L(B) = B ⊗ π ∗ OZ (1).

(39)

Note that the use of the notation L for this functor is consistent with the one used in the previous section, since in the case when X is a K3 fibration over Z ∼ = P1 we have ∗ OX (S) = π OZ (1). We also define the exact functor K induced by the kernel K = Cone(OX×X → O ), with O := j∗ OX and OX×X → O the natural restriction map. We can quote for example Lemma 3.2 of [20] to conclude that the action of this functor on D(X) is indeed given by (18). We make the assumption that the sheaf OX is spherical (as defined in Sect. 2.1). This ensures that the functor K is an autoequivalence of D(X). Finally, we define the exact functor H to be the so-called fibrewise Fourier– Mukai transform associated to the Calabi–Yau fibration π : X → Z (see, for example, [20, 38, 39]). The functor H is induced by the kernel H = Cone(OX×Z X → O ), with OX×Z X viewed as a sheaf on X × X (extension by zero), and OX×Z X → O the restriction map. To ensure that the functor H is indeed an autoequivalence (Fourier–Mukai functor), we assume that the sheaf OX is EZ–spherical. In the language of [8], this means that there exists a distinguished triangle in D(Z) (Z ∼ = Pd ) of the form5 OZ → π∗ OX → OZ (−d − 1)[−n + d] → OZ [1].

(40)

Note that the sphericity and EZ-sphericity conditions are both satisfied in the specific example of a K3 fibration over P1 analyzed in the previous section of this paper. We now justify the use of the notation H to denote the fibrewise Fourier–Mukai functor by showing that its action on D(X) is indeed, even in the higher dimensional situation, given by the formula (21) of the previous section. 5

Lemma 3.12 in [20], as well as Example 3.3 of [8] provide sufficient conditions for (40) to hold.

Massless D-Branes on Calabi–Yau Threefolds and Monodromy

59

The fibre product X ×Z X fits in the fibre square diagram X×Z X

q1

q2

X

/X

(41)

π

π

/ Z,

and let k : X ×Z X → X × X denote the canonical embedding. Since π is of finite type and flat, we can apply “cohomology commutes with the base change” (Prop. III 9.3 of [35] or for a more general form Prop. II 5.12 of [36]): π ∗ π∗ B ∼ = q2∗ q1∗ B,

(42)

for some B in D(X). On the other hand, q1 = p1 ◦ k, and q2 = p2 ◦ k, so we can write π ∗ π∗ B ∼ = q2∗ q1∗ B ∼ = p2∗ k∗ k ∗ p1∗ B ∼ = p2∗ (k∗ OX×Z X ⊗ p1∗ B).

(43)

Note that the last line in the previous formula represents the action on D(X) of the exact functor OX×Z X induced as in (35) by the kernel OX×Z X (shorthand for k∗ OX×Z X ). This shows that H(B) = Cone( OX×Z X (B) → B) = Cone(π ∗ π∗ B → B) as desired. We are now ready to start discussing the main goal of this section which is to prove the following relation between the defined Fourier–Mukai functors: (L−d KLd ) . . . (L−1 KL)K ∼ = H.

(44)

Equivalently, the same formula can be expressed using kernels as (L−d K Ld ) . . . (L−1 K L) K ∼ = H,

(45)

where, for any integer i, Li = j∗ (π ∗ OZ (i)). Of course, the case d = 1 (Z ∼ = P1 ) of (44) is precisely formula (20) of the previous section. Before we move on with the technicalities of the proof, a few remarks are in order. Note that the parentheses in the two formulae are simply decorative: the composition of functors, as well as the composition of kernels are associative (but, of course, not commutative!). For a fixed object G in D(X × X), the functors G − and − G from D(X × X) to D(X × X) are exact functors between triangulated categories (i.e. they preserve the distinguished triangles). Therefore, for any integer i, we can start with the distinguished triangle defining the kernel K, OX×X → O → K → OX×X [1],

(46)

and apply to it the operations L−i − and − Li from the left, and right, respectively. But L−i OX×X Li ∼ = (π × π )∗ (OZ (−i) OZ (i)), = π2∗ OZ (−i) ⊗ π1∗ OZ (i) ∼

(47)

and L−i O Li ∼ = O .

(48)

60

P.S. Aspinwall, R.P. Horja, R.L. Karp

The notation OZ (−i) OZ (i) (the exterior tensor product) will be used quite often in what follows and simply designates s2∗ OZ (−i) ⊗ s1∗ OZ (i). As a shorthand for later convenience we introduce the following objects in D(X×X) : Ti := (π × π)∗ (OZ (−i) OZ (i)) ∼ = π2∗ OZ (−i) ⊗ π1∗ OZ (i)

(49)

Ci := L−i K Li ∼ = Cone(Ti → O ),

(50)

and

for i ∈ Z. To justify the definition of the kernel Ci , we need to explain how to define the morphism Ti → O . We start with the canonical pairing map on Z × Z, OZ (−i) OZ (i) → OZ ,

(51)

(π × π)∗ (OZ (−i) OZ (i)) → (π × π )∗ (OZ ).

(52)

(π × π)∗ (OZ ) ∼ = OX×Z X .

(53)

and lift it to X × X ,

We claim that

Indeed, for the fibre square X ×Z X _

t

/Z

π×π

/ Z×Z

Z

k

X×X

(54)

with t : X ×Z X → Z the “diagonal map” of the fibre square (41), and π × π flat, we can apply again “cohomology commutes with the base change” to obtain (π × π )∗ (Z )∗ OZ = k∗ t ∗ OZ . Since t ∗ OZ = OX×Z X , the previous formula can be written as (π × π )∗ (Z )∗ OZ = OX×Z X , which is exactly (53). The morphism Ti → O is then defined as the composition Ti ∼ = OX×Z X → O . = (π × π )∗ (OZ (−i) OZ (i)) → (π × π)∗ (OZ ) ∼

(55)

Note that C0 ∼ = K. Therefore we have to show that Cd . . . C1 C0 ∼ = H.

(56)

Note that the case d = 0 is immediate since in that case Z reduces to a point, and the Fourier–Mukai transforms H and K coincide. An important rˆole in what follows will be played by Beilinson’s resolution of the diagonal in Z × Z ∼ = Pd × Pd [40], 0 → OZ (−d) dZ (d) → . . . → OZ (−1) 1Z (1) → OZ×Z → OZ → 0, (57) where iZ is the sheaf of holomorphic i-forms on Pd .

Massless D-Branes on Calabi–Yau Threefolds and Monodromy

61

For any integer i, 0 ≤ i ≤ d, we define the complexes Si on Z ×Z to be the following truncated versions of Beilinson’s resolution 0 → OZ (−i) iZ (i) → . . . → OZ (−1) 1Z (1) → OZ×Z → 0,

(58)

arranged such that the sheaf OZ×Z is located at the 0th position. Define Si := (π × π)∗ (Si ).

(59)

We claim that there exists a natural map 6 Si → O ,

(60)

that can be defined at the level of complexes, where, as usual, O denotes the complex on X × X with the only non-zero component located at the 0th position. To see this, we first make the remark that there exists a map of complexes Si → OZ . Such a map of complexes is well defined, since the complex Si is a piece of Beilinson’s resolution. We can now proceed as in (55) and define the desired morphism in D(X × X) as the composition Si = (π × π)∗ (Si ) → (π × π)∗ (OZ ) ∼ = OX×Z X → O .

(61)

The key result to be proved in this section is the following: Claim. For any integer i, 0 ≤ i ≤ d, Ci . . . C1 C0 ∼ = Cone(Si → O ).

(62)

Before proving it, let us convince ourselves that the claim implies (56). The complex Sd is quasi-isomorphic to the sheaf OZ (more precisely, to the complex on Z × Z having the sheaf OZ at the 0th position). Therefore, the i = d case of the claim states that Cd . . . C1 C0 ∼ = Cone((π × π)∗ (OZ ) → O ).

(63)

Since by (53) (π × π)∗ (OZ ) ∼ = OX×Z X , we see that the kernel Cd . . . C1 C0 is indeed isomorphic to H. We now proceed with the inductive proof of the claim. The induction is performed with respect to i. The case i = 0 is clear, since S0 = OZ×Z and S0 = (π ×π )∗ (OZ×Z ) = OX×X . To prove the inductive step i ⇒ (i + 1), we start with the natural maps Si → O and Ti+1 → O . Applying the functors − Si and Ti+1 − we get two more maps, and we can form the commutative square Ti+1 Si

/ Si

Ti+1

/ O .

(64)

6 In fact, by assuming that the sheaf O is EZ–spherical, it can be shown that Hom ∼ X X×X (Si , O ) = C. Therefore, the described nonzero morphism from Si to O is essentially unique.

62

P.S. Aspinwall, R.P. Horja, R.L. Karp

There is a nice result due to Verdier, guaranteeing that a commuting square X

/ Y

X

/Y

(65)

extends to a “9–diagram” of the form X

/ Y

/ Z

/ X [1]

X

/Y

/Z

/ X[1]

X

/ Y

/ Z

/ X [1]

X [1]

/ Y [1]

/ Z [1]

/ X [2] ,

(66)

where all the rows and columns are distinguished triangles, every square commutes, except for the last one (containing the shift operator [2]), which anticommutes (for more details see p. 24 in [41]). Applying Verdier’s “9–diagram” construction to (64) yields Ti+1 Si

/ Si

/ Ci+1 Si

/ Ti+1 Si [1]

/ O

/ Ci+1

/ Ti+1 [1]

/ Ci . . . C0

/ Ci+1 (Ci . . . C0 )

/ Ti+1 (Ci . . . C0 )[1]

/ Si [1]

/ Ci+1 Si [1]

/ Ti+1 Si [2] .

Ti+1 Ti+1 (Ci . . . C0 ) Ti+1 Si [1]

(67)

We are interested in the term Ci+1 Ci . . . C0 . To compute it, return for a moment to the commutative diagram (66), and consider the “diagonal” map Y → Z . The axioms of a triangulated category guarantee that the morphism Y → Z can be included in a distinguished triangle of the form A → Y → Z → A[1].

(68)

A crucial piece of the proof of Verdier’s “9–diagram” (p. 24 in [41]) provides another distinguished triangle that involves A, namely A → X → Y [1] → A[1] .

(69)

Returning to diagram (67), we obtain the distinguished triangles X → O → Ci+1 Ci . . . C0 → X [1] ,

(70)

Massless D-Branes on Calabi–Yau Threefolds and Monodromy

63

and X → Ti+1 Ci . . . C0 → Si [1] → X [1] ,

(71)

for some element X in D(X × X). We plan on using the latter triangle to compute the object X , but for that we need to first understand the term Ti+1 Ci . . . C0 . The leftmost column of diagram (67) shows that Ti+1 Ci . . . C0 ∼ = Cone(Ti+1 Si → Ti+1 ).

(72)

∗ T ∗ By definition Ti+1 Si = p13∗ (p23 i+1 ⊗ p12 Si ). After inspecting the definitions of the kernels Ti+1 and Si , it is not hard to see that computing Ti+1 Si requires the calculation of kernels of the type j ∗ ∗ (π3∗ OZ (−i − 1) ⊗ π2∗ OZ (i + 1)) ⊗ p12 (π2∗ OZ (−j ) ⊗ π1∗ Z (j ))) ∼ p13∗ (p23 = j ∗ ∗ ∼ (73) = (π × π ) OZ (−i − 1) (j ) ⊗ ( X )∗ (π OZ (i + 1 − j )) , Z

with 0 ≤ j ≤ i, and X : X → {pt} the projection to a point. But ( X )∗ (π ∗ OZ (i + 1 − j )) ∼ = ( Z )∗ (π∗ OX ⊗ OZ (i + 1 − j )). Since OX is EZ– spherical, the long exact cohomology sequence induced by the distinguished triangle (40) implies that ( Z )∗ (π∗ OX ⊗ OZ (i + 1 − j )) ∼ = HomZ (OZ , OZ (i + 1 − j )).

(74)

Summing up our work, we can conclude that Cone(Ti+1 Si → Ti+1 ) ∼ = (π × π)∗ (OZ (−i − 1) Ui ),

(75)

where Ui is the following complex in D(Z) : 0 → iZ (i) ⊗ HomZ (OZ , OZ (i + 1 − i)) j

→ . . . → Z (j ) ⊗ HomZ (OZ , OZ (i + 1 − j )) → . . . → 1Z (1) ⊗ HomZ (OZ , OZ (i + 1 − 1)) → OZ ⊗ HomZ (OZ , OZ (i + 1)) → OZ (i + 1) → 0,

(76)

arranged such that the sheaf OZ (i + 1) is located at the 0th position. But what is the complex Ui after all? Again the answer can be obtained by employing Beilinson’s resolution. Consider the following two quasi-isomorphic complexes obtained by truncating (57) at the appropriate place: 0 → dZ (d) OZ (−d) → . . . → i+1 Z (i + 1) OZ (−i − 1) → 0, 0 → iZ (i) OZ (−i) → . . . → OZ×Z → OZ → 0,

(77)

arranged such that the sheaf OZ in the second complex is located at the 0th position th (and, as a result, the sheaf i+1 Z (i + 1) OZ (−i − 1) is located at the (−i − 1) position in the first complex). Call these complexes Ai and Bi , respectively.

64

P.S. Aspinwall, R.P. Horja, R.L. Karp

On one hand, the well known properties of the cohomology groups of the projective space Z ∼ = Pd , give that s2∗ (Bi ⊗ s1∗ OZ (i + 1)) ∼ = Ui .

(78)

On the other hand, the same cohomology properties give that s2∗ (Ai ⊗ s1∗ OZ (i + 1)) ∼ = i+1 Z (i + 1)[i + 1].

(79)

But Bi and Ai are quasi-isomorphic (i.e. isomorphic in D(Z × Z)), hence Ui ∼ = i+1 Z (i + 1)[i + 1],

(80)

and Ti+1 Ci . . . C0 ∼ = Cone(Ti+1 Si → Ti+1 ) ∗ ∼ = (π × π) (OZ (−i − 1) i+1 (i + 1))[i + 1] . Z

(81)

The distinguished triangle (71) and the definition (59) of the complexes Si show that in fact our unknown complex X is nothing else but Si+1 . The proof by induction of the main claim of this section is then finished by invoking the distinguished triangle (70). 4. Applications 4.1. Z is a point. It was discussed above that the case of Z being a point amounts to a single D-brane becoming massless. This is the case originally studied by Strominger in [14] and yielding monodromy of the form studied in detail by Seidel and Thomas [20]. There are three possibilities: 1. E = X in which case we are looking at the primary component of the discriminant. The quintic was studied at length in [12]. 2. E is a complex surface of codimension one in X. This could arise from the blow-up of an isolated quotient singularity. This was studied for example in [37]. 3. E is a rational curve. This is the flop case and was studied in [13]. 4.2. Z is a curve. Now we have an infinite number of massless D-branes arising from the derived category of an algebraic curve. There are two possibilities: 1. E = X in which case X is a K3-fibration and Z ∼ = P1 . This is the case we studied in Sect. 2. 2. E is a ruled surface arising from blowing up a curve of quotient singularities in X. In either case we are essentially looking at nonperturbatively enhanced gauge symmetry [42, 43]. Putting z = Op for some point p ∈ Z we obtain a soliton q ∗ z = OF which is the structure sheaf of a single fibre F of the map q. These correspond to the charged vector bosons responsible for the enhanced gauge symmetry. Clearly these bosons classically have a moduli space given by Z since p may vary in Z. Upon including quantum effects this leads to a number of massless hypermultiplets in the theory given by the genus of Z [43, 44].

Massless D-Branes on Calabi–Yau Threefolds and Monodromy

65

Putting z = OZ we obtain q ∗ z = OX which corresponds to one of the massless monopoles. In fact we may analyze the complete spectrum of solitons in Seiberg–Witten theory [45] by using the “geometric engineering” approach of [46] to “zoom in” on the point in the moduli space where the nonabelian gauge symmetry appears. We will explain exactly what happens in detail in [15]. It is worth speculating that such analysis of Seiberg–Witten theory may shed some light on the “local mirror symmetry” story of papers such as [47]. The massless B-type D-branes associated with the derived category of Z may be related to the A-type D-brane story of [48] by some kind of local homological mirror symmetry. 4.3. Z is a surface. There is only one possibility, namely X = E is an elliptic fibration over Z. We will denote this fibration π : X → S. Let z ∈ D(S) correspond to the skyscraper sheaf of a point s ∈ S. Then π ∗ z ∈ D(X) corresponds to the structure sheaf of an elliptic fibre e ⊂ X over s. Let us consider the case where the size of S becomes infinite. According to our rules then, the 2-brane wrapping e should become massless when we hit the discriminant moving from the large radius phase to the phase where the elliptic fibres such as e have collapsed. At first sight this looks peculiar. One does not usually expect a 2-brane wrapped around a 2-torus to become massless for a particular radius of the torus! Actually we will argue that this indeed happens and that it is when the 2-torus is zero sized that the 2-brane becomes massless. Why T-duality doesn’t interfere with this will become apparent.7 As a specific example let X be given by the following equation in P4{9,6,1,1,1} : x12 + x23 + x318 + x418 + x518 .

(82)

This has a quotient singularity which may be resolved with an exceptional divisor P2 . One may regard [x3 , x4 , x5 ] as homogeneous coordinates on this S ∼ = P2 . Fixing a point 2 on S one then has an elliptic fibre e given by a sextic in P{3,2,1} . See, for example, [49] for more details about such fibrations. The moduli space of interest to us regards the area of e. This area is infinite for the Calabi–Yau limit point and shrinks down as we approach the other limit point. For a more precise statement let us consider the mirror e˜ of e given by the following equation in P2{3,2,1} : 1

x12 + x23 + x36 + 432 6 ψx1 x2 x3 .

(83)

Varying the size of e is then mirror to varying the complex structure of e˜ by varying ψ. Going through the usual story of solving the Picard–Fuchs equation and using the mirror map to map back to the B + iJ plane for e we obtain the shaded region in Fig. 3. The result is that for the Calabi–Yau limit point we √ have ψ → ∞ and thus J → ∞ as expected. For the other limit point ψ = 0 and J = 3/2. The discriminant is given by ψ = 1 and corresponds to J = 0, i.e., e has zero area. This is where the 2-branes wrapping e (and all the other D-branes given by π ∗ z for z ∈ D(S)) become massless. So what about T-duality for e? Note that the moduli space in Fig. 3 corresponds to two fundamental regions for the action of SL(2, Z) on the upper-half plane. T-duality 7

We are grateful to R. Plesser for an invaluable discussion on this point.

66

P.S. Aspinwall, R.P. Horja, R.L. Karp

ψ

∞

ψ =0 ψ =1 Fig. 3. Moduli Space for Elliptic Curve

should map the point at ψ = 1 to the point at ψ = ∞. It is important to remember however that T-duality does not act only upon e — one must also shift the string dilaton by an amount related to the resulting change in the area of the torus. As such, the point at ψ = 1 is related to ψ = ∞ only with the string coupling shifted off to infinity, implying again that the D-brane mass is zero. Therefore if we fix the dilaton to be a finite value we cannot use T-duality to relate ψ = 1 to any large radius torus. This explains why we really do have a massless 2-brane appearing wrapped around a zero-sized torus. Note also that if we allow the base S ∼ = P2 to have finite size then the T-duality group ceases to exist anyway as in [50]. The fact that there is a T-duality relating ψ = ∞ to ψ = 1 shows that their physics must be similar. In particular ψ = 1 must be an infinite distance away in the moduli space. Indeed, in many respects the spectrum of stable D-branes at ψ = 1, coming from the derived category of P2 , must be similar to the spectrum one would see upon going to a large radius limit. It would be interesting to investigate this in more detail and generality. 4.4. The Exoflop. Finally let us note that not all the walls of the K¨ahler cone correspond directly to some subspace E collapsing to Z. Even so, it appears that we can still fit many, if not all examples into the general EZ language. We illustrate this with an “exoflop”. Let X be the degree 12 hypersurface in P4{3,3,3,2,1} . Its mirror, Y , then has defining equation a0 z1 z2 z3 z4 z5 + a1 z14 + a2 z24 + a3 z34 + a4 z46 + a5 z512 + a6 z42 z58 + a7 z44 z54 ,

(84)

and we may use the following algebraic coordinates on the moduli space of complex structures of Y : x=

a1 a2 a3 a62 a04 a5

,

y=

a 4 a6 , a72

z=

a 5 a7 . a62

(85)

We have chosen coordinates so that the K¨ahler cone of X appears naturally as a positive octant in the secondary fan. In particular this means that the 3 rational curves

Massless D-Branes on Calabi–Yau Threefolds and Monodromy

67

in the moduli space connecting the large radius limit to each of the three neighbouring phase limit points are given by setting x = y = 0, x = z = 0 or y = z = 0 respectively. The discriminant has two components given by 0 = −1 − 64x + 768xz + 32768x 2 z − 196608x 2 z2 − 4194304x 3 z2 + 294912x 2 yz2 +16777216x 3 (yz2 + z3 ) − 75497472x 3 yz3 + 113246208x 3 y 2 z4 (86) and 1 = −1 + 4y + 4z − 18yz + 27y 2 z2 .

(87)

The phase transition of interest occurs for the rational curve C for which y = z = 0. For small x we are in the Calabi–Yau phase. For large x the Calabi–Yau X undergoes an “exoflop” [51]. That is X becomes reducible with one component consisting of a threefold with a singularity. The other component consists of a fibration of a Landau– Ginzburg orbifold theory over a P1 . These components intersect at a point which is the singularity in the threefold component. This then is not an EZ transformation. Note however that the discriminant 0 inter1 sects C transversely at (x, y, z) = (− 64 , 0, 0) and so the monodromy within C is exactly given by monodromy around the primary component of the discriminant. Therefore exactly one D-brane becomes massless at the transition point — the D6-brane wrapping X. This exoflop transition is equivalent, as far as monodromy is concerned, to an EZ transformation with X = E and Z given by a point. This is not entirely surprising given the following. Classically one would describe the exoflop wall of the K¨ahler cone as a wall where X J 3 = 0. Thus the classical volume of X is going to zero, even though the volume of some surfaces and curves within X, as measured by the K¨ahler form, do not vanish. Since the volume of X vanishes, one should expect the D6-brane to have vanishing mass. The fact that no other D-branes become massless is not obvious. It would be interesting to show that all phase transitions give rise to monodromies that can be associated with EZ transformations. Acknowledgements. It is a pleasure to thank J. Distler, D. Morrison and R. Plesser for useful conversations. P.S.A. is supported in part by NSF grant DMS-0074072 and by a research fellowship from the Alfred P. Sloan Foundation. R.L.K. was partly supported by NSF grants DMS-9983320 and DMS-0074072.

References 1. Kontsevich, M.: Homological Algebra of Mirror Symmetry. In: “Proceedings of the International Congress of Mathematicians”, Basel-Boston: Birkh¨auser, 1995, pp. 120–139 2. Douglas, M. R.: D-Branes, Categories and N=1 Supersymmetry. J. Math. Phys. 42, 2818–2843 (2001) 3. Lazaroiu, C. I.: Unitarity, D-Brane Dynamics and D-brane Categories. JHEP 12, 031 (2001) 4. Aspinwall, P. S., Lawrence, A. E.: Derived Categories and Zero-Brane Stability. JHEP 08, 004 (2001) 5. Diaconescu, D.-E.: Enhanced D-brane Categories from String Field Theory. JHEP 06, 016 (2001) 6. Kontsevich, M.: 1996, Rutgers Lecture, unpublished 7. Horja, R. P.: Hypergeometric Functions and Mirror Symmetry in Toric Varieties. http://arxic.org/list/math.AG/9912109, 1999 8. Horja, R. P.: Derived Category Automorphisms from Mirror Symmetry. Duke Math. J. 127, 1–34 (2005) 9. Douglas, M. R., Fiol, B., R¨omelsberger, C.: Stability and BPS Branes. http://arxiv.org/list/hepth/0002037, 2000 10. Douglas, M. R., Fiol, B., Romelsberger, C.: The Spectrum of BPS Branes on a Noncompact CalabiYau. http://arxiv.org/list/hep-th/0003263, 2000

68

P.S. Aspinwall, R.P. Horja, R.L. Karp

11. 12. 13. 14.

Douglas, M. R.: Topics in D-geometry. Class. Quant. Grav. 17, 1057–1070 (2000) Aspinwall, P. S., Douglas, M. R.: D-Brane Stability and Monodromy. JHEP 05, 031 (2002) Aspinwall, P. S.: A Point’s Point of View of Stringy Geometry. JHEP 01, 002 (2003) Strominger, A.: Massless Black Holes and Conifolds in String Theory. Nucl. Phys. B451, 96–108 (1995) Aspinwall, P. S., Karp, R. L.: Solitons in Seiberg–Witten Theory and D-Branes in the Derived Category. JHEP 04, 049 (2003) Sen, A.: Tachyon Condensation on the Brane Antibrane System. JHEP 08, 012 (1998) Distler, J., Jockers, H., Park, H.: D-Brane Monodromies, Derived Categories and Boundary Linear Sigma Models. http://arxiv.org/list/hep-th/0206242, 2002 Fukaya, K.: Floer Homology and Mirror Symmetry I. AMS/IP Stud. in Adv. Math. 23, Providence, RI: Amer. Math. Soc., 2001, pp. 15–43 Seidel, P.: Graded Lagrangian Submanifolds. Bull. Soc. Math. France 128, 103–149 (2000) Seidel, P., Thomas, R. P.: Braid Groups Actions on Derived Categories of Coherent Sheaves. Duke Math. J. 108, 37–108 (2001) Bridgeland, T., Maciocia, A.: Fourier-Mukai transforms for Quotient Varieties. http://arxiv.org/list/math.AG/9811101, 1998 Candelas, P., de la Ossa, X. C., Green, P. S., Parkes, L.: A Pair of Calabi–Yau Manifolds as an Exactly Soluble Superconformal Theory. Nucl. Phys. B359, 21–74 (1991) Morrison, D. R.: Geometric Aspects of Mirror Symmetry. In: Enquist, B., Schmid, W. (eds.), “Mathematics Unlimited – 2001 and Beyond”, Berlin-Heidelberg-NewYork: Springer-Verlag, 2001, pp. 899–918 Aspinwall, P. S., Greene, B. R., Morrison, D. R.: Measuring Small Distances in N = 2 Sigma Models. Nucl. Phys. B420, 184–242 (1994) Morrison, D. R., Plesser, M. R.: Summing the Instantons: Quantum Cohomology and Mirror Symmetry in Toric Varieties. Nucl. Phys. B440, 279–354 (1995) Batyrev, V. V.: Dual Polyhedra and Mirror Symmetry for Calabi–Yau Hypersurfaces in Toric Varieties. J. Alg. Geom. 3, 493–535 (1994) Aspinwall, P. S., Greene, B. R.: On the Geometric Interpretation of N = 2 Superconformal Theories. Nucl. Phys. B437, 205–230 (1995) Witten, E.: Phases of N = 2 Theories in Two Dimensions. Nucl. Phys. B403, 159–222 (1993) Aspinwall, P. S., Greene, B. R., Morrison, D. R.: Multiple Mirror Manifolds and Topology Change in String Theory. Phys. Lett. 303B, 249–259 (1993) Aspinwall, P. S., Greene, B. R., Morrison, D. R.: The Monomial-Divisor Mirror Map. Internat. Math. Res. Notices, 1993, pp. 319–338 Gelfand, I. M., Kapranov, M. M., Zelevinski, A. V.: Discriminants, Resultants and Multidimensional Determinants. Basel-Boston: Birkh¨auser, 1994 Greene, B. R., Kanter,Y.: Small Volumes in Compactified String Theory. Nucl. Phys. B497, 127–145 (1997) Candelas, P. et al.: Mirror Symmetry for Two Parameter Models — I. Nucl. Phys. B416, 481–562 (1994) Aspinwall, P. S.: Some Navigation Rules for D-brane Monodromy. J. Math. Phys. 42, 5534–5552 (2001) Hartshorne, R.: Algebraic Geometry. Graduate Texts in Mathematics 52, Berlin-Heidelberg-New York: Springer-Verlag, 1977 Hartshorne, R.: Residues and Duality. Lecture Notes in Math. 20, Berlin-Heidelberg-New York: Spinger-Verlag, 1966 Diaconescu, D.-E., Gomis, J.: Fractional Branes and Boundary States in Orbifold Theories. JHEP 10, 001 (2000) Bridgeland, T., Maciocia, A.: Fourier-Mukai Transforms for K3 and Elliptic Fibrations. http://arxiv.org/list/math.AG/9908022, 1999 Andreas, B., Curio, G., Hernandez Ruiperez, D., Yau, S.-T.: Fourier–Mukai Transforms and Mirror Symmetry for D-Branes on Elliptic Calabi–Yau. http://arxiv.org/list/math.AG/0012196, 2000 Beilinson, A. A.: Coherent Sheaves on Pn and Problems in Linear Algebra. Funct. Anal. Appl. 12, 214–216 (1978) Beilinson, A. A., Bernstein, J. N., Deligne, P.: Faisceaux pervers. Ast´erisque 100, (1982) Aspinwall, P. S.: Enhanced Gauge Symmetries and Calabi–Yau Threefolds. Phys. Lett. B371, 231– 237 (1996) Katz, S., Morrison, D. R., Plesser, M. R.: Enhanced Gauge Symmetry in Type II String Theory. Nucl. Phys. B477, 105–140 (1996) Witten, E.: Phase Transitions in M-Theory and F-Theory. Nucl. Phys. B471, 195–216 (1996)

15. 16. 17. 18. 19. 20. 21. 22. 23. 24. 25. 26. 27. 28. 29. 30. 31. 32. 33. 34. 35. 36. 37. 38. 39. 40. 41. 42. 43. 44.

Massless D-Branes on Calabi–Yau Threefolds and Monodromy

69

45. Seiberg, N., Witten, E.: Electric - Magnetic Duality, Monopole Condensation, and Confinement in N=2 Supersymmetric Yang-Mills Theory. Nucl. Phys. B426, 19–52 (1994) (erratum-ibid. B430, 485–486 (1994)) 46. Kachru, S. et al.: Nonperturbative Results on the Point Particle Limit of N=2 Heterotic String Compactifications. Nucl. Phys. B459, 537–558 (1996) 47. Katz, S., Mayr, P., Vafa, C.: Mirror Symmetry and Exact Solution of 4D N = 2 Gauge Theories. I. Adv. Theor. Math. Phys. 1, 53–114 (1998) 48. Klemm, A. et al.: Self-Dual Strings and N=2 Supersymmetric Field Theory. Nucl. Phys. B477, 746–766 (1996) 49. Vafa, C., Witten, E.: Dual String Pairs With N = 1 and N = 2 Supersymmetry in Four Dimensions. In: “S-Duality and Mirror Symmetry”, Nucl. Phys. (Proc. Suppl.) B46, 225–247 (1996) 50. Aspinwall, P. S., Plesser, M. R.: T-Duality Can Fail. J. High Energy Phys. 08, 001 (1999) 51. Aspinwall, P. S., Greene, B. R., Morrison, D. R.: Calabi–Yau Moduli Space, Mirror Manifolds and Spacetime Topology Change in String Theory. Nucl. Phys. B416, 414–480 (1994) Communicated by N.A. Nekrasov

Commun. Math. Phys. 259, 71–78 (2005) Digital Object Identifier (DOI) 10.1007/s00220-005-1391-9

Communications in

Mathematical Physics

The First Eigenvalue of the Dirac Operator on Compact Spin Symmetric Spaces Jean-Louis Milhorat Laboratoire Jean Leray, UMR CNRS 6629, D´epartement de Math´ematiques, Universit´e de Nantes, 2, rue de la Houssini`ere, BP 92208, 44322 Nantes Cedex 03, France. E-mail: [email protected] Received: 12 February 2004 / Accepted: 21 March 2005 Published online: 8 July 2005 – © Springer-Verlag 2005

Abstract: We give a formula for the first eigenvalue of the Dirac operator acting on spinor fields of a spin compact irreducible symmetric space G/K. 1. Introduction It is well-known that symmetric spaces provide examples where detailed information on the spectrum of Laplace or Dirac operators can be obtained. Indeed, for those manifolds, the computation of the spectrum can be (theoretically) done using group theoretical methods. However the explicit computation is far from being simple in general and only a few examples are known. On the other hand, many results require some information about the first (nonzero) eigenvalue, so it seems interesting to get this eigenvalue without computing all the spectrum. In that direction, the aim of this paper is to prove the following formula for the first eigenvalue of the Dirac operator: Theorem 1. Let G/K be a compact, simply-connected, n-dimensional irreducible symmetric space with G compact and simply-connected, endowed with the metric induced by the Killing form of G sign-changed. Assume that G and K have the same rank and that G/K has a spin structure. Let βk , k = 1, . . . , p, be the K-dominant weights occurring in the decomposition into irreducible components of the spin representation under the action of K. Then the square of the first eigenvalue of the Dirac operator is 2 min βk 2 + n/8 , 1≤k≤p

(1)

where · is the norm associated to the scalar product < , > induced by the Killing form of G sign-changed. Remark 1. The proof uses a lemma of R. Parthasarathy in [Par71], which allows to express (1) in the following way. Let T be a fixed common maximal torus of G and K. Let be the set of non-zero roots of G with respect to T . Let δG , (resp. δK ) be

72

J.-L. Milhorat

the half-sum of the positive roots of G, (resp. K), with respect to a fixed lexicographic ordering in . Then the square of the first eigenvalue of the Dirac operator is given by 2 δG 2 + 2 δK 2 − 4 max < w · δG , δK > +n/8 , w∈W

(2)

where W is a certain (well-defined) subset of the Weyl group of G.

2. The Dirac Operator on a Spin Compact Symmetric Space We first review some results about the Dirac operator on a spin symmetric space, cf. for instance [CFG89] or [B¨ar91]. A detailed survey on the subject may be found, among other topics, in the reference [BHMM]. Let G/K be a spin compact symmetric space. We assume that G/K is simply connected, so G may be chosen to be compact and simply connected and K is the connected subgroup formed by the fixed elements of an involution σ of G, cf. [Hel78]. This involution induces the Cartan decomposition of the Lie algebra G of G into G = K ⊕ P, where K is the Lie algebra of K and P is the vector space {X ∈ G ; σ∗ · X = −X}. This space P is canonically identified with the tangent space to G/K at the point o, o being the class of the neutral element of G. We also assume that the symmetric space G/K is irreducible, so all the G-invariant scalar products on P, hence all the G-invariant Riemannian metrics on G/K are proportional. We consider the metric induced by the Killing form of G sign-changed. With this metric, G/K is an Einstein space with scalar curvature Scal = n/2, (cf. for instance Theorem 7.73 in [Bes87]). The spin condition implies that the homomorphism α : K → SO(P) SOn , k → AdG (k)|P lifts to a homomorphism α : K → Spinn , cf. [CG88]. Let ρ : Spinn → HomC (, ) be the spin representation. The composition ρ ◦ α defines a “spin” representation of K which is denoted ρK . The spinor bundle is then isomorphic to the vector bundle := G ×ρK . Spinor fields on G/K are then viewed as K-equivariant functions G → , i.e. functions: :G→

s.t.

∀g ∈ G , ∀k ∈ K , (gk) = ρK (k −1 ) · (g) .

Let L2K (G, ) be the Hilbert space of L2 K-equivariant functions G → . The Dirac operator D extends to a self-adjoint operator on L2K (G, ). Since it is an elliptic operator, it has a (real) discrete spectrum. Now if the spinor field is an eigenvector of D for the eigenvalue λ, then the spinor field σ ∗ · is an eigenvector for the eigenvalue −λ, hence the spectrum of the Dirac operator is symmetric with respect to the origin. Thus the spectrum of D may be deduced from the spectrum of its square D2 . By the PeterWeyl theorem, the natural unitary representation of G on the Hilbert space L2K (G, ) decomposes into the Hilbert sum ⊕ Vγ ⊗ HomK (Vγ , ) ,

γ ∈G

The First Eigenvalue of the Dirac Operator on Compact Spin Symmetric Spaces

73

is the set of equivalence classes of irreducible unitary complex representations where G and HomK (Vγ , ) is the vector space of of G, (ργ , Vγ ) represents an element γ ∈ G K-equivariant homomorphisms Vγ → , i.e. HomK (Vγ , ) = {A ∈ Hom(Vγ , ) s.t. ∀k ∈ K , A ◦ ργ (k) = ρK (k) ◦ A} . The injection Vγ ⊗ HomK (Vγ , ) → L2K (G, ) is given by v ⊗ A → g → (A ◦ ργ (g −1 ) ) · v . Note that Vγ ⊗ HomK (Vγ , ) consists of C ∞ spinor fields to which the Dirac operator can be applied. The restriction of D2 to the space Vγ ⊗ HomK (Vγ , ) is given by the Parthasaraty formula, [Par71]: Scal v ⊗ A, (3) 8 where Cγ is the Casimir operator of the representation (ργ , Vγ ). Now since the representation is irreducible, the Casimir operator is a scalar multiple of identity, Cγ = cγ id, Hence if HomK (Vγ , ) = {0}, where the eigenvalue cγ only depends on γ ∈ G. 2 cγ + n/16 belongs to the spectrum of D . Let ρK = ⊕ρK,k be the decomposition of the spin representation K → into irreducible components. Denote by m(ργ |K , ρK,k ) the multiplicity of the irreducible K-representation ρK,k in the representation ργ restricted to K. Then dim HomK (Vγ , ) = m(ργ |K , ρK,k ) . D2 (v ⊗ A) = v ⊗ (A ◦ Cγ ) +

k

So the spectrum of the square of the Dirac operator is s.t. ∃k s.t. m(ργ , ρK,k ) = 0} . Spec(D2 ) = {cγ + n/16 ; γ ∈ G |K

(4)

3. Proof of the Result We assume that G and K have the same rank. Let T be a fixed common maximal torus. Let be the set of non-zero roots of the group G with respect to T . According to a classical terminology, a root θ is called compact if the corresponding root space is contained in KC (that is, θ is a root of K with respect to T ) and noncompact if the root + space is contained in PC . Let + G be the set of positive roots of G, K be the set of + positive roots of K, and n be the set of positive noncompact roots with respect to a fixed lexicographic ordering in . The half-sums of the positive roots of G and K are respectively denoted δG and δK and the half-sum of noncompact positive roots is denoted by δn . The Weyl group of G is denoted WG . The space of weights is endowed with the WG -invariant scalar product < , > induced by the Killing form of G sign-changed. Let + W := {w ∈ WG ; w · + G ⊃ K } .

(5)

By a result of R. Parthasaraty, cf. Lemma 2.2 in [Par71], the spin representation ρK of K decomposes into the irreducible sum ρK,w , (6) ρK = w∈W

74

J.-L. Milhorat

where ρK,w has for dominant weight βw := w · δG − δK .

(7)

βw0 2 = min βw 2 ,

(8)

if ∃w1 = w0 ∈ W such that βw1 2 = min βw 2 , then βw1 ≺ βw0 ,

(9)

Now define w0 ∈ W such that w∈W

and w∈W

where ≺ is the usual ordering on weights. Lemma 1. The weight βwG0 := w0−1 · βw0 = δG − w0−1 · δK , is G-dominant. Proof. Let G = {θ1 , . . . , θr } ⊂ + G be the set of simple roots. It is sufficient to prove <β G ,θi >

that 2 <θwi0,θi > is a non-negative integer for any simple root θi . Since T is a maximal common torus of G and K, βw0 , which is an integral weight for K is also an integral weight for G. Now since the Weyl group WG permutes the weights, βwG0 = w0−1 · βw0 <β G ,θi >

is also a integral weight for G, hence 2 <θwi0,θi > is an integer for any simple root θi . So we only have to prove that this integer is non-negative. G ,θi > Let θi be a simple root. Since 2 <δ <θi ,θi > = 1, (see for instance § 10.2 in [Hum72]) and since the scalar product < ·, · > is WG -invariant, one gets 2

< βwG0 , θi > < θi , θi >

=1−2

< δK , w0 · θi > . < θi , θi >

(10)

Suppose first that w0 · θi ∈ K . If w0 · θi is positive then w0 · θi is necessarily a Ksimple root. Indeed let K = {θ1 , . . . , θl } ⊂ + K be the set of K-simple roots. One has w0 · θi = lj =1 bij θj , where the bij are non-negative integers. But since w0 ∈ W , there are l positive roots α1 , . . . , αl in + G such that w0 · αj = θj , j = 1, . . . , l. So l r θi = j =1 bij αj . Now each αj is a sum of simple roots k=1 aj k θk , where the aj k are non-negative integers. So θi = j,k bij aj k θk . By the linear independence of simple roots, one gets j bij aj k = 0 if k = i, and j bij aj i = 1. Hence there exists a j0 such that bij0 = aj0 i = 1, the other coefficients being zero. So w0 · θi = θj0 is a K-simple <δK ,w0 ·θi > K ,w0 ·θi > root. Now since 2 <δ<θ = 2 <w = 1, one gets 2 0 ·θi ,w0 ·θi > i ,θi >

2

G ,θ > <βw 0 i <θi ,θi >

G ,θ > <βw 0 i <θi ,θi >

= 0, hence

≥ 0. Now, the same conclusion holds if w0 · θi is a negative root of K, since <β G ,θi >

0 ·θi > K ,w0 ·θi > 2 <δ<θ = −2 <δK<θ,−w = −1, hence 2 <θwi0,θi > = 2. i ,θi > i ,θi > Suppose now that w0 · θi ∈ / K , that is w0 · θi is a noncompact root. This implies that w0 σi , where σi is the reflection across the hyperplane θi⊥ , is an element of W . Let α1 , . . . , αm be the positive roots in + G such that w0 · αj = αj , where the αj ,

The First Eigenvalue of the Dirac Operator on Compact Spin Symmetric Spaces

75

j = 1, . . . , m are the positive roots of K. Since σi permutes the positive roots other than θi , (cf. for instance Lemma B, § 10.2 in [Hum72]), and since θi can not be one of the roots α1 , . . . , αm (otherwise w0 · θi ∈ + K ), each root σi · αj is positive. So w0 σi ∈ W since w0 σi · (σi · αj ) = αj , j = 1, . . . , m. We now claim that 2 impossible. Suppose that

G ,θ > <βw 0 i <θi ,θi >

2

< δK , w0 · θi > > 1. < θi , θi >

Since δK can be expressed as δK = a K-simple root

θj

such that <

K ,w0 ·θi > < 0, which is equivalent to 2 <δ<θ > 1, is i ,θi >

l

i=1 ci θi , where the ci

θj , w0

· θi > 0, and since 2

(11) are nonnegative, there exists <θj ,w0 ·θi > <θj ,θj >

is an integer, this

implies that 2

< θj , w0 · θi > < θj , θj >

≥ 1.

(12)

So θj − w0 · θi is a root (cf. for instance § 9.4 in [Hum72]). Moreover, from the bracket relation [K, P] ⊂ P, it is a noncompact root. Now ±(θj − w0 · θi ) is a positive noncompact root, so by the description of the weights of the spin representation ρK , (they are of the form: δn −(a sum of distinct positive noncompact roots), cf. §2 in [Par71]), (w0 · δG − δK ) ± (θj − w0 · θi ) is a weight of ρK . Now, (w0 · δG − δK ) + (θj − w0 · θi ) can not be a weight of ρK . Otherwise since σi · δG = δG − θi , (w0 σi · δG − δK ) + θj is a weight of ρK . But since w0 σi ∈ W , µ := w0 σi · δG − δK is a dominant weight of ρK . So µ is a dominant weight but not the highest weight of an irreducible component of ρK . Hence there exists an irreducible representation of ρK with dominant weight λ = w · δG − δK , w ∈ W , whose set of weights

contains µ. Furthermore µ ≺ λ. Now since µ ∈ , µ + δK 2 ≤ λ + δK 2 , with equality only if µ = λ, (cf. for instance Lemma C, §13.4 in [Hum72]). But µ + δK 2 = δG 2 = λ + δK 2 , so µ = λ, contradicting the fact that µ ≺ λ. Thus only µ0 := (w0 · δG − δK ) − (θj − w0 · θi ) ,

(13)

can be a weight of ρK . Now one has µ0 2 = w0 · δG − δK + w0 · θi 2 −2 < w0 · δG − δK + w0 · θi , θj > +θj 2 . Since w0 · δG − δK is a dominant weight, < w0 · δG − δK , θj >≥ 0, and from (12), 2 < w0 · θi , θj > −θj 2 ≥ 0, so µ0 2 ≤ (w0 · δG − δK ) + w0 · θi 2 .

76

J.-L. Milhorat

Now (w0 · δG − δK ) + w0 · θi 2 = w0 · δG − δK 2 +2 < δG − w0−1 · δK , θi > +θi 2 . But, as we supposed 2 2 <

δG − w0−1

G ,θ > <βw 0 i <θi ,θi >

· δK , θi > +θi

< 0, one has 2

≤ 0, hence

2 <δG −w0−1 ·δK ,θi > θi 2

≤ −1, so

(w0 · δG − δK ) + w0 · θi 2 ≤ w0 · δG − δK 2 , so µ0 2 ≤ w0 · δG − δK 2 . Now, being a weight of ρK , µ0 is conjugate under the Weyl group of K to a dominant weight of ρK , say w1 · δG − δK , with w1 ∈ W . Note that w1 = w0 , otherwise since µ0 ≺ w1 · δG − δK , (cf. Lemma A, § 13.2 in [Hum72]), the noncompact root θj − w0 · θi should be a linear combination with integral coefficients of compact simple roots. But, by the bracket relation [K, K] ⊂ K, that is impossible. Thus, by the definition of w0 , cf. (8), w0 · δG − δK 2 ≤ w1 · δG − δK 2 = µ0 2 , so µ0 2 = w1 · δG − δK 2 = w0 · δG − δK 2 . But by the condition (9), the last equality is impossible, otherwise since µ0 ≺ w1 ·δG −δK and w1 · δG − δK ≺ w0 · δG − δK , the noncompact root θj − w0 · θi should be a linear combination with integral coefficients of compact simple roots. Hence 2 also if w0 · θi ∈ / K .

G ,θ > <βw 0 i <θi ,θi >

≥0

Now let (ρ0 , V0 ) be an irreducible representation of G with dominant weight βwG0 . The fact that βw0 = w0 · βwG0 is a weight of ρ0 is an indication that ρ0 |K may contain the irreducible representation ρK,w0 . This is actually true: Lemma 2. With the notations above, m(ρ0 |K , ρK,w0 ) ≥ 1 . Proof. Let v0 be the maximal vector in V0 , (it is unique up to a nonzero scalar multiple). Let g0 ∈ T be a representative of w0 . Then g0 · v0 is a weight vector for the weight βw0 , since for any X in the Lie algebra T of T : d d = dt (exp(tX) g0 ) · v0 g0 g0−1 exp(tX) g0 · v0 X · (g0 · v0 ) = dt |t=0 |t=0 = g0 · Ad(g0−1 ) · X · v0 = βwG0 (w0−1 · X) (g0 · v0 ) = (w0 · βwG0 )(X) (g0 · v0 )

= βw0 (X) (g0 · v0 ) .

In order to prove the result, we only have to prove that g0 · v0 is a maximal vector (for the action K), hence is killed by root-vectors corresponding to simple roots of K. So let θi be a simple root of K and Ei be a root-vector corresponding to that simple root.

The First Eigenvalue of the Dirac Operator on Compact Spin Symmetric Spaces

77

Since w0 ∈ W , there exists a positive root αi ∈ + G such that w0 · αi = θi . Then −1 Ei := Ad(g0 )(Ei ) is a root-vector corresponding to the root αi since for any X in T = Ad(g0−1 ) · [Ad(g [X, Ei ] = [X, Ad(g0−1 )(Ei )] 0 )(X), Ei ] −1 −1 −1 = Ad(g0 ) · [w0 · X, Ei ] = (w0 · θi )(X) Ad(g0 ) · Ei = αi (X) Ei .

But since v0 is killed by the action of the root-vectors corresponding to positive roots in + G , one gets d g0 g0−1 exp(t Ei )g0 · v0 Ei · (g0 · v0 ) = dt |t=0 d = dt g0 exp t Ad(g0−1 ) · Ei · v0 |t=0 = g0 · E i · v 0 = 0.

Hence the result.

From the result (4), we may then conclude: Lemma 3. 2 βw0 2 + n/8 , is an eigenvalue of the square of the Dirac operator. Proof. By the Freudenthal formula, the Casimir eigenvalue cγ0 of the representation (ρ0 , V0 ) is given by βwG0 + δG 2 − δG 2 = 3 δG 2 + δK 2 − 4 < w0 · δG , δK > . On the other hand βw0 2 = δG 2 + δK 2 − 2 < w0 · δG , δK > . Hence cγ0 = 2 βw0 2 + δG 2 − δK 2 . Now, the Casimir operator of K acts on the spin representation ρK as scalar multiplication by δG 2 − δK 2 , (cf. Lemma 2.2 in [Par71]). Indeed, each dominant weight of ρK being of the form w · δG − δK , w ∈ W , the eigenvalue of the Casimir operator on each irreducible component is given by: (w · δG − δK ) + δK 2 − δK 2 = w · δG 2 − δK 2 = δG 2 − δK 2 . On the other hand, the proof of the formula (3) shows that the Casimir operator of K acts on the spin representation ρK as scalar multiplication by Scal 8 = n/16 (cf. [Sul79]), hence δG 2 − δK 2 = n/16 . So cγ0 + n/16 = 2 βw0 2 + n/8 . Hence the result.

(14)

78

J.-L. Milhorat

In order to conclude, we have to prove that Lemma 4. 2 βw0 2 + n/8 , is the lowest eigenvalue of the square of the Dirac operator. be such that there exists w ∈ W such that m(ργ , ρK,w ) ≥ 1. Let Proof. Let γ ∈ G |K βγ be the dominant weight of ργ . First, since the Weyl group permutes the weights of ργ , w −1 · βw = δG − w −1 · δK is a weight of ργ . Hence βγ + δG 2 ≥ w −1 · βw + δG 2 , (cf. for instance Lemma C, §13.4 in [Hum72]). So, from the Freudenthal formula, cγ = βγ + δG 2 − δG 2 ≥ w −1 · βw + δG 2 − δG 2 . But, using (14) w−1 · βw + δG 2 − δG 2 = 2 βw 2 + δG 2 − δK 2 = 2 βw 2 + n/16 . Hence by the definition of βw0 , cγ ≥ 2 βw 2 + n/16 ≥ 2 βw0 2 + n/16 . Hence the result.

References [B¨ar91]

B¨ar, C.: Das Spektrum von Dirac-Operatoren. Dissertation, Universit¨at Bonn, 1991, Bonner Mathematische Schriften 217. [Bes87] Besse, A.: Einstein Manifolds. Berlin: Springer-Verlag, 1987 [BHMM] Bourguignon, J.P., Hijazi, O., Milhorat, J.-L., Moroianu, A.: A Spinorial Approach to Riemannian and Conformal Geometry. Monograph (in preparation) [CFG89] Cahen, M., Franc, A., Gutt, S.: Spectrum of the Dirac Operator on Complex Projective Space P2q−1 (C). Lett. Math. Phys. 18, 165–176 (1989) [CG88] Cahen, M., Gutt, S.: Spin Structures on Compact Simply Connected Riemannian Symmetric Spaces. Simon Stevin 62, 209–242 (1988) [Hel78] Helgason, S.: Differential Geometry, Lie Groups and Symmetric Spaces, Pure and Applied mathematics, Vol. 80. San Diego: Academic Press, 1978 [Hum72] Humphreys, J.E.: Introduction to Lie Algebras and Representation Theory. Berlin-Heidelberg New York: Springer-Verlag, 1972 [Par71] Parthasarathy, R.: Dirac operator and the discrete series. Ann. Math. 96, 1–30 (1971) [Sul79] Sulanke, S.: Die Berechnung des Spektrums des Quadrates des Dirac-Operators auf der Sph¨are. Doktorarbeit, Humboldt-Universit¨at, Berlin, 1979 Communicated by P. Sarnak

Commun. Math. Phys. 259, 79–102 (2005) Digital Object Identifier (DOI) 10.1007/s00220-005-1396-4

Communications in

Mathematical Physics

SU (3)-Instantons and G2 , Spin(7)-Heterotic String Solitons Petar Ivanov, Stefan Ivanov University of Sofia “St. Kl. Ohridski”, Faculty of Mathematics and Informatics, Blvd. James Bourchier 5, 1164 Sofia, Bulgaria. E-mail: [email protected] (P. Ivanov); [email protected] (S. Ivanov) Received: 4 May 2004 / Accepted: 25 February 2005 Published online: 8 July 2005 – © Springer-Verlag 2005

Abstract: Necessary and sufficient conditions to the existence of a hermitian connection with totally skew-symmetric torsion and holonomy contained in SU (3) are given. A formula for the Riemannian scalar curvature is obtained. Non-compact solution to the supergravity-type I equations of motion with non-zero flux and non-constant dilaton is found in dimension 6. Non-conformally flat non-compact solutions to the supergravity-type I equations of motion with non-zero flux and non-constant dilaton are found in dimensions 7 and 8. A Riemannian metric with holonomy contained in G2 arises from our considerations and Hitchin’s flow equations, which seems to be new. Compact examples of SU (3), G2 and Spin(7) instanton satisfying the anomaly cancellation conditions are presented. Contents 1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2. General Properties of SU (3), G2 and Spin(7)-Structures . . . . . . . 2.1 SU(3)-structures in d = 6 . . . . . . . . . . . . . . . . . . . . . 2.2 G2 -structures in d = 7 . . . . . . . . . . . . . . . . . . . . . . 2.3 Spin(7)-structures in d = 8 . . . . . . . . . . . . . . . . . . . 3. The Supersymmetry Equations in Dimensions 6, 7 and 8 . . . . . . . 4. Non-Compact G2 -Solution Induced from a SU (3)-Instanton . . . . . 5. Non-Compact Spin(7)-Solution Induced from a G2 -Instanton . . . . 6. Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6.1 (SU (3), G2 , Spin(7))-instanton and conformally flat non-compact solution . . . . . . . . . . . . . . . . . . . . . . . 6.2 (SU (3), G2 , Spin(7))-instanton and non-conformally flat non-compact solution . . . . . . . . . . . . 7. Almost Contact Metric Structures and Non-Compact SU (3)-Solutions in Dimension 6 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7.1 Non-conformally flat local SU (3)-solutions on S 5 × S 1 . . . . .

. . . . . . . . .

. . . . . . . . .

. . . . . . . . .

80 83 83 84 85 86 88 91 92

. . .

93

. . .

95

. . . . . .

97 99

80

P. Ivanov, S. Ivanov

1. Introduction Supersymmetric backgrounds of string/M theory with non-vanishing fluxes are currently an active area of study for at least two reasons. Firstly they provide a framework of searching for new models with realistic phenomenology and secondly, they appear in generalizations of the AdS/CFT correspondence. The supersymmetric geometries of the common NS-NS sector of type IIA, IIB and heterotic/type I supergravity are analyzed in [42]. The bosonic geometry is of the form R1,9−p × Mp , where the Riemannian metric g, the dilaton function φ and the three form H are non-trivial only on Mp but all R-R fields and fermions are set to zero in type II theories. The type I/heterotic geometries, which is the main object of interest in the present note, allow in addition non-trivial gauge field A with field strength F A . We recall the basic notations [73, 54, 52, 41, 42]. We search for solutions to lowest nontrivial order in α of the equations of motion that follow from the bosonic action 2 1 2 1 10 √ −2φ g g 2 Scal + 4(∇ φ) − H − α T r F A S = 2 d x −ge 2k 12 which also preserves at least one supersymmetry. The three form H satisfies a modified Bianchi identity dH = 2α T r(F A ∧ F A ). The so-called Bianchi identity reads ˜ ˜ dH = 2α T r F A ∧ F A − T r R ∇ ∧ R ∇ ,

(1.1)

(1.2)

˜ ˜ where R ∇ is the curvature of the metric connection ∇˜ with torsion T ∇ = −H related 1 g g to the Levi-Civita connection ∇ by ∇˜ = ∇ − 2 H . The second term on the right-hand side of (1.2) is the leading string correction to the supergravity expression arising from the anomaly cancellation but for the consistency of the theory, a modification to the action should be included [10] (see also [22, 45]). In terms of characteristic classes (1.2) means that dH is proportional to the difference ˜ of the connections A, ∇, ˜ respectively. of the first Pontrjagin 4-forms (p1 (A) − p1 (∇)) A heterotic/type I geometry will preserve supersymmetry if and only if, in 10 dimensions, there exists at least one Majorana-Weyl spinor such that the supersymmetry variations of the fermionic fields vanish, i.e. 1 g δλ = ∇m = ∇m + Hmnp np = 0, 8 1 δ = m ∂m φ + Hmnp mnp = 0, (1.3) 12 A mn δξ = Fmn = 0,

where λ, , ξ are the gravitino, the dilatino and the gaugino, fields, respectively. The equations of motion corresponding to the action S are presented explicitly in [42]. It is known [25, 41] that the equation of motions of type I supergravity are automatically satisfied if one imposes, in addition to the preserving supersymmetry equations (1.3), the modified Bianchi identity (1.1).

SU (3)-Instantons and G2 , Spin(7)-Heterotic String Solitons

81

According to no-go (vanishing) theorems (a consequence of the equations of motion [31, 25]; a consequence of the supersymmetry [58] for SU(n)-case and [42] for the general case) there are no compact solutions with non-zero flux and non-constant dilaton satisfying the supersymmetry equations (1.3) and the modified Bianchi identity (1.1) simultaneously. In dimensions 7 and 8 the only known heterotic/type I solutions to the equations of motion preserving at least one supersymmetry, i.e. satisfying (1.3) and (1.1), are those constructed in [27, 37, 54] in dimension 8 and those presented in [52] in dimension 7. All these solutions are conformal to a flat space. In dimension 6, the possibility of the existence of a non-conformally flat solution on the complex Iwasawa manifold was discussed in [73, 21, 42, 63]. In the present note we concentrate our attention to find non-compact solutions to the supergravity equations (1.3) including the modified Bianchi identity (1.1) as well as the anomaly cancellation condition (1.2). In dimensions 7 and 8 we find non-locally-conformally flat non-compact solutions to the gravitino, gaugino and dilatino equations with non-zero flux and non-constant dilaton which obey the Bianchi identities (1.1) and (1.2) and therefore satisfy the equations of motion, due to the result in [41]. In dimension 6, we present a (non-conformally-flat) non-compact solution to the equations of motion showing that it obeys (1.3),(1.1),(1.2). All these non-compact solutions seem to be new. We present a non conformally flat (resp. conformally flat) SU (3), G2 and Spin(7)instantons which satisfy the anomaly cancellation condition (1.2) as well as the modified Bianchi identity (1.1). We obtain compact 6,7 and 8-manifolds which solve the gravitino and gaugino equations and satisfy the compatibility conditions (1.2) and (1.1) but do not solve the dilatino equation which is consistent with the no-go theorems. Geometrically, the vanishing of the gravitino variation is equivalent to the existence of a non-trivial spinor parallel with respect to a metric connection ∇ with totally skew symmetric torsion T = H which is related to the Levi-Civita connection ∇ g by 1 ∇ = ∇ g + H. 2 The presence of a ∇-parallel spinor leads to restriction of the holonomy group H ol(∇) of the torsion connection ∇. Namely, H ol(∇) has to be contained in SU (3), d = 6 [73, 59, 58, 51, 21, 6, 7], the exceptional group G2 , d = 7 [32, 40, 34], the Lie group Spin(7), d = 8 [40, 57]. A detailed analysis of the possible geometries is carried out in [42]. Complex Non-K¨ahler geometries appear in string compactifications and are studied intensively [73, 46, 42, 40, 41, 46, 45, 6, 7]. Some types of non-complex 6-manifold have been also invented recently in the string theory due to the mirror symmetry and T-duality [63, 50, 49, 11, 21, 22]. Another special dimension turns out to be dimension 5. The existence of a ∇-parallel spinor in dimension 5 determines an almost contact metric structure whose properties as well as solutions to gravitino and dilatino equations are investigated in [32, 33]. We use these considerations in our construction in Sect. 6 of a new SU (3)-instanton and non-compact solution to the equations of motion in dimension 6. Almost Hermitian manifolds with totally skew-symmetric Nijenhuis tensor arise as target spaces of a class of (2,0)-supersymmetric two-dimensional sigma models [69]. For the consistency of the theory, the Nijenhuis tensor has to be parallel with respect to the torsion connection with holonomy contained in SU (n). The known models are those on group manifolds. We present a 6-dimensional nil-manifold as an example which is not a group manifold.

82

P. Ivanov, S. Ivanov

Starting with a SU (3)-structure in dimension 6 we analyze the five classes discovered recently by Chiossi and Salamon [23] from the point of the existence of a SU (3)connection having totally skew-symmetric torsion. We obtain necessary and sufficient conditions for the existence of a connection solving the gravitino equation in dimension 6, i.e. the existence of a linear connection preserving the almost hermitian structure with torsion 3-form and holonomy contained in SU (3), in terms of the given SU (3)structure and present a formula for the Riemannian scalar curvature (Theorem 4.1). It turns out that the corresponding almost complex structure may not be integrable. In case that the almost complex structure is integrable, we derive that the SU (3)-structure is holomorphic if and only if the corresponding hermitian structure is balanced, i.e. has co-closed fundamental form (Corollary 4.3). On the other hand, any SU (3)-weak holonomy manifold (Nearly K¨ahler manifold) automatically solves both the gravitino and gaugino equations. It turns out that the Nearly K¨ahler 6-sphere S 6 satisfies in addition the compatibility conditions (1.2), (1.1). We present a six dimensional non-conformally flat nil-manifold Nil 6 = G/ , i.e. a compact quotient of a nilpotent Lie group G with a discrete subgroup , which solves both the gravitino and gaugino equations and satisfies the compatibility conditions (1.2), (1.1) but it is neither complex nor Nearly K¨ahler. Consequently, it does not solve the dilatino equation [73]. However, the SU (3)-structure on G is half-flat and therefore it determines a Riemannian metric with holonomy contained in G2 on G × R ∼ = R7 according to the procedure discovered by Hitchin [55], which seems to be a new one. We propose a simple way to lift a 6-dimensional solution to the gravitino and gaugino equations, i.e. a SU (3)-instanton solving the gravitino equation and satisfying the conditions (1.1) (resp. (1.2)), to a G2 -instanton on the product with the real line which solves the three supersymmetry equations (1.3) as well as the compatibility condition (1.1) (resp. (1.2)). We show that N il 6 ×R, (resp. S 6 ×R) is a non-compact solution to the equations of motion in dimension 7 with non-zero flux and non-constant dilaton which preserves at least one supersymmetry and is not locally conformally flat (resp. locally conformally flat). Consequently, the compact spaces N il 6 × S 1 , S 6 × S 1 admit a G2 instanton structure satisfying all the Eqs. (1.3), (1.2), (1.1) except the dilatino equation. It turns out that any G2 -weak holonomy manifold (Nearly-parallel manifold) automatically solves both the gravitino and gaugino equations. We show that the Nearly parallel 7-sphere satisfies in addition the compatibility conditions (1.2), (1.1). The same lifting procedure is applicable to the Spin(7) case. Namely, any G2 -instanton solving the gravitino equation and satisfying the conditions (1.1) (resp. (1.2)) can be lifted to a Spin(7)-instanton on the product with the real line which solves the three supersymmetry equations (1.3) as well as the compatibility condition (1.1) (resp. (1.2)). We show that N il 6 × R × R, (resp. S 6 × R × R, S 7 × R) is a non-compact solution to the equations of motion in dimension 8 with non-zero flux and non-constant dilaton which preserves at least one supersymmetry and is not locally conformally flat (resp. locally conformally flat). Consequently, the compact spaces Nil 6 ×S 1 ×S 1 , S 6 ×S 1 ×S 1 , S 7 ×S 1 admit a Spin(7)-instanton structure satisfying all Eqs. (1.3), (1.2), (1.1) except the dilatino equation. Starting with a Tanno deformed Einstein Sasaki structure in dimension 5, we lift it to a SU (3)-instanton on the product with the real line which satisfies all the supersymmetry equations (1.3) in dimension 6. In this way we obtain local solutions to the equations of motion in dimension 6. Consider S 5 as a Sasakian space form, i.e. a Tanno deformation of the standard Einstein-Sasaki structure, we show that S 5 ×R is a non-compact solution to the equations of motion in dimension 6 with non-zero flux and non-constant dilaton

SU (3)-Instantons and G2 , Spin(7)-Heterotic String Solitons

83

which preserves at least one supersymmetry. Consequently, the compact space S 5 × S 1 admits a SU (3)-instanton structure satisfying all the Eqs. (1.3), (1.2), (1.1). 2. General Properties of SU (3), G2 and Spin(7)-Structures In this section we recall necessary properties of SU (3), G2 and Spin(7) structures. 2.1. SU(3)-structures in d = 6. Let (M 6 , g, J ) be an almost Hermitian 6-manifold with Riemannian metric g and an almost complex structure J , i.e. (g, J ) define an U (3)-structure. The Nijenhuis tensor N , the K¨ahler form F and the Lee form θ 6 are defined by N = [J., J.] − [., .] − J [J., .] − [., J.],

F = g(., J.),

θ 6 (.) = δF (J.), (2.4)

respectively. A SU √(3)-structure is determined by an additional non-degenerate (3,0)-form = + + −1 − , or equivalently by a non-trivial spinor. To be more explicit, we may choose a local orthonormal frame e1 , . . . , e6 , identifying it with the dual basis via the metric. Write ei1 i2 ...ip for the monomial ei1 ∧ei2 ∧· · ·∧eip . A SU (3)-structure is described locally by √ √ √ = −(e1 + −1e2 ) ∧ (e3 + −1e4 ) ∧ (e5 + −1e6 ), F = −e12 − e34 − e56 , (2.5) + = −e135 + e236 + e146 + e245 , − = −e136 − e145 − e235 + e246 . The subgroup of SO(6) fixing the forms F and simultaneously is SU (3). The two forms F and determine the metric completely. The Lie algebra of SU (3) is denoted su(3). The failure of the holonomy group of the Levi-Civita connection to reduce to SU (3) can be measured by the intrinsic torsion τ , which is identified with ∇ g F or ∇ g J and can be decomposed into five classes [23], τ ∈ W1 ⊕ · · · ⊕ W5 . The intrinsic torsion of an U (n)- structure belongs to the first four components described by Gray-Hervella [48]. The five components of a SU (3)-structure are first described by Chiossi-Salamon [23] (for interpretation in physics see [21, 50, 49]) and are determined by dF, d + , d − as well as by dF and N. We describe those of them which we will use later. τ ∈ W1 : The class of Nearly K¨ahler (weak holonomy) manifold defined by dF to be (3,0)+(0,3)-form. τ ∈ W2 : The class of almost K¨ahler manifolds, dF = 0. τ ∈ W3 : The class of balanced hermitian manifold determined by the conditions N = θ 6 = 0, i.e these are complex manifolds with vanishing Lee form. These spaces are investigated in [39, 68, 2]. τ ∈ W4 : The class of locally conformally K¨ahler spaces characterized by dF = θ 6 ∧ F . τ ∈ W1 ⊕ W3 ⊕ W4 : The class called by Gray-Hervella G1 -manifolds determined by the condition that the Nijenhuis tensor is totally skew-symmetric. This is the precise class which we are interested in. The class of a half-flat SU(3)-manifold [23] may be characterized by the conditions d + = 0, θ 6 = 0. The half-flat structures can be lifted to a G2 -holonomy metric on

84

P. Ivanov, S. Ivanov

the product with the real line and vice versa due to the Hitchin theorem [55]. In fact, many new G2 -holonomy metrics are obtained in this way [43, 14]. We recall [23] that the fifth component W5 and the two scalar components of W1 are determined by the expressions ∗d + ∧ + , d + ∧ F = W1+ vol., d − ∧ F = W1− vol., respectively. If all five components are zero then we have a Ricci-flat K¨ahler (Calabi-Yau) 3-fold. 2.2. G2 -structures in d = 7. Endow R7 with its standard orientation and inner product. Let e1 , . . . , e7 be an oriented orthonormal basis. Consider the three-form ω on R7 given by ω = e127 − e236 + e347 + e567 − e146 − e245 + e135 .

(2.6)

The subgroup of GL(7) fixing ω is the exceptional Lie group G2 . It is a compact, connected, simply-connected, simple Lie subgroup of SO(7) of dimension 14 [16]. The Lie algebra is denoted by g2 and it is isomorphic to the two forms satisfying 7 linear equations, namely g2 ∼ = {α ∈ 2 (M)|∗(α ∧ ω) = −α}. The 3-form ω corresponds to a real spinor and therefore, G2 can be identified as the isotropy group of a non-trivial real spinor. The Hodge star operator supplies the 4-form ∗ω given by ∗ω = e3456 + e1457 + e1256 + e1234 + e2357 + e1367 − e2467 .

(2.7)

A 7-dimensional Riemannian manifold is called a G2 -manifold if its structure group reduces to the exceptional Lie group G2 . The existence of a G2 -structure is equivalent to the existence of a global non-degenerate three-form which can be locally written as (2.6). The 3-form ω is called the fundamental form of the G2 -manifold [15]. From the purely topological point of view, a 7-dimensional paracompact manifold is a G2 -manifold if and only if it is an oriented spin manifold [66]. We will say that the pair (M, ω) is a G2 -manifold with G2 -structure (determined by) ω. The fundamental form of a G2 -manifold determines a Riemannian metric implicitly through gij = 16 kl ωikl ωj kl [47]. This is referred to as the metric induced by ω. We write ∇ g for the associated Levi-Civita connection. In [29], Fernandez and Gray divide G2 -manifolds into 16 classes according to how the covariant derivative of the fundamental three-form behaves with respect to its decomposition into G2 irreducible components (see also [23, 40]). If the fundamental form is parallel with respect to the Levi-Civita connection, ∇ g ω = 0, then the Riemannian holonomy group is contained in G2 . In this case the induced metric on the G2 -manifold is Ricci-flat, a fact first observed by Bonan [15]. It was shown by Gray [47] (see also [16, 71]) that a G2 -manifold is parallel precisely when the fundamental form is harmonic, i.e. dω = d ∗ ω = 0. The first examples of complete parallel G2 -manifolds were constructed by Bryant and Salamon [17, 44]. Compact examples of parallel G2 -manifolds were obtained first by Joyce [60–62] and recently by Kovalev [65]. The Lee form θ 7 is defined by [19] 1 θ 7 = − ∗(∗dω ∧ ω). 3

(2.8)

If the Lee form vanishes, θ 7 = 0, then the G2 -structure is said to be balanced. If the Lee form is closed, dθ 7 = 0, then the G2 -structure is locally conformally equivalent to a balanced one [34]. If the G2 -structure satisfies the condition d ∗ω = θ 7 ∧ ω then it is called integrable and an analog of the Dolbeault cohomology is investigated in [30].

SU (3)-Instantons and G2 , Spin(7)-Heterotic String Solitons

85

2.3. Spin(7)-structures in d = 8. Now, let us consider R8 endowed with an orientation and its standard inner product. Let {e0 , ..., e7 } be an oriented orthonormal basis. Consider the 4-form on R8 given by = e0127 − e0236 + e0347 + e0567 − e0146 − e0245 + e0135 +e3456 + e1457 + e1256 + e1234 + e2357 + e1367 − e2467 .

(2.9)

The 4-form is self-dual ∗ = and the 8-form ∧ coincides with the volume form of R8 . The subgroup of GL(8, R) which fixes is isomorphic to the double covering Spin(7) of SO(7) [53]. Moreover, Spin(7) is a compact simply-connected Lie group of dimension 21 [16]. The Lie algebra of Spin(7) is denoted by spin(7) and it is isomorphic to the two forms satisfying 7 linear equations, namely spin(7) ∼ = {α ∈

2 (M)| ∗ (α ∧ ) = −α}. The 4-form corresponds to a real spinor φ and therefore, Spin(7) can be identified as the isotropy group of a non-trivial real spinor. A Spin(7)-structure on an 8-manifold M is by definition a reduction of the structure group of the tangent bundle to Spin(7); we shall also say that M is a Spin(7) manifold. This can be described geometrically by saying that there exists a nowhere vanishing global differential 4-form on M which can be locally written as (2.9). The 4-form is called the fundamental form of the Spin(7) manifold M [15]. The fundamental form of a Spin(7)-manifold determines a Riemannian metric implic1 itly through gij = 24 klm iklm j klm [47]. This is referred to as the metric induced by . In general, not every 8-dimensional Riemannian spin manifold M 8 admits a Spin(7)structure. We explain the precise condition [66]. Denote by p1 (M), p2 (M), X(M), X(S± ) the first and the second Pontrjagin classes, the Euler characteristic of M and the Euler characteristic of the positive and the negative spinor bundles, respectively. It is well known [66] that a spin 8-manifold admits a Spin(7) structure if and only if X(S+ ) = 0 or X(S− ) = 0. The latter conditions are equivalent to p12 (M) − 4p2 (M) + 8X(M) = 0, for an appropriate choice of the orientation [66]. Let us recall that a Spin(7) manifold (M, g, ) is said to be parallel (torsion-free [61]) if the holonomy of the metric H ol(g) is a subgroup of Spin(7). This is equivalent to saying that the fundamental form is parallel with respect to the Levi-Civita connection ∇ g of the metric g. Moreover, H ol(g) ⊂ Spin(7) if and only if d = 0 [16] (see also [71]) and any parallel Spin(7) manifold is Ricci flat [15]. The first known explicit example of complete parallel Spin(7) manifold with H ol(g) = Spin(7) was constructed by Bryant and Salamon [17, 44]. The first compact examples of parallel Spin(7) manifolds with H ol(g) = Spin(7) were constructed by Joyce [60, 61]. There are 4-classes of Spin(7) manifolds according to the Fernandez classification [28] obtained as irreducible representations of Spin(7) of the space ∇ g . The Lee form θ 8 is defined by [18] 1 θ 8 = − ∗ (∗d ∧ ) = 1/7 ∗ (δ ∧ ). 7

(2.10)

The 4 classes of Fernandez classification can be described in terms of the Lee form as follows [18]: W0 : d = 0; W1 : θ 8 = 0; W2 : d = θ 8 ∧ ; W : W = W1 ⊕ W2 . A Spin(7)-structure of the class W1 (i.e. Spin(7)-structure with zero Lee form) is called a balanced Spin(7)-structure. If the Lee form is closed, dθ 8 = 0 then the Spin(7)-structure is locally conformally equivalent to a balanced one [57]. It is shown in [18] that the Lee form of a Spin(7) structure in the class W2 is closed and therefore

86

P. Ivanov, S. Ivanov

such a manifold is locally conformally equivalent to a parallel Spin(7) manifold. The compact spaces with closed but not exact Lee form (i.e. the structure is not globally conformally parallel) have very different topology than the parallel ones [57]. Coeffective cohomology and coeffective numbers of Riemannian manifolds with Spin(7)-structure are studied in [74]. 3. The Supersymmetry Equations in Dimensions 6, 7 and 8 Dimension d=6 . Necessary conditions to have a solution to the system of dilatino and gravitino equations in dimension 6 were derived by Strominger in [73] and then studied by many authors [40–42, 21, 22, 6–8, 45] Necessary conditions to solve the gravitino equation are given in [32]. The presence of a parallel spinor in dimension 6 leads firstly to the reduction to U (3), i.e. the existence of an almost hermitian structure, secondly to the existence of a linear connection preserving the almost hermitian structure with torsion 3-form, and thirdly to the reduction of the holonomy group of the torsion connection to SU(3). It is shown in [32] that there exists a unique linear connection preserving an almost hermitian structure having totally skew-symmetric torsion if and only if the Nijenhuis tensor is a 3-form, i.e. the intrinsic torsion τ ∈ W1 ⊕ W3 ⊕ W4 . The torsion connection ∇ is determined by 1 ∇ = ∇g + T , 2 1 T = J dF + N = −dF (J., J., J.) + N = −dF + (J., J., J.) + N, 4

(3.11)

where dF + denotes the (1,2)+(2,1)-part of dF . The (3,0)+(0,3)-part dF − is determined completely by the Nijenhuis tensor [38]. If N is a three form then (see e.g. [32]) 3 dF − = − J N. 4

(3.12)

In addition, the dilatino equation forces the almost complex structure to be integrable and the Lee form to be closed (for applications in physics the Lee form has to be exact) determined by the dilaton due to θ 6 = 2dφ [73]. When the almost complex structure is integrable, N = 0, the torsion connection is also known as the Bismut connection and was used by Bismut to prove the local index theorem for the Dolbeault operator on the Hermitian non-K¨ahler manifold [12]. This formula was recently applied in string theory [7]. Vanishing theorems for the Dolbeault cohomology on the compact Hermitian non-K¨ahler manifold were found in terms of the Bismut connection [4, 58, 59]. Dimension d=7 . The precise conditions to have a solution to the gravitino equation in dimension 7 are found in [32]. Namely, there exists a non-trivial parallel spinor with respect to a G2 -connection with torsion 3-form T if and only if there exists a G2 -structure (ω, g) satisfying the equations d ∗ω = θ 7 ∧ ∗ω.

(3.13)

In this case the torsion connection ∇ is unique, the torsion 3-form T is given by 1 ∇ = ∇g + T , 2

H =T =

1 (dω, ∗ω)ω − ∗dω + ∗(θ 7 ∧ ω), 6

(3.14)

SU (3)-Instantons and G2 , Spin(7)-Heterotic String Solitons

87

and the Riemannian scalar curvature has the following expression [34] sg =

1 1 (dω, ∗ω) + 2||θ 7 ||2 − ||T ||2 + 3δθ 7 . 18 12

(3.15)

The necessary conditions to have a solution to the system of dilatino and gravitino equations were derived in [40, 32, 34] and sufficiency was proved in [32, 34]. The general existence result [32, 34] states that there exists a (local) non-trivial solution to both dilatino and gravitino equations in dimension 7 if and only if there exists a G2 -structure (ω, g) satisfying the equations d ∗ω = θ 7 ∧ ∗ω,

dω ∧ ω = 0,

θ 7 = 2dφ.

(3.16)

The torsion 3-form (the flux H ) is given by 1 ∇ = ∇g + T , 2

H = T = −∗dω + 2∗(dφ ∧ ω).

The Riemannian scalar curvature satisfies s g = 8||dφ||2 −

1 2 12 ||T ||

(3.17)

+ 6δdφ.

Dimension d=8. It is shown in [57] that the gravitino equation always has a solution in dimension 8. Namely, any Spin(7)-structure admits a unique Spin(7)-connection with totally skew-symmetric torsion T = ∗d − ∗(θ 8 ∧ ). The necessary conditions to have a solution to the system of dilatino and gravitino equations were derived in [40, 57] and sufficiency was proved in [57]. The general existence result [57] states that there exists a (local) non-trivial solution to both dilatino and gravitino equations in dimension 8 if and only if there there exists a Spin(7)-structure (, g) with closed Lee form, dθ 8 = 0, which is equivalent to the statement that the Spin(7)-structure is locally conformally balanced. The torsion 3-form (the flux H ) and the Lee form are given by 1 ∇ = ∇g + T , 2

H = T = ∗d − 2∗(dφ ∧ ),

θ8 =

12 dφ. 7

(3.18)

1 ||T ||2 + 6δdφ. The Riemannian scalar curvature satisfies s g = 8||dφ||2 − 12 In addition to these equations, the vanishing of the gaugino variation requires the 2-form F A to be of instanton type: ([24, 73, 54, 70, 26, 42])

Case d=6 A Donaldson-Uhlenbeck-Yau SU (3)-instanton, i.e. the gauge field A is a SU (3)-connection with curvature 2-form F A ∈ su(3). The SU(3)-instanton condition can be written in local holomorphic coordinates in the form [24, 73] ¯

A = Fα¯Aβ¯ = 0, FαAβ¯ F α β = 0. Fαβ

(3.19)

Case d=7 A G2 -instanton, i.e. the gauge field A is a G2 -connection and its curvature 2-form F A ∈ g2 . The latter can be expressed in any of the following two equivalent ways A mn Fmn ω p=0

A ⇔ Fmn =

1 A F (∗ω)pq mn . 2 pq

(3.20)

88

P. Ivanov, S. Ivanov

Case d=8 A Spin(7)-instanton, i.e. the gauge field A is a Spin(7)-connection and its curvature 2-form F A ∈ spin(7). The latter is equivalent to A = Fmn

1 A pq F mn . 2 pq

(3.21)

4. Non-Compact G2 -Solution Induced from a SU (3)-Instanton In this section we show how to construct a local solution to the equations of motion in dimension 7 if we have a solution to the gravitino and gaugino equations satisfying the modified Bianchi identity (1.1) in dimension 6. We first investigate a necessary and sufficient condition to have a solution to the gravitino equation in dimension 6, i.e. to have a ∇-parallel spinors. We prove the following Theorem 4.1. Let (M 6 , g, J, ) be a 6-dimensional smooth manifold with a SU (3)structure (g, J, ) or equivalently, the almost hermitian manifold (M 6 , g, J ) has topologically trivial canonical bundle trivialized by a (3,0)-form . The next two conditions are equivalent a) There exists a unique SU (3)-connection with torsion 3-form, i.e. a linear connection with torsion 3-form which preserves the almost hermitian structure whose holonomy is contained in SU(3). b) The Nijenhuis tensor N is totally-skew symmetric and the following conditions hold 1 d + = θ 6 ∧ + − (N, + )∗F, 4

1 d − = θ 6 ∧ − − (N, − )∗F. (4.22) 4

The torsion is given by 1 1 T = −∗dF + ∗(θ 6 ∧ F ) + (N, + ) + + (N, − ) − . 4 4

(4.23)

The Riemannian scalar curvature is expressed in the following way sg =

1 1 1 (N, + )2 + (N, − )2 + 2||θ 6 ||2 − ||T ||2 + 3δθ 6 . 8 8 12

(4.24)

In particular, if the structure is complex and balanced then the Riemannian scalar curvature is non-positive. Proof. Suppose condition a) holds. Then the Nijenhuis tensor N is a three form due to Theorem 10.1 in [32]. The conditions ∇ = ∇ + = ∇ − = 0 imply the constraints (4.22) on the exterior derivative of the form . This can be checked directly using (3.11) and (3.12). To prove the converse we consider the product M 7 = M 6 × R with the G2 -structure ω defined by [55, 23] ω = −F ∧ e7 − + ,

(4.25)

where e7 is the standard 1-form on R. We adopt the convention to indicate the object on the product by a superscript 7, 7 i.e. ∗7 , T 7 , θ 7 , ∇ 7 , R ∇ are the Hodge star operator, the torsion 3-form, the Lee form,

SU (3)-Instantons and G2 , Spin(7)-Heterotic String Solitons

89

the torsion connection and its curvature, respectively. The same objects on M 6 have superscript 6. Our idea is to check that the G2 -structure on the product M 7 = M 6 × R satisfies the conditions (3.13) and to apply the Friedrich-Ivanov result from [32] assuring the existence of G2 -connection ∇ 7 with torsion 3-form T 7 . We show that the torsion satisfies the condition T (e7 , ., .) = 0 and therefore ∇ 7 e7 = 0. Hence, the connection ∇ 7 descends on M 6 to a connection ∇ 6 which preserves the SU (3)-structure and has totally skew-symmetric torsion. We get from (4.25) applying (3.12) and (4.22) the following sequence of equalities 1 θ 7 = − ∗7 (∗7 dω ∧ ω) 3 1 6 6 = −∗ (∗ dF ∧ F ) + ∗6 (∗6 dF ∧ + )e7 − ∗6 ∗6 d + ∧ + 3 1 = θ 6 + (N, − )e7 , (4.26) 4 where we used the identities ∗6 (∗6 d + ∧ + ) = ∗6 (∗6 (θ ∧ + ) ∧ + ) = −2θ, ∗7 dω = ∗6 dF − ∗6 d + ∧ e7 , θ = −∗6 dF ∧ dF, 3 ∗6 (∗6 dF ∧ + ) = −( + , dF ) = − ( − , N ). 4

(4.27)

Applying the equalities d ∗6 F = −∗6 J θ 6 = θ 6 ∧ ∗6 F and the conditions (4.22) we obtain (3.13). Hence, there exists a G2 -connection ∇ 7 with torsion 3-form T 7 given by (3.14) on the product M 7 = M 6 × R. To compute T 7 we use (3.12) and (4.26). We have 3 (dω, ∗7 ω) = ∗7 (dω ∧ ω) = 2∗7 (dF ∧ + ) = − (N, + ), 2 (4.28) 1 6 6 6 6 + 7 7 ∗ (θ ∧ ω) = ∗ (θ ∧ F ) − ∗ (θ ∧ )e7 + (N, − ) − . 4 Now, the formula (3.14) and (4.22) give the desired expression (4.23) which implies that the torsion T 7 does not depend on e7 and therefore the connection ∇ 7 descends to M 6 . Substituting (4.28) and (4.26) into (3.15) we get (4.24) for the Riemannian scalar curvature on the product which clearly coincides with the scalar curvature on (M 6 , g 6 ). Corollary 4.2. In dimension 6 the following conditions are equivalent: a) There exists a non-trivial solution to the system of gravitino and dilatino equations with non-zero flux H and non-constant dilaton φ. b) There exists a SU (3)-structure (F, ) satisfying the conditions d + = 2dφ ∧ + ,

d − = 2dφ ∧ − .

The flux H is given by H = T = −∗dF + 2∗(dφ ∧ F ).

(4.29)

The Riemannian scalar curvature of the solution has the expression s g = 8||dφ||2 −

1 ||T ||2 + 6δdφ. 12

(4.30)

90

P. Ivanov, S. Ivanov

A part of the necessary conditions we presented are known [73, 42, 40, 41, 21]. The formula (4.29) was discovered in [40], the first formula in (4.27) has already appeared in [21]. Corollary 4.3. A closed SU (3)-structure (d + = d − =0) admits a SU (3)-connection with torsion 3-form if and only if the corresponding almost Hermitian structure is a balanced Hermitian structure, (N = θ 6 = 0). In particular, a holomorphic SU (3)-structure on a complex manifold supports a linear connection with torsion 3-form and holonomy contained in SU (3) if and only if it is balanced. In the latter case the Riemannian scalar curvature is non-positive. Consequently, such a structure is half-flat and therefore it determines a Riemannian metric with holonomy contained in G2 on the product with the real line . Remark 4.4. We note the coincidence of the formulas for the Riemannian scalar curvature of solutions to the gravitino and dilatino equations in dimensions 6, 7, and 8. Actually, it is proved in [59] (see also [45]) that the SU(n)-geometry arising from any solution to the gravitino and dilatino equations satisfies the identity Ricg (X, Y ) −

1 H (X, ei , ej )H (Y, ei , ej ) + 2∇ g X ∇ g Y φ 4 i,j

1 + dH (X, J Y, ei , J ei ) = 0, 4

(4.31)

i

which is consistent with the first equation of motion. The trace in (4.31) gives (4.30) due to the identity 1 1 dH (X, J Y, ei , J ei ) = 8||dφ||2 + 4δdφ − ||H ||2 4 3 i

shown in [4] (see also (3.24) in [59]). Remark 4.5. We note that the Riemannian scalar curvature for a half-flat SU (3)-structure is computed in [50]. On the other hand, not every half-flat structure admits an SU (3)connection with torsion 3-form since it may have a nonzero W2 component. For example, the SU (3)-structure on the nilpotent Lie algebras described in [23], Example 2 and 3, are half-flat but do not admit an SU (3) connection with torsion 3-form since d − = f F ∧F . Another consequence of Theorem 4.1 is the following Theorem 4.6. Let (M 6 , g, J, F, ) be a smooth almost complex 6-manifold with totally skew-symmetric Nijenhuis tensor equipped with a SU (3)-structure solving the gravitino equation, i.e. the conditions (4.22) hold. Assume also the conditions dθ 6 = 0,

(N, + ) = 0,

(N, − ) = const. = 0.

(4.32)

i) Then the G2 -structure ω = −F ∧ e7 − + defined on the product M 7 = M 6 × R solves both the gravitino and dilatino equations with non-constant dilaton. ii) If in addition the torsion connection ∇ 6 is a SU (3)-instanton then the corresponding torsion connection ∇ 7 is a G2 -instanton.

SU (3)-Instantons and G2 , Spin(7)-Heterotic String Solitons

91

iii) Suppose moreover that the torsion connection ∇ 6 satisfies the modified Bianchi 6 identity (1.1), (resp. (1.2)) with F A = R ∇ . Then ∇ 7 also obeys (1.1), (resp. (1.2)) 7 with F A = R ∇ and therefore solves the equations of motion with non zero flux and non-constant dilaton provided θ 6 is exact. Proof. Equations (4.26) and (4.28) imply (3.16) due to the conditions of the theorem. 7 6 ˜7 ˜6 We know from Theorem 4.1 that T 7 = T 6 , R ∇ = R ∇ , R ∇ = R ∇ and conse7 7 6 6 7 7 ˜ ˜ ˜6 ˜6 quently, T r(R ∇ ∧ R ∇ ) = T r(R ∇ ∧ R ∇ ), T r(R ∇ ∧ R ∇ ) = T r(R ∇ ∧ R ∇ ). A 7 glance at the structure of Lie algebras su(3) and g2 implies that R ∇ satisfies the G2 6 instanton equations (3.20) provided R ∇ obeys the SU (3)-instanton equations (3.19) 5. Non-Compact Spin(7)-Solution Induced from a G2 -Instanton In this section we shall show that a G2 -instanton on (N 7 , ω) induces a Spin(7)-instanton on the product N 8 = N 7 × R. We denote the Hodge star operator on N 7 by ∗7 . On the product N 8 = N 7 × R there exists a Spin(7)-structure defined by = e0 ∧ ω + ∗7 ω,

(5.33)

where e0 = dt is the standard 1-form on R. We indicate the object on the product by a 8 superscript 8, i.e. ∗8 , T 8 , θ 8 , ∇ 8 , R ∇ are the Hodge star operator, the torsion 3-form, the Lee form, the torsion connection and its curvature, respectively. The same objects on N 7 have superscript 7. Theorem 5.1. Suppose (N 7 , ω7 , g 7 , ∇ 7 , T 7 ) is a smooth G2 -manifold which solves the gravitino equation, i.e. (3.13) holds. Then the Spin(7)-structure on the product N 7 ×R determined with (5.33) has the properties 6 7 1 θ + (dω, ∗7 ω)e0 , T 8 = T 7 , 7 7 8 8 7 7 T r(R ∇ ∧ R ∇ ) = T r(R ∇ ∧ R ∇ ).

θ8 =

R∇ = R∇ , 8

7

Assume in addition the conditions dθ 7 = 0,

(dω, ∗7 ω) = const. = 0.

i) Then the Spin(7)-structure defined on the product N 8 = N 7 × R solves both the gravitino and dilatino equations with non-zero flux and non-constant dilaton. ii) If in addition the torsion connection ∇ 7 is a G2 -instanton then the corresponding torsion connection ∇ 8 is a Spin(7)-instanton. iii) Suppose moreover that the torsion connection ∇ 7 satisfies the modified Bianchi 7 identity (1.1), (resp. (1.2)) with F A = R ∇ . Then ∇ 8 also obeys (1.1), (resp. (1.2)) 8 with F A = R ∇ and therefore solves the equations of motion with non-zero flux and non-constant dilaton provided θ 7 is exact.

92

P. Ivanov, S. Ivanov

Proof. Take the exterior derivative in (5.33) and use (3.16) to get the identity d = −e0 ∧ dω + θ 7 ∧ ∗7 ω. The latter yields (5.34) ∗8 d = −∗7 dω − e0 ∧ ∗7 (θ 7 ∧ ∗7 ω), 8 7 7 7 7 7 7 ∗ (θ ∧ ) = −∗ (θ ∧ ω) − e0 ∧ ∗ (θ ∧ ∗ ω), (5.35) 1 θ 8 = − ∗8 ∗8 d ∧ = (5.36) 7 1 − ∗8 e0 ∧ ∗7 dω ∧ ω − e0 ∧ ∗7 θ 7 ∧ ∗7 ω ∧ ∗7 ω 7

−∗8 (∗7 dω ∧ ∗7 ω) = 1 − ∗7 ∗7 dω ∧ ω − ∗7 ∗7 θ 7 ∧ ∗7 ω ∧ ∗7 ω 7 6 1 −(dω, ∗7 ω)e0 = θ 7 + (dω, ∗7 ω)e0 , 7 7

where we used the conditions (3.16), (2.8) and the general identity ∗7 (∗7 γ ∧ ∗7 ω ∧ ∗7 ω) = 3γ valid for any 1-form γ on (M 7 , ω). Substitute (5.34), (5.35) and (5.36) into the formula (3.18) and compare the result with (3.17) to get T 8 = T 7 . The vector field e0 is parallel with respect to the Levi-Civita connection of g 8 and satis8 7 ˜8 fies T 8 (e0 , ., .) = T 7 (e0 , ., .) = 0. Therefore ∇ 8 e0 = 0 yielding R ∇ = R ∇ , R ∇ = ˜7 R ∇ . Consequently, 7 0

∇ Rmnkl mn ij = 8

7

∇ Rmnkl (∗7 ω)mn ij = 2Rij∇ kl = 2Rij∇ kl , 7

7

8

1

since R ∇ is a G2 -instanton. Clearly, the modified Bianchi identity (1.1), (resp. (1.2)) is 8 satisfied for F A = R ∇ . 7

The reverse procedure to find types of SU (3)-structures on a 6-manifold induced by different types of G2 -structures on 7-manifold is discussed recently in [8, 11, 50, 49]. 6. Examples Theorem 4.6 and Theorem 5.1 allow us to produce a number of examples of G2 and Spin(7)-instantons and solutions to the equations of motion with gauge connection A = ∇ starting from certain types of almost complex 6-manifolds or certain types of G2 manifolds. We recall the well known curvature identity 1 ˜ R ∇ (X, Y, Z, V ) = R ∇ (Z, V , X, Y ) + dT (X, Y, Z, V ). 2

(6.37)

It helps us to handle the Bianchi identity (1.2) with F A = R ∇ provided the next equality holds R ∇ (X, Y, Z, V ) = R ∇ (Z, V , X, Y ).

(6.38)

SU (3)-Instantons and G2 , Spin(7)-Heterotic String Solitons

93

Combine (6.37) and (6.38) to get 1 ˜ R ∇ = R ∇ + dT . 2 In view of (6.39), the Bianchi identity (1.2) with A = ∇ takes the form 1 dT = α T r(R ∇ ∧ dT ) − T r(dT ∧ dT ) . 2

(6.39)

Clearly, the condition (6.38) is a sufficient SU (3) (resp. G2 , Spin(7)) -connection to satisfy the SU (3) (resp. G2 , Spin(7)) -instanton condition (3.19) (resp. (3.20),(3.21)). The symmetry (6.38) of the curvature of a metric connection ∇ with torsion 3-form T holds exactly when ∇T is a 4-form which is equivalent to the condition ∇ g T = 41 dT [56]. In particular, if the torsion is ∇-parallel, ∇T = 0, then we have the additional relations 1 1 1 g Rij∇ kl = Rij∇ kl − Tij m Tkl m − Tj km Til m − Tkim Tj l m ; (6.40) 2 4 4

m m m dTij kl = 2 Tij m Tkl + Tj km Til + Tkim Tj l . 6.1. (SU (3), G2 , Spin(7))-instanton and conformally flat non-compact solution. Any Nearly-K¨ahler 6-manifold is an SU (3)-instanton since the torsion T = 41 N = J ∇ g J is ∇- parallel [64], (see also [9, 32]) and therefore the curvature R ∇ satisfies (6.38). Take + = dF ; we obtain a SU (3)-instanton solving the gravitino and gaugino equations according to Theorem 4.6 which, however, does not solve the dilatino equation since the almost complex structure is not integrable [73]. There are known only four compact Nearly K¨ahler 6-manifolds, namely S 6 , S 3 × S 3 , CP 3 and the flag F = U (3)/(U (1) × U (1) × U (1)) [55, 72]. We consider the six-sphere (S 6 , J, g) endowed with the standard nearly K¨ahler structure (g, J ) inherited from the imaginary octonions in R7 [47]. We claim that (S 6 , g, J, ∇, A = ∇) satisfies both the modified Bianchi identity (1.1) and the anomaly cancellation condition (1.2). It is well known that any 6-dimensional Nearly K¨ahler manifold is Einstein and of constant type. Consequently, the following identities hold [32]: 1 Tij m Tkl m = a 2 (gik gj l − gj k gil − Fik Fj l + Fj k Fil ), dT = a 2 F ∧ F = −2a 2 ∗F, 2 where a 2 is a non-zero constant which can be identified with the Riemannian scalar curvature s g , 15a 2 = s g . Applying the fact that (S 6 , g) is a space of constant sectional g curvature, i.e. Rij∇ kl = 21 a 2 (gj k gil − gik gj l ) and (6.39), we calculate the Pontrjagin forms 3a 2 ∇ ab 16π 2 p1 (∇) = T r(R ∇ ∧ R ∇ ) = Rij∇ ab Rkl dT ; dx i ∧ dx j ∧ dx k ∧ dx l = − 4 2 ˜ = T r(R ∇˜ ∧ R ∇˜ ) = 9a dT . 16π 2 p1 (∇) 4 Remark 6.1. Observe that if we rescale the metric homothetically by a constant c, g¯ = e2c g, then the new torsion T¯ = e2c T in the case of SU (3)-structure and T¯ = e4c T in the case of G2 or Spin(7)-structure while the Pontrjagin 4-forms remain unchanged, ¯

¯

T r(R ∇ ∧ R ∇ ) = T r(R ∇ ∧ R ∇ ),

¯˜

¯˜

˜

˜

T r(R ∇ ∧ R ∇ ) = T r(R ∇ ∧ R ∇ ) (see [22] for more

94

P. Ivanov, S. Ivanov

precise discussion of this phenomena). Hence, if dT is proportional to the difference of the Pontrjagin 4-forms with a constant then we can always rescale the structure by a suitable constant in order to get the formulas (1.1) and (1.2). Keeping Remark 6.1 in mind we obtain Theorem 6.2. The Nearly K¨ahler 6-sphere solves the gravitino equation, the gaugino equation with F A = R ∇ and satisfies the modified Bianchi identity (1.1) and (1.2) with negative α . Consequently: a) The product (S 6 × R, ω, A = ∇ 7 ) with the G2 -structure described in Sect. 4 solves all the supersymmetry equations (1.3) with non-zero flux, non-constant dilaton and satisfies the Bianchi identity (1.1), (1.2). Therefore it solves the equations of motion in dimension 7. The product (S 6 × S 1 , ω, A = ∇ 7 ) is a compact space solving locally the supersymmetry equations (1.3) which satisfies the Bianchi identity (1.1), (1.2) in dimension 7. b) The product (S 6 ×R×R, , A = ∇ 8 ) with the Spin(7)-structure described in Sect. 5 solves the supersymmetry equations (1.3) with non-zero flux, non-constant dilaton and satisfies the Bianchi identity (1.1), (1.2). Therefore it solves the equations of motion in dimension 8. The product (S 6 × S 1 × S 1 , , A = ∇ 8 ) is a compact space solving locally the supersymmetry equations (1.3) which satisfies the Bianchi identity (1.1), (1.2) in dimension 8. Similarly to the SU(3)-case, any G2 -weak holonomy manifold (nearly-parallel G2 manifold) is a G2 -instanton. Indeed, it is well known that any 7-dimensional nearly-parallel G2 manifold is Einstein and the following identities hold dω = −λ∗ω,

(dω, ∗ω) = −λ,

(6.41)

8 g where λ2 = 21 s is a non-zero constant. The torsion T = − 16 λω is ∇-parallel [32]. Hence, any nearly-parallel G2 -manifold is a G2 -instanton which solves the gravitino equation and the gaugino equation with A = ∇ 7 but does not solve the dilatino equation according to the result in [34]. There are many known examples of compact nearly parallel G2 -manifolds: S 7 , SO(5)/SO(3) [17, 71], the Aloff-Wallach spaces N (g, l) = SU (3)/U (1)gl [20], any Einstein-Sasakian and any 3-Sasakian 7-manifold [35, 36]. We consider the seven sphere (S 7 , ω, g) endowed with the standard nearly-parallel G2 -structure induced by the octonions in R8 , namely, consider the seven sphere as a totally umbilical hypersurface in R8 [29]. Clearly (S 7 , ω, g, ∇, A = ∇) is a G2 -instanton. We claim that it satisfies the modified Bianchi identity (1.1) and the anomaly cancellation condition (1.2). Indeed, we easily calculate from (6.40) applying (6.41), (6.39), the fact that (S 7 , g) is a space of constant sectional curvature and some G2 -algebra, that ∇ ab 16π 2 p1 (∇) = T r(R ∇ ∧ R ∇ ) = Rij∇ ab Rkl dx i ∧ dx j ∧ dx k ∧ dx l =

−

λ4 λ2 ∗ω = − dT ; 8.27 36

˜ = T r(R ∇˜ ∧ R ∇˜ ) = 1 λ4 ∗ω = 1 λ2 dT . 16π 2 p1 (∇) 54 9 We obtain using Remark 6.1 the following

SU (3)-Instantons and G2 , Spin(7)-Heterotic String Solitons

95

Theorem 6.3. The nearly-parallel 7-sphere solves the gravitino equation, the gaugino equation with F A = R ∇ and satisfies the modified Bianchi identity (1.1) and (1.2) with negative α . Consequently, the product (S 7 ×R, , A = ∇ 8 ) with the Spin(7)-structure described in Sect. 5 solves the supersymmetry equations (1.3) with non-zero flux, non-constant dilaton and satisfies the Bianchi identity (1.1), (1.2). Therefore it solves the equations of motion in dimension 8. The product (S 7 × S 1 , , A = ∇ 8 ) is a compact space solving locally the supersymmetry equations (1.3) which satisfies the Bianchi identity (1.1), (1.2) in dimension 8. We note that these sphere-solutions are (locally) conformally flat. 6.2. (SU (3), G2 , Spin(7))-instanton and non-conformally flat non-compact solution. In this section we present a non-locally-conformally flat solution starting with a nilpotent 6-dimensional Lie group. Let G be the six-dimensional connected simply connected and nilpotent Lie group, determined by the left invariant 1-forms {e1 , . . . , e6 } such that de2 = de3 = de6 = 0, de1 = e3 ∧ e6 , de4 = e2 ∧ e6 ,

(6.42) de5 = e2 ∧ e3 .

In terms of the standard coordinates x1 , . . . , x6 on R6 the left invariant forms {e1 , . . . , e6 } are described by the expressions e2 = dx2 , e3 = dx3 , e6 = dx6 , e1 = dx1 − x6 dx3 , e4 = dx4 − x6 dx2 , e5 = dx5 + x2 dx3 . 6 Consider the metric on G ∼ = R6 defined by g = i=1 ei2 , or equivalently

(6.43)

ds 2 = dx12 + (1 + x62 )dx22 + (1 + x22 + x62 )dx32 + dx42 + dx52 + dx62 −x6 (dx1 dx3 + dx2 dx4 ) + x2 dx3 dx5 . (6.44) Let (F, ) be the SU (3)-structure on G given by (2.5). Then(G, F, ) is an almost complex manifold with a SU (3)-structure. We show below that this space is a new non-conformally flat SU (3)-instanton solving both the gravitino and gaugino equations satisfying the modified Bianchi identity (1.1) as well as the anomaly cancellation condition (1.2) but not solving the dilatino equation. We compute the Riemannian curvature R g . The general Koszul formula 2g(∇ g X Y, Z) = Xg(Y, Z) + Y g(X, Z) − Zg(X, Y ) +g([XY ], Z) − g([Y, Z], X) − g([X, Z], Y )

(6.45)

gives the following essential non-zero terms 2∇ g e6 e3 = e1 , 2∇ g e2 e3 = −e5 , 2∇ g e3 e6 = −e1 , 2∇ g e3 e2 = e5 , 2∇ g e1 e6 = −e3 , 2∇ g e5 e2 = e3 ,

2∇ g e6 e2 = e4 , 2∇ g e2 e6 = −e4 , 2∇ g e4 e6 = −e2 .

(6.46)

Then we obtain R g (e5 , e6 , e2 , e1 ) = − 41 = 0. Hence, the metric is not locally conformally flat since the Weyl tensor W g (e5 , e6 , e2 , e1 ) = R g (e5 , e6 , e2 , e1 ) = − 41 = 0.

96

P. Ivanov, S. Ivanov

It is easy to verify using (2.4) and (6.42) that dF = −3e236 , N = − − , θ 6 = d + = (N, + ) = 0.

d − = ∗F,

(N, − ) = −4, (6.47)

Hence, (G, , g, J ) is neither complex nor Nearly K¨ahler manifold but it fulfills the conditions (4.22) of Theorem 4.1 and therefore there exists a SU (3)-holonomy connection with torsion 3-form on (G, , g, J ). The expression (4.23) and (6.47) give T = −2e145 + e136 + e235 − e246 ,

dT = −2(e1256 + e3456 + e1234 ) = 2∗F. (6.48)

Plug (6.46) and (6.48) into (3.11) to get that the nonzero essential terms of the torsion connection are ∇e1 e6 = −e3 ,

∇e5 e2 = e3 ,

∇e4 e5 = −e1 ,

∇e5 e1 = −e4 ,

∇e4 e6 = −e2 , ∇e1 e4 = −e5 .

(6.49)

It follows from (6.49) and (6.48) that the torsion tensor T as well as the Nijenhuis tensor N are parallel with respect to the connection ∇. Hence, ∇ defines an SU (3)-instanton. To verify the Bianchi identities for H we calculate the curvature R ∇ by means of (6.49). We obtain the following non-zero terms R ∇ (e6 , e2 , e6 , e2 ) = R ∇ (e6 , e3 , e6 , e3 ) = R ∇ (e2 , e3 , e2 , e3 ) = 1, R ∇ (e4 , e5 , e4 , e5 ) = R ∇ (e4 , e1 , e4 , e1 ) = R ∇ (e5 , e1 , e5 , e1 ) = 1, R ∇ (e2 , e6 , e5 , e1 ) = R ∇ (e3 , e6 , e4 , e5 ) = R ∇ (e2 , e3 , e1 , e4 ) = −1.

(6.50)

Applying (6.50), (6.48) and (6.39) it is straightforward to compute the first Pontrjagin ˜ Compare the result with the second equality in (6.48) to get 4-forms p1 (∇) and p1 (∇). dT =

1 ˜ ˜ T r(R ∇ ∧ R ∇ ) = −T r(R ∇ ∧ R ∇ ). 2

(6.51)

The coefficient of the structure equations of the Lie algebra given by (6.42) are integers. Therefore, the well-known theorem of Malcev [67] states that the group G has a uniform discrete subgroup such that N il 6 = G/ is a compact 6-dimensional nilmanifold. The SU (3)-structure, described above, descends to N il 6 and therefore we obtain a compact SU (3)-instanton. With the help of Remark 6.1 and (6.51) we derive from Theorem 4.6 and Theorem 5.1 the following Theorem 6.4. The non conformally flat almost hermitian 6-manifold (G, g, F, , A = ∇) solves the gravitino and gaugino equations and satisfies the Bianchi identity (1.1), (1.2) with positive α . Consequently, a) The product (G × R, ω, A = ∇ 7 ) with the G2 -structure described in Sect. 4 solves all the supersymmetry equations (1.3) with non-zero flux, non-constant dilaton and satisfies the Bianchi identity (1.1), (1.2). Therefore it solves the equations of motion in dimension 7. The product (N il 6 × S 1 , ω, A = ∇ 7 ) is a compact space solving locally the supersymmetry equations (1.3) which satisfies the Bianchi identity (1.1) and (1.2) in dimension 7;

SU (3)-Instantons and G2 , Spin(7)-Heterotic String Solitons

97

b) The product (N il 6 × R × R, , A = ∇ 8 ) with the Spin(7)-structure described in Sect. 5 solves the supersymmetry equations (1.3) with non-zero flux, non-constant dilaton and satisfies the Bianchi identity (1.1), (1.2). Therefore it solves the equations of motion in dimension 8. The product (N il 6 × S 1 × S 1 , , A = ∇ 8 ) is a compact space solving locally the supersymmetry equations (1.3) which satisfies the Bianchi identity (1.1), (1.2) in dimension 8. The G2 -analog of the Dolbeault cohomology on G2 -manifold was studied on N il 6 × S 1 in [30]. Remark 6.5. The space (G, g, J ) is an example of an almost complex 6-manifold with totally skew-symmetric Nijenhuis tensor N and zero Lee form θ 6 which is neither complex nor Nearly K¨ahler but it is half-flat and therefore it determines a Riemannian metric with holonomy contained in G2 on G × R ∼ = R7 [55] which seems to be new. The explicit expression of this metric can be found solving the Hitchin flow equations dF =

∂( + ) , ∂t

d − = −F ∧

∂(F ) , ∂t

where the SU (3)-structure depends on a real parameter t ∈ R [55]. G2 -holonomy metrics arising from types of Hermitian 6-manifolds are studied recently in [5]. Remark 6.6. The torsion tensor T as well as the Nijenhuis tensor N of (N il 6 , g, J ) are parallel with respect to the torsion connection ∇ but ∇R ∇ = 0 since the space is not naturally reductive due to the inequality −1 = g([e3 , e6 ], e1 ) = −g([e3 , e1 ], e6 ) = 0. Hence, (N il 6 , g, J ) is an example of a compact non-naturally reductive almost Hermitian 6-manifold with totally skew-symmetric Nijenhuis tensor which is neither complex nor Nearly K¨ahler. The torsion as well as the Nijenhuis tensor are parallel with respect to the torsion connection. Remark 6.7. Spaces for which the covariant derivative of the torsion is a four form (in particular zero) become automatically of ‘instanton type’. Spaces with parallel torsion are studied in [1] in connection with the string model; almost Hermitian 6-manifolds with parallel torsion are investigated very recently (after the first version of the present article was posted to the arXiv) in [3]. Remark 6.8. In general, on any almost hermitian manifold the Nijenhuis tensor N is the (3,0)+(0,3)-part of the torsion of any linear connection ∇ compatible with the almost hermitian structure (see e.g. [38]). Therefore, the condition ∇T = 0 always implies ∇N = 0 because ∇ preserves the type decomposition induced from the almost complex structure.

7. Almost Contact Metric Structures and Non-Compact SU (3)-Solutions in Dimension 6 We construct in this section a new non-compact solution to the type I-supergravity equations of motion in dimension 6. We derive our solution from Sasakian structures in dimension 5. Solutions to the gravitino and dilatino equations in dimension 5 are investigated in [32, 33]. In dimension five any solution to the gravitino equation, i.e. any parallel spinor

98

P. Ivanov, S. Ivanov

with respect to a metric connection with torsion 3-form defines an almost contact metric structure (g, ξ, η, ψ), which is preserved by the torsion connection [32, 33]. It is shown in [33] that solutions to both gravitino and dilatino equations are connected with a special type ‘conformal’ transformations of the almost contact structure introduced in [33]. We recall that an almost contact metric structure consists of an odd dimensional manifold M 2k+1 equipped with a Riemannian metric g, vector field ξ of length one, its dual 1-form η as well as an endomorphism ψ of the tangent bundle such that ψ(ξ ) = 0,

ψ 2 = −id + η ⊗ ξ,

g(ψ., ψ.) = g(., .) − η ⊗ η.

The Nijenhuis tensor N and the fundamental form F of an almost contact metric structure are defined by F (., .) = g(., ψ.),

N = [ψ, ψ] + dη ⊗ ξ.

There are many special types of almost contact metric structures. We introduce those which are relevant to our considerations: – – – –

normal almost contact structures determined by the condition N = 0; contact metric structures characterized by dη = 2F ; quasi-Sasaki structures, N = 0, dF = 0. Consequently, ξ is a Killing vector [13]; Sasaki structures, N = 0, dη = 2F . Consequently, ξ is a Killing vector [13].

An almost contact metric structure admits a linear connection ∇ with torsion 3-form preserving the structure, i.e. ∇g = ∇ξ = ∇ψ = 0, if and only if the Nijenhuis tensor is totally skew-symmetric and the vector field ξ is a Killing vector field [32]. In this case the torsion connection is unique. The torsion T of ∇ on a Sasakian manifold is expressed by T = η ∧ dη = 2η ∧ F and the torsion T is ∇-parallel, ∇T = 0 [32]. We restrict our attention to the Sasakian manifold in dimension five. The spinor bundle of a 5-dimensional contact metric spin manifold decomposes under the action of the fundamental 2-form F 5 into the sum = 0 ⊕ 1 ⊕ 2 , dim 0 = dim 2 = 1, dim 1 = 2. Spinors of type 1 parallel with respect to the torsion connection on quasi-Sasakian 5-manifold are studied in [33]. We are interested in ∇ 5 -parallel spinors of type 0 or 2 on Sasakian 5-manifold. We recall ([32], Theorem 9.2) that a 5-dimensional simply connected Sasakian manifold admits a ∇ 5 -parallel spinor of type 0 or 2 if and only if the Riemannian Ricci tensor Ricg has the form Ricg = 6g − 2η ⊗ η.

(7.52)

The Tanno deformation of a Sasakian structure satisfying (7.52), defined by the formulas ψ = ψ,

ξ =

3 ξ, 4

η =

4 η, 3

g =

4 4 g + η ⊗ η, 3 9

yields an Einstein-Sasakian structure with Ricci tensor Ricg = 4g and vice versa. We may choose locally an orthonormal basis e1 , e2 , e3 , e4 , e5 = ξ such that F 5 = e1 ∧ e 2 + e 3 ∧ e 4 ,

dη = de5 = 2F 5 ,

T 5 = 2η ∧ F 5 = 2e5 ∧ (e1 ∧ e2 + e3 ∧ e4 ).

(7.53)

SU (3)-Instantons and G2 , Spin(7)-Heterotic String Solitons

99

Let M 6 = M 5 × R. We indicate the objects on M 6 with a superscript 6. Consider the product Riemannian manifold M 6 = M 5 ×R with the product metric and the compatible almost complex structure J determined by the fundamental form F 6 = F 5 + e 5 ∧ e6 ,

(7.54)

where e6 = dt is the standard 1-form on R. The identity (7.54) yields dF 6 = 2F 5 ∧ e6 = 2e6 ∧ F 6 . This equality tell us that the almost hermitian manifold (M 6 , g 6 , F 6 ) is locally conformally K¨ahler. In particular, the almost complex structure is integrable and the Lee form θ 6 = 2e6 = 2dt is a closed 1-form on R. It is easy to check that the torsion T 6 of the corresponding Bismut connection ∇ 6 is determined by the equality T 6 = T 5 . Consequently, ∇ 6 T 6 = 0. The Bismut connection defines an SU (3)-instanton on M 5 × R which solves the three supersymmetry equations (1.3) provided M 5 is a Sasakian 5-manifold whose Riemannian Ricci tensor satisfies (7.52). 7.1. Non-conformally flat local SU (3)-solutions on S 5 × S 1 . Let S 5 be the five-sphere with the standard Einstein Sasakian structure induced on S 5 by the usual complex structure on C3 considering S 5 as a totally umbilic hypersurface in the complex space C3 . We consider (S 5 , g, ψ, η, ξ ) as a Sasakian space form, i.e. a Tanno deformation of the standard Einstein-Sasakian structure on S 5 . We may assume that the Riemannian curvature is given by (see e.g. [13]) 1

4

g Rij kl = gj k gil − gik gj l + Fkj Fli − Fki Flj + 2Fij Flk 3 3 1

+ ηi ηk gj l − ηj ηk gil + ηj ηl gik − ηi ηl gj k . (7.55) 3 Consider the locally conformally K¨ahler structure on S 5 × R determined by (7.54). We claim that the corresponding Bismut connection satisfies the modified Bianchi identity (1.1) as well as the anomaly cancellation condition (1.2). Indeed, Eq. (7.53) yields dT 6 = dT 5 = dη ∧ dη = 4F 5 ∧ F 5 . Use (6.40), apply (7.53) and (7.55), to get 1 4

6 Rij∇ kl = gj k gil − gik gj l + ηi ηk gj l − ηj ηk gil + ηj ηl gik − ηi ηl gj k + dTij6 kl . 3 6 The latter equality as well as (6.39) help to calculate the Pontrjagin forms 8 6 6 16π 2 p1 (∇ 6 ) = T r(R ∇ ∧ R ∇ ) = − dT 6 , 3 Keeping Remark 6.1 in mind we obtain

16π 2 p1 (∇˜ 6 ) =

16 6 dT . 3

Theorem 7.1. The Sasakian space form (S 5 , g 5 , ψ, η, ξ ) solves the gravitino equation 5 and satisfies the modified Bianchi identity (1.1) and (1.2) for F A = R ∇ with negative α . Consequently: a) The product (S 5 × R, g 6 , F 6 , ∇ 6 , A = ∇ 6 ) solves all the supersymmetry equations (1.3) with non-zero flux, non-constant dilaton and satisfies the Bianchi identity (1.1), (1.2). Therefore it solves the equations of motion in dimension 6. The product (S 5 × S 1 , g 6 , F 6 , A = ∇ 6 ) is a compact space solving locally the supersymmetry equations (1.3) which satisfies the Bianchi identity (1.1), (1.2) in dimension 6.

100

P. Ivanov, S. Ivanov

b) The product (S 5 × R × R, ω, A = ∇ 7 ) with the G2 -structure described in Sect. 4 solves all the supersymmetry equations (1.3) with non-zero flux, non-constant dilaton and satisfies the Bianchi identity (1.1), (1.2). Therefore it solves the equations of motion in dimension 7. The product (S 5 × S 1 × S 1 , ω, A = ∇ 7 ) is a compact space solving locally the supersymmetry equations (1.3) which satisfies the Bianchi identity (1.1), (1.2) in dimension 7. c) The product (S 5 × R × R × R, , A = ∇ 8 ) with the Spin(7)-structure described in Sect. 5 solves the supersymmetry equations (1.3) with non-zero flux, non-constant dilaton and satisfies the Bianchi identity (1.1), (1.2). Therefore it solves the equations of motion in dimension 8. The product (S 5 × S 1 × S 1 × S 1 , , A = ∇ 8 ) is a compact space solving locally the supersymmetry equations (1.3) which satisfies the Bianchi identity (1.1), (1.2) in dimension 8. Remark 7.2. Note that all compact examples we have presented in Sects. 6 and 6 solve the supersymmetry equations only locally since the closed Lee form θ is actually a closed 1-form on a circle and therefore it can not be exact. This is consistent with the vanishing results claiming that there are no compact solutions with globally defined non-constant dilaton and non-zero flux in type II and type I supergravities. Acknowledgments. The research was done during the visit of S.I. at the Abdus Salam International Centre for Theoretical Physics, Trieste, Italy. S.I. thanks the Abdus Salam ICTP for providing support and an excellent research environment. S.I. is a member of the EDGE, Research Training Network HPRN-CT2000-00101, supported by the European Human Potential Programme. The research is partially supported by Contract MM 809/1998 with the Ministry of Science and Education of Bulgaria, Contract 586/2002 with the University of Sofia “St. Kl. Ohridski”. We thank Tony Pantev for his interest in this work, for the useful suggestions and stimulating discussions. We are also grateful to Jerome Gauntlett for his helpful comments and remarks. We would like to thank the referee for his valuable comments and suggestions on clarifying the phenomenological background, especially the form of the Bianchi identity including string corrections.

References 1. Agricola, I.: Connections on naturally reductive spaces, their Dirac operator and homogeneous models in string theory. Commun. Math. Phys. 232, 536–563 (2003) 2. Alessandrini, L., Bassanelli, G.: Metric properties of manifolds bimeromorphic to compact K¨ahler manifolds. J. Diff. Geom. 37, 95–121 (1993) 3. Alexandrov, B., Friedrich, Th., Schoemann, N.: Almost Hermitian 6-manifolds Revisited. J. Geom. Phys. 53, 1–30 (2005) 4. Alexandrov, B., Ivanov, S.: Vanishing theorems on Hermitian manifolds. Diff. Geom. Appl. 14(3), 251–265 (2001) 5. Apostolov, V., Salamon, S.: Kaehler reduction of metrics with holonomy G2 . Commun. Math. Phys. 246, 43–61 (2004) 6. Becker, K., Becker, M., Dasgupta, K., Green, P.S.: Compactifications of Heterotic Theory on NonKahler Complex Manifolds: I. JHEP 0304, 007 (2003) 7. Becker, K., Becker, M., Dasgupta, K., Green, P.S., Sharpe, E.: Compactifications of Heterotic Strings on Non-Kahler Complex Manifolds: II. Nucl. Phys. B678, 19–100 (2004) 8. Behrndt, K., Jeschek, C.: Fluxes in M-theory on 7-manifolds: G-structures and Superpotential. Nucl. Phys. B694, 99–114 (2004) 9. Belgun, F., Moroianu, A.: Nearly K¨ahler 6-manifolds with reduced holonomy. Ann. Global Anal. Geom. 19(4), 307–319 (2001) 10. Bergshoeff, E.A., de Roo, M.: The quartic effective action of the heterotic string and supersymmetry. Nucl. Phys. B328, 439 (1989) 11. Bilal, A., Derendinger, J.-P., Sfetsos, K.: (Weak) G2 Holonomy from Self-duality, Flux and Supersymmetry. Nucl.Phys. B628, 112–132 (2002)

SU (3)-Instantons and G2 , Spin(7)-Heterotic String Solitons

101

12. Bismut, J.-M.: A local index theorem for non-K¨ahler manifolds. Math. Ann. 284(4), 681–699 (1989) 13. Blair, D.: Contact manifolds in Riemannian geometry. Lect. Notes Math. Vol. 509, Berlin-Heidelberg-New York: Springer Verlag, 1976 14. Brandhuber, A., Gomis, J., Gubser, S., Gukov, S.: Gauge Theory at Large N and New G2 Holonomy Metrics. Nucl. Phys. B611, 179–204 (2001) 15. Bonan, E.: Sur le vari´et´es riemanniennes a groupe d’holonomie G2 ou Spin(7). C. R. Acad. Sci. Paris 262, 127–129 (1966) 16. Bryant, R.: Metrics with exeptional holonomy. Ann. Math. 126, 525–576 (1987) 17. Bryant, R., Salamon, S.: On the construction of some complete metrics with exceptional holonomy. Duke Math. J. 58, 829–850 (1989) 18. Cabrera, F.: On Riemannian manifolds with Spin(7)-structure. Publ. Math. Debrecen 46(3–4), 271– 283 (1995) 19. Cabrera, F.: On Riemannian manifolds with G2 -structure. Bolletino UMI A 10(7), 98–112 (1996) 20. Cabrera, F., Monar, M., Swann, A.: Classification of G2 -structures. J. Lond. Math. Soc. 53, 407–416 (1996) 21. Cardoso, G.L., Curio, G., Dall’Agata, G., Lust, D., Manousselis, P., Zoupanos, G.: Non-Kaehler String Backgrounds and their Five Torsion Classes. Nucl.Phys. B652, 5–34 (2003) 22. Cardoso, G.L., Curio, G., Dall’Agata, G., Lust, D.: BPS Action and Superpotential for Heterotic String Compactifications with Fluxes. JHEP 0310, 004 (2003) 23. Chiossi, S., Salamon, S.: The intrinsic torsion of SU(3) and G2 -structures. In: Differential Geometry, Valencia 2001, Singapore World Sci. Publishing, 2002, pp. 115–133 24. Corrigan, E., Devchand, C., Fairlie, D.B., Nuyts, J.: First-order equations for gauge fields in spaces of dimension greater than four. Nucl. Phys. B 214(3), 452–464 (1983) 25. de Wit, B., Smit, D.J., Hari Dass, N.D.: Residual Supersimmetry Of Compactified D=10 Supergravity. Nucl. Phys. B 283, 165 (1987) 26. Donaldson, S.K., Thomas, R.P.: Gauge theory in higher dimensions. In: The geometric universe (Oxford, 1996), Oxford: Oxford Univ. Press, 1998, pp 31–47 27. Fairlie, D.B., Nuyts, J.: Spherically symmetric solutions of gauge theories in eight dimensions. J. Phys. A17, 2867 (1984) 28. Fernandez, M.: A classification of Riemannian manifolds with structure group Spin(7). Ann. Mat. Pura Appl. 143, 101–122 (1982) 29. Fernandez, M., Gray, A.: Riemannian manifolds with structure group G2 . Ann. Mat. Pura Appl. 32(4), 19–45 (1982) 30. Fern´andez, M., Ugarte, L.: Dolbeault cohomology for G2 -manifolds. Geom. Dedicata 70(1), 57–86 (1998) 31. Freedman, D.Z., Gibbons, G.W., West, P.C.: Ten Into Four Won’t Go. Phys. Lett. B 124, 491 (1983) 32. Friedrich, Th., Ivanov, S.: Parallel spinors and connections with skew-symmetric torsion in string theory. Asian J. Math. 6, 3003–3036 (2002) 33. Friedrich, Th., Ivanov, S.: Almost contact manifolds, connections with torsion, parallel spinors. J. Reine Angew. Math. 559, 217–236 (2003) 34. Friedrich, Th., Ivanov, S.: Killing spinor equations in dimension 7 and geometry of integrable G2 manifolds. J. Geom. Phys 48, 1–11 (2003) 35. Friedrich, Th., Kath, I.: 7-dimensional compact Riemannian manifolds with Killing spinors. Commun. Math. Phys. 133(3), 543–561 (1990) 36. Friedrich, Th., Kath, I., Moroianu, A., Semmelmann, U.: On nearly parallel G2 -structures. J. Geom. Phys. 23(3-4), 259–286 (1997) 37. Fubini, S., Nikolai, H.: The octonionic instanton. Phys. Let. B 155, 369 (1985) 38. Gauduchon, P.: Hermitian connections and Dirac operators. Boll. Un. Mat. Ital. B (7) 11(2), Suppl., 257–288 (1997) 39. Gauduchon, P.: Fibr´es hermitiens a` endomorphisme de Ricci non n´egatif. Bull. Soc. Math. France 105(2), 113–140 (1977) 40. Gauntlett, J., Kim, N., Martelli, D., Waldram, D.: Fivebranes wrapped on SLAG three-cycles and related geometry. JHEP 0111, 018 (2001) 41. Gauntlett, J.P., Martelli, D., Pakis, S., Waldram, D.: G-Structures and Wrapped NS5-Branes. Commun. Math. Phys. 247, 421–445 (2004) 42. Gauntlett, J., Martelli, D., Waldram, D.: Superstrings with Intrinsic torsion. Phys. Rev. D69, 086002 (2004) 43. Gibbons, G.W., Lu, H., Pope, C.N., Stelle, K.S.: Supersymmetric Domain Walls from Metrics of Special Holonomy. Nucl. Phys. B623, 3–46 (2002) 44. Gibbons, G.W., Page, D.N., Pope, C.N.: Einstein metrics on S 3 , R3 , and R4 bundles. Commun. Math. Phys. 127, 529–553 (1990) 45. Gillard, J., Papadopoulos, G., Tsimpis, D.: Anomaly, Fluxes and (2,0) Heterotic-String Compactifications. JHEP 0306, 035 (2003)

102

P. Ivanov, S. Ivanov

46. Goldstein, E., Prokushkin, S.: Geometric Model for Complex Non-Kaehler Manifolds with SU(3) Structure. Commun. Math. Phys. 251, 65–78 (2004) 47. Gray, A.: Vector cross product on manifolds. Trans. Am. Math. Soc. 141, 463–504 (1969) Correction 148, 625 (1970) 48. Gray, A., Hervella, L.: The sixteen classes of almost Hermitian manifolds and their linear invariants. Ann. Mat. Pura Appl. 123(4), 35–58 (1980) 49. Gurrieri, S., Micu, A.: Type IIB Theory on Half-flat Manifolds. Class.Quant.Grav. 20, 2181–2192 (2003) 50. Gurrieri, S., Louis, J., Micu, A., Waldram, D.: Mirror Symmetry in Generalized Calabi-Yau Compactifications. Nucl.Phys. B654, 61–113 (2003) 51. Gutowski, J., Ivanov, S., Papadopoulos, G.: Deformations of generalized calibrations and compact non-Kahler manifolds with vanishing first Chern class. Asian J. Math. 7, 39–80 (2003) 52. G¨unaydin, M., Nikolai, H.: Seven-dimensional octonionic Yang-Mills instanton and its extension to an heterotic string soliton. Phys. Lett. B 353, 169 (1991) 53. Harvey, R., Lawson, H.B.: Calibrated geometries. Acta Math. 148, 47–157 (1982) 54. Harvey, J.A., Strominger, A.: Octonionic superstring solitons. Phys. Review Let. 66(5), 549 (1991) 55. Hitchin, N.: Stable forms and special metrics. In: Global differential geometry: the mathematical legacy of Alfred Gray (Bilbao, 2000), Contemp. Math. 288, Providence, RI: Amer. Math. Soc., 2001 pp. 70–89 56. Ivanov, S.: Geometry of quaternionic K¨ahler connections with torsion. J. Geom. Phys. 41(3), 235– 257 (2002) 57. Ivanov, S.: Connection with torsion, parallel spinors and geometry of Spin(7) manifolds. Math. Res. Lett. 11(2–3), 171–186 (2004) 58. Ivanov, S., Papadopoulos, G.: A no-go theorem for string warped compactifications. Phys.Lett. B497, 309–316 (2001) 59. Ivanov, S., Papadopoulos, G.: Vanishing Theorems and String Backgrounds. Class.Quant.Grav. 18, 1089–1110 (2001) 60. Joyce, D.: Compact Riemannian 7-manifolds with holonomy G2 . I. J.Diff. Geom. 43, 291–328 (1996) 61. Joyce, D.: Compact Riemannian 7-manifolds with holonomy G2 . II. J.Diff. Geom. 43, 329–375 (1996) 62. Joyce, D.: Compact Riemannian manifolds with special holonomy, Oxford Oxford University Press, 2000 63. Kachru, S., Schulz, M.B., Tripathy, P.K., Trivedi, S.P.: New Supersymmetric String Compactifications. JHEP 0303, 061 (2003) 64. Kirichenko, V.: K-spaces of maximal rank. Mat. Zam. 22, 465–476 (1977) (In Russian) 65. Kovalev, A.: Twisted connected sums and special Riemannian holonomy. J. Reine Angew. Math. 565, 125–160 (2003) 66. Lawson, B., Michelsohn, M.-L.: Spin Geometry. Princeton, NJ: Princeton University Press, 1989 67. Malcev, A.I.: On a class of homogeneous spaces. Reprinted in Amer. Math. Soc. Trans. Series 1, 9, 276–307 (1962) 68. Michelsohn, M.L.: On the existence of special metrics in complex geometry. Acta Math. 149(3–4), 261–295 (1982) 69. Papadopoulos, G.: (2,0)-supersymmetric sigma models and almost complex structures. Nucl.Phys. B448, 199–219 (1995) 70. Reyes Carrin, R.: A generalization of the notion of instanton. Diff. Geom. Appl. 8(1), 1–20 (1998) 71. Salamon, S.: Riemannian geometry and holonomy groups. Pitman Res. Notes Math. Ser. 201, London: Pitman 1989 72. Salamon, S.: Almost parallel structures. In: Global differential geometry: the mathematical legacy of Alfred Gray (Bilbao, 2000), Contemp. Math. 288, Providence, RI: Amer. Math. Soc. 2001 pp 162–181 73. Strominger, A.: Superstrings with torsion. Nucl. Phys. B 274, 253 (1986) 74. Ugarte, L.: Coeffective Numbers of Riemannian 8-manifold with Holonomy in Spin(7). Ann. Glob. Anal. Geom. 19, 35–53 (2001) Communicated by G. W. Gibbons

Commun. Math. Phys. 259, 103–128 (2005) Digital Object Identifier (DOI) 10.1007/s00220-005-1297-6

Communications in

Mathematical Physics

Lagrangian Supersymmetries Depending on Derivatives. Global Analysis and Cohomology 1

Giovanni Giachetta , Luigi Mangiarotti1 , Gennadi Sardanashvily2 1 2

Department of Mathematics and Informatics, University of Camerino, 62032 Camerino (MC), Italy Department of Theoretical Physics, Physics Faculty, Moscow State University, 117234 Moscow, Russia

Received: 28 May 2004 / Accepted: 20 July 2004 Published online: 18 February 2005 – © Springer-Verlag 2005

Abstract: Lagrangian contact supersymmetries (depending on derivatives of arbitrary order) are treated in a very general setting. The cohomology of the variational bicomplex on an arbitrary graded manifold and the iterated cohomology of a generic nilpotent contact supersymmetry are computed. In particular, the first variational formula and conservation laws for Lagrangian systems on graded manifolds using contact supersymmetries are obtained. 1. Introduction At present, BRST transformations in the BV formalism [7, 24] provide the most interesting example of Lagrangian contact supersymmetries, depending on derivatives and preserving the contact ideal of graded exterior forms. Much that is already known regarding Lagrangian BRST theory (including the short variational complex, BRST cohomology [4, 5, 8], Noether’s conservation laws [5, 16, 19]) has been formulated in terms of jet manifolds of vector bundles (see [5] for a survey) since the jet manifold formalism provides the algebraic description of Lagrangian and Hamiltonian systems of both even and odd variables. In spite of this formulation, most authors however assume the base manifold X of these bundles to be contractible because, e.g., the relative (local in the terminology of [4, 5]) cohomology are not trivial even when X = Rn . Stimulated by the BRST theory, we consider Lagrangian systems of odd variables and contact supersymmetries in a very general setting. For this purpose, one usually calls into play fiber bundles over supermanifolds [12, 13, 17, 34]. We describe odd variables and their jets on an arbitrary smooth manifold X as generating elements of the structure ring of a graded manifold whose body is X [32, 38, 39]. This definition differs from that of jets of a graded fiber bundle [27], but reproduces the heuristic notion of jets of ghosts in the field-antifield BRST theory on Rn [5, 9]. Our goal is the following. Firstly, we construct the Z2 -graded variational bicomplex on a graded manifold with an arbitrary body X, and obtain the cohomology of its short

104

G. Giachetta, L. Mangiarotti, G. Sardanashvily

variational subcomplex and the complex of one-contact graded forms (Theorem 4.1). In particular, the first variational formula and conservation laws for Lagrangian systems on graded manifolds using contact supersymmetries are obtained (formulae (5.4) – (5.5)). Secondly, the iterated cohomology of a generic nilpotent contact supersymmetry is computed (Theorems 6.2, 6.4 and 6.5). In the most interesting case of the form degree n = dim X, it coincides with the above mentioned relative cohomology. Therefore, we extend the results of [5] and our recent work [21] to an arbitrary nilpotent contact supersymmetry. As is well-known, generalized (depending on derivatives) symmetries of differential equations have been intensively investigated [3, 11, 29, 30, 35]. Generalized symmetries of Lagrangian systems on a local coordinate domain have been described in detail [11, 35]. The variational bicomplex constructed in the framework of the infinite order jet formalism enables one to provide the global analysis of Lagrangian systems on a fiber bundle and their symmetries [2, 22, 32, 42]. Sketched in Sect. 2 of our work, this analysis is extended to Lagrangian systems on graded manifolds (Sect. 3). Recall that an r-order Lagrangian on a fiber bundle Y → X is defined as a horizontal n

density L : J r Y → ∧ T ∗ X, n = dim X, on the r-order jet manifold J r Y of sections of Y → X. With the inverse system of finite order jet manifolds r πr−1

π01

π

X ←− Y ←− J 1 Y ←− · · · J r−1 Y ←− J r Y ←− · · · ,

(1.1)

we have the direct system π∗

r ∗ πr−1

π01 ∗

∗ O∗ (X) −→ O∗ (Y ) −→ O1∗ −→ · · · Or−1 −→ Or∗ −→ · · ·

(1.2)

of graded differential algebras (henceforth GDAs) of exterior forms on jet manifolds r ∗ . Its direct limit is the GDA O ∗ with respect to the pull-back monomorphisms πr−1 ∞ consisting of all the exterior forms on finite order jet manifolds modulo the pull-back ∗ is decomposed into the sum d = d + d identification. The exterior differential on O∞ H V ∗ into a bicomplex. of the total and the vertical differentials. These differentials split O∞ Introducing the projector (2.3) and the variational operator δ, one obtains the var∗ . Its d - and δ-cohomology (Theorem 2.1) has been iational bicomplex (2.4) of O∞ H obtained in several steps [1, 2, 22, 35, 41–43]. In order to define the variational bicomplex on graded manifolds (Sect. 3), let us recall that, by virtue of Batchelor’s theorem [6], any graded manifold (A, X) with a body X is isomorphic to the one whose structure sheaf AQ is formed by germs of sections of the exterior product 2

∧Q∗ = R ⊕ Q∗ ⊕ ∧ Q∗ ⊕ · · · , X

X

(1.3)

X

where Q∗ is the dual of some real vector bundle Q → X. In field models, a vector bundle Q is usually given from the beginning. Therefore, we consider graded manifolds (X, AQ ) where Batchelor’s isomorphism holds. We agree to call (X, AQ ) the simple graded manifold constructed from Q. Accordingly, r-order jets of odd fields are defined as generating elements of the structure ring of the simple graded manifold (X, AJ r Q ) constructed from the jet bundle J r Q → X of Q which is also a vector bundle [32, 39]. Let CJ∗ r Q be the bigraded differential algebra (henceforth BGDA) of Z2 -graded (or, simply, graded) exterior forms on the graded manifold (X, AJ r Q ). A linear bundle r : J r Q → J r−1 Q yields the corresponding monomorphism of BGDAs morphism πr−1 ∗ ∗ CJ r−1 Q → CJ r Q [6, 32]. Hence, there is the direct system of BGDAs

Lagrangian Supersymmetries Depending on Derivatives π01∗

105

r ∗ πr−1

∗ CQ −→ CJ∗ 1 Q · · · −→ CJ∗ r Q −→ · · · ,

(1.4)

∗ consists of graded exterior forms on graded manifolds (X, A r ), whose direct limit C∞ J Q r ∈ N, modulo the pull-back identification. This definition of odd jets enables one to describe odd and even variables (e.g., fields, ghosts and antifields in BRST theory) on the same footing. Namely, let Y → X be an ∗ ⊂ O ∗ the C ∞ (X)-subalgebra of exterior forms whose coeffiaffine bundle and P∞ ∞ cients are polynomial in the fiber coordinates on jet bundles J r Y → X. This notion is ∗ is an exterior form on some finite order jet manifold intrinsic since any element of O∞ ∗ of graded and all jet bundles J r Y → X are affine. Let us consider the product S∞ ∗ ∗ ∗ algebras C∞ and P∞ over their common subalgebra O (X) of exterior forms on X. It is a BGDA which is split into the Z2 -graded variational bicomplex, analogous to that of ∗ . O∞ In Sect. 4, we obtain cohomology of some subcomplexes of the variational bicom∗ when X is an arbitrary manifold (Theorem 4.1). They are the short variational plex S∞ complex (4.1) of horizontal (local in the terminology of [5, 8]) graded exterior forms and the complex (4.2) of one-contact graded forms. For this purpose, one however must: (i) ∗ to the BGDA (S∗ ) of graded exterior forms of locally finite jet enlarge the BGDA S∞ ∞ order, (ii) compute the cohomology of the corresponding complexes of (S∗∞ ), and (iii) ∗ . Following this proceprove that this cohomology of (S∗∞ ) coincides with that of S∞ dure, we show that cohomology of the complex (4.1) equals the de Rham cohomology of X, while the complex (4.2) is globally exact. Note that the exactness of the short variational complex (4.1) on X = Rn has been repeatedly proved [5, 8, 14]. One has also considered its subcomplex of graded exterior forms whose coefficients are constant on Rn . Its dH -cohomology is not trivial [5]. The exactness of the complex (4.2) enables us to generalize the first variational formula and Lagrangian conservation laws in the calculus of variations on fiber bundles to graded Lagrangians and contact supersymmetries of arbitrary order (Sect. 5). Cohomology of the short variational complex (4.1) and its modification (6.2) is the main ingredient in a computation of the iterated cohomology of nilpotent contact supersymmetries. By analogy with a contact symmetry (Proposition 2.3), an infinitesimal contact supertransformation or, simply, a contact supersymmetry υ is defined as a graded derivation 0 such that the Lie derivative L preserves the contact ideal of the BGDA of the R-ring S∞ υ ∗ . The BRST transformation υ (5.7) in gauge theory on principal bundles exemplifies S∞ a first order contact supersymmetry such that the Lie derivative Lυ of horizontal graded exterior forms is nilpotent. This fact motivates us to study nilpotent contact supersymmetries in a general setting. The key point is that the Lie derivative Lυ along a contact supersymmetry and the total differential dH mutually commute. When Lυ is nilpotent (Lemma 5.3), we suppose 0,∗ that the dH -complex S∞ of horizontal graded exterior forms is split into the bicomplex k,m {S } with respect to the nilpotent operator

sυ φ = (−1)|φ| Lυ φ,

0,∗ φ ∈ S∞ ,

(1.5)

and the total differential dH . In the case of the above mentioned BRST transformation υ (5.7), sυ (1.5) is the BRST operator. One usually studies the relative cohomology H ∗,∗ (sυ /dH ) of sυ with respect to the total differential dH (see [15] for the BRST cohomology modulo the exterior differential d). This cohomology is not trivial even when X = Rn , but it can be related to the total (sυ + dH )-cohomology only in the form

106

G. Giachetta, L. Mangiarotti, G. Sardanashvily

degree n = dim X. We consider the iterated cohomology H ∗,∗ (sυ |dH ) of the bicomplex {S k,m } (Sect. 6). In the most interesting case of form degree n = dim X, relative and iterated cohomology groups coincide. They naturally characterize graded Lagrangians 0,∗ L ∈ S ∗,n , for which υ is a variational symmetry, modulo Lie derivatives Lυ ξ , ξ ∈ S∞ , and dH -exact graded exterior forms. Using the fact that dH -cocycles are represented by exterior forms on X and that any exterior form on X is sυ -closed, we obtain the iterated cohomology H ∗,m
y λ+ = i

∂x µ

i d y , λ µ ∂x

0 ≤ | |,

(2.1)

of J ∞ Y , where = (λk ...λ1 ) is a symmetric multi-index, λ + = (λλk ...λ1 ), and dλ = ∂λ +

| |≥0

d = dλr ◦ · · · ◦ dλ1 ,

i yλ+ ∂i ,

= (λr ...λ1 ),

(2.2)

are the total derivatives. Hereafter, we fix an atlas of Y and, consequently, that of J ∞ Y ∗ containing a finite number of charts [26]. Restricted to the chart (2.1), the GDA O∞ λ can be written in a coordinate form; horizontal forms {dx } and contact one-forms i = dy i − y i λ 0 ∗ {θ λ+ dx } make up a local basis for the O∞ -algebra O∞ .

∗ = ⊕O k,m of O ∗ into O 0 -modules O k,m There is the canonical decomposition O∞ ∞ ∞ ∞ ∞ of k-contact and m-horizontal forms together with the corresponding projections hk : ∗ → O k,∗ and hm : O ∗ → O ∗,m . Accordingly, the exterior differential on O ∗ is O∞ ∞ ∞ ∞ ∞ split into the sum d = dH + dV of the total and vertical differentials

dH ◦ hk = hk ◦ d ◦ hk , dV ◦ hm = hm ◦ d ◦ hm ,

dH ◦ h0 = h0 ◦ d, dH (φ) = dx λ ∧ dλ (φ), i ∗ dV (φ) = θ ∧ ∂i φ, φ ∈ O∞ .

One also introduces the R-module projector =

1 k>0

k

◦ hk ◦ hn ,

(φ) =

| |≥0

(−1)| | θ i ∧ [d (∂i φ)],

>0,n φ ∈ O∞ ,

(2.3)

Lagrangian Supersymmetries Depending on Derivatives

107

∗ such that ◦ d = 0 and the nilpotent variational operator δ = ◦ d on O ∗,n . of O∞ H V ∞ k,n ∗ is split into the variational bicomplex Put Ek = (O∞ ). Then the GDA O∞

dV

0 →

1,0 O∞ dV

0 → R →

−→ dH

0,1 O∞

π ∞∗ d

dV dH

−→ · · ·

6

−→

6 0

1,1 O∞

dV

O 0 (X) −→

6

6

dV dH

6

0 O∞ π ∞∗

0 → R →

6

dV dH

−→ · · ·

6

6 1,m O∞

π ∞∗ d

6

dV dH

−→ · · ·

6

0,m O∞

1,n O∞

dH

−→ · · ·

6

0,n O∞ π ∞∗

6

6

−δ

6

−→

E1 → 0

≡

6

0,n O∞

(2.4)

d

O m (X) −→ · · ·

O n (X) −→ 0

6

0

−δ

6

dV

d

O 1 (X) −→ · · ·

. . .

. . .

. . .

. . .

. . .

6

0

0

Its cohomology has been obtained in several steps (see the outline of proof of Theorem 2.1 in Appendix A). Theorem 2.1. (i) The second row from the bottom and the last column of this bicomplex make up the variational complex dH

dH

δ

δ

0 0,1 0,n 0 → R → O∞ −→ O∞ · · · −→ O∞ −→ E1 −→ E2 −→ · · · .

(2.5)

Its cohomology is isomorphic to the de Rham cohomology of the fiber bundle Y . (ii) The rows of contact forms of the bicomplex (2.4) are exact sequences. One can think of the elements 0,n L = Lω ∈ O∞ , δL = (−1)| | d (∂i L)θ i ∧ ω ∈ E1 ,

ω = dx 1 ∧ · · · ∧ dx n ,

| |≥0

of the variational complex (2.5) as being a finite order Lagrangian and its Euler–Lagrange operator, respectively. Corollary 2.2. The exactness of the row of one-contact forms of the variational bicom1,n plex (2.4) at the term O∞ relative to the projector provides the R-module decomposition 1,n 1,n−1 O∞ = E1 ⊕ dH (O∞ ) 0,n , the corresponding decomposition and, given a Lagrangian L ∈ O∞

dL = δL − dH .

(2.6)

The form in the decomposition (2.6) is not uniquely defined. It reads λν ...ν

= Fi s 1 θνis ...ν1 ∧ ωλ , Fiνk ...ν1 = ∂iνk ...ν1 L − dλ Fiλνk ...ν1 +hνi k ...ν1 , ωλ = ∂λ ω, s=0 (ν ν

)...ν

1 0 obey the relations hν = 0, h k k−1 = 0. It follows where local functions h ∈ O∞ i i that L = + L is a Lepagean equivalent of a finite order Lagrangian L [25]. The decomposition (2.6) leads to the first variational formula (2.15) and the Lagrangian conservation law (2.16) as follows.

108

G. Giachetta, L. Mangiarotti, G. Sardanashvily

0 of the R-ring O 0 such that the Lie derivative L preserves A derivation υ ∈ dO∞ υ ∞ ∗ (i.e., the Lie derivative L of a contact form is a contact the contact ideal of the GDA O∞ υ form) is called an infinitesimal contact transformation or, simply, a contact symmetry (by analogy with C-transformations in [30] though υ need not come from a morphism of J ∞ Y ). Proposition 2.3 below shows that, restricted to a coordinate chart (2.1) and a GDA Or∗ of finite jet order, any contact symmetry υ is the jet prolongation of a generalized vector field in [35]. 0 is isomorphic to the O 0 -dual (O 1 )∗ Proposition 2.3. (i) The derivation module dO∞ ∞ ∞ 1 0 is of the module of one-forms O∞ . (ii) Relative to an atlas (2.1), a derivation υ ∈ dO∞ given by the expression i υ = υ λ ∂λ + υ i ∂i + υ ∂i , (2.7) | |>0

i are local smooth functions of finite jet order obeying the transformation where υ λ , υ i , υ law

υ = λ

∂x λ µ υ , ∂x µ

υ = i

∂y i j ∂y i µ υ + µυ , ∂y j ∂x

υ = i

∂y i ||≤| |

j

∂y

j

υ +

∂y i µ υ . ∂x µ (2.8)

(iii) A derivation υ (2.7) is a contact symmetry iff i i υ = d (υ i − yµi υ µ ) + yµ+ υ µ,

0 < | |.

(2.9)

∗ is generated by elements df , f ∈ O 0 . It suffices Proof. (i) At first, let us show that O∞ ∞ 1 0 -linear combination of elements df , to justify that any element of O∞ is a finite O∞ 0 . Indeed, every φ ∈ O 1 is an exterior form on some finite order jet manifold f ∈ O∞ ∞ r J Y . By virtue of the Serre–Swan theorem extended to non-compact manifolds [36, 40], the C ∞ (J r Y )-module Or∗ of one-forms on J r Y is a projective module of finite rank, i.e., φ is represented by a finite C ∞ (J r Y )-linear combination of elements df , 0 . Any element ∈ (O 1 )∗ yields a derivation υ (f ) = (df ) f ∈ C ∞ (J r Y ) ⊂ O∞ ∞ 0 1 is generated by elements df , f ∈ O 0 , different of the R-ring O∞ . Since the module O∞ ∞ 1 )∗ provide different derivations of O 0 , i.e., there is a monomorphism elements of (O∞ ∞ 1 )∗ → dO 0 . By the same formula, any derivation υ ∈ dO 0 sends df → υ(f ) (O∞ ∞ ∞ 0 is generated by elements df , it defines a morphism : O 1 → O 0 . and, since O∞ υ ∞ ∞ Moreover, different derivations υ provide different morphisms υ . Thus, we have a 0 → (O 1 )∗ . monomorphism and, consequently, an isomorphism dO∞ ∞ 1 0 -module generated by the (ii) Restricted to a coordinate chart (2.1), O∞ is a free O∞ i . Then dO 0 = (O 1 )∗ restricted to this chart consists of eleexterior forms dx λ , θ ∞ ∞ i . The transformation rule (2.8) results ments (2.7), where ∂λ , ∂i are the duals of dx λ , θ from the transition functions (2.1). The interior product υ φ and the Lie derivative Lυ φ, ∗ , obey the standard formulae. Restricted to a coordinate chart, the Lie derivative φ ∈ O∞ Lυ sends each finite jet order GDA Or∗ to another finite jet order GDA Os∗ . Since the ∗ . atlas (2.1) is finite, Lυ φ preserves O∞ (iii) The expression (2.9) results from a direct computation similar to that of the first part of B¨acklund’s theorem [29]. One can then justify that local functions (2.9) satisfy the transformation law (2.8).

Lagrangian Supersymmetries Depending on Derivatives

109

Any contact symmetry admits the horizontal splitting υ = υH + υV = υ λ dλ + (ϑ i ∂i +

| |>0

ϑ i = υ i − yµi υ µ , (2.10)

d ϑ i ∂i ),

0 [32]. relative to the canonical connection ∇ = dx λ ⊗ dλ on the C ∞ (X)-ring O∞

Lemma 2.4. Any vertical contact symmetry υ = υV obeys the relations υ dH φ = −dH (υ φ), Lυ (dH φ) = dH (Lυ φ),

∗ φ ∈ O∞ .

(2.11) (2.12)

Proof. It is easily justified that, if φ and φ satisfy the relation (2.11), then φ ∧ φ does i . The well. Then it suffices to prove the relation (2.11) when φ is a function and φ = θ result follows from the equalities i i υ θ = υ ,

i i dH (υ ) = υλ+ dx λ ,

i dH θλi = dx λ ∧ θλ+ , (2.13)

i i ∂i = v ∂i ◦ dλ . dλ ◦ v

The relation (2.12) is a corollary of the equality (2.11).

(2.14)

0,n Proposition 2.5. Given a Lagrangian L = Lω ∈ O∞ , its Lie derivative Lυ L along a contact symmetry υ (2.10) fulfills the first variational formula

Lυ L = υV δL + dH (h0 (υ

L )) + LdV (υH ω),

(2.15)

where L is a Lepagean equivalent, e.g., a Poincar´e–Cartan form of L. Proof. The formula (2.15) comes from the splitting (2.6) and the relation (2.11) as follows: Lυ L = υ dL + d(υ L) = [υV dL − dV L ∧ υH ω] + [dH (υH L) + dV (LυH ω)] = υV dL + dH (υH L) + LdV (υH ω) = υV δL − υV dH + dH (υH L) + LdV (υH ω) = υV δL + dH (υV

+ υH L) + LdV (υH ω), where υV

= h0 (υ

) since is a one-contact form, υH L = h0 (υ L), and L =

+ L. Let υ be a variational symmetry of L (in the terminology of [35]), i.e., Lυ L = dH σ , 0,n−1 σ ∈ O∞ . By virtue of the expression (2.15), this condition implies that υ is projected onto X. Then the first variational formula (2.15) restricted to Ker δL leads to the weak conservation law 0 ≈ dH (h0 (υ

L ) − σ ).

(2.16)

110

G. Giachetta, L. Mangiarotti, G. Sardanashvily

3. Z2 -Graded Variational Bicomplex Let (X, AQ ) be the simple graded manifold constructed from a vector bundle Q → X of fiber dimension m. Its structure ring AQ of sections of AQ consists of sections of the exterior bundle (1.3) called graded functions. Given bundle coordinates (x λ , q a ) on Q with transition functions q a = ρba q b , let {ca } be the corresponding fiber bases for Q∗ → X, together with transition functions c a = ρba cb . Then (x λ , ca ) is called the local basis for the graded manifold (X, AQ ) [6, 32]. With respect to this basis, graded functions read f =

m 1 fa ...a ca1 · · · cak , k! 1 k k=0

where fa1 ···ak are local smooth real functions on X. Given a graded manifold (X, AQ ), by the sheaf dAQ of graded derivations of AQ is meant a subsheaf of endomorphisms of the structure sheaf AQ such that any section u of dAQ over an open subset U ⊂ X is a Z2 -graded derivation of the Z2 -graded ring AQ (U ) of graded functions on U , i.e., u(ff ) = u(f )f + (−1)[u][f ] f u(f ),

f, f ∈ AQ (U ),

where [.] denotes the Grassmann parity. One can show that sections of dAQ over U exhaust all Z2 -graded derivations of the ring AQ (U ) [6]. Let dAQ be the Lie superalgebra of Z2 -graded derivations of the R-ring AQ . Its elements are called Z2 -graded (or, simply, graded) vector fields on (X, AQ ). Due to the canonical splitting V Q = Q × Q, the vertical tangent bundle V Q → Q of Q → X can be provided with the fiber bases {∂a } which is the dual of {ca }. Then a graded vector field takes the local form u = uλ ∂λ + ua ∂a , where uλ , ua are local graded functions. It acts on AQ by the rule u(fa...b ca · · · cb ) = uλ ∂λ (fa...b )ca · · · cb + ud fa...b ∂d (ca · · · cb ).

(3.1)

This rule implies the corresponding transformation law u = uλ , λ

u = ρja uj + uλ ∂λ (ρja )cj . a

Then one can show [32, 38] that graded vector fields on a simple graded manifold can be represented by sections of the vector bundle VQ → X which is locally isomorphic to the vector bundle ∧Q∗ ⊗X (Q ⊕X T X), and is equipped with the bundle coordinates (x˙aλ1 ...ak , vbi 1 ...bk ), k = 0, . . . , m, together with the transition functions x˙ i1 ...ik = ρ −1 ia11 · · · ρ −1 iakk x˙aλ1 ...ak , j i −1 b1 −1 bk v j1 ...jk = ρ j1 · · · ρ jk ρji vb1 ...bk + λ

k! λ i ∂λ ρbk . x˙ (k − 1)! b1 ...bk−1

Using this fact, we can introduce graded exterior forms on the simple graded mani∗ , where V ∗ → X is the pointwise fold (X, AQ ) as sections of the exterior bundle ∧ VQ Q ∧Q∗ -dual of VQ . Relative to the dual bases {dx λ } for T ∗ X and {dcb } for Q∗ , graded one-forms read φ = φλ dx λ + φa dca ,

φa = ρ −1ba φb ,

φλ = φλ + ρ −1ba ∂λ (ρja )φb cj .

Lagrangian Supersymmetries Depending on Derivatives

111

The duality morphism is given by the interior product u φ = uλ φλ + (−1)[φa ] ua φa . ∗ with respect to the bigraded exterior Graded exterior forms constitute the BGDA CQ product ∧ and the exterior differential d. The standard formulae of a BGDA hold. Since the jet bundle J r Q → X of a vector bundle Q → X is a vector bundle, let us consider the simple graded manifold (X, AJ r Q ) constructed from J r Q → X. Its local a }, 0 ≤ | | ≤ r, together with the transition functions basis is {x λ , c a j a c λ+ = dλ (ρja c ), dλ = ∂λ + cλ+ ∂a , (3.2) | |
a. where ∂a are the duals of c ∗ Let CJ r Q be the BGDA of graded exterior forms on the graded manifold (X, AJ r Q ). ∗ of the direct system (1.4) inherits the BGDA operations intertwined The direct limit C∞ r ∗ . It is locally a free C ∞ (X)-algebra countably generated by the monomorphisms πr−1 a a = dca − ca λ by the elements (1, c , dx λ , θ λ+ dx ), 0 ≤ | |. It should be emphasized ∗ ∗ that, in contrast with the GDA O∞ , the BGDA C∞ consists of sections of sheaves over X. In order to regard these algebras on the same footing, let Y → X hereafter be an affine ∗ is an algebra of sections of some sheaf bundle. Then one can show that the GDA O∞ over X (see Appendix B). Let us consider the above mentioned polynomial subalgebra ∗ of O ∗ and the product C ∗ ∧ P ∗ of graded algebras C ∗ and P ∗ over their common P∞ ∞ ∞ ∞ ∞ ∞ graded subalgebra O∗ (X) of exterior forms on X. It consists of the elements ∗ ∗ ψi ⊗ φ i , φi ⊗ ψ i , ψ ∈ C∞ , φ ∈ P∞ , i

i ∗ C∞

∗ P∞

∗ ⊗ C ∗ of the C ∞ (X)-modules C ∗ and P ∗ of the tensor products ⊗ and P∞ ∞ ∞ ∞ which are subject to the commutation relations ∗ ψ ⊗ φ = (−1)|ψ||φ| φ ⊗ ψ, ψ ∈ C∞ , ∗ (ψ ∧ σ ) ⊗ φ = ψ ⊗ (σ ∧ φ), σ ∈ O (X),

∗ φ ∈ P∞ ,

(3.3)

and the multiplication

(ψ ⊗ φ) ∧ (ψ ⊗ φ ) := (−1)|ψ ||φ| (ψ ∧ ψ ) ⊗ (φ ∧ φ ).

(3.4)

Elements ψ ⊗φ are endowed with the total form degree |ψ|+|φ| and the total Grassmann parity [ψ]. Then the multiplication (3.4) obeys the relation

ϕ ∧ ϕ = (−1)|ϕ||ϕ |+[ϕ][ϕ ] ϕ ∧ ϕ,

∗ ϕ, ϕ ∈ S∞ ,

∗ ∧ P ∗ into a bigraded C ∞ (X)-algebra S ∗ , where the asterisk means the and makes C∞ ∞ ∞ total form degree. Due to the algebra monomorphisms ∗ ∗ C∞ ψ → ψ ⊗ 1 = 1 ⊗ ψ ∈ S∞ ,

∗ ∗ P∞ φ → φ ⊗ 1 = 1 ⊗ φ ∈ S∞ ,

∗ as being an algebra generated by elements of C ∗ and P ∗ . For one can think of S∞ ∞ ∞ 0 are polynomials of ca and y i with coefficients in instance, elements of the ring S∞ C ∞ (X).

112

G. Giachetta, L. Mangiarotti, G. Sardanashvily ∗ with the exterior differential Let us provide S∞

d(ψ ⊗ φ) := (dC ψ) ⊗ φ + (−1)|ψ| ψ ⊗ (dP φ),

∗ ψ ∈ C∞ ,

∗ φ ∈ P∞ , (3.5)

∗ and P ∗ , where dC and dP are exterior differentials on the differential algebras C∞ ∞ respectively. We obtain at once from the relation (3.3) that

d(φ ⊗ ψ) = (dP φ) ⊗ ψ + (−1)|φ| φ ⊗ (dC ψ),

∗ ψ ∈ C∞ ,

∗ φ ∈ P∞ .

The exterior differential d (3.5) is nilpotent. It obeys the equalities d(ϕ ∧ ϕ ) = dϕ ∧ ϕ + (−1)|ϕ| ϕ ∧ dϕ ,

∗ ϕ, ϕ ∈ S∞ ,

∗ into a BGDA, which is locally generated by the elements and makes S∞ a i a a a i i i (1, c , y , dx λ , θ = dc − cλ+ dx λ , θ = dy − yλ+ dx λ ),

| | ≥ 0.

A and θ A stand both for even and odd generatHereafter, let the collective symbols s a i a i ∞ ∗ which, thus, is locally generated ing elements c , y , θ , θ of the C (X)-algebra S∞ A A λ ∗ the graded exterior forms by (1, s , dx , θ ), | | ≥ 0. We agree to call elements of S∞ on X. ∗ , the BGDA S ∗ is decomposed into S 0 -modules S k,r of k-contact Similarly to O∞ ∞ ∞ ∞ and r-horizontal graded forms together with the corresponding projections hk and hr . ∗ is split into the sum d = d + d Accordingly, the exterior differential d (3.5) on S∞ H V of the total and vertical differentials

dH (φ) = dx λ ∧ dλ (φ),

∗ φ ∈ S∞ .

A dV (φ) = θ ∧ ∂A φ,

∗ is given by the expression The projection endomorphism of S∞ 1 = ◦ h k ◦ hn , (φ) = (−1)| | θ A ∧ [d (∂A φ)], k | |≥0

k>0

>0,n φ ∈ S∞ ,

similar to (2.3). The graded variational operator δ = ◦d is introduced. Then the BGDA ∗ is split into the Z -graded variational bicomplex S∞ 2 ∗,∗ k,n (O∗ (X), S∞ , Ek = (S∞ ); d, dH , dV , , δ),

(3.6)

analogous to the variational bicomplex (2.4). 4. Cohomology of Z2 -Graded Complexes We aim to study the cohomology of the short variational complex dH

dH

δ

0 0,1 0,n 0 −→ R −→ S∞ −→ S∞ · · · −→ S∞ −→ E1

(4.1)

and the complex of one-contact graded forms dH

dH

1,0 1,1 1,n 0 → S∞ −→ S∞ · · · −→ S∞ −→ E1 → 0

(4.2)

∗ . One can think of the elements of the BGDA S∞ 0,n L = Lω ∈ S∞ , δ(L) = (−1)| | θ A ∧ d (∂A L) ∈ E1 | |≥0

of the complexes (4.1) – (4.2) as being a graded Lagrangian and its Euler–Lagrange operator, respectively.

Lagrangian Supersymmetries Depending on Derivatives

113

Theorem 4.1. The cohomology of the complex (4.1) equals the de Rham cohomology H ∗ (X) of X. The complex (4.2) is exact. The proof of Theorem 4.1 follows the scheme of the proof of Theorem 2.1, but all sheaves are sheaves over X. The proof falls into the three steps. (i) We start by showing that the complexes (4.1) – (4.2) are locally exact. Lemma 4.2. The complex (4.1) on X = Rn is exact. Referring to [5], Theorems 4.1 – 4.2, for the proof, we summarize a few formulae 0,∗ quoted in the sequel. Any horizontal graded form φ ∈ S∞ admits the decomposition , φ = φ0 + φ

= φ

1 0

dλ A s ∂A φ, λ

(4.3)

| |≥0

0,m
D

+ν

1

φ= 0

dλ ν α1 α µ ...µ µ A A (x , λs kδ(µ1 δµ2 · · · δµk−1 λs(α ∂ 1 kφ , dx µ ). (4.4) 1 ...αk−1 ) A k) λ k≥0

holds, and leads to the desired expression = δµν φ The relation [D +ν , dµ ]φ ξ=

(n − m − 1)! k=0

(n − m + k)!

, D +ν Pk ∂ν φ

P0 = 1,

Pk = dν1 · · · dνk D +ν1 · · · D +νk . (4.5)

0,m
∗ are polynomials in s A , the sum in the expression (4.5) Remark 4.1. Since elements of S∞ is finite. However, the expression (4.5) contains a dH -exact summand which prevents its ∗ . In this respect, we also quote the homotopy operator (5.107) in [35] extension to O∞ which leads to the expression

1 ξ =

A I (φ)(x µ , λs , dx µ ) 0

I (φ) =

| |≥0 µ

dλ , λ

(4.7)

µ + 1 (µ + + )! A µ+ + s d ∂ A (−1) (∂µ φ)], d [ (µ + )! ! n−m + | |+1 | |≥0

(4.8) where ! = µ1 ! · · · µn ! and µ denotes the number of occurrences of the index µ in . The graded forms (4.6) and (4.7) differ in a dH -exact graded form.

114

G. Giachetta, L. Mangiarotti, G. Sardanashvily

Lemma 4.3. The complex (4.2) on X = Rn is exact. 1,m
whose coefficients are linear in the variables s A , and vice versa. Let us consider the modified total differential d H = dH + dx λ ∧ sA λ+ ∂ A , | |>0

acting on graded forms (4.10), where ∂ A is the dual of ds A . Comparing the equality λsA = dx and the last equality (2.13), one can easily justify that dH φ = d H φ. dH sA λ+ Let a graded (1, m)-form φ (4.9) be dH -closed. Then the associated horizontal graded m-form φ (4.10) is d H -closed and, by virtue of Lemma 4.2, it is d H -exact, i.e., φ = d H ξ , where ξ is a horizontal graded (m − 1)-form given by the expression (4.5) depending on additional variables s A A glance at this expression shows that, since φ is linear in . A. , so is ξ = ξA s A ξA ∧ θ the variables s A . It follows that φ = dH ξ , where ξ = It remains to prove the exactness of the complex (4.2) at the last term E1 . If (σ ) = (−1)| | θ A ∧ [d (∂A σ )] | |≥0

=

| |≥0

(−1)| | θ A ∧ [d σA ]ω = 0,

1,n σ ∈ S∞ ,

a direct computation gives σ = dH ξ,

ξ =−

| |≥0 + =

(−1)|| θ A ∧ d σA

µ+

ωµ .

(4.11)

Remark 4.2. The proof of Lemma 4.3 fails to be extended to complexes of higher contact A ∧ θ B and s A s B obey different commutation rules. forms because the products θ (ii) Let us associate to each open subset U ⊂ X the BGDA SU∗ of elements of the ∗ whose coefficients are restricted to U . These algebras make up a S∞ presheaf over X. Let S∗∞ be the sheaf of germs of this presheaf and (S∗∞ ) its structure module of sections. One can show that S∗∞ inherits the variational bicomplex operations, and (S∗∞ ) does so (see Appendix C). For short, we say that (S∗∞ ) consists of C ∞ (X)-algebra

Lagrangian Supersymmetries Depending on Derivatives

115

a , ds a of locally bounded jet order | |. There is the monomorphism polynomials in s ∗ ∗ S∞ → (S∞ ). Let us consider the complexes of sheaves dH

dH

δ

0,n 0 −→ R −→ S0∞ −→ S0,1 ∞ · · · −→ S∞ −→ E1 , dH

dH

E1 = (S1,n ∞ ), (4.12)

1,1 1,n 0 → S1,0 ∞ −→ S∞ · · · −→ S∞ −→ E1 → 0

(4.13)

over X and the complexes of their structure modules dH

dH

δ

0,n 0 −→ R −→ (S0∞ ) −→ (S0,1 ∞ ) · · · −→ (S∞ ) −→ (E1 ), dH

dH

1,1 1,n 0 → (S1,0 ∞ ) −→ (S∞ ) · · · −→ (S∞ ) −→ (E1 ) → 0.

(4.14) (4.15)

By virtue of Lemmas 4.2 – 4.3 and Theorem 8.3, the complexes of sheaves (4.12) – (4.13) ∞ are exact. The terms S∗,∗ ∞ of the complexes (4.12) – (4.13) are sheaves of C (X)-modules. Therefore, they are fine and, consequently, acyclic. By virtue of Theorem 8.4 (see Appendix D), the cohomology of the complex (4.14) equals the cohomology of X with coefficients in the constant sheaf R, i.e., the de Rham cohomology H ∗ (X) of X, whereas the complex (4.15) is globally exact. (iii) It remains to prove the following. Proposition 4.4. Cohomology of the complexes (4.1) – (4.2) equals that of the complexes (4.14) – (4.15). Let the common symbol D stand for the operators dH , δ and in the complexes ∗ denote the terms of these complexes. Since cohomology (4.14) – (4.15), and let ∞ groups of these complexes are either trivial or equal to the de Rham cohomology of X, ∗ takes the form one can say that any D-closed element φ ∈ ∞ φ = ψ + Dξ,

∗ ξ ∈ ∞ ,

(4.16)

where ψ is a closed exterior form on X which is not necessarily exact. Since all D-closed ∗ of finite jet order are also of form (4.16), it suffices to show that, if an elements of ∞ ∗ is D-exact in the module ∗ (i.e., φ = Dξ , ξ ∈ ∗ ), then it is also element φ ∈ S∞ ∞ ∞ ∗ ∗ ). in S∞ (i.e., φ = Dϕ, ϕ ∈ S∞ ∗ be D-exact in ∗ . Let X be a (contractible) domain, and let an element φ ∈ S∞ ∞ ∗ in accordance with Lemmas 4.2 and 4.3. Then, being D-closed, it is D-exact in S∞ Moreover, a glance at the expressions (4.5), (4.6) and (4.8) shows that the maximal jet order [ϕ] of ϕ is bounded by an integer N ([φ]) which depends only on the maximal jet order [φ] of φ. It follows that, if φ = Dϕ is an arbitrary D-exact form of the jet order less than k, then the jet order of ϕ does not exceed N (k). We agree to call this fact the finite exactness of an operator D. Let X be an arbitrary manifold and U a domain of X. By virtue of Lemmas 4.2 and ∗ | (or, roughly speaking, the operator D on 4.3, the restriction of the operator D to ∞ U U ) has the finite exactness property. Let us state the following. Lemma 4.5. Given a family {Uα } of disjoint open subsets of X, let us suppose that the finite exactness of the operator D takes place on each subset Uα separately. Then D on the union ∪ Uα also has the finite exactness property. α

∗ be a D-exact graded form on X. The finite exactness on ∪U holds Proof. Let φ ∈ S∞ α since φ = Dϕα on every Uα and all [ϕα ] < N ([φ]).

116

G. Giachetta, L. Mangiarotti, G. Sardanashvily

Lemma 4.6. Suppose that the finite exactness of an operator D takes place on open subsets U , V of X and their non-empty overlap U ∩ V . Then it is also true on U ∪ V . ∗ be a D-exact graded form on X. By assumption, it can be Proof. Let φ = Dϕ ∈ S∞ brought into the form DϕU on U and DϕV on V , where ϕU and ϕV are graded forms of bounded jet order. Due to the decomposition (4.16), one can choose the forms ϕU , ϕV such that ϕ − ϕU on U and ϕ − ϕV on V are D-exact. Let us consider the difference ϕU − ϕV on U ∩ V . It is a D-exact graded form of bounded jet order which, by assumption, can be written as ϕU − ϕV = Dσ , where σ is also of bounded jet order. Lemma 4.7 below shows that σ = σU + σV , where σU and σV are graded forms of bounded jet order on U and V , respectively. Then, putting

ϕU = ϕU − DσU ,

ϕV = ϕV + DσV ,

we have the graded form φ, equal to DϕU on U and DϕV on V , respectively. Since the difference ϕU − ϕV on U ∩ V vanishes, we obtain φ = Dϕ on U ∪ V , where def ϕ |U = ϕU ϕ = ϕ |V = ϕV is of bounded jet order.

Lemma 4.7. Let U and V be open subsets of X and σ a graded form of bounded jet order on U ∩ V . Then σ splits into the sum σU + σV of graded exterior forms σU on U and σV on V of bounded jet order. Proof. By taking a smooth partition of unity on U ∪ V subordinate to its cover {U, V } and passing to the function with support in V , we get a smooth real function f on U ∪ V which is 0 on a neighborhood UU −V of U − V and 1 on a neighborhood UV −U of V − U in U ∪ V . The graded form f σ vanishes on UU −V ∩ (U ∩ V ) and, therefore, can be extended by 0 to U . Let us denote it σU . Accordingly, the graded form (1 − f )σ has an extension σV by 0 to V . Then σ = σU + σV is a desired decomposition because σU and σV are of finite jet order which does not exceed that of σ . Lemma 9.5 in [10], Chapter V, states that, if some property holds on a domain and obeys the conditions of Lemmas 4.5 and 4.6, it holds on any open subset of Rn . Hence, the operator D has the jet exactness property on any open subset of Rn and, consequently, on any chart of the fiber bundle Q×X Y → X. Since the latter admits a finite bundle atlas with the transition functions (2.1) and (3.2) preserving the jet order, the finite exactness of D takes place on the whole manifold X in accordance with Lemma 4.6. This proves Proposition 4.4 and, consequently, Theorem 4.1. Remark 4.3. Let us consider the complex d

d

0 1 k 0 → R −→ S∞ −→ S∞ · · · −→ S∞ −→ · · · ,

(4.17)

∗ , d) is the differential calculus which we agree to call the de Rham complex because (S∞ 0 n over the R-ring S∞ . If X = R , it is exact ([14], Theorem 3.1). Similarly to the proof of Theorem 4.1, one can show that the cohomology of the de Rham complex (4.17) equals the de Rham cohomology of X.

Lagrangian Supersymmetries Depending on Derivatives

117

0,m
φ = ψ + dH ξ,

0,m−1 ξ ∈ S∞ ,

(4.18)

where ψ is a closed m-form on X. Every δ-closed graded Lagrangian L ∈ sum φ = ψ + dH ξ,

0,n−1 ξ ∈ S∞ ,

0,n S∞

is the (4.19)

where ψ is a non-exact n-form on X. 1,n The global exactness of the complex (4.2) at the term S∞ results in the following.

Proposition 4.9. Given a graded Lagrangian L = Lω, there is the decomposition 1,n−1 dL = δL − dH ,

∈ S∞ , (4.20) νk ...ν1 νk ...ν1 λνk ...ν1 νk ...ν1 λνs ...ν1 A

= θνs ...ν1 ∧ FA ωλ , FA = ∂A L − d λ FA + hA ,(4.21) s=0 (ν νk−1 )...ν1

where local graded functions h obey the relations hνa = 0, ha k

= 0.

Proof. The decomposition (4.20) is a straightforward consequence of the exactness of 1,n the complex (4.2) at the term S∞ and the fact that is a projector. The coordinate expression (4.21) results from a direct computation −dH = −dH [θ A FAλ + θνA FAλν + · · · + θνAs ...ν1 FAλνs ...ν1 λνs+1 νs ...ν1

+θνAs+1 νs ...ν1 ∧ FA = = =

+ · · · ] ∧ ωλ

[θ + θνA (FAν + dλ FAλν ) ν ν ...ν λν ν ...ν + · · · + θνAs+1 νs ...ν1 (FAs+1 s 1 + dλ FA s+1 s 1 ) + · · · ] ∧ ω ν ν ...ν [θ A dλ FAλ + θνA (∂Aν L) + · · · + θνAs+1 νs ...ν1 (∂As+1 s 1 L) + · · · ] ∧ ω θ A (dλ FAλ − ∂A L) ∧ ω + dL = −δL + dL. A

dλ FAλ

Proposition 4.9 states the existence of a global finite order Lepagean equivalent

L = + L of any graded Lagrangian L. Locally, one can always choose (4.21) where all functions h vanish. 5. Contact Supersymmetries 0 of the R-ring S 0 is said to be an infinitesimal contact A graded derivation υ ∈ dS∞ ∞ supertransformation or, simply, a contact supersymmetry if the Lie derivative Lυ pre∗ (i.e., the Lie derivative L of serves the ideal of contact graded forms of the BGDA S∞ υ a graded contact form is a graded contact form). A , dx λ , θ A ) for the BGDA S ∗ , Proposition 5.1. With respect to the local basis (x λ , s ∞ any contact supersymmetry takes the form υ = υH + υV = υ λ dλ + (υ A ∂A + d υ A ∂A ), (5.1) | |>0

where

υ λ,

υA

are local graded functions.

118

G. Giachetta, L. Mangiarotti, G. Sardanashvily

∗ can be identified as sections of a Proof. The key point is that, since elements of C∞ ∗ . finite-dimensional vector bundle over X, so can elements of the C ∞ (X)-algebra S∞ 0 Moreover, any graded form is a finite composition of df , f ∈ S∞ . Therefore, the proof follows that of Proposition 2.3. ∗ are defined by the The interior product υ φ and the Lie derivative Lυ φ, φ ∈ S∞ same formulae

υ φ = υ λ φλ + (−1)[φA ] υ A φA ,

1 φ ∈ S∞ ,

υ (φ ∧ σ ) = (υ φ) ∧ σ + (−1)|φ|+[φ][υ] φ ∧ (υ σ ), Lυ φ = υ dφ + d(υ φ),

∗ φ, σ ∈ S∞ ,

Lυ (φ ∧ σ ) = Lυ (φ) ∧ σ + (−1)[υ][φ] φ ∧ Lυ (σ ),

as those on a graded manifold. Following the proof of Lemma 2.4, one can justify that any vertical contact supersymmetry υ (5.1) satisfies the relations υ dH φ = −dH (υ φ), Lυ (dH φ) = dH (Lυ φ),

φ∈

∗ S∞ .

(5.2) (5.3)

0,n , its Lie derivative Lυ L along a Proposition 5.2. Given a graded Lagrangian L ∈ S∞ contact supersymmetry υ (5.1) fulfills the first variational formula

Lυ L = υV δL + dH (h0 (υ

L )) + dV (υH ω)L,

(5.4)

where L = + L is a Lepagean equivalent of L given by the coordinate expression (4.21). Proof. The proof follows that of Proposition 2.5 and results from the decomposition (4.20) and the relation (5.2). In particular, let υ be a variational symmetry of a graded Lagrangian L, i.e., Lυ L = 0,n−1 dH σ , σ ∈ S∞ . Then the first variational formula (5.4) restricted to Ker δL leads to the weak conservation law 0 ≈ dH (h0 (υ

L ) − σ ).

(5.5)

Remark 5.1. Let us consider the gauge theory of principal connections on a principal bundle P → X with a structure Lie group G. These connections are represented by sections of the quotient C = J 1 P /G → X [32]. This is an affine bundle coordinated by (x λ , aλr ) such that, given a section A of C → X, its components Arλ = aλr ◦ A are coefficients of the familiar local connection form (i.e., gauge potentials). Let J ∞ C be r ), 0 ≤ | |, and let the infinite order jet manifold of C → X coordinated by (x λ , aλ, ∗ ∗ P∞ (C) be the polynomial subalgebra of the GDA O∞ (C). Infinitesimal generators of one-parameter groups of vertical automorphisms (gauge transformations) of a principal bundle P are G-invariant vertical vector fields on P → X. They are associated to sections of the vector bundle VG P = V P /G → X of right Lie algebras of the group G. Let us consider the simple graded manifold (X, AVG Y ) constructed from this vector bundle. Its local basis is (x λ , C r ). Let CJ∗ r VG Y be the BGDA of graded exterior forms on ∗ (V P ) the direct limit of the direct system the graded manifold (X, AJ r VG P ), and C∞ G (1.4) of these algebras. Then the graded product ∗ ∗ ∗ S∞ (VG , C) = C∞ (VG P ) ∧ P∞ (C)

(5.6)

Lagrangian Supersymmetries Depending on Derivatives

119

describes gauge potentials, odd ghosts and their jets in the BRST theory. With respect to ∗ (V , C) (5.6), the BRST symmetry is given a local basis (x λ , aλr , C r ) for the BGDA S∞ G by the contact supersymmetry (d υλr ∂r ,λ + d υ r ∂r ), (5.7) υ = υλr ∂rλ + υ r ∂r + | |>0

υλr =

p r Cλr + cpq aλ C q ,

1 r p q υ r = − cpq C C , 2

r are structure constants of the Lie algebra of G and ∂ λ , ∂ , ∂ ,λ ∂ are the where cpq r r r r r r , respectively. A remarkable peculiarity of this contact duals of daλr , dC r , da ,λ and dC supersymmetry is that the Lie derivative Lυ along υ (5.7) is nilpotent on the module 0,∗ S∞ of horizontal graded forms.

In a general setting, a vertical contact supersymmetry υ (5.1) is said to be nilpotent if Lυ (Lυ φ) =

||≥0,| |≥0

B A (υ ∂B (υ )∂A + (−1)[s

B ][υ A ]

B A υ υ ∂B ∂A )φ = 0

(5.8)

0,∗ for any horizontal graded form φ ∈ S∞ .

Lemma 5.3. A contact supersymmetry υ is nilpotent iff it is odd and the equality B A Lυ (υ A ) = υ ∂B (υ ) = 0 ||≥0

holds for all υ A . Proof. There is the relation i λ i λ ∂i = v ∂i ◦ dλ , dλ ◦ v

(5.9)

similar to (2.14). Then the lemma follows from the equality (5.8) where one puts φ = s A AsB . and φ = s Remark 5.2. A useful example of a nilpotent contact supersymmetry is the supersymmetry υ = υ A (x)∂A + ∂ υ A ∂A , (5.10) | |>0

A , but all s A are odd. where all υ A are smooth real functions on X, ∂A are the duals of ds

6. Cohomology of Nilpotent Contact Supersymmetries Let υ be a nilpotent contact supersymmetry. Since the Lie derivative Lυ obeys the rela0,∗ tion (5.3), let us assume that the R-module S∞ of graded horizontal forms is split into a bicomplex {S k,m } with respect to the nilpotent operator sυ (1.5) and the total differential dH which obey the relation

120

G. Giachetta, L. Mangiarotti, G. Sardanashvily

sυ ◦ dH + dH ◦ sυ = 0.

(6.1)

This bicomplex dH : S k,m → S k,m+1 ,

sυ : S k,m → S k+1,m

is graded by the form degree 0 ≤ m ≤ n and an integer k ∈ Z, though it may happen that S k,∗ = 0 starting from some number k. For the sake of brevity, let us call k the charge number. 0,∗ For instance, the BRST bicomplex S∞ (C, VG P ) is graded by the charge number r . In this case, s k which is the polynomial degree of its elements in odd variables C υ r (1.5) is the BRST operator. Since the ghosts C are characterized by the ghost number 1, k ∈ N, is the ghost number. The bicomplex defined by the contact supersymmetry (5.10) has the similar gradation, but taken with the sign minus (i.e., k = 0, −1, . . . ) because the nilpotent operator sυ decreases the odd polynomial degree. Let us consider the relative and iterated cohomology of the nilpotent operator sυ (1.5) with respect to the total differential dH . Recall that a horizontal graded form φ ∈ S ∗,∗ is said to be a relative closed form, i.e., (sυ /dH )-closed form if sυ φ is a dH -exact form. This form is called exact if it is a sum of an sυ -exact form and a dH -exact form. Accordingly, we have the relative cohomology H ∗,∗ (sυ /dH ). If a (sυ /dH )-closed form φ is also dH -closed, it is called an iterated (sυ |dH )-closed form. This form φ is said to be exact if φ = sυ ξ + dH σ , where ξ is a dH -closed form. Thus, we obtain the iterated cohomology H ∗,∗ (sυ |dH ) of the (sυ , dH )-bicomplex S ∗,∗ . It is the term E2∗,∗ of the spectral sequence of this bicomplex [31]. There is an obvious isomorphism H ∗,n (sυ /dH ) = H ∗,n (sυ |dH ) of relative and iterated cohomology groups on horizontal graded densities. Forthcoming Theorems 6.2 and 6.5 extend our results on iterated cohomology in [21] to an arbitrary nilpotent contact supersymmetry. Proposition 6.1. Let us consider the complex dH

dH

dH

0 0,1 0,n 0 −→ R −→ S∞ −→ S∞ · · · −→ S∞ −→ 0.

(6.2)

Its cohomology groups H m
(6.3)

Proof. The complex (6.2) differs from the short variational complex (4.1) in the last term. Therefore its cohomology H m
(6.4)

of the de Rham cohomology H m (X) of X of form degree less than n onto the iterated cohomology H ∗,m
Lagrangian Supersymmetries Depending on Derivatives

121

Proof. Since a nilpotent contact supersymmetry υ is vertical, all exterior forms φ on X are sυ -closed. It follows that d-cocycles on X are (sυ |dH )-closed. Since any dH -exact horizontal graded form is also (sυ |dH )-exact, we have a morphism ζ (6.4). By virtue of Corollary 4.8 (and, equivalently, Proposition 6.1), any dH -closed horizontal graded (m < n)-form φ is split into the sum φ = ϕ + dH ξ (4.18) of a closed m-form ϕ on X and a dH -exact graded form. Therefore, any (sυ |dH )-cocycle is the sum of a closed exterior form on X and a dH -exact graded form. It follows that the morphism ζ (6.4) is an epimorphism. The kernel of the morphism ζ (6.4) consists of elements whose representatives are sυ -exact closed exterior forms on X. In particular, if X = Rn , the iterated cohomology H ∗,0<m
(6.5)

of the de Rham cohomology H
Theorem 6.5. Put H = H ∗ ( sυ )/Im γ , where the asterisk means the total charge. There is an isomorphism ∗

H ∗,n (sυ |dH )/H = Ker γ .

(6.6)

122

G. Giachetta, L. Mangiarotti, G. Sardanashvily

Proof. The proof falls into the following three steps. (i) First, we show the existence of a morphism η : H ∗,n (sυ |dH ) → Ker γ

(6.7)

from the iterated cohomology group H ∗,n (sυ |dH ) to Ker γ . Consider a horizontal graded n-form φn which is (sυ |dH )-closed. Then, by definition, sυ φn is dH -exact, i.e., sυ φn + dH φn−1 = 0.

(6.8)

Acting on this equality by sυ , we observe that sυ φn−1 is a dH -closed graded form, i.e., sυ φn−1 + dH φn−2 = ϕn−1 ,

(6.9)

where ϕn−1 is a closed (n − 1)-form on X in accordance with Corollary 4.8. Since sυ ϕn−1 = 0, an action of sυ on Eq. (6.9) shows that sυ φn−2 is a dH -closed graded form, i.e., sυ φn−2 + dH φn−3 = ϕn−2 , where ϕn−2 is a closed (n − 2)-form on X. Iterating the arguments, one comes to the system of equations sυ φn−k + dH φn−k−1 = ϕn−k ,

0 ≤ k < n,

sυ φ0 = ϕ0 = const, (6.10)

which can be assembled into descent equations = ϕ, sυ φ = φn + φn−1 + · · · + φ0 , φ

ϕ = ϕn−1 + · · · + ϕ0 .

(6.11) (6.12)

Thus, any (sυ |dH )-closed horizontal graded form defines descent equations (6.11) whose right-hand sides ϕ are closed exterior forms on X such that their de Rham classes belong to the kernel Ker γ of the morphism (6.5). For the sake of brevity, let us denote these (6.12) descent equations by ϕ . Accordingly, we say that a horizontal graded form φ is a solution of descent equations ϕ (6.12). Descent equations defined by a (sυ |dH ) be another solution of another closed horizontal graded form φn are not unique. Let φ set of descent equations ϕ such that φn = φn . Let us denote φk = φk − φk and ϕk = ϕk − ϕk . Then the equations (6.8) lead to the equation dH (φn−1 ) = 0. It follows from Corollary 4.8 that φn−1 = dH ξn−2 + αn−1 ,

(6.13)

where αn−1 is a closed (n − 1)-form on X. Accordingly, Eq. (6.10) leads to the equation sυ (φn−1 ) + dH (φn−2 ) = ϕn−1 . Substituting the equality (6.13) into this equation and bearing in mind the relation (6.1), we obtain the equality dH (−sυ ξn−2 + φn−2 ) = ϕn−1 . It follows that φn−2 = sυ ξn−2 + dH ξn−3 + αn−2 ,

ϕn−1 = dαn−2 ,

Lagrangian Supersymmetries Depending on Derivatives

123

where αn−2 is an exterior form on X. Iterating the arguments, one comes to the relations φn−k = sυ ξn−k + dH ξn−k−1 + αn−k ,

ϕn−k = dαn−k−1 ,

1 < k < n, (6.14)

where αn−k−1 are exterior forms on X and, finally, to the equalities φ0 = 0, ϕ0 = 0. Then it is easily justified that − φ = sυ σ + α, σ = ξn−2 + · · · ξ1 , φ ϕ− ϕ = d α, α = αn−1 + · · · + α1 .

(6.15) (6.16)

It follows that right-hand sides of any two descent equations defined by a (sυ |dH )-closed horizontal graded form φn differ from each other in an exact form on X. Moreover, let φn and φn be representatives of the same iterated cohomology class in H ∗,n (sυ |dH ), of i.e., φn = φn + sυ ψ + dH β, where ψ is dH -closed. Let φn provide a solution φ = φ + a descent equation ϕ . Then φn defines a solution φ sυ (ψ + β) of the same descent equation. Thus, the assignment φn → ϕ yields the desired morphism η (6.7). (ii) Let us show that the morphism η (6.7) is an epimorphism. Let ϕ be a closed exterior form on X whose de Rham cohomology class belongs to Ker γ . Let ϕ = ϕn−1 +· · ·+ϕ0 be its decomposition in k-forms ϕk , k = 1, . . . , n − 1. Then the family of exterior forms (ϕk ) yields a system of the equations (6.10) which can be assembled into the descent exists because equations ϕ (6.11). Its solution φ ϕ ∈ Ker γ . Let ϕ differ from ϕ in of the equation an exact form, i.e., let the relation (6.16) hold. Then any solution φ ϕ = φ − yields a solution φ α (6.15) of the equation ϕ such that φn = φn . It follows that the morphism η (6.7) is an epimorphism. (iii) The kernel of the morphism η (6.7) is represented by (sυ |dH )-closed horizontal graded forms φn which yield homogeneous descent equations = 0. sυ φ

(6.17)

sυ ) onto Ker η. For this purLet us define an epimorphism of the total cohomology H ∗ ( its higher term φn . The latter defines homopose, let us associate to each sυ -cocycle φ , i.e., φn ∈ Ker η. Let φ = geneous descent equations (6.17) whose solution is φ sυ ψ be a sυ -coboundary. Its higher term φn takes the form φn = sυ ψn + dH ψn−1 , i.e., it → φn provides the desired is an iterated coboundary. It follows that the assignment φ ∗ epimorphism τ : H ( sυ ) → Ker η. The kernel of this epimorphism is represented by of the descent equation (6.17) whose higher term vanishes. Following item solutions φ = (i), one can easily show that these solutions take the form φ sυ σ + α , where α is a closed exterior form on X of form degree < n. Cohomology classes of these solutions exhaust the image of the morphism γ (6.5), i.e., Im γ = Ker τ . In particular, if the morphism γ (6.5) is a monomorphism (i.e., no non-exact closed exterior form on X is sυ -exact), the isomorphism (6.6) gives the isomorphism sυ )/H
124

G. Giachetta, L. Mangiarotti, G. Sardanashvily

7. Conclusion In the present work, we follow the algebraic topological approach to describing Lagrangian field theories in terms of the variational bicomplex. This enables us to extend the cohomology analysis of Lagrangian BRST theory on Rn to a generic contact supersymmetry on an arbitrary manifold X. Since only vector and affine bundles over X are involved, the corresponding cohomology characteristic is represented by the de Rham cohomology of X. In a general case of nilpotent contact supersymmetry sυ , its contribution however is not trivial because exterior forms on X are sυ -closed, but need not be sυ -exact. For instance, this contribution is given by the kernel of the morphism γ (6.5) in Theorem 6.5. Our analysis seems important for BV quantization of field systems with non-contractible topologies, e.g., gravitation theory and topological field models. We also bear in mind the extension of BV quantization to field systems where parameters of gauge transformations may depend on field variables and their derivatives [18]. For instance, this is the case of spinor fields in gauge gravitation theory [37]. 8. Appendixes Appendix A. Since Y is a strong deformation retract of any finite order jet manifold ∗ is easily proved to equal the de Rham J r Y , the de Rham cohomology of the GDA O∞ ∗ cohomology H (Y ) of Y in accordance with Theorem 8.3 [2]. However, we must enlarge ∗ in order to find its d - and δ-cohomology. O∞ H Outline of proof of Theorem 2.1. One starts from the algebraic Poincar´e lemma [35, 43]. Lemma 8.1. If Y is a contractible bundle Rn+p → Rn , the variational bicomplex (2.4) is exact. For instance, the homotopy operators for dV , dH , δ and are given by the formulae (5.72), (5.109), (5.84) in [35] and (4.5) in [43], respectively. Let O∗r be the sheaf of germs of exterior forms on the r-order jet manifold J r Y , and ∗ let Or be its canonical presheaf. There is the direct system of presheaves ∗

π∗

1∗ ∗ π0

∗

r ∗ πr−1

∗

OX −→ O0 −→ O1 · · · −→ Or −→ · · · . ∗

Its direct limit O∞ is a presheaf of GDAs on the infinite order jet manifold J ∞ Y . Let T∗∞ ∗ be the sheaf of GDAs on J ∞ Y constructed from the presheaf O∞ , i.e., T∗∞ is the sheaf ∗ of germs of O∞ (we follow the terminology of [28]). The structure module (T∗∞ ) of ∗ sections of T∞ is a GDA such that, given an element φ ∈ (T∗∞ ) and a point z ∈ J ∞ Y , there exist an open neighbourhood U of z and an exterior form φ (k) on some finite order jet manifold J k Y so that φ|U = πk∞∗ φ (k) |U . In particular, there is the monomorphism ∗ → (T∗ ). The fact that the paracompact space J ∞ Y admits a partition of unity O∞ ∞ by elements of the ring (T0∞ ) [42], enables one to obtain dH - and δ-cohomology of (T∗∞ ) as follows [1, 2, 41, 42]. The sheaf T∗∞ is split into the bicomplex T∗,∗ ∞ . Let us consider its variational subcomplex and the complexes of sheaves of contact forms dH

dH

δ

δ

0,n 0 → R → T0∞ −→ T0,1 ∞ · · · −→ T∞ −→ E1 −→ E2 −→ · · · , dH

dH

k,1 k,n 0 → Tk,0 ∞ −→ T∞ · · · −→ T∞ −→ Ek → 0,

Ek = (Tk,n ∞ ), (8.1) (8.2)

Lagrangian Supersymmetries Depending on Derivatives

125

together with complexes of their structure modules dH

dH

δ

δ

0,n 0 → R → (T0∞ ) −→ (T0,1 ∞ ) · · · −→ (T∞ ) −→ (E1 ) −→ (E2 ) −→ · · · , (8.3) dH

dH

k,1 k,n 0 → (Tk,0 ∞ ) −→ (T∞ ) · · · −→ (T∞ ) −→ (Ek ) → 0.

(8.4)

By virtue of Lemma 8.1 and Theorem 8.3, the complexes (8.1) – (8.2) are exact. Since 0 T∗,∗ ∞ are sheaves of (T∞ )-modules, they are fine. The sheaves Ek are also proved to be fine [21, 41]. Consequently, all sheaves, except R, in the complexes (8.1) – (8.2) are acyclic. Therefore, these complexes are the resolutions of the constant sheaf R and the zero sheaf over J ∞ Y , respectively. In accordance with the abstract de Rham theorem ([28], Theorem 2.12.1), cohomology of the complex (8.3) equals the cohomology of J ∞ Y with coefficients in R, while the complex (8.4) is exact. Since Y is a strong deformation retract of J ∞ Y [2, 22], cohomology of the complex (8.3) is isomorphic to the de Rham cohomology of Y . Note that, in order to prove the exactness of the complex (8.4), one can use a minor generalization of the above mentioned abstract de Rham theorem (see Appendix D), and need not justify the acyclicity of the sheaves Ek [42]. ∗ ⊂ (T∗ ) is proved to have the same d - and δ-cohoFinally, the subalgebra O∞ H ∞ k,n ∗ mology as (T∞ ) [21, 41]. Similarly, one can show that, restricted to O∞ , the operator remains exact. The following is a corollary of item (i) of Theorem 2.1 (cf. Corollary 4.8). Corollary 8.2. Every dH -closed form φ ∈ O0,m
0,m−1 ξ ∈ O∞ ,

where ψ is a closed m-form on Y . Every δ-closed Lagrangian L ∈ L = h0 ψ + dH ξ,

ξ∈

0,n−1 O∞ ,

(8.5) 0,n O∞

is the sum (8.6)

where ψ is a closed n-form on Y . Note that the formulae (8.5) – (8.6) were obtained in [1] by computing cohomology of the fixed order variational sequence, but the proof of the local exactness of this sequence requires rather sophisticated ad hoc techniques. Appendix B. Let us consider the open surjection π ∞ : J ∞ Y → X and the direct image ∗ T∗ on X of the sheaf T∗ of exterior forms on J ∞ Y . Its stalk at a point x ∈ X conπ∞ ∞ ∞ sists of the equivalence classes of sections of the sheaf T∗∞ which coincide on the inverse images (π ∞ )−1 (Ux ) of open neighbourhoods Ux of X. Since (π ∞ )−1 (Ux ) is the infinite order jet manifold of sections of the fiber bundle π −1 (Ux ) → X, every point x ∈ X has a base of open neighbourhoods {Ux } such that the sheaves T∗,∗ ∞ and Ek in the proof of Theorem 2.1 are acyclic on the inverse images (π ∞ )−1 (Ux ) of these neighbourhoods. Then, in accordance with the Leray theorem [23], cohomology of J ∞ Y with coefficients in the sheaves T∗,∗ ∞ and Ek is isomorphic to that of X with coefficients in their direct ∗ T∗,∗ and π ∗ E , i.e., the sheaves π ∗ T∗,∗ and π ∗ E over X are acyclic. Let images π∞ ∞ ∞ k ∞ ∞ ∞ k Y → X be an affine bundle. Then X is a strong deformation retract of J ∞ Y . In this case, the inverse images (π ∞ )−1 (Ux ) of contractible neighbourhoods Ux are contractible and π∗∞ R = R. Then, by virtue of Lemma 8.1, the variational bicomplex T∗∞ of sheaves over (π ∞ )−1 (Ux ) is exact, and the variational bicomplex π∗∞ T∗∞ of sheaves over X is so. There is an R-algebra isomorphism of the GDA of sections of the sheaf π∗∞ T∗∞ over ∗ can be regarded X to the GDA (T∗∞ ). Thus, the GDA (T∗∞ ) and its subalgebra O∞ as algebras of sections of a sheaf over X.

126

G. Giachetta, L. Mangiarotti, G. Sardanashvily

Appendix C. Let us associate to each open subset U ⊂ X the bigraded algebra SU∗ ∗ whose coefficients are restricted to U . These of elements of the C ∞ (X)-algebra S∞ algebras make up a presheaf

SU∗ , rVU | rVU : SU∗ → SV∗

(8.7)

over X. Let S∗∞ be the sheaf constructed from this presheaf. Its stalk S∗x at a point x ∈ X is the direct limit of the direct system of R-modules {SU∗ , rVU }, indexed by the directed ∗ set of open neighbourhoods U of x. This stalk consists of the germs of elements of S∞ at x, i.e., elements of the presheaf (8.7) are identified if their restrictions (namely, the restrictions of their coefficients) to some open neighbourhood of x coincide with each ∗ on X = Rn which consists of elements other. Let Sc∗ be the subalgebra of the BGDA S∞ with constant coefficients. Then S∗x is the stalk of germs of Sc∗ -valued functions on X. It is a bigraded algebra isomorphic to the tensor product Cx∞ ⊗R Sc∗ of the R-algebra Cx∞ of the germs of smooth real functions on X at x and the R-algebra Sc∗ . This stalk is naturally decomposed into Cx∞ -modules Sk,m of the germs of graded (k, m)-forms on x X. Let the common symbol stand for all the operators (d,dH , dV , and δ) on the ∗ . It is an R-module morphism whose restrictions to S ∗ intertwined by BGDA S∞ U U the restriction morphisms rVU (8.7) constitute the direct system of morphisms

U , rVU | rVU ◦ U = V ◦ rVU ,

(8.8)

indexed by the directed set of open neighbourhoods U of x. Its direct limit x is an R-module morphism of the stalk S∗x . The properties of the direct limit of morphisms are summarized by the following theorem [33]. Theorem 8.3. The direct limit of a direct system of complexes {Ci∗ , i ∈ I } is a complex whose cohomology is the direct limit of that of the complexes Ci∗ . It follows that the stalk S∗x is a BGDA which contains the complexes correspond∗ . Since, ing to the subcomplexes of the variational bicomplex (3.6) of the BGDA S∞ dx = dH x + dV x and the operators dx , dH x , dV x are nilpotent, we have a bicomplex ∗ ∞ ∗ S∗,∗ x . Moreover, the vertical differential dV x on Sx = Cx ⊗R Sc comes from the oper∗ ator dV on Sc . Therefore, δx = x ◦ dV x and the subcomplexes of S∗x can be assembled into the variational bicomplex. Accordingly, the sheaf S∗∞ of germs of graded exterior forms on X constitutes the variational bicomplex ∗ k,n (OX , S∗,∗ ∞ , Ek = (S∞ ); d, dH , dV , , δ),

(8.9)

where the operators on S∗∞ are denoted by the same symbols as those on the BGDA ∗ . S∞ Let (S∗∞ ) be the bigraded algebra of sections of the sheaf S∗∞ . Given an arbitrary section s of (S∗∞ ), there exists an open neighbourhood U of each point x ∈ X such that s|U is an element of the presheaf (8.7). It follows that (S∗∞ ) can be provided with the ∗ which make it into the Z -graded same operators d, dH , dV , and δ as the BGDA S∞ 2 ∗ . The homomorphism S ∗ → (S∗ ) is variational bicomplex, analogous to that of S∞ ∞ ∞ a monomorphism.

Lagrangian Supersymmetries Depending on Derivatives

127

Appendix D. We quote the following minor generalization of the abstract de Rham theorem ([28], Theorem 2.12.1) [20, 22, 42]. Let h

h0

h1

hp−1

hp

0 → S −→ S0 −→ S1 −→ · · · −→ Sp −→ Sp+1 ,

p > 1,

be an exact sequence of sheaves of abelian groups over a paracompact topological space Z, where the sheaves Sq , 0 ≤ q < p, are acyclic, and let h∗

h0∗

h1∗

p−1

h∗

p

h∗

0 → (Z, S) −→ (Z, S0 ) −→ (Z, S1 ) −→ · · · −→ (Z, Sp ) −→ (Z, Sp+1 ) (8.10) be the corresponding cochain complex of structure groups of these sheaves. Theorem 8.4. The q-cohomology groups of the cochain complex (8.10) for 0 ≤ q ≤ p are isomorphic to the cohomology groups H q (Z, S) of Z with coefficients in the sheaf S. Acknowledgement. The authors would like to thank a referee for carefully reading the manuscript and numerous suggestions.

References 1. Anderson, I., Duchamp, T.: On the existence of global variational principles. Am. J. Math. 102, 781–868 (1980) 2. Anderson, I.: Introduction to the variational bicomplex. Contemp. Math. 132, 51–73 (1992) 3. Anderson, I., Kamran, N., Olver, P: Internal, external and generalized symmetries. Adv. Math. 100, 53–100 (1993) 4. Barnich, G., Brandt, F., Henneaux, M.: Local BRST cohomology in the antifield formalism. 1. General theorems. Commun. Math. Phys. 174, 57–91 (1995) 5. Barnich, G., Brandt, F., Henneaux, M.: Local BRST cohomology in gauge theories. Phys. Rep. 338, 439–569 (2000) 6. Bartocci, C., Bruzzo, U., Hern´andez Ruip´erez, D.: The Geometry of Supermanifolds. Dordrecht: Kluwer, 1991 7. Batalin, I., Vilkoviski, G.: Closure of the gauge algebra, generalized Lie algebra equations and Feynman rules. Nucl. Phys. B234, 106–124 (1984) 8. Brandt, F.: Local BRST cohomology and covariance. Commun. Math. Phys. 190, 459–489 (1997) 9. Brandt, F.: Jet coordinates for local BRST cohomology. Lett. Math. Phys. 55, 149–159 (2001) 10. Bredon, G.: Topology and Geometry. Berlin: Springer-Verlag, 1993 11. Bryant, R., Griffiths, P., Grossman, D.: Exterior Differential Systems and Euler–Lagrange Partial Differential Equations. Chicago, IL: Univ. of Chicago Press, 2003 12. Cari˜nena, J., Figueroa, H.: Hamiltonian versus Lagrangian formulations of supermechanics. J. Phys. A 30, 2705–2724 (1997) 13. Cianci, R., Francaviglia, M. Volovich, I.: Variational calculus and Poincar´e–Cartan formalism in supermanifolds. J. Phys. A. 28, 723–734 (1995) 14. Dragon, N.: BRS symmetry and cohomology. http://arxiv.org/list/hep-th/9602163, 1996 15. Dubois-Violette, M., Henneaux, M., Talon, M., Vialett, C.-M.: General solution of the consistence equation. Phys. Lett. B 289, 361–367 (1992) 16. Fatibene, L., Ferraris, M., Francaviglia, M., McLenaghan, R.: Generalized symmetries in mechanics and field theories. J. Math. Phys. 43, 3147–3161 (2002) 17. Franco, D., Polito, C.: Supersymmetric field-theoretic models on a supermanifold. J. Math. Phys. 45, 1447–1473 (2004) 18. Fulp, R., Lada, T., Stasheff, J.: Sh-Lie algebras induced by gauge transformations. Comm. Math. Phys. 231, 25–43 (2002) 19. Fulp, R., Lada, T., Stasheff, J.: Noether variational Theorem II and the BV formalism. Rend. Circ. Mat. Palermo (2) Suppl. (71), 115–126 (2003) 20. Giachetta, G., Mangiarotti, L., Sardanashvily, G.: Cohomology of the variational complex. http://arxiv.org/list/math-ph/0005010, 2000

128

G. Giachetta, L. Mangiarotti, G. Sardanashvily

21. Giachetta, G., Mangiarotti, L., Sardanashvily, G.: Iterated BRST cohomology. Lett. Math. Phys. 53, 143–156 (2000) 22. Giachetta, G., Mangiarotti, L., Sardanashvily, G.: Cohomology of the infinite-order jet space and the inverse problem. J. Math. Phys. 42, 4272–4282 (2001) 23. Godement, R.: Th´eorie des Faisceaux. Paris: Hermann, 1964. 24. Gomis, J., Par´ıs, J., Samuel, S.: Antibracket, antifields and gauge theory quantization. Phys. Rep 295, 1–145 (1995) 25. Gotay, M.: A multisymplectic framework for classical field theory and the calculus of variations. In: Mechanics, Analysis and Geometry: 200 Years after Lagrange. Amsterdam: North Holland, 1991, pp. 203–235 26. Greub, W., Halperin, S., Vanstone, R.: Connections, Curvature, and Cohomology, Vol. 1. NewYork: Academic Press, 1972 27. Hern´andez Ruip´erez, D., Mu˜noz Masqu´e, J.: Global variational calculus on graded manifolds. J. Math. Pures Appl. 63, 283–309 (1984) 28. Hirzebruch, F.: Topological Methods in Algebraic Geometry. Berlin: Springer-Verlag, 1966 29. Ibragimov, N.: Transformation Groups Applied to Mathematical Physics. Boston: Riedel, 1985 30. Krasil’shchik, I., Lychagin, V., Vinogradov, A.: Geometry of Jet Spaces and Nonlinear Partial Differential Equations. New York: Gordon and Breach, 1985 31. Mac Lane, S.: Homology. Berlin: Springer-Verlag, 1967 32. Mangiarotti, L., Sardanashvily, G.: Connections in Classical and Quantum Field Theory. Singapore: World Scientific, 2000 33. Massey, W.: Homology and Cohomology Theory. New York: Marcel Dekker, 1978 34. Monterde, J., Vallejo, J.: The symplectic structure of Euler–Lagrange superequations and Batalin– Vilkoviski formalism. J. Phys. A 36, 4993–5009 (2003) 35. Olver, P.: Applications of Lie Groups to Differential Equations. Berlin: Springer-Verlag, 1986 36. Rennie, A.: Smoothness and locality for nonunital spectral triples. K-Theory 28, 127–165 (2003) 37. Sardanashvily, G.: Covariant spin structure. J. Math. Phys. 39, 2714–2729 (1998) 38. Sardanashvily, G.: SUSY-extended field theory. Int. J. Mod. Phys. A 15, 3095–3112 (2000) 39. Sardanashvily, G.: Cohomology of the variational complex in field-antifield BRST theory. Mod. Phys. Lett. A 16, 1531–1541 (2001) 40. Sardanashvily, G.: Remark on the Serre–Swan theorem for non-compact manifolds. http://arxiv.org/list/math-ph/0102016, 2001 41. Sardanashvily, G.: Cohomology of the variational complex in the class of exterior forms of finite jet order. Int. J. Math. and Math. Sci. 30, 39–48 (2002) 42. Takens, F.: A global version of the inverse problem of the calculus of variations. J. Diff. Geom. 14, 543–562 (1979) 43. Tulczyiew, W.: The Euler–Lagrange resolution. In: Differential Geometric Methods in Mathematical Physics (Proc. Conf., Aix-en-Provence/Salamanca, 1979), Lecture Notes in Math. 836. Berlin: Springer, 1980, pp. 22–48 Communicated by N.A. Nekrasov

Commun. Math. Phys. 259, 129–138 (2005) Digital Object Identifier (DOI) 10.1007/s00220-005-1361-2

Communications in

Mathematical Physics

A New Inequality for the von Neumann Entropy Noah Linden, Andreas Winter Department of Mathematics, University of Bristol, University Walk, Bristol BS8 1TW, U.K. E-mail: [email protected]; [email protected] Received: 2 July 2004 / Accepted: 13 December 2004 Published online: 19 May 2005 – © Springer-Verlag 2005

Abstract: Strong subadditivity of von Neumann entropy, proved in 1973 by Lieb and Ruskai, is a cornerstone of quantum coding theory. All other known inequalities for entropies of quantum systems may be derived from it. Here we prove a new inequality for the von Neumann entropy which we prove is independent of strong subadditivity: it is an inequality which is true for any four party quantum state, provided that it satisfies three linear relations (constraints) on the entropies of certain reduced states. 1. Introduction Entropy is a key concept both in classical and in quantum information theory: Shannon’s source and channel coding and half a century of work [11] have exhibited a vast range of operational coding problems whose solution can be expressed most naturally by (Shannon-Gibbs-Boltzmann) entropies of random variables: H (X) = − Pr{X = x} log2 Pr{X = x}. x

Quantum information theory [3] allows a wealth of new information processing possibilities, with the von Neumann (quantum) entropy playing a role analogous to Shannon’s (classical) entropy in classical information theory: for a density operator ρ, the von Neumann entropy is S(ρ) = −Trρ log2 ρ. Although the two entropy functionals exhibit similarities, they have many decidedly different properties. These properties however, because of the intimate relation of the entropy to operational properties of (classical and quantum) information, ultimately express statements about the “nature of information”. Furthermore, they are indispensable technical tools in proving the information theoretic optimality of constructions:

130

N. Linden, A. Winter

most importantly, there are inequalities governing the relative magnitude of entropies, conditional entropies and mutual information. In the quantum case, there is essentially only one known inequality (all others being derivable from it): strong subadditivity. Proved by Lieb and Ruskai [6] in 1973, it is the key result on which virtually every nontrivial quantum coding theorem relies. We prove here a new inequality for the von Neumann entropy, which we show cannot be derived from the known ones: it is a constrained inequality in that it is not true in general but only for states satisfying three particular linear constraints on their entropies. One starting point for our work was the desire to understand properties of quantum entropy. We were also motivated by investigations of multi-party entanglement in [7]: namely, the entropies of any set of parties of a multi-party pure states are invariants under asymptotically reversible (multi-copy) state transformations. In [7], “entropy” values were found which are allowed by strong subadditivity but for which the authors could not find quantum states. This led to the conjecture that it is impossible to realise those values by a quantum state, which we indeed prove here. The structure of our paper is as follows: the next section reviews the well-established convexity framework for (linear) information inequalities. In Sect. 3 we state and prove our result, while in Sect. 4 we explain why it does not follow from the standard inequalities. In Sect. 5 we present a number of alternative forms of our inequality. We close in Sect. 6 with a discussion and a conjectured unconstrained inequality. 2. Linear Inequalities Pippenger [8] initiated the programme of determining all (linear) inequalities satisfied by the classical entropy functional H . This question was based on the realisation of two facts (see Yeung’s work [12]): first, that in information theoretic applications, the properties one uses about the entropy to bound information quantities seem always to be 1. 2. 3. 4.

Nonnegativity of entropy H (X). Nonnegativity of conditional entropy H (X|Y ) = H (XY ) − H (Y ). Nonnegativity of mutual information I (X; Y ) = H (X) + H (Y ) − H (XY ). Nonnegativity of conditional mutual information I (X; Z|Y ) = H (XY ) + H (Y Z) − H (XY Z) − H (Y ),

for random variables X, Y, Z, the so-called basic inequalities. n Second, that for every number n of random variables, the points in R2 −1 given by the entropies of all possible subsets of the random variables, (H (XS ))∅=S⊂{1,... ,n} : X1 , . . . , Xn random variables (where all random variables are assumed to be discrete and indeed finite range) form “almost” a convex cone (i.e., closed under nonnegative linear combinations) in the ∗ positive orthant: one only needs to go to the topological closure, denoted n , and called the (classical) entropy cone. Surprisingly, the classical entropy cone can be strictly smaller than the cone cut out by the basic inequalities for all subsets of random variables (which we will call “Shannon cone” n ): while the two cones coincide for n ≤ 3, they differ for n = 4. Indeed, as Yeung and Zhang have shown, there are further inequalities satisfied by the entropy

A New Inequality for von Neumann Entropy

131

cone which are not dependent on the basic inequalities; i.e., they are violated by points in the Shannon cone. Pippenger [9] observed that a similar situation occurs in the quantum case: with an underlying multipartite state ρ, denote the entropy of its restriction to subsystems A, . . . , or groups of subsystems AB, . . . , by S(A), S(AB), etc. Then, there is a “von Neumann” cone n , defined by the basic inequalities 1. Nonnegativity of entropy S(A). 2. Nonnegativity of the quantity S(A|B) + S(A) = S(AB) − S(B) + S(A) (this is known as the triangle inequality [2] of Araki and Lieb). 3. Nonnegativity of S(C|A)+S(C|B) = S(CA)+S(CB)−S(A)−S(B) (this replaces nonnegativity of the conditional entropy, and is called “weak monotonicity”). 4. Nonnegativity of quantum mutual information I (A; B) = S(A) + S(B) − S(AB). 5. Nonnegativity of quantum conditional mutual information I (A; C|B) = S(AB) + S(BC) − S(ABC) − S(B). (Note that the names of the quantities are given based on straightforward analogy, with no operational significance implied at this point.) The latter two are simply subadditivity and strong subadditivity [6] of the quantum entropy. The properties 2) [and 3)] above, can actually be derived from 4) [and 5)] (and vice versa) by viewing the state as the restriction of a pure state on the given parties plus one, and the fact that for a pure state, the entropy of a subset of the parties equals the entropy of the complementary set; the first equivalence was observed in [2], the second in [6]. They are actually a consequence of a linear algebra fact, namely the Schmidt decomposition of bipartite pure states, whose coefficients are the eigenvalues of both reduced states (for a more detailed discussion, see [10]). Note also that choosing trivial B (i.e., with Hilbert space C) reduces weak monotonicity to the triangle inequality, and conditional mutual information to mutual information (compare the classical case). Thus all non-trivial inequalities may be derived from strong subadditivity. We are particularly interested in the values of entropies which are actually realised by quantum states. Given a quantum state on the tensor product of n finite quantum systems, there is the 2n − 1-dimensional vector of the entropies of the 2n − 1 nontrivial ∗ marginals. Hence there is the cone of quantum entropies n : the closure of the set of all vectors realised by entropies of quantum states on finite quantum systems. That it is indeed a cone is proved in the same way as for the classical case [9], and we briefly sketch the argument. We leave it to the reader to convince him/herself that it is enough to show the following. a) For any entropy vectors v, w ∈ n∗ , also v + w ∈ n∗ ; this is proved by considering the tensor product of states representing v and w. b) For any > 0 there is δ > 0 such that for all 0 ≤ λ ≤ δ and v ∈ n∗ , there is u ∈ n∗ such that λv − u ≤ ; for this, consider the state λρ + (1 − λ)|0 . . . 0 0 . . . 0|, where ρ represents v and |0 . . . 0 0 . . . 0| is some n-party product (pure) state. The faces of the cone n are given by certain entropies being zero (which means that the corresponding subsystem is in a pure state), certain mutual information being zero (which means that certain pairs of subsystems are in a product state), certain conditional mutual information being zero, etc. The latter is fully analogous to the classical case of a Markov chain, where A and C are independent conditional on B (which we call the “pivot” of the chain), as explained in [4]. All this raises the following natural question: are there any further linear inequal∗ ities for the quantum entropy than those above? More precisely: is n n , and if ∗ so, can we find a hyperplane intersecting the interior of n but having n entirely in

132

N. Linden, A. Winter

one halfspace? It should not be too surprising that the question for inqualities beyond strong subadditivity, or for strengthenings of it, has been asked before: one of the earliest discussions of this problem seems to be by Lieb [5]. In our case, an additional incentive was provided by the two extremal rays of 4 mentioned in the introduction and given in Sect. 4 below, which in [7] were studied, but without being able to find a matching state. Since our main result proves that such states do in fact not exist, one arrives at the conjecture that any face of 4 which contains one of these rays is not entirely contained in 4∗ — indeed, this would be a consequence of our conjecture 6 below. 3. The New Inequality Our main result is the following theorem, which gives an answer to the question at the end of Sect. 2 in its first form, and provides evidence for a positive answer to the second. Theorem 1. Let ρ ABCD be a state of a quadripartite quantum system, such that strong subadditivity is saturated for the three triples ABC, CAB and ADB (pivot always in the middle). Then, I (C; D) ≥ I (C; AB). Proof. The proof relies heavily on the recent characterisation of states which saturate strong subadditivity [4], which is stated below as Proposition 2. First of all, since we have strong subadditivity saturated for CAB, Ca L aR B ρ CAB = pi ρ i i ⊗ ρ i i , i

by Proposition 2. To this we apply Proposition 2 once more, for the triple ABC: there exists the recovery map RB→C (we duplicate C, and attach a prime, to distinguish the

two incarnations of C) mapping ρ AB to ρ ABC . It maps the above ρ CAB to a R bL bR C

Ca L ρ CABC = pi pj |i ρi i ⊗ ρiji j ⊗ ρj j . ij

C

A

B

D Fig. 1. The angles in the figure represent the strong subadditivity constraints which are saturated in the conditions of Theorem 1 (i.e. strong subadditivity is saturated for the triples ABC, CAB, ADB; pivot always in the middle). Theorem 1 then states that under these conditions, the correlation between C and D is not smaller than that between C and AB

A New Inequality for von Neumann Entropy

133

(Notice that the states on the far right, by the structure of RB→C can only depend on j , the Hilbert space sector measured by the map, as described in Proposition 2 below.) But the two states obtained by tracing out C and C , respectively (and identifying C with C again), must coincide: ρ CAB = ρ ABC =

CaiL

a R bjL

⊗ ρiji

ij

pij ρi

ij

pij ρi i ⊗ ρiji

a R bjL

aL

bR

⊗ ρj j , bR C

⊗ ρj j .

Comparing these two, for a given sector labelled ij , with pij > 0, we obtain that both CaiL

ρi

bR C

and ρj j

are actually product states: CaiL

ρi

bR C

ρj j

aL

= ρjC ⊗ ρi i , bR

= ρj j ⊗ ρiC .

(1)

That the right-hand sides contain both i and j (whereas the left hand sides mention only i and only j , respectively) is no error, but in fact the main point: it means that for actually occurring ij , i.e., pij > 0, the state of C belonging to this sector depends only on i and only on j — in other words, it must be a common function of i and j : ρkC , with k = f (i) = g(j ), with certain (deterministic) functions f and g. We note that if all the pij are strictly positive, then the only way for ρkC to be consistent with Eq. (1) is for it to be constant, i.e. independent of i and j . Situations in which ρkC can vary with i and j are only possible when some of the pij are zero. As an illustration, consider a state ρ ABC which has i, j = 1, 2, 3, and p11 > 0, p22 > 0, p23 > 0, p32 > 0 and p33 > 0, but pij = 0 otherwise; then the non-constant possibility ρkC = ρ1 for i = j = 1 and ρkC = ρ2 = ρ1 for (i, j ) = (2, 2), (2, 3), (3, 2), (3, 3) (i.e. f (1) = g(1) = 1, f (2) = f (3) = g(2) = g(3) = 2) is consistent with Eq. (1). Returning to the general situation, with k as described in the previous paragraph but one, we can rewrite ρ ABC again, ρ ABC =

a R bjL

aL

pij ρi i ⊗ ρiji

bR

⊗ ρj j ⊗ ρkC .

ij

In fact, let us introduce quantum registers KA and KB holding k explicitly (of course, by our observation, they will be perfectly correlated) — and note that their content can be extracted locally at A and B, respectively, without disturbing the state ρ ABC , by a measurement of the orthogonal subspace sector i and j , respectively: ρ KA ABKB C =

aL

a R bjL

pij |k k|KA ⊗ ρi i ⊗ ρiji

bR

⊗ ρj j ⊗ |k k|KB ⊗ ρkC .

ij

We shall use the following convention: some of the registers are classical (such as KA and KB ) in as much as they come with a distinguished basis, and the global state is written as a mixture of states which have the classical registers in one of their distinguished basis states. These classical registers we identify with random variables, by the same name, for example KA with distribution

134

N. Linden, A. Winter

Pr{KA = k} = pk =

pi .

i:f (i)=k

This will allow us to speak about the state in quantum theoretical language, and interchangeably about its classical properties in random variable language. For example, as random variables, KA = KB with probability 1, by our earlier observation. The proof will now be completed by showing two things: first, that I (C; AB) equals the Holevo quantity of the ensemble of the ρkC , which is I (KA ; C); second, that k is also “known at D” by which we mean that there is a measurement on D extracting a random variable KD perfectly correlated with KA = KB . First, the first claim: the system AB, by our above characterisation, falls into orthog(AB)k onal sectors (AB)k , labelled by k, and the state in this sector is some σk ⊗ ρkC , a R bL

aL

bR

because it is a convex combination of states ρi i ⊗ ρiji j ⊗ ρj j ⊗ ρkC , with ij consistent with k, so they all have the same state ρkC on C. Hence, there is a quantum operation extracting KA from AB (a coarse-graining of the disturbance-free measurement of i (AB)k and j ), as well as a reverse, creating σk in AB from KA . By monotonicity of the quantum mutual information, I (C; AB) = I (KA ; C). The second claim is seen as follows: using the third constraint, I (A; B|D) = 0, with the monotonicity of the quantum conditional mutual information under the local maps extracting KA (from A) and KB (from B), gives I (KA ; KB |D) = 0. Proposition 2 guarantees the existence of a measurement (whose outcome we think of being stored in a classical register KD ) such that conditional on each measurement outcome, KA and KB are in a product state. It is straightforward to check that then I (KA ; KB |KD ) = 0, and because KA and KB are perfectly correlated, they must also be perfectly correlated with KD : KA = KB = KD with probability 1, as random variables. These two facts, by monotonicity of the quantum mutual information under quantum operations, finally yield I (C; AB) = I (KA ; C) = I (KD ; C) ≤ I (C; D). Proposition 2 (Hayden, Jozsa, Petz and Winter [4]). A state ρ ABC saturating strong subadditivity at pivot B, i.e., satisfying S(AB) + S(BC) = S(ABC) + S(B), must have the following form. There exists an orthogonal decomposition of B’s Hilbert space HB into subspaces HBj , each of which has a natural presentation as tensor product of two Hilbert spaces: HB = HbL ⊗ HbR , j

j AbjL

such that (with states ρj

j

bR C

on HA ⊗ HbL and ρj j j

ρ ABC =

AbjL

pj ρ j

on HbR ⊗ HC ) j

bR C

⊗ ρj j .

j

This can be operationally rephrased as follows: there is a quantum operation RB→C from B to BC such that ρ ABC = (idA ⊗ RB→C )ρ AB , which has the following form: 1. Perform a projective measurement associated with an orthogonal decomposition of B into sectors Bj .

A New Inequality for von Neumann Entropy

135

2. Each sector has a tensor product structure Bj = bjL bjR ; having measured j in step bR C

1, the map discards the state on bjR and replaces it by ρj j bjR C.

on the composite system

Remark 3. One can easily construct states where our inequality is strict, and others where it is tight: a state of the form ρ ABCD = qj ρjA ⊗ σjB ⊗ ϕjCD , j

with ϕjCD being arbitrary and having the marginal states τj and ζj on C and D, respectively, will generically have I (C; D) > I (C; AB). If however ϕjCD = τjC ⊗ ζjD , with mutually orthogonal τjC , we have equality. Also, it is easy to see that no proper subset of our three constraints can imply I (C; D) ≥ I (C; AB): 1. Saturation of strong subadditivity for ABC and CAB: consider ρ ABCD =

ABC 1 |000 000| + |111 111| ⊗ |0 0|D . 2

It satisfies these two constraints (and many more), but has I (C; D) = 0 and, however, I (C; AB) = 1. 2. Saturation of strong subadditivity for ABC and ADB: consider ρ ABCD =

BC 1 ⊗ |00 00|AD . |00 00| + |11 11| 2

It satisfies these two constraints (and many more), but has I (C; D) = 0 and, however, I (C; AB) = 1. Remark 4. It is worth pointing out that not every application of Proposition 2 along the lines of our proof of Theorem 1 yields a nontrivial result, even though it may seem so at first sight: for example, consider a tripartite state ρ ABC which saturates strong subadditivity for ABC. Then, the characterisation of such states implies that ρ AC is separable, which is well-known to imply S(AC) ≥ S(C). Since this inequality is false for general states, have we found a new constrained inequality? Actually no: it can be checked immediately that in generality, 2S(A|C) + I (A; C|B) = S(A|B) + S(A|C) + I (A; B|C) ≥ 0, by the basic inequalities (weak monotonicity and strong subadditivity): hence, if I (A; C|B) = 0, then necessarily S(A|C) ≥ 0. 4. Why the Constrained Inequality is New In the introduction we have explained already why for three parties there cannot be an ∗ information inequality independent of the basic ones, as 3 = 3 [9, 7]. Indeed, as

136

N. Linden, A. Winter

one can see from these papers there cannot even be a constrained inequality, since on each of the 8 extremal rays of 3 there are (nonzero) entropy vectors realised by certain states. We re-iterate here that if we speak of an inequality being “dependent” on others, we mean it in the sense of convex geometry: that it is a positive linear combination of these other inequalities. In particular, the equivalence between, say, subadditivity and triangle inequality, is not of this type. The four party case is studied in [7] with particular interest in the insights to be gained about multi-party entanglement. There it is shown that 4 has 76 extremal rays, which fall naturally into 8 symmetry classes under permutation of the parties. For 6 of them [7] gives states realising entropy vectors on the rays. The two remaining classes are represented by the rays spanned by the following vectors (the first row gives the combinations of subsystems in lexicographic order; below are their “entropies”): I II

A B C D AB AC AD BC BD CD ABC ABD ACD BCD ABCD 3 3 2 2 4 3 3 3 3 4 4 4 3 3 2 3 3 3 3 4 4 4 4 4 6 5 5 5 5 2

Clearly, if one could find states realising these vectors (or nonzero multiples), this would ∗ prove 4 = 4 . However, it is readily verified that both these rays satisfy the condition of Theorem 1, but not the conclusion: both vectors given above have I (C; D) = S(C) + S(D) − S(CD) = 0 but I (C; AB) = S(C) + S(AB) − S(ABC) = 2 > 0. Corollary 5. There are no quantum states of finite systems realising entropy vectors on the rays I and II above (except for the origin). In fact, in the face of the cone 4 described by the three constraint equations of Theorem 1, the new inequality I (C; D) ≥ I (C; AB) cuts off a slice, which contains the rays I and II. In other words, the two entropy vectors satisfy all the basic inequalities, but by Theorem 1 there can be no non-trivial quantum state with entropy vector in these rays. Thus, Theorem 1 cannot be derived from the constraints in its statement using only the basic inequalities and positive linear combinations, and so the new inequality is indeed independent of all previously known inequalities. We hasten to add, however, that this does not exclude the possibility of a proof of it using other, “non-linear”, means, like the purification trick relating some of the basic inequalities. 5. Alternative Forms of the Inequality We have presented the new inequality in Theorem 1 in a form which reflects our way of proving it. Writing out the mutual information in terms of entropies, one notices that some terms cancel, and we arrive at the following reformulation of our result: Theorem 1 . Let ρ ABCD be a state of a quadripartite quantum system such that S(AB) + S(BC) − S(B) − S(ABC) = 0, S(CA) + S(AB) − S(A) − S(CAB) = 0, S(AD) + S(DB) − S(D) − S(ADB) = 0. Then, S(ABC) + S(D) ≥ S(AB) + S(CD), i.e., I (ABC; D) ≥ I (AB; CD).

A New Inequality for von Neumann Entropy

137

We present this reformulation mainly because it may help understanding and applying the result. There is another one, however, which is less trivial: we can apply the purification trick that is used to relate strong subadditivity and weak monotonicity (see Sect. 2). In detail, we construct a purification ABCDE of the given state ρ ABCD and can apply Theorem 1 or 1 to three situations: strong subadditivity saturated for the triples ABC, CAB and AEB; second, for AEC, CAE and ADE; third, for ABE, EAB and ADB. If we then systematically eliminate all entropies involving E by substituting the complementary group, we get the following statements: Theorem 1

. Let ρ ABCD be a state of a quadripartite quantum system. Consider the following three properties this state could have: I (A; C|B) = I (B; C|A) = I (A; B|CD) = 0, S(C|A) + S(C|BD) = I (A; C|BD) = S(A|D) + S(A|BC) = 0, S(B|A) + S(B|CD) = S(A|B) + S(A|CD) = I (A; B|D) = 0.

(1) (2) (3)

Then, (1) ⇒ S(C|AB) + S(C|ABD) ≥ 0, (2) ⇒ S(C|D) + S(C|BD) ≤ 0, (3) ⇒ S(D) + S(CD) ≥ S(AB) + S(ABC).

6. Discussion Although we believe that the discovery of a new constrained information inequality is ∗ interesting in itself, our Theorem 1 is not enough to conclude 4 4 because it may be that there are states realising entropy vectors arbitrarily close to the points I and II in the previous section. Such a possibility could be ruled out by finding an unconstrained inequality satisfied by 4∗ but violated by points on the rays I and II. Note that indeed for n = 3, in both the quantum and classical version of the question, the set of entropic ∗ ∗ vectors is not closed, so is not identical to the entropy cone 3 , 3 . On the other hand, it is still the case that the extremal rays are indeed populated by distributions/states. We may remark that in the classical variant of the question, Yeung and Zhang also at first only found a constrained inequality [13], and only somewhat later their unconstrained inequality in [14], whose proof indeed uses ideas from constrained inequalities. We think, however, that our result provides some evidence towards the existence of such an inequality for the quantum entropy cone; in fact, we believe that it way well be possible to prove an inequality ruling out the approximability of I and II, based on the following: in [4], it is conjectured that there is a robust version of that paper’s main theorem — characterising the states that come close to saturating strong subadditivity. It seems likely that with such a theorem one could perform an approximation version of the proof of Theorem 1, and conclude a new “constrained” inequality if the three constraint equations of Theorem 1 are only almost satisfied. In other words, there would be a trade-off between the degree by which I (A; C|B), I (C; B|A), I (A; B|D) are nonzero, and the negativity of I (C; D) − I (C; AB). This rationalises the following conjecture, with which we close the paper:

138

N. Linden, A. Winter

Conjecture 6. There exist positive constants κ1 , κ2 and κ3 , such that for all quadripartite states, κ1 I (A; C|B) + κ2 I (C; B|A) + κ3 I (A; B|D) + I (C; D) − I (C; AB) ≥ 0. Acknowledgements. We thank E. Maneva, S. Massar, S. Popescu, D. Roberts, B. Schumacher, J. A. Smolin and A. V. Thapliyal, for illuminating discussions on the subjects of this paper over many years and for allowing us to use their results prior to publication. We also thank M. Christandl and T. Osborne for helpful remarks. Both authors received support from the EU under European Commission project RESQ (contract IST-2001-37559).

References 1. Accardi, L., Frigerio, A.: Markovian cocycles. Proc. Roy. Irish Acad. 83A(2), 251–263 (1983) 2. Araki, H., Lieb, E.H.: Entropy inequalities. Commun. Math. Phys. 18, 160–170 (1970) 3. Bennett, C.H., Shor, P.W.: Quantum information theory. IEEE Trans. Inf. Theory 44(6), 2724–2742 (1998) 4. Hayden, P., Jozsa, R., Petz, D., Winter, A.: Structure of states which satisfy strong subadditivity of quantum entropy with equality. Commun. Math. Phys. 246(2), 359–374 (2004) 5. Lieb, E.H.: Some Convexity and Subadditivity Properties of Entropy. Bull. Amer. Math. Soc. 81, 1–13 (1975) 6. Lieb, E.H., Ruskai, M.B.: Proof of the strong subadditivity of quantum-mechanical entropy. J. Math. Phys. 14, 1938–1941 (1973) 7. Linden, N., Maneva, E., Massar, S., Popescu, S., Roberts, D., Schumacher, B., Smolin, J.A., Thapliyal, A.V.: In preparation 8. Pippenger, N.: What are the laws of information theory? 1986 Special Problems in Communication and Computation Conference, Palo Alto, CA, 3–5 September 1986 9. Pippenger, N.: The inequalities of quantum information theory. IEEE Trans. Inf. Theory 49(4), 773–789 (2003) 10. Ruskai, M.B.: Inequalities for quantum entropy: A review with conditions for equality. J. Math. Phys. 43(9), 4358–4375 (2002) 11. Shannon, C.E.: A Mathematical Theory of Communication. Bell System Tech. J. 27, 379–423 and 623–656 (1948) Shannon Theory demi-centennial issue of IEEE Trans. Inf. Theory: 44(6), (1998) 12. Yeung, R.W.: A Framework for Linear Information Inequalities. IEEE Trans. Inf. Theory 43(6), 11924–1934 (1997) 13. Zhang, Z., Yeung, R.W.: A Non-Shannon Type Conditional Inequality of Information Quantities. IEEE Trans. Inf. Theory 43(6), 1982–1985 (1997) 14. Yeung, R.W., Zhang, Z.: On Characterization of Entropy Function via Information Inequalities. IEEE Trans. Inf. Theory 44(4), 1440–1452 (1998) Communicated by M.B. Ruskai

Commun. Math. Phys. 259, 139–183 (2005) Digital Object Identifier (DOI) 10.1007/s00220-005-1347-0

Communications in

Mathematical Physics

Instability Zones of a Periodic 1D Dirac Operator and Smoothness of its Potential Plamen Djakov1, , Boris Mityagin2 1

Department of Mathematics, Sofia University, 1164 Sofia, Bulgaria. E-mail: [email protected] 2 Department of Mathematics, The Ohio State University, 231 West 18th Ave, Columbus, OH 43210, USA. E-mail: [email protected] Received: 26 August 2004 / Accepted: 8 November 2004 Published online: 15 April 2005 – © Springer-Verlag 2005

Abstract: Let L be the differential operator 0 P (x) 1 0 dy + y, Ly = i Q(x) 0 0 −1 dx

y y= 1 , y2

where P (x), Q(x) are 1-periodic functions such that Q(x) = P (x). The operator L, considered on [0, 1] with periodic (y(0) = y(1)), or antiperiodic (y(0) = −y(1)) boundary conditions, is self-adjoint, and moreover, for large |n| it has, close to nπ, a pair of periodic − (if n is even), or antiperiodic (if n is odd) eigenvalues λ+ n , λn . We study the relationship − between the decay rate of the instability zone sequence γn = λ+ n − λn , n → ±∞, and the smoothness of the potential function P (x). 1. Introduction 1. The operator

0 P (x) 1 0 dy Ly = i y, + 0 −1 dx P (x) 0

y y= 1 , y2

(1.1)

with a periodic function P (x) of period 1, P ∈ L2 ([0, 1]), is a self-adjoint operator on the real line R. Its spectrum σ (L) is absolutely continuous and has “a band structure”, i.e., + σ (L) = R \ ∪n∈Z (λ− n , λn ), The first author acknowledges the hospitality of The Mathematics Department of The Ohio State University during academic year 2003/2004. His research is partially supported by Grant MM–1401/04 of the Bulgarian Ministry of Education and Science.

140

P. Djakov, B. Mityagin

where − + + · · · < λ− n ≤ λn < λn+1 ≤ λn+1 < · · · , + and λ− n , λn is a pair of eigenvalues of the same differential operator L, but considered on the interval [0, 1], respectively with periodic (for even n), and antiperiodic (for odd n) boundary conditions (bc):

P er + : y(0) = y(1),

P er − : y(0) = −y(1).

See basic facts and further references on 1-D Dirac operators in [21, 23, 25, 29]. After Zakharov-Shabat [32–34] the operators (1.1) or (2.4), with Q = −P , and their spectra are closely related to analysis of the complete integrability of a dynamic system NLS, the nonlinear (cubic) Schr¨odinger equation. See [1, 31, 12] and references there. − 2. Let γn = λ+ n − λn , n ∈ Z, be the lengths of spectral gaps, or instability zones, + − (λn , λn ). What is the relationship between the decay rate of γn , n → ±∞, and the smoothness of a potential P ? In the case of Schr¨odinger (Hill) operators this question has a long history. Let us recall a few results and steps in the understanding of this relation. H. Hochstadt [17] proved that if a real-valued L2 -potential v of the Schr¨odinger operator My = −y + v(x)y, C ∞ -function

is a that is

v(x + 1) = v(x), x ∈ R,

then the gap sequence (γn )∞ 1 decays faster than any power of 1/n, (γn ) ∈ a = {(xn ) :

∞

|xn |2 (1 + n2 )a < ∞}

(1.2)

n=1

for every a > 0. He proved also [18] by using the trace formula [13] that a finite-zone potential (i.e., γn = 0 for all but finitely many n) is a C ∞ -function. For H. Hochstadt, this was an important step in analysis of finite-zone potentials; as soon as one knew that such potential is a C ∞ - function, it was possible [14, 17, 18] to use derivatives and derive polynomial identities involving v, v , v , . . . to determine v. Further analysis of finite-zone potentials [10, 28] led to the Dubrovin equations (see [11]). H. McKean and E. Trubowitz [24] used the trace formula in general setting to prove that (1.2) implies v ∈ C ∞ . E. Trubowitz [30] extended the analysis of [24], complemented by Dubrovin equations [10], to show that a real-valued L2 -potential v(x) is analytic if and only if the gap sequence (γn ) decays exponentially fast, that is ∃a > 0, C > 0 :

γn ≤ C exp(−an) ∀n ≥ 0.

In terms of the weighted sequence spaces 2 = {(xn ) : |xn |2 2 (n) < ∞}, Sobolev or analytic functions v, v(x) =

vk exp(2πikx),

can be characterized as having their Fourier-coefficient sequences in 2 , where = (1 + n2 )a/2 , or = exp(an), a > 0, respectively. T. Kappeler and B. Mityagin [19, 20] raised the general question about the relationship between the two conditions v ∈ H () and (γn ) ∈ 2 , where

1D Dirac Operators

141

H () = {v : (vk ) ∈ 2 },

(1.3)

for general (submultiplicative) weights. They showed that v ∈ H () ⇒ (γn ) ∈ 2 .

(1.4)

(γn ) ∈ 2 ⇒ v ∈ H ()

(1.5)

The opposite implication

required a delicate analysis of special non-linear equations in sequence spaces and a priori estimates of the Sobolev norms of their solutions. This has been done in [3–5] for, roughly speaking, each submultiplicative weight sequence of subexponential growth, i.e., lim (log n ) /n = 0. This is not just a technical restriction. For with superexponential growth like exp(|n|b ), b > 1, the implications (1.4) and (1.5) are not valid, but the proper adjustment can be made, and this is presented in [6]. Analysis of non-self-adjoint Hill operators, i.e., the case of complex-valued potentials, is done in [7]; see further references there. 3. Let us return to Dirac operators. Surprisingly enough, we could not find in the literature even a Hochstadt–McKean–Trubowitz [17, 18, 24] type statement in this case. Still, after [19, 20] the approach developed there for the Schr¨odinger-Hill case has been used in the Dirac case in [15, 16] to get claims about the decay rate of spectral gaps: γn2 2n < ∞ (1.6) P ∈ H () ⇒ n∈Z

under some rigid and (as we will see) unnecessary restrictions on . The main goal of the present paper is to show that for subexponential weights the H ()-smoothness of a potential P , i.e., the condition P ∈ H (), follows from 2 -decay of the two-sided sequence (γn ), i.e., (1.7) γn2 2n < ∞ ⇒ P ∈ H () (see Theorem 11, Sect. 4, for accurate formulation). This result has been announced in [8], Thm 2(i). Maybe, it’s worth mentioning that there is an analogue of this implication (and equivalence) in the non-self-adjoint case (see [8], Thm 2(ii); this result will be given with detailed proofs in [9]). In particular, (1.6) and (1.7) tell us the following. (A) (γn ) decays faster than any power of 1/n if and only if P ∈ C ∞ (compare to [17, 24]). (B) (γn ) decays faster than exp(−an) for some a > 0 if and only if P is analytic in a strip around the real axis (compare to [30]). (C) (γn ) decays faster than exp(−anβ ), β ∈ (0, 1), for some a > 0 if and only if the Fourier coefficients (pk ) of P decay faster than exp(−A|k|β ), for some A > 0 (compare to [5]). The statements (A), (B) and (C) are given in a precise form in Sect. 5.1. In the case of Schr¨odinger - Hill operators we have proven similar statements in [5] and [7]. The general scheme of the present paper is close to the scheme of our paper [5]. However, the technical details and difficulties are quite different, because

142

P. Djakov, B. Mityagin

(i) the Dirac operator is not semibounded; (ii) its resolvent is not a trace class operator. We are going to make this point explicit and specific in our proofs and comments below. The structure of our paper is as follows. Abstract Introduction Basic equation and formulae for gaps Weights; Carleman sequences Basic results: estimates on the smoothness of the potential in terms of the decay rate of spectral gaps 5. Conclusions and comments References 1. 2. 3. 4.

2. Basic Equation and Formulae for Spectral Gaps 1. The Dirac operator

L0 y = i

1 0 dy , 0 −1 dx

y=

y1 , y2

(2.1)

considered on the interval [0, 1] with periodic (y(0) = y(1)) or antiperiodic (y(0) = −y(1)) boundary conditions, has a discrete spectrum {2kπ, k ∈ Z} or {(2k+1)π, k ∈ Z}, respectively. Each eigenvalue nπ, both for periodic (if n is even), or antiperiodic (if n is odd) boundary conditions has multiplicity 2, and 1 −inπx 0 inπx 1 2 en (x) = e , en (x) = e (2.2) 0 1 are eigenfunctions corresponding to the eigenvalue nπ. Moreover, if the Hilbert space H = L2 [0, 1] × L2 [0, 1] is equipped with the scalar product 1 g f1 , 1 = (2.3) f1 (x)g1 (x) + f2 (x)g2 (x) dx, f2 g2 0 1 , e2 , k ∈ Z} and {e1 2 then each of the systems {e2k 2k 2k+1 , e2k+1 , k ∈ Z} is an orthonormal basis in H. The operator 0 P (x) 0 L = L + V, V = , (2.4) Q(x) 0

where P and Q are 1-periodic functions, may be considered as a perturbation of L0 . Further we always assume that P , Q ∈ L2 [0, 1]; then the operator L, considered with periodic or antiperiodic boundary conditions, has also a discrete spectrum. The following statement is known (see, for example [22, 23, 25, 26], in particular, [27], Thm. 4.1 and Prop. 4.3). Lemma 1. There exists N0 = N0 (P , Q) such that for each |n| ≥ N0 the open disc with center π n and radius π/2 contains exactly two (counted with multiplicity) periodic (if + n is even), or antiperiodic (if n is odd) eigenvalues {λ− n , λn } of L, i.e., |λ± n − πn| < π/2,

|n| ≥ N0 .

(2.5)

1D Dirac Operators

143

2. Suppose that λ = nπ + z, |n| ≥ N0 , is a periodic (or antiperiodic) eigenvalue of L with |z| < π/2 and y = 0 is a corresponding eigenvector. Let En0 = [en1 , en2 ] be the eigenspace of L0 that corresponds to nπ, and let H(n) be its orthogonal complement. We denote by Pn0 and Q0n , respectively, the orthogonal projectors on En0 and H(n). Then the equation (nπ + z − L)y = 0 is equivalent to the following system of two equations: Q0n (nπ + z − L0 − V )Q0n y + Q0n (nπ + z − L0 − V )Pn0 y = 0,

(2.6)

Pn0 (nπ + z − L0 − V )Q0n y + Pn0 (nπ + z − L0 − V )Pn0 y = 0.

(2.7)

Taking into account that Pn0 Q0n = Q0n Pn0 = 0 and Pn0 L0 Q0n = Q0n L0 Pn0 = 0 we obtain that (2.6) and (2.7) can be written as Q0n (nπ + z − L0 − V )Q0n y − Q0n V Pn0 y = 0,

(2.8)

−Pn0 V Q0n y − Pn0 V Pn0 y + zPn0 y = 0.

(2.9)

A = A(n, z) := Q0n (nπ + z − L0 − V )Q0n : H(n) → H(n)

(2.10)

The operator

is invertible for large |n| (see below (2.18), (2.26) and (2.27)). Thus, solving (2.8) for Q0n y, we obtain Q0n y = A−1 Q0n V Pn0 y, where Pn0 y = 0 (otherwise Q0n y = 0 which implies y = Pn0 y + Q0n y = 0). Now (2.9) implies (after plugging the above expression for Q0n y into (2.9)) that (S − z)Pn0 y = 0, where the operator S is given by S := Pn0 V A−1 Q0n V Pn0 + Pn0 V Pn0 : En0 → En0 .

(2.11)

12

S 11 S be the matrix representation of the two-dimensional operator S with S 21 S 22 respect to the basis en1 , en2 ; then Let

S 11 = en1 , Sen1 ,

S 22 = en2 , Sen2 ,

S 12 = en1 , Sen2 ,

S 21 = en2 , Sen1 . (2.12)

Hence we obtain (since Pn0 y = 0) 11 S − z S 12 = 0. det S 21 S 22 − z

(2.13)

In the self-adjoint case (Q(x) = P (x)), if λ is a double eigenvalue, then there exists another eigenvector y˜ (corresponding to λ), such that y and y˜ are linearly independent. Then Pn0 y and Pn0 y˜ are linearly independent also. Indeed, if Pn0 y = cPn0 y˜ then ˜ Q0n y = A−1 Q0n V Pn0 y = cA−1 Q0n V Pn0 y˜ = cQ0n y, which leads to a contradiction:

y = Pn0 y + Q0n y = c Pn0 y˜ + Q0n y˜ = cy. ˜

144

P. Djakov, B. Mityagin

Thus S ≡ 0, i.e., if λ = πn + z is a double eigenvalue of a self-adjoint Dirac operator L, then (for large enough n) S 11 − z = 0,

S 12 = 0,

S 21 = 0,

S 22 − z = 0.

(2.14)

1 , m ∈ Z} and 3. Let H1 and H2 be the subspaces of H generated, respectively, by {em 2 1 2 {em , m ∈ Z}, and let H (n) and H (n) be, respectively, the intersections of these spaces with H(n). Then H = H1 ⊕ H2 , so each operator B : H → H may be identified with a 2 × 2 operator matrix (B ij ), where B ij : Hj → Hi , i, j = 1, 2. If we consider the 1 , e2 , k ∈ Z} (or {e1 2 matrix representation of B in the basis {e2k 2k 2k+1 , e2k+1 , k ∈ Z}) ij then this matrix itself combines the matrix representations of B . Of course, a similar remark holds for operators acting in H(n). Further we always work with one of the bases (2.2) (respectively, using the first basis in the case of periodic boundary conditions, and the second one in the case of antiperiodic boundary conditions). However, we don’t specify below which basis is used because the formulas for the matrix representations in these bases are formally the same (with running indices even in the first case and odd in the second case). Let P (x) = p(m)eimπx and Q(x) = q(m)eimπx , (2.15) m∈Z

m∈Z

where p(m) = q(m) = 0 for odd m, be the Fourier expansions of the functions P and Q. It is easy to see that the operator V has the following matrix representation 0 V 12 12 21 = p(−k − m), Vkm = q(k + m). (2.16) V = , Vkm V 21 0 The diagonal operator Q0n (nπ + z − L0 )Q0n : H(n) → H(n) is invertible in H(n) for any z with |z| ≤ π/2. Let Dn denote its inverse operator; then the matrix representation of Dn is 11 δkm Dn 0 11 22 , D . (2.17) = D = Dn = 22 n n 0 Dn km km π(n − k) + z The operator A defined in (2.10) can be written as A = Q0n (nπ + z − L0 )Q0n (1 − Tn Q0n ),

(2.18)

Tn = Dn Q0n V : H → H(n).

(2.19)

where

Thus A = A(n, z) is invertible if and only if 1 − Tn Q0n is invertible in H(n). By (2.16) and (2.17) one can easily see that the operator (2.19) has a matrix representation 0 Tn12 , (2.20) Tn = Tn21 0 where (Tn12 )km =

p(−k − m) , π(n − k) + z

(Tn21 )km =

q(k + m) , π(n − k) + z

k, m ∈ Z, k = n. (2.21)

1D Dirac Operators

145

We need also to know the matrix representation of its square Tn2 . From (2.20) and (2.21) it follows that 12 21 T T 0 Tn2 = n n , (2.22) 0 Tn21 Tn12 where (Tn12 Tn21 )km =

p(−k − j )q(j + m) , [π(n − k) + z][π(n − j ) + z]

j =n

(Tn21 Tn12 )km =

(k, m ∈ Z, k = n) q(k + j )p(−j − m) . [π(n − k) + z][π(n − j ) + z]

j =n

(2.23)

Lemma 2. The norm of the operator Tn2 : H → H(n) tends to 0 as |n| → ∞. More precisely, if |z| < π/2, then 1/2 1/2   P Q Tn2 ≤ C √ |q(k)|2  + CQ  |p(k)|2  , + CP  |n| |k|≥|n| |k|≥|n| (2.24) where C is an absolute constant. Proof. The norm of Tn2 does not exceed its Hilbert-Schmidt norm, so, by (2.22), it is less than the sum of the Hilbert-Schmidt norms of the operators Tn12 Tn21 and Tn21 Tn12 . We estimate in detail only the Hilbert-Schmidt norm Tn12 Tn21 H S because 21 Tn Tn12 H S could be estimated in the same way. One can easily see that 1 1 ≤ |π(n − k) + z| |n − k|

for

|z| < π/2, k = n,

(2.25)

so by (2.23) we have Tn12 Tn21 2H S

 2 |p(−k − j )||q(j + m)|   ≤ 1 + 2 + 3 , ≤ |n − k||n − j | m k =n

j =n

where

1 =



 ... ,

m |k−n|≥ |n| 2



2

2 =

j =n

3 =

  k =n m

  

m |k−n|< |n| 2

|j −n|< |n| 2

2  ... .

|j −n|≥ |n| 2

2  ...

146

P. Djakov, B. Mityagin

Now we estimate each of these sums separately. By the Cauchy inequality we obtain    1  

1 ≤ |p(−k − j )|2 |q(j + m)|2  2 (n − j )2 (n − k) |n| m |k−n|≥

≤

π2 3

j =n

2

|k−n|≥|n|/2

j =n

1 C1 |p(−k − j )|2 |q(j + m)|2 ≤ P 2 Q2 . 2 (n − k) n m j =m

The sum 2 can be estimated in an analogous way, so

2 ≤

C1 P 2 Q2 . n

Finally, the sum 3 does not exceed     |p(−k − j )|2 |q(j + m)|2  1    2 2 (n − k) (n − j ) |n| m |n| |k−n|<

≤

2

|j −n|<

j =n

2

1 |p(−k − j )|2 |q(j + m)|2 2 (n − k) m |k−n|<|n|/2 |j −n|<|n|/2 ≤ C2 Q2 |p(ν)|2 , π2 3

|ν|≥|n|

which completes the proof.

By Lemma 2, for each potential matrix V there exists N1 = N1 (V ) such that Tn2 ≤ 1/2

for |n| ≥ N1 .

(2.26)

Since Tn2k ≤ Tn2 k

and Tn2k+1 ≤ Tn Tn2 k ,

the series (1 − Tn Q0n )−1 =

∞

Tn Q0n

=0

converges. Thus, in view of (2.18), A−1 exists and A−1 =

∞

Tn Dn ,

|n| ≥ N1 .

(2.27)

=0

Now, from (2.11) and (2.19) it follows that S = Pn0 V Pn0 +

∞ =0

Pn0 V Tn Dn Q0n V Pn0 =

∞ k=0

Pn0 V Tnk Pn0 ,

(2.28)

1D Dirac Operators

147

so, in view of (2.12), we have ∞ j ij S ij = eni , Sen = Sk ,

(2.29)

k=0

where

j Sνij = eni , V Tnk en , k = 0, 1, 2, . . . .

From (2.16) and (2.21 - 2.23) it follows that 12 T 21 T 12 ν 0 V 2ν n n ν , V Tn = V 21 Tn12 Tn21 0 12 21 12 21 ν 0 V Tn Tn Tn V Tn2ν+1 = . ν 0 V 21 Tn12 Tn21 Tn21 It is easy to see that eni , V Tn2ν eni = 0,

i = 1, 2;

(2.30)

(2.31) (2.32)

ν = 0, 1, 2, . . . ,

therefore by (2.12), (2.28), (2.29) and (2.32) we obtain S 11 =

∞

11 S2ν+1 ,

S 22 =

ν=0

where

∞

22 S2ν+1 ,

ν 11 S2ν+1 = en1 , V Tn2ν+1 en1 = en1 , V 12 Tn21 Tn12 Tn21 en1 =

j0 ,j1 ,... ,j2ν

i0 ,i1 ,... ,i2ν

(2.34)

p(−n − j0 )q(j0 + j1 )p(−j1 − j2 )q(j2 + j3 ) . . . p(−j2ν−1 − j2ν )q(j2ν + n) , [π(n − j0 ) + z][π(n − j1 ) + z] . . . [π(n − j2ν−1 ) + z][π(n − j2ν ) + z] =n

22 S2ν+1 = en2 , V Tn2ν+1 en2 = en2 , V 21 Tn12 (Tn21 Tn12 )ν en2 =

(2.33)

ν=0

(2.35)

q(n + i0 )p(−i0 − i1 )q(i1 + i2 )p(−i2 − i3 ) . . . q(j2ν−1 + j2ν )p(−j2ν − n) . [π(n − i0 ) + z][π(n − i1 ) + z] . . . [π(n − i2ν−1 ) + z][π(n − i2ν ) + z] n =

In an analogous way we obtain formulas for S 12 and S 21 . Indeed, en2 , V Tn2ν+1 en1 = 0, ν = 0, 1, 2, . . . , en1 , V Tn2ν+1 en2 = 0, and therefore, from (2.12), (2.28), (2.29) and (2.31) it follows that S 12 =

∞ ν=0

12 S2ν ,

S 21 =

∞ ν=0

21 S2ν ,

(2.36)

148

P. Djakov, B. Mityagin

where

S012 = en1 , V en2 = p(−2n),

S021 = en2 , V en1 = q(2n),

(2.37)

and for ν = 1, 2 . . .

12 = en1 , V Tn2ν en2 = en1 , V 12 (Tn21 Tn12 )ν en2 = S2ν

j1 ,... ,j2ν

p(−n − j1 )q(j1 + j2 )p(−j2 − j3 )q(j3 + j4 ) . . . q(j2ν−1 + j2ν )p(−j2ν − n) , [π(n − j1 ) + z][π(n − j2 ) + z] . . . [π(n − j2ν−1 ) + z][π(n − j2ν ) + z] n =

21 S2ν = en2 , V Tn2ν en1 = en2 , V 12 (Tn21 Tn12 )ν en1 = j1 ,... ,j2ν

(2.38)

(2.39)

q(n + j1 )p(−j1 − j2 )q(j2 + j3 )p(−j3 − j4 ) . . . p(−j2ν−1 − j2ν )q(j2ν + n) . [π(n − j1 ) + z][π(n − j2 ) + z] . . . [π(n − j2ν−1 ) + z][π(n − j2ν ) + z] n =

Lemma 3. (a) For any potential functions P , Q we have S 11 (n, z) = S 22 (n, z).

(2.40)

S 12 (n, z) = S 21 (n, z).

(2.41)

(b) If Q(x) = P (x), then

Proof. (a) Changing the summation indices in (2.35) by putting js = i2ν−s ,

s = 0, 1, . . . , 2ν,

we obtain (by (2.34)) that 22 11 = S2ν+1 , S2ν+1

ν = 0, 1, 2, . . . ,

and therefore, by (2.33), we have S 22 = S 11 . (b) If Q(x) = P (x), then q(m) = p(−m) ∀ m ∈ Z, and therefore, (2.37), (2.38) and 21 (n, z) = S 12 (n, z) for each ν = 0, 1, 2, . . . , so (2.36) implies (2.41). (2.39) yield S2ν 2ν 4. Let us set for convenience αn (z) := S 11 (n, z)

βn (z) := S 21 (n, z).

(2.42)

Lemma 4. For each pair P (x), Q(x) of potential functions there exists N2 > 0 such that for |n| ≥ N2 and |z| ≤ π/2, αn (z) and βn (z) are well defined, differentiable, and sup |αn (z)| → 0,

|z|≤π/2

sup |βn (z)| → 0 as |n| → ∞.

|z|≤π/2

(2.43)

1D Dirac Operators

149

Proof. By (2.10) and (2.11), d S(n, z) = −Pn0 V (A−1 )2 Q0n V Pn0 , dz and therefore, in view of (2.12) and (2.42), we have αn (z) = − Pn0 V (A−1 )2 Q0n V Pn0 en1 , en1 ,

(2.44)

βn (z)

(2.45)

=

− Pn0 V (A−1 )2 Q0n V Pn0 en1 , en2 .

By (2.27) A−1 Q0n V =

∞

Tn Dn Q0n V =

−1

A

A

−1

Q0n V

=

∞

Tn Dn

=0

Tnk ,

k=1

=0

and therefore,

∞

∞

Tnk

= Dn Tn + Tn Dn Tn + R,

(2.46)

as |n| → ∞.

(2.47)

k=1

where, in view of Lemma 2,

R = O Tn2 → 0

Thus, by (2.44) and (2.45), we have αn (z) = − V Dn Tn en1 , en1 − V Tn Dn Tn en1 , en1 − V Ren1 , en1 , βn (z) = − V Dn Tn en1 , en2 − V Tn Dn Tn en1 , en2 − V Ren1 , en2 .

(2.48) (2.49)

We are going to show that all terms on the right of the above formulae go to 0 uniformly in z, |z| ≤ π/2, as |n| → ∞. From (2.47) it follows that

V Ren1 , en1 → 0,

V Ren1 , en1 → 0

as |n| → ∞.

By (2.16), (2.20), (2.21) and (2.25), p(n + k)q(−k − n) ≤ 1 + 2 , V Dn Tn en1 , en1 = π(n − k) + z k =n

(2.50)

(2.51)

where

1 =

|n−k|≤|n|/2

|p(n + k)||q(−k − n)| , |n − k|

2 =

|n−k|>|n|/2

|p(n + k)||q(−k − n)| . |n − k| (2.52)

Let us change the summation index k in 1 to i = n + k. Then, since |i| = |2n − (n − k)| ≥ 2|n| − |n − k| > |n|,

150

P. Djakov, B. Mityagin

we obtain

1 ≤

 |p(i)||q(−i)| ≤ 

|i|>|n|

1/2  |p(i)|2 



|i|>|n|

1/2 |q(i)|2 

→0

as |n| → ∞.

|i|>|n|

(2.53) The Cauchy inequality yields 

2 ≤ P Q 

|n−k|>|n|/2

1/2 1  (n − k)2

= O(1/ |n|).

(2.54)

On the other hand, (2.16), (2.20) and (2.21) imply that

V Tn Dn Tn en1 , en1 = 0,

V Dn Tn en1 , en2 = 0.

(2.55)

Next we estimate V Tn Dn Tn en1 , en2 . Set Un = Tn Dn Tn ; then, by (2.16) and (2.20)– (2.23) the absolute value of each term in the matrix representation of Un does not exceed the absolute value of the corresponding term in the matrix representation of (Tn )2 , and therefore, by the proof of Lemma 2, Un = Tn Dn Tn → 0

as |n| → ∞.

(2.56)

Of course, (2.56) implies that

V Tn Dn Tn en1 , en2 → 0

as |n| → ∞.

(2.57)

Now, in view of (2.48) and (2.49), the formulae (2.50)–(2.57) show that (2.43) holds. Theorem 5. Let L be a self-adjoint Dirac operator given by (1.1), and let (γn ) be the sequence of its spectral gaps. Then there exist N2 > 0 and a sequence of positive numbers (εn ), εn → 0, such that 2|βn (z)|(1 − εn ) ≤ γn ≤ 2|βn (z)|(1 + εn ),

|n| ≥ N2 ,

(2.58)

where z = zn , |zn | ≤ π/2.

(2.59)

2 ± Proof. By Lemma 1, if |n| ≥ N0 , then there are exactly two eigenvalues λ± n = n + zn ± of L (periodic for even n and antiperiodic for odd n) such that |zn | < π/2. Moreover, we know (see (2.26) and (2.27)) that there exists N1 > N0 such that, for |n| ≥ N1 , zn− and zn+ are roots of the quasi-quadratic equation (2.13). Since the operator L is self-adjoint, zn− and zn+ are real numbers, zn− ≤ zn+ , and

γn = zn+ − zn− .

(2.60)

By (2.40) and (2.41) in Lemma 3, the quasi-quadratic equation (2.13) becomes the equation (z − αn (z))2 − |βn (z)|2 = 0,

(2.61)

1D Dirac Operators

151

which splits into two equations z − αn (z) − |βn (z)| = 0,

(2.62)

z − αn (z) + |βn (z)| = 0.

(2.63)

δn = sup |α (n, z)| + sup |β (n, z)|.

(2.64)

Set |z|≤π/2

|z|≤π/2

By Lemma 4, δn → 0 as |n| → ∞. Choose N2 > N1 so that for |n| ≥ N2 .

δn < 1/8

(2.65)

+ Fix an n such that |n| ≥ N2 . If γn = 0, then λ− n = λn is a double eigenvalue of L, so (2.14) and (2.42) yield (2.58). If zn− < zn+ , set

ζn+ = zn+ − αn (zn+ ),

ζn− = zn− − αn (zn− ).

(2.66)

|ζn− | = |βn (zn− )|.

(2.67)

Then, by (2.62) and (2.63), |ζn+ | = |βn (zn+ )|, By (2.66), ζn+

− ζn+

=

zn+

zn−

1 − αn (z) dz.

Thus, in view of Lemma 4, (zn+ − zn− )(1 − δn ) ≤ |ζn+ − ζn− | ≤ (zn+ − zn− )(1 + δn ),

(2.68)

which yields (since δn < 1/8 by (2.65)) the inequalities |ζn+ − ζn− | (1 − δn ) ≤ zn+ − zn− ≤ |ζn+ − ζn− | (1 + 2δn ) ≤ 2|ζn+ − ζn− |.

(2.69)

Since zn+ and zn− are roots of (2.61), each of these numbers is a root of either (2.62) or (2.63). Hypothetically, there are two cases: (i) zn+ and zn− are roots of different equations; (ii) zn+ and zn− are roots of one and the same equation. In Case (i) we have, by (2.62), (2.63) and (2.67), that |ζn+ − ζn− | = |βn (zn+ )| + |βn (zn− )| = |ζn+ | + |ζn− |. On the other hand, since βn (zn+ ) − βn (zn− ) =

zn+ zn−

βn (t)dt, (2.64) and (2.66) imply that

|βn (zn+ ) − βn (zn− )| ≤ (zn+ − zn− )δn ≤ |ζn+ − ζn− | · 2δn . Thus, (2.67) and (2.70) yield + |ζ | − |ζ − | = |βn (z+ )| − |βn (z− )| ≤ |ζ + | + |ζ − | · 2δn n

n

n

n

(2.70)

n

n

(2.71)

152

P. Djakov, B. Mityagin

so, since 2|ζn+ | = |ζn+ | + |ζn− | + |ζn+ | − |ζn− | , + |ζn | + |ζn− | (1 − 2δn ) ≤ 2|ζn+ | ≤ |ζn+ | + |ζn− | (1 + 2δn ) , and therefore, since δn < 1/8, 2|ζn+ | (1 − 2δn ) ≤ |ζn+ | + |ζn− | ≤ 2|ζn+ | (1 + 4δn ) .

(2.72)

Finally, using again that δn < 1/8, we obtain by (2.69), (2.70) and (2.72) that (2.58) holds with z = zn+ and εn = 8δn . Case (ii), where zn+ and zn− are simultaneously roots of one of Eqs. (2.62) and (2.63), is impossible. Indeed, by (2.71), we would have, since δn < 1/8, 1 |ζn+ − ζn− | = |βn (zn+ )| − |βn (zn− )| ≤ |ζn+ − ζn− | · 2δn ≤ |ζn+ − ζn− |, 4 which implies ζn+ = ζn− . But then (2.69) yields zn+ = zn− , which contradicts our assump tion that zn+ = zn− . 3. Weights and Carleman sequences 1. A sequence of positive numbers (n), n ∈ Z, is called a weight, or a weight sequence, if (−n) = (n),

(n) ∞

as

n ∞, n ≥ n0 > 0.

(3.1)

Each weight generates the weighted 2 -space, 2 (, Z) = {x = (xn )n∈Z : x2 =

|xn |2 ((n))2 < ∞}.

n∈Z

We say that two weights 1 and 2 are equivalent if ∃C > 0 :

C −1 1 (n) ≤ 2 (n) ≤ C1 (n),

n ∈ Z.

(3.2)

Obviously equivalent weights yield equivalent norms, so they generate one and the same weighted 2 -space. A weight is called submultiplicative if ∃C > 0 :

(n + m) ≤ C(n)(m),

n, m ∈ Z.

(3.3)

Of course, if 1 and 2 are equivalent weights, then whenever one of them is submultiplicative, the other one is submultiplicative also. Obviously, if satisfies (3.3), then ˜ = C satisfies (3.3) with C = 1. Therefore, we may assume that (3.3) holds with C = 1 by passing to an equivalent weight. Moreover, it is easy to see that if (3.3) holds for |n|, |m| ≥ n0 , then it holds for all n, m ∈ Z, maybe with another constant C. A weight is said to be slowly increasing if sup (2n)/ (n) < ∞.

(3.4)

n

It is easy to see that (3.4) implies ∃ m > 0, C > 0 :

(n) ≤ C|n|m ,

for |n| ≥ 1.

(3.5)

1D Dirac Operators

153

Indeed, if M = supn≥1 (2n)/ (n), then (3.4) implies that (2k ) ≤ (1)M k = (1)(2k )m ,

m = log2 (M).

Now (3.5) follows (since is monotone increasing for n ≥ n0 ) : if n0 ≤ 2k ≤ n < 2k+1 then (n) ≤ (2k+1 ) ≤ M(2k ) ≤ M(1)(2k )m ≤ M(1)nm . Further we consider weights of the form (n) = exp(h(|n|)),

|n| ≥ n0 > 0,

(3.6)

or (n) = exp(ϕ(log |n|)),

|n| ≥ n0 > 0,

(3.7)

and characterize some properties of in terms of the functions h and ϕ. Remark. Observe that in (3.6) or (3.7) we don’t care to define for all n because our main object is the corresponding weighted 2 -space. Therefore, weights are important only “up to equivalence” and the values of (n) for |n| < n0 may be chosen in an arbitrary way since the corresponding 2 -spaces will coincide. Of course, with the formulae ϕ(t) = h(et ),

h(n) = ϕ(log(n)),

one can easily pass from representation (3.6) to (3.7), and back. It is more convenient to give concrete weights in the form (3.6). For example, m (n) = |n|m ,

m > 0,

(3.8)

are known as the Sobolev weights, and a,b (n) = exp(a|n|b ),

a > 0, b ∈ (0, 1),

(3.9)

are the Gevrey weights. Lemma 6. A weight of the form (3.6) is submultiplicative if h is an increasing concave function. Proof. Indeed, one can easily see that if h : [n0 , ∞) → R is an increasing concave function, then there exists an increasing concave function h1 : [0, ∞) → [0, ∞) such that h1 (n) = h(n) + C for n ≥ n0 , n ∈ N. Then the weight is equivalent to the weight 1 (·) = exp(h1 (·)), so it is enough to show that 1 is submultiplicative. On the other hand, since h1 is concave we have for m, n > 0, h1 (0) + h1 (m + n) ≤ h1 (m) + h1 (n), which implies (in view of (3.3) and (3.6)) that the weight 1 is submultiplicative.

2. The next lemma brings our attention to a class of rapidly increasing submultiplicative weights of the form (3.7). In particular, this class contains the Gevrey weights (3.9).

154

P. Djakov, B. Mityagin

Lemma 7. Suppose ϕ : [0, ∞) → [0, ∞), ϕ(0) = 0, is a twice differentiable function such that the following conditions hold: ϕ (t) ∞ as t ∞;

(3.10)

et /ϕ (t) ∞ as t ∞.

(3.11)

(a) Let ψ(s) be the Young dual function of ϕ, i.e. ψ(s) = sup[st − ϕ(t)],

s ≥ 0.

(3.12)

t≥0

Then ek :=

1 exp(ψ (k)) ∞ as k ∞. k

(3.13)

ϕ (t) − ϕ (t) > 1, log ϕ (t)

(3.14)

(b) In addition, if lim inf t→∞

then ∃p ∈ N, τ > 1 :

k

τ

ek epk

k ≤ 1 for k ≥ k0 .

(3.15)

Proof. (a) Since (st −ϕ(t))t = s −ϕ (t) one can easily see, by (3.10), that the expression st − ϕ(t) attains its maximum at the point t (s) = (ϕ )−1 (s),

(3.16)

ψ(s) = st (s) − ϕ(t (s)).

(3.17)

thus

The function s → t (s) is increasing because ϕ is increasing. From the identity ϕ (t (s)) = s and (3.17) it follows that ψ (s) = t (s) + st (s) − ϕ (t (s))t (s) = t (s),

(3.18)

ψ (s) = t (s) = 1/ϕ (t (s)).

(3.19)

Therefore, (3.11) implies that

ek = eψ (k) /k = et (k) /ϕ (t (k)) ∞. (b) One can easily see that (3.15) is equivalent to epk k ∃p ∈ N : lim inf > 1. log k log k ek

(3.20)

1D Dirac Operators

155

By (3.13), we have log ek = ψ (k) − log k, and therefore, (3.16) and (3.19) imply that epk log = [ψ (pk) − log(pk)] − [ψ (k) − log k] ek p p 1 1 1 =k ψ (uk) − du = k − du. uk ϕ [t (uk)] ϕ [t (uk)] 1 1 For large enough k it follows from (3.14) and (3.16) that uk = ϕ [t (uk)] > ϕ [t (uk)], so 1 1 ϕ [t (uk)] − ϕ [t (uk)] − > . ϕ [t (uk)] ϕ [t (uk)] u2 k 2 Thus (again, by (3.14)) we obtain that p epk k ϕ [t (uk)] − ϕ [t (uk)] 1 > log · 2 du log k ek log ϕ [t (uk)] u 1

(3.21)

for large enough k. Let > 1 be the liminf in (3.14). Choose p ∈ N so that +1 2 (1 − 1/p) > 1. Since ( + 1)/2 < there exists k0 such that for k ≥ k0 , (3.21) holds and there the integral is greater than p +1 1 +1 · 2 du = (1 − 1/p) > 1. 2 u 2 1 This completes the proof of the lemma.

Remark. Obviously, if ϕ satisfies the condition ϕ (t) − ϕ (t) =∞ t→∞ log ϕ (t) lim

(3.22)

then (3.14) holds. One can easily see that the Gevrey weights (3.9) satisfy (3.22). Now we present a family of weights that satisfy (3.14) but don’t satisfy (3.22). Consider the weights (3.7) generated by t ϕ(t) = eω(u) du, 0

where ω(u) = βu − (1 − β) cos u + αue−βu ,

α > 1, β ∈ (0, 1).

Then ϕ (t) = eω(t) ,

ϕ (t) = eω(t) β + (1 − β) sin t + α(1 − βt)e−βt ,

so ϕ (t) − ϕ (t) eω(t) = (1 − β)(1 − sin t) + α(βt − 1)e−βt log ϕ (t) ω(t)

(3.23)

156

P. Djakov, B. Mityagin

which is greater than α

eω(t) βt − 1 (βt − 1)e−βt = α exp[(β − 1) cos t + αte−βt ]. ω(t) ω(t)

(3.24)

Let (tk ) be a sequence of positive numbers such that tk → ∞. Observe that if lim inf k (1− sin(tk )) > 0 then the expression (3.23) with t = tk goes to ∞ as k → ∞, while whenever limk (1 − sin(tk )) = 0 the expression (3.24) with t = tk tends to α. On the other hand for t = tk = (4k + 1)π/2, k = 1, 2, . . . the expressions (3.23) and (3.24) coincide. By these observations it is easy to see that lim inf [ϕ (t) − ϕ (t)]/ log ϕ (t) = α. t→∞

Since α > 1, the inequality (3.14) holds, while (3.22) fails. 3. We say that a sequence of positive numbers (Mk )∞ k=0 is a Carleman sequence if M0 = 1,

Mk / (kMk−1 ) ∞.

(3.25)

We attach to any Carleman sequence (Mk ) the following sequences: m0 = 1,

mk = Mk /Mk−1 ,

e0 = 1,

ek = mk /k, k ≥ 1.

(3.26)

We set also E0 := e0 ,

Ek = e1 . . . ek = Mk /k!,

k ≥ 1.

(3.27)

Observe that if a sequence (ek )∞ k=0 satisfies the condition ek ∞, then it generates a corresponding Carleman sequence Mk = k!Ek with Ek defined by (3.27). Suppose ϕ ∈ (3.7) is a weight that grows faster than any power of n. For a technical reason we need to characterize the relation x = (xn ) ∈ 2 () by the sequence of 1 -norms xk = x0 + |xn ||n|k , k = 1, 2, . . . . (3.28) It turns out that this can be done in terms of an appropriate Carleman sequence generated by the function ϕ. For every function ϕ such that (3.10), (3.11) and (3.14) hold we denote by (Mk (ϕ)) the Carleman sequence generated by the formula mk (ϕ) = exp(ψ (k)),

k = 1, 2, . . . ,

that is Mk (ϕ) = exp(ψ (1) + · · · + ψ (k)),

k = 1, 2, . . . ,

where ψ is the Young dual function of ϕ. We may assume, without loss of generality, that the function ϕ is defined on [0, ∞), and moreover, that the condition ϕ (t) − ϕ (t) > 0 holds for t ≥ 0 (since otherwise one can consider an equivalent weight generated by a suitable function ϕ). ˜ Moreover, the condition (3.11) implies that the weight ϕ is submultiplicative. Indeed, since ϕ (n) = exp[ϕ(log |n|)] = eψ(|n|)

with

ψ(s) = ϕ(log s),

1D Dirac Operators

157

we obtain, in view of (3.11), that the derivative ϕ(log s) ϕ(log s) = s elog s is decreasing, so ψ(s) is a concave function. Thus, by Lemma 6, the weight ϕ is submultiplicative. ψ (s) =

Lemma 8. If ϕ satisfies (3.10), (3.11), (3.14) and ϕ (|n|) = exp(ϕ(log |n|)) then we have (a)

x = (xn ) ∈ 2 (ϕ ) ⇒ xk ≤ CMk (ϕ1 ),

ϕ1 (t) = ϕ(t) − t;

(3.29)

(b)

xk ≤ CMk (ϕ) ⇒ x = (xn ) ∈ 2 (ϕ2 ),

ϕ2 (t) = ϕ(t) − 4t.

(3.30)

Proof. (a) Observe that (with ψ(0) = 0 ) ψ(k) ≤ ψ (1) + · · · + ψ (k) ≤ ψ(k + 1). Therefore sup n

|n|k = exp sup k log |n| − ϕ(log |n|) ϕ (n) n ≤ exp(ψ(k)) ≤ Mk (ϕ) = exp(ψ (1) + · · · + ψ (k)).

If x = (xn ) ∈ 2 (ϕ ) then, with = ϕ , by the Cauchy inequality we obtain xk = |xn ||n|k = |xn |ϕ (n) |n|k / ϕ (n) 1/2 1 |n|k+1 2 |n|k ≤ x2 (ϕ ) · ≤ C sup ≤ CMk (ϕ1 ), n2 ϕ (n) ϕ (n)/n n where ϕ1 (t) = ϕ(t) − t. (b) Suppose that xk ≤ CMk (ϕ); then for large |n| we have xk |n|k ≥ |xn | sup Mk k k Mk ≥ |xn | exp sup k log |n| − (ψ (1) + · · · + ψ (k)) k ≥ |xn | exp sup k log |n| − ψ(k + 1) k −2 ≥ |xn ||n| exp sup s log |n| − ψ(s) = |xn ||n|−2 exp(ϕ(log |n|)),

C ≥ sup

s>0

that is |xn

||n|−2

ϕ (n)

≤ C. Therefore 1 ϕ (n) |xn | 4 ≤ C < ∞, n n2 n n

which implies that x = (xn ) ∈ 1 (ϕ2 ) ⊂ 2 (ϕ2 ) with ϕ2 (t) = ϕ(t) − 4t.

158

P. Djakov, B. Mityagin

Lemma 9. Suppose (ek )∞ k=1 is a sequence of positive numbers such that ek ↑ ∞ and let E0 = 1,

Ek =

k

k ≥ 1.

ej ,

1

Then the following implications hold: sup k τ (ek /epk )k < ∞

∃ p ∈ N, τ > 0 :

⇒

sup k τ (Ek )2 /E2k < ∞;

k ∞

(ek /epk )k < ∞

⇒

k=1 ∞

(3.31)

k ∞

(Ek )2 /E2k < ∞;

(3.32)

k=1

(Ek )2 /E2k < ∞

⇒

m Ej Em−j

Q := sup m

k=1

j =0

Em

< ∞,

(3.33)

and moreover, sup

m s +···+s µ 0

Es0 . . . Esµ < Qµ , E m =m

µ = 1, 2, . . . .

(3.34)

Remark. This lemma is a “multidimensional” version of the statements on p. 164 in [3]. It improves Lemma 5 on p. 251 in [4], where we can now omit the factor k p−2 in the hypothesis (5.7). Proof. If k = pν + r with 0 ≤ r < p, then we have (Ek )2 e1 . . . eν eν+1 . . . ek e1 . . . eν = · ≤ ≤ E2k ek+1 . . . ek+ν ek+ν+1 . . . e2k ek+1 . . . ek+ν

eν epν

ν

(because ei < ej for i < j ). Thus (3.31) and (3.32) hold. To prove (3.33) and (3.34) let us consider the sums Tm =

m Ej Em−j j =0

Em

.

Then Ej Em−j Ej Em+1−j ≤ , Em+1 Em

0 ≤ j ≤ m,

because (3.35) is equivalent to em+1−j ≤ em+1 , which holds since the sequence (ek ) is increasing. By symmetry Tm = 2

0≤j ≤m/2

Ej Em−j − δm , Em

(3.35)

1D Dirac Operators

159

where

δm =

0 , m = 2n + 1 En2 /E2n , m = 2n.

The next sum is

Tm+1 = 2

0≤j ≤m/2

Ej Em+1−j + δm+1 , Em+1

and (3.35) implies that Tm+1 ≤ Tm + δm + δm+1 . Therefore, we have Tm ≤ 2 +

∞

(δk + δk+1 ) = 2 + 2

∞

En2 /E2n < ∞,

n=1

k=1

thus (3.33) holds. Now we prove (3.34) by induction in µ. Let us denote by Sµ (m) the set of all (µ+1)tuples of integers s = (s0 , . . . , sµ ) such that 0 ≤ si ≤ m and |s| = s0 + · · · + sµ = m, i.e., Sµ (m) = {s = (s0 , . . . , sµ ) :

0 ≤ si ≤ m, |s| = m}.

(3.36)

By (3.33), the inequality (3.34) holds for µ = 1. Assume that (3.34) holds for some µ ≥ 1. Then we have   m Es0 . . . Esµ+1 Es0 . . . Esµ Esµ+1 Em−sµ+1   = Em Em−sµ+1 Em sµ+1 =0

s∈Sµ+1 (m)

≤ Qµ

s∈Sµ (m−sµ+1 )

m Esµ+1 Em−sµ+1

Em

sµ+1 =0

This proves (3.34).

≤ Qµ+1 .

4. The next statement (Lemma 10) has as its prototypes Lemma 6 in [3] and Theorem 3 in [4] (see also the proof of Prop. 4 there). But, influenced by the proof of Lemma 1.1 in [2], now we use “maxima” instead of “sums” in the statement, which makes the lemma more convenient for applications. The proof of Lemma 10 uses the same idea that was used to prove its prototypes, but it is simpler. Lemma 10. Let (fk )∞ k=1 be a sequence of positive numbers such that fk ∞,

(3.37)

and let F0 = 1,

Fk =

k j =1

fj ,

k = 1, 2, . . . .

(3.38)

160

P. Djakov, B. Mityagin

If T > 0 and (Xk )∞ k=0 is a sequence of positive numbers such that X0 = 1 and Fs0 . . . Fsµ Xk ≤ max T , sup max (3.39) Xs0 . . . Xsµ , k ≥ 2, Fk µ si <|s|=k where s = (s0 , . . . , sµ ) and |s| = s0 + · · · + sµ , then the sequence (Xk ) is bounded. Proof. For convenience the proof is subdivided into 3 steps. Step 1. We may assume without loss of generality that X1 ≤ T /F1 ,

1 ≤ T /F2

(3.40)

(otherwise T could be replaced by a larger constant). Let k0 = min{k ≥ 2 : T > T k+1 /Fk+1 }.

(3.41)

It is easy to see, by (3.37) and (3.38), that T k /Fk → 0 as k → ∞; thus, k0 is well defined, and moreover, in view of (3.40) and (3.41), we have T ≤

Tk Fk

2 ≤ k ≤ k0

for

and T >

Tk Fk

for

k ≥ k0 + 1,

(3.42)

and for k > k0 .

fk > T

(3.43)

Step 2. Claim. The following inequalities hold: Xk ≤ T k /Fk ,

k = 0, 1, . . . , k0 .

(3.44)

We prove (3.44) by induction. In view of (3.38) and (3.40) our claim holds for k = 0, 1. Let Ps =

Fs0 . . . Fsµ Xs0 . . . Xsµ , F|s|

s = (s0 , . . . , sµ ).

(3.45)

Assume that (3.44) holds for k = 1, . . . , m for some m with 1 ≤ m < k0 . Then, for each µ and for each (µ + 1)-tuple s = (s0 , . . . , sµ ) ∈ Sµ (m + 1), we have by (3.44), Ps ≤

Fs0 . . . Fsµ T s0 T sµ T m+1 · ··· = . Fm+1 Fs0 Fsµ Fm+1

By (3.42) T m+1 /Fm+1 ≥ T ; thus, (3.39) implies that Xm+1 ≤ T m+1 /Fm+1 , i.e., (3.44) holds for k = m + 1. The claim is proven. Step 3. Here we show that Xk ≤ T

for k ≥ k0 + 1.

(3.46)

For technical convenience we prove also that Ps < T

for s = (s0 , . . . , sµ ) with

sj < |s| = k,

k ≥ k0 + 1.

(3.47)

1D Dirac Operators

161

Observe that in view of (3.39), if the inequalities (3.47) hold for some k, then (3.46) holds for the same k also. We are proving (3.46) and (3.47) by induction for k ≥ k0 + 1. Let k = k0 + 1. For each (µ + 1)-tuple s = (s0 , . . . , sµ ) ∈ Sµ (k0 + 1), with sj < k0 + 1, we obtain, by (3.44) and (3.42), that Ps ≤

Fs0 . . . Fsµ T s0 T sµ T k0 +1 · ... = < T. Fk+1 Fs0 Fsµ Fk0 +1

Thus, (3.47), and of course (3.46), hold for k = k0 + 1. Let m ≥ k0 + 1; assume that (3.46) and (3.47) hold for every k, k0 + 1 ≤ k ≤ m. Then, we claim that (3.46) and (3.47) hold for k = m + 1. Indeed, fix any (µ + 1)-tuple s = (s0 , . . . , sµ ), with |s| = m + 1 and sj < m + 1. There are several cases: (a) If sj ≤ k0 for every j = 0, . . . , µ, then the numbers Xsj satisfy the estimates (3.44). Thus, one can easily see (as in the proof for k = k0 + 1 ) that Ps ≤ T m+1 /Fm+1 < T , so in this case (3.47) holds. (b) Suppose that there exists j with sj > k0 , say j = 0. (Since a transposition of s0 , . . . , sµ does not change Ps , one may assume without loss of generality that j = 0.) Then we have two subcases: (b1) where m+1−s0 ≤ k0 , and (b2) where m+1−s0 > k0 . In the subcase (b1) we estimate Ps by using (3.46) for Xs0 and (3.44) for Xs1 , . . . , Xsµ . Since T < fk for k > k0 by (3.43), we obtain that Ps ≤

Fs0 Fs1 . . . Fsµ T s1 T sµ T m+1−s0 ·T · ··· =T · < T; Fm+1 Fs1 Fsµ fs0 +1 . . . fm+1

thus, (3.47) holds for k = m + 1. In the case (b2) we have Xs Fs Fm+1−s0 Ps = 0 0 Fm+1

Fs1 . . . Fsµ Xs1 . . . Xsµ . Fm+1−s0

The expression in the brackets equals Ps˜ with s˜ = (s1 , . . . , sµ ), |˜s | = m + 1 − s0 . Thus, by the inductive assumption Ps˜ < T . Since Xs0 < T (by (3.46) with k = s0 ) we have T Fs0 Fm+1−s0 Ps < · T, Fm+1 so it remains to show that the expression in the square brackets does not exceed 1. By (3.42) Fk0 ≤ T k0 −1 , and therefore, Fs0 = Fk0 fk0 +1 . . . fs0 ≤ T k0 −1 fk0 +1 . . . fs0 . Thus, T Fs0 Fm+1−s0 T k0 fk0 +1 . . . fs0 ≤ < 1, Fm+1 fm+2−s0 . . . fm+1 because, due to (3.37) and (3.43), each factor in the numerator of the latter fraction is strictly less than the corresponding factor in the denominator.

162

P. Djakov, B. Mityagin

4. Basic Results; Estimates on the Smoothness of the Potential in Terms of the Decay Rate of Spectral Gaps 1. Our main result is the following statement. Theorem 11. Let L be a self-adjoint Dirac operator given by (1.1), with a potential function P ∈ L2 ([0, 1])), P (x) = p(2n)e2πinx . Let be a submultiplicative weight (see (3.1) and (3.3)) such that either is slowly increasing (i.e., (3.4) holds), or is a rapidly increasing weight of the form (n) = exp(ϕ(log |n|)), where ϕ has the properties (3.10), (3.11) and (3.14). Then − 2 2 |λ+ ⇒ |p(2n)(2n)|2 < ∞. (4.1) n − λn | ((2n)) < ∞ n∈Z

n∈Z

An implication in the opposite direction is given by Theorem 12 below; see further comments in Sect. 5.1. Theorem 12. Let L be a self-adjoint given by (1.1), with a potential Dirac operator function P ∈ L2 ([0, 1])), P (x) = p(2n)e2πinx . If is a submultiplicative weight, then − 2 2 |p(2n)|2 ((2n))2 < ∞ ⇒ |λ+ (4.2) n − λn | ((2n)) < ∞. Proof. By Theorem 5, for large enough |n|, − γn = λ+ n − λn 2|βn (zn )| with

|zn | ≤ π/2,

(4.3)

where, in view of (2.42) and (2.36)–(2.38), βn (n, zn ) = p(−2n) +

∞

21 S2ν (n, zn ).

(4.4)

ν=1

Therefore, by (2.25), we have |βn (zn )| ≤ |p(−2n)| +

∞ ∞ 21 σν (n, r), S2ν (n, zn ) ≤ |r(2n)| + ν=1

(4.5)

ν=1

where r = (r(m))m∈Z , and σν (n, r) =

j1 ,... ,j2ν =n

r(m) = max(|p(m)|, |p(−m)|),

(4.6)

r(n + j1 )r(−j1 − j2 )r(j2 + j3 ) . . . r(−j2ν−1 − j2ν )r(j2ν + n) . |n − j1 ||n − j2 | . . . |n − j2ν | (4.7)

Consider the operator σ :

r = (r(m)) ∈ 2 (Z) → (σ (n, r)) ∈ 2 (Z),

(4.8)

where σ (n, r) =

∞

σν (n, r).

(4.9)

ν=1

Thus, in view of (4.3)–(4.9), the following statement completes the proof of Theorem 12.

1D Dirac Operators

163

Proposition 13. If is a submultiplicative weight, then for each sequence of nonnegative numbers, |r(2n)|2 ((2n))2 < ∞ ⇒ |σ (n, r)|2 ((2n))2 < ∞. (4.10) Proposition 13 is proven in Sect. 4.2 as a corollary of some basic properties of the operator σ. Proof of Theorem 11. The proof of Theorem 11 follows from the properties of the operator σ also, but it is much more complicated. Set ζ (n) = |β(n, zn )|; then Theorem 11 will be proven if we show that |p(2n)|2 ((2n))2 < ∞. |ζ (n)|2 ((2n))2 < ∞ ⇒

(4.11)

Under the above notations we have, by (4.4), that |p(−2n)| ≤ |βn (zn )| +

∞ ∞ 21 σν (n, r). S2ν (n, zn ) ≤ |ζ (n)| + ν=1

(4.12)

ν=1

In the same way, changing n to −n one can see that ∞ ∞ 21 |p(2n)| ≤ |β−n (z−n )| + σν (n, r). S2ν (−n, z−n ) ≤ |ζ (−n)| + ν=1

(4.13)

ν=1

Thus, by (4.12) and (4.13), we obtain, with ξ(n) = max(ζ (n), ζ (−n)), r(2n) ≤ ξ(n) +

∞

σν (n, r) = ξ(n) + σ (n, r).

ν=1

Thus, in view of the above discussion, Theorem 11 would be proven if we prove the following statement. Theorem 14. Let be a submultiplicative weight (see (3.1) and (3.3)) such that either slowly increasing (i.e., (3.4) holds), or is a rapidly increasing weight of the form (n) = exp(ϕ(log n)), where ϕ has the properties (3.10), (3.11) and (3.14). If ξ = (ξ(m))m∈Z and r = (r(m))m∈Z are two sequences of non-negative numbers such that r ∈ 2 (Z),

r(m) = 0 for odd m,

r(2n) ≤ ξ(n) + σ (n, r),

|n| ≥ n∗ ,

(4.14) (4.15)

then n∈Z

|ξ(n)(2n)|2 < ∞ ⇒

n∈Z

|r(2n)(2n)|2 < ∞.

(4.16)

164

P. Djakov, B. Mityagin

The remaining part of this section is devoted to the proof of Theorem 14. Some of the steps of this proof are interesting by themselves (e.g., Lemma 15 and Proposition 16 give a proof of Proposition 13). Therefore, the claims that follow below are formulated and proven as independent statements, although they are steps in the proof of Theorem 14. 2. Throughout the paper we assume that the weights are submultiplicative. The following property of the operator σ (n, r) reveals why this assumption is so important. Lemma 15. If is a submultiplicative weight such that (k + m) ≤ (k)(m) ∀k, m ∈ Z, (i.e., (3.3) holds with C = 1) then, for each sequence of non-negative numbers r = (r(m))m∈Z , σ (n, r)(2n) ≤ σ (n, r˜ ) where r˜ = (r(m)(m))m∈Z .

(4.17)

Proof. Since the weight is submultiplicative, we have, for each 2ν-tuple (j1 , . . . , j2ν ), that (2n) ≤ (n + j1 )(−j1 − j2 )(j2 + j3 ) · · · (−j2ν+1 − j2ν )(j2ν + n), and therefore, r(n + j1 )r(−j1 − j2 ) · · · r( j2ν + n)(2n) ≤ r˜ (n + j1 )˜r (−j1 − j2 ) · · · r˜( j2ν + n). Thus, in view of (4.7), we obtain σν (n, r)(2n) ≤ σν (n, r˜ )

ν = 1, 2, . . . ,

so, by (4.9), σ (n, r)(2n) =

∞

σν (n, r)(2n) ≤

ν=1

∞

σν (n, r˜ ) = σ (n, r˜ ).

ν=1

Next, we use the properties of the operator σ to prove the following crucial estimate. Proposition 16. Under the above notations |n|≥N

|σ (n, r)|2 ≤

2 + (R(N ))2 , N

N > N ∗,

(4.18)

where R(N) :=

|n|≥N

|r(n)|2 .

(4.19)

1D Dirac Operators

165

Proof. By (4.7), the sequence (σ (n, r)) is the sum of the sequences (σν (n, r)), and therefore, by the triangle inequality for 2 -norms, we have 1/2  1/2  ∞ 2 2   |σ (n, r)|  ≤ |σν (n, r)|  . (4.20) |n|≥N

To estimate in (4.7)

|n|≥N

|n|≥N

ν=1

|σν (n, r)|2 , for fixed ν ∈ N, we divide the set of summation indices j1 , . . . , j2ν = n}

J (n) = {j = (j1 , . . . , j2ν ) : into several subsets by setting

a = {α = (α1 , . . . , α2ν ) : αs ∈ {0, 1}}, and

J (n) = (j1 , . . . , j2ν ) ∈ J (n) : α

|α| = α1 + · · · + α2ν ,

|n − js | ≤ |n|/2 if αs = 0 |n − js | > |n|/2 if αs = 1 .

Notice that card (a) = 22ν . Then J (n) =

J α (n), α∈a

so

··· =

··· ,

α∈a j ∈J α (n)

J (n)

and therefore, the triangle inequality implies that  1/2    |σν (n)|2  ≤  |n|≥N

α∈a

2 1/2  · · ·  . |n|≥N j ∈J α (n)

(4.21)

By the Cauchy inequality, |n|≥N

 

2

··· ≤

Aα (n)Bα (n),

(4.22)

|n|≥N

j ∈J α (n)

where Aα (n) =

j ∈J α (n)

 ≤

|n−k|≤|n|/2

2ν−|α|  1   (n − k)2

1 (n − j1 )2 . . . (n − j2ν )2

|n−k|>|n|/2

(4.23)

|α| 2 2ν−|α| |α| 1 4  ≤ π , (n − k)2 3 N

166

P. Djakov, B. Mityagin

and

Bα (n) =

|r(n + j1 )|2 |r(−j1 − j2 )|2 · · · |r(j2ν + n)|2 .

(4.24)

j ∈J α (n)

In order to estimate

|n|≥N

Bα (n) we change the indices of summation to

i1 = n + j1 , i2 = −j1 − j2 , . . . , i2ν = −j2ν−1 − j2ν , i2ν+1 = j2ν + n. Then

Bα (n) ≤

|n|≥N

|r(i1 )|2 · · · |r(i2ν+1 )|2 ,

(4.25)

i∈I α

where I α = I α (N ) is the set of indices i = (i1 , . . . , i2ν+1 ) given by I α = I1 (α) × · · · × I2ν+1 (α), where Is (α) =

Z {is : |is | ≥ N }

if αs = 1 if αs = 0

for

s = 1, 2ν + 1

and Is (α) =

Z {is : |is | ≥ N}

if αs−1 = 1 or αs = 1 , if αs−1 = 0 and αs = 0

2 ≤ s ≤ 2ν.

Indeed, α1 = 0 (or α2ν+1 = 0) means that |n − j1 | ≤ |n|/2 (respectively, |n − j2ν | ≤ |n|/2). Thus, |i1 | = |n + j1 | = |2n − (n − j1 )| ≥ |2n| − |n|/2 > |n| ≥ N, and the same argument shows that |i2ν+1 | ≥ N. Fix an s such that 2 ≤ s ≤ 2ν. If αs−1 = αs = 0 then |n − js−1 | ≤ |n|/2,

|n − js | ≤ |n|/2;

thus, |is | = |js−1 + js | = |2n − (n − js−1 ) − (n − js )| ≥ |2n| − 2(|n|/2) ≥ |n| ≥ N. Now we have |n|≥N

Bα (n) ≤

2ν+1

|r(is )|2 ≤ (R(N ))γ (α) (r2 )2ν+1−γ (α) ,

(4.26)

s=1 is ∈Is (α)

where γ (α) := card{s : Is (α) = Z} ≥ 2ν + 1 − 2|α|.

(4.27)

1D Dirac Operators

167

Indeed, one can easily see, by the definition of Is (α), that γ (α) = (1 − α1 ) + (1 − α2ν ) +

2ν

(1 − αs )(1 − αs−1 )

s=2 2ν

≥ (1 − α1 ) + (1 − α2ν ) +

(1 − αs − αs−1 ) = 2ν + 1 − 2|α|.

s=2

Taking into account (4.22), (4.23), (4.26) and (4.27), we obtain  2 2 2ν 2ν+1−γ (α) 2 2|α| π   ··· ≤ (4.28) √ (R(N ))γ (α) r2 3 N α |n|≥N

j ∈J (n)

≤ K 2ν+1 (ρ(N ))2|α|+γ (α)) ≤ K 2ν+1 (ρ(N ))2ν+1 , where 2 ρ(N ) = √ + R(N), N

K=

π2 (r2 + 1). 3

Obviously ρ(N ) → 0 as N → ∞, so there is N ∗ such that −1 ρ(N ) < 256K 3 for N ≥ N ∗ .

(4.29)

Since card(a) = 22ν , the inequalities (4.21), (4.28) and (4.29) imply, for N ≥ N ∗ , that  1/2 ∞ ∞  |σν (n, r)|2  ≤ 4ν (Kρ(N ))ν+1/2 ν=1

|n|≥N

ν=1

≤ 4(Kρ(N ))3/2

∞

2−ν ≤ 8(Kρ(N ))3/2 ≤

ν=0

1 ρ(N ). 2

Thus, by (4.20),

|σ (n.r)|2 ≤

|n|≥N

which completes the proof.

1 2 (ρ(N ))2 ≤ + (R(N ))2 , 4 N

Proof of Proposition 13. Suppose that is a submultiplicative weight (we may assume that (3.3) holds with C = 1) and r = (r(n))n∈Z is a sequence of non-negative numbers such that r(m) = 0 for odd m and (4.30) (r(2n)(2n))2 < ∞. Lemma 15 implies that σ (n, r)(2n) ≤ σ (n, r˜ ),

where

r˜ = (r(m)(m)) .

(4.31)

168

P. Djakov, B. Mityagin

Therefore, in view of (4.30), we have r˜ ∈ 2 (Z); thus, by Proposition 16, there exists N∗ > 0 such that 2  2 + |˜r (n)|2  < ∞. (σ (n, r˜ ))2 ≤ N |n|≥N∗

|n|≥N∗

Thus, by (4.31), we obtain that (σ (n, r)(2n))2 ≤ (σ (n, r˜ ))2 < ∞, which proves Proposition 13. 3. Two elementary lemmas. ∞ Lemma 17. If (B(n))∞ 1 and (R(n))1 are decreasing sequences of positive real numbers such that

B(n) 0,

R(n) 0,

R(2n) ≤ C1 B(n) + C1 (R(n))2 ,

(4.32)

C1 > 0, n = 1, 2, . . . ,

(4.33)

and B(n) ≤ C2 B(2n),

C2 > 0, n = 1, 2, . . . ,

(4.34)

then there exists a constant C > 0 such that R(2n) ≤ CB(n),

n = 1, 2, . . . .

(4.35)

Proof. By (4.32) there exists n1 such that R(n) <

1 2C1 C2

for n ≥ n1 .

Therefore, by (4.33) and (4.34), we obtain R(4n) 1 R(2n) 1 R(2n) ≤ C1 + ≤ C1 + , B(2n) 2C2 B(2n) 2 B(n)

n ≥ n1 .

(4.36)

Consider the sequence Xk = R(2k+1 n1 )/B(2k n1 ),

k = 1, 2, . . . .

(4.37)

From (4.36) it follows that 1 Xk+1 ≤ C1 + Xk , 2

k = 1, 2, . . . .

One can easily derive from (4.38), by induction, that Xk+1 ≤ C1

k j =0

2−j + 2−k X1 ;

(4.38)

1D Dirac Operators

169

thus, the sequence (Xk ) is bounded: Xk ≤ 2C1 + X1 .

(4.39)

Fix an arbitrary n ≥ n1 . Then, for some k ≥ 0, we have 2k n1 ≤ n < 2k+1 n1 . Since R(m) is decreasing, (4.34), (4.37) and (4.39) yield R(2n) ≤ R(2k+1 n1 ) = Xk B(2k n1 ) ≤ Xk C2 B(2k+1 n1 ) ≤ CB(n), with C = C2 (2C1 + X1 ), i.e., (4.35) holds.

The next lemma explains that, due to Abel’s transform, a sequence (x(n)) ∈ 1 belongs to a weighted 1 -space generated by a weight T (n) if and only if the sequence (X(N )), X(N) = |x(n)|, |n|≥N

belongs to the weighted 1 -space generated by the weight T (N ) − T (N − 1). Lemma 18. If (T (n)), n ∈ Z is a weight sequence then the following conditions are equivalent:

(i)

|x(n)|T (n) < ∞;

n

(ii)

(ii.a) (ii.b)

X(N )T (N ) → 0 as N → ∞, n X(n)[T (n) − T (n − 1)] < ∞.

X(N ) =

|n|≥N

|x(n)|;

Proof. (i) ⇒ (ii). If (i) holds, then X(N )T (N ) ≤

|x(n)|T (n) → 0;

|n|≥N

thus, part (a) of (ii) holds. Moreover, if 0 < M < N then M≤|n|≤N

|x(n)| T (n) =

N

(|x(−n)| + |x(n)|)T (n)

(4.40)

n=M

= X(M)T (M) − X(N + 1)T (N ) +

N

X(n) (T (n) − T (n − 1)) .

n=M+1

By (i) the left-hand side of the above identity goes to 0 as M → ∞. Since X(N + 1)T (N ) ≤ X(N + 1)T (N + 1) → 0,

170

P. Djakov, B. Mityagin

we have, by part (a) of (ii), N

X(n) (T (n) − T (n − 1)) → 0

as

M → ∞;

n=M+1

thus, the series in (ii.b) satisfies the Cauchy convergence condition. The implication (ii) ⇒ (i) follows from (4.40) also. Indeed, by part (ii.a), X(M)T (M) → 0 as M → ∞; thus, the Cauchy convergence condition for the series in (ii.b) implies the Cauchy convergence condition for the series in (i). 4. Now we prove Theorem 14 in the case where is a slowly increasing submultiplicative weight. Proposition 19. Suppose is a slowly increasing (i.e. ∈ (3.4)) submultiplicative weight. Recall that the operator σ is defined by (4.8), (4.9) and (4.7). If r = (r(n))n∈Z and ξ = (ξ(n))n∈Z are two sequences of non-negative numbers such that r(n) = 0 for odd n,

r ∈ 2 ,

and r(2n) ≤ ξ(n) + σ (n, r), then

|ξ(n)|2 ((2n))2 < ∞

⇒

n

|r(2n)|2 ((2n))2 < ∞.

(4.41)

(4.42)

n

Proof. Since the weight is slowly increasing we have, by (3.5), that ∃a > 0 :

(m) ≤ |m|a

for |m| > 1.

(4.43)

For convenience the proof is divided into two steps. Step 1. Proof of the claim in the case where a < 1/4. By (4.41), |r(2n)|2 ≤ 2 |ξ(n)|2 + 2 |σ (n, r)|2 , |n|≥N

|n|≥N

(4.44)

|n|≥N

and therefore, by Proposition 16, we have for N ≥ 4 (since ((N ))2 ≤ N 1/2 ≤ N/2), R(2N ) ≤ 2X(N) + where X(n) =

2 + 2(R(N ))2 , ((N ))2

|ξ(n)|2 .

|n|≥N

On the other hand, we have εN := X(N )((N ))2 ≤

|n|≥N

|ξ(n)|2 ((n))2 → 0,

(4.45)

(4.46)

1D Dirac Operators

171

and therefore, X(N) = εN /((N ))2

with

εN → 0.

(4.47)

Consider the sequence (B(N )) given by B(N) := X(N) +

1 . ((N ))2

(4.48)

Since the weight is slowly increasing, (4.47) and (4.48) imply that sup B(N )/B(2N ) < ∞,

˜ B(N ) ≤ C/((N ))2 .

(4.49)

N

By (4.45), we have R(2N ) ≤ B(N) + 2(R(N ))2 ; thus, in view of (4.49), Lemma 17 gives us that R(2N ) ≤ C1 B(N) ≤ C1

C˜ . ((N ))2

(4.50)

On the other hand, by (4.44) and Proposition 16, we obtain R(2N ) ≤ 2X(N) + 4/N + 2(R(N ))2 .

(4.51)

Notice that (4.43), with a < 1/4, implies ((N ))4 /N → 0. Thus, since is slowly increasing weight, (4.50) and (4.51) yield R(2N ) ≤ 2X(N) +

C2 . ((N ))4

(4.52)

Now (4.47) and (4.52) imply that R(2N )(N )2 → 0

as

N → ∞.

(4.53)

Moreover, (4.52) implies R(2N ) ((N ))2 − ((N − 1))2 < ∞.

(4.54)

N

Indeed, since (ξn ) ∈ 2 (), by Lemma 18 we have that X(N) ((N ))2 − ((N − 1))2 < ∞. N

On the other hand, ((N ))2 − ((N − 1))2 1 1 < ∞; ≤ − ((N ))4 ((N − 1))2 ((N ))2 N

N

thus, (4.54) holds. By (4.53), (4.54) and Lemma 18, we have i.e., (4.42) holds if a < 1/4.

|r(m)|2 ((m))2 < ∞,

172

P. Djakov, B. Mityagin

Step 2. Proof of the claim in the case where a ≥ 1/4. If (m) ≤ |m|a with a ≥ 1/4, then we choose k0 so that a/k0 < 1/4, set k (m) = ((m))k/k0 ,

k = 1, . . . , k0 ,

(4.55)

and prove that the claim holds for k by induction in k. Since 1 (m) ≤ |m|1/4 , by Step 1, the claim holds for k = 1. Assume that r = (r(m)) ∈ 2 (k ) for some k, 1 ≤ k < k0 . Multiplying both sides of (4.41) by k (2n) and using that k is submultiplicative, we obtain r˜ (2n) ≤ ξ˜ (n) + σ (n, r˜ ),

(4.56)

where r˜ (m) = r(m)k (m),

ξ˜ (m) = ξ(m)k (2m),

m ∈ Z.

Since (r(m)) ∈ 2 (k ) and (ξm ) ∈ 2 () we have r˜ = (˜r (m)) ∈ 2 ,

(ξ˜ (m)) ∈ 2 (k0 −k ) ⊂ 2 (1 ).

By Step 1, it follows that (˜r (m)) ∈ 2 (1 ); thus, r = (r(m)) ∈ 2 (k+1 ). Hence, r = (r(m)) ∈ 2 (k ) for k = 1, . . . , k0 . By (4.55), k0 = . This proves Proposition 19. 5. Finally, we prove Theorem 14 for rapidly increasing weights of the form ϕ (|n|)) = exp(ϕ(log |n|)). Proposition 20. Suppose (Mk )∞ k=0 is a Carleman sequence (see (3.25) - (3.27)) such that Mk = k!Ek with √ k(Ek )2 /E2k → 0, (4.57)

∃τ ∈ (0, 1) :

∞

(Ek )2 /E2k

τ

→ 0.

(4.58)

k=1

If r = (r(n))n∈Z and ξ = (ξn )n∈Z are sequences of non-negative numbers such that r(n) = 0 for odd n,

r ∈ 2

and r(2n) ≤ ξ(n) + σ (n, r),

|n| ≥ n∗ ,

(4.59)

where σ is the operator defined by (4.7) - (4.9), then ˜ k ∀k. |ξ |k := |ξn ||2n|k ≤ CMk ∀k ⇒ rk = |r(2n)||2n|k ≤ CM n

n

(4.60)

1D Dirac Operators

173

Proof. By Proposition 19, |ξ |k < ∞

⇒

rk < ∞,

∀k ∈ N.

Set rk = Xk Mk r0 ,

k = 0, 1, 2, . . . .

(4.61)

The lemma will be proven if we show that the sequence (Xk ) is bounded. Let us multiply both sides of (4.59) by |2n|k+1 and sum up over n, |n| ≥ N∗ > n∗ . (Later N∗ will be chosen large enough.) This leads us to the inequality |r(2n)||2n|k+1 + |ξ(n)||2n|k+1 + σ (n, r)|2n|k+1 rk+1 ≤ |n|
|n|≥N∗ ∞

≤ C1 (2N∗ )k+1 + |ξ |k+1 +

|n|≥N∗

|2n|k+1 σν (n, r),

(4.62)

ν=1 |n|≥N∗

where C1 = maxm |r(m)|. Next, we fix ν ∈ N and estimate the sum |2n|k+1 σν (n, r). Sν := |n|≥N∗

Observe that, by (4.7), Sν =

|n|≥N∗

|2n|k+1

j1 ,... ,j2ν =n

r(n + j1 )r(−j1 − j2 ) . . . r(j2ν + n) . |n − j1 ||n − j2 | . . . |n − j2ν |

(4.63)

As in the proof of Proposition 16 we divide the set of indices in the above sum into subsets J α (n), J (n) = {j = (j1 , . . . , j2ν ) : j1 , . . . , j2ν = n} = α∈a

where a is the set of all 2ν-tuples α = (α1 , . . . , α2ν ) with αi ∈ {0, 1}, and |n − js | ≤ |n|/2 if αs = 0 α J (n) = (j1 , . . . , j2ν ) ∈ J (n) : . |n − js | > |n|/2 if αs = 1 By the definition of J α (n) we have |n − j1 | . . . |n − j2ν | ≥ (|n|/2)|α| ≥ (N∗ /2)|α|

for

j ∈ J α (n).

With this estimate for the denominator in (4.63), we have (N∗ /2)−|α| |2n|k+1 r(n + j1 . . . r(j2ν + n). Sν ≤ α∈a

(4.64)

|n|≥N∗ J α (n)

Set a = {α ∈ a : |α| = |α1 + · · · α2ν | ≥ ν},

a = a \ a ,

(4.65)

174

P. Djakov, B. Mityagin

and split the sum in (4.64) into two subsums: ··· = ··· + ··· . α∈a

α∈a

(4.66)

α∈a

First, we estimate α∈a . Taking into account that card (a) = 22ν we obtain · · · ≤ (8/N∗ )ν |2n|k+1 r(n + j1 ) . . . r(j2ν + n). (4.67) α∈a

n j1 ,... ,j2ν

By the multinomial formula, |2n|k+1 = |(n + j1 ) + (−j1 − j2 ) + (j2 + j3 ) + · · · + (j2ν + n)|k+1 k + 1 |n + j1 |s0 · | − j1 − j2 |s1 · · · |j2ν + n|s2ν ; ≤ s |s|=k+1

thus, (4.67) implies ν k+1 8 rs0 rs1 . . . rs2ν ··· ≤ s N∗ |s|=k+1 α∈a ν ν k+1 8 8 2ν rs0 rs1 . . . rs2ν . = (2ν + 1)r0 rk+1 + s N∗ N∗ |s| = k + 1 si < k + 1 Obviously, there exists N1 > 0 such that ∞

(8/N∗ )ν (2ν + 1)r2ν 0 < 1/2

for N∗ > N1 .

ν=1

Thus, we have ∞ ν=1 α∈aν

∞ 8 ν 1 ≤ rk+1 + 2 N∗ ν=1

|s| = k + 1 si < k + 1

k+1 rs0 rs1 . . . rs2ν s (4.68)

aν

a

to show the dependence of on ν). (where the notation is used Next, we estimate the sum α∈a . Consider the new indices i1 = n + j1 , i2 = −j1 − j2 , . . . , i2ν = −j2ν−1 − j2ν , i2ν+1 = j2ν + n.

(4.69)

Such change of the summation indices has been used in the proof of Proposition 16. It is easy to check, by the definition of J α (n), that if j ∈ J α (n) then α1 = 0 ⇒ |i1 | = |n + j1 | > |n|,

α2ν = 0 ⇒ |i2ν+1 | = |j2ν + n| > |n|

and αs−1 = αs = 0 ⇒ |is | = |js−1 + js | > |n|, (see the proof of Proposition 16 for details).

2 ≤ s ≤ 2ν

1D Dirac Operators

175

Let γ (α) denote the number of expressions is in (4.69) such that |is | > n for j ∈ J α (n). Of course, γ (α) is the same function that has been used in the proof of Proposition 16; thus, by (4.27), we have γ (α) ≥ 2ν + 1 − 2|α|. In particular, since |α| ≤ ν − 1 for α ∈ a , we obtain for α ∈ a .

γ (α) ≥ 3

Choose indices s1 , s2 so that the corresponding is1 and is2 in (4.69) satisfy the inequalities |is1 | > |n|,

|is2 | > |n| for j = (j1 , . . . , j2ν ) ∈ J α (n).

Set k1 = k2 =

k+1 for odd k, 2

k1 =

k k , k2 = 1 + for even k. 2 2

(4.70)

Then |n|k+1 ≤ |is1 |k1 |is2 |k2

for j ∈ J α (n).

Thus, by changing the indices of summation according to the formulae (4.69), we have by (4.64), 2 |α| ··· ≤ 2k+1 |is1 |k1 |is2 |k2 r(i1 ) . . . r(i2ν+1 ), N ∗ |n|≥N∗ J α (n) α∈a α∈a ! 2|α| 2 ≤ 2k+1 |is1 |k1 |is2 |k2 r(i1 ) . . . r(i2ν+1 ), (4.71) N ∗ α I (N∗ )

α∈a

α where I α (N∗ ) = I1α × · · · × I2ν+1 with

Isα = {m ∈ Z : and Isα = Z otherwise. Let R(N∗ ) =

|m| > N∗ }

|r(n)|,

if |is | > n ∀j ∈ J α (n),

ρ(N∗ ) =

2/N∗ + R(N∗ ).

(4.72)

|n|>N∗

With these notations, (4.71) implies the inequality ! 2|α| 2 2ν+1−γ (α) ··· ≤ 2k+1 rk1 rk2 | (R(N∗ ))γ (α)−2 r0 N ∗ I α (N∗ ) α∈a α∈a ≤ (ρ(N∗ ))2|α|+γ (α)−2 2k+1 rk1 rk2 |K 2ν−1 , α∈a

where K = max(1, r0 ). Since card a ≤ 2ν ,

2|α| + γ (α) ≥ 2ν + 1,

176

P. Djakov, B. Mityagin

by (4.27), we have

· · · ≤ (2Kρ(N∗ ))2ν−1 2k+1 rk1 rk2 |.

(4.73)

α∈a

From (4.72) it follows that ρ(N∗ ) → 0 as N∗ → ∞, so there exists N2 > 0 such that 2Kρ(N∗ ) < 1/2

for N∗ ≥ N2 .

Thus, (4.73) implies that ∞ ν=1

· · · ≤ 2k+1 rk1 rk2

for N∗ ≥ N2 .

(4.74)

α∈a

Now we sum up the above inequalities. From (4.62), (4.64), (4.68) and (4.74) it follows that for N∗ > max(N1 , N2 ), rk+1 ≤ C1 (2N∗ )k+1 + ξ k+1 + 2k+1 rk1 rk2 ∞ 8 ν k+1 +2 rs0 rs1 . . . rs2ν . N∗ s ν=1 |s| = k + 1 si < k + 1 By substituting the norms of r from (4.61), and estimating from above the norm of ξ by (4.60), we obtain (with Mk = k!Ek , and after dividing with (k + 1)!Ek+1 r0 ): 2C1 k1 !k2 ! Ek1 Ek2 (2N∗ )k+1 · · + 2C + 2k+2 r0 (k + 1)!Ek+1 (k + 1)! Ek+1 ν ∞ 8r20 Es0 . . . Es2ν +2 Xs0 . . . Xs2ν , N∗ Ek+1 ν=1 |s| = k + 1 si < k + 1

Xk+1 ≤

(4.75)

where k1 and k2 are given in (4.70). Obviously, the first term in the above estimate of Xk+1 goes to 0 as k → ∞, so it is bounded. The same holds for the third term. Indeed, if k + 1 is even, say k + 1 = 2m, then k1 = k2 = m, and by the Stirling formula we have, in view of (4.57), that 22m

√ m!m! Em Em · m(Em )2 /E2m → 0. (2m)! E2m

If k + 1 is odd, say k + 1 = 2m + 1, then k1 = m, k2 = m + 1, and we obtain em+1 2m+1 m!(m + 1)! Em Em+1 2m m!m! Em Em 2m + 2 2 → 0, · · · =2 (2m + 1)! E2m+1 (2m)! E2m 2m + 1 e2m+1 because the expression in the square brackets is bounded. Thus, we have 2C1 (2N∗ )k+1 Ek1 Ek2 k+2 k1 !k2 ! D := sup < ∞. · + 2C + 2 · r0 (k + 1)!Ek+1 (k + 1)! Ek+1

(4.76)

1D Dirac Operators

177

By Lemma 9, the assumption (4.58) implies that Es . . . Es τ 0 2ν ∃Q > 0 : sup < Q2ν . E k+1 k |s|=k+1

Therefore, the double sum in (4.75) does not exceed the expression ∞ ν 8r2 Q2 Es0 . . . Es2ν 1−τ 0 2 sup max Xs0 . . . Xs2ν · . Ek+1 N∗ ν≥1 |s| = k + 1 ν=1 si < k + 1 If N∗ > N3 := 40r20 Q2 , then the sum in the above expression is less than 1/4. By (4.76), we have     Es0 . . . Es2ν 1−τ   max Xs0 . . . Xs2ν  . Xk+1 ≤ max 2D, sup   E k+1 ν≥1 |s| = k + 1 si < k + 1 Hence, by Lemma 10 (with T = 2D, Fk = (Ek )1−τ ) we obtain that the sequence (Xk ) is bounded. This completes the proof of Proposition 20. Now we complete the proof of Theorem 14 for weights of the form ϕ (n) = exp(ϕ(|n|)), where ϕ has the properties (3.10), (3.11) and (3.14). Let ξ = (ξ(m))m∈Z and r = (r(m))m∈Z be sequences with non-negative terms such that r(2n) ≤ ξ(n) + σ (n, r), We have to prove that 2 ξ(n)ϕ (2n) < ∞

⇒

n ≥ n∗ .

(4.77) 2

r(m)ϕ (m)

< ∞.

(4.78)

Let ξ = (ξ (m)), By part (a) of Lemma 9, 2 ξ (m)ϕ (m) < ∞

ξ (2m) = ξ(m),

⇒

∃C > 0 :

ξ 2m+1 = 0.

ξ = |ξ |k ≤ CMk (ϕ1 ),

where ϕ1 (t) = ϕ(t) − t and (Mk (ϕ1 )) is the Carleman sequence generated by ϕ1 (see the text after (3.4) prior to Lemma 8). By Proposition 20 there exists C˜ > 0 such that ξ k = |ξ |k ≤ CMk (ϕ1 )

⇒

˜ k (ϕ1 ) rk ≤ CM

k = 0, 1, 2, . . . .

On the other hand, the part (b) of Lemma 8 yields 2 ˜ k (ϕ1 ) ∀k ⇒ rk ≤ CM r(m)ϕˆ (m) < ∞, with ϕ(t) ˆ = ϕ1 (t) − 4t = ϕ(t) − 5t.

(4.79)

178

P. Djakov, B. Mityagin

Consider the sequences rˆ = r(k)ϕˆ (k) ,

ξˆ = ξ(k)ϕˆ (2k) .

By multiplying (4.77) by ϕˆ (2n), we obtain, by Lemma 15, that rˆ (2n) ≤ ξˆ (n) + σ (n, rˆ ).

(4.80)

By (4.79), we have rˆ ∈ 2 (Z). Since ϕ (m) = ϕˆ (m) · |m|5 , the left side of (4.16) yields 2 2 ξˆ (k)|2m|5 = ξ(m)ϕ (2m) < ∞.

(4.81)

In view of (4.80) and (4.81), Proposition 19 can be applied to the sequences rˆ , ξˆ and the 2 weight (n) = |n|5 , so we have rˆ (m)|m|5 < ∞. Thus,

2

r(m)ϕ (m)

=

rˆ (m)|m|5

2

< ∞.

This completes the proof of Theorem 14. Therefore, Theorem 11 has been proven as well.

5. Conclusions and Comments 1. Theorems 12, 14 and our main result, Theorem 11, are formulated and proven in terms of weight sequences . This could be a slight obstruction in understanding these statements as results about classes of smooth (or infinitely differentiable) functions. Since these classes are basic for many analytic problems, let us write a few examples of weight sequences [satisfying the hypotheses of Theorem 11], and the corresponding corollaries of Theorem 11. Observe, that the weights in Examples 1-3 below are subexponential, i.e., lim (log (n)) /n = 0 (compare to (110) in [5]). Example 1. Consider the slowly increasing weight (n) = (1 + |n|)α , α > 0. Then H () = H α is a Sobolev space. Corollary 21. Let L be a self-adjoint Dirac operator given by (1.1). Then λ+ − λ− 2 (1 + n2 )α < ∞ n n Z

if and only if P ∈ H α . Of course, Corollary 21 explains Statement (A) of Sect. 1.

1D Dirac Operators

179

Example 2. Let (n) = (1 + |n|)α exp a|n|b ,

α ∈ R, b ∈ (0, 1), a > 0.

(5.1)

These weights are more general than (3.9). Let us denote by G(b, a; α) the corresponding Gevrey space H (). We rewrite as (n) = exp h(|n|),

h(t) = at b + α log(1 + t), t > 0,

or (n) = exp (ϕ(log(|n|)) ,

where ϕ(τ ) = h(eτ ) = aebτ + α log 1 + eτ , τ > 0.

Then −1 ϕ (τ ) = abebτ + α 1 + e−τ , and the conditions (3.10) and (3.11) hold. In addition, ϕ (τ ) − ϕ (τ ) ab(1 − b)ebτ ∞, log ϕ (τ ) bτ so (3.14) holds also. These elementary calculus exercises show that ∈ (5.1) is a rapidly increasing weight with the properties (3.10), (3.11) and (3.14). Therefore, Theorem 11 gives the following. Corollary 22. Let L be as in Corollary 21, and let α ∈ R, a > 0, b ∈ (0, 1). Then λ+ − λ− 2 (1 + n2 )α exp 2a|n|b < ∞ n

n

Z

if and only if P is a function in the Gevrey class G(b, a; α). Corollary 22 explains Statement (C) of Sect. 1. Example 3. Let (n) = 1 + log(e + |n|)γ ,

γ > 0,

(5.2)

or (n) = (1 + |n|)α 1 + log(e + |n|)β ,

α > 0, β ∈ R.

(5.3)

Like in Example 1, this is a slowly increasing weight. We do not write explicitly the corresponding corollary of Theorem 11. But it is worth to mention that in the case (5.2) the space H (γ ) is N OT a subspace of any Sobolev space H α , α > 0, although, vice versa, H α ⊂ H (γ ) for any α, γ > 0.

180

P. Djakov, B. Mityagin

Example 4. Let (n) = a (n) = exp(a|n|),

a > 0.

(5.4)

In this case the functions P ∈ H () are analytic in the strip {z = x+iy | |y| < a/(2π )}. If P (x), P (x + 1) = P (x) is analytic in the strip |y| < b, then P ∈ H (α ) for any a < 2π b. We could be more specific and talk about the boundary values of H (α )functions being in L2 (I ±),

I ± = {x + iy | 0 ≤ x ≤ 1, y = ±a/(2π )}.

But (n) = ea|n| is not a subexponential weight: lim (log (n)) /n = a > 0.

n→∞

(5.5)

Still, the hypotheses of Theorem 12 hold and Theorem 12 implies: Corollary 23. If P ∈ H (a ) then λ+ − λ− 2 e2a|n| < ∞. n n Z

In a weaker form, if P (x), P (x + 1) = P (x) is real-valued on R and analytic in the strip |y| < b/(2π ) then λ+ − λ− 2 e2a|n| < ∞ (5.6) n n Z

for any a, ; 0 < a < b. However, with (5.5), a does not satisfy the hypotheses of Theorem 11 and we cannot use it immediately. Still, we can modify our constructions from Sects. 3 and 4 to prove an analogue of Theorem 11 in the case of weights (5.4). Proposition 24. If P (x) = p(k)e2πikx is an L2 -potential function, and the sequence − of spectral gaps (γn ) = (λ+ n − λn ) of Dirac operator (1.1) satisfies (5.6), then P (x) can be extended as an analytic function in the strip {x + iy | |y| < A} with some A > 0. The constant A cannot be chosen as a function of a; it depends on the norm P 2 as well. We’ll give all technical details and necessary adjustments of the constructions of Sects. 3 and 4 in [9]. We have done such analysis and adjustments of our constructions in Sect. 5.4, Prop. 15, in [5], in the case of Schr¨odinger operators to give an alternative proof of E. Trubowitz’ result [30]. Now, in the case of Dirac operators, we have Statement (B) of Sect. 1, as a corollary of Corollary 23 and Proposition 24. 2. If L is a Dirac operator of the form (2.4), not necessarily self-adjoint, then the left − side inequality in (2.58), Theorem 5, does not hold. But in any case, |λ+ n − λn | could be estimated from above if we use the basic equation (2.13) and Lemma 4. More precisely, the following is true.

1D Dirac Operators

181

Lemma 25. If L is a Dirac operator of the form (2.4), then there exists n∗ > 0 such that 12 21 − |λ+ − λ | ≤ 2 max (n, z) + 2 max (n, z) (5.7) S S , |n| ≥ n∗ . n n |z|≤π/2

|z|≤π/2

Proof. Indeed, with αn (z) = S 11 (n, z) = S 22 (n, z) and ζ = z − αn (z),

(5.8)

ζ 2 = S 11 (n, z)S 22 (n, z).

(5.9)

Eqn. (2.13) becomes

By Lemma 4, there exists n∗ > N0 , where N0 is the constant from Lemma 1, such that dαn (z) 1 dz ≤ 2 for |z| ≤ π/2, |n| ≥ n∗ . Thus, (5.8) defines a holomorphic mapping ζ (z) = z − αn (z) in the disc |z| < π/2 such that 1/2 ≤ |dζ /dz| ≤ 3/2. From here, it follows that 1 + |z − zn− | ≤ |ζ (zn+ ) − ζ (zn− )| ≤ 2|zn+ − zn− |, 2 n where, in view of Lemma 1, |zn± | < π/2 for |n| ≥ n∗ . So, by taking into account that − + − |λ+ n − λn | = |zn − zn |, we have 1 + + − + − |λ − λ− n | ≤ |ζn − ζn | ≤ 2|λn − λn |, 2 n where ζn+ = ζ (zn+ ) and ζn− = ζ (zn− ). On the other hand, (5.9) implies that 1/2 ± 12 ζ = S (n, z± )S 21 (n, z± ) ≤ 1 S 12 (n, z± ) + 1 S 21 (n, z± ) . n n n n n 2 2 Therefore, |ζn+ − ζn− | ≤ |ζn+ | + |ζn− | ≤ max S 12 (n, z) + max S 21 (n, z) ; |z|≤π/2

hence, (5.7) holds.

|z|≤π/2

Theorem 26. Let L be a Dirac operator of the form (2.4) with potential functions P (x) = p(2n)e2π inx and Q(x) = q(2n)e2πinx . If is a submultiplicative weight, then − 2 2 |p(2n)|2 + |q(2n)|2 ((2n))2 < ∞ ⇒ |λ+ n − λn | ((2n)) < ∞. (5.10)

182

P. Djakov, B. Mityagin

Proof. Set r(m) = max(|p(−m)|, |p(m)|, |q(−m)|, |q(m)|).

(5.11)

In view of (2.36)–(2.39), we obtain, by (2.25), that max |S 12 (n, z)| ≤ σ (n, r),

|z|≤π/2

max |S 21 (n, z)| ≤ σ (n, r),

|z|≤π/2

(5.12)

where r = (r(m)) and σ (n, r) is defined by (4.7)–(4.9). Now, the claim follows from Proposition 13. Under some rigid assumptions on , which, a for example, exclude such weights as (m) = exp(a|m|), or (k) = log(e + |k|) , a > 0, the claim (5.10) can be found in [15] or [16]. 3. The present paper deals only with the case of subexponential growth of the weight , i.e., (m) ≤ ea|m| , a > 0. The case of superexponential weights could be analyzed as well. We will present this analysis elsewhere. For Hill–Schr¨odinger operators see such analysis in [6]. References 1. Ablowitz, M. A., Segur, H.: Solitons and the inverse scattering transform. Philadelphia: SIAM, 1981 2. Costin, O., Kruskal, M.: Optimal uniform estimates and rigorous asymptotics beyond all orders for a class of ordinary differential equations. Proc. Roy. Soc. London Ser. A 452, 1057–1085 (1996) 3. Djakov, P., Mityagin, B.: Smoothness of solutions of a nonlinear ODE. Integral Equations and Operator Theory 44, 149–171 (2002) 4. Djakov, P., Mityagin, B.: Smoothness of solutions of nonlinear ODE’s. Math. Ann. 324, 225–254 (2002) 5. Djakov, P., Mityagin, B.: Smoothness of Schr¨odinger operator potential in the case of Gevrey type asymptotics of the gaps. J. Funct. Anal. 195, 89–128 (2002) 6. Djakov, P., Mityagin, B.: Spectral gaps of the periodic Schr¨odinger operator when its potential is an entire function, Adv. in Appl. Math. 31(3), 562–596 (2003) 7. Djakov, P., Mityagin, B.: Spectral triangles of Schr¨odinger operators with complex potentials. Selecta Mathematica 9, 495–528 (2003) 8. Djakov, P., Mityagin, B.: Spectra of 1D periodic Dirac operators and smoothness of potentials. Math. Reports Acad. Sci. Royal Soc. Canada 25, 121–125 (2003) 9. Djakov, P., Mityagin, B.: Uspehi Mat. Nauk, in preparation 10. Dubrovin, B. A.: The inverse problem of scattering theory for periodic finite-zone potentials. Funktsional. Anal. i Prilozhen. 9, 65–66 (1975) 11. Dubrovin, B. A., Krichever, I.M., Novikov, S.P.: Integrable systems I. In: Encycl. of Math. Sci., Dynamical systems IV, Arnold, V. I., Novikov, S.P. (eds.), Berlin-Heidelberg-NewYork: Springer, 1990, pp. 173–283 12. Faddeev, L. D., Takhtajan, L. A.: Hamiltonian methods in the theory of solitons. Berlin, New York: Springer-Verlag, 1987 13. Gelfand, I. M., Levitan, B. M.: On a simple identity for the eigenvalues of a second order differential operator. Dokl. Akad. Nauk SSSR 88, 593–596 (1953) (Russian) 14. Goldberg, W.: On the determination of a Hill’s equation from its spectrum. Bull. Amer. Math. Soc. 80, 1111–1112 (1974) 15. Gr´ebert, B., Kappeler, T., Mityagin, B.: Gap estimates of the spectrum of the Zakharov-Shabat system. Appl. Math. Lett. 11, 95–97 (1998) 16. Gr´ebert, B., Kappeler, T.: Estimates on periodic and Dirichlet eigenvalues for the Zakharov-Shabat system. Asymptotic Analysis 25, 201–237 (2001) 17. Hochstadt, H.: Estimates on the stability intervals for the Hill’s equation. Proc. Amer. Math. Soc. 14, 930–932 (1963) 18. Hochstadt, H.: On the determination of a Hill’s equation from its spectrum. Arch. Ration. Mech. Anal. 19, 353–362 (1965)

1D Dirac Operators

183

19. Kappeler, T., Mityagin, B.: Gap estimates of the spectrum of Hill’s Equation and Action Variables for KdV. Trans. AMS 351, 619–646 (1999) 20. Kappeler, T., Mityagin, B.: Estimates for periodic and Dirichlet eigenvalues of the Schr¨odinger operator. SIAM J. Math. Anal. 33, 113–152 (2001) 21. Levitan, B. M., Sargsian.: “Introduction to spectral theory; Self-adjoint ordinary differential operators”. Translation of Mathematics Monographs, Vol. 39, Providence, RI: AMS, 1975 22. Li, Y., McLaughlin, D.: Morse and Melnikov functions for NLS PDEs. Commun. Math. Phys. 162, 175–214 (1994) 23. Marchenko, V. A.: Sturm-Liouville operators and applications. Oper. Theory Adv. Appl., Vol. 22, Basel-Boston: Birkh¨auser, 1986 24. McKean, H., Trubowitz, E.: Hill’s operator and hyperelliptic function theory in the presence of infinitely many branch points. Comm. Pure Appl. Math. 29, 143–226 (1976) 25. Misyura, T.: Properties of the spectra of periodic and antiperiodic boundary value problems generated Dirac operators I, II (in Russian). Teor. Funktsii Funktsional. Anal. i Prilozhen. 30, 90–101 (1978); 31, 102–109 (1979) 26. Mityagin, B.: Convergence of expansions in the eigenfunctions of the Dirac operator, (Russian), Dokl. Acad. Nauk 393, 456–459 (2003). [English transl.: Doklady Math. 68, 388–391 (2003)] 27. Mityagin, B.: Spectral expansions of one-dimensional periodic Dirac operator. Dynamics of PDE 1, 125–191 (2004) 28. Novikov, S.P.: The periodic problem for Korteweg-De Vries equation. Funktsional. Anal. i Prilozhen. 8:3 , 54–66 (1974). English transl., Functional Analysis and its applications, 8, 236–246 January 1975 29. Tkachenko, V.: Non-selfadjoint periodic Dirac operators, Operator Theory; Advances and Applications, Vol. 123, Basel: Birkh¨auser Verlag, 2001, pp. 485–512 30. Trubowitz, E.: The inverse problem for periodic potentials. CPAM 30, 321–342 (1977) 31. Zakharov, V., Manakov, S., Novikov, S., Pitayevskii, L. : Theory of solitons; the inverse scattering method. New York: Consultants Bureau, 1984 32. Zakharov, V., Shabat, A.: Exact theory of two-dimensional self-focusing and one-dimensional selfmodulation of waves in non-linear media. Sov. Phys. JETP 34, 62–69 (1971) 33. Zakharov, V., Shabat, A.: A scheme of integrating the non-linear equations of mathematical physics by the method of inverse scattering problem I. J. Funct. Anal. Appl. 8, 226–235 (1974) 34. Zakharov, V., Shabat, A.: Integration of the non-linear equations of mathematical physics by the method of inverse scattering problem II. J. Funct. Anal. Appl. 13, 166–174 (1979) Communicated by B. Simon

Commun. Math. Phys. 259, 185–221 (2005) Digital Object Identifier (DOI) 10.1007/s00220-005-1369-7

Communications in

Mathematical Physics

Equality of the Bulk and Edge Hall Conductances in a Mobility Gap A. Elgart1 , G.M. Graf 2 , J.H. Schenker2 1 2

Department of Mathematics, Stanford University, Stanford, CA 94305-2125, USA Theoretische Physik, ETH Z¨urich, 8093 Z¨urich, Switzerland

Received: 8 September 2004 / Accepted: 7 February 2005 Published online: 21 June 2005 – © Springer-Verlag 2005

Abstract: We consider the edge and bulk conductances for 2D quantum Hall systems in which the Fermi energy falls in a band where bulk states are localized. We show that the resulting quantities are equal, when appropriately defined. An appropriate definition of the edge conductance may be obtained through a suitable time averaging procedure or by including a contribution from states in the localized band. In a further result on the Harper Hamiltonian, we show that this contribution is essential. In an appendix we establish quantized plateaus for the conductance of systems which need not be translation ergodic. 1. Introduction Two conductances, σB and σE , are associated to the Quantum Hall Effect (QHE), depending on whether the currents are ascribed to the bulk or to the edge. The equality σB = σE , suggested by Halperin’s analysis [17] of the Laughlin argument [21], has been established in the context of an effective field theory description [14]. It was later derived in a microscopic treatment of the integral QHE [32, 12, 24] for the case that the Fermi energy lies in a spectral gap of the single-particle Hamiltonian HB . We prove this equality, by quite different means, in the more general setting that HB exhibits Anderson localization in – more precisely, dynamical localization (see (1.2) below). The result applies to Schr¨odinger operators which are random, but does not depend on that property. We therefore formulate the result for deterministic operators. The relation to recent work [7] will be discussed below. The bulk is represented by the lattice Z2 x = (x1 , x2 ) with Hamiltonian HB = HB∗ on 2 (Z2 ). We assume its matrix elements HB (x, x ), x, x ∈ Z2 , to be of short range in the sense that HB (x, x ) (eµ|x−x | − 1) =: C1 < ∞ (1.1) sup x∈Z2 x ∈Z2

186

A. Elgart, G.M. Graf, J.H. Schenker

for some µ > 0, where |x| = |x1 | + |x2 |. Our hypothesis on the bounded open interval ⊂ R is that for some ν ≥ 0, g(HB )(x, x ) (1 + |x|)−ν eµ|x−x | =: C2 < ∞, sup (1.2) g∈B1 ()

x,x ∈Z2

where B1 () denotes the set of Borel measurable functions g which are constant in {λ|λ < } and in {λ|λ > } with |g(x)| ≤ 1 for every x. In particular C2 is a bound when g is of the form gt (λ) = e−itλ E (λ) and the supremum is over t ∈ R, which is a statement of dynamical localization. By the RAGE theorem this implies that the spectrum of HB is pure point in (see [20] or [10, Theorem 9.21] for details). We denote the corresponding eigen-projections by E{λ} (HB ) for λ ∈ E , the set of eigenvalues λ ∈ . We assume that no eigenvalue in E is infinitely degenerate, dim E{λ} (HB ) < ∞ ,

λ ∈ E .

(1.3)

The validity of these assumptions is discussed below (but see also [2, 4]). The zero temperature bulk Hall conductance at Fermi energy λ is defined by the Kubo-Stˇreda formula [5] σB (λ) = −i tr Pλ [ [Pλ , 1 ] , [Pλ , 2 ] ] ,

(1.4)

where Pλ = E(−∞,λ) (HB ) and i (x) is the characteristic function of x = (x1 , x2 ) ∈ Z2 | xi < 0 . Under the above assumptions σB (λ) is well-defined for λ ∈ , but independent thereof, i.e., it shows a plateau. (This result, first proved in [6], is strengthened here in an appendix, since we do not assume translation covariance or ergodicity of the Schr¨odinger operator. We also show the integrality of 2πσB therein, though it is not needed in the sequel.) We remark that (1.3) is essential for a plateau: for the Landau Hamiltonian (though defined on the continuum rather than on the lattice) Eqs. (1.1, 1.2) hold if properly interpreted, but (1.3) fails in an interval containing a Landau level, where indeed σB (λ) jumps. The sample with an edge is modeled as a half-plane Z × Za , where Za = {n ∈ Z | n ≥ −a}, with the height −a of the edge eventually tending to −∞. The Hamiltonian Ha = Ha∗ on 2 (Z × Za ) is obtained by restriction of HB under some largely arbitrary boundary condition. More precisely, we assume that Ea = Ja Ha − HB Ja : 2 (Z × Za ) → 2 (Z2 ) satisfies sup

x∈Z2 x ∈Z×Z

Ea (x, x ) eµ(|x2 +a|+|x1 −x1 |) ≤ C3 < ∞ ,

(1.5)

(1.6)

a

where Ja : 2 (Z × Za ) → 2 (Z2 ) denotes extension by 0. For instance with Dirichlet boundary conditions, Ha = Ja∗ HB Ja , we have Ea = (Ja Ja∗ − 1)HB Ja , i.e., −HB (x, x ) , x2 < −a , Ea (x, x ) = 0, x2 ≥ −a ,

Equality of the Bulk and Edge Hall Conductances in a Mobility Gap

187

whence (1.6) follows from (1.1). We remark that Eq. (1.1) is inherited by Ha with a constant C1 that is uniform in a, but not so for Eq. (1.2) as a rule. The definition of the edge Hall conductance requires some preparation. The current operator across the line x1 = 0 is −i [Ha , 1 ]. Matters are simpler if we temporarily assume that is a gap for HB , i.e., if σ (HB ) ∩ = ∅, in which case one may set [32] σE := −i tr ρ (Ha ) [Ha , 1 ] , where ρ ∈ C ∞ (R) satisfies

(1.7)

ρ(λ) =

1, λ < , 0, λ > .

(1.8)

The heuristic motivation for (1.7) is as follows. We interpret ρ(Ha ) as the 1-particle density matrix of a stationary quantum state. Though some current is flowing near the edge we should discard it, as it is supposed to be canceled by current flowing at an opposite edge located at x2 = +∞. If the chemical potential is now lowered by δ at the first edge, but not at the second, a net current δ I = −i tr ((ρ(Ha + δ) − ρ(Ha )) [Ha , 1 ]) = −i dt tr ρ (Ha − t) [Ha , 1 ] 0

is flowing. Since σE is independent of ρ as long as it conforms with (1.8), see [32] and Theorem 1 below, it is indeed the conductance σE = I /δ for sufficiently small δ. The operator in (1.7) is trace class essentially because i [H, 1 ] is relevant only on (single-particle) states near x1 = 0, and ρ (Ha ) only near the edge x2 = −a, so that the intersection of the two strips is compact. In the situation (1.2) considered in this paper the operator appearing in (1.7) is not trace class, since the bulk operator may have spectrum in , which can cause the above stated property to fail for ρ (Ha ). In search of a proper definition of σE , we consider only the current flowing across the line x1 = 0 within a finite window −a ≤ x2 < 0 next to the edge. This amounts to modifying the current operator to be i i − (2 [Ha , 1 ] + [Ha , 1 ] 2 ) = − { [Ha , 1 ] , 2 } , 2 2

(1.9)

with which one may be tempted to use i lim − tr ρ (Ha ) { [Ha , 1 ] , 2 } 2

a→∞

(1.10)

as a definition for σE . Though we show that this limit exists, it is not the physically correct choice. We may in fact expect that the dynamics of e−itHa acting on states supported far away from the edge resembles for quite some time the dynamics generated by HB . Being bound states or, more likely, resonances, such states may carry persistent currents (whence the operator in (1.7) is not trace class), but no or little net current across the line x1 = 0. This cancelation is the rationale for ignoring the part x2 ≥ 0 of the line x1 = 0 by means of the cutoff 2 in (1.9), however the cancelation is not achieved on states located near the end point x = (0, 0). In the limit a → ∞ we pretend these states are bound, which yields the contribution missed by (1.10): i − (ψλ , { [HB , 1 ] , 1 − 2 } ψλ ) = Im (ψλ , 1 HB 2 ψλ ) , 2

(1.11)

188

A. Elgart, G.M. Graf, J.H. Schenker

from each bound state ψλ of HB , with corresponding energy λ ∈ E . We incorporate them with weight ρ (λ) in our definition of the edge conductance: i (1) σE := lim − tr ρ (Ha ) { [Ha , 1 ] , 2 } a→∞ 2 + ρ (λ) Im tr E{λ} 1 HB 2 E{λ} .

(1.12)

λ∈E

We will show that the sum on the r.h.s. is absolutely convergent, and its physical meaning will be further discussed at the end of the introduction. We will also show it to be nonzero on average for the Harper Hamiltonian with an i.i.d. random potential in Theorem 3. The terms of this sum involve HB , though the few states for which they are sizeable are supported near x = (0, 0) and hence far from the edge x2 = −a. Since the mere appearance of HB in the definition of an edge property may be objectionable, we present an alternative. The basic fact that the net current of a bound state is zero, −i (ψλ , [HB , 1 ] ψλ ) = 0 ,

(1.13)

can be preserved by the regularization provided the spatial cutoff 2 is time averaged. In fact, let 1 T iHa t −iHa t AT ,a (X) = e Xe dt (1.14) T 0 be the time average over [0, T ] of a (bounded) operator X with respect to the Heisenberg evolution generated by Ha , with a finite or a = B. If a limit ∞ 2 = lim T →∞ AT ,B (2 ) were to exist, it would commute with HB so that −

i ψλ , [HB , 1 ] , ∞ 2 ψλ = 0 . 2

This motivates our second definition, (2)

σE

i := lim lim − tr ρ (Ha ) [Ha , 1 ] , AT ,a (2 ) . T →∞ a→∞ 2

(1.15)

The two definitions allow for the following result. Theorem 1. Under the assumptions (1.1, 1.2, 1.3, 1.6, 1.8) the sum in (1.12) is absolutely convergent, the limits there and in (1.15) exist, and (1)

σE

(2)

= σE

= σB .

In particular (1.12, 1.15) depend neither on the choice of ρ nor on that of Ea . Remark 1. i) The hypotheses (1.1, 1.2) hold almost surely for ergodic Schr¨odinger operators whose Green’s function G(x, x ; z) = (HB − z)−1 (x, x ) satisfies a moment condition [3] of the form s

sup lim sup E G(x, x ; E + iη) ≤ Ce−µ|x−x | (1.16) E∈

η→0

Equality of the Bulk and Edge Hall Conductances in a Mobility Gap

for some s < 1. The implication is through the dynamical localization bound E sup g(HB )(x, x ) ≤ Ce−µ|x−x | ,

189

(1.17)

g∈B1 ()

although (1.2) has also been obtained by different means, e.g., [16]. The implication (1.16) ⇒ (1.17) was proved in [1] (see also [2, 11, 4]). The bound (1.17) may be better known for supp g ⊂ , but is true as stated since it also holds [6, 2] for the projections g(HB ) = Pλ , Pλ⊥ = 1 − Pλ , (λ ∈ ). ii) Condition (1.3), in fact simple spectrum, follows from the arguments in [34], at least for operators with nearest neighbor hopping, HB (x, y) = 0 if |x − y| > 1. iii) When σ (HB ) ∩ = ∅, the operator appearing in (1.7) is known to be trace class. (1) (2) In this case, the conductance σE = σE defined here coincides with σE defined in (1.7). This statement follows from Theorem 1 and the known equality σE = σB [32, 12], but can also be seen directly. For completeness, we include a proof of this fact in Sect. 2 below. A point of view which combines both definitions of the edge conductance is expressed by the following result. Theorem 2. Under the assumptions of Theorem 1, i lim − tr ρ (Ha ) [Ha , 1 ] , 2;a (t) a→∞ 2 = σB + ρ (λ) Im tr E{λ} [HB , 1 ] eiHB t 2 e−iHB t E{λ} ,

(1.18)

λ∈E

with 2;a (t) = eiHa t 2 e−iHa t . (1)

In particular, this reduces to σE = σB for t = 0 by (1.11, 1.12). On the other hand, (2) σE = σB results, as we will show, from the time average of (1.18). A recent preprint [7] contains results which are topically related, to but substantially different from, those presented here. In that work, two contiguous media are modeled by positing a potential of the form U (x1 , x2 ) = V0 (x2 )χ (x2 < 0) + V (x1 , x2 )χ (x2 ≥ 0) (in our notation), where V0 is independent of x1 . The role of V is that of a bulk potential, and that of V0 as of a wall, provided it is large. The kinetic term is given by the Landau Hamiltonian on the continuum L2 (R2 ), whose unperturbed spectrum is the familiar set (2N+1)B, with B the magnitude of the constant magnetic field. A result is the following: if model (a), with V0 = 0, exhibits localization in ⊂ [(2N − 1)B, (2N + 1)B] for some positive integer N, and hence σE = 0, then model (b), with V0 (x2 ) ≥ (2N + 1)B, has 2π σE = N. The result is established by showing that the difference between 2π σE in cases (b) and (a) is independent of V , and equals N if V = 0, the two models then being solvable thanks to the translation invariance w.r.t. x1 . In comparison to our work, the following features may be noted: i) The localization assumption on the reference model (a) is made for a system which has itself an interface. (Our Eq. (1.2) concerns a bulk model serving as reference.) ii) The validity of that assumption is limited to small V , because the interface of (a) will otherwise produce extended edge states with energies in . The result σE = σB thus applies to perturbations of the free Landau Hamiltonian of size B. (Our comparison σE = σB does not require either side to be explicitly computable.)

190

A. Elgart, G.M. Graf, J.H. Schenker (1)

iii) The definition of σE for (b) depends on eigenstates in of (a), like our σE , but (2) not σE . A model without bulk potential, but allowing interactions between diluted particles, was studied from a related perspective in [25]. In (1.11, 1.12) we argued that the limit (1.10) is not identical to σB . To indeed prove this, we show that the sum on the right-hand side of (1.12) does not vanish for the Harper Hamiltonian with i.i.d. Cauchy randomness on the diagonal. The Harper Hamiltonian models the hopping of a tightly bound charged particle in a uniform magnetic field. The hopping terms H (x, x ) are zero except for nearest neighbor pairs, for which they are of modulus one, H (x, x ) = 0, |x − x | = 1 , (1.19) 1, |x − x | = 1 , where the non-zero matrix elements are interpreted as H (x, x ) = ei

x

x

A(y)· d1 y

,

with A the magnetic vector potential and the line integral computed along the bond connecting x, x . The magnetic flux through any region D ⊂ R2 is B(x)d2 x = A(y) · d1 y , D

∂D

so, for a uniform field, the flux is proportional to the area A(y) · d1 y = φ|D| . ∂D

Thus, we require that H (x (1) , x (4) )H (x (4) , x (3) )H (x (3) , x (2) )H (x (2) , x (1) ) = ei

∂P

A(y)·d1 y

= eiφ , (1.20)

where x (1) , x (2) , x (3) , x (4) are the vertices of a plaquette P , listed in counter-clockwise order, and φ is the flux through any plaquette. There are many choices of nearest neighbor hopping terms which satisfy (1.19) and (1.20), all interrelated by gauge transformations. For our purposes, it suffices to fix a gauge and take   x = x ± e1 , 1 , iφx 1 Hφ (x, x ) := e (1.21) , x = x + e2 ,  e−iφx1 , x = x − e , 2 with e1 = (1, 0) and e2 = (0, 1) the lattice generators. This choice of Hφ comes from representing the constant field B = φ via the vector potential A = φ(0, x1 ). We note that the bulk and edge Hall conductances are gauge invariant quantities, so Theorem 3 stated below holds for any other choice of Hφ . We refer the reader to ref. [26] and references therein for further discussion of the Harper Hamiltonian.

Equality of the Bulk and Edge Hall Conductances in a Mobility Gap

191

To guarantee localized spectrum, we consider a bulk Hamiltonian which consists of Hφ plus a diagonal random potential, HB = Hφ + αV , where V ψ(x) = V (x)ψ(x) and V (x), x ∈ Z2 are independent identically distributed Cauchy random variables. Here α is a coupling parameter (the “disorder strength”) and “Cauchy” signifies that the distribution of v = V (x) is 1 1 dv . π 1 + v2 We use Cauchy variables because it is possible to calculate certain quantities explicitly for such variables: E (f (v)) = f (i) for a function f having a bounded analytic continuation to the upper half plane. It is clear that HB is short range, i.e., (1.1) holds. For simplicity we consider Ha which are defined via non-random boundary conditions, i.e., the operators Ea appearing in (1.5) do not depend on the random couplings V (x). We then have the following result. Theorem 3. For HB , Ha as above, there is jB ∈ C ∞ such that i E − lim tr ρ (Ha ) { [Ha , 1 ] , 2 } = − ρ (λ)jB (λ)dλ , 2 a→∞

(1.22)

whenever ρ ∈ C0∞ (R). The expectation is well defined and may be interchanged with the limit. Furthermore, jB (λ) has the following asymptotic behavior jB (λ) = −

4|α| sin(φ)(cos(φ) + 1)λ−5 + O(λ−6 ) , π

|λ| → ∞ .

(1.23)

The result is relevant in relation to (1.12) since it has in fact been shown that (1.2) holds for HB at large energies: Theorem ([1]). There is E0 (α) such that (1.17) holds for HB and = ± with − = (−∞, −E0 (α)] and + = [E0 (α), ∞). Hence (1.2) holds almost surely. Remark 2. i) For any α = 0 the spectrum of HB is (almost surely) the entire real line, so the eigenvalues of HB in ± make up a (random) dense subset which we denote E± . In fact, this pure point spectrum is almost surely simple, as can be shown using the methods in [34]. ii) For sufficiently large α we have E0 (α) = 0, i.e., the spectrum is completely localized. iii) Localization also holds inside the spectral gaps of Hφ , for small α, via the methods in [1, 4]. The mentioned result implies σB (λ) = 0 for λ ∈ ± , because σB is insensitive to λ in that range and Pλ → 1 or 0 as λ → ∞ or −∞, respectively. Thus for ρ as in (1.8) (1) with supp ρ ⊂ ± we have σE = 0 by Theorem 1. On the other hand, for the first term on the r.h.s. of (1.12), JB (ρ), we have by Theorem 3, 4|α| (1.24) sin(φ)(cos(φ) + 1) ρ (λ)λ−5 dλ + O λ−6 E (JB (ρ)) = 0 π |λ|≥E0 (α) as λ0 = inf |λ||λ ∈ supp ρ → ∞. Clearly the right-hand side can be non-zero for appropriately chosen ρ, and the same then holds for the expectation of the last term in (1.12).

192

A. Elgart, G.M. Graf, J.H. Schenker

The definitions (1.12, 1.15) may be related, heuristically, to concepts from classical electro-magnetism of material media [31]. There the macroscopic (or average) current is split as jf + ∂ P /∂t + rot M into free, polarization, and magnetization currents. (The magnetization M is a scalar in two dimensions.) The distinction depends on the existence of units (free electrons, atoms, molecules, ...) each with conserved charge, whose current densities are effectively of the form ∂ j( x , t) = q r˙ (t)δ( x − r(t)) + δ( x − r(t))p(t) + rot (δ( x − r(t))m(t)) , (1.25) ∂t where q, p(t), m(t) are the unit’s charge and electric/magnetic moments respectively. The macroscopic quantities emerge as a weak limit of the microscopic ones     x , t)   qk r˙ k (t)   jf ( δ( x − rk (t)) , p (t) P ( x , t)  k    m (t) k M( x , t) k or more precisely after integration against compactly supported test functions which vary slowly over the interatomic distance. The microscopic current across the portion x2 ≤ 0 of the line x1 = 0 is then d2 x (x1 )(x2 )jk,1 ( x , t) I =− k

=−

+

∂ P1 x , t) + d x (x1 )(x2 ) jf,1 ( ( x , t) ∂t 2

d2 x (x1 ) (x2 )M( x , t) .

(1.26)

The derivation assumes that is smooth over interatomic distances. The last term in x − rk (t))mk (t). It (1.26) comes from the corresponding term in (1.25), which is ∂2 δ( cannot be replaced by adding (rotM)1 = ∂2 M within the square brackets, which would correspond to the macroscopic current. In fact, it differs from that by a boundary term, which would vanish if (x2 ) were compactly supported. Let now the macroscopic fields be stationary and slowly varying on the scale of . In the QHE we expect that the (free) edge currents are located near the edge, so that (1.26) becomes dx2 jf,1 ( x ) + M(0) . I = x1 =0

When M(0) is subtracted from the l.h.s., we obtain an expression for the edge current, which is the role of the second term in (1.12). In this analogy the definition (1.15) corresponds to replacing (x2 ) in the first line of (1.26) by (e2 · rk,T ), where rk,T is the time average of rk (t). Then the last term no longer arises. The above discussion neglects the weighting ρ (λ) of energies in (1.12). This will be remedied in the following heuristic argument in support of σB = σE . In a finite sample of volume V the Stˇreda relation [35] asserts ∂N ∼ = σB V , ∂φ

(1.27)

Equality of the Bulk and Edge Hall Conductances in a Mobility Gap

193

where N is the total charge of carriers, i.e., N = tr ρ(HV ) in the situation considered here. For the total magnetization M we have −

∂HV ∂M = tr ρ (HV ) , ∂µ ∂φ

(1.28)

where µ is the chemical potential, as can be seen from the Maxwell relation [15] −

∂N ∂M = . ∂µ ∂φ

(1.29)

To compute ∂HV /∂φ we use a gauge equivalent to (1.21), with trivial phases along bonds in direction e2 , and obtain for (1.28), −

1 i 1 ∂M = tr ρ (HV ) { [HV , X1 ] , X2 } . V 2 V ∂µ (1)

By (1.27, 1.29) this quantity is formally σB . To relate it to σE it should be noted that the total magnetization is not the integral of the bulk magnetization, even in the thermodynamic limit. For instance, for classical, spinless particles M vanishes [22], but consists [27] of a diamagnetic, bulk contribution and an opposite contribution from states close to the edge. These two contributions (in reverse order) may be identified in the quantum mechanical context with the two terms of (1.12). In this example, the expected edge term is negative for φ > 0. This should also emerge from (1.24) when sup supp ρ → −∞, and it does if one also takes into account that −H is the counterpart to the continuum Hamiltonian. In Sect. 2 we will present the main steps in the proof of Theorems 1 and 2, with details supplied in Sect. 3. The proof of Theorem 3 will be given in Sect. 4. The appendix is about properties of σB . 2. Outline of the Proof A reasonable first step is to make sure that the traces in (1.12, 1.15) are well-defined. We will show this for σE (a, t) := −i tr ρ (Ha ) [Ha , 1 ] 2;a (t) ,

(2.1)

with 2;a (t) = eitHa 2 e−itHa , by proving that i [Ha , 1 ] 2;a (t) ∈ I1 in Lemma 5. Here, I1 denotes the ideal of trace class operators, and we denote the trace norm by ·1 . Then σE (a, t) = −i tr 2;a (t) [Ha , 1 ] ρ (Ha ) = −i tr ρ (Ha )2;a (t) [Ha , 1 ] , where we used that tr AB = tr BA if AB , BA ∈ I1 , e.g., [33, Corollary 3.8]. The definition (1.15) then reads 1 T (2) σE = lim lim dt Re σE (a, t) . T →∞ a→∞ T 0

(2.2)

(2.3)

194

A. Elgart, G.M. Graf, J.H. Schenker

By the argument given in the Introduction, the trace norm of the operator in (2.1) diverges as a → ∞. To see that its trace nevertheless converges we subtract from it an operator Z(a, t) ∈ I1 , to be specified below, with tr Z(a, t) = 0, implying

σE (a, t) = −i tr ρ (Ha ) [Ha , 1 ] 2;a (t) − Z(a, t) . (2.4) The idea, of course, is to choose Z(a, t) so that sup ρ (Ha ) [Ha , 1 ] 2;a (t) − Z(a, t)1 < ∞ .

(2.5)

a

An operator of zero trace is [ρ(Ha ), 1 ] 2 ; it is trace class (see Lemma 5) and its trace, computed in the position basis, is seen to vanish. Though it does not quite suffice for (2.5), we consider it since [ρ(Ha ), 1 ] and ρ (Ha ) [Ha , 1 ] are closely related: From the Helffer-Sj¨ostrand representations (see Sect. 3 for details) 1 ρ(Ha ) = (2.6a) dm(z)∂z¯ ρ(z)R(z), 2π 1 ρ (Ha ) = − dm(z)∂z¯ ρ(z)R(z)2 , (2.6b) 2π with R(z) = (Ha − z)−1 , we obtain

1 dm(z)∂z¯ ρ(z)R(z) [Ha , 1 ] R(z), 2π 1 ρ (Ha ) [Ha , 1 ] = − dm(z)∂z¯ ρ(z)R(z)2 [Ha , 1 ] . 2π [ρ(Ha ), 1 ] = −

(2.7a) (2.7b)

The two expressions, multiplied from the right by 2 , respectively by 2;a (t) as in (2.1), would have an even more similar structure if in the second a resolvent could be moved to the right. This can be achieved under the trace by setting Z(a, t) = [ρ(Ha ), 1 ] 2

1 − dm(z)∂z¯ ρ(z)R(z) R(z) [Ha , 1 ] 2;a (t)−[Ha , 1 ] 2;a (t)R(z) , 2π (2.8) for which tr Z(a, t) = 0. Then (2.4) reads σE (a, t) = tr a (t) with a i

ia (t) := − [ρ(Ha ), 1 ] 2 1 + − dm(z)∂z¯ ρ(z)R(z) [Ha , 1 ] 2;a (t)R(z) 2π

(2.9)

a (t) i

ia (t)

(2.10) = [ρ(Ha ), 1 ] 2;a (t) − 2 " ! 1 + − dm(z)∂z¯ ρ(z)R(z) [Ha , 1 ] R(z) Ha , 2;a (t) R(z), 2π ia (t)

Equality of the Bulk and Edge Hall Conductances in a Mobility Gap

195

where, to obtain! the last expression, (2.7a) " ! multiplied "by 2;a (t) has been added and subtracted, and R(z), 2;a (t) = −R(z) Ha , 2;a (t) R(z) has been used. We remark that equality of (2.9) and (2.10) also holds for HB , i.e., if we replace Ha by HB and 2;a (t) by 2;B (t) = eiHB t 2 e−iHB t , and set R(z) = (HB − z)−1 . We will show σE (a, t) = tr a (t) −−−→ tr B (t) a→∞

(2.11)

and, incidentally, (2.5) by establishing: Lemma 1. Under assumptions (1.1, 1.6), but without making use of (1.2, 1.3, 1.8), we have for ρ ∈ C0∞ (R), Ja (t)J ∗ − (t) −−−→ 0 , (2.12) a a B 1 a→∞ Ja (t)J ∗ − (t) −−−→ 0, (2.13) a a B 1 a→∞

uniformly for t in a compact interval. ∗ on 2 (Z × Note that the replacement A → Ja AJ extends by zero an operator a simply 2 2 ∗ ∗ Za ) to one on (Z ). In particular Ja AJa 1 = A1 and tr Ja AJa = tr A. For the rest of this section we shall only be concerned with bulk quantities like tr B (t). By (2.1, 2.3, 2.11), the statements to be proven are ρ (λ) Im tr E{λ} [HB , 1 ] eiHB t 2 e−iHB t E{λ} Re tr B (t) = σB + λ∈E

for Theorem 2 and part of Theorem 1, and 1 T lim tr B (t)dt = σB T →∞ T 0

(2.14)

for the other part, where actually the real part of the l.h.s. would suffice. It may be noted (1) (2) that the ρ’s allowed by (1.8) form an affine space and that B (t), like σE , σE , is affine in ρ. The relation to σB will be made through the following decomposition, which exhibits the same property for this quantity. Lemma 2. Let ⊂ R be as in Theorem 1 and let E− , E+ be the spectral projections for HB onto {λ | λ < }, resp. {λ | λ > }. Then, for λ0 ∈ , ! " ! " σB (λ0 ) = i tr E− Pλ0 , 1 2 E− + i tr E+ Pλ0 , 1 2 E+ + i tr E Tλ0 E , (2.15) where Pλ0 = E(−∞,λ0 ) , Tλ0 = Pλ0 1 Pλ⊥0 2 Pλ0 − Pλ⊥0 1 Pλ0 2 Pλ⊥0 ,

(2.16)

and the traces are well defined. Moreover, the last term in (2.15) can be further decomposed as ! " i tr E{λ} Pλ0 , 1 2 E{λ} , (2.17) i tr E Tλ0 E = λ∈E

with absolutely convergent sum.

196

A. Elgart, G.M. Graf, J.H. Schenker

Since σB is independent of λ0 ∈ , (2.15) with the last term replaced by the r.h.s. of (2.17) also holds if Pλ0 is replaced by ρ satisfying (1.8), since ρ(HB ) = − dλ0 ρ (λ0 )Pλ0 . The proof of Lemma 2, which is given in Sect. 3, makes use of E{λ} , (2.18) 1 = E− + E+ + E , E = λ∈E

where the sum is strongly convergent. Using this decomposition on B (t) ∈ I1 we obtain

B (t) E− + tr E+ B (t) E+ B + B + tr B (t) = tr E− + tr E B (t)E .

(2.19)

Though the two contributions (2.9) to B (t) are not separately trace class, they become E± also appear in (2.15), and E± (t)E± vanish so in (2.19). In fact, those of E± B B by integration by parts since E± R(z) and R(z)E± are analytic on the support of ρ(z) or of ρ(z) − 1. We thus find that tr B (t) = σB + i dλ0 ρ (λ0 ) tr E Tλ0 E + tr E B (t)E . (2.20) At this point the analysis of the last term splits into two tracks with the purpose of (1) (2) showing σE = σB , resp. σE = σB . 2.1. Track 1. We decompose the projection E into its atoms as in (2.18), which by s Xn − → 0 , Y ∈ I1 ⇒ Xn Y 1 → 0 , Y Xn∗ 1 → 0 (2.21) yields a trace class norm convergent sum for E B (t)E . Thus

B (t) E{λ} . B + tr E{λ} tr E B (t)E = λ∈E

E{λ} are themselves trace class as they match those of Again, the contributions E{λ} B (2.17), canceling the second term of (2.20). We conclude that tr B (t) = σB = σB

i dm(z)∂z¯ ρ(z) tr E{λ} R(z) [HB , 1 ] 2;B (t)R(z)E{λ} + 2π λ∈E − i ρ (λ) tr E{λ} e−iHB t [HB , 1 ] eiHB t 2 E{λ} , (2.22) λ∈E

where we used that f (HB )E{λ} = f (λ)E{λ} . By its derivation this sum is absolutely (1) convergent for each t. This proves Thm. 2 and hence σE = σB . 2.2. Track 2. Here we do not decompose E , but use (2.10) whose two terms are separately trace class, tr E B (t)E = tr E B (t)E + tr E B (t)E .

Equality of the Bulk and Edge Hall Conductances in a Mobility Gap

197

Lemma 3. For ⊂ R as in Theorem 1 we have 1 T

T

0

tr E B (t)E dt −−−→ 0 , T →∞

(2.23)

and ! "

−i tr E Pλ0 , 1 AT ,B (2 ) − 2 E −−−→ i tr E Tλ0 E T →∞

(2.24)

for λ0 ∈ , the expression on the l.h.s. being uniformly bounded in λ0 ∈ , T > 0. By dominated convergence (2.24) implies 1 T

0

T

tr E B (t)E dt −−−→ −i T →∞

dλ0 ρ (λ0 ) tr E Tλ0 E . (2)

Together with (2.20, 2.23), this proves (2.14) and hence σE = σB . 2.3. Alternate Track 2. We now show that the last result can also be inferred from (2.22), at least if assumption (1.3) is strengthened to a uniform upper bound on the degeneracies: dim E{λ} (HB ) ≤ C4 < ∞ ,

λ ∈ E .

(2.25)

Then, the sum (2.22) is uniformly convergent in t ∈ R, as stated in Lemma 4. Assuming (1.1, 1.2, 2.25), we have sup tr E{λ} e−iHB t [HB , 1 ] eiHB t 2 E{λ} < ∞ . λ∈E t∈R

(2.26)

In order to prove (2.14), it suffices in view of (2.26) to show 1 lim T →∞ T

0

T

tr E{λ} e−iHB t [HB , 1 ] eiHB t 2 E{λ} = 0

(2.27)

for each λ ∈ E . Because tr E{λ} e−iHB t [HB , 1 ] eiHB t 2 E{λ} = i

d tr E{λ} e−iHB t 1 eiHB t 2 E{λ} , dt

(2.28)

the expression under the limit is just $ i # tr E{λ} e−iHB t 1 eiHB t 2 − tr E{λ} 1 2 . T

(2.29)

Since each term inside the square brackets is bounded by C4 < ∞, Eq. (2.27) follows. (2) This concludes the alternate proof of σE = σB .

198

A. Elgart, G.M. Graf, J.H. Schenker

2.4. Edge conductance in a spectral gap. We conclude this section by showing as mentioned above in the remark following Theorem 1 that −i tr ρ (Ha ) [Ha , 1 ] = σB if σ (HB ) ∩ = ∅. By translation invariance of σB , see Lemma 7 below, it suffices to show this for a = 0, in which case we drop the subscript a of the edge Hamiltonian. It s has been shown in (A.8) of [12] that ρ (H ) [H, 1 ] ∈ I1 . Since 2,a := 2 (· − a) − →1 as a → ∞, we have by (2.21), −i tr ρ (H ) [H, 1 ] = −i lim tr ρ (H ) [H, 1 ] 2,a a→∞ " ! = −i lim tr ρ (H a ) H a , 1 2 . a→∞

(2.30)

Here H a is the operator on 2 (Z × Za ) obtained from H by a shift (0, −a); it is not the restriction to Z × Za of a fixed Bulk Hamiltonian HB , as Ha was, but instead of an equally shifted one, HBa . The estimates (1.1, 1.6) therefore still apply, which is all that matters for (2.12, 2.13). The r.h.s. of (2.30) thus equals lima→∞ tr Ba (0), where Ba (t) pertains to HBa . Since the sum in (2.22) vanishes, tr Ba (t) = σBa , which is independent of a. 3. Details of the Proof We give some details about the Helffer-Sj¨ostrand representations (2.6). The integral is over z = x + iy ∈ C with measure dm(z) = dxdy, ∂z¯ = ∂x + i∂y , and ρ(z) is a quasianalytic extension of ρ(x) which, see [18], for given n can be chosen so that

dm(z) |∂z¯ ρ(z)| |y|−p−1 ≤ C

n+2 (k) ρ k=0

k−p−1

(3.1)

k for p = 1, ..., n, provided the appearing norms f k = dx(1 + x 2 ) 2 |f (x)| are finite. This is the case for ρ with ρ ∈ C0∞ (R). For p = 1 this shows that (2.6b) is norm convergent. The integral (2.6a), which would correspond to the case p = 0, is nevertheless a strongly convergent improper integral, see e.g., (A.12) of [12]. A further preliminary is the Combes-Thomas bound [8] δ(x) Ra (z)e−δ(x) ≤ e where δ can be chosen as

C , | Im z|

δ −1 = C 1 + | Im z|−1

(3.2)

(3.3)

for some (large) C > 0 and (x) is any Lipschitz function on Z2 with |(x) − (y)| ≤ |x − y| (see e.g. [2, Appendix D] for details).

(3.4)

Equality of the Bulk and Edge Hall Conductances in a Mobility Gap

199

Lemma 5. We have [Ha , 1 ] 2;a (t) ∈ I1 ,

(3.5)

and for ρ ∈ C ∞ (R) with supp ρ compact also [ρ(Ha ), 1 ] 2;a (t) ∈ I1 .

(3.6)

In particular, Z(a, t) as given in (2.8) is trace class. Proof. We first prove the finite propagation speed estimate (see [13] and [23]): Let µ > 0 be as in (1.1). Then, for 0 ≤ δ ≤ µ and as (3.4), δ(x) iHa t −δ(x) e e (3.7) ≤ eC|t| e for some C < ∞. Indeed, let A(t) = eδ(x) eiHa t e−δ(x) , d −δ(x) −iHa t 2δ(x) iHa t −δ(x) d e e e e = A(t)∗ BA(t) , A(t)∗ A(t) = e dt dt ! " where B = −ie−δ(x) Ha , e2δ(x) e−δ(x) has matrix elements

iB(x, x ) = Ha (x, x )(eδ((x )−(x)) − eδ((x)−(x )) ). By (1.1) which, as remarked in the Introduction, is inherited by Ha , and by Holmgren’s bound B ≤ max sup |B(x, x )| , sup |B(x, x )| , (3.8) x

x

x

x

we have 2C := B < ∞ and hence A(t)2 = A(t)∗ A(t) ≤ e2C|t| . We factorize [Ha , 1 ] 2;a (t) = [Ha , 1 ] eδ|x1 | · e−δ|x1 | e−δ|x2 | · eδ|x2 | 2;a (t) , and note that

−δ|x1 | −δ|x2 | e ≤ Cδ −2 , e

(3.9)

1

since this is a summable function of (x1 , x2 ) ∈ Z2 . It is therefore enough for (3.5) to show (3.10) [Ha , 1 ] eδ|x1 | ≤ C , δ|x2 | 2;a (t) ≤ Ceδa (3.11) e for small δ, where the first estimate also holds for a = B. Indeed, the first operator has matrix elements

T (x, x ) = Ha (x, x )(1 (x ) − 1 (x))eδ|x1 | .

200

A. Elgart, G.M. Graf, J.H. Schenker

They vanish if |x1 − x1 | ≤ |x1 | since x1 ≥ 0 (resp. x1 < 0) then implies the same for x1 . Therefore T (x, x ) ≤ 2 Ha (x, x ) eδ|x1 −x1 | ≤ 2 Ha (x, x ) eδ|x−x | ,

which together with T (x, x) = 0 yields |T (x, x )| ≤ C|Ha (x, x )|(eδ|x−x | − 1). Now (3.10) follows from (1.1) and (3.8). The estimate for eδ|x2 | 2;a (t) = eδ|x2 | eiHa t e−δ|x2 | · eδ|x2 | 2 e−iHa t follows from (3.7) and from eδ|x2 | 2 = eδa < ∞. The proof of (3.6) is similar: Using (2.6) we write 1 dm(z)∂z¯ ρ(z) [Ra (z), 1 ] [ρ(Ha ), 1 ] = 2π

(3.12)

and claim that [Ra (z), 1 ] eδ|x1 | ≤

C | Im z|2

(3.13)

for δ = δ(z) as in (3.3). Together with (3.1, 3.9, 3.11) this implies (3.6). To derive (3.13), note that the operator to be bounded is −Ra (z)[Ha , 1 ]eδ|x1 | · e−δ|x1 | Ra (z)eδ|x1 | and the bound follows from (3.2, 3.10). The conclusion about Z(a, t) follows from (3.6) at t = 0 and (3.1, 3.5). 3.1. Proof of Lemma 1. It follows from (3.6) that a (t) is trace class. While (3.13) holds uniformly in a, including the bulk case, (3.11) fails in this respect. Nevertheless B (t) ∈ I1 , since (3.14) sup eδ|x2 | (2;a (t) − 2 ) ≤ C a,B

for t in a compact interval. In fact e

δ|x2 |

2;a (t) − 2 = e

δ|x2 |

t

eiHa s i [Ha , 2 ] e−iHa s ds

0

with

sup eδ|x2 | eiHa t [Ha , 2 ] e−iHa t ≤ C ,

(3.15)

a,B

because of (3.7) and of eδ|x2 | [Ha , 2 ] ≤ C, cf. (3.10). To prove (2.12) we use (3.12) and Ja∗ Ja = 1 to write 1 ∗ dm(z)∂z¯ ρ(z) Ja a (t)Ja = − 2π

×Ja [Ra (z), 1 ] eδ|x1 | Ja∗ · e−δ|x1 | e−δ|x2 | · Ja eδ|x2 | 2;a (t) − 2 Ja∗ .

Equality of the Bulk and Edge Hall Conductances in a Mobility Gap

201

It is enough to establish convergence to the bulk expression pointwise in z, since domination is provided by (3.13, 3.9, 3.14, 3.1). We thus may show s

Ja [Ra (z), 1 ] eδ|x1 | Ja∗ −−−→ [RB (z), 1 ] eδ|x1 | , a→∞

s Ja 2;a (t) − 2 Ja∗ eδ|x2 | −−−→ (2B (t) − 2 ) eδ|x2 | .

(3.16) (3.17)

a→∞

Since the l.h.s.’s are uniformly bounded in a by (3.13, 3.14) it suffices to prove convergence on the dense subspace of compactly supported states in 2 (Z×Z), which amounts to dropping eδ|xi | in (3.16, 3.17). Equation (1.5) implies the geometric resolvent identity Ja Ra (z) − RB (z)Ja = −RB (z)Ea Ra (z), and by taking the adjoint

s Ja Ra (z)Ja∗ − RB (z) = − Ja Ra (z)Ea∗ + 1 − Ja Ja∗ RB (z) −−−→ 0, a→∞

s

s

because Ea∗ −−−→ 0 by (1.6) and because 1 − Ja Ja∗ −−−→ 0 is the projection onto a→∞ a→∞ states supported in {x2 < −a}. This implies [30, Thm. VIII.20] s-lim Ja f (Ha )Ja∗ = f (HB )

(3.18)

a→∞

for any bounded continuous function f , and in particular the modified limits (3.16, 3.17). The proof of (2.13) is similar. We write the integrand of Ja a (t)Ja∗ as " ! Ja [Ra (z), 1 ] eδ|x1 | Ja∗ · e−δ|x1 | e−δ|x2 | · Ja eδ|x2 | Ha , 2;a (t) Ra (z)Ja∗ . Since the estimates for the first two factors have already been given, all we need are ! " sup eδ|x2 | Ha , 2;a (t) ≤ C , !

a,B

" s Ja Ha , 2;a (t) Ja∗ −−−→ [HB , 2B (t)] . a→∞

The first estimate is just (3.15) and the second is again implied by (3.18).

3.2. Proof of Lemma 2. Let Pλ⊥0 = 1 − Pλ0 . By the definition (1.4) we have σB (λ0 ) = i tr Pλ0 1 Pλ⊥0 2 Pλ0 − Pλ0 2 Pλ⊥0 1 Pλ0 . Since the two terms are separately trace class by (A.2), we also have −iσB (λ0 ) = tr Tλ0 with Tλ0 as in (2.16); see (2.2). Now (2.18) yields   −iσB (λ0 ) = tr E− Tλ0 E− + E+ Tλ0 E+ + E{λ} Tλ0 E{λ}  , λ∈E

and the claim follows from " ! tr P Tλ0 P = tr P Pλ0 , 1 2 P

202

A. Elgart, G.M. Graf, J.H. Schenker

for P = P ∗ with P Pλ⊥0 = 0 or P Pλ0 = 0, since one or the other holds true for P = E± , E{λ} . Indeed, in the first case, which also entails Pλ⊥0 P = 0, we have

P Tλ0 P = P Pλ0 1 Pλ⊥0 2 Pλ0 P = P Pλ0 1 2 − 1 Pλ0 2 P ! " = P Pλ0 , 1 2 P . The other case is similar:

P Tλ0 P = −P Pλ⊥0 1 Pλ0 2 Pλ⊥0 P = −P Pλ⊥0 1 2 − 1 Pλ⊥0 2 P # $ " ! = −P Pλ⊥0 , 1 2 P = P Pλ0 , 1 2 P .

3.3. Consequences of localization. We now discuss the technical consequences of assumption (1.2). In fact, all that we say in this section is a consequence of the following (weaker) estimate: g(HB )(x, x ) e−ε|x| eµ|x−x | =: Dε < ∞ , sup (3.19) g∈B1 ()

x,x ∈Z2

for every ε > 0, where the factor (1+|x|)−ν of (1.2) has been replaced by an exponential. Note that (3.19) follows from (1.2) since e−ε|x| ≤ Cε,ν (1 + |x|)−ν . (We require (1.2) to prove integrality of 2πσB (Prop. 3 below), otherwise (3.19) would suffice for the results described here.) In terms of operators, rather than of matrix elements, (3.19) implies that for some µ > 0 and all ε > 0, sup eµ(x) e−ε|x| g(HB )e−µ(x) ≤ Dε < ∞ , (3.20) g,

where the supremum with g ∈ B1 () is also taken over Lipschitz functions as in (3.4). In fact, the norm in (3.20) is estimated by Holmgren’s bound (3.8) as the larger of sup eµ((x)−(x )) e−ε|x| g(HB )(x, x ) (3.21) x

x

and a similar quantity with x, x under the supremum and summation interchanged. After bounding the supremum by a sum, both quantities are estimated by (3.19). Conversely, we take (x) = |x − x | and consider the (x, x ) matrix element of the operator in (3.20), eµ|x−x | e−ε|x| g(HB )(x, x ) ≤ Dε . (3.22) The sum in (3.19) is finite if µ is replaced there by µ/2 and ε by 2ε. We say that a bounded operator X is confined in direction i (i = 1, 2) if for some δ > 0 and all (small) ε > 0, −ε|x| δ|xi | X(i) (3.23) := e <∞. Xe ε,δ Bounds of a similar form are (3.13, 3.14), where a weight was applied to an operator X, which could have as well been replaced by X ∗ . Equivalently, the weight could have been placed on either side of X. Here, by contrast, dynamical localization will allow to

Equality of the Bulk and Edge Hall Conductances in a Mobility Gap

203

establish (3.23) for some operators X, but not for their adjoints. The asymmetry originates from the following: if X is confined, so are BX for B bounded and Xg(HB ) for g ∈ B1 (), with (i) BX(i) ε,δ ≤ B Xε,δ ,

(3.24)

(i) ε Xg(HB )(i) ε,δ ≤ D 2 X ε ,δ

(3.25)

2

for small δ > 0. In fact, Xg(HB )e−ε|x| eδ|x2 | ε ε ε ε ≤ Xe− 2 |x| eδ|x2 | · e−(δ|x2 |− 2 |x|) g(HB )e− 2 |x| e(δ|x2 |− 2 |x|) , and for sufficiently small ε, δ > 0 the Lipschitz norm of δ|x2 | − 2ε |x| is smaller than µ, whence (3.20) applies. Lemma 6. Let S ⊂ R be a Borel set that either contains or is disjoint from {λ|λ < } and similarly for {λ|λ > }, i.e., ES ∈ B1 (). Let X be a confined operator in direction i (i = 1, 2). i) The following operators are also confined in direction i, as indicated by the estimates: (i) [X, g(HB )](i) ε,δ ≤ C X 2ε ,δ , (g ∈ B1 ()) , ⊥ E XES (i) ≤ C X(i) . ε S ,δ ε,δ

(3.26) (3.27)

2

ii) If in addition S ⊂ , then the following operators are also confined: ! " HB , AT ,B (X) ES (i) ≤ C X(i) , ε ε,δ 2 ,δ T

(i) (i) AT ,B (X) − X ES ≤ C X ε ,δ , ε,δ

(3.28) (3.29)

2

and given S ⊂ R with d = dist(S, S ) > 0, ES AT ,B (X)ES (i) ≤ C X(i) . ε ε,δ 2 ,δ T

(3.30)

(i)

iii) Properties (i, ii) also hold for X = i , with Xε,δ replaced by 1. The constants C depend on ε, δ, but not on the remaining quantities, except for (3.30) which depends on d. The main use of confined operators will be through the following remark: If Xi , (i = 1, 2), is confined in direction i, then X2 X1∗ ∈ I1 with X2 X ∗ ≤ C X2 (2) X1 (1) 1 1 ε,δ ε,δ

(3.31)

for 2ε < δ. In particular, if also X1∗ X2 ∈ I1 , (3.31) is a bound for tr X1∗ X2 = tr X2 X1∗ . Indeed, (3.31) follows from e−δ|x2 | e2ε|x| e−δ|x1 | = e−(δ−2ε)|x| ∈ I1 .

204

A. Elgart, G.M. Graf, J.H. Schenker

3.4. Proof of Lemma 6. For X confined, (3.26) is implied by (3.24, 3.25). We thus consider X = i , where it is enough to estimate [i , g(HB )] e−ε|x| e±δxi = i g(HB )(1 − i )e−ε|x| e±δxi +(1 − i )g(HB )i e−ε|x| e±δxi . In the + case, for instance, the second term is bounded because i eδxi is. By (3.20) this holds for the first one too. From now on the switch functions and the confined operators will be treated simultaneously. Equation (3.27) follows from (3.26) and ES⊥ XES = ES⊥ [X, ES ]. To prove (3.28) we consider ! " T · i HB , AT ,B (X) ES = (eiHB T Xe−iHB T − X)ES = eiHB T Xe−iHB T ES − e−iHB T ES X ES − ES⊥ XES . (3.32) The term in parentheses is bounded by (3.26) for g(λ) = e−iλT ES (λ). The norm (3.23) of (3.32) is uniformly bounded in T ∈ R by (3.24, 3.25, 3.27). The same bound applies to 1 T dt (eiHB t Xe−iHB t − X)ES . (AT ,B (X) − X)ES = T 0 We now turn to (3.30), which is related to an integration by parts lemma of [19]. Since S ⊂ and d > 0, there is a contour γ in the complex plane (of length ≤ 4|| + 2d) encircling S once, but not S , at a distance ≥ d/2 from both. Then 1 X = dzR(z)ES XES R(z) 2π γ is convergent in the norm (3.23) because of (3.24, 3.25, 3.27) (note that (2/d)·ES (λ)(z− λ)−1 ∈ B1 ()). Its commutator with HB is ! " =− 1 i HB , X dz [HB − z, R(z)ES XES R(z)] 2πi γ 1 dz(ES XES R(z) − R(z)ES XES ) = ES XES . =− 2πi γ ! " ES and the claim Therefore, ES AT ,B (X)ES = AT ,B (ES XES ) = ES i HB , AT ,B (X) follows from (3.28). 3.5. Proof of Lemma 3. We first prove (2.23) and begin by recalling, see (2.10, 1.14), that 1 T i tr E B (t)E = dm(z)∂z¯ ρ(z) tr E R(z) [HB , 1 ] · T 0 2π ! " ·R(z) HB , AT ,B (2 ) R(z)E . (3.33) By (3.24, 3.25, 3.28) we have for small δ > 0, ! " R(z) HB , AT ,B (2 ) R(z)E (2) ≤ C | Im z|−2 , ε,δ T

Equality of the Bulk and Edge Hall Conductances in a Mobility Gap

205

and, together with (3.10), −1 [HB , 1 ] R(z)E (1) . ε,δ ≤ C| Im z|

By (3.31) the trace in (3.33) is bounded by a constant times T −1 | Im z|−3 . As the constant is independent of z, (2.23) now follows by means of (3.1). The operator under the trace in (2.24) is E Pλ0 1 (AT ,B (2 ) − 2 )E − E 1 Pλ0 (AT ,B (2 ) − 2 )E = E Pλ0 1 Pλ⊥0 · (AT ,B (2 ) − 2 )E −E Pλ⊥0 1 Pλ0 · (AT ,B (2 ) − 2 )E .

(3.34)

We claim that the two terms on the r.h.s. are separately trace class. In fact (3.27) implies Pλ0 1 Pλ⊥0 e−ε|x| eδ|x1 | ≤ C, and similarly with Pλ0 , Pλ⊥0 interchanged, and the bound (3.14) also applies with AT ,B (2 ) in place of 2,B (t). (Note however that the bound so obtained is not uniform in T .) A factor Pλ0 , resp. Pλ⊥0 , may now be cycled around the traces of the two terms on the r.h.s. of (3.34). The trace (2.24) thus equals tr E Pλ0 1 Pλ⊥0 · Pλ⊥0 AT ,B (2 )Pλ0 E − tr E Pλ⊥0 1 Pλ0 · Pλ0 AT ,B (2 )Pλ⊥0 E − tr E Tλ0 E ,

(3.35)

where we used that the two terms of Tλ0 , see (2.16), are separately trace class. We next show that the first two terms of (3.35) are uniformly bounded in λ0 ∈ , T > 0. Indeed, X1 = Pλ⊥0 1 Pλ0 E and X2 = Pλ⊥0 AT ,B (2 )Pλ0 E = Pλ⊥0 (AT ,B (2 )− 2 )Pλ0 E + Pλ⊥0 2 Pλ0 E are uniformly confined by (3.27, 3.29) and the conclusion is by (3.31). Finally, we will show that these two terms vanish as T → ∞, pointwise in λ0 ∈ . The first one is split according to Pλ0 = Pλ + (Pλ0 − Pλ ) for any λ < λ0 , λ ∈ : tr Pλ⊥0 AT ,B (2 )Pλ0 E · E Pλ0 1 Pλ⊥0 = tr Pλ⊥0 AT ,B (2 )Pλ E · E Pλ0 1 Pλ⊥0 + tr Pλ⊥0 AT ,B (2 )Pλ0 E · (Pλ0 − Pλ ) · E Pλ0 1 Pλ⊥0 ≡ I + II . In II, we extract the weights of the confined operators, so that the middle factor becomes e2ε|x| e− 2 (|x1 |+|x2 |) · e−ε|x| e 2 (|x1 |−|x2 |) (Pλ0 − Pλ )e 2 (|x2 |−|x1 |) e−ε|x| · δ

δ

δ

· e− 2 (|x1 |+|x2 |) e2ε|x| . δ

For δ/2 > 2ε the operators on the sides are trace class, and the middle one is uniformly bounded in λ ∈ by (3.20). Moreover, it converges weakly to zero as λ ↑ λ0 , as this s holds true by Pλ0 − Pλ − → 0 for matrix elements between states from the dense subspace of compactly supported states in 2 (Z2 ). Using w

Xn − →0,

Y1 , Y2 ∈ I1

⇒

Y1 Xn Y2 1 → 0 ,

206

A. Elgart, G.M. Graf, J.H. Schenker

we conclude that II can be made uniformly small in T by picking λ close to λ0 . The term I is then seen to be O(T −1 ) by (3.30) with S = (−∞, λ) ∩ and S = [λ0 , ∞). The second trace in (3.35) is dealt with slightly differently. We insert Pλ0 = Pλ + E (Pλ0 − Pλ )E for λ < λ0 , λ ∈ , which yields two well-defined traces. The second can be made uniformly small in T , as was the case for II above. The first one, which by (2.2) equals tr Pλ AT ,B (2 )Pλ⊥0 E · E Pλ⊥0 1 Pλ , is O(T −1 ) by (3.30), this time with S = [λ0 , ∞) ∩ , S = (−∞, λ). 3.6. Proof of Lemma 4. We shall need a particular choice of basis ψλ;j for ran E{λ} , which is related to a SULE basis [11]. (The issue is only of relevance if λ ∈ E is degenerate, since otherwise ψλ is unique up to a phase.) a basis

can be chosen ) We claim so that (3.20) applies not only to g(Hλ ) = E{λ} = ψλ;j ψλ;j , · , but also to the rank one projections into which it is decomposed (upon changing µ, Dε , depending on C4 ). Since φ (ψ, · ) = φ ψ, this amounts to (3.36) sup eµ(x) e−ε|x| ψλ;j e−µ(x) ψλ;j ≤ Dε . )

In fact, since x E{λ} (x, x)=tr E{λ}≤C4 , we may pick x0 ∈ Z2 such that E{λ} (x0 , x0 ) = maxx E{λ} (x, x). Let ψλ;0 (x) = E{λ} (x, x0 )/E{λ} (x0 , x0 )1/2 . This normalized eigenfunction satisfies the bounds ε|x0 | e−µ|x−x0 | /E (x , x )1/2 , {λ} 0 0 ψλ;0 (x) ≤ Dε e E{λ} (x0 , x0 )1/2 . The first one follows from (3.22) for g(HB ) = E{λ} , and the second from E{λ} (x, x0 ) ≤ E{λ} (x, x)1/2 E{λ} (x0 , x0 )1/2 ≤ E{λ} (x0 , x0 ) . 1

µ

Combining them into a geometric mean yields |ψ(x)| ≤ Dε2 e 2 |x0 | e− 2 |x−x0 | and, by the triangle inequality, µ µ ψλ;0 (x)ψ λ;0 (x ) ≤ Dε eε|x0 | e− 2 (|x−x0 |+|x −x0 |) ≤ Dε eε|x| e−( 2 −ε)|x−x | .

For small ε the bound (3.22) is reproduced for ψλ;0 ψλ;0 , · in place of E{λ} , with a smaller value of µ. Since the rank of E{λ} − ψλ;0 ψλ;0 , · is one less than the rank of E{λ} , the task is completed by induction. After these preliminaries, we turn to the proof of Lemma 4 proper. We denote by E the eigenvalues in E listed according to multiplicity. More precisely, we let E be the set of pairs ζ = (λ; n) with λ ∈ E and n a non-negative integer less than the multiplicity of λ. The eigenvectors {ψζ , ζ ∈ E } constructed above are an ortho-normal basis for ranE . Let, for ζ ∈ E ,

Mζ = min 1 ψζ , (1 − 1 )ψζ , 2 ψζ , (1 − 2 )ψζ . We claim that

ζ ∈E

Mζ < ∞ .

ε

(3.37)

Equality of the Bulk and Edge Hall Conductances in a Mobility Gap

207

This states that almost all eigenfunctions are localized in at least one among the left, right, upper, and lower half planes, and hence in at most two (intersecting) ones. In particular almost no eigenfunction encircles the origin, which makes them insensitive to a flux tube applied there – a fact

used in some explanations [17, 28] of the QHE. We apply (3.36) to ψζ ψζ , · and use that for rank one operators φ (ψ, ·) = φ ψ to obtain eµ(x) e−ε|x| ψζ e−µ(x) ψζ ≤ Dε . For (x) = x1 we have 1 (x) ≤ e−µ(x) , implying −1 Dε 1 ψζ ≥ eµx1 e−ε|x| ψζ , similar estimates for 1 − 1 , 2 , and 1 − 2 have x1 on the r.h.s. replaced by −x1 , x2 , and −x2 respectively. Therefore, −2 −2 −2 −2 Mζ−2 = max 1 ψζ , (1 − 1 )ψζ , 2 ψζ , (1 − 2 )ψζ 1 1 ψζ −2 + (1 − 1 )ψζ −2 + 2 ψζ −2 + (1 − 2 )ψζ −2 ≥ 4 1 −2ε|x| 2µx1 −2µx1 2µx2 −2µx2 ≥ ψ e ψζ , e + e + e + e ζ 4Dε2 1 (µ−2ε)|x| , e ψ ψ ≥ ζ ζ 4Dε2 −1 1 −(µ−2ε)|x| ψ , e ψ , ≥ ζ ζ 4Dε2 where we use e2µ|x1 | + e2µ|x2 | ≥ eµ(|x1 |+|x2 |) and, in the last step, the Cauchy-Schwarz inequality 2

2 δ δ 1 = ψζ , ψζ = e 2 |x| ψζ , e− 2 |x| ψζ ≤ ψζ , eδ|x| ψζ ψζ , e−δ|x| ψζ . Now let ε > 0 be small enough that δ := µ − 2ε > 0. We can then estimate the distribution function N(t) of the Mζ : + * t2 N(t) := # ζ ∈ E | Mζ > t ≤ # ζ ∈ E | ψζ , e−δ|x| ψζ > 4Dε2 * + t2 ≤ # x ∈ Z2 | e−δ|x| > 4Dε2 ≤ C ln t , where in the step before last we used the min-max principle, see e.g., [29, Theorem XIII.1]. Together with N (t) = 0 for t ≥ 1, we have ζ ∈E

proving (3.37).

Mζ = −

1

0+

t dN (t) =

1 0+

dt N (t) < ∞ ,

208

A. Elgart, G.M. Graf, J.H. Schenker

We can now estimate the traces in (2.26): tr E{λ} e−iHB t [HB , 1 ] eiHB t 2 E{λ} ≤ ψζ , [HB , 1 ] eiHB t 2 ψζ . ζ =(λ;·)

(3.38) By inserting 2 = 1 − (1 − 2 ), the terms on the right-hand side may also be expressed as ψζ , [HB , 1 ] eiHB t (1 − 2 )ψζ . Using

ψζ , [HB , 1 ] φ = 1 ψζ , (λ − HB )φ = − (1 − 1 )ψζ , (λ − HB )φ , ) one sees that (3.38) is bounded by a constant times ζ =(λ;·) Mζ , so the right-hand side ) of (2.26) is bounded by ζ Mζ . 4. Analysis of the Harper Hamiltonian In this section we prove Theorem 3 which shows that the contribution from bulk states in (1.12) can be non-zero. We begin with the following proposition: Proposition 1. Let f ({Vx }x∈Zd ) be a function which is bounded and continuous in the Zd product topology on {Vx }x∈Zd | Im Vx ≤ 0 = C− . If f is separately analytic in each Vx , then E (f ) = f ({−i}x∈Zd ) ,

(4.1)

where E (·) represents the average with respect to the product measure dP({Vx }x∈Zd ) :=

, x∈Zd

dVx , π(1 + Vx2 )

d supported on {Vx }x∈Zd |Vx ∈ R = RZ . The same statement holds for C+ , +i in place of C− , −i. Proof. Let Sj be an increasing sequence of finite sets with limj Sj = ∪j Sj = Zd , and let Fjc denote the σ -algebra generated by {Vx }x∈Sjc . So conditional expectation with respect to Fjc is given by “averaging out” the variables {Vx }x∈Sj . Thus fj ({Vx }x∈Sjc ) := E

f |Fjc

=

, x∈Sj

dVx f ({Vx }x∈Sj × {Vx }x∈Sjc ) . π(1 + Vx2 )

Because f is bounded and separately analytic in each Vx , we may evaluate the integrals on the right-hand side by residues to obtain fj ({Vx }x∈Sjc ) = f ({−i}x∈Sj × {Vx }x∈Sjc ) .

Equality of the Bulk and Edge Hall Conductances in a Mobility Gap

209

Because f is continuous and limj →∞ {−i}x∈Sj × {Vx }x∈Sjc = {−i}x∈Zd in the product topology on C−

Zd

, we have lim fj ({Vx }x∈Sjc ) = f ({−i}x∈Zd )

j →∞

d for any {Vx }x∈Zd ∈ RZ . Since fj are uniformly bounded and E fj = E (f ) for every j , we conclude by dominated convergence that (4.1) holds. Turning now to the proof of Theorem 3, we first recall that, by Lemma 1, −

i lim tr ρ (Ha ) {[Ha , 1 ] , 2 } = Re tr B (0) , 2 a→∞

where iB (0)

1 =− 2π

(4.2)

! ! " " dm(z)∂z¯ ρ(z) tr RB (z) Hφ , 1 RB (z) Hφ , 2 RB (z) . TB (z)

In going from (2.10) to the above expression for B (0) we have replaced HB by Hφ in the commutators [HB , i ] since the random potential commutes with each switch function i . By Lemma 1, we have supa tr ρ (Ha ) { [Ha , 1 ] , 2 } ≤ C < ∞, with a constant C that depends on ρ and on the bounds C1 , C3 in (1.1, 1.6), but not on the random constant C2 in (1.2). Since the constants C1 , C3 are non-random in our setup, the expectation in (1.22) is well defined, and furthermore can be exchanged with the limit. We claim that for Im z = 0, E (tr TB (z)) = tr Tφ (z + iασ (z)) , (4.3) ! ! " " where Tφ (z) = Rφ (z) Hφ , 1 Rφ (z) Hφ , 2 Rφ (z), with Rφ (z) = (Hφ − z)−1 , and σ (z) = Im z/| Im z| denotes the sign of the imaginary part of z. Indeed, for Im z > 0, it suffices to verify that fz ({Vx }) = tr TB (z) obeys the hypotheses of Proposition 1. For that purpose, it is useful to note that Gz ({Vx }x∈Zd ) := (Hφ + αV − z)−1 is a continuous map from {Vx }x∈Zd | Im Vx ≤ 0 to the bounded operators on 2 (Z2 ) endowed with the strong operator topology. Indeed, z is in the resolvent set of Hφ + αV since the numerical range of this operator is contained in the closed lower half plane. Thus Gz is well defined, SOT-continuous (since {Vx }x → Hφ + αV and A → A−1 are SOT-continuous), and Gz ({Vx }

≤

x∈Zd )

1 1 . ≤ dist(z, num. range(Hφ + αV )) | Im z|

Furthermore, the Combes-Thomas bound (3.2) extends to Gz , i.e., C δ(x) Gz ({Vx }x∈Zd )e−δ(x) ≤ , δ −1 = C 1 + | Im z|−1 , e | Im z|

(4.4)

(4.5)

210

A. Elgart, G.M. Graf, J.H. Schenker

with (x) as in (3.4). The resolvent of e±δ(x) (Hφ + αV )e∓δ(x) , considered as a perturbation of Hφ + αV , is in fact as stable as in (3.2), where Hφ was self-adjoint, since the same bound (4.4) still holds for Im z > 0. Furthermore, we see in this way that {Vx }x∈Zd → eδ(x) Gz ({Vx }x∈Zd )e−δ(x) is SOT-continuous. Thus, for Im z > 0,

" ! tr TB (z) = tr Gz ({Vx }x∈Zd ) Hφ , 1 eδ|x1 | · e−δ|x1 | Gz ({Vx }x∈Zd )eδ|x1 | · ! " · e−δ(|x1 |+|x2 |) · eδ|x2 | Hφ , 2 Gz ({Vx }x∈Zd ) ,

is a continuous function, which is bounded by |tr TB (z)| ≤

! " δ|x | C Gz ({Vx } d )2 H e 1 · , φ 1 x∈Z δ2 ! " · e−δ|x1 | Gz ({Vx }x∈Zd )eδ|x1 | eδ|x2 | Hφ , 2

≤C

(4.6)

(1 + | Im z|−1 )2 , | Im z|3

with the factor of 1/δ 2 coming from the estimate (3.9) on the trace of e−δ|x| . A similar argument is used for Im z < 0. Since the separate analyticity of fz (·) = tr TB (z) is clear, Proposition 1 applies. We see that

1 (4.7) Im dm(z)∂z¯ ρ(z) tr Tφ (z + iασ (z)) , E Re tr B (0) = − 2π

where the interchange of dm(z) and E is justified by Fubini’s theorem and (4.6) since we may arrange for ∂z¯ ρ(z) to vanish faster than | Im z|5 as z approaches the real axis. We note that Cα tr Tφ (z + iασ (z)) ≤ . (4.8) [x 2 + (|y| + α)2 ]3/2 In fact, now that V = 0, | Im z|−1 in (4.4) may be replaced by dist(z, σ (Hφ ))−1 ≤ dist(z, [−2, 2])−1 and the same replacement carries over to the denominator in the estimate (4.6) for tr Tφ (z). The only singularities in the integrand on the right - hand side of (4.7) are jump discontinuities at Im z = 0. Integrating by parts, on the upper and lower half planes separately, we find ∞

1 E Re tr B (0) = dxρ(x) tr Tφ (x + αi) − Tφ (x − αi) , (4.9) Re 2π −∞ since by (4.8)

∞ there are no contributions from the boundary at infinity. Upon writing ρ(x) = − x ρ (λ)dλ, and interchanging λ and x integration we obtain ∞ λ

1 E Re tr B (0) = − dλρ (λ) Re tr Tφ (x + αi) − Tφ (x − αi) dx . 2π −∞ −∞ (4.10)

Equality of the Bulk and Edge Hall Conductances in a Mobility Gap

211

This proves (1.22) with jB (λ) =

1 Re 2π

λ

−∞

tr Tφ (x + αi) − Tφ (x − αi) dx .

To obtain the asymptotic expression (1.23), note that for |λ| > 2, α 1 idη tr Tφ (λ + iη) , Re jB (λ) = 2π −α

(4.11)

(4.12)

because the difference of the right-hand sides of (4.11, 4.12) is the real part of an integral around a closed contour, which may be deformed to infinity, of the analytic function tr Tφ (z), which vanishes like 1/|z|2 as z → ∞. (It is of interest to note that for λ in an internal gap of the spectrum of Hφ , the corresponding contour integral gives (φ) the bulk conductance σB (λ) for the Hamiltonian Hφ at Fermi energy λ, so jB (λ) =

(φ) α 1 σB (λ) + 2π Re i −α dη tr Tφ (λ + iη).) It is useful to rewrite (4.12) as α 1 idη tr Tφ (λ + iη) − tr Tφ (λ − iη) , (4.13) Re jB (λ) = 2π 0 which follows by considering the contributions from η < 0 and η > 0 separately, and using Re i w = − Re i w. We obtain (1.23) from the series for TB (λ+iη)−tr TB (λ − iη) produced by expanding each resolvent in a Neumann series. For sufficiently large |λ|, Rφ (λ + iη) = −

. ∞ 1 Hφ − iη n λ λ

(4.14)

n=0

is absolutely convergent, and ∞ "

n ! 1 1 tr Hφ − iη 1 Hφ , 1 · λ3 λN N=0 n1 +n2 +n3 =N

n2 !

n " · Hφ − iη Hφ , 2 Hφ − iη 3 .

tr Tφ (λ + iη) = −

To prove convergence here, it is useful to note that in addition to (4.14), the series e

δ|x|

Rφ (λ + iη)e

−δ|x|

.n ∞ 1 eδ|x| Hφ e−δ|x| − iη = − λ λ n=0

is also absolutely convergent, in light of (1.1). By cyclicity of the trace tr Tφ (λ + iη) = −

∞ N=0

1 λN+3

N n=0

n ! " (n + 1) tr Hφ − iη Hφ , 1 ·

"

N−n ! · Hφ − iη Hφ , 2 ,

212

A. Elgart, G.M. Graf, J.H. Schenker

and, making use of the identity tr T = tr T ∗ , ∞

N

"

n ! (N − n + 1) tr Hφ − iη Hφ , 1 · λN+3 n=0 N=0 "

N−n ! · Hφ − iη Hφ , 2 .

tr Tφ (λ − iη) = −

1

Thus tr Tφ (λ + iη) − tr Tφ (λ − iη) = −

∞ N=0

N

1 λN+3

n !

N−n ! " " (2n − N ) tr Hφ − iη Hφ , 1 · Hφ − iη Hφ , 2 ,

n=0

which is the desired expansion. The first term (N = 0) of this series vanishes trivially. The second (N = 1) also vanishes, because ! " " "! "

!

! tr Hφ , 1 Hφ − iη Hφ , 2 − tr Hφ − iη Hφ , 1 Hφ , 2 ! ! "" ! " = − tr Hφ , Hφ , 1 Hφ , 2 ! ! "" ! " =− Hφ , Hφ , 1 (x, y) Hφ , 2 (y, x) = 0 , (4.15) x

y

" ! "" ! ! since Hφ , 2 (y, x) = 0 only for |x − y| = 1 and Hφ , Hφ , 1 (x, y) = 0 only for |x − y| = 0, 2, as only nearest neighbor hopping terms are present in Hφ . However the coefficient of λ−5 (N = 2) is non-zero, and given by ! " " "! "

2 !

2 ! 2 tr Hφ , 1 Hφ − iη Hφ , 2 − 2 tr Hφ − iη Hφ , 1 Hφ , 2 ! " ! " ! "! " = 2 tr Hφ , 1 Hφ2 Hφ , 2 − 2 tr Hφ2 Hφ , 1 Hφ , 2 !! " ! "" = −2 tr Hφ2 Hφ , 1 , Hφ , 2 , 2 since the term proportional " ! by (4.15) "" and the term proportional to η is the !! to η vanishes trace of a commutator, tr Hφ , 1 , Hφ , 2 = 0. To calculate this term explicitly, recall that i = I [xi < 0] so, by (1.21), " ! Hφ , 1 (x, x ) = (1 (x ) − 1 (x))Hφ (x, x )   x = (0, x2 ) , x = (−1, x2 ) , 1 , = −1 , x = (−1, x2 ) , x = (0, x2 ) ,  0 , all other x, x ,

which is more succinctly expressed in Dirac notation: " ! |0, a −1, a| − |−1, a 0, a| . Hφ , 1 = a∈Z

Similarly, " eiφa |a, 0 a, −1| − e−iφa |a, −1 a, 0| . Hφ , 2 =

!

a∈Z

Equality of the Bulk and Edge Hall Conductances in a Mobility Gap

Thus

213

!! " ! "" Hφ , 1 , Hφ , 2 = (e−iφ − 1) |0, 0 −1, −1| + |−1, 0 0, −1| −(eiφ − 1) |0, −1 −1, 0| + |−1, −1 0, 0| ,

and " ! "" Hφ , 1 , Hφ , 2 = (e−iφ − 1) −1, −1| Hφ2 |0, 0 + 0, −1| Hφ2 |−1, 0 − c.c.

tr Hφ2

!!

Finally, since −1, −1| Hφ2 |0, 0 = 1 + e−iφ ,

0, −1| Hφ2 |−1, 0 = 1 + eiφ ,

we have 2 tr Hφ2

!!

" ! "" Hφ , 1 , Hφ , 2 = 4(e−iφ − 1) (cos(φ) + 1) − c.c. = −8i sin(φ)(cos(φ) + 1) .

Therefore tr Tφ (λ + iη) − tr Tφ (λ − iη) = 8i sin(φ)(cos(φ) + 1)λ−5 + O(λ−6 ) , and jB (λ) = −

4α sin(φ)(cos(φ) + 1)λ−5 + O(λ−6 ) , π

which gives (1.23). This completes the proof of Theorem 3.

Appendix: Conductance Plateaus Localization is an essential prerequisite for the QHE. Some localization condition, valid at energies in an interval , is proven and used in [6, 2]. It ensures that σB (λ) is 1. well defined as given by (1.4), 2. constant in λ ∈ , and 3. 2π σB (λ) ∈ Z. These results also rest on a homogeneity assumption for the Hamiltonian HB , or on its Fermi projections Pλ , namely that they be invariant or ergodic under magnetic translations. The purpose of the Appendix is to establish (1.–3.) under assumptions (1.1–1.3), which do not entail translation invariance. Proposition 2. Assume (1.1) and (1.2). Then σB (λ) is well-defined. If in addition (1.3) holds, then σB (λ) is constant in λ ∈ . Proposition 3. Assume (1.1) and (1.2). Then 2πσB (λ) ∈ Z for λ ∈ . We remark that here constancy is proven without combining integrality and continuity.

214

A. Elgart, G.M. Graf, J.H. Schenker

A.1. Proof of Prop. 2. We consider Borel sets S ⊂ R that either contain or are disjoint from {λ|λ < } and similarly for {λ|λ > }. The class of such sets S is closed under unions and complements. We associate a bulk Hall conductance to S by setting σB (S) = −i tr ES [[ES , 1 ] , [ES , 2 ]] = i tr ES 1 ES⊥ 2 ES − ES 2 ES⊥ 1 ES ,

(A.1)

where ES⊥ = 1 − ES and the second line follows from ES [ES , 1 ] = ES [ES , 1 ] ES⊥ = ES 1 ES⊥ . Note that σB (λ0 ) = σB ((−∞, λ0 )). We claim that, if S1 ∩ S2 = ∅, then ES1 1 ES2 2 ES1 ∈ I1 , σB (S1 ∪ S2 ) = σB (S1 ) + σB (S2 ) ,

(A.2) (A.3)

lim σB (Sn ) = 0 if Sn ↓ ∅ .

(A.4)

and moreover n→∞

In particular, (A.2) and its adjoint for S1 = S, S2 = R \ S imply that the two terms in the final expression of (A.1) are separately trace class. (A.2): In the factorization ES1 1 ES2 2 ES1 = ES1 1 ES2 e3δ|x1 | e−δ|x| · e−δ|x| · e−δ|x| e3δ|x2 | ES2 2 ES1 ,

(A.5)

the middle e−δ|x| = e−δ|x1 | e−δ|x2 | is trace class by (3.9), so that we need to show (A.6) ES1 i ES2 e3δ|xi | e−δ|x| < ∞ , (i = 1, 2) . This follows from (3.25, 3.27) and part (iii) of Lemma 6, with a bound which is uniform in S1 , S2 . (A.4): By (A.1, A.5) and (2.21) it suffices to show s

ESn i ES⊥n e3δ|xi | e−δ|x| −−−→ 0 . n→∞

Since the l.h.s. is uniformly bounded in norm by the remark just made, we may drop the exponentials as explained in connection with (3.16, 3.17). Then the claim becomes obvious. (A.3): From ES1 ∪S2 = ES1 + ES2 and (2.2) we have σB (S1 ∪ S2 ) =

2

tr ESi i ES⊥1 ∪S2 2 ESi − tr ES⊥1 ∪S2 1 ESi 2 ES⊥1 ∪S2

i=1

We use ES⊥1 ∪S2 = ES⊥i − ESi+1 (with i + 1 defined mod 2) and obtain σB (S1 ∪ S2 ) =

2

σB (Si ) −

i=1

+

2

2

tr ESi 1 ESi+1 2 ESi

i=1

tr ESi+1 1 ESi 2 ESi+1

i=1

= σB (S1 ) + σB (S2 ) .

.

Equality of the Bulk and Edge Hall Conductances in a Mobility Gap

215

We finally prove constancy by showing that σB ([a, b]) = 0 for any [a, b] ⊂ . Since σ (HB ) is pure point in we have En :=

n i=1

s

E{λi } −−−→ EE[a,b] = E[a,b] , n→∞

where λi is any labeling of the eigenvalues λ ∈ E[a,b] . Now En is a finite dimensional projection by (1.3), whence the two terms in

σB ∪ni=1 {λi } = −i tr (En 1 En 2 En − En 2 En 1 En ) = 0 are separately trace class. They cancel by (2.2). We conclude by (A.3, A.4) that

σB ([a, b]) = σB ∪ni=1 {λi } + σB E[a,b] \ ∪ni=1 {λi } −−−→ 0 . n→∞

A.2. Proof of Prop. 3. As in [5] we are going to establish that 2π σB (λ) is an integer by relating it to the index of a pair of projections. We first allow the functions i in (1.4) to switch values at points other than the origin. Let p = (p1 , p2 ) ∈ Z2∗ = Z2 + ( 21 , 21 ) be the center of a plaquette and set !! " ! "" σp = −i tr Pλ Pλ , 1,p , Pλ , 2,p ! " ! " ! " ! " = i tr Pλ , 1,p Pλ⊥ Pλ , 2,p − Pλ , 2,p Pλ⊥ Pλ , 1,p , (A.7) where i,p = (xi − pi ), (i = 1, 2). (Since (n) = (n + 21 ) for n ∈ Z, σB (λ) is just σp for p = −( 21 , 21 ).) To define the index, let θp (x) = arg(x −p) be the angle of sight of x ∈ Z2 from p, and set Up (x) = eiθp (x) . The relevant index is Np = Ind(Up Pλ Up∗ , Pλ ), where Ind(P , Q) denotes the index of a pair of projections introduced in ref. [5]: Ind(P , Q) := dim ran P ∩ ker Q − dim ran Q ∩ ker P .

(A.8)

We recall the following basic properties of Ind(·, ·): 1. If P − Q is compact, Ind(P , Q) is well defined and finite. 2. If (P − Q)2n+1 is trace class for some integer n ≥ 0, then tr(P − Q)2n+1 = Ind(P , Q) .

(A.9)

Since Np is an integer by (A.8), Prop. 3 is a consequence of the identity 2πσB (λ) = Np , to be proved below. Indeed, this is the same strategy employed in refs. [5, 2]. The starting point for our proof is the observation that σp and Np are independent of p even without ergodicity for the underlying projection. Lemma 7. The index Np is well defined for any p ∈ Z2∗ , and for any a ∈ Z2 , i) Np+a = Np , ii) σp+a = σp .

216

A. Elgart, G.M. Graf, J.H. Schenker

Proof. Part (i) follows from [5, Prop. 3.8] once we verify that Np is well defined. For this we follow [2] and show that (Pλ − Up Pλ Up∗ )3 is trace class, using Lemma ([2, Lemma 1]). For an operator with the matrix elements Tx,y , 1/3 3 1/3 3 T 3 ≡ (tr |T | ) ≤ |Tx+b,x | . x

b

In our case, with T =

Pλ − Up Pλ Up∗ ,

we have (see [2, Eq. (4.13)])

|T (x + b, x)| = |1 − ei(θp (x+b)−θp (x)) ||Pλ (x + b, x)| |b| |b| ≤ C |Pλ (x + b, x)| ≤ C(1 + |p|) |Pλ (x + b, x)| . 1 + |x − p| 1 + |x| (Here and in the sequel, C denotes a generic constant, whose value is independent of any lattice sites in the given inequality, though that value may change from line to line.) Since (1.2) holds for g(HB ) = Pλ , we have |Pλ (x + b, x)| ≤ C2 (1 + |x|)ν e−µ|b| , but we also have |Pλ (x + b, x)| ≤ 1, because Pλ ≤ 1. Combing these two estimates gives 1 |b| ≤ 2ν µ ln(|x| + 1) , µ (A.10) |Pλ (x + b, x)| ≤ 2ν − 2 |b| |b| > µ ln(|x| + 1) . C2 e Thus

x

1/3 |T (x + b, x)|

3



 ≤ C(1 + |p|)|b| 

µ

#

|x|<e 2ν

≤ C(1 + |p|)|b| e

− µ2 |b|

|b|

µ

C2 e− 2 |b|

−1

+ e

(1 + |x|)3 µ − 6ν |b|

1/3

$3 +

µ

|x|≥e 2ν

|b|

−1

1   (1 + |x|)3

.

Since the last line is clearly summable over b, we see that (Up Pλ Up∗ − P )3 is trace class, and therefore the index Np is well defined. Turning now to part (ii), we note that we may just treat the case p = −( 21 , 21 ), a = (a1 , 0), the case of translation in the 2-direction being similar. By (A.1, A.2, 2.2) we need to show that tr(Pλ (1 )Pλ⊥ 2 Pλ ) − tr(Pλ 2 Pλ⊥ (1 )Pλ ) = tr(Pλ (1 )Pλ⊥ 2 Pλ ) − tr(Pλ⊥ (1 )Pλ 2 Pλ⊥ )

(A.11)

vanishes, where 1 (x) = (x1 ) − (x1 − a1 ) is compactly supported in x1 . We claim that (1 )Pλ⊥ 2 Pλ ∈ I1 . This follows like (A.2) through the factorization (1 )Pλ⊥ 2 Pλ = (1 )e3δ|x1 | e−δ|x| · e−δ|x| · e−δ|x| e3δ|x2 | Pλ⊥ 2 Pλ ,

Equality of the Bulk and Edge Hall Conductances in a Mobility Gap

217

by noticing that the first factor, which is new, is bounded. Likewise (1 )Pλ 2 Pλ⊥ ∈ I1 . Therefore (A.11) equals tr(1 )Pλ⊥ 2 Pλ − tr(1 )Pλ 2 Pλ⊥ = tr(1 ) [2 , Pλ ] = 0 , by evaluating the trace in the position basis.

The proof of Prop. 3 is now completed by the following result, with the translation invariance required in the argument of [5] now provided by Lemma 7. Lemma 8. Let L = {−L, . . . , L}2 ⊂ Z2 . Then + −2i N/2π Pλ (x, y)Pλ (y, z)Pλ (z, x) Area(x, y, z) , = lim 2 σB (λ) L→∞ (2L + 1) 2 y,z∈Z x∈L

(A.12) where N, resp. σB (λ) are the translation invariant values of Np , resp. σp , and Area (x, y, z) is the triangle’s oriented area, namely 21 (x − y) ∧ (y − z). Remark 3. The r.h.s. of (A.12) is the trace per unit volume of −iPλ [[Pλ , X1 ] , [Pλ , X2 ]] , which may be interpreted as the macroscopic version of (1.4). Proof. The first statement makes use of Connes’ area formula [9] in the version [5] adapted to the lattice [2]: For a fixed triplet u(1) , u(2) , u(3) ∈ Z2 , let αi (p) ∈ (−π, π ) be the angle of view from p ∈ Z2∗ of u(i+2) relative to u(i+1) (with αi (p) = 0 if p lies between them). Then 3

sin αi (p) = 2π Area(u(1) , u(2) , u(3) ) .

(A.13)

p∈Z2∗ i=1

By the computation of [5], Np = tr(Up Pλ Up − Pλ )3 = −2i

Pλ (x, y)Pλ (y, z)Pλ (z, x)S(p, x, y, z),

x,y,z∈Z2

with S(p, x, y, z) = sin ∠(x, p, y) + sin ∠(y, p, z) + sin ∠(z, p, x). Letting ∗L = 2 −L + 21 , . . . , L + 21 ⊂ Z2∗ we have that N (2L + 1)2 is the sum of the r.h.s. over ∗ p ∈ L . We would like to replace the sum over x ∈ Z2 , p ∈ ∗L by that over x ∈ L , p ∈ Z2∗ . The error is estimated by |f (p, x)| + |f (p, x)| , (A.14) x∈Z2 \L p∈∗L

x∈L p∈Z2∗ \∗L

218

A. Elgart, G.M. Graf, J.H. Schenker

where f (p, x) := −2i

Pλ (x, y)Pλ (y, z)Pλ (z, x)S(p, x, y, z) .

y,z∈Z2

By (1.2) for g(HB ) = Pλ the points y, z are exponentially clustered around x, so we have |f (p, x)| ≤ Cx (1 + |p − x|)−3 . However because of the pre-factor (1 + |x|)ν in (1.2), the constant Cx carries some dependence on x (as indicated), which must be controlled in order to bound (A.14). In fact, the following estimate for |f (p, x)| is true: |f (p, x)| ≤ C

[1 + ln(1 + |x|)]5 . 1 + |x − p|3

(A.15)

Before proving (A.15), let us see how it allows us to complete the proof. Indeed, since 1 = O (L ln L) , L → ∞ , (1 + |x − p|)3 x∈L p∈Z2∗ \∗L

as far as the second term of (A.14) is concerned, we have 1 |f (p, x)| ≤ C[ln L]5 = O(L[ln L]6 ) . (1 + |x − p|)3 x∈L p∈Z2∗ \∗L

x∈L p∈Z2∗ \∗L

For the first term we note that [1 + ln(1 + |x|)]5 ≤ C(ln L)5 [1 + ln(1 + |x − p|)]5 , for x, p in the indicated range and large L, resulting in

|f (p, x)| ≤ C[ln L]5

p∈∗L x∈Z2 \L

x∈Z2 \L p∈∗L

Therefore,

N(2L + 1)2 =

= −2i

[1 + ln(1 + |x − p|)]5 = O(L[ln L]11 ) . (1 + |x − p|)3

f (p, x) + O(L[ln L]11 )

x∈L p∈Z2∗

Pλ (x, y)Pλ (y, z)Pλ (z, x)

S(p, x, y, z) + O(L[ln L]11 ) ,

p∈Z2∗

x∈L y,z∈Z2

which gives (A.12) for N/2π after applying Connes’ area formula and taking the limit L → ∞. As for the proof of (A.15), we consider separately the cases (i) |p−x| < 2ν µ ln(|x|+1) and (ii) |p − x| ≥ conclude

2ν µ

|f (p, x)| ≤ 6

ln(|x| + 1). In case (i), we use the bound |S(p, x, y, z)| ≤ 3 to y,z∈Z2

|Pλ (x, y)Pλ (y, z)Pλ (z, x)| ≤ 6

y∈Z2

|Pλ (x, y)| ,

Equality of the Bulk and Edge Hall Conductances in a Mobility Gap

219

since

 |Pλ (y, z)Pλ (z, x)| ≤ 

z∈Z2

|Pλ (y, z)|2

z∈Z2

z∈Z2 1/2

≤ [Pλ (y, y)Pλ (x, x)]

1/2 |Pλ (z, x)|2  ≤ 1.

Now by (A.10), 2 ν |Pλ (x, y)| ≤ 4 ln(|x| + 1) + 1 + C2 µ 2

y∈Z

|b|> 2ν µ ln(|x|+1)

≤ C [1 + ln(|x| + 1)]2 ≤ C

µ

e− 2 |b|

[1 + ln(|x| + 1)]5 , (1 + |x − p|)3

where in the last step we have used that |x − p| ≤ 2ν µ ln(|x| + 1). This implies (A.15) in case (i). To prove (A.15) in case (ii), consider separately the contributions to f (p, x) coming when both y and z fall inside the ball of radius |p − x| around x and when one of y or z falls outside the ball. The latter contribution is exponentially small in |x − p|, since it is bounded by     6 

|y−x|≥|p−x| z∈Z2

≤ 12

+

|z−x|≥|p−x| y∈Z2

  |Pλ (x, y)Pλ (y, z)Pλ (z, x)|  µ

|Pλ (x, y)| ≤ Ce− 2 |x−p| ,

|y−x|≥|p−x|

where in the last step we have used (A.10) and the fact that |x − p| > 2ν µ ln(|x| + 1). To bound the former contribution note that in this case both |∠(y, p, x)| and |∠(z, p, x)| are smaller than π2 , and make use of the following estimates: (1) given α, β ∈ (− π2 , π2 ), |sin α + sin β − sin(α + β)| ≤ |sin α|3 + |sin β|3 , and (2) given y with |y − x| < |p − x|, |sin ∠(y, p, x)| ≤

|y − x| . 1 + |p − x|

Putting these two estimates together gives the following bound for the contribution with y, z in the ball of radius |x − p| around x, C (1 + |p − x|)3 ≤

|Pλ (x, y)Pλ (y, z)Pλ (z, x)| |y − x|3 + |z − x|3

|y−x|,|z−x|<|p−x|

C [1 + ln(|x| + 1)]5 3 |P (x, y)| |y − x| ≤ C , λ (1 + |p − x|)3 (1 + |p − x|)3 2 y∈Z

220

A. Elgart, G.M. Graf, J.H. Schenker

where in the last step we have used (A.10). This proves (A.15) in case (ii) and completes the proof of (A.12) for N/2π . The proof for σB is similar. By evaluating (A.7) in the position basis as in [5] we obtain σp = i Pλ (x, y)Pλ⊥ (y, z)Pλ (z, x) · x,y,z∈Z2

· [((y1 − p1 )−(x1 − p1 ))((z2 − p2 )−(y2 − p2 ))−(1 ↔ 2)] .(A.16) We then sum over p ∈ ∗L and move the anchor from p to x (in this case the corresponding f (p, x) decays exponentially in |p − x|, again with logarithmic growth in |x|). The sum over p ∈ Z2∗ of the square bracket in (A.16) involves ((yi − pi ) − (xi − pi )) = xi − yi , pi ∈Z∗

and thus equals (x1 − y1 )(y2 − z2 ) − (x2 − y2 )(y1 − z1 ) = 2 Area(x, y, z). The proof is completed by Pλ⊥ (y, z) = δyz − Pλ (y, z). Note added in proof. After submission of this paper we learned of the following early result on equality of conductances in the case of a spectral gap: Hatsugai, Y.: Chern number and edge states in the integer quantum Hall effect. Phys. Rev. Lett. 71, 3697–3700 (1993) Acknowledgements. We thank M. Aizenman,Y. Avron, J. Bellissard, J.-M. Combes, J. Fr¨ohlich, F. Germinet, and H. Schulz-Baldes for useful discussions.

References 1. Aizenman, M.: Localization at weak disorder: some elementary bounds. Rev. Math. Phys. 6(5A), 1163–1182 (1994) 2. Aizenman, M., Graf, G. M.: Localization bounds for an electron gas. J. Phys. A 31(32), 6783–6806 (1998) 3. Aizenman, M., Molchanov, S.: Localization at large disorder and at extreme energies: an elementary derivation. Commun. Math. Phys. 157, 245–278 (1993) 4. Aizenman, M., Schenker, J. H., Friedrich, R. M., Hundertmark, D.: Finite volume fractional moment criteria for Anderson localization. Commun. Math. Phys. 224, 219–253 (2001) 5. Avron, J. E., Seiler, R., Simon, B.: Charge deficiency, charge transport and comparison of dimensions. Commun. Math. Phys. 159(2), 399–422 (1994) 6. Bellissard, J., van Elst, A., Schulz-Baldes, H.: The noncommutative geometry of the quantum Hall effect. J. Math. Phys. 35(10), 5373–5451 (1994) 7. Combes, J.-M., Germinet, F: Edge and impurity effects on quantization of Hall currents. Commun. Math. Phys. 256, 159–180 (2005) 8. Combes, J.-M., Thomas, L.: Asymptotic behaviour of eigenfunctions for multiparticle Schr¨odinger operators. Commun. Math. Phys. 34, 251–270 (1973) ´ 9. Connes, A.: Noncommutative differential geometry. Inst. Hautes Etudes Sci. Publ. Math. 62, 257– 360 (1985) 10. Cycon, H. L., Froese, R. G., Kirsch, W., Simon, B.: Schr¨odinger operators with application to quantum mechanics and global geometry. Texts and Monographs in Physics. Berlin: Springer-Verlag, study edition, 1987 11. del Rio, R., Jitomirskaya, S., Last, Y., Simon, B.: Operators with singular continuous spectrum. IV. Hausdorff dimensions, rank one perturbations, and localization. J. Anal. Math. 69, 153–200 (1996) 12. Elbau, P., Graf, G. M.: Equality of bulk and edge Hall conductance revisited. Commun. Math. Phys. 229(3), 415–432 (2002)

Equality of the Bulk and Edge Hall Conductances in a Mobility Gap

221

13. Elgart, A., Schenker, J. H.: Dynamical localization for an adiabatically driven particle in the presence of disorder. In preparation 14. Fr¨ohlich, J., Studer, U. M.: Gauge invariance and current algebra in nonrelativistic many-body theory. Rev. Mod. Phys. 65(3, part 1), 733–802 (1993) 15. Gat, O., Avron, J. E.: Magnetic fingerprints of fractal spectra and the duality of Hofstadter models. New J. Phys. 5, 44.1–44.8 (2003) 16. Germinet, F., De Bi`evre, S.: Dynamical localization for discrete and continuous random Schr¨odinger operators. Commun. Math. Phys. 194(2), 323–341 (1998) 17. Halperin, B. I.: Quantized hall conductance, current carrying edge states, and the existence of extended states in a two-dimensional disordered potential. Phys. Rev. B25, 2185–2190 (1982) 18. Hunziker, W., Sigal, I. M.: Time-dependent scattering theory of N-body quantum systems. Rev. Math. Phys. 12(8), 1033–1084 (2000) 19. Kato, T.: On the adiabatic theorem of quantum mechanics. Phys. Soc. Jap. 5, 435–9 (1958) 20. Kunz, H., Souillard, B.: Sur le spectre des op´erateurs aux diff´erences finies al´eatoires. Commun. Math. Phys. 78(2), 201–246 (1980/81) 21. Laughlin, R. B.: Quantized hall conductivity in two-dimensions. Phys. Rev. B23, 5632–5733 (1981) 22. van Leeuwen, H.: Probl`emes de la th´eorie e´ lectronique du magn´etisme. J. de Phys. 2, 361–377 (1921) 23. Lieb, E. H., Robinson, D. W.: The finite group velocity of quantum spin systems. Commun. Math. Phys. 28, 251–257 (1972) 24. Macris, N.: On the equality of bulk and edge conductance in the integer Hall effect: microscopic analysis. Preprint, 2003 25. Macris, N., Martin, Ph. A., Pul´e, J. V.: Diamagnetic currents. Commun. Math. Phys. 117, 215–241 (1988) 26. Osadchy, D., Avron, J.E.: Hofstadter butterfly as quantum phase diagram. J. Math. Phys. 42(12), 5665–5671 (2001) 27. Peierls, R.: Surprises in theoretical physics. Princeton, NJ: Princeton University Press, 1979 28. Prange, R. E.: In: Prange, R., Girvin, S. M. (eds.), The Quantum Hall Effect, Berlin-Heidelberg-New York: Springer Verlag, 1987, pp. 1–34 29. Reed, M., Simon, B.: Methods of modern mathematical physics. IV. Analysis of operators. Second edition, New York: Academic Press [Harcourt Brace Jovanovich Publishers], 1978 30. Reed, M., Simon, B.: Methods of modern mathematical physics. I. Functional analysis. Academic Press Inc. [Harcourt Brace Jovanovich Publishers], 1980 31. Robinson, F. N. H.: Macroscopic Electromagnetism. Oxford: Pergamon, 1973 32. Schulz-Baldes, H., Kellendonk, J., Richter, T.: Simultaneous quantization of edge and bulk Hall conductivity. J. Phys. A 33(2), L27–L32 (2000) 33. Simon, B.: Trace ideals and their applications, Volume 35 of London Mathematical Society Lecture Note Series. Cambridge: Cambridge University Press, 1979 34. Simon, B.: Cyclic vectors in the Anderson model. Rev. Math. Phys. 6(5A), 1183–1185 (1994) 35. Stˇreda, P.: Theory of quantized Hall conductivity in two dimensions. J. Phys. C 15, L717–L721 (1982) Communicated by M. Aizenman

Commun. Math. Phys. 259, 223–256 (2005) Digital Object Identifier (DOI) 10.1007/s00220-005-1371-0

Communications in

Mathematical Physics

Stability of Large Amplitude Ekman-Hartmann Boundary Layers in MHD : The Case of Ill-Prepared Data F. Rousset CNRS, Laboratoire J.-A Dieudonne, UMR 6621, Universite de Nice, Parc Valrose, 06108 Nice Cedex 02, France. E-mail: [email protected] Received: 22 September 2004 / Accepted: 18 January 2005 Published online: 14 June 2005 – © Springer-Verlag 2005

Abstract: In this paper, we study an incompressible highly rotating fluid submitted to a high magnetic field between two planes with a Dirichlet boundary condition. We investigate the nonlinear stability of Ekman-Hartmann boundary layers under a spectral assumption for general initial data; this means that the data can be chosen as an arbitrary (but smooth enough) three-dimensional divergence free vector field independent of the small parameter. Introduction We study the equations describing a three dimensional highly rotating fluid submitted to the action of a magnetic field of high intensity in the domain = R2 × (0, 1): e × jε ∇p ε e × uε + + = εuε , ε ε ε j ε = ∇ϕ ε − e × uε , ∇ · uε = 0, ∇ · j ε = 0. ∂t uε + uε · ∇uε +

(1) (2) (3)

In this model uε and pε are the velocity and the pressure of the fluid which is assumed to be incompressible and j ε and ϕ ε are the current density and the magnetic potential. We assume that e which gives both the direction of the rotation axis and the averaged magnetic field is constant: we set e = (0, 0, 1). Here ε > 0 is a small parameter which will tend to zero and > 0, which is the Elsasser number, is considered to be fixed O(1). We add to this system the boundary conditions ε uε\∂ = 0, j\∂ · n = 0,

(4)

where n is the normal to the boundary of the domain and the initial condition uε (0, x) = u0 (x),

(5)

224

F. Rousset

where u0 is independent of ε and divergence free. This system with such a scaling is relevant to modelize the motion of the Earth liquid core, ε is known to be very small (ε ∼ 10−7 ) and hence it is physically relevant to investigate the limit of the model when ε goes to zero. The possible instabilities in the boundary layers could have some effect in the geodynamo process. We refer to [11] for more details on this model and the underlying physics. For ε > 0 fixed, the mathematical status of the system (1), (2), (3), (4), (5) is very similar to the one of the incompressible Navier-Stokes equation: there is a classical theory of local in time existence and uniqueness of strong solutions (for example in H s with s sufficiently large) that we shall use in this article and a theory of global weak solutions (without uniqueness in 3-d) analogous to Leray solutions of the incompressible Navier-Stokes equation, we refer to [24] for a survey on this topic. In this paper we shall focus on the study of the asymptotic behavior of the solution when ε goes to zero. In a formal way, when ε tends to zero, because of the stiff terms in (1), uε tends to be in the kernel of the operator Lu = P e × u + e × P (e × u) , where P is the Leray projection on divergence free vector fields. The kernel of this operator is described by the classical Taylor-Proudman theorem which gives that u is a two-dimensional vector field, u = (u1 (t, x1 , x2 , 0), u2 (t, x1 , x2 , 0), 0) and j = 0. Since a two-dimensional vector field cannot match the boundary conditions (4), a boundary layer which is of size ε (the Ekman-Hartmann boundary layer) appears in the vicinity of the boundary. This boundary layer which is well-known in physics was mathematically studied in [8, 22]: a matched asymptotic expansion using boundary layers was computed in [8] and its justification was performed in [8] under a smallness assumption and in [22] under a more general spectral assumption. Actually, the model considered in [8, 22] is slightly more general: there is an evolution equation for the magnetic field which replaces (2). Nevertheless, as described in [11], the model that we consider here is already physically relevant and moreover the stability problems only involve part (2) of the equation as it was proved in [22]. It seems possible to generalize the results of this paper to the more general model, but computations should be more complicated. The main restriction in the two works [8, 22] is that the problem was studied for well-prepared data; this means that it was assumed that at t = 0 the initial datum of (1), (2), (3) tends sufficiently quickly to a two-dimensional vector field when ε goes to zero. Here, we shall study (1), (2), (3) for ill-prepared data, i.e. with a data u0 in (5) which is an arbitrary three-dimensional vector field which is divergence free and verifies the boundary condition (4). The aim of this paper is to prove that the main result of [22] that is the convergence under the assumption of linear stability of the boundary layers of uε to a two-dimensional vector field solution of an Euler equation is still true in this general setting. More precisely, we assume that u0 is a three dimensional vector field which matches the boundary condition (4) and we define 1 1 0,wp 0,wp u1 = u01 (t, x1 , x2 , x3 ) dx3 , u2 = u02 (t, x1 , x2 , x3 ) dx3 . 0

0

int Note that u0,wp is divergence free. Let uint = (uint 1 (t, x1 , x2 ), u2 (t, x1 , x2 )) be the solution of the two-dimensional Euler equation with damping

Stability of Large Amplitude Ekman-Hartmann Boundary Layers in MHD

∂t uint + uint · ∇uint + ∇p + β uint = 0, where

β=

2 , tan τ2

tan

225

∇ · uint = 0, t > 0, (x1 , x2 ) ∈ R2 , (6)

τ 1 . = √ 2 + 1 + 2

We recall that the damping term in this equation which was computed in [8] is due to energy dissipation in the boundary layers. We add to this equation the initial condition uint (0, x1 , x2 ) = u0,wp (x1 , x2 ).

(7)

Next, for each uint (t, x1 , x2 ) = q, we can define near each boundary x3 = 0, x3 = 1, an Ekman-Hartmann boundary layer ub (q, Z) = (ub1 , ub2 , 0), j b (q, Z) = (j1b , j2b , 0), where Z = x3 /ε or (1 − x3 )/ε . The profile ub is a solution of the ordinary differential equation −1 b b ∂ZZ u = Bu , B = 1 such that ub (q, 0) = −q, ub (q, +∞) = 0. We easily find that the solution of this problem is given by √ +iZ

ub1 + iub2 = −e−

(q1 + iq2 ).

Moreover, the current in the boundary layer is given by j1b + ij2b = −i(ub1 + iub2 ). To characterize the stability of the lower boundary layer, we can linearize (1), (2), (3) about (q + ub , j b ) and set (t , x1 , x2 , Z) = (t/ε, x1 /ε, x2 /ε, x3 /ε). Dropping the ’, and keeping only the 1/ε terms in the equations yield the evolution problem in R2 ×(0, +∞): ∂t v + (q + ub ) · ∇v + v3 ∂Z ub + ∇p + e × u + e × j = v, j = ∇ϕ − e × v, ∇ · v = 0, ∇ · j = 0. Note that, if we set j = curl b, we recover exactly the same system as in [22], Sect. 2. We add to this system the boundary condition u = 0, j3 = 0 for Z = 0, and we shall say that the boundary layer is linearly stable if this system does not have any solution which grows exponentially in time. By a normal mode analysis, we can look for solutions under the form (u, j ) = e(γ +iτ )t+iξ ·(x1 ,x2 ) (U (Z), J (Z)), γ ≥ 0, τ ∈ R, ξ ∈ R2 . This yields an ordinary differential equation with ζ = (γ , τ, ξ ) as parameters and by setting V = U3 , ∂Z2 U3 , iξ1 U2 − iξ2 U1 , J3 , ∂Z U3 , ∂Z3 U3 , ∂Z (iξ1 U2 − iξ2 U1 ), ∂Z J3 , we reduce the eigenvalue problem to the search of solutions of ∂Z V = A(Z, ζ, q)V ,

Z>0

such that V (0) = 0, V (+∞) = 0, where V = (V1 , V3 , V4 , V5 ) and

(8)

226

F. Rousset

A(Z, ζ, q) = 

0 4 I4 M A

0  −|ξ |2 P1 − iξ · ∂ 2 ub θ M=  i∂θ ub · ξ ⊥ 0

1 P2 0 0

, 0 0 P1 0

 0 0  , 0  |ξ |2

P1 = γ + iτ + i(q + ub ) · ξ + |ξ |2 , P2 = γ + iτ + i(q + ub ) · ξ + 2|ξ |2 + ,   0 0 0 0  0 0 1 0  A= . −1 0 0 −  0 0 −1 0 We recall from [22], that the space of solutions of ∂Z Z = AV which tend to zero as Z tends to +∞ has dimension four and is smooth for ξ = 0, γ ≥ 0. Hence by choosing a basis (V 1 , V 2 , V 3 , V 4 ) of this stable subspace, we can define an Evans function for this problem by D(ζ, q) = det V 1 (0, ζ, q), V 2 (0, ζ, q), V 3 (0, ζ, q), V 4 (0, ζ, q) . By definition, there is an unstable mode if and only if the Evans function vanishes for γ > 0. This kind of construction was very much used to characterize the stability of travelling waves in reaction-diffusion equations [1, 17] or viscous conservation laws [12]. It was also used in the setting of hydrodynamic stability to perform numerical computations [3]. Finally, we point out that as in [22], the same Evans function characterizes the stability of the upper boundary layer. With the help of this Evans function, we can state our main theorem. It just remains to define the anisotropic Sobolev space H m,0 () = {u ∈ L2 (), ∇xm1 ,x2 u ∈ L2 ()}. Theorem 1. Let u0 be a divergence free vector field such that u0/∂ = 0 and that u0 , ∇u0 ∈ H m+s,0 (), and ∇ 2 u0 ∈ H m,0 () for m ≥ 2 and s > 6. Let uint be the solution of (6), (7), assuming that D(ζ, uint (t, x1 , x2 )) = 0, ∀ζ = (γ , τ, ξ ), γ ≥ 0, ξ = 0

(H) for every t ∈

[0, T ∗ ], (x1 , x2 ) ||u − u ε

+ε

int

−1 2

∈

(9)

R2 , then the solution of (1), (2), (3), (4), (5) is such that

||L∞ ((0,T ∗ ),L2 ()) + ||uε − uint ||L2 ((0,T ∗ )×)) loc

||j ε ||L2 ((0,T ∗ )×) → 0

(10)

when ε goes to zero. This theorem says that we have a nonlinear stability result as long as the boundary layer remains linearly stable which is the meaning of the Assumption (H). The method that we use needs a lot of regularity for u0 . Nevertheless, the regularity that we require here is not too surprising when we compare it to other results (see [21] for example). As it was proved in [8] by direct L2 energy estimates, this assumption is matched when a Reynolds number R = sup ||uint ||L∞ t

is sufficiently small (R 5). More generally, it can be checked numerically (see [9]) that Assumption (H) is still matched for larger Reynolds number R 40. When the

Stability of Large Amplitude Ekman-Hartmann Boundary Layers in MHD

227

Reynolds number becomes larger than a critical value Rc ∼ 40 the Ekman-Hartmann boundary layer becomes linearly unstable [9] and it was proved in [10] that it also implies nonlinear instability. Consequently, the combination of Theorem 1 and the result of [10] gives a complete description of the problem: there is linear and nonlinear stability for general data when R < Rc and linear and nonlinear instability for R > Rc . There are basically two parts in the proof of Theorem 1. At first, though it does not appear in the statement of the theorem, we need to generalize the construction of an approximate solution to absorb the part of the initial data which depends on the vertical variable: this new approximate solution will be made of various waves, initial layers and boundary layers. In the second part, we have to prove that the interaction of these new waves with the main Ekman-Hartmann boundary layer does not affect its stability, and hence that we can still get from our Assumption (H) a good energy estimate as in [22]. In the case of rotating fluid (that we recover by taking formally = 0), the vertical part of the initial data creates waves, called inertial waves, the situation was mathematically investigated in [6] for an anisotropic viscosity where ε is replaced by ε∂x23 x3 + H u. The crucial part in the convergence proof in [6] was the study of the dispersion of the inertial waves. Here, we shall see that the inertial waves are damped by magnetic effect; hence the situation is less subtle, nevertheless, we have to take into account the possible instabilities in the boundary layer. This last phenomenon is not present in [6] since the Ekman layer is always stable in the case of anisotropic viscosities. In our setting, for boundary layers of moderate amplitude which verify our Assumption (H) but not the smallness assumption of [8], the L2 norm is not decreasing any more and hence for the stability part, the method of [8] does not apply. To get a result under the sharp Assumption (H), we shall use the microlocal analysis of [22, 23] which was inspired from [21]. In this part of the proof, the fact that the inertial waves are damped is crucial; our analysis does not readily extend to the Ekman case ( = 0). For related works in the periodic case, we refer to [5, 4, 20]. The first part of the paper is devoted to the construction of an approximate solution U app for (1), (2), (3), (5), (4). This means that U app will match exactly the divergence free condition (3) and the boundary condition (4) and that U app makes an error which is O(εM ) with M sufficiently large when we plug it into Eqs. (1), (2). Our approximate solution is basically made of three pieces. The first part U wp is just the approximate solution which corresponds to the two-dimensional part of the initial data and that was already used in [8, 13]. Next, we add a second part U ip in order to lift the remaining part of the initial data. In this part, we shall study precisely the damping of the inertial waves. Each wave will create a boundary layer of size ε. At this stage, we have an approximate solution such that √ ||u0 − (uwp + uip )/t=0 ||L2 ε. Actually, the right-hand side comes from the L2 norm of the boundary layers which do not vanish at t = 0. This estimate is not sufficient to close a nonlinear stability argument. Actually, when we deal with large linearly stable boundary layers as in Theorem 1, we cannot obtain an L2 energy estimate by the standard energy method. We still get an L2 energy estimate on the linearized problem thanks to (H) through a microlocal analysis. Due to this method we cannot work in the class of Leray’s weak solutions of (1), the existence and uniqueness of a solution of the nonlinear problem in an interval of time independent of ε come from a fixed point argument as in [18]. To realize this program, app we need that ||u0 − u/t=0 || = O(εk ) with k > 21 . Consequently, we need to add a third part in the approximate solution to cancel the boundary layers at t = 0; this part U tl will

228

F. Rousset

be made of initial layers. The crucial point is that the special structure of our system (1), (2), (3) gives for utl a linear equation which is just a heat equation with damping. The second part of the paper is devoted to the study of the linearization of (1), (2), (3) about the approximate solution U app . The main result of this part is that despite the presence of the new waves and boundary layers, Assumption (H) on the main boundary layer still allows to get the same type of energy estimate as in [22]. Actually, we need to derive a better energy estimate to deal with the nonlinear problem: here the use of time derivatives is not appropriate in this setting for non-zero initial data because of the initial layers. In the last part, we give the proof of Theorem 1 by a fixed point argument. Notations. At first, we set U = (u, j, p, ϕ) and we define the operators corresponding to (1), (2): e×u e×j ∇p + + − εu, ε ε ε M(U ) = j − (∇ϕ − e × u).

N S(U ) = ∂t u + u · ∇u +

Throughout this paper, we use the notation ˜ for horizontal operators and quantities: ˜ = ∂1 f , u ˜ = ∂12 u + ∂22 u, u˜ = u1 . ∇f ∂2 f u2 Moreover, we shall often use for the variables the notations z = x3 and y = (x1 , x2 ). Next, we need to define some norms. For u, v ∈ R3 , |u|2 and u · v are the standard euclidean norm and scalar product. We also set

|∂yα u(t, y, x3 )|2 dy, |u|2m = |u(t, ·, x3 )|2H m (R2 ) = 2 |α|≤m R

||u||2m = ||u||H m,0 ()2 =

|α|≤m

|∂yα u(t, x)|2 dx.

The norm || · ||0 will be denoted by || · || and the associated scalar product will be denoted by (·, ·). Note that the norm || · ||m is not the usual norm of H m (), we do not require any regularity with respect to the z variable. Moreover, we deal throughout the paper with m such that m > 1 so that H m (R2 ) is an algebra. In a similar way, we define

|u||m,p = ||u||W m,p,0 () = ||∂yα u||Lp () , for p ∈ [1, +∞]. |α|≤m

As previously, for p = 2, we omit the index p. Actually, we only need the cases p = 1 and p = +∞. We shall also use weighted norms; we set for a parameter γ ≥ 1 which will be chosen sufficiently large (and independent of ε)

α ||u||2m,γ = ||Zαm u||2 , Zm u = γ m−|α| ∂yα u |α|≤m

Stability of Large Amplitude Ekman-Hartmann Boundary Layers in MHD

and |||u|||2m,γ =

|α|≤m

|||Zαm u|||2 , |||u|||2 =

+∞

229

||u(t)||2 dt.

0

The notation || · ||m,p , || · ||m,γ may seem a little confusing, nevertheless, we will always use the letter p or p = 1, ∞ for the W m,p,0 norm and the letter γ for the weighted norm. Throughout the paper , C, O(1) stand for harmless numbers which are independent of ε and γ ≥ 1. We shall sometimes need to make precise the dependence of these numbers with respect to the regularity of the initial value u0 : we will use the notations Cm for a number which only depends on ||u0 ||m and Cm,β for a number which only depends on k≤β ||∂zk u0 ||m := ||u0 ||H m,β . Finally a(τ ) stands for a nonnegative function such that +∞ a(τ ) dτ < +∞ 0

which may change from line to line. 1. Construction of an Approximate Solution The aim of this section is to construct an approximate solution U a = (ua , pa , j a , ϕ a ) of (1, 2, 3, 4, 5). This means that U a matches exactly the boundary condition (4) and the divergence free condition (3) and is an approximate solution of (1), (2), in the sense that N S(U a ) = R1ε , M(U a ) = R2ε , where R1ε and R2ε goes to zero when ε goes to zero (we will give a precise statement later). As in [6], we can use a special basis of L2 (0, 1) to decompose the initial value under the form 0 0,k u1 u1 = cos(kπ z), (11) u02 u0,k 2 k≥0

and since

u0

must be divergence free, we have the corresponding decomposition

1 ∇˜ · u˜ 0,k sin(kπ z). (12) u03 = − kπ k≥1

Thanks to (11), (12), we can set 0,k  0,0    u1 u1 cos kπ z .  and u0,ip =  k≥1 u0,k u0,wp =  u0,0 2 2 1 0,k 0 − k≥1 kπ ∇˜ · u˜ sin kπ z We consider three different parts in the construction of the approximate solution. The first part corresponds to u0,wp in the decomposition of u0 , hence this part is associated to a two dimensional initial value, this is the well prepared part of the solution since the high rotation forces the fluid to be invariant in the direction of the rotation axis and hence to be two-dimensional. This part of the approximate solution was already built in [8, 13]: it is made of an interior part which is two dimensional and a boundary layer part which is used to match the boundary conditions.

230

F. Rousset

Next we consider a second part in the approximate solution, this part is associated to the remaining components in the decomposition of the initial data; this part is made of waves and boundary layers. The waves are oscillating at high frequency in time and are damped by magnetic effect. This is the main difference with the pure Ekman case studied in [6] where the waves are not damped; they tend to zero due to dispersion effects. Finally the third part of the approximate solution is made of an initial layer: this part is used to make the approximate solution sufficiently close to u0 at t = 0 to apply our nonlinear argument as explained in the introdution. 1.1. The well-prepared part. In this section, we recall the construction of [8, 13] Proposition 2 ([8, 13]). There exists U wp under the form U wp (t, x) =

M

k=0

z 1 − z ε k U wp,k t, x, , ε ε

(13)

such that U wp matches exactly the boundary condition (4) and the divergence free condition (3). Moreover, we have N S(U wp ) = R11 , M(U wp ) = R21 , where for every m ≥ 0, T ε ||R11 ||2m dt + 0

T 0

||R11 ||m dt

2

+ ε −1

T 0

||R21 ||2m dt ≤ Cm+2 ε 2 .

Moreover, at t = 0, we have wp u (0, x) − u0,wp ≤ C e−αx/ε + e−α(1−x)/ε + r 1 ,

(14)

(15)

||r 1 ||2m + ε 2 ||∇r 1 ||2m + ε 4 ||∇ 2 r 1 ||2m ≤ Cm+2 ε 2 . Proof. We shall not give the complete proof of this lemma, but for the sake of clarity and since it will be useful later, we describe more precisely the approximate solution U wp . In (87) each term is under the form z 1 − z z 1−z U wp,k t, x, , = U int,k (t, x)+ Uˇ wp,k (t, x1 , x2 , )+ Uˆ wp,k (t, x1 , x2 , ), ε ε ε ε where the boundary layer terms Uˆ 1,k , Uˇ 1,k are exponentially decreasing with respect to Z = z/ε or (1 − z)/ε: there exists α, C > 0 such that |∂Zα Uˇ wp,k (t, x1 , x2 , Z)| + |∂Zα Uˆ wp,k (t, x1 , x2 , Z)| ≤ Ce−αZ , ∀t, x, Z.

(16)

Moreover, uint,0 = (uint,0 (t, x1 , x2 ), uint,0 (t, x1 , x2 ), 0), j int,0 = 0 and uint,0 = uint 1 2 is the solution of the two dimensional Euler equation (6) with the initial condition (7). The other important term in (87) is the leading term of the boundary layer terms. We have pˇ 1,0 = ϕˇ 1,0 = ˇ31,0 = uˇ 1,0 3 = 0.

(17)

1,0 ˇ 1,0 ˜ = (uˇ 1,0 , uˇ 2,0 ) The remaining components are such that ˇ11,0 = uˇ 1,0 2 , ˇ2 = −u 1 and u is a solution of the linear ordinary differential equation

Stability of Large Amplitude Ekman-Hartmann Boundary Layers in MHD

∂θ2 u˜ = B u, ˜ B=

−1 1

231

(18)

with the constraints u(+∞) ˜ = 0, u˜ /θ=0 = −uint,0 . It is easy to show that this ordinary differential equation has a unique solution which moreover decreases at an exponential rate. We have just given the form of the leading term of the approximate solutions U wp , as usual in WKB type expansions the next terms U wp,k are solutions of linearized equations about the leading order solution U wp,0 with a source term which depends only on (U wp,l )l≤k−1 . In particular, in our case, we have the same equations for the boundary layer terms because the leading order boundary layer equations are linear and we have a linearized Euler equation for the interior part uint,k which has the form ∂t uint,k + uint,0 · ∇uint,k + uint,k · ∇uint,0 + βuint,k + ∇p = Fk , wp,0

= uˆ 3

wp,0

εe

−αZ

A useful remark is that since uˇ 3 |ub3 |m

∇ · uint,k = 0.

= 0, we have the improved estimates

, |∂z u3,b |, 1.

(19)

Finally, (15) just comes from the fact that the difference between the approximate solution and u0,wp is made of the leading boundary layer term plus correction terms which are at least of order ε. As explained in the introduction, the property (15) is not good enough for our nonlinear argument; we have to add an additional corrector which is an initial layer to cancel the boundary layers in the approximate solution at the initial time t = 0. 1.2. Additional initial layer. In this section, we consider an initial layer under the form t z t 1−z U tl = Uˇ tl,0 ( , x1 , x2 , ) + Uˆ tl,0 ( , x1 , x2 , ) + ε··· , ε ε ε ε and we look for functions which are fastly decreasing with respect to the variables τ = t/ε and Z = z/ε or (1 − z)/ε. Moreover, we want that the initial layer corrects the term created by the boundary layer at t = 0: wp,0

wp,0

uˇ tl,0 ˇ /t=0 , uˆ tl,0 ˆ /t=0 . /t=0 = −u /t=0 = −u We will also require that the initial layer matches exactly the boundary condition (4). By plugging this ansatz in the equation, we first find as usual for boundary layer terms that tl,0 ptl,0 = ϕ tl,0 = utl,0 =0 3 = j3

and then, we get that

(uˇ tl,0 ˇ tl,0 1 ,u 2 )

and

(uˆ tl,0 ˆ tl,0 1 ,u 2 )

(20)

are solutions of

∂τ v − ∂ZZ v + Bv = 0

(21)

v(τ, 0) = 0,

(22)

v(0, Z) = v0 (Z).

(23)

with the boundary condition and the initial condition Here, B is the two by two matrix defined in (18) and v0 (Z) stands for −uˇ wp,0 (0, x1 , x2 , Z) or −uˆ wp,0 (0, x1 , x2 , Z) which are both exponentially decreasing with respect to Z.

232

F. Rousset

Moreover, the current in the initial layer is given by tl,0 = utl,0 j2tl,0 = −utl,0 1 , j1 2 .

(24)

Finally, note that the variables (x1 , x2 ) are parameters in this problem they just appear through the dependence of the initial condition with respect to them. We also point out that the initial condition for (21) does not verify the boundary condition (22), hence the solution of this parabolic equation will have a singularity at t = 0, z = 0 that we shall need to consider carefully in the future estimates. The aim of the next lemma is to study the properties of the solutions of the system (21), (22), (23). Lemma 3. Let v be a solution of ∂τ v − ∂ZZ v + Bv = F, Z > 0, τ > 0

(25)

with the boundary condition (22) and the initial value f ; we assume that the source term and the initial value are such that |F (τ, ·, Z)|m + |∂Z F (τ, ·, Z)|m ≤ Km e−αZ a(τ ), |f (·, Z)|m + |∂Z f (·, Z)|m ≤ Km e−αZ ,

(26) (27)

for some α > 0. Then we have for every p ∈ [1, +∞], the estimates ||v(τ )||m,p Km ,

(28)

||v(τ )||m,p + ||v(τ )||2m,p + ||∂Z v(τ )||m,p + ||∂Z v(τ )||2m Km a(τ ), ε−1 |∂Zk v(τ, ·, ε−1 )|m + ε −2 |∂Zk v(τ, ·, ε−1 )|2m Km a(τ ), k ≤ 2,

(29)

1 2

||Z ∂Z v||m Km .

(30) (31)

The proof of this lemma which only relies on elementary convolution estimates on the explicit formula for the solution of (25) which is just a heat equation with damping is postponed to Appendix A. Thanks to this lemma, we are now able to give the precise construction of our initial layer corrector; we take U tl = U tl,0 + ε(U tl,1 + U tl,2 ), where U tl is given by (20), (21), (22), (23), (24). Since the initial condition for (21) is a boundary layer, the assumption (26), (27) of Lemma 3 is verified (note that in this case, there is no source term) and hence the estimates of Lemma 3 hold for U tl,0 . As usual, the two correctors, U tl,1 , U tl,2 are needed to recover exactly the divergence free condition (3), and the boundary condition (4). To recover the divergence free condition, we first choose +∞ uˇ tl,1 (τ, x , x , Z) = (∂1 uˇ tl,0 ˇ tl,0 1 2 3 1 + ∂2 u 2 )(τ, x1 , x2 , θ ), dθ, Z +∞ (∂1 ˇ1tl,0 + ∂2 ˇ2tl,0 )(τ, x1 , x2 , θ ), dθ, ˇ3tl,1 (τ, x1 , x2 , Z) = Z +∞ tl,1 (∂1 uˆ tl,0 ˆ tl,0 uˆ 3 (τ, x1 , x2 , Z) = − 1 + ∂2 u 2 )(τ, x1 , x2 , θ ), dθ, Z +∞ (∂1 ˆ1tl,0 + ∂2 ˆ2tl,0 )(τ, x1 , x2 , θ ), dθ. ˆ3tl,1 (τ, x1 , x2 , Z) = − Z

Stability of Large Amplitude Ekman-Hartmann Boundary Layers in MHD

233

Note that thanks to Lemma 3, the corrector U tl,1 enjoys the estimates ||U tl,1 ||m,∞ + ||∂Z U tl,1 ||m,∞ ≤ Cm+1 ,

(32)

||U ||m,∞ + ||U tl,1 ||2m,∞ ≤ Cm+1 a(τ ), ||∂Z U tl,1 ||m,∞ + ||∂Z U tl,1 ||2m,∞ ≤ Cm+1 a(τ ), ||∂Zk U tl,1 ||m + ||∂Zk U tl,1 ||2m ≤ Cm+1 a(τ ), ∀k ≤

(33)

tl,1

(34) 2.

(35)

Moreover, note that thanks to (21), we have for example +∞ tl,1 ∂1 (−B uˇ tl,0 )1 + ∂2 (−B uˇ tl,0 )2 dθ − ∂1Z uˇ tl,0 ˇ tl,0 ∂τ uˇ 3 = 1 − ∂2Z u 2 , Z

hence thanks to Lemma 3, we get the estimate ||∂τ U tl,1 ||m + ||∂τ U tl,1 ||2m ≤ Cm+1 a(τ ).

(36)

Finally, the second corrector U tl,2 (τ, x1 , x2 , z) will be used to recover the exact boundary condition. Consequently, we choose it as in [13] such that in : ∇ · utl,2 = 0, ∇ · j tl,2 = 0, and with the boundary conditions utl,2 (τ, x1 , x2 , 0) = −ε−1 uˆ tl,0 (τ, x1 , x2 , ε−1 )− uˇ tl,1 (τ, x1 , x2 , 0)− uˆ tl,1 (τ, x1 , x2 , ε−1 ), utl,2 (τ, x1 , x2 , 1) = −ε−1 uˇ tl,0 (τ, x1 , x2 , ε−1 )− uˇ tl,1 (τ, x1 , x2 , ε−1 )− uˆ tl,1 (τ, x1 , x2 , 0), j3tl,2 (τ, x1 , x2 , 0) = −ε −1 ˆ3tl,0 (τ, x1 , x2 , ε−1 )−ˇ3tl,1 (τ, x1 , x2 , 0)−ˆ3tl,1 (τ, x1 , x2 , ε−1 ), j3tl,2 (τ, x1 , x2 , 1) = −ε −1 ˆ3tl,0 (τ, x1 , x2 , ε−1 )−ˇ3tl,1 (τ, x1 , x2 , ε−1 )−ˆ3tl,1 (τ, x1 , x2 , 0). Note that thanks to (31), the terms like ε−1 uˆ tl,0 (τ, x1 , x2 , ε−1 ) are uniformly bounded in ε and integrable with respect to τ . By using the same technique as in [13] and thanks to the estimates (29), (30) and (33), we find that the solution of the previous system enjoys the estimates: ||U tl,2 ||m,∞ + ||∂z U tl,2 ||m,∞ ≤ Cm+1 , ||U ||m,∞ + ||U tl,2 ||2m,∞ + ||∂z U tl,2 ||m,∞ + ||∂z U tl,2 ||2m,∞ ||∂zk U tl,2 ||m + ||∂zk U tl,2 ||2m ≤ Cm+1 a(τ ), ∀k ≤ 2. tl,2

(37) ≤ Cm+1 a(τ ),

(38) (39)

Moreover, note that thanks to (30), Eq. (25) also gives that ε−1 |∂τ v(τ, ·, ε−1 )|m + ε −2 |∂τ v(τ, ·, ε−1 )|2m Km a(τ ), where v stands for uˆ tl,0 or uˇ tl,0 and hence this implies that ||∂τ utl,2 ||m + ||∂τ utl,2 ||2m ≤ Cm+1 a(τ ).

(40)

We are now able to improve the statement of Proposition 2. Lemma 4. Let U = U wp + U tl , then U verifies exactly the boundary condition (4), and the divergence free condition (3); moreover, we have

234

F. Rousset

N S(U ) = R 1 , M(U ) = R 2 , where ||R 1 ||m + ||R 1 ||2m + ε −2 ||R 2 ||2m Cm+2 (ε + a(τ )), and at t = 0, let

r = U /t=0 −

u˜ 00 0

(41)

,

then ||r||2m + ε 2 ||∇r||2m + ε 4 ||∇ 2 r||2m ≤ Cm+2 ε 2 .

(42)

The main improvement in the estimates of Lemma 4 with respect to Lemma 2 is that (15) is changed in (42) which will be sufficient for our nonlinear argument. Proof. We begin with the estimate of ||R 2 ||m which is the easiest one. By definition, we have R 2 = R21 + ε(j tl,1 + j tl,2 − e × (utl,1 + utl,2 )), and hence the estimate (41) for R 2 follows from (14) and (35). Next, we note that R 1 = E1 + E2 + E3 , where 1

E1 = R 1 + (∂τ utl,1 (t/ε) + ∂τ utl,2 (t/ε)) + e × utl,2 ˜ tl , −ε2 (utl,1 + utl,2 ) − ε u

(43)

E2 = uwp · ∇utl + utl · ∇uwp , E3 = utl · ∇utl .

(44) (45)

The estimate (41) for E1 follows from (14), Lemma 3 and (32–36), (37–40). The fact ˜ tl . that this estimate depends on Cm+2 comes from the term u ∞ To estimate the second line, we use L estimates for uwp , (19) and the fact that wp utl3 = 0, u3 = 0 to get ˜ tl ||m + ε||∂z utl ||m , ||E2 ||m ||utl ||m + ||∇u and hence, thanks to (29), (35), (39), we get the desired estimate. Finally, for E3 , we use (28), (32) and (37), to get ˜ tl ||m + ε||∂z utl ||m , ||E3 ||m ∇u which gives the wanted estimate thanks to (29), (35), (39). 1.3. The ill-prepared part. We construct in this section an approximate solution U ip for the linear problem e×j ∇p e×u ∂t u + + + − εu = 0, (46) ε ε ε j = ∇ϕ − e × u, (47) ∇ · u = 0, ∇ · j = 0, (48) in with the boundary condition (4) such that at t = 0, uip is sufficiently close to the part of the initial condition that is not absorbed in the construction of the previous

Stability of Large Amplitude Ekman-Hartmann Boundary Layers in MHD

235

paragraph: this means that we shall require: ip

||u/t=0 − u0,ip ||2m ε2 . For this construction, the first step is to get a good spectral representation of the operator Lu = P e × u + e × (P (e × u)) , where P is the Leray projection on divergence free vector fields or equivalently, to understand the behaviour of the solutions of the linear equation ∂t u +

Lu =0 ε

in R2 × (0, 1) with the boundary condition u3/∂ = 0. As in [6], the first step is to reformulate (46), (47), (48). Let us denote by F the Fourier transform with respect to horizontal variables (x1 , x2 ) and by ξ the variable dual to (x1 , x2 ), we define ω = F(∂1 u2 − ∂2 u1 ), d = F(∂1 u1 + ∂2 u2 ), w = F(u3 ), η = F(∂1 j2 − ∂2 j1 ), c = F(∂1 j1 + ∂2 j2 ), J = F(j3 ). And we rewrite the system with new unknowns (ω, d, w, η, c, J ) (in this section with an abuse of notation we still denote by p and ϕ the Fourier transform of p and ϕ). The new system becomes ω d + = |ξ |2 ϕ, ε ε ε d ω |ξ |2 p ∂t d + − = , ε ε ε −∂z p , ∂t w = ε c = −|ξ |2 ϕ + ω, η = −d, J = ∂z ϕ, ∂z w = −d, ∂z J = −c.

∂t ω +

(49) (50) (51) (52) (53)

Following the method of [6], we can use the basis (cos kπ z)k≥0 of L2 (0, 1) to decompose v = (ω, d) as

v= vk (t) cos kπ z, k≥1

where vk = (ωk , dk ). Thanks to the divergence free condition, w must be under the form

w=− (kπ )−1 dk (t) sin kπ z. k≥1

As usual, the divergence free conditions (53) determine the pressure and the magnetic potential ((kπ)2 + |ξ |2 )ϕk = ωk , ((kπ )2 + |ξ |2 )pk = dk − ωk . Next we get thanks to (49), (50), ∂ t vk =

Dk vk , ε

236

F. Rousset

where D = k

−αk2 −1 αk2 −αk2

, αk =

(kπ )2 (kπ )2 + |ξ |2

21 .

The resolution of this system gives vk (t) = Lk (t/ε)v k (0), where 2 cos(αk τ ) sin(αk τ )/αk Lk (τ ) = exp(τ D k ) = e−αk τ . −αk sin(αk τ ) cos(αk τ ) Note that when = 0, we have the same expression as in [6]. Here, when is positive, the oscillatory waves created by the high rotation are damped by the high magnetic field, hence the situation is very different and less subtle. The aim of the following lemma is to study precisely this damping effect: Lemma 5. Let (u, j ) be a solution of the linear equation ∂τ u + e × u + e × j + ∇p = 0, j = ∇ϕ − e × u, ∇ · u = 0, ∇ · j = 0,

(54) (55) (56)

in with the boundary condition u3/∂ = j3/∂ = 0

(57)

and an initial condition u0 such that ∇ · u0 = 0 and u0/∂ = 0. Then, we have for s > 4, the following estimates: ||∂zβ u(τ )||m ≤ C(||u0 ||m+s+β + ||u0 ||H m,β ), ||∂zβ u(τ )||m

≤ C(||∂z u ||m+s+β + ||u ||H m,β )a(τ ), 0

0

||u(τ )||m,∞ ≤ C||∂z u ||m+s a(τ ). 0

(58) (59) (60)

Remark 6. Note that by obvious interpolation between (58) and (59), we also have ∞ ||∂zβ u(τ )||2m dτ ≤ C(||∂z u0 ||2m+s+β + ||u0 ||2H m,β ). (61) 0

Proof. At first, we prove (58), (59) for β = 0. In this case, we use the explicit formula for the solution of (54), (55), (56). We have, thanks to the same notations as previously

v(τ, ξ, z) = Lk (τ )vk0 cos(kπ z), w(τ, ξ, z) = − (kπ )−1 (Lk (τ )vk0 )2 sin(kπ z). k≥1

k≥1

(62) Thanks to the Bessel identity, we shall get ||Fξ−1 v(τ, ·, ·)||2m + ||Fξ−1 w(τ, ·, ·)||2m + ||Fξ−1 ∂z w(τ, ·, ·)||2m ≤ C||u0 ||2m+1 . (63) Indeed, since ∀τ ≥ 0, ||Lk (τ )|| ≤ Ce−αk τ (1 + |ξ |) ≤ C(1 + |ξ |), 2

we have

Stability of Large Amplitude Ekman-Hartmann Boundary Layers in MHD

||v(τ, ·, ·)||2m =

k≥1 ||v 0 ||2m+1

≤

|Lk (τ )vk0 |2m ≤ ≤

2 k≥1 R C||u0 ||2m+2 .

237

(1 + |ξ |2 )(1 + |ξ |2 )m |vk0 |2 dξ

In a similar way, we get the estimates for w and ∂z w. We deduce (58) from (63) and classical estimates for the inversion of the Laplacian in R2 since ˜ 1 = F −1 (−iξ2 ω + iξ1 ∂z u3 ). ˜ 2 = F −1 (iξ1 ω − iξ2 ∂z u3 ), u u ξ ξ

(64)

Next, we prove (59) for β = 0. Using (62), we rewrite

vk (τ, ξ ) cos(kπ z) v(τ, ξ, z) =

(65)

k≥1

and we note that vk (τ, ξ, z) is a linear combination of terms under the form 0 (ξ ), Ik (τ, ξ ) = e−αk τ ak (ξ )vk,i 2

(66)

0 stands for a component of v 0 (ξ ). where |ak (ξ )| ≤ C(1 + |ξ |) and vk,i k +∞ We want to estimate 0 |Ik (τ, ·)|L2 dτ . Towards this, we use that ξ

+∞ 0

|Ik (τ, ·)|L2 dτ = ξ

+∞

sup

(Ik (τ, ·), ϕ(τ, ·))L2 dτ

(67)

ξ

|ϕ(τ,·)|L2 ≤1 0 ξ

and we perform the decomposition +∞

|(Ik (τ, ·), ϕ(τ, ·))L2 | dτ ≤ ξ

0

+∞

l≥−1 0

|(Ikl (τ, ·), ϕ(τ, ·))L2 | dτ,

(68)

ξ

where Ikl (τ, ξ ) = ϕ l (ξ )Ik (ξ ) and ϕ l = ϕ(ξ/2l ) is the usual dyadic partition of unity of the Littlewood-Paley decomposition (see [2] or [7] for example). Now, by the CauchySchwarz inequality, we get that +∞ +∞ 2 l 0 |(Ik (τ, ·), ϕ(τ, ·))L2 | dτ |e−αk τ ak (ξ )ϕ l (ξ )vk,i (ξ )ϕ(τ, ξ )| dξ dτ 0

0

+∞ 0 |vk,i ϕ l |L2 ξ

ξ

0

ξ ∈supp(ϕ l )

ξ

1 2 2 e−2αk τ |ak (ξ )|2 |ϕ(τ, ξ )|2 dξ dτ.

Now, since on the support of ϕ l , we have |ξ | 2l , we get that e−αk τ e−αk (2 )τ , |ak (ξ )|2 22l , 2

2

l

and we find +∞ 0 |(Ikl (τ, ·), ϕ(τ, ·))L2 | dτ 2l |vk,i ϕ l |L2 sup(|ϕ(τ, ·)|L2 ) 0

ξ

2

3l

ξ τ 0 l |vk,i ϕ |L2 . ξ

ξ

0

+∞

e−αk (2 )τ dτ 2

l

238

F. Rousset

Thanks to (68), this yields for s > 3, +∞

0 (Ik (τ, ·), ϕ(τ, ·))L2 dτ 23l |vk,i ϕ l |L2 ξ

0

ξ

l≥−1

1 2

0 22sl |vk,i ϕ l |2L2

l≥−1 0 |vk,i |s .

ξ

2(6−2s)l

1 2

l≥−1

In the last step, we have used the classical characterization of Sobolev spaces by dyadic decomposition (again see [2] or [7] for example. Consequently, going back to (67), we have proven that +∞ 0 |Ik (τ, ·)|L2 dτ |vk,i |s . ξ

0

Consequently, we easily get thanks to (65) by a combination of the last estimate and the Sobolev embedding that for s > 4, +∞ +∞

−1 ||Fξ v(τ )||m,∞ + ||Fξ−1 v(τ )||m ≤ C ||Fξ−1 vk0 ||m+s . 0

0

Moreover, since

u0/∂

k≥1

= 0, we have thanks to an integration by parts

||Fξ−1 vk0 ||m+s ≤ C(kπ )−1 ||Fξ−1 (∂z v 0 )k ||m+s and hence the Cauchy Schwarz inequality gives +∞ +∞ −1 ||Fξ v(τ )||m,∞ + ||Fξ−1 v(τ )||m ≤ C||Fξ−1 ∂z v 0 ||m+s Cm+s,1 . (69) 0

0

By the same technique, we can prove thanks to (62) that the same inequality as (69) holds for w and ∂z w with Cm+s,1 replaced by Cm+s+1,1 . At last, we get (59) for β = 0 by using again (64). It remains to prove (58), (59) for β > 0. Note that it seems difficult to use directly the representation (62) since the series obtained by taking the derivative with respect to z of each term of (62) are not convergent any more. To overcome this difficulty, we shall make direct energy estimates on the partial differential equations (54), (55), (56). At first, we notice that the divergence free relations (56) determine the pressure and the magnetic potential as ϕ = ∇ · (e × u) = −(∂1 u2 − ∂2 u1 ), p = −∇ · e × u + e × j = ∂1 u2 − ∂2 u1 + (∂1 j2 − ∂2 j1 ) .

(70) (71)

Consequently, by classical elliptic regularity, we get for β ≥ 1 the estimates ||∇∂zβ ϕ||m ||∂zβ−1 u||m+1 , ||∂zβ−1 ϕ||m+2 ||∂zβ−1 u||m+1 ,

(72)

||∇∂zβ p||m ||∂zβ−1 u||m+1 + ||∂zβ−1 j ||m+1 .

(73)

Note that thanks to (55) we also have ||∂zβ j || ||∇∂zβ ϕ||m + ||∂zβ u||m , hence the combination of (73) and (74), (72) gives

(74)

Stability of Large Amplitude Ekman-Hartmann Boundary Layers in MHD

239

||∇∂zβ p||m ||∂zβ−1 u||m+1 .

(75)

Next, by a standard energy estimate on (54), we have 1 d ||∂ β u(τ )||2m + (e × ∂zβ j, ∂zβ u)m + (∇∂zβ p, ∂zβ u)m = 0, 2 dτ z and hence thanks to (55), we get 1 d ˜ 2m (||∇∂zβ p||m + ||∇∂zβ ϕ||m )||∂zβ u||m . ||∂ β u(τ )||2m + ||∂zβ u|| 2 dτ z Since by the divergence free condition, we also have ||∂zβ u3 ||2m ||∂zβ−1 u|| ˜ m+1 ||∂zβ u3 ||m , we finally get 1 d ||∂ β u(τ )||2m + ||∂zβ u||2m (||∂zβ−1 u|| ˜ m+1 + ||∇∂zβ p||m + ||∇∂zβ ϕ||m )||∂zβ u||m . 2 dτ z Consequently, the use of (72) and (75) and an integration in time yield τ τ β β ||∂z u(τ )||m + ||∂z u(s)||m ds ≤ C ||∂zβ−1 u(s)||m+1 ds + ||∂zβ u(0)||m , (76) 0

0

where C is independent of τ . Since we have already proven (58), (59) for β = 0, we get the estimates in the general case thanks to (76) by induction. Thanks to Lemma 5, we can now give the construction of the approximate solution which corresponds to the ill-prepared part of the initial data. Lemma 7. There exists an approximate solution U ip of (1), (2), (3) under the form U ip = U int,ip (t/ε, x) + Uˇ ip (t/ε, y, z/ε) + Uˆ ip (t/ε, y, (1 − z)/ε) +εU c (t, t/ε, x, z/ε, (1 − z)/ε), where U int,ip is the solution of the linear equation (54), (55), (56) given by Lemma 5, Uˇ ip , and Uˆ ip are boundary layer terms and U c is an higher order correction term. This approximate solution matches exactly the divergence free conditions (3) and the boundary condition (4), and we have ip

ip

N S(U ip ) = R1 , M(uip ) = R2 , with the estimates ||R1 ||m + ||R1 (τ )||2m + ε −2 ||R2 ||2m ≤ (Cm+s+2,1 + Cm,2 )(ε + a(τ )). (77) ip

ip

ip

Moreover, at t = 0, we have ip

ip

ip

||u/t=0 − u0,ip ||2m + ε 2 ||∇(u/t=0 − u0,ip )||2m + ε 4 ||∇ 2 (u/t=0 − u0,ip )||2m ≤ (Cm+s+2,1 + Cm,2 )ε 2 .

(78)

Finally, the boundary terms Uˇ ip and Uˆ ip satisfy the estimates (28), (29), (30), (31) of Lemma 3 and the corrector term U c verifies the estimates (33), (34), (35).

240

F. Rousset

Proof. We first choose U int,ip as the solution of (54), (55), (56) with the boundary condition (57) given by Lemma 5. Since it does not match the full boundary condition (4), we must add the boundary layers Uˇ ip , Uˆ ip . As usual for boundary layers, we find p = ϕ = u3 = j3 = 0, the remaining components (u1 , u2 ) = v are solutions of ∂τ v − ∂ZZ v + Bv = 0, τ > 0, Z > 0 with the prescribed boundary condition int,ip

int,ip

vˇ/Z=0 = −(u1/z=0 , u2/z=0 ) for the boundary layer corresponding to the lower boundary and int,ip

int,ip

vˆ/Z=0 = −(u1/z=1 , u2/z=1 ) for the boundary layer corresponding to the upper one. To get an approximate solution sufficiently accurate in order to get the estimate (78) at t = 0, we choose the initial condition v /t=0 = 0. Again, note that the initial condition does not match the boundary condition, hence the boundary layers are not smooth in the close domain (τ ≥ 0, Z ≥ 0). Since we are studying an initial boundary value problem with an inhomogeneous boundary condition, it is more convenient to change it into an homogeneous one. We set v = v − v(τ, y, 0)e−Z . After this change of unknown, we find for v the inhomogeneous equation with homogeneous boundary condition ∂τ v − ∂ZZ v + Bv = F, v(τ, y, 0) = 0 with the source term F = e−Z (∂τ + B + Id)v(τ, y, 0). The initial condition is now v(0, y, Z) = −v(0, y, 0)e−Z . Next, we note that thanks to Lemma 5, the assumptions (26), (27) of Lemma 3 is matched with Km replaced by Cm+s,1 . Consequently, we get the existence of Uˇ ip and Uˆ ip which satisfy the estimates (28), (29), (30), (31) thanks to Lemma 5. Next, the construction of the corrector εU c = ε(U c,1 + U c,2 ) to recover the divergence free conditions and the exact boundary condition is similar as in Sect. 1.2. Moreover, the estimates on these terms are the same with Cm+1 replaced by Cm+s+1,1 since they only depend on the estimates on Lemma 3. ip ip Finally, we have to compute the error terms R1 , R2 . Again since the estimates on the terms Uˆ ip , Uˇ ip and U c are the same as in Sect. 1.2, most of the terms except one type of terms can be estimated as in this section, and we shall not give the details again. The only difference occurs for the estimates of the terms involving quadratic quantities in the Navier-Stokes equation and more precisely for the terms under the form (there are similar terms involving the other boundary layer that we can estimate in the same way): N = u3 ∂z uˇ ip = ε−1 u3 ip

int,ip

∂Z Uˇ ip + uc3 ∂Z Uˇ ip .

ip ip The last equality comes from the fact that uˇ 3 = uˆ 3 term, we use the same technique as for the term E3 in the

(79)

= 0. To estimate the second proof of Lemma 4. It remains to estimate the first term. Note that this term did not appear in Sect. 1.2 since for the

Stability of Large Amplitude Ekman-Hartmann Boundary Layers in MHD

241

well-prepared part, we have uint 3 = 0. For the estimate of this term, we use the classical Hardy - like estimate which is classical in boundary layer stability problems [14–16]: int,ip since ∂ym u3 (τ, y, 0) = 0, we have int,ip

|u3

int,ip

(τ, z)|2m ≤ Cz||∂z u3

hence we write int,ip

||u3

∂Z uˇ ip ||2m ε −2 ||∂z u3

int,ip

int,ip

||∂z u3

(τ )||2m

(τ )||2m ,

+∞ 0

z|∂Z uˇ ip (τ, ·, z/ε)|2m dz

(80)

1

(τ )||2m ||Z 2 ∂Z uˇ ip (τ, ·, ·)||2m int,ip

Cm+s+1 ||∂z u3

(τ )||2m .

(81)

Here, we have used the uniform bound (31). Finally, thanks to (59), (61), we get int,ip

||u3

int,ip

∂Z uˇ ip ||2m + ||u3

∂Z uˇ ip ||m (Cm+s+1 + Cm,1 )a(τ )

which is a part of (77). 1.4. Final approximate solution. By collecting the results of Lemma 4–7, we can finally get our final approximate solution Theorem 8. Let U app = U + U ip , then U app verifies the boundary condition (4), the divergence free condition (3) and app

app

N S(U app ) = R1 , M(U app ) = R2 , with the estimates ||R1 (τ )||m + ||R2 (τ )||2m + ε −2 ||R2 ||2m ≤ (Cm+s+2,1 + Cm,2 )(ε + a(τ )), app

app

app

(82)

where a ∈ + ). Moreover, at t = 0, we have L1 (R

app

app

app

||u/t=0 − u0 ||2m + ε 2 ||∇(u/t=0 − u0 )||2m + ε 4 ||(u/t=0 − u0 )||2m ≤ (Cm+s+2,1 + Cm,2 )ε 2 .

(83)

We do not detail the proof of this theorem since the estimates (82), (83) are obtained by collecting the results of Lemma 4–7 and by estimating the new interaction terms uip · ∇u + u · ∇uip . Nevertheless there is no new difficulty for the estimate of these terms: as previously, we get a bound on the most difficult term (which appears only in the first term of the above int,wp sum since u3 = 0) thanks to (80). 2. Linear Stability In this section, we set V = e−γ t (U ε − U app ), where γ is a parameter which will be chosen sufficiently large (but independent of ε) later and the aim is to derive an energy estimate for V which is the solution of the system

242

F. Rousset e×j ∂t v + γ v + uwp · ∇v + v · ∇uwp + e×v ε + ε + j = ∇ϕ − e × v + F2 , ∇ · v = 0, ∇ · j = 0,

∇p ε

= εv + F1 ,

(84) (85) (86)

where app

F1 = R1 F2 =

− (uapp − uwp ) · ∇v − v · ∇(uapp − uwp ) − eγ t v · ∇v,

app R2

(87) (88)

with the boundary condition (4) and the initial condition v(0, x) = v0 (x).

(89)

Note that in the Navier-Stokes equation, we have incorporated both the nonlinear term and some part of the linear terms in the source term. In this section, we shall make an estimate of the solution of (84), (85), (86) with a general source term F = (F1 , F2 ). In the next section, we shall prove Theorem 1 by using this estimate in a fixed point argument which will take into account the form of the source term given by (87), (88). We have chosen to put the terms (uapp − uwp ) · ∇v + v · ∇(uapp − uwp ) which are linear in v in the source term and to postone their estimate to the next section because they need to be estimated in a different way: the fast decay in time of the ill-prepared part and time layer part of the approximate solution makes them easier to handle, nevertheless, we have to be careful with the singularity that they carry at the space time corner t = 0, z = 0. The main theorem of this section is Theorem 9. Assume (H), then there exists γ0 > 0 and ε0 > 0 such that for every γ ≥ γ0 , ε ≤ ε0 and m ∈ N, there exists C > 0 such that ∀T ≥ 0, T ||v(T )||m,γ + ε 2 ||∇v(T )||2m,γ + ε||∇v(s)||2m,γ + ε −1 ||j (s)||2m,γ ds 0 ≤ C ||v0 ||2m,γ + ε 2 ||∇v0 ||2m,γ + ε 4 ||v0 ||2m,γ T T 2 2 −1 2 . (90) + ε||F1 (s)||m,γ + ε ||F2 (s)||m,γ ds + ||F1 (s)||m,γ ds 0

0

Note that the estimate (90) can be seen as an improvement of the estimate in Theorem 1 3 of [23]: the main amelioration is that we have an estimate L∞ T (H ()) for v which is 2 1 better than the LT (H ) given in [23]. Moreover, there are no derivatives with respect to z of the source term in the estimate (90). These two points are crucial for the nonlinear stability argument since we cannot use time and normal derivative estimates as in [23] because of the singularities in the initial layers. The proof of this theorem is quite long and hence, we split it in various lemmas. The aim of the two first lemmas is to show that we get (90) as soon as we have a good estimate T of ε 0 ||∇v||2m,γ . Hence the situation will be reduced to the one in [23, 22], and the aim of the next three lemmas will be to sharpen the estimates in [23, 22] to control the singularity in the source term.

Stability of Large Amplitude Ekman-Hartmann Boundary Layers in MHD

243

Lemma 10. There exists C1 > 0 such that for γ ≥ γ0 and ε ≤ ε0 , we have T 2 ||v(T )||m,γ + ε −1 ||j (s)||2m,γ + ε||∇v||2m,γ ds ≤ C1 ||v0 ||2m,γ

T

+C1 0

0

ε||∇v(s)||2m,γ +||F1 (s)||m,γ ||v(s)||m,γ +ε−1 ||F2 (s)||2m,γ ds. (91)

Note that in this lemma, we do not use the spectral Assumption (H). Nevertheless, for the moment this estimate does not allow to get (90) by the Gronwall inequality because of the term C1 ε||∇v||2m,γ , which cannot be absorbed by the left-hand side when C1 is large. To control this term, we need other estimates which use the spectral Assumption (H) as in [23]. α to (84), (85), (86), we get Proof. After the application of the operator Zm α α α α v + γ Zm v + uwp · Zm ∇v + Zm v · ∇uwp ∂t Zm αv αj αp e × Zm e × Zm ∇Zm + + + ε ε ε α α v + Zm F1 − C α , = εZm α α α j = ∇Zm ϕ − e × Zm v + Zm F2 , α α ∇ · Zm v = 0, ∇ · Zm j = 0,

(92) (93) (94)

where the commutator C α is defined as α α , uwp · ∇]v + [Zm , ∇uwp ]v. C α = [Zm

A standard energy estimate for this system gives d 1 α 2 α α α ||Z v|| + γ ||Zm v||2 + ε −1 ||Zm j ||2 + ε||∇Zm v||2 dt 2 m α α α α α ||Zαm F1 || ||Zm v|| + ε −1 ||Zm F2 ||2 + |(C α , Zm v)| + |(Zm v · ∇uwp , Zm v)|. (95) α v, Z α v) cancels since ∇ · uwp = 0 and that We just recall that the term (uwp · ∇Zm m thanks to (93), (94), we have α α α α α α α j, Zm u) = −(Zm j, e × Zm u) = ||Zm j ||2 − (Zm j, ∇Zm ϕ) (e × Zm α α +O(1)||Zm j || ||Zm F2 || 1 α 2 α α α α = ||Zm j ||2 + O(1)||Zm j || ||Zm F2 || ≥ ||Zm j || − C||Zm F2 ||2 . 2

Next, we estimate the last term in (95) in a classical way by the same method as in (80): α α (96) v · ∇uwp , Zm v)| ≤ C ||v||2m,γ + ε||∇v||2m,γ . |(Zm We recall that C is large when the boundary layer has large amplitude. The commutator C α was already estimated in [23](estimates (40) and (41)), we have

α |(C α,wp , Zm v)| ||v||2m + (ε 2 + εγ −1 )||∇v||2m . (97) |α|≤m

Finally, (91) follows by integrating (95) in time and by using (97) and (96).

244

F. Rousset

The next step towards the proof of Theorem 9 is to estimate ε 2 ||∇v(T )||2m,γ . Lemma 11. There exists C2 > 0 such that for γ ≥ γ0 and ε ≤ ε0 we have T 2 2 ||∂t v(s)||2m,γ ds ε ||∇v(T )||m,γ + ε 0

≤ C2 ε 2 ||∇v0 ||2m,γ + +ε−1 ||j ||2m,γ

(98)

T

||v||2m,γ + ε||∇v||2m,γ + ε||F1 ||2m,γ + ε −1 ||F2 ||2m,γ ds .

(99)

0

(100)

Note that this lemma will give control of ε2 ||∇v(T )||2m,γ . This estimate will be crucial in the nonlinear stability argument and is better than in the pure rotating fluid case where only an estimate of ε3 ||∇v(T )||2 was derived in [8]. α v, a standard energy estimate, thanks to the boundary Proof. We multiply (84) by ε∂t Zm condition (4) and the divergence free condition (94) gives

d α α α α α α v||2 + ε||∂t Zm v||2 + (e × Zm v, ∂t Zm v) + (e × Zm j, ∂t Zm v) (101) ||∇Zm dt α wp α α (u · ∇v)||2 + ||Zm (v · ∇uwp )||2 + ||Zm F1 ||2 . (102) ε ||Zm

ε2

Next, we write thanks to the Young inequality, α α α α |(e × Zm j, ∂t Zm v)| ≤ ε η||∂t Zm v||2 + c(η)ε −1 ||Zm j ||2 ,

(103)

where η > 0 independent of ε and γ will be chosen sufficiently small later. Moreover, by a new use of (93) and (94), we have α α α α α α α α (e × Zm v, ∂t Zm u) = (∇Zm ϕ, ∂t Zm v) − (Zm j, ∂t Zm v) + (Zm F2 , ∂t Zm v) α α α α = −(Zm j, ∂t Zm v) + (Zm F2 , ∂t Zm v),

hence we get α α α α α |(e × Zm v, ∂t Zm v)| ≤ ηε||∂t Zm v||2 + ε −1 c(η)(||Zm j ||2 + ||Zm F2 ||2 ).

(104)

By choosing η sufficiently small, we get from (101), (103), (104) that T ||∂t v(s)||2m,γ ds ε2 ||∇v(T )||2m,γ + ε 0

T

α α ε −1 ||j ||2m,γ + ε −1 ||Zm F2 ||2 + ε||Zm F1 ||2 0 α α app +ε||Zm (u · ∇v)||2m,γ + ε||Zm (v · ∇uapp )||2m,γ ds . (105)

ε

2

||∇v(0)||2m,γ

+

To estimate the two last terms in the last inequality, we write α wp ε||Zm (u · ∇v)||2 ε||uwp ||2m,∞ ||∇v||2m,γ ε||∇v||2m,γ

and α α ˜ wp ||2m,∞ ||v||2m,γ + ε||Zm ε||Zm (v · ∇uwp )||2m ε||∇u (v3 ∂z uwp )||2m α (v3 ∂z uapp )||2m . ||v||2m,γ + ε||Zm

(106)

Stability of Large Amplitude Ekman-Hartmann Boundary Layers in MHD

245

To estimate the second term, we use that α α ε||Zm (v3 ∂z uwp )||2 ε||v||2m,γ + ε −1 ||Zm (v3 ∂Z uwp,b )||2 .

Again, thanks to (80), we have α ε−1 ||Zm (v3 ∂Z uwp,b )||2 ε||∇v||2m,γ .

Consequently, we have proved that α ε||Zm (v · ∇uwp )||2m ||v||2m,γ + ε||∇v||2m,γ .

We end the proof by collecting (105) and (106), (107).

(107)

For the moment, the estimates in Lemmas 10–11 are not sufficient to conclude since T the term ε 0 ||∇v||2m,γ cannot be absorbed by the left-hand side when the boundary layer has large amplitude. As in [23, 22], we must use other estimates which come from the spectral Assumption (H). These estimates rely on a cut-off of the frequency domain following the idea of [21]. To use the Fourier transform in time, it is more convenient to deal with a vanishing initial value, hence we set v = v − e−t/ε v0 for t ≥ 0. Since v0 is such that ∇ · v0 = 0 and v0/∂ = 0, v is now a solution of ∂t v + γ v + uwp · ∇v + v · ∇uwp +

e×v e×j ∇p + + ε ε ε

(108)

= εv + F1 + F10 , j = ∇ϕ − e × v + F2 + F20 , ∇ · v = 0, ∇ · j = 0,

(109) (110)

where e × v0 F10 = e−t/ε (γ − ε −1 )v0 + uwp · ∇v0 + v0 · ∇uwp + − εv0 , ε F20 = −e−t/ε e × v0 , which still satisfies the boundary condition (4). Next we continue the source terms F , F0 by 0 for t < 0 and t > T , and we choose a smooth continuation of uwp for t ∈ R. By standard arguments for parabolic equations (see [19, 21]), the solution of the new equation will coincide with v on [0, T ], that’s why we do not use different letters for the functions and their continuation. Finally, since v /t=0 = 0, we set v = 0 for t < 0 and v will be a solution of (108) for t ∈ R. To get better estimates than in [23], we use a slightly different frequency partition. The first step here will be to elimitate low spatial frequencies. Let χ l (ξ ) ∈ Cc∞ (R2 ) such that χ l = 1 for |ξ | ≤ r and χ l = 0 for |ξ | ≥ 2r. We define v l as v l (t, y, z) = χ l (ε∂y )v(t, y, z), where χ l (ε∂y ) is the Fourier multiplier defined as Fy (χ l (ε∂y )v)(t, ξ, z) = χ l (εξ )Fy (v)(t, ξ, z). Note that the variables t and z are considered as parameters in this definition. By choosing r sufficiently small, we can get a better estimate for v l than the classical one which was obtained in Lemma 10. This is the aim of the following lemma.

246

F. Rousset

Lemma 12. There exists r > 0, C3 > 0, such that for γ ≥ γ0 , and ε ≤ ε0 , we have γ |||v l (s)|||2m,γ + ε|||∇v l (s)|||2m,γ + ε −1 |||j l |||2m,γ ≤ C3 |||v|||2m,γ + ε(γ −1 + ε)|||∇v|||2m,γ T (||F1 (s)||m,γ + ||F10 (s)||m,γ )||v(s)||m,γ + 0 +ε −1 (||F2 (s)||2m,γ + ||F20 (s)||2m,γ ) ds .

(111)

α to (108), (109), (110), this yields Proof. We apply the operator χ (ε∂y )Zm α l α l α α l ∂t Zm v + γ Zm v + uwp · Zm ∇v l + Zm v · ∇uwp +

α vl e × Zm ε

α jl α pl e × Zm ∇Zm α l α l α 0,l = εZm v + Zm F1 + Z m F1 − C α,l , + ε ε α l α l α l ϕ − e × Zm v + Zm F2 , j l = ∇Zm

+

∇

α l · Zm v

= 0, ∇

α l · Zm j

= 0,

(112) (113) (114)

where the commutator C α,l is defined by C α,l = C1α,l + C2α,l + C3α,l , where α α C1α,l = [Zm , uwp · ∇]v l + [Zm , ∇uwp ]v l , α C2α,l = Zm [χ l , uwp · ∇]v , α C3α,l = Zm [χ l , ∇uwp ]v .

We perform the same kind of energy estimate as in the proof of Lemma 10. Nevetheless, to get the improved bounds (111), we have to estimate in a different way the crucial term α v l · ∇uwp , Z α v l ). As in [23] Sect. 2.3 and [22] Sect. 3.3, we have (Zm m α l α l |(Zm v · ∇uwp , Zm v )| ≤ Cεr||∇v l ||2m,γ .

(115)

The factor r in this estimate comes from the spectral localization of χ l . Moreover, thanks to the proof of [23] Lemma 1, we also have α l |(C α,l , Zm v )| ||v||2m,γ + ε 2 ||∇v||2m,γ + εγ −1 ||∇v||2m,γ .

(116)

Note that the estimate of these commutators rely on [21] Appendix B and in particular on the fact that t and x3 playing the part of parameters: |[χ (ε∂y ), m]v(t, x3 , ·)| ≤ Cε|v(t, x3 , ·)| for m ∈ W 1,∞ .

(117)

Stability of Large Amplitude Ekman-Hartmann Boundary Layers in MHD

247

Consequently, the usual energy estimate gives that γ |||v l |||2m,γ + ε|||∇v l |||2m,γ + ε −1 |||j l |||2m,γ ≤ C εr|||∇v l |||2m,γ + |||v|||2m,γ + ε(γ −1 + ε)|||∇v|||2m,γ T (||F1 (s)||m,γ + ||F10 (s)||m,γ )||v(s)||m,γ + 0 +ε −1 (||F2 (s)||2m,γ + ||F20 ||2m,γ ) ds, and hence we find (111) by choosing r sufficiently small (Cr < 1).

The previous lemma gives a good estimate of γ |||v l |||2m,γ + ε|||∇v l |||2m,γ . The aim of the two following lemmas is to estimate the remaining part, i.e. γ |||(1 − χ l (ε∂y )v|||2m,γ + ε|||∇(1 − χ l (ε∂y ))v|||2m,γ . Towards this, we need to consider differently large and bounded space-time frequencies. We choose a smooth bounded function ψ(s) on R such that ψ = 1 for |s| ≤ R and ψ = 0 for |s| ≥ 2R and we define χ (ζ ) = ψ(< ζ >), where ζ = (γ , τ, ξ ) and 1 < ζ >= (γ 2 + τ 2 + |ξ |4 ) 4 . Further, we write (1 − χ l (ε ∂y ))v = (1 − χ l )χ (ε ∂)v + (1 − χ l )(1 − χ (ε ∂))v = χ s (ε ∂) v + χ L (ε ∂)v = vs + vL, where χ (ε ∂) is the Fourier multiplier defined by Ft,y (χ (ε ∂)f ) = χ (ε ζ )Ft,y f. Note that the spectrum of v s is supported in {< εζ >≤ 2R} ∩ {ε|ξ | ≥ r} and that of v L is supported in {< εζ >≥ R} ∩ {ε|ξ | ≥ r}. We first give an estimate for v L : Lemma 13. There exists R > 0, C4 > 0 such that for γ ≥ γ0 and ε ≤ ε0 , we have γ |||v L |||2m,γ + ε|||∇v L |||2m,γ + ε −1 |||j L |||2m,γ ≤ C4 |||v|||2m,γ + ε(γ −1 + ε)|||∇v|||2m,γ

+ε(|||F1 |||2m,γ + |||F10 |||2m,γ ) + ε −1 (|||F2 ||2m,γ + |||F20 |||2m,γ ) .

(118)

α to (108), Proof. We use the same technique as in [22] Lemma 5, we apply χ L (ε∂)Zm α v L , where κ(ζ ) = i/sgn τ for |τ | ≥ 1. (109), (110) and we multiply it by (1 + κ(ε∂))Zm A standard energy estimate together with the fact that the spectrum of v L is contained in < εζ >≥ R gives

γ |||v L |||2m,γ + ε|||∇v L |||2m,γ + R 2 ε −1 |||v L |||2m,γ + ε −1 |||j L |||2m,γ (119) −1 L 2 −1 2 ≤ C ε |||v |||m,γ + ε(ε + γ )|||∇v|||m,γ + |||F1 |||m,γ + |||F10 |||m,γ |||v|||m,γ + ε −1 (|||F2 |||2m,γ + |||F20 |||2m,γ ) , (120) where C is independent of R, ε and γ . Consequently, we find (118) for R sufficiently large.

248

F. Rousset

It remains to estimate v s ; this is the crucial part, where the spectral Assumption (H) is needed. Lemma 14. Assuming (H), there exists C5 > 0, such that for γ ≥ γ0 and ε ≤ ε0 , we have ε|||∇v s |||2m,γ ≤ C5 ε|||v|||2m,γ + (ε 2 + εγ −1 )|||∇v|||2m,γ +ε(|||F1 |||2m,γ + |||F10 |||2m,γ ) + ε −1 (|||F2 |||2m,γ + |||F20 |||2m,γ ) . (121) Proof. Again we apply the operator χ s (ε∂) to (108), (109), (110), and we rewrite the equation obtained from (108) as ∂t v s + uwp,0 · ∇v s + v s · ∇uwp,0 +

e × vs e × js + = εv s + H, ε ε

(122)

where H = F1s + F10,s − (uwp − uwp,0 ) · ∇v s − v s · ∇(uwp − uwp,0 ) + C, where C is the commutator defined by C = [χ s , uwp · ∇]v + [χ s , ∇uwp ]v. Moreover, we recall that uwp,0 stands for the leading term in the asymptotic expansion (87). Note that thanks to a new use of (117), we have the estimate |||H |||2m,γ |||F1 |||2m,γ + |||F10 |||2m,γ + ε 2 |||∇v|||2m,γ + |||v|||2m,γ .

(123)

Now, we introduce the operators ˜ + , T1 (ε∂) = ε∂t + εγ + u˜ wp,0 · ∇˜ − ε wp,0 ˜ ˜ T2 (ε∂) = ε∂t + εγ + u˜ · ∇ − ε , and we rewrite (122), (109), (110) in spatial dynamics. We set t ps ϕ s V = v s1 , v s2 , ε∂z v s1 , ε∂z v s2 , v 3 , , , j3s ε ε and we get the system 1 G(ε∂, t, x1 , x2 , z/ε)V + H, ε where the symbol of the operator is given by  0 0 1 0 0 0 0  0 0 0 1 0 0 0   T1 (ζ ) −1 0 0 ∂Z uwp,b,0 ξ1 −ξ2  1  1 T (ζ ) 0 0 ∂ uwp,b,0 ξ ξ1  1 Z 2 2 G(ζ, q, z/ε) =  0 0 0  −ξ1 −ξ2 0 0  0 0 −ξ1 −ξ2 −T2 (ζ ) 0 0   0 0 0 0 0 0 0 ξ2 −ξ1 0 0 0 0 −ξ12 − ξ22 ∂z V =

where q is a placeholder for uint,0 (t, x1 , x2 ). Moreover, H is defined as

(124)  0 0  0  0 , 0  0  1 0

Stability of Large Amplitude Ekman-Hartmann Boundary Layers in MHD

0,s s − F2,3 , −∇˜ · (F˜2s + F˜20,s ) . 0, 0, −H1 , −H2 , 0, H3 , −F2,3

249

(125)

The boundary condition for (124) becomes V1 = V2 = V5 = V8 = 0 on ∂.

(126)

Next, we notice that for ξ = 0, the matrix G∞ (ζ, q) = limZ→+∞ G(ζ, q, Z) is similar to A∞ (ζ, q) = limZ→+∞ A(ζ, q, Z), hence the dimension of the stable subspace of G∞ is also 4. Since we have four boundary conditions, we can construct an Evans function E by the same argument as in the introduction. Moreover, the existence of a nontrivial solution to (8) is equivalent to the existence of a nontrivial solution for (124), (126). Hence the nonvanishing of D is equivalent to the nonvanishing of E for ξ = 0. Consequently, since we have a problem under the abstract form (124), (126), we can use the fact that the Evans function E does not vanish as in [21, 23, 22] to get an energy estimate for such an elliptic boundary value problem. We find |||V |||2m,γ ε2 |||H|||2m,γ . This yields, in particular, thanks to (125) and (123), ε −1 |||v s |||2m,γ + ε|||∂z v s1 |||2m,γ + ε|||∂z v s2 |||2m,γ ε(|||F1 |||2m,γ + |||F10 |||2m,γ ) + ε|||∇˜ · (F˜2s + F˜20,s )|||2m,γ +(ε 2 + εγ −1 )|||∇v|||2m,γ + ε|||v|||2m,γ . Finally, we note that |ξ |2 ≤ C(R)ε −2 on the support of χ s , hence we get an estimate for ε|||∂z v s1 |||2m,γ + ε|||∂z v s2 |||2m,γ as in (121) by using that ε|||∇˜ · (F˜2s + F˜20,s )|||2m,γ ≤ C(R)ε −1 (|||F2 |||2m,γ + |||F20 |||2m,γ ). Next, we use that thanks to (110), we also have ε|||∂z v s3 |||2m,γ ≤ C(R)ε −1 |||v s |||2m,γ and that ˜ s |||2m,γ ≤ C(R)ε −1 |||v s |||2m,γ ε|||∇v to conclude the proof of Lemma 14.

2.1. Proof of Theorem 9. To prove (90), it suffices to collect carefully the estimates of the previous lemmas. At first, by summing (111), (118) and (121), we get γ |||v|||2m,γ + ε|||∇v|||2m,γ ≤ C6 |||v|||2m,γ + ε(γ −1 + ε)|||∇v|||2m,γ T ||F1 (s)||m,γ + ||F10 (s)||m,γ ||v(s)||m,γ ds + 0

+ε(|||F1 |||2m,γ + ||||F10 |||2m,γ ) + ε −1 (|||F2 |||2m,γ + |||F20 |||2m,γ ,

250

F. Rousset

where C6 is independent of γ ≥ γ0 and ε ≤ ε0 . Consequently, we can enlarge γ0 to get γ |||v|||2m,γ + ε|||∇v|||2m,γ T ||F1 (s)||m,γ + ||F10 (s)||m,γ ||v(s)||m,γ ds 0

+ε(|||F1 |||2m,γ + ||||F10 |||2m,γ ) + ε −1 (|||F2 |||2m,γ + |||F20 |||2m,γ .

(127)

Now, thanks to the expressions of F10 and F20 , we have that ||F1 (t)||m,γ e−t/ε (γ + ε −1 )||v0 ||m,γ + ||∇v0 ||m,γ + ε||v0 ||m,γ , (128) ||F2 (t)||m,γ e−t/ε ||v0 ||m,γ .

(129)

Consequently, we get from (127) and (128), (129) the estimate T 1 2 2 γ |||v|||m,γ + ε|||∇v|||m,γ Nγ ,ε (v0 ) + ε −1 a(t/ε)Nγ ,ε (v0 ) 2 ||v(s)||m,γ ds + 0

0

T

||F1 (s)||m,γ ||v(s)||m,γ ds + ε|||F1 |||2m,γ + ε −1 |||F2 |||2m,γ ,

where we have set Nε,γ (v0 ) = ||v0 ||2m,γ + ε 2 ||∇v0 ||2m,γ + ε 4 ||v0 ||2m,γ . Finally since v = v − e−t/ε v0 , we find that γ |||v|||2m,γ + ε|||∇v|||2m,γ T 1 Nε,γ (v0 ) + ε −1 a(t/ε)Nε,γ (v0 ) 2 ||v(s)||m,γ ds +

0

T

||F1 (s)||m,γ ||v(s)||m,γ ds + ε|||F1 |||2m,γ + ε −1 |||F2 |||2m,γ . (130)

0

Now, consider A(130) + (91) + η(98), where A and η are chosen independent of ε and γ and such that A > C1 + ηC2 and 1 − C2 η > 0. This yields for every T > 0, ||v(T )||2m,γ + ε 2 ||∇v(T )||2m,γ T + ε −1 ||j (s)||2m,γ + ε||∇v||2m,γ + ε||∂t v(s)||2m,γ ds 0

Nε,γ (v0 ) + +

T

T

ε −1 ||F2 ||2m,γ + ε||F1 ||2m,γ ds

0

1 ||F1 (s)||m,γ + ε −1 a(s/ε)Nε,γ (v0 ) 2 ||v(s)||m,γ ds.

0

We conclude by a Gronwall type inequality, we define for T ∈ [0, t]; t z(T ) = Nε,γ (v0 ) + ε −1 ||F2 ||2m,γ + ε||F1 ||2m,γ ds + 0

T

0

1 ||F1 (s)||m,γ + ε −1 a(s/ε)Nε,γ (v0 ) 2 ||v(s)||m,γ ds.

(131)

Stability of Large Amplitude Ekman-Hartmann Boundary Layers in MHD

251

Note that the second member of (131) is majorated by z(T ) for T ∈ [0, t], hence we find 1 z (T ) ||F1 (T )||m,γ + ε −1 a(T /ε)Nε,γ (v0 ) 2 z(T ), and the integration of this inequality gives z(T ) z(0) + Nε,γ (v0 ) + Since z(0) = Nε,γ (v0 ) +

T 0

||F1 (s)||m,γ ds

2 .

(132)

t 0

ε −1 ||F2 ||2m,γ + ε||F1 ||2m,γ ds,

we get (90) by combining (131) and (132) for T = t. 3. Nonlinear Stability In this section, we prove a refined version of Theorem 1. We will deduce (10) in Theorem 1 as a consequence. Let U app be the approximate solution given by Theorem 8. Theorem 15. Under the same assumptions as in Theorem 1, there exists C > 0 such that the solution of (1), (2), (3), (4), (5) is defined on [0, T ∗ ] and such that ∀t ∈ [0, T ∗ ], t ε app 2 ||u (t) − u (t)||m + ε −1 ||j ε − j app ||2m + ε||∇(uε − uapp )||2m ds ≤ Cε2 . 0

Proof. As in Sect. 2, we set V = e−γ t (U ε − U app ) and hence, we study the system (84), (85), (86), but now, we take into account that app

F1 = R1

+ L + Q,

L = −v ip · ∇v − v · ∇v ip ,

Q = −eγ t v · ∇v,

app

F2 = R2 , where v ip = uapp − uwp . Note that by Theorem 8, we have Nε,γ (v0 ) ≤ C0 ε 2 . The existence of classical solutions for (84), (85), (86) is classical and we can define T ε ≤ T ∗ , the maximum of times T such that there exists a solution of (84), (85), (86) such that u ∈ C([0, T ), H m,1 ) and t Ym,γ ,ε (v(t)) + ε||∇v(s)||2m,γ ds ≤ Rε2 , ∀t ∈ [0, T ), (133) 0

where R is chosen such that R > C0 and Ym,γ ,ε (v(t)) = ||v(t)||2m,γ + ε 2 ||∇v(t)||2m,γ . As usual, to prove that T ε = T ∗ , it suffices to prove that we can never reach equality in (133) when ε is sufficiently small. Towards this, we shall use the a priori estimate of Theorem 9. To evaluate the right-hand side of (90), we note thanks to Theorem 8 that T app app ε ||R1 ||2m,γ + ε −1 ||R2 ||2m,γ ds Nm,ε,γ (v0 ) + +

0

0

T

1 ||Rapp ||m,γ ds

2

≤ Cε 2 ,

(134)

252

F. Rousset

where C is independent of ε and R. In this section, γ is fixed and hence in this section C and stand for a number which is independent of ε and R. Next, we have, thanks to Theorem 8 and Lemma 4, 5, 7, that ||v ip · ∇v||m,γ ||v ip ||m,∞ ||∇v||m,γ a(t/ε)||∇v||m,γ , ||v ip · ∇v||2m,γ ||v ip ||2m,∞ ||∇v||2m,γ a(t/ε)||∇v||2m,γ , which gives

t

0

||v ip · ∇v||m,γ ds

t

ε 0

||v ip · ∇v||2m,γ

2

2 a(s/ε)||∇v||m,γ ds 0 t t a(s/ε) a(s/ε)||∇v||2m,γ ds 0 0 t ε a(s/ε)||∇v||2m,γ ds, 0 t ds ε a(s/ε)||∇v||2m,γ ds.

t

(135) (136)

0

In a similar way, by classical Sobolev embeddings for m ≥ 2, we have that 1 1 1 1 1 2 2 2 2 ||v · ∇v ip (t)||m,γ ||v||m,γ ||∇v||m,γ ||∇uip ||m,γ √ ||v||m,γ ||∇v||m,γ a(t/ε), ε

hence we find by a successive use of the Cauchy-Schwarz and Young inequalities that T 2 ||v · ∇v ip ||m,γ ds 0

2 T 1 1 1 T 2 2 ||v||m,γ ||∇v||m,γ a(t/ε) dt ||v||m,γ ||∇v||m,γ a(t/ε) dt ε 0 0 T ε −1 a(t/ε)||v||2m,γ + εa(t/ε)||∇v||2mγ ds (137)

0

and

T

ε 0

||v · ∇v ip ||2m,γ

T 0

0

T

||v||m,γ ||∇v||m,γ a(t/ε) dt ε −1 a(t/ε)||v||2m,γ + εa(t/ε)||∇v||2m,γ dt.

(138)

Finally, we evaluate the nonlinear term Q. For m ≥ 3, we have for every T ∈ [0, T ε ) that T eγ t ||v · ∇v||2m,γ dt ε 0

T

≤C ε T∗

0

≤ CT ∗ Rε 2

||v||m,γ ||∇v||3m,γ

T 0

||∇v||2m,γ dt

dt ≤ C ε sup (||v||m,γ ||∇v||m,γ ) T∗

[0,T ]

0

T

||∇v||2m,γ dt (139)

Stability of Large Amplitude Ekman-Hartmann Boundary Layers in MHD

253

and that 2 T eγ t ||v · ∇v||m,γ dt 0

≤ CT ∗

T 0 3

≤ CT ∗ R ε 2

1

3

2 2 ||v||m,γ ||∇v||m,γ dt

T 0

2

≤ CT ∗

T 0

||v||2m,γ dt

1

T

2

0

||∇v||2m,γ dt.

||∇v||2m,γ dt

3 2

(140)

Finally, we collect (90) and (134), (135), (136), (137), (138), (139), (140) and we note 3 that for ε sufficiently small, we have that CT ∗ R(ε 2 + ε 2 ) < 1/2; hence, we find ε t Ym,γ ,ε (v(t)) + ||∇v||2m,γ 2 0 t ≤ C1 ε 2 + C2 ε −1 a(s/ε)Ym,γ ,ε (v(s)) ds ∀t ∈ [0, T ε ) 0

and the Gronwall inequality gives that ∀t ∈ [0, T ε ), t ε t Ym,γ ,ε (v(t)) + ||∇v||2m,γ ≤ C1 ε 2 exp C2 ε −1 a(s/ε) ds ≤ C1 eC3 ε 2 . 2 0 0 To conclude, we choose R such that R > C1 eC3 ; this shows that inequality is impossible in (133). By classical arguments we get that T ε = T ∗ . This ends the proof of Theorem 15. 3.1. Proof of Theorem 1. We recall from Theorem 8 that T ||U ip (s)||2 + ||U tl ||2 ds ε. 0

The crucial part in the obtention of this estimate is Lemma 5. We get at once from Theorem 15 that ||uε − uint ||2L2 ((0,T ∗ )×) + ε −1 ||j ε ||2L2 ((0,T ∗ )×) ε. ∗ 2 It remains to prove that uε − uint goes to zero in L∞ loc ((0, T ), L ()). Again, we easily get from Theorem 15 that √ ||uε − uint ||L∞ ((0,T ∗ ),L2 ()) ||uip ||L∞ ((0,T ∗ ),L2 ()) + ε. loc

loc

||v ip ||

Consequently, it suffices to prove that tends to zero. From (65) ∗ 2 L∞ loc ((0,T ),L ()) −1 and (66), it suffices to prove that ||F Ik (·/ε)||L∞ ((0,T ∗ ),L2 ()) tends to zero. Let η > 0, loc we choose R such that 0 ||1|ξ |≥R ak (ξ )vk,i || ≤ η.

Then, we get ||Ik (t/ε)|| ≤ ||1|ξ |≤R Ik || + η ≤ Ce and hence, we easily get (10).

−

π 2 t ε(π 2 +R 2 )

+ η,

254

F. Rousset

4. Proof of Lemma 3 We set v = v1 + iv2 , F = F1 + iF2 and f = f1 + if2 , then v is a solution of ∂τ v + ( + i)v − ∂ZZ v = F,

(141)

in the domain Z > 0 with the initial condition v(0, y, Z) = f(y, Z) and the boundary condition v(τ, y, Z) = 0. As usual for boundary value problems for the heat equation, we can choose odd continuations of F and f and then it is equivalent to solve (141) for Z ∈ R with F and f replaced by their continuation (by an abuse of notations we will use the same letters for the functions and their continuations). Note that f is not continuous since we did not assume that f(0) = 0. The solution of this problem is then given by τ v(τ, y, z) = e−(+i)τ kτ f + e−(+i)(τ −s) kτ −s F(s) ds, (142) 0

where

Z2 exp − 4τ 4πτ is the standard one dimensional heat kernel and stands for the convolution with respect to the Z variable. We recall that in this problem the variable y only plays the part of a parameter. By standard convolution estimates, we have τ ||v(τ )||m,p e−τ ||f||m,p + e−(τ −s) ||F(s)||m,p ds kτ (Z) = √

1

0

which gives (28). Moreover, since by (26), (27), we also have +∞ τ +∞ e−(τ −s) ||F(s)||m,p dsdτ Km a(τ ) dτ, 0

and

0

+∞ τ

0

0

0

e−(τ −s) ||F(s)||m,p ds

2

2 dτ Km

2

+∞

a(τ ) dτ

,

0

we also get (29). To prove the estimates involving Z derivatives, we note, thanks to an integration by parts, that ∂Z (kτ f) = ∂Z kτ f = − ∂µ kτ (Z − µ)f(µ) dµ R

= 2kτ (Z)f(y, 0) + kτ ∂Z f. Consequently, we get

τ

kτ (Z)f(y, 0) + 2 e−(+i)(τ −s) kτ −s (Z)F(s)Z=0 ds 0 τ +e−(+i)τ kτ ∂Z f + e−(+i)(τ −s) kτ −s ∂Z F(s). (143)

∂Z v = 2e

−(+i)τ

0

We deduce from this that τ 1 1 ||∂Z v||2m Kn e−τ (1 + √ ) + e−(τ −s) a(s)(1 + √ ) ds , τ τ −s 0 +∞ e−s √ ds, < +∞, we get (29). and hence, since 0 s

Stability of Large Amplitude Ekman-Hartmann Boundary Layers in MHD

255

We now turn to the proof of (30). We only prove the case k = 2, the others being easier. Thanks to (143), we get τ e−(+i)(τ −s) ∂Z kτ −s (Z)F(s)Z=0 ds ∂Z2 v = 2e−(+i)τ ∂Z kτ (Z)f(y, 0) + 2 0 τ −(+i)τ +e ∂ Z kτ ∂ Z f + e−(+i)(τ −s) ∂Z kτ −s ∂Z F(s). 0

Again by standard convolution estimates, this yields τ 1 1 − 2 −1 −1 −τ − 4ε2 τ e + e−(τ −s) e 4ε2 (τ −s) a(s) ds . |∂Z v(τ, ·, ε )|m Km τ e 0

We deduce from this that +∞ |∂Z2 v(τ, ·, ε−1 )|m dτ Km 1 + 0

and since τ −1 e

−

1 4ε 2 τ

+∞

+∞

a(s) ds 0

e−τ τ −1 e

−

1 4ε 2 τ

dτ,

0

ε2 for every τ ≥ 0, we get +∞ |∂Z2 v(τ, ·, ε−1 )|m dτ Km ε 2 . 0

This gives the first part of (30). In a similar way, we have +∞ |∂Z2 v(τ, ·, ε−1 )|2m dτ 0 +∞ 2 +∞ − 1 a(s) ds e−τ τ −2 e 2ε2 τ Km ε 4 , Km 1 + 0

0

and hence the second part of (30) is proved. It remains to prove (31). Again, we will use (143). At first, we notice that 2 1 2 2 ||z kτ ∂Z f||m |Z| kτ (Z − µ)|∂Z f(µ)|m dµ dZ R R 2 1 |z − µ| 2 kτ (Z − µ)|∂Z f(µ)|m dµ dZ R R 2 1 + kτ (Z − µ)|µ| 2 |∂Z f(µ)|m dµ dZ R R 1 1 |Z 2 kτ |2L1 (Z) + |kτ |2L1 (Z) ||∂z f||2m + ||Z 2 ∂Z f||2m 1

Km (τ 2 + 1), thanks to (26), (27). Next, thanks to the previous computation and (143), we get that 1 1 1 ||Z 2 ∂Z v||m Km e−τ (||Z 2 kτ ||m + 1 + τ 4 ) τ 1 1 + e−(τ −s) (||Z 2 kτ −s ||m + 1 + (τ − s) 4 )a(s) ds , 0

1

1

and hence we find (31) since τ 4 e−τ is uniformly bounded and ||Z 2 kτ −s ||m = O(1).

256

F. Rousset

References 1. Alexander, J., Gardner, R., Jones, C.: A topological invariant arising in the stability analysis of travelling waves. J. Reine Angew. Math. 410, 167–212 (1990) 2. Alinhac, S., G´erard, P.: Op´erateurs pseudo-diff´erentiels et th´eor`eme de Nash-Moser. Savoirs Actuels. [Current Scholarship]. Paris: InterEditions, 1991 3. Allen, L., Bridges, T.J.: Hydrodynamic stability of the ekman boundary layer including interaction with a compliant surface: a numerical framework. European J. Mech. B Fluids 22, 239–258 (2003) 4. Babin, A., Mahalov, A., Nicolaenko, B.: Global splitting, integrability and regularity of 3D Euler and Navier-Stokes equations for uniformly rotating fluids. European J. Mech. B Fluids 15(3), 291–300 (1996) 5. Babin, A., Mahalov, A., Nicolaenko, B.: Global regularity of 3D rotating Navier-Stokes equations for resonant domains. Indiana Univ. Math. J. 48(3), 1133–1176 (1999) 6. Chemin, J.-Y., Desjardins, B., Gallagher, I., Grenier, E.: Ekman boundary layers in rotating fluids. ESAIM Control Optim. Calc. Var. 8, 441–466 (electronic), 2002. A tribute to J. L. Lions 7. Chemin, J.-Y.: Perfect incompressible fluids. In: Volume 14 of Oxford Lecture Series in Mathematics and its Applications. New York: The Clarendon Press Oxford University Press, 1998. Translated from the 1995 French original by Isabelle Gallagher and Dragos Iftimie 8. Desjardins, B., Dormy, E., Grenier, E.: Stability of mixed Ekman-Hartmann boundary layers. Nonlinearity 12(2), 181–199 (1999) 9. Desjardins, B., Dormy. E., Grenier, E.: Instability of Ekman-Hartmann boundary layers, with application to the fluid flow near the core mantle boundary. Physics of the Earth and Planetary Interior 124, 283–294 (2001) 10. Desjardins, B., Grenier, E.: Linear instability implies nonlinear instability for various boundary layers. Ann. Inst. H. Poincar´e Anal. Non-Lineaire 20(1), 87–106 (2000) 11. Dormy, E.: Mod´elisation num´erique de la dynamo terrestre. PhD thesis, Institut de Physique du Globe, 1997 12. Gardner, R.A., Zumbrun, K.: The gap lemma and geometric criteria for instability of viscous shock profiles. Comm. Pure Appl. Math. 51(7), 797–855 (1998) 13. Gerard-Varet, D.: A geometric optics type approach to fluid boundary layers. Comm. Partial Differ. Eqs. 28(9–10), 1605–1626 (2003) ´ 14. Gisclon, M., Serre, D.: Etude des conditions aux limites pour un syst`eme strictement hyberbolique via l’approximation parabolique. C. R. Acad. Sci. Paris S´er. I Math. 319(4), 377–382 (1994) 15. Grenier, E., Gu`es, O.: Boundary layers for viscous perturbations of noncharacteristic quasilinear hyperbolic problems. J. Differ. Eqs. 143(1), 110–146 (1998) 16. Grenier, E., Masmoudi, N.: Ekman layers of rotating fluids, the case of well prepared initial data. Comm. Partial Differ. Eqs. 22(5–6), 953–975 (1997) 17. Kapitula, T., Sandstede, B.: Stability of bright solitary-wave solutions to perturbed nonlinear Schr¨odinger equations. Phys. D 124(1–3), 58–103 (1998) 18. Kato, T., Fujita, H.: On the nonstationary Navier-Stokes system. Rend. Sem. Mat. Univ. Padova 32, 243–260 (1962) 19. Kreiss, H.-O., Lorenz, J.: Initial-boundary value problems and the Navier-Stokes equations. Boston, MA: Academic Press Inc., 1989 20. Masmoudi, N.: Ekman layers of rotating fluids: the case of general initial data. Comm. Pure Appl. Math. 53(4), 432–483 (2000) 21. M´etivier, G., Zumbrun, K.: Large viscous boundary layers for noncharacteristic nonlinear hyperbolic problems. To appear in Mem. of the Amer. Math. Soc. available at http://www.ufr-mi.u-bordeaux.fr/˜ metivier/preprints. html, Preprint, 2002 22. Rousset, F.: Large mixed Ekman-Hartmann boundary layers in magnetohydrodynamics. Nonlinearity 17(2), 503–518 (2004) 23. Rousset, F.: Stability of large Ekman boundary layers in rotating fluids. Arch. Rat. Mech. Anal. 172(2), 213–245 (2004) 24. Sermange, M., Temam, R.: Some mathematical questions related to the MHD equations. Comm. Pure Appl. Math. 36(5), 635–664 (1983) Communicated by P. Constantin

Commun. Math. Phys. 259, 257–286 (2005) Digital Object Identifier (DOI) 10.1007/s00220-005-1388-4

Communications in

Mathematical Physics

Loop-Erased Random Walk on a Torus in Dimensions 4 and Above Itai Benjamini, Gady Kozma Department of Mathematics, Weizmann Institute of Science, Rehorot 76100, Israel Received: 7 October 2003 / Accepted: 28 February 2005 Published online: 15 July 2005 – © Springer-Verlag 2005

Abstract: We show that the statistics of loop erased random walks above the upper critical dimension, 4, are different between the torus and the full space. The typical length of the path connecting a pair of sites at distance L, which scales as L2 in the full space, changes under the periodic boundary conditions to Ld/2 . The results are precise for dimensions ≥ 5; for the dimension d = 4 we prove an upper bound, conjecturally sharp up to subpolyonmial factors. 1. Introduction A well known phenomenon in probabilistic constructions in Rd or Zd is that usually some critical dimension d exists, above which the geometry of Rd ceases to play any significant role, and the process behaves like a similar non-geometric object, usually a tree. At the critical dimension itself, similar behavior is also expected, but when compared to the non-geometric object one gets a logarithmic correction. The phenomenon was discovered in statistical mechanics and field theory, (see e.g. [W71]) for which it is known that various critical exponents stabilize beyond suitable, model dependent, upper critical dimensions. Many of these physical models can be translated to questions in probability, some of which were solved rigorously. Notable examples include the Ising model [A82], the self-avoiding walk [BS85, HS92], percolation [HS90], directed percolation [NY95], and lattice trees [DS98]. In particular, the problem of loop-erased random walk on Zd is well studied. Loop-erased random walk is a process that starts from a random walk on some graph and then removes all loops in chronological order, or in other words, whenever the random walk hits the partial path, the loop just created is erased and the process continues. The result is a random simple path. Originally [L80] suggested as a model for the selfavoiding walk (a random walk conditioned not to hit itself), better understanding of its structure has situated it as an important object in combinatorics and mathematical physics. In particular, loop-erased random walk can be used to describe the uniform spanning

258

I. Benjamini, G. Kozma

tree, which is the limit of the q-Potts model as q → 0. See [S00] for a survey, and the complementary [LSW04]. For a survey with a different focus, see [L99]. Note also the recent [BKPS04] for the rich structure of the phase transitions of the uniform spanning tree in various dimensions. For other recent results of interest, see [BLPS01, K, LPS03]. It is well known that the critical dimension of loop-erased random walk on Zd is 4, since above this dimension a random walk does not intersect itself enough and the process of loop-erasure is local and uninteresting. See [L96, Chap. 7]. Further, loop-erased walk is one of the few models where the logarithmic correction is known precisely, with a correction of log−1/3 loop-erased random walk on Z4 is similar to the regular random walk on Z4 , see [L95]. With so much known, it seems strange that a small change in settings could provoke significant difficulties. To understand why, let us examine the question we are interested in precisely. Let T be a discrete torus, Zd /(N Z)d for some large N . Let b and e be two points on a torus, and let R be a random walk starting from b and stopped on e. We wish to say something about the loop-erasure of R. The results for Zd all use the fact that the random walk does not intersect itself enough. However, in our settings the random walk does a very long walk — of the order of N d — in a relatively small space, and intersects itself over and over again. Thus it is definitely not true that the random walk and its loop erasure are similar! The random walk is essentially a random set that covers a large portion of the torus. Its loop-erasure is much thinner — as we will see, the expected size is N d/2 . The estimates discussed here may be related to a phenomenon which is of broader interest, that for certain problems, periodic boundary conditions change the answer in a fundamental way. Very roughly, if the problem features long paths with high winding number, then the correct geometry less model to consider is actually the complete graph. A few conjectures of similar nature pertaining to critical Ising model and percolation were communicated to us by Michael Aizenman together with convincing heuristic arguments. To the best of our knowledge, the results of this paper are the first rigorous demonstration of this phenomenon. Let us therefore analyze the complete graph. There are a number of ways it can be done, but our favorite is using the notion of the Laplacian random walk. A Laplacian random walk from b to e, two points on an arbitrary graph G, is constructed inductively by solving, at each step, the discrete Dirichlet problem f (e) = 1, f |γ ≡ 0, f |G\(γ ∪{e}) ≡ 0,

(1)

where γ is the partially constructed path and is the discrete Laplacian. The walk then continues to the next point using f as weights. This model was suggested in [LEP86] and was shown to be equivalent to loop-erased random walk in [L87] (though the core Markov property is already in [L80]). The case of the complete graph is very easy to analyze, since if the partially constructed curve γ has length i then   v∈γ 0 , f (v) = 1 v=e   1 otherwise i+1 and then the probability of the walk to terminate in the next step is closed formula P(# LE(R) = k) = k/n

k−1 i=2

1−

i+1 . n

i+1 n .

This gives a

Loop-Erased Random Walk on a Torus in Dimensions 4 and Above

259

√ √ In particular, we see that the correct scaling is N and that # LE(R)/ N converges 2 to a limiting distribution with density te−t /2 . Unfortunately, we do not know how to analyze more interesting graphs using the Laplacian random walk, nor can we show the existence of a limiting distribution for # LE(R) on, say, the torus 1 . have a good basis to claim that mean field behavior in our case should be √ Thus wed/2 |T | = N . For d < 4 this does not happen — indeed known results for d = 1 (trivial) and d = 2 ([K00a, K00b], see also [LSW04]) and computer simulations for d = 3 [GB90] show that even a single branch of the loop-erased walk is too big2 (the sizes of a loop-erased random walk reaching to distance N are, respectively, N , N 5/4 and N 1.62±0.01 ). We shall show that mean field behavior does occur for d > 4. In the critical dimension itself, we can only show an upper bound, and we do not calculate the precise logarithmic correction (we do have some good evidence for a conjecture on the precise logarithmic correction needed — log1/6 N — see the end of this section). Namely, our results are Theorem 1. If d > 4 then a loop-erased random walk L on the (N, d)-torus starting from a point b and stopped when hitting a point e has the estimate P(#L > λN d/2 ) ≤ Ce−cλ . If d = 4 then P(#L > λN 2+ ) ≤ Ce−cλ ∀ > 0. Where the constants C and c may depend on d and on . Theorem 2. Let d ≥ 5. Let b be a point in T = TNd and let e be a random, uniform point in T . Let R be a random walk on T starting from b and stopped at e. Let λ ≥ N −1/2 . Then P(# LE(R) ≤ λN d/2 ) ≤ Cλ log λ−1 . Returning to the cases of d ≤ 3, we see that the reason for non-mean-field behavior is strong local intersections and these increase the size of the loop-erased walk. Therefore we are tempted to conjecture Conjecture. Let G be a vertex transitive finite graph, and let b and e be two random points in G. Let R be a random walk starting from b and stopped when hitting e. Then E# LE(R) ≥ c |G|. A graph G is vertex transitive when, for every two vertices v and w there exists a graph automorphism of G carrying v to w. The requirement that G is vertex transitive is supported by the standard “extreme non-transitive” example of a tree of size N , where the loop-erased random walk between b and e is of course the only path between b and e and its length is bounded by C log N . We wish to end this introduction with one last conjecture. Returning to the analysis of the complete graph using the Laplacian random walk, we note that this analysis does 1 As this paper was being prepared to print, this was proved (in dimension 5 and above) by Peres and Revelle, see [PR]. 2 We believe that the growth exponents in d = 2, 3 are the same on Zd and T d , but this is beyond the N scope of this paper.

260

I. Benjamini, G. Kozma

not change by much if one considers α-power Laplacian random walk, which is a walk one gets if one takes as weights for any step the function f α , where f is defined by (1)— this generalization was also discussed in [LEP86], and has since been investigated non-rigorously by physicists, see e.g. [H02]. For the complete graph we get that the size of a typical path is N 1/(1+α) . We ask: is this behavior replicated in a d-dimensional torus for d > dαcrit ? 2(1+α) Conjecture. For α ≤ 1, dαcrit = 2(1+α) the typical path of a α , i.e. for any d > α α-weighted Laplacian random walk on a d-dimensional torus is of size N d/(1+α) while for smaller d’s this does not hold.

We have no good conjecture on the value of the critical dimension for α > 1, though it does seem (again, we have no proof of that) that for α = ∞ (which corresponds to a deterministic process which simply proceeds to the point where f attains its maximum) the process gives a straight line from b to e in all dimensions, so one might say the critical dimension is 1. Also note that we do not believe this also describes the critical dimension in Zd . 1.1. About the proof. The basic question behind the solution is “what is the probability of a random walk of length L will hit a loop-erased walk of length L?” (in dimension 4 we need to differentiate between these two lengths, but only by a sub-polynomial factor). When the probability is larger than some constant c > 0, then this is the L we seek, as this means that the probability of a loop-erased random walk to go further than λL is exponentially small in λ. Since a loop-erased walk is a complicated object, let us first ask “what is the probability of a random walk of length L will hit some set of size L?” This probability is largest when is rather spread out. Take as an example to be a random collection of points on the torus. It is easy to calculate the expected number of intersections of a random walk with and the second moment and to derive from both the estimate that the probability is ≈L2 N −d , so this gives that the L we look for is 4 two or at most three steps are necessary to get the true estimate, N d/2 . This argument is done in Lemma 1. 1.2. Reading recommendations. Section 2 is probably the one deserving most attention. While the main ideas are sketched above, the devil is in the details and the interested reader might want to read through the proof and do √ the “exercise” — not so designated explicitly — of simplifying the proof with a cost of log in the final result. Section 3 is technical and most readers would probably agree that the conclusion (Theorem 3) is not surprising. The proof of Lemma 5 is the core — as for Lemma 4, you might opt to read its statement but skip its proof. And again, verify that the claim is trivial if one is willing to lose a factor of log (the argument is contained in the first half-page of the proof of

Loop-Erased Random Walk on a Torus in Dimensions 4 and Above

261

Lemma 5). Section 4 contains the proof of Theorem 2 and is quite short. While there are alternative, more complicated approaches that might prove a little more we have not included them. There are some comments and hints at the end of Sect. 4 — we hope they make at least some sense. We have collected some well known and unsurprising facts we use (and their proofs) in the Appendix. We hope this makes the paper more accessible to non-experts and students. Lemmas with numbers like “A.7” are to be found in the appendix.

1.3. Standard notations. In the sequel we denote by C and c positive constants which may depend on the dimension but on nothing else. C will usually pertain to constants which are “large enough” and c to constants which are “small enough”. The notation x ≈ y is a short-hand for cx ≤ y ≤ Cx. In dimension 4 we shall prove only imprecise estimates, namely that the length of the loop-erased walk is < N 2+ . All constants C and c may depend on this as well. Similarly, all constants implicit in notations such as O and ≈ might depend on d and . Occasionally we shall number constants for clarity. When we write log x we always mean max{log x, 1} and log 0 = 1. The (N, d)-torus, denoted by TNd is the set Zd /(N Z)d endowed with the graph structure derived from Zd and the distance derived from the l2 norm on Zd . The distance of v and w will be denoted by |v − w|, while distance of sets will be denoted by d(·, ·). A ball of radius r and center v in either Zd or TNd will be denoted by B(v, r) and its inner boundary (namely, all points in B with an edge leading outside of B) by ∂B(v, r). 2. The Upper Bound We will need to examine the effect of adding a section to a path and how it might increase the length of its loop-erasure. We shall always assume that the section we add starts at 0, so that we are looking at a path γ : {−m, . . . , n} → T and define, in addition to the usual loop-erasure of γ , which we will denote by LE(γ ), the continued loop-erasure, which we shall denote by LE+ (γ ). Here are both definitions: Definition. For a finite path γ : {−m, . . . , n} → T in a graph T we define its loop erasure, LE(γ ), which is a simple path in T , by the consecutive removal of loops from γ . Formally, LE(γ )0 := γ (−m), LE(γ )i+1 := γ (ji + 1) ji := max{j : γ (j ) = LE(γ )i }. Naturally, this is defined for all i such that ji < n. The continued loop-erasure is a subset of LE(γ ) defined by LE+ (γ )i := LE(γ )I +i I := min{i : ji ≥ 0}. The notations LE(γ [A, B]) and LE+ (γ [A, B]) stand for the loop-erasure and continued loop-erasure of the segment of γ going from A to B. When we write −∞ in place of A we just mean the beginning of the path, nothing more. Definition. Let d ≥ 4 be the dimension and let N ∈ N. Let R be a path in T = TNd such that the negative part is fixed and the positive part is a random walk on T . Let

262

I. Benjamini, G. Kozma

b = R(0). Let v ∈ T and 0 < r < 18 N , and assume for simplicity that b ∈ B(v, 2r). Let ti be stopping times defined by t0 = 0 and then inductively t2i+1 := min{t ≥ t2i : R(t) ∈ ∂B(v, 2r)}, t2i := min{t ≥ t2i−1 : R(t) ∈ ∂B(v, 4r)} .

(2)

Let f : R → R be an increasing function. Then we say that the (d-dimensional) random walk has the f -property if one has P(#(LE+ (R[−∞, ti ]) ∩ B(v, r)) > λf (r) | R[t2j , t2j +1 ]∀j ) ≤ Ce−cλ

(3)

which should hold for every such v and r, every λ > 0, every i ∈ N and any path we put in the negative portion of R. The conditioning here, in words, is on any arbitrary set of paths between t2j and t2j +1 , and in particular on the points R(ti ) themselves. Notice that we do not condition on the value of the ti ’s. Let us remark that for the proof of the upper bound it is enough to consider the case where R has no negative part, and then LE+ ≡ LE. Lemma 1. Let d ≥ 4. Then 1. If the d-dimensional random walk satisfies the r α logβ r-property for r α logβ r r d−2 log−3 r, then it also satisfies the r α/2+1 log(β+3)/2 r-property. d−2 log−3 r-property, then it also 2. If the d-dimensional √ random walk satisfies the r d/2 satisfies the r log log r-property. 3. If it satisfies the r α logβ -property for r α logβ r r d−2 log−3 r, then it satisfies the r d/2 -property. Case 2 is not really necessary for the proof of the theorem, we include it here mainly for completeness. Proof. Denote the function given to us (e.g. r α logβ r) by f (r) and the result (e.g. r α/2+1 log(β+3)/2 r) by g(r). Let ti be the stopping times from the definition of the f -property. The main part of the lemma will consider the events in R[ti , ti+1 ] for some particular odd i. Therefore let us fix i > 0. Denote Li,v,r := #(LE+ (R[−∞, ti ]) ∩ B(v, r)). Clearly L2i+1,v,r ≤ L2i,v,r so if we prove the lemma for all i odd it will also hold fori even. To fix notations, we consider the time span [−∞, ti ] as the “past” and ti , ti+1 is the “present”. We start by examining the past. Let w ∈ B(v, r) and s ≤ 18 r. The first step is to show that (3) holds if we replace the ball but keep the stopping times, i.e P(#(LE+ (R[−∞, ti ]) ∩ B(w, s)) > λf (s) | R[t2j , t2j +1 ]∀j ) ≤ Ce−cλ .

(4)

We generalize the notation Li,v,r to Li,w,s := #(LE+ (R[−∞, ti ]) ∩ B(w, s)), that is, again, the loop-erased random walk inside a smaller ball measured at the stopping times pertaining to the larger ball. Here our conditioning by everything outside the ball is crucial. Let Kj ∈ N be some arbitrary numbers, and let γj,k be paths (1 ≤ k ≤ Kj ) in B(v, 4r) \ B(w, 2s) such that γj,1 is a path going from R(t2j −1 ) ∈ ∂B(v, 2r) to ∂B(w, 2s), γj,k for 1 < k < Kj is a path from ∂B(w, 4s) to ∂B(w, 2s) and γj,Kj is a path from ∂B(w, 4s) to R(t2j ) ∈ ∂B(v, 4r). If Kj = 1 then let γj,1 be a path from R(t2j −1 ) to R(t2j ). Then we can sum

Loop-Erased Random Walk on a Torus in Dimensions 4 and Above

263

over all such combinations of K and γ as follows. Denote by X the event Li,w,s > λf (s). Let YK,γ be the event that for all j , the random walk on [t2j −1 , t2j ] follows γj,1 until ∂B(w, 2s), then stays within B(w, 4s), then follows γj,2 , etc. until finally exiting from B(v, 4r). Then P(X | R[t2j , t2j +1 ]∀j ∩ YK,γ ) · P(YK,γ | R[t2j , t2j +1 ]∀j ) P(X | R[t2j , t2j +1 ]∀j ) = K,γ

≤ Ce−cλ

P(YK,γ | R[t2j , t2j +1 ]∀j ) = Ce−cλ .

(5)

K,γ

(i−1)/2 Of course, we used the f -property for w, s and the index j =1 Kj ; and the fact that B(w, 4s) ⊂ B(v, 2r). The inequality (4) is not as useful as it should be since most balls of radius s (for s r) are empty anyway. However, another consequence of the conditioning is the fact that (4) is independent from the event LE+ (R[−∞, ti ]) ∩ B(w, 4s) = ∅. The reason is that Li,w,4s = 0 if and only if the segment inside B(w, 4s) is cut “from the root”, i.e. for some u1 < u2 < · · · < u2n , n ∈ {1, 2, . . . } we must have R[u2i−1 , u2i ] ∩ B(w, 4s) = ∅ and R(u2i ) = R(u2i+1 ). Whether this happens in the positive or negative part of R is immaterial — in both cases this is an event that happens outside B(w, 4s), therefore it is an event we condition on. We get P(Li,w,s > λf (s) | R[t2j , t2j +1 ]∀j ) ≤ Ce−cλ P(Li,w,4s = 0 | R[t2j , t2j +1 ]∀j ) . (6) Let γ = γi be the (chronologically) first G elements of LE+ (R[−∞, ti ]) ∩ B(v, r), where G is some number. If LE+ (R[−∞, ti ]) ∩ B(v, r) contains less than G elements, take γ = LE+ (R[−∞, ti ]) ∩ B(v, r). Equations (4) and (6) allow us to get a “secondorder estimate for γ ”. By this we mean the quantity Vs := #{w1 , w2 ∈ γ : |w1 − w2 | ≤ s} which has the estimate P(#γ > δELi,v,r and Vs > λ log(s/δ)f (s)#γ ) ≤ Ce−cλ

(7)

for any parameters λ > 0 and 0 < δ < 1. Before starting the proof of (7) let us just remark that the first condition and the variable δ are unfortunate technicalities. The “essentials” of (7) are really the stronger claim P(Vs > λ(log s)f (s)#γ ) ≤ Ce−cλ , but we don’t know how to prove it. Also note that it is rather easy to show P(Vs > λ(log r)f (s)#γ√) ≤ Ce−cλ , saving us all the mucking with δ later on, but this inequality will cost us a log r in the final result of Theorem 1. Proof of (7). Cover B(v, r) by balls {Bj } of radius 2s such that any two points of distance ≤ s are inside at least one Bj , and such that each point is covered ≤ C times. Examine one Bj = B(wj , 2s). We have (not writing the “| R[t2j , t2j +1 ]∀j ” for brevity) ELi,v,r > c

j

(6)

P(Li,wj ,8s > 0) ≥ cecλ

P(Li,wj ,2s > λf (s)) ∀λ.

j

Denote by Xµ the total volume of the balls Bj , where Li,wj ,2s > µf (s) and get EXµ ≤ Ce−cµ s d ELi,v,r . This gives, using P(Xµ > ecµ EXµ ) ≤ e−cµ , P(Xµ > Cs d e−cµ ELi,v,r ) ≤ e−cµ

∀µ,

264

I. Benjamini, G. Kozma

and shoving in #γ in a way that might look, for now, a little artificial, we get P(#γ > δELi,v,r and Xµ > Cs d δ −1 e−cµ #γ ) ≤ e−cµ

∀µ.

Taking µk = λ log(s/δ) + Ck and assuming that λ > C for some C sufficiently large (as we may, without loss of generality), we get P(#γ > δELi,v,r and for some k, Xµk > c−k #γ ) ≤ Ce−cλ Now, since Vs ≤ j #(γ ∩ Bj ) · Li,wj ,2s , then Vs ≤ #γ (λ log(s/δ) + C)f (s) +

∞

.

(8)

Xµk (λ log(s/δ) + Ck)f (s).

k=1

If it happens that Xµk ≤ c−k #γ for all k, i.e. the opposite of the second half of the event in (8), then Vs ≤ λ log(s/δ)f (s)#γ +

∞

(c−k #γ )f (s)(λ log(s/δ) + Ck)

k=1

≤ Cλ log(s/δ)f (s)#γ , 1 r but (7) holds for larger s too and we get (7). This argument works for any s ≤ 16 (there’s not much point in s > 2r of course) — we only have to pay in the constant C.

We want (7) to hold not for one particular s but for all s and the simplest version of such an inequality is P(#γ > δELi,v,r and ∃s s.t. Vs > λ log2 (s/δ)f (s)#γ ) ≤ Ce−c1 λ

(9)

which follows from using (7) with λs := λ log(s/δ) and summing over s. Continuing the proof of the lemma, it is now time to examine the present. We keep the notations of G, γ and Vs . For an odd i we want to estimate the probability pi := P(R[ti , ti+1 ] ∩ γ = ∅)

.

Lemma A.5 allows us to consider a unconditioned random walk starting from R(ti ) and stopped on ∂B(v, 4r) instead of R. Denote it by R . Denote by Xi the number of intersections of R with γ , so pi ≈ P(Xi > 0). We have E(Xi | past) =

ti+1

P(R (t) = w | past)

.

t=ti w∈γ

For r 2 ≤ t − ti ≤ 2r 2 we have for half of the w ∈ B(v, r) that P({R (t) = w} ∩ {t < ti+1 }) > cr −d (“half of the w’s” means that we need t − ti + ||w − R(ti )||1 to be even, otherwise the probability is zero). Therefore E(Xi | past) > cr 2−d #γ

.

(10)

Loop-Erased Random Walk on a Torus in Dimensions 4 and Above

265

Next estimate E(Xi2 | past). Assume until further notice that Vs ≤ λ log2 (s/δ)f (s)#γ for some δ and λ and for all s. Then E(Xi2 | past) = P(R (ti ) = wi ) ≤ t1 ,t2 ,w1 ,w2 ∞ ∞

P R(t) = w1 , R(t + ) = w2 ,

≤2

=0 k=0 t,w1 ,w2

√ √ k ≤ |w1 − w2 | < (k + 1) .

(11)

√ Examine one couple of w1 , w2 ∈ γ with k ≤ |w1 − w2 |. Remembering the independence of the past from the present we can estimate the probability of one summand with a standard estimate on the end point of a random walk of length starting from w1 . We get P(R(t) = w1 , R(t + ) = w2 ) ≤ Cr −d −d/2 e−k

2 /2

.

We sum over all t. Since, easily, P(ti+1 − ti > ) ≤ Ce−c/r and since E(ti+1 − ti | ti+1 − ti > ) ≤ C max{r 2 , }, we get 2 2 P(R(t) = w1 , R(t + ) = w2 ) ≤ Ce−c/r max{r 2 , }r −d −d/2 e−k /2 . 2

t

Plugging this into (11) we get E(Xi2 | past) ≤ C

∞ ∞

e−c/r max{r 2 , }r −d −d/2 e−k 2

2 /2

V(k+1)√ .

(12)

=0 k=0

For all our functions f (that is, all the specific functions we named in the statement of the lemma) we have ∞

e−k

2 /2

V(k+1)√ ≤ λ#γ

∞

e−(k−1)

2 /2

√ √ f (k ) log2 (k /δ) ≤

k=1

k=0

√ ≤ Cλ#γf ( ) log2 (/δ)

.

Similarly, for all our functions f we have ∞

√ 2 e−c/r max{r 2 , }−d/2 f ( ) log2 (/δ) ≤

=0 2

≤ Cr

2

r

√ −d/2 f ( ) log2 (/δ)

.

=0

Equations (12) and (13) give 2

E(Xi2 | past)

≤ Cλr

2−d

#γ

r =0

√ −d/2 f ( ) log2 (/δ)

(13)

266

I. Benjamini, G. Kozma

and then with (10) and the standard inequality P(X > 0) ≥ (EX)2 /EX2 we get r 2−d #γ P(Xi > 0 | past) > c √ λ −d/2 f ( ) log2 (/δ)

(14)

.

This inequality is the heart of the proof. We recall that we assumed Vs ≤ λ log2 (s/δ) · f (s)#γ to get it. Fix G = µg(r), where µ > 1 is some variable which we will fix

later and where g is as defined in the beginning of the lemma. Let H = 2 g(r)r −2 , where · is the integer value. Let X1 = X1 (µ) be the event that #γi = G, let X2 = X2 (λ, δ, µ) be the event that Vs ≤ λ log2 (s/δ)f (s)G for all s (λ and δ are two additional variables) and let X3 = X3 (µ) be the event that R[tj +1 − tj ] ∩ γ = ∅ for all odd i ≤ j ≤ i + H . The events comprising X3 are (conditioning on the R(tj )) independent, therefore we may use (14) 21 H times to get

r 2−d µg(r) P(X3 | X1 ∩ X2 ) ≤ 1 − c 2 √ λ r=1 −d/2 f ( ) log2 (/δ) cµ ≤1− . λ log2 δ −1

1H 2

(15)

To see the rightmost inequality in (15), for each of the cases in the formulation of the lemma, apply the corresponding f and g and estimate the sum. Indeed, (15) is the inequality that governs the connection between f and g. Note that the formulation of the lemma is a little lax: if f (r) = r α logβ r with √ α > d − 2 then we can actually prove the lemma with g = r α/2 log(β+2)/2 , i.e. one log r factor better than the formulation √ of the lemma. This additional log r factor is here only for the case α = d − 2 and β < −3. Have no fear — this factor will disappear in the conclusion of Theorem 1. The proof of the lemma will now follow by induction over i. We use a “jumping induction” that assumes that for some k and K we have the inequality P(Li,v,r > νg(r)) ≤ Ke−kν for all ν > 0 and then proves the same for Li+H,v,r (the case i = 0 needs no explanation). Therefore we need first to calculate how much Li,v,r can change in between. Clearly, if R([tj , tj +1 ]) does not intersect LE([R[0, tj ]) then Lj +1,v,r − Lj,v,r ≤ tj +1 − tj

.

These variables have the simple estimate P(tj +1 − tj > νr 2 ) ≤ Ce−cν

(16)

irrespectively of R(tj +1 ) and R(tj ) for all j odd. Denote by Ai the sum of 21 H of those, and get a similar estimate (see Lemma A.9): P (Ai > νg(r)) ≤ Ce−c2 ν

Ai :=

i+H

tj +1 − tj

.

(17)

j =i j odd

Next we make the following important assumption: G > δELi,v,r

∀i.

(18)

Loop-Erased Random Walk on a Torus in Dimensions 4 and Above

267

Actually, we want it to be true independently of the value of µ, so we really need g(r) > δELi,v,r . This holds for δ sufficiently small, but it is inconvenient to fix the value of δ at this point, as it depends on some constants (depending on d only) which are determined only later. Therefore we shall perform the necessary calculations with δ a variable and finally fix its value as some constant when we have all the information at hand, see (20). With a value of δ satisfying (20), or smaller, (18) will hold. It is time to compare Li,v,r with Li+H,v,r . Li+H,v,r might be larger than νg(r) for the simple reason that Ai is very large. Let τ ≤ ν be yet another variable describing what “very large” means and we may estimate this phenomenon simply by ν

P({Li,v,r > (ν − n − 1)g(r)} ∩ {Ai > ng(r)})

.

n=τ

“Simply” because we ignore any effect of intersections. If, however, Ai is not as large we need both Li,v,r to be rather large, and X3 , i.e. to have no intersections with a path of length G = µg(r) during the last H “moves”. We need to assume µ + τ < ν for this to make sense, and this assumption holds until (19) below and we will not repeat it. All in all we get P(Li+H,v,r > νg(r)) ≤ P({Li,v,r > (ν − τ )g(r)} ∩ X3 ) ν + P({Li,v,r > (ν − n − 1)g(r)} ∩ {Ai > ng(r)}) ∀i, ν, τ, µ n=τ

(the parameter µ hides in the definition of X3 ). For the first summand we have by (9), (18), (15) and the induction hypothesis that P({Li,v,r > (ν − τ )g(r)} ∩ X3 ) ≤ ≤ P({Li,v,r > (ν − τ )g(r)} \ X2 ) + P({Li,v,r > (ν − τ )g(r)} ∩ X3 ∩ X2 ) cµ ≤ Ce−c1 λ + Ke−k(ν−τ ) 1 − ∀i, ν, τ, λ, µ, δ, λ log2 δ −1 and estimating the other summands using (17) we get cµ P(Li+H,v,r > νg(r)) ≤ Ke−k(ν−τ ) 1 − + Ce−c1 λ + λ log2 δ −1 ν Ke−k(ν−n−1) · Ce−c2 n ∀i, ν, τ, λ, µ, δ. +

(19)

n=τ

Having arrived at this closed formula, we only need to pick our variables carefully. First pick τ = C log δ −1 for some C sufficiently large. This will give, if k < c2 /2, that ν n=τ

Ke−k(ν−n−1) · Ce−c2 n ≤ C

−1

e−C log δ Ke−k(ν−τ ) ≤ CδKe−k(ν−τ ) 1 − e−c2 /2

.

Next we pick λ = Cν and µ = 21 ν, and the requirement µ + τ < ν translates to ν > C log δ −1 . We get from everything that

−1 1 − log2cδ −1 + Cδ + Ce−cν . P(Li+H,v,r > νg(r)) ≤ Ke−kv ekC log δ

268

I. Benjamini, G. Kozma

Pick k = c log−3 δ −1 and get, for δ sufficiently small and ν > C log δ −1 that c . P(Li+H,v,r > νg(r)) ≤ Ke−kν 1 − log2 δ −1 Pick K sufficiently large so that the inequality P(Li,v,r > νg(r)) ≤ Ke−kν will hold trivially for ν ≤ C log δ −1 — notice that because k = c log−3 δ −1 we have that K does not depend on δ — and our induction is complete. With these k and K, the inequality P(Li,v,r > νg(r)) ≤ Ke−kν is preserved from i to i + H and since it clearly holds for i ≤ H then it holds for all i. Is this the end of the lemma? Almost. We still need to justify the assumption (18). The estimate P(Li,v,r > νg(r)) ≤ Ke−kν gives ELi,v,r ≤ g(r) Kk ≤ Cg(r) log3 δ −1 . Therefore (remember that G > g(r)) the assumption reduces to the inequality g(r) > g(r) · (Cδ log3 δ −1 )

(20)

.

Taking δ sufficiently small this will hold, and the lemma is proved.

Lemma 2. The d-dimensional random walk has the f -property for r d/2 d > 4 . fd (r) := 2+ r d=4

(21)

Proof. Trivially, the d-dimensional random walk has the r d -property. Therefore we may apply Lemma 1 twice for d > 6, thrice for d = 6 or 5 and log −1 times for d = 4. Proof of Theorem 1. Lemma 2 gives P(Li,v,r > λf (r)) ≤ Ce−cλ . where Li,v,r = #(LE(R[0, ti ]) ∩ B(v, r)) for any v and r satisfying b ∈ B(v, 2r), where f is defined by (21). Note that at this point we do not need the formulation in terms of continued process, and we may set the negative part of R to empty. If in addition e ∈ B(v, 4r), then the event that R is stopped between tI and tI +1 is external to the ball, therefore we get that (21) holds for I . Since the section of the walk from tI until the time when R hits e can only decrease LE(R) ∩ B(v, r) we get P(#(L ∩ B(v, r)) > λf (r)) ≤ Ce−cλ . However, we can cover our torus by balls B(vi,j , N 2−i ) with the property b, e ∈ B(vi,j , 4N 2−i ) and with the number of j ’s corresponding to each i bounded by a constant. Therefore for some constant c3 sufficiently small we have P(#L > λf (N )) ≤ P(∃i, j s.t. L ∩ B(vi,j , r) > c3 λ2i/4 f (r))

c log N

≤

Ce−cλ2

i/4

≤ Ce−cλ

.

(22)

i=0

Loop-Erased Random Walk on a Torus in Dimensions 4 and Above

269

Remark. The same techniques can be improved to show that P(#L > λf (N )) ≤ Ce−cλ , 2

where f is given by (21). The basic phenomenon behind this estimate is that to get a path of length λf (N ), we need to have that each of the λ sections of the random walk, which are essentially independent, would not intersect any other. Since there are cλ2 couples, the true estimate of the probability is square-exponential, as above. The analysis required to get this estimate is not inherently more difficult than that of the exponential estimate, but is more technical and we decided to represent the simpler exponential estimate. On the other hand, we are not aware of a simpler version of the proof that gives an estimate of the decay of the probability worse than exponential. This follows from the recursive character of the proof. Thus, Lemma 1 may be simplified by removing the requirement that the probability decays exponentially, but it then cannot be used recursively to get a reasonable final result. Similarly, the very strong independence condition in Lemma 1, that the probability estimate inside every ball is independent of everything that happens outside the ball, cannot be relaxed without destroying the ability of the lemma to be used recursively. We wish to reiterate that the only major simplification we are aware of of this proof is the one discussed after (7). It saves the discussion after (5), i.e. the one leading √ to (6), as well as each and every appearance of the parameter δ. The cost is an added log factor in the formulation of the theorem. Conjecture. The accurate upper bound in dimension 4 is N 2 log1/6 N . The method above may be refined in many points and an estimate of the type N 2 logα N may be achieved for rather small α’s. However, a fundamental difficulty is the fact that the sum in the denominator of (14) truly depends on N , which means that the second moment methods used here alone cannot give a precise result. 3. Absolute Times The proof of the lower bound is, as will be seen in Sect. 4, quite simple once a good estimate of the upper bound is available. Actually, one might think about the recursive nature of the proof of the upper bound in the following terms: “the proof of the upper bound was only possible once a good estimate of the upper bound was available”. Unfortunately, we were not able to get a reasonable proof of the lower bound using only Lemma 1. The problem is that we need to know what happens at absolute times, i.e. to fix some t and get an estimate for LE(R[0, t]). Calculations true for ti do not hold automatically for a fixed t. Apriori, one cannot rule out behavior such as “the loop-erased random walk is much denser if t is divisible by 1024”, since the ti ’s might avoid those “bad absolute times”. The purpose of this section is to show that this ridiculous behavior does not occur. The first step is to learn something about the distribution of the ti ’s. Since ti is a sum of the return times to some sphere, and these return times are more-or-less independent, we would expect a central limit theorem. We don’t need something so precise — we shall prove below (Lemma 4) a large deviation estimate of the sort one would expect from a Gaussian variable, and this will be enough. We start with

270

I. Benjamini, G. Kozma

Lemma 3. Let X1 , . . . , Xn be variables with the properties P(|Xi | > λ | X1 , . . . , Xi−1 , Xi+1 , . . . , Xn ) ≤ Ce−cλ , E(Xi1 · · · Xik | Xik+1 , . . . , Xil ) ≤

k

C exp(−c min |ij − im |), 1≤m≤l m=j

j =1

(23) (24)

where (24) needs to hold only for i1 , . . . , il all different. Then for all λ < cn1/4 ,

√ 2 P Xi > λ n ≤ Ce−cλ . We interpret the condition (24) in the case k = l = 1 as saying EXi = 0 for all i. In the case k > 1, we call (24) a “pseudo independence” relation, because, rather than claiming that E Xi = 0, as we would have for independent variables, we get that it is exponentially small in the distance, so that if the ik ’s are relatively sparse, it will be extremely small. Actually, it is possible to replace exp(−ck) with any sequence ak with ak < C. The proof is a pretty standard exercise: a calculation (which can be done either directly or√by comparing to the case of independent exponential variables) can show that for k < c n, E

2k Xi

≤ (Ckn)k

.

Taking k = cλ2 and using Markov’s inequality will give the lemma. We skip the gory details. Lemma 4. Let b ∈ TNd and let R be a random walk on T starting from b. Let C < r < 1 d 8 N, v ∈ TN and let ti be the stopping times defined by (2). Then there exists numbers E = E(r) ≈ N d r 2−d and σ = σ (r) ≈ E such that √ 2 P(|tn − nE| > λσ n) ≤ Ce−cλ

(25)

for all n ∈ N and λ < cn1/4 . Proof. The point is of course to show that the variables ti+1 − ti are pseudo independent and apply Lemma 3. The first thing to note is that the distributions of R(ti ) converge exponentially. Let q1 and q2 be two distributions on ∂B(v, 2r), and denote :=

|q1 (x) − q2 (x)|

.

x∈B(v,2r)

Let Rµ , µ = 1, 2 be random walks starting from a point on ∂B(v, 2r) chosen with the distribution qµ and stopped when hitting ∂B(v, 4r). Let pµ be the distributions on the hit points of Rµ . Then p1 (w) − p2 (w) =

x∈∂B(v,2r)

(q1 (x) − q2 (x))π(x, w),

(26)

Loop-Erased Random Walk on a Torus in Dimensions 4 and Above

271

where π(x, w) is the probability of a random walk starting from x to hit w. Let A+ ⊂ ∂B(v, 2r) be the set where q1 (x) ≥ q2 (x), and define D + (w) = |q1 (x) − q2 (x)|π(x, w) . x∈A+

Clearly w∈∂B(v,4r)

D + (w) =

|q1 (x) − q2 (x)|π(x, w) =

x∈A+ w

1 2

and similarly for D − defined equivalently using A− := ∂B(v, 2r)\A+ . Furthermore, the inequality π(x, w) ≈ r 1−d (see Lemma A.4) gives that D ± (w) ≈ r 1−d and therefore |D + (w) − D − (w)| ≤ (1 − c)(D + (w) + D − (w)) for some constant c > 0. This gives |p1 (w) − p2 (w)| = |D + (w) − D − (w)| w

w∈∂B(v,4r)

≤ (1 − c)

D + (w) + D − (w) = (1 − c)

(27)

w

and we see that the L1 distance between the distributions has contracted. An identical calculation works when the random walk starts from ∂B(v, 4r) and stops at ∂B(v, 2r) (see the remark following Lemma A.4) therefore we see that there is only one limiting distribution as i increases, and that the L1 distance to this distribution decreases expoµ nentially with i. In other words, if ti are stopping times defined by (2) for the walks Rµ then we get |P(R1 (ti1 ) = w) − P(R2 (ti2 ) = w)| ≤ e−ci . (28) w

This

L1

estimate allows to get a uniform estimate for every w and i > 0: |P(R1 (ti1 ) = w) − P(R2 (ti2 ) = w)| ≤ Ce−ci min P(Rµ (ti ) = w). µ

µ=1,2

(29)

µ

Indeed, take the distributions of Ri−1 as the qµ ’s in (26) and together with (28) and π(x, w) ≤ Cr 1−d get that |P(R1 (ti1 ) = w) − P(R2 (ti2 ) = w)| ≤ Cr 1−d e−ci

.

µ P(Rµ (ti )

In the other direction, π(x, w) ≥ cr 1−d gives = w) ≥ cr 1−d and we get (29). To make notations simpler, let Bi be ∂B(v, 2r) if i is odd and ∂B(v, 4r) if i is even. Now, each ti+1 − ti has an exponential distribution3 , with its expectation being less than or equal to Cr 2 i is odd Ui := (30) CN d r 2−d i is even 3 For i even, t 2 i+1 − ti has a rather large (>c) probability to be very small, of the order of r . However, since there is also a probability >c to escape B(v, 21 N), this fact has negligible impact on the moments of ti+1 − ti .

272

I. Benjamini, G. Kozma

even after conditioning on the entry and exit points. In a formula, P(ti+1 − ti > λUi | R(ui ) = y1 and R(ui+1 ) = y2 ) ≤ Ce−cλ

(31)

for every y1 ∈ Bi and y2 ∈ Bi+1 (see Lemmas A.8 and A.11). Define the variables Xi := (ti+1 − ti − E(ti+1 − ti ))/U0

.

We wish to use Lemma 3 for the Xi ’s. To get (23) we use (31) to see that E(ti+1 −ti )/U0 ≤ CUi /U0 ≤ C and then use (31) again to get P(Xi > λ | R(ti ), R(ti+1 )) ≤ Ce−cλ

(32)

.

Denote by X the event X1 , . . . , Xi−1 , Xi+1 , . . . , Xn and then P(Xi > λ | X ) = EP(Xi > λ | R(ti ), R(ti+1 ), X ) = EP(Xi > λ | R(ti ), R(ti+1 )) ≤ ECe−cλ = Ce−cλ , where the expectation above is with respect to R(ti ) and R(ti+1 ). This gives (23). The argument for (24) requires the convergence of the distributions. Start with the case of one i. Denote by Y the event R(ti ), R(ti+1 ) and by Z the event R(ti− ), R(ti+1+ ) for some ∈ {0, 1, . . . }. Then P(Y = y | Z) · E(Xi | Y = y) E(Xi | Z) = y∈Bi ×Bi+1

=

P(Y = y | Z) − P(Y = y) · E(Xi | Y = y)

y∈Bi ×Bi+1

≤C

P(Y = y | Z) − P(Y = y),

(33) (34)

y∈Bi ×Bi+1

where the equality (33) is due to EXi = 0. Denote by πk (w, x) the probability to start from w and hit x after k moves of going from Bj to Bj +1 . In a formula πk (w, x) := P(R(uj +k ) = x | R(uj ) = w)

.

Of course, we mean that if w ∈ ∂B(v, 2r) then we take j odd and in the opposite case we take j > 0 even. Other than that the value of πk is independent of j . With these notations we get P(Y = (y1 , y2 )) = P(R(ti ) = y1 )π1 (y1 , y2 ), π (z1 , y1 )π1 (y1 , y2 )π (y2 , z2 ) P(Y = (y1 , y2 ) | Z = (z1 , z2 )) = , π2+1 (z1 , z2 ) so

|P(Y = y | Z = z) − P(Y = y)| ≤ π1 (y1 , y2 ) |P(R(ti ) = y1 ) − π (z1 , y1 )| + π (y2 , z2 ) + . (35) − 1 π (z1 , y1 ) π2+1 (z1 , z2 )

Loop-Erased Random Walk on a Torus in Dimensions 4 and Above

273

Summing over y the first half of (35) we get π1 (y1 , y2 )|P(R(ti ) = y1 ) − π (z1 , y1 )| y1 ,y2

=

|P(R(ti ) = y1 ) − π (z1 , y1 )| ≤ 2e−c ,

(36)

y1

where the last inequality is due to the exponential convergence of the distributions in the form (28) — take q1 to be the distribution of R(ti− ) and q2 = δ{z1 } (the distance between any two distributions is always ≤ 2). For the second half of (35), we use the form (29) for and get, under the assumption > 0, π (y2 , z2 ) π (z1 , y1 )π1 (y1 , y2 ) − 1 π (z , z ) y1 ,y2

≤ Ce−c

2+1

1

2

π (z1 , y1 )π1 (y1 , y2 ) = Ce−c

.

(37)

y1 ,y2

We used here (29) with q1 = δ{y2 } and q2 the distribution of R(uj ++1 ) | R(uj ) = z1 for the point z2 . Using (36), (37) and (35) in (34) gives E(Xi | R(ti− ), R(ti+1+ )) ≤ Ce−c

(38)

(the case = 0 doesn’t follow from the argumentation above, but can be deduced, say, from (32)). With (38), proving (24) is easy. Let i1 , . . . , il be some integers, all different, and let 1 , min |ij − im | − 1 j = 2 1≤m≤l m=j

so that the intervals ij − j , ij + 1 + j are disjoint. Let X be the event R(ti1 −1 ), R(ti1 +1+1 ), . . . , R(tik −k ), R(tik +1+k )

.

Then conditioning by X the events Xi are independent so we get E(Xi1 · · · Xik | X ) =

k

E(Xj | X ) =

j =1 (38)

≤

k

k

E(Xj | R(tij −j ), R(tij +1+j ))

j =1

Ce−cj ≤

j =1

k

C exp(−c min |ij − im |)

j =1

1≤m≤l m=j

which immediately gives (24) since E(Xi1 · · · Xik | Xik+1 , . . . , Xil ) = E E(Xi1 · · · Xik | X ) Xik+1 , . . . , Xil k ≤E C exp(−c min |ij − im |) Xik+1 , . . . , Xil j =1

=

k j =1

1≤m≤l m=j

C exp(−c min |ij − im |) 1≤m≤l m=j

274

I. Benjamini, G. Kozma

with (23) and (24) established we can invoke Lemma 3 and get P(|tn − Etn | > λU0 ) ≤ Ce−cλ

2

.

Lemma 4 now follows since (29) shows that Et2i+1 −t2i converge exponentially to some Eeven and Et2i+2 − t2i+1 converge exponentially to some Eodd so Etn − n 1 (Eeven + Eodd ) ≤ CU0 2 √ and for λ < C n this translation affects only the multiplicative constant. Therefore taking E = 21 (Eeven + Eodd ) ≈ N d r 2−d and σ = U0 ≈ E we are done. Lemma 5. Let b ∈ TNd and let R be a random walk on T starting from b. Let r < 18 N , v ∈ TNd . Let t ∈ N be some time. Then P(LE(R[0, t]) ∩ B(v, r) > λf (r)) ≤ Ce−cλ ∀λ > 0,

(39)

where f is defined by (21). Proof. Let λ > 0 be some number. We note that we may assume t < λN d since in time λN d the probability to hit b is > 1 − Ce−cλ and in this case the process starts afresh, memoryless. Let ti be stopping times defined by (2). Let E and σ be defined by Lemma 4 so that (25) holds. The first case is λ > C1 log r for some C1 sufficiently large. This case is uninteresting for the following reason: Lemma 4 gives that for some C2 sufficiently large, if

n = C2 λr d−2 then P(tn ≤ t) ≤ Ce−cλ . Let k := max{l : tl ≤ t}. If k is even then LE(R[0, t]) ∩ B(v, r) ⊂ LE(R[0, tk ]) ∩ B(v, r). If k is odd then #((LE(R[0, t]) \ LE(R[0, tk ])) ∩ B(v, r)) ≤ tk+1 − tk and this variable has the estimate (16) so it is uninteresting. Therefore it is enough to calculate the loop-erased at the times tk . We get P(#(LE(R[0, t]) ∩ B(v, r)) > λf (r)) n 1 ≤ Ce−cλ + P(#(LE(R[0, tk ]) ∩ B(v, r)) > λf (r)) 2 k=1

≤ Cne−cλ ≤ Cr d−2 λe−cλ ≤ Cr d−2 e−cλ ≤ Cr d−2−cC1 e−cλ (of course, all c’s in the last line are different). This shows that for C1 sufficiently large— namely, (d − 2)/c, where c is the last c on the last line above, (39) holds. Thus this case is proved. Therefore we shall assume that λ < C log r. Let n± 1 be defined by √ n− 1 := max{n even : t − nE > λσ n}, √ n+ 1 := min{n : t − nE < −λσ n}. Note that

− 3/2 n+ r d−2 . 1 − n1 ≤ Cλ t/E ≤ Cλ

Loop-Erased Random Walk on a Torus in Dimensions 4 and Above

275

− + + − n E| > λσ Let E1 be the event |tn− − n− E| > λσ n or |t n+ n 1 1 1 1 . Lemma 4 gives 1

1

us that P(E1 ) < Ce−cλ . We note that under ¬E1 we can “locate” t, tn− < t < tn+ 1 1 √ and the interval is not very large, tn+ − tn− < CN d λ3/2 r 2−d . Let E2 be the event 1

1

# LE(R[0, tn1 ]) ∩ B(v, r)) > λf (r). Lemma 2 gives us that P(E2 ) < Ce−cλ . We continue to define a short sequence of n± j inductively: √ n− j := max{n even : t − tNj−−1 − nE > λσ n}, √ n+ j := min{n : t − tN − − nE < −λσ n}, j −1

Nj± := n± j +

j −1

n− k.

k=1 ± Unlike n± 1 which are just numbers, nj , j

E2i−1 to be the event |tN ± i

> 1 are events depending on R[0, tN − ]. Define i−1 − t| > λσ n± (as before, we mean that either happens). i

Again, we get P(E2i−1 ) < Ce−cλ . Under ¬(E1 ∪ E3 ∪ · · · ∪ E2i−3 ) we have n± i
t − tN −

i−1

E

≤C

tN + − tN − i−1

i−1

E

(41)

≤ C(i)λ2−2

−i+1

r2

−i+1 (d−2)

(40)

(the use of (41) is inductively, for i − 1). The addition of ¬E2i−1 gives tN − < t < tN + i i and (40) −i −i + ni + n− ≤ C(i)N d λ2−2 r (1−2 )(2−d) . (41) tN + − tN − ≤ λσ i i

i

To use Lemma 2, we need to define an auxiliary walk R , Ri (u) = R(u + tN − ) Ri : {−tN − , . . . , t − tN − } → T . i−1

i−1

i−1

In other words, we consider the part of the walk until tN − as fixed, and the part from i−1 tN − to t as the probabilistic part. Of course, the stopping times tj corresponding to Ri i−1

are simply tj = tN − +j . The fact that Ni− is even means that Ri (0) ∈ B(v, 2r) and then i−1 Lemma 2 will give that P(E2i ) ≤ Ce−cλ , E2i := {#(Li ∩ B(v, r)) > λf (r)}, Li := LE+ (Ri [−tN − , tN − − tN − ]), i−1

i

i−1

(42) (43)

Note that we have now defined all the exceptional events Ei : the even ones are (42) and the odd ones have been defined slightly above. When we said that the series n± i is short, we meant that we shall take it until I defined by   d≥7 2 I= 3 d = 5, 6 ,  C log −1 d = 4

276

I. Benjamini, G. Kozma

where is from (21), which we consider as a constant, so I ≤ C. In particular P(E1 ∪ · · · ∪ E2I ) ≤ CI e−cλ ≤ Ce−cλ . The reason for this selection of I is that with this I it is possible to do a simple estimate of the path between tN − and t. For any i we have I

Ni+

< Cλ (t − tN − )/E ≤ Cλ (tN + − tN − )/E

− Ni−

i−1

(41)

2−2−i

≤ Cλ

r

2−i (d−2)

i−1

≤ Cr

2−i (d−2)

i−1

log2 r

(remember that λ < C log r) and for I this gives NI+ − NI− ≤ Cf (r)r −2

.

Therefore we may use (16) NI+ − NI− times, to get P

NI+

tj +1 − tj > λf (r) ≤ Ce−cλ ,

(44)

j =NI− j odd

which of course bounds also #(LE(R[0, t]) ∩ B(v, r)) − #(LE(R[0, tN − ]) ∩ B(v, r)). I Finally, the definitions of Ri , LE, LE+ and Li (43) give LE(R[0, tN − ]) ⊂ L1 ∪ L2 ∪ · · · ∪ LI , I

and assuming ¬(E2 ∪ E4 ∪ · · · ∪ E2i ) we have from (42) that #(LE(R[0, tN − ]) ∩ B(v, r)) ≤ I λf (r) ≤ Cλf (r) I

and with (44) we finally get P(#(LE(R[0, t]) ∩ B(v, r)) > λf (r)) ≤ Ce−cλ and the lemma is proved.

,

Remark. By now the reader would not be surprised to learn that here too, if one is willing to let go of a log factor then the proof gets much simpler. Indeed, the arguments used for the case λ > C log r can be used for any λ to get this result, and for this case one does not need the precise estimates of Lemma 4 either, and the entire section may be reduced to half a page. Theorem 3. Let b ∈ TNd and let R be a random walk on T starting from b. Let t ∈ N be some time. Then P(LE(R[0, t]) > λf (N )) ≤ Ce−cλ ∀λ > 0, where f is defined by (21). The theorem follows from Lemma 5 like Theorem 1 follows from Lemma 2 (cover T by balls, etc.) and we shall omit the proof.

Loop-Erased Random Walk on a Torus in Dimensions 4 and Above

277

4. The Lower Bound We will use the concept of a cut time Definition. Let R be a random walk on a graph, possibly with a stopping condition. A time t is called a cut time for R if R[0, t] ∩ R ]t, ∞[ = ∅. Clearly, if t is a cut time then R(t) ∈ LE(R). Further, all R(ti )’s for different cut times ti are different. Therefore it is possible to estimate the length of a loop-erased random walk by counting cut times. Lemma 6. Let d ≥ 5. Let R be a random walk on TNd of length L for some L = N d/2 , sufficiently small and N > N0 (). Let X be the number of cut times of R. Then EX > cL VX < C 2 L2 .

(45)

As usual V denotes the variance, i.e. VX := EX 2 − (EX)2 . Proof. Denote by Et the event that t is a cut time. Easily, 1 − P(Et ) ≤

L t

P(R(s1 ) = R(s2 ))

.

s1 =0 s2 =t+1

Now for |si − t| ≤ N this is identical to the equivalent problem on Zd which is well known (see [L96]) so we get P(R[max{0, t − N}, t] ∩ R ]t, min{L, t + N }] = ∅) < 1 − c

.

For other si we use the easy P(R(s1 ) = R(s2 )) ≤ C min{N 2 , |s1 − s2 |}−d/2

(46)

to get 1 − P(Et ) < 1 − c + C 2 + CN 2−d/2 , therefore for sufficiently small and N sufficiently large we get P(Et ) > c which gives the first part of (45) — EX > cL. For the second part,

we examine the covariance of Et1 and Et2 for some t1 < t2 . Denote t = 21 (t1 + t2 ) and E1 = P(R[0, t1 ] ∩ ]t1 , t] = ∅) E2 = P(R[t, t2 ] ∩ R ]t2 , L] = ∅)

.

We note that E1 and E2 are independent and therefore cov E1 , E2 = 0. On the other hand, summing (46) we get |P(E1 ) − P(Et1 )|

≤ P(R[0, t1 ] ∩ R[t, L] = ∅) ≤

t1 L

C min{N 2 , |s2 − s1 |}−d/2

s1 =0 s2 =t

≤C

t1

|t − s|1−d/2 + N −d/2 ≤ C(|t2 − t1 |2−d/2 + 2 ),

s1 =0

so we get the same for the covariance of Eti , cov Et1 , Et2 ≤ C|t2 − t1 |2−d/2 + C 2

.

Summing these for all ti ’s we get the second half of the lemma.

278

I. Benjamini, G. Kozma

Lemma 7. Let d ≥ 5. Let b ∈ TNd and let R be a random walk on T starting from b. Let t ∈ N, t > N d/2 and λ > N −1/2 . Then P(# LE(R[0, t]) ≤ λN d/2 ) ≤ Cλ. Proof. We may assume without loss of generality that λ ≤ c for some constant. Let C1 be some constant which will be fixed later. Define u := t − C1 λN d/2 (we assume here λ < 1/C1 , as we may). Denote by X the number of cut times in the segment [u, t]. Lemma 6 shows that EX > c(t − u) = cC1 λN d/2 . Pick C1 sufficiently large such that EX > 3λN d/2 . Lemma 6 also gives VX ≤ Cλ4 N d/2 and then P(X ≤ 2λN d/2 ) ≤ Cλ2 . Next we want to estimate P(LE(R[0, u]) ∩ R[u + N 2 , t] = ∅). Define Y = #{LE(R[0, u]) ∩ R[u + N 2 , t]}. If we assume # LE(R[0, u]) ≤ µN d/2 , then because R(u + N 2 ) is distributed ≈ uniformly on T we get E(Y | # LE(R[0, u]) ≤ µN d/2 ) ≈ N −d (# LE(R[0, u]))(t − u − N 2 ) ≈ µλ (this is the only place we use the assumption λ > N −1/2 ). Without the assumption # LE(R[0, u]) ≤ µN d/2 we get EY ≤ ≤

∞ µ=0 ∞

P(# LE(R[0, u]) ≤ µN d/2 ) · E(Y | # LE(R[0, u]) ≤ (µ + 1)N d/2 ) Ce−cµ µλ ≤ Cλ,

µ=0

and hence P(Y > 0) ≤ Cλ. Under the assumption Y = 0 every cut point of R[u, t] above u + N 2 is in LE(R[0, t]) and the lemma follows. Proof of Theorem 2. Let R be a random walk starting from b with no stopping condition. Define events X (v, t) = {R (t) = v ∧ v ∈ R [0, t[}, Y(v, t) = {R (t) = v ∧ # LE(R[0, t]) ≤ λN d/2 }.

Now, v P(X (v, t)) is simply the probability that a random walk reaches its end point for the first time, or equivalently by symmetry, the probability that it never returned to its starting point, therefore it is easy to calculate −d P(X (v, t)) ≤ Ce−ctN ∀t. v∈T

Next, for t > N d/2 , Lemma 7 gives P(Y(v, t)) ≤ Cλ v∈T

∀t > N d/2 .

Loop-Erased Random Walk on a Torus in Dimensions 4 and Above

279

Finally, note that ∞

P(X (v, t)) = 1

∀v ∈ T .

t=0

With these three facts we get, for any parameter µ > 0, P(X (v, t) \ Y(v, t)) t,v



≥N − d

d/2 N

t=0

+

∞ t=µN d

 

d µN

t=N d/2

v∈T

P(X (v, t)) −

v∈T

P(Y(v, t))

≥ N d (1 − Ce−cµ − N −d/2 − Cµλ). Picking µ = C log λ for some C sufficiently large will prove the theorem.

4.1. Remarks on alternative approaches. The first alternative approach to the proof of the lower bound is as follows: prove a conditioned version of Lemma 6, namely Lemma. Let b and e be two points on TNd with |b − e| > cN . Let R be a random walk on TNd of length L for some L = N d/2 , sufficiently small starting from b and conditioned to end at e. Let X be the number of cut times of R. Then EX > cL VX < C 2 L2 . This lemma allows to prove a version of Theorem 2 for any points far enough, not just two random points. Further, it allows to avoid the need to use absolute times, and just work directly with the times ti for some arbitrary ball. In other words, to show that the loop-erased random walk from b to e is long with high probability, define an arbitrary ball B, show that at the stopping times ti corresponding to B the entire loop-erased random walk is quite small (this is quite simple) and then show that the random walk from the last ti to e has many cut points using the lemma above. The proof of this lemma requires no new ideas when compared with Lemma 6. However, it is very technical, and quite long, which is the main reason we chose the approach above. In some sense we do not consider the length of Sect. 3 as an indication that the approach we chose is more complicated because the result (Theorem 3) is trivial if one can afford to lose a log factor (and also because the result is quite natural). Another approach is the use of the uniform spanning tree and Wilson’s algorithm (see [W96]). Roughly, one might hope to show that the loop-erased random walk is long by constructing an appropriate partial UST, and then showing that the random walk R starting from some point b and stopped on the partial UST is not too long (therefore no complicated self interactions, as in Lemma 6) and not too short, so LE(R) can be proved to be long. Since the loop-erased random walk from b to some other point e (say inside the partial UST) contains LE(R), this will be enough. Alternatively, one can take two random walks R and R starting from b and e respectively and stopped on the partially constructed UST, and calculate the probabilities that at least one is long and that they do not intersect. Both approaches allow to generalize Theorem 2 from a random end point to any end point (naturally, if b and e are very close then with positive probability the

280

I. Benjamini, G. Kozma

loop-erased random walk from b to e is short. However, one can show that there is a positive probability for the loop-erased random walk to be long, i.e. ≈N d/2 ). A third strategy using the UST is as follows. Notice that the harmonic measure on a partially constructed UST is roughly uniform — this follows since the escape probabilities from a typical small ball are positive. If one wants to estimate the probability that the loop-erased random walk between b and e is ≤ λN d/2 , construct a partial UST containing b up until its size is ≈ (1/λ)N d/2 , and then estimate that the number of vertices in the tree with distant ≤ λN d/2 from b is ≈ λN d/2 , so the harmonic measure is ≈ λ2 . This approach gives (in addition to the fact that e may be arbitrary) stronger estimates than the λ log λ−1 of Theorem 2 — formalizing these arguments we were able to show P(# LE(R) ≤ λN d/2 ) ≤ Cλ2 log λ−1 , and we believe that the true value is, as in the case of the complete graph, λ2 . The difference between λ log λ−1 and λ2 log λ−1 is significant in the following sense: the weaker estimate does not prove that the UST has a true branching nature: even points that are distributed linearly along a path of length N d/2 satisfy the requirement that P(LE ≤ λN d/2 ) ≤ Cλ. However, the estimate λ2 log λ−1 allows to deduce non-trivial facts about the branching structure of the UST. None of these methods work in dimension 4, and the culprit is always the same: in dimension 4 our methods do not show that within the mixing time the probability of hitting the loop-erased random walk is small. In other words, to get a lower bound for dimension 4 one must either show a very precise upper estimate (not much different from the conjectured precise value) or alternatively show indirectly that the mixing time is smaller than the hitting time of the loop-erased random walk. Appendix A. Proofs of Known and Unsurprising Facts The harmonic potential on Zd , d > 2, is the unique bounded function a satisfying 1 z = 0 a(z) = 0 otherwise and a(∞) = 0, where stands for the discrete Laplacian. It is well known (see e.g. [L96, Theorem 1.5.4] 4 or [KS04, Theorem 5]) that a(v) = α|v|2−d + O(|v|−d ). Lemma A.1. Let B1 = B(x1 , r1 ) ⊂ B2 = B(x2 , r2 ) ⊂ TNd , r2 ≤ C1 r1 . Let v ∈ B2 \ B1 satisfy d(v, ∂B2 ) ≥ c1 r1 . Let R be a random walk starting from v and stopped on ∂B1 ∪ ∂B2 . Let p be the probability that R hits ∂B1 . Then p ≥ c(c1 , C1 ). We assume here that a ball (e.g. B2 ) satisfies r2 < 21 N i.e. it does not wrap itself because we are on a torus. This assumption holds for all balls in this appendix, and we will not repeat it. Proof. Clearly, we may assume r1 is sufficiently large in the sense that r1 > C(c1 , C1 ). Since we are dealing with a process completely inside B2 , we may assume we are in Zd . Assume first that |x1 − v| < 21 d(x1 , ∂B2 ). Since a1 (v) := a(v − x1 ) is harmonic on B2 \ B1 , a1 (R) is a martingale, and if we define τ to be the stopping time on ∂B1 ∪ ∂B2 then we get a1 (v) = Ea1 (R(τ )), so a1 (v) = pE(a1 (R(τ )) | R(τ ) ∈ ∂B1 ) + (1 − p)E(a1 (R(τ )) | R(τ ) ∈ ∂B2 ) ≤ pαr12−d (1 + o(1)) + (1 − p)αd(x1 , ∂B2 )2−d (1 + o(1)) 4

(47)

[L96] only shows a(v) = α|v|2−d + O(|v|−d ), but this is completely sufficient for our purposes.

Loop-Erased Random Walk on a Torus in Dimensions 4 and Above

281

(the o(1) notations are as r1 → ∞ and may depend on c1 and C1 ) and from a1 (v) = α|x1 − v|2−d (1 + o(1)) we get p≥

|x1 − v|2−d − d(x1 , ∂B2 )2−d r12−d − d(x1 , ∂B2 )2−d

(1 + o(1)) ≥ c

(48)

for r1 sufficiently large. In the case |x1 − v| ≥ 21 d(x1 , ∂B2 ), we can find a sequence of balls B(yn , sn ) of length ≤ C(c1 , C1 ) and each sn ≥ c(c1 , C1 )r1 such that |y1 − v| ≤ 21 d(y1 , ∂B2 ) and ∀w ∈ ∂B(yi , si ), |yi+1 − w| ≤ 21 d(yi+1 , ∂B2 ). Notice that this is possible because d(v, ∂B2 ) ≥ c1 r1 . The previous case now gives that the probability that the random walk, after hitting B(yi , si ) will continue to B(yi+1 , si+1 ) is ≥ c2 (c1 , C1 ). Since it needs to perform only C(c1 , C1 ) such steps in order to hit B1 , we get p ≥ c2C = c3 (c1 , C1 ). Lemma A.2. Let B(x1 , r1 ), B(x2 , r2 ) be two disjoint balls, r2 ≤ C1 r1 and |x1 − x2 | ≤ C1 r1 ; and let v ∈ B2 ∪ B1 satisfy d(v, B2 ) ≥ c1 r2 . Then p ≥ c(c1 , C1 ), where p is as above. Proof. Assume first that d(v, B1 ) ≤ 21 d(B2 , B1 ) where d(B1 , B2 ) stands for the distance between the two balls in the usual sense. Let Bi be the sets Bi considered as subsets of Zd and let Si = Bi + N Zd , i.e. Si is the preimage of Bi by the quotient map Zd → TNd . Let R be a simple random walk on Zd starting from v (we consider v and the Bi ’s as subsets of Zd as well, say by locating them in [0, N]d ). Then p = P(R hits S1 before S2 ) ≥ P(R hits B1 before S2 ) (∗)

≥ P(R hits B1 before ∂B(x1 , r1 + d(B1 , B2 ))) ≥ c, where (∗) comes from the same harmonic potential arguments as (47)-(48). If d(v, B1 ) > 21 d(B2 , B1 ) but we have both d(v, B1 ) ≤ (2C1 + 2)r1 , d(v, B2 ) ≥ c1 r1 , then the same ball-sequence argument as in the previous lemma gives p ≥ c; If d(v, B1 ) > (2C1 + 2)r1 , let τ be the hitting time of B3 := B(x1 , (2C1 + 2)r1 ), then p = EP(R starting from R(τ ) hits B1 before B2 ) ≥ Ec = c,

(49)

where here R is a simple random walk on TNd (differing from R only by the starting point), the expectation E is over the distribution of R(τ ) and the inequality comes from the previous two cases. Finally, if d(v, B2 ) < c1 r1 define τ the hitting time of B4 := B(x2 , c1 r1 ). The harmonic potential at x2 with a calculation similar to (47)-(48) shows that the probability to hit B4 before B2 is ≥ c. After hitting B4 a calculation similar to (49) gives that p ≥ c. Lemma A.3. Let d(B(x1 , r1 ), B(x2 , r2 )) ≥ c1 r1 , r2 ≥ c1 r1 and |x1 − x2 | ≤ C1 r2 ; and let v ∈ ∂B1 . Let R be a random walk starting from v and stopped on ∂B2 ∪ {x1 }. Let p be the probability that R hits x1 . Then p ≈ r12−d .

282

I. Benjamini, G. Kozma

In the formulation of the lemma, and in its proof, all constants implicit in the ≈ signs might depend on c1 and C1 . Proof. Let B3 = B(x1 , 21 r1 ). Define stopping times ti similarly to (2), as follows: t0 := 0 and t2i+1 := {t > t2i : R(t) ∈ ∂B3 ), t2i := {t > t2i−1 : R(t) ∈ ∂B1 ∪ {x1 }}. Define also τ the hitting time of ∂B2 . The usual harmonic potential calculations (use the harmonic potential around x1 ) show that the probability of a random walk starting from any v ∈ ∂B3 to hit x1 before exiting B1 is ≈ r12−d . Hence, P(R(t2i ) = x1 |R(t2i−1 )) ≤ Cr12−d . Lemma A.2 shows that a random walk starting from any point in ∂B1 has probability ≥ c to hit B2 before hitting B3 . Therefore, the probability to get ti > τ decreases exponentially in i, i.e. P(t2i < τ ) ≤ Ce−ci and hence P(R(t2i ) = x1 and t2i < τ ) ≤ Ce−ci r12−d so p≤

∞

P(R(t2i+1 ) = x1 and t2i+1 < τ ) ≤ Cr12−d .

i=0

The inequality p ≥ cr12−d follows easily from (47)–(48) for the harmonic potential at x1 and the requirement d(B1 , B2 ) ≥ c1 r1 . Lemma A.4. Let |v| < (1−c1 )r and w ∈ ∂B(0, r). Then the probability p that a random walk starting from v will exit B in w is ≈ r 1−d . Without the restriction |v| < (1 − c1 )r one has p ≤ C(r − |v|)1−d In the formulation of the lemma, and in its proof, all constants implicit in the ≈ signs might depend on c1 . Proof. In Zd , d > 2, the probability of a walk starting from v to never return is > c. Hence the probability to hit ∂B(0, r) before returning to v is > c, and this event is identical on Zd and on the torus. The symmetry of the random walk shows that p is ≈ to the probability that a random walk starting from w will hit ∂B ∪ {v} in v (the quotient is exactly the probability of a random walk starting from v to return to v before exiting ∂B, which, as we just discussed, is ≈ 1) 5 . This probability can be calculated in three steps as follows. First, the probability of a random walk starting from w to hit ∂B(0, 23 r + 13 |v|) is ≈ (r − |v|)−1 : this uses an argument similar to (47)-(48) using the harmonic potential a at 0, but here we need the precise estimate a(x) = |x|2−d + O(|x|−d ) or at least a(x) = |x|2−d + O(|x|1−d ) (see, e.g. [K87, Lemma 3] for a detailed version of this calculation). Next, if |v| < r(1−c1 ), use Lemma A.1 to show that continuing from any point on ∂B(0, 23 r + 13 |v|) the probability to hit B(v, 13 (r − |v|)) is ≈ 1 — if |v| ≥ r(1 − c1 ), we only estimate that this probability is ≤ 1. Finally, the same (47)-(48) argument with the harmonic potential at v shows that starting from any point on ∂B(v, 13 (r − |v|)), the probability to hit v before hitting ∂B is ≈ (r − |v|)2−d . 5

When we say “hit” we mean at time > 0, so that these probabilities are not simply 1.

Loop-Erased Random Walk on a Torus in Dimensions 4 and Above

283

A similar calculation works when (1 + c1 )r ≤ ||v|| and p is the probability the random walk will hit B in w, using Lemma A.2 instead of Lemma A.1 and Lemma A.3 in the third step. Lemma A.5. Let v ∈ B(0, r) ⊂ B(0, 2r) ⊂ TNd . Let R be a random walk starting from v and stopped on ∂B(0, 2r). Let Rx be a random walk starting from v and conditioned to hit ∂B(0, 2r) at a specific point x. Then R ∩ B(0, r) ≈ Rx ∩ B(0, r), where ≈ means that the probabilities of any event are equal up to a constant. Proof. Let t be the last time when R(t) ∈ B(0, r). Let w = R(t). For any w, the probability of an (unconditioned) random walk starting from w to hit B(0, r) ∪ ∂B(0, 2r) in ∂B(0, 2r) is ≈ r −1 . The probability to hit x is ≈ r −d . This independence from w finishes the lemma. Both estimates are easily proved as in the previous lemma. Lemma A.6. Let R be a random walk starting from v ∈ B(0, r) and let w ∈ B(0, r). Let t > r 2 . Let p be the probability that R[0, t] ⊂ B(0, r) and R(t) = w. Then p ≤ Cr −d e−ctr

−2

.

Proof. Starting from any v ∈ B(0, r), after r 2 steps the random walk has probability > c to exit B(0, r). This shows, clearly, that the probability that R[0, t − r 2 ] ⊂ B(0, r) is −2 ≤ Ce−ctr . For any x ∈ B(0, r), the probability that a random walk R starting from x satisfies R (r 2 ) = w is ≤ Cr −d . Lemma A.7. Let v ∈ B(0, r) and let R be a random walk starting from v and stopped on ∂B(0, r). Let w ∈ ∂B(0, r). Then the probability p that R hits w satisfies p ≤ C1 |v − w|1−d . Proof. Denote s = |v − w|. We shall prove the lemma using an induction process that assumes the lemma holds for 1, . . . , 21 s and proves it for 21 s + 1, . . . , s. Denote d(v, ∂B(0, r)) = s. The first thing to note is that the lemma holds if > c with no need for induction (in the sense that p ≤ C2 (c)|v − w|1−d ), due to the second part of Lemma A.4. It is for the case of small that we need the induction process. Let δ = c1 log−1 −1 for some c1 which will be fixed later. Let D := B(0, r) \ B(0, r − δs), 1 E : = D ∩ B(v, s) . 2 Examine the exit probabilities of R from E. Let p2 be the probability that R exits E at ∂B(0, r − δs) ∩ ∂E. Then p2 ≤ P(R exits D at ∂B(0, r − δs)) ≤ C/δ, where the second inequality comes from the harmonic potential at zero. Let p3 be the probability that R exits E at ∂B(v, 21 s) ∩ ∂E. It is easy to see that for any x ∈ D, the probability that R exits B(x, 2δs) without hitting ∂D is <1 − c. Therefore −1

p3 < (1 − c)(s/2)/(2δs) = e−cc1

log −1

.

284

I. Benjamini, G. Kozma

Therefore for c1 sufficiently small we would get p3 ≤ . Together these two give p2 + p3 ≤ C log −1 . Since E ∩ B(w, 21 s) = ∅ we get that the probability of R to hit ∂B(w, 21 s) before exiting B(0, r) is ≤ C log −1 . The induction assumption gives that the probability to hit w after hitting ∂B(w, 21 s) is ≤ C1 ( 21 s)1−d . Therefore p ≤ C1 s 1−d · (C2d−1 log −1 ). For < c2 this will be ≤ C1 s 1−d and this case is finished too. The lemma is now finished because for < c2 the induction process works for any C1 , and for ≥ c2 the first case allows to define C1 := C2 (c2 ). A similar calculation works when v is outside B(0, r), and the random walk is stopped when hitting B(0, r) and the conclusion is p ≤ C min{|v − w|, r}1−d . Lemma A.8. Let v ∈ B(0, 21 r) and w ∈ ∂B(0, r). Let R be a random walk started from v and conditioned to exit B(0, r) in w. Let t be the exit time. Then P(t > λr 2 ) ≤ Ce−cλ . Proof. We may assume λ > 1. Let R be an unconditioned walk starting from v. Let An = (B(w, 2n ) \ B(w, 2n−1 )) ∩ B(0, r). Lemma A.6 shows that P(R [0, λr 2 ] ⊂ B(0, r) and R (λr 2 ) ∈ An ) ≤ C#An r −d e−cλ . Lemma A.7 shows that for every x ∈ An , the probability of a random walk starting at x to exit B(0, r) at w is ≤ C2n(1−d) . Therefore P(R [0, λr 2 ] ⊂ B(0, r) and R (λr 2 ) ∈ An and R hits w) ≤ C(#An )2n(1−d) r −d e−cλ ≤ Cr −d 2n e−cλ , and we get

P(R [0, λr ] ⊂ B(0, r) and R hits w) ≤ Cr 2

1−d −cλ

e

log r

r −1 2n ≤ Cr 1−d e−cλ .

n=1

Since the probability of R to hit w is > cr 1−d (by Lemma A.4), we are done.

Lemma A.9. Let Xi be events with a past-independent exponential estimate, namely P(Xi > λE | Xi−1 , . . . , X1 ) ≤ C1 e−c1 λ . Then P

n

Xi > λnE

≤ Ce−cλ .

i=1

As usual, C, c and all constants in the proof might depend on C1 and c1 .

Loop-Erased Random Walk on a Torus in Dimensions 4 and Above

285

Proof. Clearly we may assume E = 1. Let Yi be i.i.d. variables with Yi ∼ C2 (1 + G), where G is a standard exponential variable (namely with density e−t ) and C2 is some constant sufficiently large such that P(Yi > λ) ≥ min{1, C1 e−c1 λ }. A simple induction now shows that

P Xi > λn ≤ P Yi > λn , n−1

t .A and the sum of the Yi has the distribution C2 (n + ), where has density e−t (n−1)! −cλ simple calculation shows that P( > λn) ≤ Ce .

Lemma A.10. Let v ∈ TNd , and let R be a random walk starting from v going to a length of C1 N d r 2−d for some C1 sufficiently large. Then P(R hits B(0, r)) ≥

1 2

∀v ∈ TNd .

Proof. Define S to be the preimage of B in Zd (namely B +N Zd ) and let R be a random walk on Zd starting from some preimage of v. Then P(R hits B(0, r)) = P(R [0, N d r 2−d ] ∩ S = ∅). Define stopping times ti as follows: t0 = 0 and for every i let zi be the element of N Zd closest to R(ti ). Define inductively ti+1 = min{t > ti : R(t) ∈ ∂B(zi , r) ∪ ∂B(zi , 2N )}. Since d(R(ti ), zi ) ≤ N then using the harmonic potential at zi shows that P(R(ti+1 ) ∈ S) ≥ P(R(ti+1 ) ∈ ∂B(zi , r)) ≥ c(r/N )d−2 independently of the value of R(ti ). This immediately gives that, for C2 sufficiently large,

r d−2 C2 (N/r)d−2 1 P(R[0, tC2 (N/r)d−2 ] ∩ S = ∅) ≤ 1 − c ≤ . (50) N 4 On the other hand, it is easy to see that P(ti+1 − ti > λN 2 ) ≤ Ce−cλ independently of the past, and using Lemma A.9 we get that P(tn > λnN 2 ) ≤ Ce−cλ . Using this for n = C2 (N/r)d−2 and λ sufficiently large we get that 1 . 4 Equations (50) and (51) together show that the C in (51) may serve as our C1 . P(tn > CN d r 2−d ) ≤

(51)

Lemma A.11. Let v ∈ ∂B(0, 2r) and w ∈ ∂B(0, r). Let R be a random walk started from v and conditioned to hit B(0, r) in w. Let t be the hitting time. Then P(t > λN d r 2−d ) ≤ Ce−cλ . The proof is identical to that of Lemma A.8 with the use of Lemmas A.4 and A.7 replaced by the comments following them, respectively, and using Lemma A.10 to show that the probability to not hit a ball of radius r after λN d r 2−d steps is ≤ Ce−cλ and hence the equivalent of Lemma A.6. We omit the details. Acknowledgement. We wish to thank Chris Hoffman, Dan Romik and Oded Schramm for useful discussions.

286

I. Benjamini, G. Kozma

References Aizenman, M.: Geometric analysis of φ 4 fields and Ising models. Parts I and II. Commun. Math. Phys. 86:1, 1–48 (1982) [BKPS04] Benjamini, I., Kesten, H., Peres,Y., Schramm, O.: Geometry of the Uniform Spanning Forest: Transitions in Dimensions 4, 8, 12. Ann. of Math. 160(2), 465–491 (2004) [BLPS01] Benjamini, I., Lyons, R., Peres, Y., Schramm, O.: Uniform spanning forests. Ann. Probab. 29:1, 1–65 (2001) [BS85] Brydges, D.C., Spencer, T.: Self-avoiding walk in 5 or more dimensions. Commun. Math. Phys. 97:1-2, 125–148 (1985) [DS98] Debez, E., Slade, G.: The scaling limit of lattice trees in high dimensions. Commun. Math. Phys. 193:1, 69–104 (1998) [GB90] Guttmann, A.J., Bursill, R.J.: Critical exponents for the loop erased self-avoiding walk by Monte Carlo methods. J. Stat. Phys. 59:1/2, 1–9 (1990) [K00a] Kenyon, R.: The asymptotic distribution of the discrete Laplacian. Acta Mathematica 185:2, 239–286 (2000) [K00b] Kenyon, R.: Long range properties of spanning trees. J. Math. Phys. 41:3, 1338–1363 (2000) [HS90] Hara, T., Slade, G.: Mean-field critical behavior for percolation in high dimensions. Commun. Math. Phys. 128, 333–391 (1990) [HS92] Hara, T., Slade, G.: Self-avoiding walk in five or more dimensions. I. The critical behaviour. Commun. Math. Phys. 147:1, 101–136 (1992) [H02] Hastings, M.B.: Exact Multifractal Spectra for Arbitrary Laplacian Random Walks. Phys. Rev. Lett. 88, 055506 (2002) [K87] Kesten, H.: Hitting probabilities of random walks on Zd . Stochastic Processes and their Applications 25, 165–184 (1987) [K] Kozma, G.: Scaling limit of loop erased random walk — a naive approach. http://arXiv.org/ abs/math.PR/0212338, 2002 [KS04] Kozma, G., Schreiber, E.: An asymptotic expansion for the discrete harmonic potential. Electron. J. Proab. 9(1), 1–17 (2004) [L80] Lawler, G.F.: A self-avoiding random walk. Duke Math. J. 47:3, 655–693 (1980) [L87] Lawler, G.F.: Loop-erased self-avoiding random walk and the Laplacian random walk. J. Phys. A 20:13, 4565 (1987) [L95] Lawler, G.F.: The logarithmic correction for loop-erased walk in four dimensions. Proceedings of the conference in honor of Jean-Pierre Kahane (Orsay, 1993), special issue of J. Fourier Anal. Appl. 347–362 (1995) [L96] Lawler, G.F.: Intersections of random walks. Birkhäuser Boston, 1996 [L99] Lawler, G.F.: Loop-erased random walk. In: Perplexing problems in probability, Boston: Birkhäuser 1999, pp 197–217 [LSW04] Lawler, G.F., Schramm, O., Werner, W.: Conformal invariance of planar loop-erased random walk and uniform spanning trees. Ann. Prob. 32(1B), 939–995 (2004) [LEP86] Lyklema, J.W., Evertz, C., Pietronero, L.: The Laplacian random walk. Europhysics-Letters 2:2, 77–82 (1986) [LPS03] Lyons, R., Peres, Y., Schramm, O.: Markov Chain Intersections and the Loop-Erased Walk. Ann. Inst. H. Poincaré Probab. Statist. 39(5), 779–791 (2003) [NY95] Nguyen, B.G., Yang, W.-S.: Gaussian limit for critical oriented percolation in high dimensions. J. Stat. Phys. 78:3–4, 841–876 (1995) [P91] Pemantle, R.: Choosing a spanning tree for the integer lattice uniformly. Ann. Probab. 19:4, 1559–1574 (1991) [PR] Peres, Y., Revelle, D.: Scaling limits of the uniform spanning tree and loop-erased random walk on finite graphs. http://arxiv.org/abs/math.PR/0410430, 2004 [S00] Schramm, O.: Scaling limits of random walks. Israel J. Math. 118, 221–288 (2000) [W96] Wilson, D.: Generating random spanning trees more quickly than the cover time, TwentyEighth Annual ACM symposium on Theory of Computing, Math. New York: ACM Press, pp. 293–303, 1996 [W71] Wilson, K.G.: Renormalization Group and Critical Phenomena. II. Phase-Space Cell Analysis of Critical Behavior. Phys. Rev. B 4:9, 3184–3205 (1971) [A82]

Communicated by M. Aizenman

Commun. Math. Phys. 259, 287–305 (2005) Digital Object Identifier (DOI) 10.1007/s00220-005-1394-6

Communications in

Mathematical Physics

Topological Calculation of the Phase of the Determinant of a Non Self-Adjoint Elliptic Operator Alexander G. Abanov1, , Maxim Braverman2, 1 2

Department of Physics and Astronomy, Stony Brook University, Stony Brook, NY 11794, USA. E-mail: [email protected] Department of Mathematics, Northeastern University, Boston, MA 02115, USA. E-mail: [email protected]

Received: 30 January 2004 / Accepted: 6 April 2005 Published online: 15 July 2005 – © Springer-Verlag 2005

Abstract: We study the zeta-regularized determinant of a non self-adjoint elliptic operator on a closed odd-dimensional manifold. We show that, if the spectrum of the operator is symmetric with respect to the imaginary axis, then the determinant is real and its sign is determined by the parity of the number of the eigenvalues of the operator, which lie on the positive part of the imaginary axis. It follows that, for many geometrically defined operators, the phase of the determinant is a topological invariant. In numerous examples, coming from geometry and physics, we calculate the phase of the determinants in purely topological terms. Some of those examples were known in physical literature, but no mathematically rigorous proofs and no general theory were available until now. Contents 1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . 2. Preliminaries on Determinants of Elliptic Operators . . . . . 3. Operators Whose Spectrum is Symmetric with Respect to the Imaginary Axis . . . . . . . . . . . . . . . . . . . . . . . . 4. First Examples . . . . . . . . . . . . . . . . . . . . . . . . 5. A Dirac-Type Operator on a Circle . . . . . . . . . . . . . . 6. The Phase of the Determinant and the Degree of the Map . .

. . . . . . . . . . . . . . . .

287 288

. . . .

291 294 296 298

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

1. Introduction In the recent years several examples appeared in the physical literature when the phase of the determinant of a geometrically defined non-self-adjoint Dirac-type operator is a topological invariant (see e.g., [11, 12, 2, 1]). Many of those examples appear in the study of the non-linear σ -model for Dirac fermions coupled to chiral bosonic fields

The first author was partially supported by the Alfred P. Sloan foundation. The second author was partially supported by the NSF grant DMS-0204421.

288

A.G. Abanov, M. Braverman

[2, 1]. The topologically invariant phase is called the θ-term. It has a dramatic effect on the dynamics of the Goldstone bosons but also has a great interest for geometers. Unfortunately, no mathematically rigorous proofs of the topological invariance of the phase of the determinant were available until now. This paper is an attempt to better understand the above phenomenon. In particular, we find a large class of operators whose determinants have a topologically invariant phase. We also develop a technique for calculation of this phase. In particular, we get a first mathematically rigorous derivation of several examples which appeared in the physical literature. In many cases, we also improve and generalize those examples. Our first result is Theorem 3.1 which states that the determinant of an elliptic operator D with a self-adjoint leading symbol, which acts on an odd-dimensional manifold and whose spectrum is symmetric with respect to the imaginary axis, is real. Moreover, the sign of this determinant is equal to (−1)m+ , where m+ is the number of the eigenvalues of D (counted with multiplicities) which lie on the positive part of the imaginary axis. Note that this result is somewhat surprising. Indeed, if one calculates the determinant of a finite matrix D with the spectrum symmetric with respect to an imaginary axis, then one comes to a different result. E.g., the determinant is not necessarily real. Suppose now that we are given a family D(t) of operators as above. Assuming that the eigenvalues of D(t) depend continuously on t one easily concludes (cf. Theorem 3.2) that the sign of the determinant of D(t) is independent of t. In particular, it follows that, if the definition of the operator D depends on some geometric data (Riemannian metric on a manifold, Hermitian metric on a vector bundle, etc.), then (provided the spectrum of D is symmetric) the sign of the determinant is independent of these data, i.e., is a topological invariant. We present numerous examples of this phenomenon. In all those examples we calculate the signs of the determinants in terms of the standard topological invariants, such as the Betti numbers or the degree of a map. The paper is organized as follows: In Sect. 2, we briefly recall the basic facts about the ζ -regularized determinants of elliptic operators. In Sect. 3, we formulate and prove our main result (Theorem 3.1) and discuss its main implications. In Sect. 4, we present the simplest (but still interesting) geometric examples of applications of Theorem 3.1. In Sect. 5, we consider an operator D on a circle, which appeared in the study of a quantum spin in the presence of a planar, time-dependent magnetic field. This operator depends on a map from a circle to itself. We calculate the phase of the determinant of D in terms of the winding number of this map. In Sect. 6, we extend some of the examples considered by P. Wiegmann and the first author in [2]. The operator in question is a Dirac type operator D on an odd dimensional manifold M, whose potential depends on a section n of the bundle of spheres in R ⊕ T M. In particular, if the manifold M is parallelizable, n is a map from M to a dim M-dimensional sphere. We show that the sign of the determinant of D is equal to (−1)deg n , where deg n is the topological degree of n. 2. Preliminaries on Determinants of Elliptic Operators Let E be a vector bundle over a smooth compact manifold M and let D : C ∞ (M, E) → C ∞ (M, E) be an elliptic differential operator of order m ≥ 1. Let σL (D) denote the leading symbol of D.

Topological Calculation of the Phase of the Determinant

289

2.1. The choice of an angle. Our aim is to define the ζ -function and the determinant of D. For this we will need to define the complex powers of D. As usual, to define complex powers we need to choose a spectral cut in the complex plane. We will restrict ourselves to the simplest spectral cuts given by a ray Rθ = ρeiθ : 0 ≤ ρ < ∞ , 0 ≤ θ ≤ 2π. (2.1) Consequently, we have to choose an angle θ ∈ [0, 2π ). Definition 2.1. The angle θ is a principal angle for an elliptic operator D if spec σL (D)(x, ξ ) ∩ Rθ = ∅, for all x ∈ M, ξ ∈ Tx∗ M\{0}. If I ⊂ R we denote by LI the solid angle LI = ρeiθ : 0 < ρ < ∞, θ ∈ I . Definition 2.2. The angle θ is an Agmon angle for an elliptic operator D if it is principal angle for D and there exists ε > 0 such that spec (D) ∩ L[θ−ε,θ+ε] = ∅. 2.2. The ζ -function and the determinant. Let θ be an Agmon angle for D. Assume, in addition, that D is injective. The ζ -function ζθ (s, D) of D is defined as follows. Let ρ0 > 0 be a small number such that spec (D) ∩ z ∈ C; |z| < 2ρ0 = ∅. Define the contour = θ,ρ0 ⊂ C consisting of three curves = 1 ∪ 2 ∪ 3 , where 1 = ρeiθ : ρ0 ≤ ρ < ∞ , 2 = ρ0 eiα : θ < α < θ + 2π , 3 = ρei(θ+2π) : ρ0 ≤ ρ < ∞ . (2.2) Assume that θ = 0. For Re s > Dθ−s =

dim M m ,

i 2π

the operator

θ,ρ0

−1 λ−s dλ θ (D − λ)

(2.3)

is a pseudo-differential operator with smooth kernel Dθ−s (x, y), cf. [14, 15]. Here λ−s θ := e−s logθ λ , where logθ λ denotes the branch of the logarithm in C\Rθ which takes real values on the positive real axis. We define dim M . (2.4) ζθ (s, D) = Tr Dθ−s = tr Dθ−s (x, x) dx, Re s > m M It was shown by Seeley [14] (see also [15]) that ζθ (s, D) has a meromorphic extension to the whole complex plane and that 0 is a regular value of ζθ (s, D). More generally, let Q be a pseudo-differential operator of order q. We set ζθ (s, Q, D) = Tr Q Dθ−s ,

Re s > (q + dim M)/m.

(2.5)

290

A.G. Abanov, M. Braverman

If Q is a projection, i.e., Q2 = Q then [18, §6], [19] (see also [10] for a shorter proof), the function ζθ (s, D; Q) also has a meromorphic extension to the whole complex plane which is regular at 0. Finally, we define the ζ -regularized determinant of D by the formula d (2.6) Det θ (D) = exp − s=0 ζθ (s, D) . ds 2.3. The case of an operator close to self-adjoint. Let us assume now that σL (D)∗ = σL (D),

(2.7)

where σL (D)∗ denotes the dual of σL (D) with respect to some fixed scalar product on the fibers on E. This assumption implies that D can be written as a sum D = D + A, where D is self-adjoint and A is a differential operator of a smaller order. In this situation we say that D is an operator close to self-adjoint, cf. [3, §6.2], [9, §I.10]. Though the operator D is not self-adjoint in general, the assumption 2.7 guarantees that it has nice spectral properties. More precisely, cf. [9, §I.6], the space L2 (M, E) of square integrable sections of E is the closure of the algebraic direct sum of finite dimensional D-invariant subspaces L2 (M, E) =

k

(2.8)

such that the restriction of D to k has a unique eigenvalue λk and limk→∞ |λk | = ∞. In general, the sum 2.8 is not a sum of mutually orthogonal subspaces. The spaces k are called the space of root vectors of D with eigenvalue λk . We call the dimension of the space k the multiplicity of the eigenvalue λk and we denote it by mk . By Lidskii’s theorem [8], [13, Ch. XI], the ζ -function 2.4 is equal to the sum (including the multiplicities) of the eigenvalues of Dθ−s . Hence, ζθ (s, D) =

∞

k=1

mk λ−s k

=

∞

mk e−s logθ λk ,

(2.9)

k=1

where logθ (λk ) denotes the branch of the logarithm in C\Rθ which take the real values on the positive real axis. 2.4. Dependence of the determinant on the angle. Assume now that θ is only a principal angle for D. Then, cf. [14, 15], there exists ε > 0 such that spec (D)∩L[θ−ε,θ+ε] is finite and spec (σL (D)) ∩ L[θ−ε,θ+ε] = ∅. Thus we can choose an Agmon angle θ ∈ (θ − ε, θ + ε) for D. In this subsection we show that Detθ (D) is independent of the choice of this angle θ . For simplicity, we will restrict ourselves with the case when D is an operator close to self-adjoint, cf. Subsect. 2.3. Let θ

> θ be another Agmon angle for D in (θ − ε, θ + ε). Then there are only finitely many eigenvalues λr1 , . . . , λrk of D in the solid angle L[θ ,θ

] . We have logθ λk , if k ∈ {r1 , . . . , rk }; logθ

λk = (2.10) logθ λk + 2πi, if k ∈ {r1 , . . . , rk }.

Topological Calculation of the Phase of the Determinant

291

Hence, ζθ (0, D) − ζθ

(0, D) =

k

d mk e−s logθ (λri ) (1 − e−2πis ) ds s=0 i=1

= 2π i

k

(2.11)

mri ,

i=1

and Det θ

D = Detθ D.

(2.12)

Note that the equality 2.12 holds only because both angles θ and θ

are close to a given principal angle θ so that the intersection spec (D) ∩ L[θ ,θ

] is finite. If there are infinitely many eigenvalues of D in the solid angle L[θ ,θ

] then Det θ

(D) and Det θ (D) might be quite different. 3. Operators Whose Spectrum is Symmetric with Respect to the Imaginary Axis In this section M is an odd-dimensional closed manifold, E → M is a complex vector bundle over M, and D is a differential operator of order m ≥ 1 which is close to self-adjoint (cf. Subsect. 2.3) and invertible. 3.1. The phase of the determinant and the imaginary eigenvalues. Suppose that the spectrum of D is symmetric with respect to the imaginary axis. More precisely, we assume that, if λ = ρeiα is an eigenvalue of D with multiplicity m, then ρe−i(π+α) is also an eigenvalue of D with the same multiplicity. Since the leading symbol of D is self-adjoint, ± π2 are principal angles of D, cf. Definition 2.1. Hence, cf. Subsect. 2.4, we can choose an Agmon angle θ ∈ ( π2 , π) such that there are no eigenvalues of D in the solid angles L(π/2,θ] and L(−π/2,θ −π ] . Let m+ denote the number of eigenvalues of D (counted with multiplicities) on the positive part of the imaginary axis, i.e., on the ray Rπ/2 (cf. Definition 2.1). Our first result is the following Theorem 3.1. In the situation described above Im ζθ (0, D) = − π m+ . (3.1) In particular, Detθ D = exp − ζθ (0, D) is a real number, whose sign is equal to (−1)m+ . Remark 3.1. a. For 3.1 to hold we need the precise assumption on θ which we specified above. However, if we are only interested in the sign of the determinant of D, the result remains true for all θ ∈ (−π, π). This follows from 2.12. b. Note that only the eigenvalues on the positive part of the imaginary axis contribute to the sign of the determinant. This asymmetry between the positive and the negative part of the imaginary axis is coursed by our choice of the spectral cut Rθ in the upper half plane. If we have chosen the spectral cut in the lower half plane the sign of the determinant would be determined by the eigenvalues on the negative imaginary axis.

292

A.G. Abanov, M. Braverman

Proof. Let π , j = 1, 2, . . . 2 be all the eigenvalues of D which lie in the solid angle L(θ−π,π/2) (here and below all the eigenvalues appear in the list the number of times equal to their multiplicities). Since the spectrum of D is symmetric with respect to the imaginary axis, ρj e−i(π+αj ) (j = 1, 2, . . . ) are all the eigenvalues of D in the solid angle L(θ−2π,−π/2) . 1 Finally, let θ − π < αj <

ρj eiαj ,

i2 + i2 µ+ 1 e , . . . , µm+ e , π

−i 2 −i 2 µ− , . . . , µ− m− e 1 e

π

π

π

be all the imaginary eigenvalues of D (since ± π2 are principal angles for D, there are only finitely many of those, cf. Subsect. 2.4). Then ζθ (s, D) =

∞

j =1



= 2

m+ m−

−s −i π2 s −s i π2 s ρj−s e−iαj s + ei(αj +π)s + (µ+ ) e + (µ− e j j ) ∞

j =1

m−

+

j =1



ρj−s cos (αj +

j =1

j =1

m+

π  iπs −s −i π2 s )s e 2 + (µ+ e j ) 2 j =1

−s i 2 s (µ− e . j ) π

Set z(s) := 2

∞

j =1

ρj−s cos (αj +

π )s. 2

Then ζθ (s, D) = z(s) e

i π2 s

+

m+

j =1

and ζθ (0, D) = z (0) + i

−s −i π2 s (µ+ e j )

+

m−

j =1

−s i 2 s (µ− e , j ) π

π π z(0) + log µ± m− − m+ . j + i 2 2

Note that z(s) and z (s) are real for s ∈ R. Hence, we obtain π z(0) + m− − m+ . Im ζθ (0, D) = 2

(3.2)

We will now calculate z(0) by comparing it with the ζ -function of the operator D 2 . The angle 2θ is a principal angle for D 2 and

−s −i π2 s (µ± ρj−s e−iαj s + eiαj s + e ζ2θ (s/2, D 2 ) = j )

−s ± −s −i π2 s = 2 ρj cos(αj s) + (µj ) e . 1 Since we have chosen a spectral cut along the ray R with θ ∈ (π/2, π) we write all the eigenvalues θ −s −iαs . in the form λ = ρeiα with α ∈ (θ − 2π, θ ) so that λ−s θ =ρ e

Topological Calculation of the Phase of the Determinant

293

Hence, ζ2θ (s/2, D 2 ) − z(s) = 4 +

ρj−s sin(αj +

π π )s sin s 4 4

−s −i 2 s (µ± e . j ) π

(3.3)

Let (−π/2,π/2) , (π/2,3π/2) : L2 (M, E) → L2 (M, E) be the orthogonal projections onto the spans of the eigensections of D corresponding to the eigenvalues in L(−π/2,π/2) and in L(π/2,3π/2) respectively. Then, using the notation introduced in (2.5), we obtain 3π

ζθ (s, (−π/2,π/2) , D) e−i 4 s − ζθ (s, (π/2,3π/2) , D) e−i 4 s

π 3π = ρj−s e−iαj s e−i 4 s − ρj−s ei(αj +π)s e−i 4 s

π π π = ρj−s e−i(αj + 4 ) − ei(αj + 4 ) = −2i ρj−s sin(αj + )s. 4 −s Hence, cf. the discussion in the end of Subsect. 2.2, the function ρj sin(αj + π4 )s has a meromorphic extension to the whole complex plane, which is regular at 0. Thus, the first term in the RHS of (3.3) vanishes when s = 0. The equality (3.3) implies now that π

ζ2θ (0, D 2 ) − z(0) = m+ + m− . It is well known, cf. [14], that the ζ -function of a differential operator of even order on an odd-dimensional manifold vanishes at 0. In particular, ζ2θ (0, D 2 ) = 0. Thus, z(0) = − m+ + m− . Substituting this equality into (3.2), we obtain (3.1).

Remark 3.2. Note that the result of Theorem 3.1 is somewhat surprising. Indeed, if one thinks about Detθ D as a formal product of the eigenvalues of D, then one can do the following formal computation (where we shall use the notation introduced in the proof of Theorem 3.1): for each j = 1, 2, . . . the product of the eigenvalues ρj eiαj π and ρj ei(π −αj ) is a real number. Hence, one expects Detθ D = ± Detθ D ei 2 (m+ −m− ) , which is quite different from the correct answer given by Theorem 3.1. This example illustrates the danger of formal manipulations with determinants 2 .

3.2. Stability of the phase of the determinant. Suppose now that D(t) is a family of close to self-adjoint operators, depending on a real parameter t. We will say that the spectrum of D(t) depends continuously on t if, for each t, we can represent L2 (M, E) as a closure of a sum of D(t)-invariant finite dimensional subspaces, L2 (M, E) =

k (t),

(cf. 2.8) such that 2 The fact that formal computations often lead to wrong answers is well known. In particular, Det D θ might not be real even if D = D ∗ so that all the eigenvalues of D are real, cf., for example, [20].

294

A.G. Abanov, M. Braverman

• dim k (t) is independent of t; • the restriction of D(t) to k has a unique eigenvalue λk (t) and limk→∞ |λk (t)| = 0; • for every k = 1, 2, . . . , the function λk (t) is continuous in t. Theorem 3.2. Let now D(t) be a family of operators depending on a real parameter t. We assume that for each t ∈ R the operator D(t) satisfies all the assumptions of Theorem 3.1. In particular, its spectrum is symmetric with respect to the imaginary axis. Assume, in addition, that the eigenvalues of D(t) depend continuously on t. For each t let us choose an Agmon angle θ (t) ∈ (π/2, π). Then Detθ(t) D(t) is real and its sign is independent of t. Proof. By our assumptions, the eigenvalues of D(t) are symmetric with respect to the imaginary axis and never pass through zero. It follows that when one of the eigenvalues reaches Rπ/2 from the left the other must reach it from the right. In other words, the parity of the number of the eigenvalues on the imaginary axis is independent of t. The theorem follows now from Theorem 3.1 and Remark 3.1. 3.3. Topological invariance of the phase of the determinant. The eigenvalues always depend continuously on t if D(t) = D0 + tB, where D0 is an elliptic differential operator of order m ≥ 1 and B is a differential operator whose order is less than m, cf. [7]. They also often depend continuously on t when we have a smooth family of geometric structures (i.e, Riemannian metrics on a manifold, Hermitian metrics on a vector bundle, etc.) and D(t) is a family of geometrically defined operators (Dirac operators, Laplacians, etc.) depending on these geometric structures. Suppose, in addition, the spectrum of D(t) is symmetric with respect to the imaginary axis. Then, in view of Theorem 3.2, it is natural to expect that the phase of the determinant is a topological invariant. A natural question is how to relate this invariant to the other topological invariants. In other words, we would like to find a topological method of calculating the phase of the determinant of geometric operators, whose spectrum is symmetric with respect to the imaginary axis. In the rest of the paper we present numerous examples in which such a calculation is indeed possible. 4. First Examples In this section we present some simple examples of applications of Theorem 3.1. More sophisticated examples will be considered in the subsequent sections.

4.1. The circle. The simplest possible example of an operator satisfying the conditions of Theorem 3.1 is the operator Da = −i

d + ia, dt

a ∈ R,

acting on the space of function on the circle S 1 . Clearly, the only imaginary eigenvalue of Da is ia. Hence, Theorem 3.1 implies that, for θ ∈ (0, π ), we have Detθ Da < 0, if a > 0,

and

Detθ Da > 0 if a < 0.

(4.1)

Topological Calculation of the Phase of the Determinant

295

Remark 4.1. The determinant of the operator Da can be calculated explicitly. In fact [6] provides a formula for this determinant in terms of the monodromy operator associated to Da . Using this formula, one easily gets Det θ Da = e−aβ − 1, where β is the length of the circle. Clearly, that agrees with 4.1. 4.2. The deformed DeRham-Dirac operator. Suppose M is a closed manifold of odd dimension N = 2l + 1. Let d : ∗ (M) → ∗+1 (M) be the DeRham differential and let d ∗ : ∗ (M) → ∗−1 (M) be the adjoint of d with respect to a fixed Riemannian metric on M. Let βj (M) = dim H j (M) (j = 0, . . . , N) denote the Betti numbers of M. The operator Da := d + d ∗ + ia,

a ∈ R, a = 0

has exactly one imaginary eigenvalue λ = ia and its multiplicity is equal to the sum N e duality, this sum is j =0 βj (M) of the Betti numbers of M. Because of the Poincar´ an even number. Thus Theorem 3.1 implies that Det θ Da > 0,

a ∈ R, 0 < θ < π.

for all

To construct a more interesting example let us fix non-zero real numbers a0 , . . . , aN and consider the operator A : ∗ (M) → ∗ (M) defined by the formula A : ω → aj ω,

if

ω ∈ j (M).

Then one easily concludes from Theorem 3.1 that βj (M) Det θ d + d ∗ + iA . Detθ d + d ∗ + iA = (−1) {j :aj >0} Another interesting example can be constructed as follows. Let ∗ : ∗ (M) → N−∗ (M) denote the Hodge-star operator. Consider the operator : ∗ (M) → N−∗ (M), defined by the formula : α → i

N (N +1) 2

(−1)

j (j +1) 2

∗ α = i l+1 (−1)

j (j +1) 2

∗ α,

α ∈ j (M). (4.2)

Since N = 2l + 1 is odd, is self-adjoint, satisfies 2 = 1, and commutes with d + d ∗ . In particular, acts on Ker(d + d ∗ ) and this action has exactly 2 eigenvalues ±1, which have equal multiplicities 21 N j =0 βj (M). Hence, the operator D := d + d ∗ + i , has exactly 2 imaginary eigenvalues ±i, and multiplicities of these eigenvalues are equal to 21 N j =0 βj (M). Theorem 3.1 implies now that 1

Det θ D = (−1) 2

N

j =0

βj (M)

Det θ D .

Remark 4.2. All the results of this subsection can be easily extended to operators acting on the space of differential forms with values in a flat vector bundle F → M. (The DeRham differential should be replaced by the covariant differential and the Betti numbers should be replaced by the dimensions of the cohomology of M with coefficients in F ). We leave the details to the interested reader.

296

A.G. Abanov, M. Braverman

5. A Dirac-Type Operator on a Circle In this section we consider the operator D on the circle, which appears, e.g., in the study of a quantum spin in the presence of a planar, time-dependent magnetic field. In the case when the magnetic field is changing adiabatically in time the wave function of spin 1/2 acquires the phase π k, where k is an integer number of rotations that the direction n of magnetic field makes around the origin during the time evolution. This adiabatic phase is called the Berry phase [5]. For adiabatic evolution of magnetic field the Berry phase is equal up to a trivial dynamic factor to the determinant of the operator D defined in 5.2 below, cf., e.g., [2]. The main result of this section is Theorem 5.1 which calculates the phase of this determinant. This theorem is known in physical literature (see e.g., [2]), but no mathematically rigorous proofs were available until now. 5.1. The setting. Let S 1 be the circle, which we view as the interval [0, β] (β > 0) with identified ends. Let n : S 1 −→ z ∈ C : |z| = 1 be a smooth map. Then there exists a smooth function φ : R → R, satisfying the periodicity conditions φ(t + β) = 2πk + φ(t),

k ∈ Z,

(5.1)

such that n = eiφ . The number k above is called the topological degree (or the winding number) of the map n. 0 eiφ and consider the family of operators depending on a real paramSet n = −iφ e 0 eter m d d 0 eiφ D = i + im n = i + im −iφ , (5.2) e 0 dt dt acting on the space of vector-functions ξ : [0, β] → C2 with boundary conditions ξ(β) = eiπν ξ(0),

ξ˙ (β) = eiπν ξ˙ (0),

ν = 0, 1.

(5.3)

We shall study the determinant of D. The following lemma shows that this determinant is non-zero for m sufficiently large. ˙ Lemma 5.1. For m > maxt∈[0,β] |φ(t)|, zero is not in the spectrum of D. Proof. Consider the following scalar product on the vector valued functions on [0, β]: β (ξ, η) = ξ(t), η(t) dt, 0

where ·, · stands for the standard scalar product on C2 . Let ξ = (ξ, ξ )1/2 denote the norm of the vector function ξ . Integrating by parts the expression for Dξ 2 we obtain, for ξ satisfying the boundary condition (5.3), Dξ 2 = (ξ˙ , ξ˙ ) + m( nξ, ξ˙ ) + m(ξ˙ , nξ ) + m2 ξ 2 ≥ −m( n˙ ξ, ξ ) − m( nξ˙ , ξ ) + m(ξ˙ , nξ ) + m2 ξ 2 ˙ = −m( n˙ ξ, ξ ) + m2 ξ 2 ≥ m m − max |φ(t)| ξ 2 . t∈[0,β]

Topological Calculation of the Phase of the Determinant

297

˙ Theorem 5.1. Let m > maxt∈[0,β] |φ(t)|. For every θ ∈ (0, π ) such that there are no eigenvalues of D on the ray Rθ the following equality holds Det θ D = −(−1)k+ν Det θ D , (5.4) where k is defined in 5.1 and ν is defined in 5.3. Remark 5.1. Theorem 5.1 relates the sign of Det θ D with the topological invariant of the map eiφ . This realizes the program outlined in Subsect. 3.3. We precede the proof of the theorem with some discussion of the spectral properties of D. 5.2. The spectral properties of D. In order to study the spectrum of D it is convenient to replace it by a conjugate operator as follows. The operator iφ/2 e 0 Uφ := 0 e−iφ/2 maps the space of vector-functions with boundary conditions 5.3 to the space of functions ξ : [0, β] → C2 with new boundary conditions ξ(β) = eiπ(ν+k) ξ(0), Thus the operator

ξ˙ (β) = eiπ(ν+k) ξ˙ (0).

(5.5)

˙ := U −1 ◦ D ◦ Uφ = i d + −φ/2 im D φ ˙ im φ/2 dt

(5.6)

acting on the space of vector functions with boundary conditions 5.5 is isospectral to D. We now consider the following deformation of D: ˙ a := i d + −a φ/2 im , a ∈ [0, 1]. (5.7) D ˙ im a φ/2 dt The same arguments which were used in the proof of Lemma 5.1 show that a is invertible for all a ∈ [0, 1], and all sufficiently large Lemma 5.2. The operator D m > 0. Let

˙ −im d −a φ/2 Da = −i + , ˙ −im a φ/2 dt

a . be the complex conjugate of the operator D a (and, hence, of D) is symmetric The following lemma shows that the spectrum of D with respect to both the real and the imaginary axis. a , −D a∗ are conjugate to each other. Therefore, a , and D Lemma 5.3. The operators D they have the same spectral decomposition (2.8). In particular, the operators D, −D, and D ∗ are conjugate to each other. Proof. An easy calculation shows that 1 0 1 0 a∗ , · Da · = D 0 −1 0 −1

01 01 a . · Da · = −D 10 10

298

A.G. Abanov, M. Braverman

are conjugate to each other 5.3. Proof of Theorem 5.1. Since the operators D and D =D a is their determinants are equal. By Theorem 3.2, the sign of the determinant of D equal to the sign of the determinant of the operator d 0 im + . D0 = i im 0 dt 0 are given by the formula It is easy to see that all the eigenvalues of D λ± n = ±im +

π (2n − k − ν), β

n ∈ Z.

0 does not have any eigenvalues on the ray Rπ/2 if k + ν is odd and has exactly Hence, D one eigenvalue λ+ (k+ν)/2 = im on this ray if k + ν is even. Theorem 5.1 follows now from Theorem 3.1. 6. The Phase of the Determinant and the Degree of the Map This section essentially generalizes the previous section to manifolds of higher dimensions. For the case of a sphere of dimension N = 4l + 1 the results of this section have been obtained in [2] as topological terms in non-linear σ -models emerging as effective models for Dirac fermions coupled to chiral bosonic fields. However, no mathematically rigorous proofs were available until now. Note also that our result is more precise, since the equality 6.6 was obtained in [2] from the gradient expansion, i.e., only asymptotically for m → ∞. The section is organized as follows: first we formulate the problem in purely geometric terms as a question about the determinant of the DeRham-Dirac operator with potential. We state our main result as Theorem 6.6. Then, in Subsect. 6.3, we reformulate the result in terms of an operator acting on the tensor product of the two spaces of spinors. This formulation is closer to the one considered in physical literature. Finally, we present the proof of Theorem 6.6 based on the application of Theorems 3.1 and 3.2. 6.1. The setting. Let M be a closed oriented manifold of odd dimension N = 2r + 1. We fix a Riemannian metric on M and use it to identify the tangent and the cotangent j bundles, T M T ∗ M. Let ∗ T M = N j =0 T M denote the exterior algebra of T M viewed as a vector bundle over M. The space ∗ (M) of complex-valued differential forms on M coincides with the space of sections of the complexification ∗ T M ⊗ C of this bundle. The bundle ∗ T M ⊗ C (and, hence, the space ∗ (M)) carries 2 anti-commuting actions of the Clifford algebra of T M (the “left” and the “right” action) defined as follows cL (v) ω = v ∧ ω − ιv ω, cR (v) ω = v ∧ ω + ιv ω,

v ∈ T M, ω ∈ ∗ (M),

where ιv denotes the interior multiplication by v.

(6.1)

Topological Calculation of the Phase of the Determinant

299

The DeRham-Dirac operator ∂ can be written now (cf. [4, Prop. 3.53]) as ∂ = d + d∗ =

N

j =1

cL (ej ) ∇eLC , j

(6.2)

where ∇ LC denotes the Levi-Civita covariant derivative and e1 , . . . , eN is an orthonormal frame of T M. We view the direct sum R ⊕ T M as a vector bundle over M. Consider the corresponding sphere bundle S := (t, a) ∈ R ⊕ T M : t 2 + |a|2 = 1 . (6.3) Let n be a smooth section of the bundle S. In other words, n = (n0 , n), where n0 ∈ C ∞ (M), n ∈ C ∞ (M, T M) and n20 + |n|2 = 1. Remark 6.1. Suppose M is a parallelizable manifold, i.e., there given an identification between T M and the product M × RN . Then n can be considered as a map (6.4) M −→ S N := y ∈ RN+1 : |y|2 = 1 . Also, if M ⊂ RN+1 is a hypersurface, then, for every x ∈ M, the space R ⊕ Tx M is naturally identified with RN+1 . Hence, n again can be considered as a map M → S N . Note, however, that, even if M is parallelizable, this map is different from 6.4. Consider the map : R ⊕ T M → End ∗ T M ⊗ C,

: n = (n0 , n) → in0 + cR (n),

and define the family of deformed DeRham-Dirac operators Dmn = ∂ + m (n) : ∗ (M) −→ ∗ (M).

(6.5)

We are interested in the phase of Detθ Dmn for sufficiently large m. The following lemma shows that this determinant is well defined. Lemma 6.1. Fix an orthonormal frame e1 , . . . , eN of T M and set N

LC LC ∇ n(x) = ∇ n(x) , ej

For m > maxx∈M

j =1

|(∇ LC n(x)| + |∇n

N

∇n0 (x) = ∇e n0 (x) . j j =1

0 (x)| , zero is not in the spectrum of D.

The lemma is a particular case of a more general Lemma 6.2, cf. below. 6.2. The degree of a section. Note that the bundle S → M has a natural section σ : M → S, σ (x) = (1, 0). Definition 6.1. The topological degree deg(n) of the map n is the intersection number of the manifolds σ (M) and n(M) inside S. Remark 6.2. Suppose M is parallelizable and consider n and σ as maps M → S N , cf. Remark 6.1. Then σ is the constant map σ (x) = (1, 0). Hence, deg(n) is the usual topological degree of the map n : M → S N . Theorem 6.1. Let m > maxx∈M |(∇ LC n(x)| + |∇n0 (x)| . For every θ ∈ (0, π ) such that there are no eigenvalues of Dmn on the ray Rθ the following equality holds: (6.6) Detθ Dmn = (−1)deg n Det θ Dmn .

300

A.G. Abanov, M. Braverman

6.3. Reformulation in terms of spinors. Consider the (left) chirality operator L := i

N +1 2

cL (e1 ) cL (e2 ) · · · cL (eN ),

where e1 , . . . , eN is an orthonormal frame of T M. This operator is independent of the choice of the frame [4, Lemma 3.17] (in fact, it coincides with the operator defined in 4.2). Moreover, L2 = 1 and L commutes with cL (v) and anti-commutes with cR (v) for all v ∈ T M. Consider the map : R ⊕ T M → End ∗ T M,

n = (n0 , n) → n := i L n0 + L cR (n).

Then n2 = − n20 + |n|2 . Hence, the map n → n defines a Clifford action of R ⊕ T M on ∗ T M. Assume now that M is a spin-manifold (without this assumption the construction of this subsection is true only locally, in any coordinate neighborhood). In particular, there exists a bundle S → M whose fibers are isomorphic to the space of spinors over R⊕T M. Then (cf. [4, Prop. 3.35]) there exists a bundle S → M, such that ∗ T M ⊗ C → M can be decomposed as the tensor product S ⊗ S, and the operators n (n ∈ R ⊕ T M) act only on the second factor. More precisely, if we denote by cS : R ⊕ T M → End S the Clifford action of R ⊕ T M on S, then n = 1 ⊗ cS (n). We introduce now a new Clifford action c : T M → End ∗ T M ⊗ C of T M on ∗ T M ⊗ C, defined by the formula c(v) = L cL (v),

v ∈ T M.

(6.7)

One readily sees that n and c(v) commute for all v ∈ T M, n ∈ R ⊕ T M. It follows (cf. [4, Prop. 3.27]) that there is a Clifford action cS : T M → End S such that c(v) = cS (v) ⊗ 1. Comparing dimensions we conclude that S is a spinor bundle over M. It follows from (6.2), that ∂ = L ∂ S ⊗ 1, where ∂ S is the Dirac operator on S. Hence, the operator (6.5) takes the form Dmn = ∂ + m L n = L ∂ S ⊗ 1 + m · 1 ⊗ cS (n) : C ∞ (S ⊗ S) −→ C ∞ (S ⊗ S). In this form this and similar operators have appeared in physical literature. In particular, for the case when M is a (4l + 1)-dimensional sphere this operator 3 was considered in [2]. Also a result similar to our Theorem 6.6 was obtained in [2] for the operator L · Dmn = ∂ S ⊗ 1 + m 1 ⊗ cS (n) on a (4l + 3)-dimensional sphere. 3

Note, however, that there is a sign discrepancy between our notation and the notation accepted in physical literature. Our operators cS (v), cS (n) are skew-adjoint and satisfy the equalities cS (v)2 = −|v|2 , cS (n) = −|n|2 . Consequently, the operator ∂ S is self-adjoint.

Topological Calculation of the Phase of the Determinant

301

6.4. The idea of the proof. The rest of this section is devoted to the proof of Theorem 6.6, which is based on an application of Theorems 3.1 and 3.2. More pre whose determinant has the same cisely, we will deform operator Dmn to an operator D sign (in view of Theorem 3.2). We then calculate the number of imaginary eigenvalues which, in view of Theorem 3.1, will give us the sign of the determinants of D and of D, Dmn . First, we need to define the class of operators in which we will perform our deformation. This is done in the next subsection. 6.5. Extension of the class of operators. Let a : M → R and v : M → T M be a smooth function and a smooth vector field on M respectively. Set D(a, v) := ∂ + ia + cR (v). Clearly, Dmn = D(mn0 , mn). Also the following analogue of Lemma 6.1 holds: Lemma 6.2. Suppose a(x)2 + |v(x)|2 > 0 for all x ∈ M. Fix m0 > max |(∇ LC v(x)| + |∇a(x)| . x∈M

Then, for all m ≥ m0 , zero is not in the spectrum of D(m0 a, mv). Proof. Set ∂ m = ∂ + (m − m0 )cR (v). Then D(m0 a, mv) = ∂ m + i m0 a + m0 cR (v). ∗ (M).

Let α ∈ Using 6.2, we obtain, D(m0 a, mv) α 2 2 = ∂ m α + m20 α2 + m0 ∂ m (cR (v) + ia) + (cR (v) − ia) ∂ m α, α ≥ m20 α2 + m0

N

j =1

cL (ej ) cR (∇eLC v) + ∇ej a α, α j

+ 2 m0 (m − m0 ) |v|2 α, α ≥ m0 m0 − max |(∇ LC v(x)| + |∇a(x)| α2 . x∈M

The following lemma shows that we can apply Theorem 3.1 to the study of Detθ (a, v) (and, hence, of Detθ Dmn ). Lemma 6.3. The operators D(a, v) and −D(a, v)∗ are conjugate to each other. Consequently, they have the same spectral decomposition (2.8). ∗ are conjugate to each other. In particular, the operators Dmn and −Dmn Proof. Let N : ∗ (M) → ∗ (M) be the grading operator defined by the formula N ω = (−1)j ω,

ω ∈ j (M).

(6.8)

Then N ◦ D(a, v) ◦ N = − ∂ + ia − cR (v) = −D(a, v)∗ .

(6.9)

302

A.G. Abanov, M. Braverman

6.6. Deformation of Dmn . Let n = (n0 , n) be as in Theorem 6.6. Suppose that deg(n) = ±k, where k is a non-negative integer. Then there exists a section n = (n 0 , n ) of S, which is homotopic to n and has the following properties: • There exist k distinct points x1 , . . . , xk ∈ M such that n 0 (xj ) = 1,

n (xj ) = 0,

j = 1, . . . , k.

• There exists a Morse function f : M → R and a neighborhood U of the set {x1 , . . . , xk } such that n (x) = ∇f (x),

for all

x ∈ U,

and n (x) = 0 for all x ∈ U \{x1 , . . . , xk }. • If n (x) = 0 and x ∈ {x1 , . . . , xk }, then n 0 (x) = −1 and ∇f (x) = 0. Let xk+1 , . . . , xl be the rest of the critical points of f . Then n (x) = 0 for all x ∈ {x1 , . . . , xl }. Fix open neighborhoods Vj (j = 1, . . . , l) of xj whose closures are mutually disjoint and such that Vj ⊂ U for all j = 1, . . . , k. We will assume that Vj are small enough so that n 0 (x) = 0 and n (x) = 0 for all x ∈ Vj \{xj }. For each j = 1, . . . , l fix a neighborhood Wj of xj , whose closure lies inside Vj . Let a : M → [−1, 1] be a smooth function such that 1, if x ∈ kj =1 Wj ; a(x) = (6.10) −1, if x ∈ kj =1 Vj . Consider the deformation (n0 (t), n(t)) of the section (n 0 , n ) ∈ S given by the formulas ta + (1 − t)n 0 , 0≤t ≤1 n0 (t) = , a, 1≤t ≤2 n , 0≤t ≤1 n(t) = .

(t − 1)∇f + (2 − t)n , 1≤t ≤2 Clearly, (n0 (t), n(t)) = 0 for all t ∈ [0, 2]. Hence, by Lemma 6.2, for large m0 and every m > m0 , t ∈ [0, 2], zero is not in the spectrum of the operator D m0 n0 (t), mn(t) . Theorem 3.2 implies now that the determinant of Dmn has the same sign as the determinant of D m0 n0 (2), mn(2) = D(m0 a, m∇f ). 6.7. The spectrum of the operator D(0, m∇f ). Before investigating the operator D(m0 a, m∇f ) we consider a simpler operator ∗ D(0, m∇f ) = ∂ + m cR (∇f ) = e−mf d emf + e−mf d emf . This is a self-adjoint operator whose spectrum was studied by Witten [17] (see, for example, [16] for a mathematically rigorous exposition of the subject). In particular, D(0, m∇f ) has the following properties:

Topological Calculation of the Phase of the Determinant

303

• There exist a constant C > 0 and a function r(m) > 0 such that limt→0 r(m) = 0 and, for all sufficiently large m > 0, the spectrum of D(0, m∇f ) lies inside the set √ √ − ∞, −C m ∪ − r(m), r(m) ∪ C m, ∞). • Let Em denote the span of the eigenvectors of D(0, m∇f ) with eigenvalues in the interval − r(m), r(m) . Then, for all sufficiently large m, the space Em has a basis α1,m , . . . , αl,m (αj,m = 1) such that each αj,m is concentrated in Wj in the following sense: αj,m ∧ ∗αj,m = 1 − o(1), as m → ∞. (6.11) Wj

(Here o(1) stands for a vector whose norm tends to 0 as m → ∞.) In particular, dim Em = l. Note that (6.11) and (6.10) imply that αj,m (x) + o(1), a(x) αj,m (x) = −αj,m (x) + o(1),

for j = 1, . . . , k; for j = k + 1, . . . , l.

(6.12)

6.8. The spectrum of the operator D(m0 a, m∇f ). Let m0 be as in Subsect. 6.6 and let C be as in Subsect. 6.7. Choose m large enough so that 4(m0 + 1) 2 m > , C and r(m) < 1. We view the operator D(m0 a, m∇f ) = D(0, m∇f ) + i m0 a(x) as a perturbation of D(0, m∇f ). Lemma 6.4. The number of eigenvalues λ (counting with multiplicities) of D(m0 a, m∇f ) which satisfy |λ| < 2(m0 + 1),

Im λ > 0,

is equal to k = deg n. Proof. The spectral projection of the operator D(0, m∇f ) onto the space Em (cf. Subsect. 6.7) is given by the Cauchy integral −1 1 λ − D(0, m∇f ) dλ, (6.13) Pm = 2πi γ where γ is the boundary of the disk B = z ∈ C : |z| < 2(m0 + 1) . Note that, for all λ ∈ γ , we have −1 1 1 = . (6.14) ≤ λ − D(0, m∇f ) 2m0 + 1 dist λ, spec D(0, m∇f )

304

A.G. Abanov, M. Braverman

Hence, for all λ ∈ γ , we obtain −1 λ − D(m0 a, m∇f ) −1 −1 ≤ λ − D(0, m∇f ) · 1 − (λ − D(0, m∇f ))−1 m0 a ≤

1 1 1 = · . 2m0 + 1 1 − 2mm00+1 m0 + 1

(6.15)

In particular, γ is contained in the resolvent set of D(m0 a, m∇f ).

denote the span of the root vectors of D(m a, m∇f ) with eigenvalues in B. Let Em 0

is given by the formula The spectral projection of D(m0 a, m∇f ) onto Em Pm =

1 2πi

λ − D(m0 a, m∇f )

−1

dλ.

γ

Using (6.14) and (6.15), we obtain −1 −1 1 λ − D(0, m∇f ) m a λ − D(m a, m∇f ) dλ 0 0 2π γ 1 1 2m0 · m0 · = < 1. (6.16) ≤ 2(m0 + 1) · 2m0 + 1 m0 + 1 2m0 + 1

Pm − P = m

In particular,

dim Em = dim Em = l,

. Recall that the basis α and the projection Pm maps Em isomorphically onto Em 1,m

, . . . , αl,m of Em was defined in Subsect. 6.7. Then Pm α1,m , . . . , Pm αl,m is a basis of

. Em From (6.12), we get

D(m0 a, m∇f ) αj,m =

i m0 αj,m + o(1), −i m0 αj,m + o(1),

for j = 1, . . . , k for j = k + 1, . . . , l.

Since the operators D(m0 a, m∇f ) and Pm commute we obtain D(m0 a, m∇f ) Pm αj,m =

i m0 Pm αj,m + o(1), −i m0 Pm αj,m + o(1),

for j = 1, . . . , k for j = k + 1, . . . , l.

has exactly k eigenvalues (counting with Hence, the restriction of D(m0 a, m∇f ) to Em multiplicities) with positive imaginary part.

Topological Calculation of the Phase of the Determinant

305

6.9. Proof of Theorem 6.1. Clearly, all the eigenvalues of D(m0 a, m∇f ) satisfy Im λ ≤ m0 . (6.17) In particular, all the eigenvalues of D(m0 a,m∇f ) which lie on the ray Rπ/2 belong to the disc B = z ∈ C : |z| < 2(m0 + 1) . Since the spectrum of D(m0 a, m∇f ) is symmetric with respect to the imaginary axis the number of these eigenvalues (counting with multiplicities) has the same parity as the number of all eigenvalues, which lie in B and have positive imaginary part. Theorem 6.6 follows now from Theorem 3.1 and Lemma 6.4. Acknowledgements. The first author would like to thank the Theory Institute of Strongly Correlated and Complex Systems at Brookhaven for hospitality and support. The second author would like to thank the Max-Planck-Institut f¨ur Mathematik, where most of this work was completed, for hospitality and providing the excellent working conditions. He would also like to thank Mikhail Shubin and Raphael Ponge for valuable discussions.

References 1. Abanov, A.G.: Hopf term induced by fermions. Phys.Lett. B492, 321–323 (2000) 2. Abanov, A.G., Wiegmann, P.B.: Theta-terms in nonlinear sigma-models. Nucl.Phys. B570, 685–698 (2000) 3. Agranovich, M.S.: Elliptic operators on closed manifolds. Current problems in mathematics. Fundamental directions, Vol. 63 (Russian), Itogi Nauki i Tekhniki, Moscow: Akad. Nauk SSSR Vsesoyuz. Inst. Nauchn. i Tekhn. Inform., 1990, pp. 5–129 4. Berline, N., Getzler, E., Vergne, M.: Heat kernels and Dirac operators. Berlin-Heidelberg-New York: Springer-Verlag, 1992 5. Berry, M.: Quantal phase factors accompanying adiabatic changes. Proc. R. Soc. Lond. A 392, 45–57 (1984) 6. Burghelea, D., Friedlander, L., Kappeler, T.: On the determinant of elliptic differential and finite difference operators in vector bundles over S 1 . Comm. Math. Phys. 138(1), 1–18 (1991) 7. Kato, T.: Perturbation theory for linear operators. Berlin-Heidelberg-New York: Springer-Verlag, 1966 8. Lidski˘ı, V.B.: Non-selfadjoint operators with a trace. Dokl. Akad. Nauk SSSR 125, 485–487 (1959) 9. Markus, A.S.: Introduction to the spectral theory of polynomial operator pencils. Translations of Mathematical Monographs, Vol. 71, Providences RI: Amer. Math, Soc. 1998 10. Ponge, R.: Spectral asymetry, zeta function and the noncommutative residue. Preprint, to appear in J. Funct. Anal. 11. Redlich, A.N.: Gauge Noninvariance and Parity Nonconservation of Three-Dimensional Fermions. Phys. Rev. Lett. 52, 18–21 (1984) 12. Redlich, A.N.: Parity violation and gauge noninvariance of the effective gauge field action in three dimensions. Phys. Rev. D 29, 2366–2374 (1984) 13. Retherford, J.R.: Hilbert space: compact operators and the trace theorem. London Mathematical Society Student Texts, Vol. 27, Cambridge: Cambridge University Press, 1993 14. Seeley, R.: Complex powers of elliptic operators. Proc. Symp. Pure and Appl. Math. AMS 10, 288–307 (1967) 15. Shubin, M.A.: Pseudodifferential operators and spectral theory. Berlin, New York: Springer Verlag, 1980 16. Shubin, M.A.: Semiclassical asymptotics on covering manifolds and Morse inequalities. Geom. Funct. Anal. 6, 370–409 (1996) 17. Witten, E.: Supersymmetry and Morse theory. J. of Diff. Geom. 17, 661–692 (1982) 18. Wodzicki, M.: Local invariants of spectral asymmetry. Invent. Math. 75(1), 143–177 (1984) 19. Wodzicki, M.: Noncommutative residue. I. Fundamentals. K-theory, arithmetic and geometry (Moscow, 1984–1986), Lecture Notes in Math., Vol. 1289, Berlin: Springer, 1987 pp. 320–399 20. Wojciechowski, K.P.: Heat equation and spectral geometry. Introduction for beginners. Geometric methods for quantum field theory (Villa de Leyva, 1999), River Edge, NJ: World Sci. Publishing, 2001, pp. 238–292 Communicated by P. Sarnak

Commun. Math. Phys. 259, 307–324 (2005) Digital Object Identifier (DOI) 10.1007/s00220-005-1392-8

Communications in

Mathematical Physics

Birkhoff’s Polytope and Unistochastic Matrices, N = 3 and N = 4 Ingemar Bengtsson1 , Åsa Ericsson1 , Marek Ku´s2 , Wojciech Tadej3 , 2,4 ˙ Karol Zyczkowski 1

Stockholm University, AlbaNova, Fysikum, 106 91 Stockholm, Sweden. E-mail: [email protected]; [email protected] 2 Centrum Fizyki Teoretycznej, Polska Akademia Nauk, Al. Lotników 32/44, 02–668 Warszawa, Poland. E-mail: [email protected] 3 Cardinal Stefan Wyszynski University, Warszawa, Poland. E-mail: [email protected] 4 Instytut Fizyki im. Smoluchowskiego, Uniwersytet Jagiello´ nski, ul. Reymonta 4, 30–059 Kraków, Poland. E-mail: [email protected] Received: 8 March 2004 / Accepted: 24 March 2005 Published online: 15 July 2005 – © Springer-Verlag 2005

Abstract: The set of bistochastic or doubly stochastic N × N matrices is a convex set called Birkhoff’s polytope, which we describe in some detail. Our problem is to characterize the set of unistochastic matrices as a subset of Birkhoff’s polytope. For N = 3 we present fairly complete results. For N = 4 partial results are obtained. An interesting difference between the two cases is that there is a ball of unistochastic matrices around the van der Waerden matrix for N = 3, while this is not the case for N = 4. 1. Introduction Unistochastic matrices arise in many different contexts including error correcting codes, quantum information theory and particle physics. To define them, we first recall that an N × N matrix B is said to be bistochastic if its matrix elements satisfy i: Bij ≥ 0, ii: Bij = 1, iii: Bij = 1 . (1) i

j

The set of bistochastic matrices is a convex polytope known as Birkhoff’s polytope. One way of constructing a bistochastic matrix is to begin with a unitary matrix U and let Bij = |Uij |2 .

(2)

However, it is well-known [1] that not all bistochastic matrices arise in this way. If there is such a U , then we will call B unistochastic. If U is also real, that is orthogonal, then we call B orthostochastic. (Much of the mathematics literature uses the term orthostochastic to mean any matrix satisfying (2) and does not distinguish the subclass for which U is real. We will see later that the distinction is important.) In this paper, we consider the problem of characterizing the unistochastic subset of Birkhoff’s polytope. Before summarizing our results, we mention some physical applications. In quantum mechanics, the transition probabilities associated with a finite basis form bistochastic

˙ I. Bengtsson, Å. Ericsson, M. Ku´s, W. Tadej, K. Zyczkowski

308

matrices. In studies of the foundations of quantum theory, the attempt to build some group structure into these transition probabilities leads to the requirement that they form unistochastic matrices. A sample of the literature includes Landé [2], Rovelli [3] and Khrennikov [4]. In the attempt to formulate quantum mechanics on graphs (in the laboratory on thin strips of, say, gold film) the question of what Markov processes have quantum counterparts in the given setting again leads to unistochastic matrices [5–7]. In this connection studies of the spectra and entropies of unistochastic matrices chosen at random have been made [8]. In particle physics, a related question arises. In the theory of weak interactions one encounters the unitary Kobayashi-Maskawa matrices (one for quarks and one for neutrinos), and Jarlskog raised the question to what extent such a matrix can be parametrized by the easily measured moduli of its matrix elements. The physically interesting case here is N = 3 [9], and possibly also N = 4, should a fourth generation of quarks be discovered [10]. The question of determining U from B also arises in scattering theory, with no restriction on N [11]. Our main result involves the van der Waerden matrix JN , whose matrix elements satisfy (JN )ij = N1 . This matrix is unistochastic, and any corresponding unitary matrix is known as a complex Hadamard matrix. An example is the Fourier matrix, whose matrix elements are 1 Uj k = √ q j k , N

0 ≤ j, k ≤ N − 1 .

(3)

Here q = e2π i/N is a root of unity. Complex Hadamard matrices have a long history in mathematics [12–14], and have recently arisen in quantum information theory [15–17]. In this paper we study the set of unistochastic matrices, and the precise way in which it forms a subset of Birkhoff’s polytope. Our main result is that for N = 4 every neighborhood of the van der Waerden matrix contains matrices that are not unistochastic. This is in striking contrast with the N = 3 case for which J3 is at the center of a ball of unistochastic matrices inside a star-shaped region bounded by the set of orthostochastic matrices. This paper is organized as follows. In Sect. 2 we consider the set of all bistochastic matrices, and describe the cases N = 3 and N = 4 in detail (N = 2 is trivial). In Sect. 3 we discuss some generalities concerning unistochastic matrices, and then characterize the unistochastic subset in the case N = 3. Most of our results can be found elsewhere but, we believe, not in this coherent form. In Sect. 4 we consider N = 4, prove our main result, and relate some already known facts [10] to our explicit description of Birkhoff’s polytope. Section 5 summarises our conclusions. Some technical matters are found in three appendices. 2. Birkhoff’s Polytope The set BN of bistochastic N × N matrices has (N − 1)2 dimensions. To see this, note that the last row and the last column are fixed by the conditions that the row and column sums should equal one. The remaining (N − 1)2 entries can be chosen freely, within limits. Birkhoff proved that BN is a convex polytope whose extreme points, or corners, are the N ! permutation matrices [18]. It is called Birkhoff’s polytope. All its corners are equivalent in the sense that they can be transformed into each other by means of orthogonal transformations. A bistochastic matrix belongs to the boundary of BN if and only if at least one of its entries is zero. The boundary consists of corners, edges, faces,

Birkhoff’s Polytope and Unistochastic Matrices, N = 3 and N = 4

309

3-faces and so on; the highest dimensional faces are called facets and consist of matrices with only one zero entry. For a detailed account of BN , especially its face structure, see Brualdi et al. [19]. We will be even more detailed concerning B3 and B4 . We will use a quite explicit notation for the 24 permutation matrices in B4 ; see Appendix A for the details. It is convenient to regard the convex polytope BN as a subset of a vector space, with the van der Waerden matrix JN as its origin. The distance squared between two matrices is chosen to be D 2 (A, B) = Tr(A − B)(A† − B † ) ,

(4)

where the dagger denotes Hermitian conjugation. The distance squared between an arbitrary bistochastic matrix B and the van der Waerden matrix JN is then given by D 2 (B, JN ) = Bij2 − 1 . (5) i,j

In particular, the distance between JN and a corner of the polytope becomes √ D = N − 1. Permutations of rows or columns are orthogonal transformations of the polytope, since they preserve distance and leave the van der Waerden matrix invariant. They also take permutation matrices (corners) into permutation matrices, hence they are symmetry operations of Birkhoff’s polytope as well. The (Shannon) entropy of a bistochastic matrix is defined as the entropy of the rows averaged over the columns, 1 S=− Bij ln Bij . (6) N i

j

Its maximum value ln N is attained at JN . For some of its properties consult Słomczy´nski [20] et al. [8]. When N = 2 there are just two permutation matrices and B2 is a line segment between these two points. A general bistochastic matrix can be parametrized as 2 2 π c s B= 2 2 , c ≡ cos θ , s ≡ sin θ , 0≤θ ≤ . (7) s c 2 When N = 3 we have six permutation matrices forming the vertices of a four dimensional polytope. It admits a simple description: Theorem 1. The 6 corners of B3 are the corners of two equilateral triangles placed in two totally orthogonal 2-planes and centered at J3 . To prove this we form two triangles as convex combinations of permutation matrices. Using a notation that is consistent with Appendix A they are   p0 p3 p4 1 = p0 P0 + p3 P3 + p4 P4 =  p4 p0 p3  , p0 + p3 + p4 = 1 (8) p 3 p4 p 0 and

 p1 p2 p5 2 = p1 P1 + p2 P2 + p5 P5 =  p2 p5 p1  , p5 p 1 p 2 

p1 + p2 + p5 = 1 .

(9)

˙ I. Bengtsson, Å. Ericsson, M. Ku´s, W. Tadej, K. Zyczkowski

310

The calculation we have to do is to check that D 2 (P0 , P3 ) = D 2 (P0 , P4 ) = D 2 (P3 , P4 ) = 6 and similarly for the other triangle, and also that Tr(1 − J3 )(†2 − J3 ) = 0

(10)

for all values of pi . This is so. There are thus 6 corners and 6 · 5/2 = 15 edges, all of which are extremal. The last is a rather exceptional property; in 3 dimensions only the simplex has it. There are 9 short edges of length squared D 2 = 4 and 6 long edges of length squared D 2 = 6, namely the sides of the two equilateral triangles. A useful overview of B3 is given by its graph, where we exhibit all corners and all edges (see Fig. 1). All the 2-faces are triangles with one long and two short edges. The 3-faces in a 4 dimensional polytope are facets and here they are made of matrices with a single zero. They are irregular tetrahedra with two long edges, one from each equilateral triangle (see Fig. 4). The volume of B3 is readily computed because it can be triangulated using only three simplices. The total volume is 9/8. As N grows the total volume of BN becomes increasingly hard to compute; mathematicians know it for N ≤ 10 [21]. The next case is the 9 dimensional polytope B4 . It has 24 corners and 276 edges. The latter come in four types and we give the classification including the angle they subtend at J4 and whether they consist of unistochastic matrices or not (see Sects. 3 and 4): 4U 6 8 8U

Length squared Unistochastic Angle at origin Number of edges 4 Yes Acute 72 6 No 90 degrees 96 8 No Obtuse 72 8 Yes Obtuse 36

All edges except the 8U ones are extremal. The 2-faces consist of triangles and squares. (Interestingly, for all N it is true that the 2-faces of Birkhoff’s polytope BN are either triangles or rectangles [19].) There are 18 squares bounded by edges of type 4U and their diagonals are of type 8U . Three squares meet at each corner. If we pick four permutation matrices we obtain a 3-face, with six exceptions. The exceptions form 6 regular tetrahedra centered at J4 , whose edges are non-extremal 8U edges. They are denoted Ti

Fig. 1. Left: Birkhoff’s polytope for N = 2 (centered at J2 ). Right: The graph of Birkhoff’s polytope for N = 3; single lines have D 2 = 4 and double D 2 = 6. The double edges form the triangles mentioned in Theorem 1

Birkhoff’s Polytope and Unistochastic Matrices, N = 3 and N = 4

311

Fig. 2. How to begin to draw the surface of B4 . Two tetrahedra whose edges are the non-extremal diagonals of squares are shown. The dashed line goes through the polytope; it connects the midpoints of two opposing 8U edges of two tetrahedra that are otherwise disjoint

and explicitly listed in Appendix A. When regular tetrahedra are mentioned below it is understood that we refer to one of these six. In a sense the structure can now be drawn; see Fig. 2. The facets consist of matrices with one zero, so there are 16 facets. A subset of B4 that has no counterpart for B3 is the set of matrices that are tensor products of two by two bistochastic matrices. This subset splits naturally into several two dimensional components, and it turns out that they sit in B4 as doubly ruled surfaces inside the regular tetrahedra. Thus the following matrix, parametrised with two angles, is a tensor product of two matrices of the form (7): 

c12 c22 c12 s22 s12 c22 s12 s22



 c2 s 2 c2 c2 s 2 s 2 s 2 c2   1 2 1 2 1 2 1 2  2 2 2 2 2 2 2 2 ,  s 1 c 2 s 1 s 2 c1 c2 c 1 s 2 

c1 ≡ cos θ1 , etc.

(11)

s12 s22 s12 c22 c12 s22 c12 c22 These matrices form a doubly ruled surface inside the regular tetrahedron T1 , analogous to that depicted in Fig. 4. An interesting way to view B4 , and one that will recur in Sect. 4, stems from the following observation: Theorem 2. The 24 corners of B4 belong to a set of nine orthogonal hyperplanes through J4 . Each regular tetrahedron belongs to six hyperplanes and contains the normal vectors of the remaining three hyperplanes. Each hyperplane contains four regular tetrahedra and its normal vector is the intersection of the remaining two regular tetrahedra. Again the proof is a simple calculation, once the explicit form of the hyperplanes is known. They are denoted i and listed in Appendix A. From now on, hyperplane always refers to one of these nine. Figure 3 in a sense illustrates the theorem.

˙ I. Bengtsson, Å. Ericsson, M. Ku´s, W. Tadej, K. Zyczkowski

312

Fig. 3. A regular tetrahedron centered at J4 . It contains the normal vectors of three orthogonal hyperplanes and belongs entirely to another six. There are six such regular tetrahedra and pairs of them intersect along the normal vectors they contain. (Note that the dashed line in Fig. 2 represents such a normal vector.)

It is quite helpful to have an incidence table for tetrahedra and hyperplanes available. It is T1 T2 T3 T4 T5 T6

1 2 X X X X X X X X

3 4 5 6 X X X X X X X X X X X X X X X X

7 8 9 X X X X X X X X X X X X

(12)

where the tetrahedra Ti and the hyperplanes i are listed in Appendix A. For later purposes we will need some information about exactly how the hyperplanes divide the space into 29 hyperoctants. For this reason we look at the rays Bi (t) = J4 + tVi ,

(13)

where Vi is a vector constructed in terms of the normal vectors n1 , . . . , n9 of the hyperplanes (see Appendix A), namely   9 −3 −3 −3 1  −3 1 1 1  V1 ≡ n1 + n2 + n3 + n4 + n5 + n6 + n7 + n8 + n9 =   , (14) 4 −3 1 1 1 −3 1 1 1 

 7 −1 −1 −5 1  −1 −1 −1 3  V2 ≡ n1 + n2 + n3 + n4 + n5 + n6 + n7 + n8 − n9 =   , (15) 4 −1 −1 −1 3 −5 3 3 −1

Birkhoff’s Polytope and Unistochastic Matrices, N = 3 and N = 4

313



 5 1 −3 −3 1  1 −3 1 1  V3 ≡ n1 + n2 + n3 + n4 − n5 + n6 + n7 + n8 − n9 =   . (16) 4 −3 1 −3 5 −3 1 5 −3 All other cases can be obtained from one of these three by permutations of rows and columns. The various hyperoctants are convex cones centered on these rays. This gives a classification of the hyperoctants into six different types (since the parameter t can be positive or negative) called respectively type I± , II± and III± . Type I has 16 representatives and is especially noteworthy. For type I− the centered ray hits the boundary in the center of one of the 16 facets, at the matrix B1 (− 19 ). In the other direction we also hit a quite distinguished point. There are 16 ways of setting one entry of a bistochastic matrix equal to one, and this gives rise to 16 copies of B3 sitting in the boundary of B4 . For the octants I+ the centered ray hits the boundary precisely at the center of such a B3 , at the matrix B1 ( 13 ). In Sect. 4 we will see how the structure of the unistochastic subset is related to the structure of Birkhoff’s polytope, and in particular to those of its features that we have stressed. 3. The Unistochastic Subset, N = 3 Let us begin with some generalities concerning the unistochastic subset UN of BN . The dimension of BN is (N −1)2 , and the dimension of the group of unitary N by N matrices, U (N ), is N 2 . Therefore the map U (N ) → BN cannot be one-to-one. Now it is clear that multiplying a row or a column by a phase factor—an operation that we refer to as rephasing—will result in the same bistochastic matrix via Eq. (2). Therefore the map is naturally defined as a map from a double coset space to BN . The double coset space is U (1) × · · · × U (1) \ U (N) / U (1) × · · · × U (1) ,

(17)

with N U (1) factors acting from the right and N − 1 factors from the left, say. The dimension of this set is (N − 1)2 , so now the dimensions match. There is a complication because the double coset space is not a smooth manifold. The action from the left of the U (1) factors on the right coset space (in itself a well behaved flag manifold) has fixed points. These fixed points are easy to locate however (and always map to the boundary of BN ), so that for most practical purposes we can think of our map as a map between smooth manifolds. In general we will see that the image of our map is a proper subset of BN , and the map is many-to-one. There is not much we can usefully say about the general case, except for two remarks: The unistochastic subset UN has the full dimension (N − 1)2 while the unistochastic subset of the boundary of BN has dimension (N − 1)2 − 2; why this is so will presently become clear. For N = 2 every bistochastic matrix is orthostochastic. A unitary matrix that maps to the matrix in Eq. (7) is π c s U= , c ≡ cos θ , s ≡ sin θ , 0≤θ ≤ . (18) s −c 2 The matrix is given in dephased form. This means that the first row and the first column are real and positive. This fixes the U (1) factors mentioned above (unless there is a zero entry in one of these places) and from now on we shall present all unitary matrices

˙ I. Bengtsson, Å. Ericsson, M. Ku´s, W. Tadej, K. Zyczkowski

314

in this form. For any N it is straightforward to check whether a given edge of BN is unistochastic. For N = 3 the edges of length squared equal to 4 are unistochastic, and for N = 4 we have the results given in table (11). Given a 3 × 3 bistochastic matrix

it is easy to check whether it is unistochastic or not [22, 9]. We form the moduli rij = Bij and write down the matrix 

 r00 r01 • U =  r10 r11 eiφ11 •  . r20 r21 eiφ21 •

(19)

This matrix is given in dephased form. If it is unitary, the original matrix B is unistochastic. The unitarity conditions simply say that the first two columns are orthogonal. The last column by construction has the right moduli and does not impose any further restrictions, hence it is not written explicitly. The problem is whether phases φ11 and φ21 can be found so that the matrix is unitary. This problem can be translated into the problem of forming a triangle from three line segments of given lengths L0 = r00 r01 ,

L1 = r10 r11 ,

L2 = r20 r21 .

(20)

This is possible if and only if the “chain–links” conditions are fulfilled, i.e. |L1 − L2 | ≤ L0 ≤ L1 + L2 .

(21)

The bistochastic matrix B corresponding to U sits at the boundary of U3 if and only if one of these inequalities is saturated. When the inequalities (21) hold the solution for the phases is cos φ11 =

L22 − L20 − L21 , 2L0 L1

cos (φ11 − φ21 ) =

cos φ21 =

L21 − L22 − L20 , 2L0 L2

L20 − L21 − L22 . 2L1 L2

(22)

(23)

There is a two-fold ambiguity (corresponding to taking the complex conjugate of the matrix, U → U ∗ ). The area A of the triangle is easily computed and the chain–links conditions are equivalent to the single inequality A ≥ 0. As a matter of fact we can form six so called unitarity triangles in this way, depending on what pair of columns or rows that we choose. Although their shapes differ their area is the same, by unitarity [9]. Because we can easily decide if a given matrix is unistochastic, it is easy to characterize the unistochastic set U3 . We single out the following facts (some of which are known [22, 23]) for attention: Theorem 3. The unistochastic subset U3 of B3 is a non-convex star shaped four dimensional set whose boundary consists of√the set of orthostochastic matrices. It contains a unistochastic ball of maximal radius 2/3, centered at J3 . The set meets the boundary of B3 in a doubly ruled surface in each facet.

Birkhoff’s Polytope and Unistochastic Matrices, N = 3 and N = 4

315

Fig. 4. Birkhoff’s polytope for N = 3. Left: One of the two orthogonal equilateral triangles centered at J3 , with its unistochastic subset (the boundary is the famous hypocycloid). Right: A facet, an irregular tetrahedron, with its doubly ruled surface of unistochastic matrices

The relative volume of the unistochastic subset is, according to our numerics, vol(U3 ) ≈ 0.7520 ± 0.0005 . vol(B3 )

(24)

We did not attempt an analytical calculation; details of our numerics are in Appendix B. Theorem 3 is easy to prove. To see that U3 is non-convex we just draw its intersection with one of the equilateral triangles that went into the definition of the polytope, and look at it (see Fig. 4). An amusing side remark is that the boundary of the unistochastic set in this picture is a 3-hypocycloid [8]. It can be obtained by rolling a circle of radius 1/3 inside the unit circle. The maximal unistochastic ball is centered at J3 and touches the boundary at the hypocycloid, as one might guess from the picture; its radius was deduced from results presented in ref. [24]. To see that the boundary consists of orthostochastic matrices, observe that when the chain–links conditions are saturated the phases in U will equal ±1. That the set is star shaped then follows from an explicit check that there is only one orthostochastic matrix on any ray from J3 . Finally Fig. 4 includes a picture of the unistochastic subset of a facet. The reason why it has codimension one is that a matrix on the boundary of BN has a zero entry, which means that the number of phases available in the dephased unitary matrix drops with one, and then the dimension of the unistochastic set also drops with one; the argument goes through for any N . Finally let us make some remarks on entropy. First we compare the Shannon entropy averaged over B3 to the Shannon entropy averaged over U3 , using the flat measure in both cases. Numerically we find that S B3 ≈ 0.883

and

S U3 ≈ 0.908 ,

(25)

with all digits significant. Observe that the latter average is larger since some matrices of small entropy close to the boundary of B3 are not unistochastic and do not contribute to the average over U3 . The above data may be compared with the maximal possible entropy Smax = ln 3 ≈ 1.099, attained at J3 , and also with S Haar =

1 1 + ≈ 0.833 , 2 3

(26)

which is the average taken over U3 with respect to the measure induced by the Haar measure on U (3). This analytical result follows from the observation that S Haar coincides

316

˙ I. Bengtsson, Å. Ericsson, M. Ku´s, W. Tadej, K. Zyczkowski

with the average entropy of squared components of complex random vectors, which was computed by Jones [25]. 4. The Unistochastic Subset, N = 4 The case N = 4 is more difficult. It is also clear from the outset that it will be qualitatively different—thus the dimension of the orthogonal group is too small for the boundary of the unistochastic set U4 to be formed by orthostochastic matrices alone. There are other differences too, as we will see.

Given a bistochastic matrix we can again define rij = Bij and consider   r00 r01 r02 •  r10 r11 eiφ11 r12 eiφ12 •   U = (27)  r20 r21 eiφ21 r22 eiφ22 •  . r30 r31 eiφ31 r32 eiφ32 • Phases must now be chosen so that this matrix is unitary, and more especially so that the three columns we focus on are orthogonal. Geometrically this is the problem of forming three quadrilaterals with their sides given and six free angles. This is not a simple problem, and in practice we have to resort to numerics to see whether a given bistochastic matrix is unistochastic (see Appendix B for details). There are some easy special cases though. One easy case is that of a matrix belonging to the boundary of BN . Then the matrix U must contain one zero entry and when we check the orthogonality of our three columns two of the equations reduce to the problem of forming triangles. This fixes four of the angles, and the final orthogonality relation is easily dealt with. Another easy case concerns the regular tetrahedra. They turn out to consist of orthostochastic matrices; for the tetrahedron T1 (see Appendix A) a corresponding orthonormal matrix is √ √ √   √ √p0 √p7 √p16 √p23  p − p − √p23 √p16  O1 =  √ 7 √ 0 (28)  . √p16 √p23 −√p0 −√p7 p23 − p16 p 7 − p0 This saturates a bound saying that the maximum number of N × N permutation matriN

ces whose convex hull is unistochastic is not larger than 2 2 , where [N/2] denotes the integer part of N/2 [26]. Let us now turn our attention to J4 . Hadamard [27] observed that up to permutations of rows and columns the most general form of the complex Hadamard matrix is   1 1 1 1 1  1 eiφ −1 −eiφ  . H (φ) =  (29) 2  1 −1 1 −1  1 −eiφ −1 eiφ One can show that this is a geodesic in U (N ). What is new, compared to N = 3, is that the van der Waerden matrix is orthostochastic because H (0) is real. Moreover, there is a continuous set of dephased unitaries mapping to the same B. In a calculational tour de force, Auberson et al. [10] were able to determine all bistochastic matrices whose dephased unitary preimages contain a continuous ambiguity (and they found that the

Birkhoff’s Polytope and Unistochastic Matrices, N = 3 and N = 4

317

ambiguity is given by one parameter in all cases). There are three such families. Using the notation of ref. [10] they consist of matrices of the following form:  Type A:

a b e f

b a f e

c d g h

 d c , h g 

Type B:

 Type C:

a a

b b   c c d d

s12 s22 c12 s22 c32 c22 s32 c22

1 2 1 2 1 2 1 2

−a −b −c −d

1 2 1 2 1 2 1 2

−a



−b  , −c

(30)

−d



 s 2 c2 c2 c2 c2 s 2 s 2 s 2   1 2 1 2 3 2 3 2  2 2 2 2 2 2 2 2 .  c1 c4 s 1 c4 s 3 s 4 c3 s 4 

(31)

c12 s42 s12 s42 s32 c42 c32 c42 Here c1 = cos θ1 , s1 ≡ sin θ1 , and so on. Type A consists of nine five dimensional sets, type B of nine four dimensional sets, and type C of six three dimensional sets. In trying to understand their location in B4 the observation in Sect. 2 concerning the nine orthogonal hyperplanes begins to pay dividends. (In particular, consult the incidence table (12).) Type A consists of the linear subspaces obtained by taking all intersections of four hyperplanes that contain exactly two regular tetrahedra. Type C consists of the linear subspaces obtained by taking all intersections of six hyperplanes that contain no permutation matrices at all. Type B finally consists of curved manifolds confined to one hyperplane. Auberson’s families are not exclusive. In particular tensor product matrices belong to families A and B, which means that there are two genuinely different ways of introducing a free phase in the corresponding unitary matrix. Outside the three sets A, B and C Auberson et al. find a 12-fold discrete ambiguity in the dephased unitaries, dropping to 4-fold for symmetric matrices [10]. Tensor product matrices B4 = B2 ⊗ B2 appear because 4 = 2 × 2 is a composite number. That they are always unistochastic follows from a more general result: Lemma 1. Let BK and BM be unistochastic matrices of size K and M, respectively. Then the matrix BN = BK ⊗ BM of size KM is unistochastic. The corresponding dephased unitary matrices contain at least (K − 1)(M − 1) free phases. That BN is unistochastic follows from properties of the Hadamard and the tensor products. By definition, the Hadamard product A ◦ B of two matrices is the matrix whose matrix elements are the products of the corresponding matrix elements of A and B. Then ∗ implies that B = (U ◦ U ∗ ) ⊗ (U ◦ U ∗ ) = BK = UK ◦ UK∗ and BM = UM ◦ UM N K M K M ∗ ∗ (UK ⊗ UM ) ◦ (UK ⊗ UM ), so it is unistochastic. The existence of free phases is an easy generalization of Proposition 2.9 in Haagerup [28]. The hyperplane structure of B4 reverberates in the structure of the unistochastic set in several ways. Let us consider how the tangent space of U (N ) behaves under the map to BN . In equations, this means that we fix a unitary matrix U0 and expand 1 U (t) = eiht U0 = (1 + iht − h2 t 2 + . . . )U0 , 2

(32)

where h is an Hermitian matrix. Then we study bistochastic matrices with elements Bij (t) = |Uij (t)|2 to first order in t. The following features are true for all N :

˙ I. Bengtsson, Å. Ericsson, M. Ku´s, W. Tadej, K. Zyczkowski

318

• Generically the tangent space of U (N ) maps onto the tangent space of BN . We checked this statement by generating unitary matrices at random using the Haar measure on the group. It implies that the dimension of the unistochastic set is equal to that of BN . • A matrix element in B receives a first order contribution only if it is non-vanishing. Hence the map of the tangent space of U (N ) to the tangent space of BN is degenerate at the boundary of the polytope. In general such behaviour is to be expected at the boundary of the unistochastic set UN . • If U0 is real the map is degenerate in the sense that the tangent space maps to an N(N − 1)/2 dimensional subspace of the tangent space of BN . • If U0 maps to a corner of the polytope then the first order contributions vanish. To second order we pick up the tip of a convex cone whose extreme rays are the N (N −1)/2 edges of type 4U , emanating from that corner. For N = 4 the story becomes interesting when we choose U0 equal to the Hadamard matrix H (φ). Then we find that the tangent space at U0 maps into one of the nine hyperplanes; which particular one depends on how we permute rows and columns in Eq. (29). The question therefore arises whether the orthostochastic van der Waerden matrix belongs to the boundary of the unistochastic set—or not since a priori such degeneracies can occur also in the interior of the set. We know that we can form curves of unistochastic matrices starting from J4 and moving out into the nine hyperplanes. Can we form such curves that go directly out into one of the 29 hyperoctants? Here the division of the 29 hyperoctants into six different types becomes relevant. We have investigated whether their central rays given in Eqs. (13–16) consist of unistochastic matrices, or not. Let us begin with the 16 hyperoctants of type I, where the central ray B1 (t) = J4 + tV1 hits the boundary in the center of one of the 16 B3 sitting in the boundary (at t = 1/3), and in the center of one of the 16 facets (at t = −1/9). Of these two points, the first is unistochastic, the second is not. A one parameter family of candidate unitary matrices that maps to the central ray is √ √  √ • √1 − 3t iφ √1 − 3t √1 − 3t φ 11 12 1  1+t •  , √1 + te √1 + te √ (33) U (t) =  2  √1 + t √1 + teiφ21 √1 + teφ22 •  φ iφ 1+t 1 + te 31 1 + te 32 • where t > 0 and we permuted the columns relative to Eq. (14) in order to get the unitarity equations in a pleasant form. (We do not need to give the phases for the last column.) The conditions that the first three columns be orthogonal read eiφ11 + eiφ21 + eiφ31 + L = 0, eiφ12 + eiφ22 + eiφ32 + L = 0, i(φ11 −φ12 ) i(φ21 −φ22 ) e +e + ei(φ31 −φ32 ) + L = 0,

(34) (35) (36)

where L=

1 − 3t . 1+t

In Appendix C we prove that the system of equations (34–36) 1. has no real solutions for L > 1,

(37)

Birkhoff’s Polytope and Unistochastic Matrices, N = 3 and N = 4

319

2. for 0 < L < 1 has the solution φ11 = 0, φ21 = φ, φ31 = −φ , φ12 = φ, φ22 = 0, φ32 = −φ

cos φ =

L+1 t −1 =− . t +1 2

(38)

It follows that the central ray is unistochastic for the hyperoctants of type I+ (and the unitary matrices on the central ray tend to the real Hadamard matrix at t = 0). In the other direction the central ray is not unistochastic for type I− . Thus we have proved Theorem 4. For N = 4 there are non-unistochastic matrices in every neighbourhood of the van der Waerden matrix J4 . At J4 the map U (4) → B4 aligns the tangent space of U (4) with one of the nine orthogonal hyperplanes. The structure of the unistochastic set is dramatically different depending on whether N = 3 or N = 4. It is only in the former case that there is a ball of unistochastic matrices surrounding the van der Waerden matrix. On the other hand, the hyperoctants are not empty—some of them do contain unistochastic matrices all the way down to J4 . Concerning the other hyperoctants, for types II− , III+ , and III− the central rays hit the boundary of the polytope in points that are not unistochastic, but numerically we find that a part of the ray close to J4 is unistochastic. For type II+ we hit the boundary in a unistochastic point and numerically we find the entire ray to be unistochastic. There is still much that we do not know. We do not know if the hyperoctants of type I− are entirely free of unistochastic matrices, nor do we know if U4 is star shaped, or what its relative volume may be. What is clear from the results that we do have is that the global structure of Birkhoff’s polytope reverberates in the structure of the unistochastic subset in an interesting way—it is a little bit like a nine dimensional snowflake, because the nine hyperplanes in B4 can be found through an analysis of the behaviour of U4 in the neighbourhood of J4 . 5. Conclusions Our reasons for studying the unistochastic subset of Birkhoff’s polytope have been summarized in the introduction. Because the problem is a difficult one we concentrated on the cases N = 3 and N = 4. Our descriptions of Birkhoff’s polytope for these two cases are given in Theorems 1 and 2, respectively, and a characterization sufficient for our purposes of the unistochastic set for N = 3 is given in Theorem 3. For N = 4 the dimension of the unistochastic set is again equal to that of the polytope itself, but its structure differs dramatically from the N = 3 case. In particular Theorem 4 states that for N = 4 there are non-unistochastic matrices in every neighbourhood of the van der Waerden matrix. Hence there does not exist a unistochastic ball surrounding the van der Waerden matrix. We observed that the structure of the unistochastic set at the center of the polytope reflects the global structure of the latter in an interesting way. It is natural to ask to what extent the difference between the two cases is due to the fact that 3 is prime while 4 is not. Although this is not the place to discuss the cases N > 4, let us mention that we have reasons to believe that the dimension of the unistochastic set is equal to that of BN for all values of N [29]. On the other hand it is only when N is a prime number that we have been able to show that there is a unistochastic ball surrounding the van der Waerden matrix.

˙ I. Bengtsson, Å. Ericsson, M. Ku´s, W. Tadej, K. Zyczkowski

320

Acknowledgements. We thank Göran Björck, Prot Pako´nski, Wojciech Słomczy´nski, and Gregor Tanner for discussions, Petre D˘i¸ta for email correspondence, Uffe Haagerup for supplying us with a copy of Petrescu’s thesis, and an anonymous referee for comments. Financial support from the Swedish Research CouncilVR, and from the Polish Ministry of Scientific Research under grant No PBZ-MIN-008/P03/2003, is gratefully acknowledged.

Appendix A: Notation For N = 4 we have defined the 24 permutation P0 , . . . , P23 matrices in a lexicographical order. They can be regarded as the corners of 6 regular tetrahedra, that can be written in the form   p0 p7 p16 p23 p p p p  T1 = p0 P0 + p7 P7 + p16 P16 + p23 P23 =  7 0 23 16  , p16 p23 p0 p7 p23 p16 p7 p0 

T2 = p1 P1 + p6 P6 + p17 P17 + p22 P22

p1  p6 = p22 p17

p6 p1 p17 p22



T3 = p2 P2 + p10 P10 + p13 P13 + p21 P21

p2  p13 = p10 p21 

T4 = p3 P3 + p11 P11 + p12 P12 + p20 P20

p3  p12 = p20 p11 

T5 = p4 P4 + p8 P8 + p15 P15 + p19 P19

p4  p19 = p8 p15 

T6 = p5 P5 + p9 P9 + p14 P14 + p18 P18

p5  p18 = p14 p9

p17 p22 p6 p1

 p22 p17  , p1  p6

p10 p21 p2 p13

p13 p2 p21 p10

 p21 p10  , p13  p2

p11 p20 p12 p3

p12 p3 p11 p20

 p20 p11  , p3  p12

p8 p15 p4 p19

p15 p8 p19 p4

 p19 p4  , p15  p8

p9 p14 p18 p5

p14 p9 p5 p18

 p18 p5  . p9  p14

These expressions also implicitly define our numbering convention for the permutation matrices. The nine hyperplanes mentioned in Theorem 2 consist of matrices of the form     B00 B01 • • B00 B01 • • B B • •  • • • • 1 =  10 11 , 2 =  , B20 B21 • •  • • • • • • •• • • ••

Birkhoff’s Polytope and Unistochastic Matrices, N = 3 and N = 4



B00  • 3 =  • B30 

B00  • 5 =  B20 • 

B00  B10 7 =  • •

• • • •

 • • , • •

B02 • B22 •

 • • , • •

B01 • • B31 • • • •

•• •• •• ••

 B03 B13  , •  • 

B00  • 9 =  • B30



B00  B10 4 =  • • 

B00  • 6 =  • B30 

B00  • 8 =  B20 • •• •• •• ••

321

• • • •

B02 B12 • •

 • • , • •

• • • •

B02 • • B32

 • • , • •

•• •• •• ••

 B03 •  , B23  •

 B03 •  , •  B33

where the matrix elements that are explicitly written are assumed to sum to one (hence this holds also for the remaining three blocks taken separately). The normal vectors of these hyperplanes are the matrices   1 1 −1 −1 1  1 1 −1 −1  n1 =   4 −1 −1 1 1 −1 −1 1 1 and so on. Appendix B: Numerics I. Entropy averaged over B3 . To generate a random bistochastic matrix from the flat measure on B3 ⊂ R4 , we have drawn at random a point (x, y, z, t) in the 4-dimensional hypercube. It determines a minor of a N = 3 matrix B, and the remaining five elements of B may be determined by the unit sum conditions in Eq. (1). Condition i is fulfilled if the sums in both rows and both columns of the minor does not exceed unity, and the sum of all four elements is not smaller than one. If this was the case, the random matrix B was accepted to the ensemble of random bistochastic matrices. If additionally, the chain links condition (21) were satisfied, the matrix was accepted to the ensemble of unistochastic matrices, generated with respect to the flat measure on U3 . The mean entropies, (25), were computed by taking an average over both ensembles consisting of 107 random matrices, respectively. II. Numerical verification, whether a given bistochastic matrix B is unistochastic. We have performed a random walk in the space of unitary matrices. Starting from an arbitrary random initial point U0 we computed B0 = U0 ◦ U0∗ and its distance to the analyzed matrix, D0 = D(B0 , B), as defined in (4). We fixed a small parameter α ≈ 0.1, generated a random Hermitian matrix H from the Gaussian unitary

˙ I. Bengtsson, Å. Ericsson, M. Ku´s, W. Tadej, K. Zyczkowski

322

ensemble [30], and found a unitary perturbation V = exp(−iαH ). The matrix Un+1 = V Un was accepted as a next point of the random trajectory if the distance Dn+1 was smaller than the previous one, Dn . If a certain number (say 100) of random matrices V did not allow us to decrease the distance, we reduced the angle α by half, to start a finer search. A single run was stopped if the distance D was smaller than = 10−6 (numerical solution found), or α got smaller than a fixed cut off value (say αmin = 10−4 ). In the latter case, the entire procedure was repeated a hundred times, starting from various unitary random matrices U0 , generated from the Haar measure on U (4) [31]. The smallest distance Dmin and the closest unistochastic matrix Bmin = Un ◦ U¯ n were recorded. To check the accuracy of the algorithm we constructed several random unistochastic matrices, B = U ◦ U ∗ , and verified that the random walk procedure gave their approximations with Dmin < . Appendix C: A System of Equations In order to curtail a plethora of indices in Eqs. (34–36) and ease the subsequent notation, let us introduce shorthand: ϕj = φj 1 , ψj = −φj 2 , j = 1, 2, 3. With that the system reads eiϕ1 + eiϕ2 + eiϕ3 = −L, eiψ1 + eiψ2 + eiψ3 = −L, ei(ϕ1 +ψ1 ) + ei(ϕ2 +ψ2 ) + ei(ϕ3 +ψ3 ) = −L.

(39) (40) (41)

We shall prove the following: Lemma 2. The system of Eqs. (39–41) 1. has no real solutions for L > 1, 2. for 0 < L < 1 has the solution ϕ1 = 0, ϕ2 = φ, ϕ3 = −φ , ψ1 = −φ, ψ2 = 0, ψ3 = φ

cos φ =

L+1 t −1 =− , t +1 2

(42)

unique up to obvious permutations, 3. has continuous families of solutions for L = 0, 1. Indeed, each of the unimodal numbers eiϕk , k = 1, 2, 3 is a root of: P (λ) = (λ − eiϕ1 )(λ − eiϕ2 )(λ − eiϕ3 ) = λ3 − (eiϕ1 + eiϕ2 + eiϕ3 )λ2 + (ei(ϕ1 +ϕ2 ) + ei(ϕ1 +ϕ3 ) + ei(ϕ2 +ϕ3 ) )λ −e(iϕ1 +ϕ2 +ϕ3 ) (43) = λ3 − (eiϕ1 + eiϕ2 + eiϕ3 )λ2 + (e−iϕ3 + e−iϕ2 + e−iϕ1 )e(iϕ1 +ϕ2 +ϕ3 ) λ −e(iϕ1 +ϕ2 +ϕ3 ) = λ3 + λ2 L − λLei − ei = λ2 (λ + L) − (1 + λL)ei , where = ϕ1 + ϕ2 + ϕ3 , and we used (39) and the reality of L. Thus each λ = eiϕk , (k = 1, 2, 3), fulfills: λ2 (λ + L) = (1 + λL)ei .

(44)

Birkhoff’s Polytope and Unistochastic Matrices, N = 3 and N = 4

323

Analogously, µ = eiψk , (k = 1, 2, 3), fulfills µ2 (µ + L) = (1 + µL)ei ,

(45)

with = ψ1 + ψ2 + ψ3 . Observe now, that if λ = eiϕk and µ = eiψk are solutions of (39–41) with the same number k (k = 1, 2, 3) then, upon the same reasoning applied to (41), λµ fulfills λ2 µ2 (λµ + L) = (1 + λµL)ei(+ ) .

(46)

Multiplying (44) by (45) and finally by (46) after exchanging its sides, we obtain, after division by λ2 µ2 ei(+ ) = 0, (L + λ)(L + µ)(Lλµ + 1) = (Lλ + 1)(Lµ + 1)(L + λµ),

(47)

which, upon substitution λ = eiϕk , µ = eiψk and putting everything on one side factorizes to L(L − 1)(eiϕk − 1)(eiψk − 1)(ei(ϕk +ψk ) − 1) = 0,

(48)

(any computer symbolic manipulation program can be helpful in revealing (48) from (47)). Hence, if L = 0, 1, then for each pair (ϕk , ψk ), k = 1, 2, 3, either: a) one of the angles is zero or b) they are opposite. The latter case can not occur for all three pairs since then ei(ϕ1 +ψ1 ) + ei(ϕ2 +ψ2 ) + ei(ϕ3 +ψ3 ) = 3 = −L, hence at least one of ϕk or ψk equals zero. Up to unimportant permutations we can assume ϕ3 = 0, but then, since eiϕ1 + eiϕ2 + eiϕ3 = −L ∈ R, we immediately get ϕ1 = −ϕ2 . This determines also all other angles (also up to some unimportant permutation), and we end up with the solution announced in point 2 above as the only possibility, but such a solution exists only if L ≤ 1. To prove point 3, observe that 1. for L = 0, ϕ1 = ϕ, ψ1 = ψ,

ϕ2 = ϕ + 2π/3, ϕ3 = ϕ + 4π/3, ψ2 = ψ + 2π/3, ψ3 = ψ + 4π/3,

(49) (50)

is a legitimate solution of (39–41) for arbitrary ϕ and ψ, 2. for L = 1, ϕ1 = ϕ, ϕ2 = π, ϕ3 = ϕ + π, ψ1 = −ϕ + π, ψ2 = π, ψ3 = −ϕ, is a solution for an arbitrary ϕ.

(51) (52)

324

˙ I. Bengtsson, Å. Ericsson, M. Ku´s, W. Tadej, K. Zyczkowski

References 1. Marshall, A.W., Olkin, I.: Inequalities: Theory of Majorization and its Applications. New York: Academic Press 1979 2. Landé, A.: From Dualism to Unity in Quantum Physics. Cambridge: Cambridge U. P. 1960 3. Rovelli, C.: Relational Quantum Mechanics. Int. J. of Theor. Phys. 35, 1637 (1996) 4. Khrennikov, A.: Linear representations of probabilistic transformations induced by context transition. J. Phys. A34, 9965 (2001) 5. Tanner, G.: Unitary-stochastic matrix ensembles, and spectral statistics. J. Phys. A34, 8485 (2001) ˙ 6. Pako´nski, P., Zyczkowski, K., Ku´s, M.: Classical 1D maps, quantum graphs and ensembles of unitary matrices. J. Phys. A34, 9303 (2001) ˙ 7. Pako´nski, P., Tanner, G., Zyczkowski, K.: Families of Line-Graphs and Their Quantization. J. Stat. Phys. 111, 1331 (2003) ˙ 8. Zyczkowski, K., Ku´s, M., Słomczy´nski, W., Sommers, H.-J.: Random unistochastic matrices. J. Phys. A36, 3425 (2003) 9. Jarlskog, C., Stora, R.: Unitarity polygons and CP violation areas and phases in the standard electroweak model. Phys. Lett. B208, 268 (1988) 10. Auberson, G., Martin,A., Mennessier, G.: On the Reconstruction of a Unitary Matrix from its Moduli. Commun. Math. Phys. 140, 523 (1991) 11. Mennessier, G., Nyuts, J.: Some unitarity bounds for finite matrices. J. Math. Phys. 15, 1525 (1974) 12. Sylvester, J.J.: Thoughts on inverse orthogonal matrices, simultaneous sign-successions, and tessellated pavements in two or more colours, with applications to Newton’s rule, ornamental tile-work, and the theory of numbers. Phil. Mag. 34, 461 (1867) 13. Petrescu, M.: Existence of continuous families of complex Hadamard matrices of certain prime dimensions and related results. UCLA thesis, Los Angeles 1997 14. D˘i¸ta, P.: Some results on the parametrization of complex Hadamard matrices. J. Phys. A37, 5355 (2004) ˙ 15. Zeilinger, A., Zukowski, M., Horne, M.A., Bernstein, H.J., Greenberger, D.M.: Einstein-PodolskyRosen correlations in higher dimensions. In: J. Anandan, J. L. Safko, eds. Fundamental Aspects of Quantum Theory, Singapore: World Scientific, 1993 16. Törmä, P., Stenholm, S., Jex, I.: Hamiltonian theory of symmetric optical network transforms. Phys. Rev. A52, 4853 (1995) 17. Werner, R.F.: All Teleportation and Dense Coding Schemes. J. Phys. A34, 7081 (2001) 18. Birkhoff, G.: Tres observaciones sobre el algebra lineal. Univ. Nac. Tucumán Rev. A5, 147 (1946) 19. Brualdi, R.A., Gibson, P.M.: Convex Polyhedra of Doubly Stochastic Matrices. I. Applications of the Permanent Function. J. Comb. Theory A22, 194 (1977) 20. Słomczy´nski, W.: Subadditivity of Entropy for Stochastic Matrices. Open Sys. Inf. Dyn. 9, 201 (2002) 21. Beck, M., Pixton, D.: The volume of the 10th Birkhoff polytope. arXiv: math.CO/0305322 22. Au-Yeung, Y.-H., Poon, Y.-T.: 3 × 3 Orthostochastic Matrices and the Convexity of Generalized Numerical Ranges. Lin. Alg. Appl. 27, 69 (1979) 23. Nakazato, H.: Sets of 3 × 3 Orthostochastic Matrices. Nikonkai Math. J. 7, 83 (1996) 24. Gadiyar, H.G., Maini, K.M.S., Padma, R., Sharatchandra, H.S.: Entropy and Hadamard matrices. J. Phys. A36, L109 (2003) 25. Jones, K.R.W.: Entropy of random quantum states. J. Phys. A23, L1247 (1990) 26. Au-Yeung, Y.-H., Cheng, C.-M.: Permutation Matrices Whose Convex Combinations Are Orthostochastic. Lin. Alg. Appl. 150, 243 (1991) 27. Hadamard, M.J.: Résolutions d’une question relative aux déterminants. Bull. Sci. Math. 17, 240 (1893) 28. Haagerup, U.: Orthogonal maximal Abelian ∗-subalgebras of the n × n matrices and cyclic n-roots. In: Operator Algebras and Quantum Field Theory, Rome (1996), Cambridge, MA: Internat. Press, 1997 29. Tadej, W.: Unpublished 30. Mehta, M.L.: Random Matrices. II ed., New York: Academic, 1991 ˙ 31. Po´zniak, M., Zyczkowski, K., Ku´s, M.: Composed ensembles of random unitary matrices. J. Phys. A31, 1059 (1998) Communicated by M.B. Ruskai

Commun. Math. Phys. 259, 325–362 (2005) Digital Object Identifier (DOI) 10.1007/s00220-005-1395-5

Communications in

Mathematical Physics

A New Quantum Deformation of ‘ax + b’ Group W. Pusz, S. L. Woronowicz Department of Mathematical Methods in Physics, Faculty of Physics, University of Warsaw, Ho˙za 74, 00-682 Warszawa, Poland. E-mail: [email protected]; [email protected] Received: 19 April 2004 / Accepted: 5 April 2005 Published online: 15 July 2005 – © Springer-Verlag 2005

Abstract: The paper is devoted to locally compact quantum groups that are related to the classical ‘ax + b’ group. We discuss in detail the quantization of the deformation parameter assumed with no justification in the previous paper. Next we construct (on the C∗ -level) a larger family of quantum deformations of the ‘ax + b’ group corresponding to the deformation parameter q 2 running over an interval in the unit circle. To this end, beside the reflection operator β known from the previous paper we use a new unitary generator w. It commutes with a, b and βwβ = s sgn b w, where s ∈ S 1 is a new deformation parameter related to q 2 . At the end we discuss the groups at roots of unity.

0. Introduction In the last years a lot of effort was devoted to constructing explicit examples of (noncompact) locally compact quantum groups. The present paper inscribes into this line of research. It is devoted to quantum deformations of the group ‘ax + b’ of affine transformations of the real line. Such quantum ‘ax + b’ groups were presented first in [19]. We shall use the adjective old to distinguish them from the new ‘ax + b’ groups constructed in Sect. 4 of present paper. We go back to the subject for the following reasons. At first the quantizations of the deformation parameter introduced in the previous paper were not discussed in detail. π Now we give strong arguments that the values of = 2k+3 are the only ones allowed within the setting considered in [19]. Secondly one of the important formulae in [19] was not proven. We fill this gap. Third, the old quantum ‘ax + b’ admits a large set of automorphisms. In [19] we identified only four of them. Now we show that the group of automorphisms is as large as S 1 . These automorphisms play an important role in constructing the new quantum ‘ax + b’ groups. The new groups do exist for running over an interval in R (no more quantization of deformation parameter).

326

W. Pusz, S.L. Woronowicz

The new quantum groups constructed in the present paper seem to be very important. They will serve as building blocks in the construction of the quantum SL(2, R) group. This is our next target. Let G be the ‘ax + b’ group. On the classical level G consists of all transformations of the form R x −→ ax + b ∈ R,

(0.1)

where a and b are real parameters labeling the elements of the group. We shall assume that a > 0. Assigning to each element of the group the values of the parameters we define two unbounded continuous real functions on G. To denote the functions we shall use the same letters: a, b ∈ C(G). Then the C∗ -algebra C∞ (G) of all continuous functions vanishing at infinity on G is generated by log a and b: uniformly closed linear envelope C∞ (G) = f (log a)g(b) : f, g ∈ C∞ (R) . Functions a and b may be considered as elements affiliated with C∞ (G). Composing two transformations of the form (0.1) with parameters (a1 , b1 ) and (a2 , b2 ) one obtains the transformation with parameters (a1 a2 , a1 b2 + b1 ). This result leads to the following formulae describing the comultiplication: (a) = a ⊗ a, (b) = a ⊗ b + b ⊗ I.

(0.2)

At the moment the elements of G are considered as affine transformations of R. However one may realize them as unitary operators acting on a Hilbert space. To this end, to any transformation of the form (0.1) we assign the unitary operator V(a,b) ∈ B(L2 (R)) introduced by the formula: V(a,b) f (x) = a −1/2 f a −1 (x − b) for any f ∈ L2 (R). Then G may be identified with the set of unitary operators: G = V(a,b) : a, b ∈ R; a > 0 .

(0.3)

This identification preserves the group structure and the topology. More precisely V(1,0) = I and V(a1 ,b1 ) V(a2 ,b2 ) = V(a1 a2 ,a1 b2 +b1 )

(0.4)

for any a1 , a2 ∈ ]0, ∞[ and b1 , b2 ∈ R. Moreover a sequence V(an ,bn ) converges to V(a∞ ,b∞ ) in strong topology if and only if an → a∞ > 0 and bn → b∞ . In particular (0.3) with the strong operator topology is a locally compact space. One can also show that (0.3) is a closed subset of B(L2 (R)) (in strong operator topology). For any Hilbert space H we denote by K(H ) the C∗ -algebra of all compact operators acting on H . According to the general theory [13] the strongly continuous family of unitaries (0.3) is described by a single unitary V ∈ M(K(L2 (R)) ⊗ C∞ (G)). The C∗ -algebra C∞ (G) is generated (in the sense of [13]) by V . Formula (0.4) means that (id ⊗ )V = V12 V13 .

A New Quantum Deformation of ‘ax + b’ Group

327

This way we arrive at the notion of a (quantum) group of unitary operators. Let H be a Hilbert space. We shall consider pairs (A, V ), where A is a C∗ -algebra and V is a unitary element of the multiplier algebra M(K(H ) ⊗ A). If A is generated by V ∈ M(K(H ) ⊗ A) then (A, V ) is called a quantum family of unitary operators. We say that the family is closed with respect to operator multiplication if there exists a morphism ∈ Mor(A, A ⊗ A) such that (id ⊗ )V = V12 V13 .

(0.5)

Then is unique (because A is generated by V ). Finally (A, V ) is said to be a quantum group of unitary operators, if it is closed with respect to the operator multiplication and if (A, ) is a locally quantum group in the sense of Kustermans and Vaes [3]. It should be possible to formulate the last condition directly in terms of (A, V ). However this is not the subject of the present paper. Let us go back to the ‘ax + b’ group. In the quantum setting functions a and b are replaced by selfadjoint elements a = a ∗ > 0 and b = b∗ that no longer commute. Instead they satisfy the relation ab = q 2 ba,

(0.6)

where the deformation parameter q 2 is a number of modulus 1. Unfortunately in our case elements a and b are represented by unbounded operators and the products ab and ba may not be well defined because of the domain problem. For this reason we replace (0.6) by the so-called Zakrzewski relation. It says that for any τ ∈ R: a iτ ba −iτ = eτ b. In this formula is a real constant such that q 2 = e−i . For technical reasons we shall assume that 0 < < π2 . The reader should notice that for τ = −i the above relation reduces to (0.6). The second problem is related to the comultiplication. We would like to keep formulae (0.2). However in general a ⊗ b + b ⊗ I is not selfadjoint and in the best case we may expect that (b) is a selfadjoint extension of a ⊗ b + b ⊗ I : a ⊗ b + b ⊗ I ⊂ (b). To choose the extension in a well defined way we have to use additional operators independent of a and b. For old quantum ‘ax + b’ groups we use a selfadjoint unitary β commuting with a and anticommuting with b. For new groups the situation is even more complicated. It means that the algebra A is no longer generated by log a and b. It is not obvious how to present the quantum ‘ax + b’ group as a quantum group of unitary operators (A, V ). The crucial point is the formula V = V (a, b, . . . ) expressing V in terms of a, b and perhaps some other elements related to A. Equation (0.5) takes the form V (a ⊗ a, [a ⊗ b + b ⊗ I ], . . . ) = V (a ⊗ I, b ⊗ I, . . . )V (I ⊗ a, I ⊗ b, . . . ), where [a ⊗ b + b ⊗ I ] is a suitable selfadjoint extension of a ⊗ b + b ⊗ I . To find solutions of this equation we spent a lot of time making use of our experience in the area of quantum exponential functions and quantum groups (cf. [15, 14, 11, 18, 17, 19, 7, 9]). As a result we got formulae (3.8) and (4.8) that are starting points in our presentation.

328

W. Pusz, S.L. Woronowicz

Let us briefly discuss the content of the paper. Sections 1 and 2 are devoted to mathematical tools used in the paper. In the first one we recall the Zakrzewski commutation relation and related quantum exponential function (with a slightly modified notation). Most of the results presented in that section come from [17]; the essentially new result is contained in Proposition 1.4. The second section deals with the notion of a C∗ -algebra generated by affiliated elements. We prove a number of results used in the main part of the paper. Section 3 is devoted to the quantum ‘ax + b’ groups introduced in [19]. These groups π exist only for special values of deformation parameter q 2 = e−i with = 2k+3 , where k = 0, 1, 2, . . . . This fact was not really shown in [19]. The special values of the deformation parameter were chosen to proceed with some computations. It was not clear that (at the expense of some complications) one is not able to construct the quantum ‘ax + b’ group for a larger set of values of the deformation parameter. Now, presenting the ‘ax + b’ group as a quantum group of unitary operators we obtain the quantization of the deformation parameter as a precise mathematical statement (cf. Theorem 3.3). More precisely for q 2 = e−i we shall construct a C∗ -algebra A with distinguished selfadjoint elements a, b and iβb affiliated with it (the so called reflection operator β is a unitary involution which is not affiliated with A). These elements satisfy (in a well defined sense) the relations ab = q 2 ba, aβ = βa and bβ = −βb. The algebra A is generated by a unitary element V ∈ M(K(L2 (R))⊗A). The pair (A, V ) is defined for all 0 < < π/2. However, the existence of satisfying the condition (0.5) selects a much smaller subset of admissible ’s. We shall prove that exists if and only if is of the form indicated above. Next we derive formulae showing how acts on generators of A. In particular we prove an elegant formula describing the action of the comultiplication on the reflection operator. This formula appeared (with no proof) in the previous paper (cf. formula (4.16) of [19]). At the end of Sect. 3 we find an interesting action of S 1 on the algebra A: for any s ∈ S 1 we have an automorphism φs of A and φss = φs oφs . If π exists (i.e. if = 2k+3 , where k = 0, 1, 2, . . . ) then oφs = (φs ⊗ φs )o. New quantum groups related to the classical ‘ax +b’ group are constructed in Sect. 4. Using one of the automorphism φs described at the end of Sect. 3 we consider the corresponding crossed product. In other words we extend the algebra A by adding a new unitary generator w implementing φs . This enlargement of the algebra opens new possibilities. In particular we obtain new admissible values of the deformation parameter. Now = πp , where p is a number larger than 2 such that −eiπp = s. The latter relation distinguishes a discrete set of possible p. However the quantization of disappears because changing s we may cover the whole interval ∈]0, π2 [. For s = 1 we obtain p = 2k + 3, where k = 0, 1, 2, . . . . In this case the new quantum ‘ax + b’ group reduces to a semidirect product of the old one by S 1 . For s = 1 we get essentially new examples of locally compact quantum groups. In the next section (Sect. 5) we investigate the multiplicative unitaries for the quantum groups constructed in Sect. 4. We prove their modularity and find the unitary antipode and scaling group. In particular the objects constructed in Sect. 4 satisfy all the axioms of Kustermans and Vaes [3] and the ones of Masuda, Nakagami and Woronowicz [4]. At the end of the section we briefly discuss the duals of the new quantum ‘ax + b’ groups. The last section is devoted to new quantum ‘ax + b’ groups with q 2 being a root of unity. In this case we may pass to groups with smaller size. To this end we have to assume that the unitary generator w satisfies the additional relation of the form w N = I . The word size used in the previous paragraph has a precise meaning. It is based on the Stone - von Neumann theorem. Let (A, ) be one of the quantum ‘ax + b’ group

A New Quantum Deformation of ‘ax + b’ Group

329

considered in this paper and π be a representation of A acting on a Hilbert space Hπ such that ker(b) = {0}. Then operators log(π(a)) and log(|π(b)|) satisfy the same commutation relations as position xQM and momentum pQM in quantum mechanics. By the Stone - von Neumann theorem the pair (log(π(a)), log(|π(b)|) is unitarily equivalent to a direct sum of k copies of (xQM , pQM ). The number k will be called the multiplicity of π . We say that the size of the group (A, ) is equal to k if there exists a faithful representation of A with multiplicity k and if k is the smallest number with this property. In the classical situation the algebra of functions A is generated by log a and b. This is not the case when we consider quantum ‘ax + b’ groups: we use additional generators such as β and w. It means that together with the quantum deformation we pass to a sort of extension of the group. The size tells us how large the extension is. The old quantum ‘ax + b’ groups are of size 2. This is the minimal value. One can show that the old quantum ‘ax + b’ groups are the only ones with size 2. The new groups introduced in Sect. 4 are of infinite size. On the other hand the groups at roots of unity considered in the last section are of size 2N, where N is the number appearing in the relation w N = I . Our approach extensively uses the C∗ -algebra language and the theory of selfadjoint operators on Hilbert space. For the basic facts concerning the general C∗ -algebra theory we refer to [1, 6]. The notation used in the paper follows the one explained in [13, 12]. In particular M(A) is the multiplier algebra of a C∗ -algebra A. The affiliation relation in the sense of C∗ -algebra theory is denoted by “η” and Aη is the set of all affiliated elements (“unbounded multipliers”). It is known that M(A) ⊂ Aη . A morphism from A to a C∗ -algebra B is by definition any ∗ -homomorphism π : A −→ M(B) such that π(A)B is dense in B. Let us recall that any such π has the unique extension to a unital ∗-homomorphism π : M(A) −→ M(B) and to a ∗ -preserving map π : Aη −→ B η respectively (both denoted by the same symbol). The set of all morphisms from A to B is denoted by Mor(A, B). With some abuse of notation, the symbol Rep(A) will stay for the “set” of all nondegenerate representations of a C∗ -algebra A. For any π ∈ Rep(A), we denote by Hπ the carrier Hilbert space of π. Then π ∈ Mor(A, K(Hπ )). In the paper we mostly deal with concrete C∗ -algebras. By definition they are norm closed ∗ -subalgebras of the algebra B(H ) of all bounded operators acting on some (separable) Hilbert space H . As a rule, C∗ -algebras we deal with are separable. Non separable ones will appear only as multiplier algebras. In particular B(H ) = M(K(H )). We shall denote by C ∗ (H ) the set of all non-degenerate separable C∗ -algebras of operators acting on a Hilbert space H . We recall that an algebra A ⊂ B(H ) is non-degenerate if AH is dense in H . We shall use functional calculus for strongly commuting selfadjoint operators. If T and β are selfadjoint operators acting on a Hilbert space H and T and β strongly commute then T =

⊕

β=

r dE(r, ),

⊕

dE(r, ),

where dE(r, ) is the common spectral measure supported by the joint spectrum ⊂ R2 of (T , β). Moreover for any measurable complex valued function on we have f (T , β) =

⊕

f (r, )dE(r, ).

330

W. Pusz, S.L. Woronowicz

In this context the characteristic function χ will appear quite often. By definition for any sentence R, we have

0 if R is false, χ (R) = 1 if R is true. Typically R is a formula involving an (in)equality sign. For example χ (r ≤ 0) is equal to 0 for positive r and 1 for r = 0 or negative. Consequently χ (T ≤ 0) is the spectral projection assigned to the negative part of the spectrum of a selfadjoint operator T . The corresponding spectral subspace will be denoted by H (T ≤ 0): H (T ≤ 0) = χ (T ≤ 0)H . Similarly χ(T = λ) is the orthogonal projection on the eigenspace H (T = λ) of T corresponding to the eigenvalue λ ∈ R. We refer to [17] for a more detailed explanation of this notation. Let Np(r) = rχ (r < 0). Then for any selfadjoint T , Np(T ) = T χ (T < 0)

(0.7)

is a selfadjoint operator acting on H . This is the negative part of the operator T . Another function frequently used in the paper is the one that returns the sign of the argument: sgn r = χ (r > 0) − χ (r < 0). Then sgn T is the partial isometry that appears in the polar decomposition: T =(sgn T ) |T |. 1. A Special Function and Selfadjoint Extensions In this section we recall (in a slightly modified version with a certain loss of generality) the basic definitions and statements of [17]. The only essentially new result is contained in formula (1.11). Later on it will help us to prove the formula announced in [19, formula (4.16)]. We start with a modified version of the quantum exponential function introduced in [17]. Let ∈ R and 0 < < π2 . Instead of function F defined on the set R− × {−1, 1} ∪ R+ × {0} we shall use function G defined on R × {−1, 1}. It is related to the function F by the formula G (r, ) = F (r, χ (r < 0))

(1.1)

for any r ∈ R and = ±1. Taking into account the definition [17, formula (1.19)] we obtain  Vθ (log r) for r > 0   G (r, ) = (1.2) π   1 + i|r| Vθ log |r| − π i for r < 0, where θ = 2π and Vθ is the meromorphic function on C such that

∞ 1 dt log(1 + t −θ ) Vθ (x) = exp 2πi 0 t + e−x for all x ∈ C such that |x| < π . In addition G (0, ±1) = 1. Then G (r, ) is a continuous function on R × {−1, 1} and G (r, ) = G (r, ) ⇐⇒ χ (r < 0) = χ (r < 0) . (1.3)

A New Quantum Deformation of ‘ax + b’ Group

331

The asymptotic behavior of G (r, ) for large r is described by the formula

(log |r|)2 G (r, ) ≈ C exp , 2i

(1.4)

where C is a phase factor depending only on sgn r and ρ and ‘≈’means that the difference goes to 0 when r → ±∞ (see Statements 9 and 10 of [17, Theorem 1.1]). It is known that the quantum exponential function assumes values of modulus 1. Therefore if T and β are operators acting on a Hilbert space H , T is selfadjoint and β is unitary selfadjoint commuting with T , then G (T , β) is unitary. Now we recall the concept of selfadjoint extension of a symmetric operator defined by a reflection operator. Let Q be a symmetric operator acting on a Hilbert space H and ρ be a unitary selfadjoint operator (ρ ∗ = ρ, ρ 2 = I ) anticommuting with Q. Then we denote by [Q]ρ the restriction of Q∗ to the domain {x ∈ D(Q∗ ) : (ρ − I )x ∈ D(Q)}. It is known (cf. [17, Prop. 5.1]) that [Q]ρ is a selfadjoint extension of Q. We shall use the following simple Proposition 1.1. Let Q, X and ρ be operators acting on a Hilbert space H such that Q is symmetric, X is selfadjoint, ρ is unitary selfadjoint, ρQ = −Qρ and ρX = −Xρ. Assume that the restrictions of Q and X to H (ρ = −1) coincide: Q|H (ρ=−1) = X|H (ρ=−1) .

(1.5)

Then X = [Q]ρ . Proof. Let H1 = H (ρ = −1) and H2 = H (ρ = 1). Then H = H1 ⊕ H2 and (all) bounded and (some) unbounded operators may be represented by 2 × 2 matrices. In particular −I , 0 ρ= . 0 ,I Remembering that Q and X anticommute with ρ we obtain: 0 , X− 0 , Q− and X = , Q= Q+ , 0 X+ , 0 where Q+ and X+ are operators acting from H1 to H2 and Q− and X− are operators ∗ (X is acting from H2 to H1 . Clearly Q+ ⊂ Q∗− (Q is symmetric) and X− = X+ selfadjoint). Assumption (1.5) means that Q+ = X+ . Therefore 0 , Q∗+ . X= Q+ , 0 On the other hand Q∗ =

0 , Q∗+ Q∗− , 0

.

It shows that X ⊂ Q∗ and D(X) = {x ∈ D(Q∗ ) : (ρ − I )x ∈ D(Q)}.

332

W. Pusz, S.L. Woronowicz

Let ∈ R. We shall use the Zakrzewski relation o (cf. [17]). Let R and S be selfadjoint operators acting on a Hilbert space H with the polar decompositions R = sgn R |R| and S = sgn S |S|. For simplicity we shall assume that one of the operators R and S has trivial kernel. If ker S = {0}, then sgn S is unitary selfadjoint and   sgn S commutes with R   R o S ⇐⇒  and |S|−iλ R |S|iλ = eλ R  . for any λ ∈ R. If ker R = {0}, then sgn R is unitary selfadjoint and   sgn R commutes with S   R o S ⇐⇒  and |R|iλ S |R|−iλ = eλ S  . for any λ ∈ R. If ker R = ker S = {0}, then the two above conditions are equivalent. One can easily show that antiunitary operators reverse the direction of the Zakrzewski relation: R o S and J is an (1.6) ⇒ J SJ o J RJ . antiunitary involution

Let R and S be selfadjoint operators with trivial kernels and R o S. It is known [17, Example 3.1] that in this case, the operators ei /2 S −1 R and ei /2 SR −1 are selfadjoint and sgn ei /2 S −1 R = sgn ei /2 SR −1 = (sgn R)(sgn S). We shall use the following result (cf. [17, Theorem 5.2]): Proposition 1.2. Let R, S and τ be operators acting on a Hilbert space H . Assume that

R and S are selfadjoint with trivial kernels, R o S, and that τ is unitary, selfadjoint anticommuting with R and S. We set T = ei /2 S −1 R. Then T is a selfadjoint operator with trivial kernel, T commutes with τ , R + S is a closed symmetric operator and the selfadjoint extension [R + S]τ = G (T , τ )∗ SG (T , τ ) = G (T −1 , τ )RG (T −1 , τ )∗ .

(1.7)

Remark 1.3. If τ is another unitary, selfadjoint operator anticommuting with R and S and if in addition there exists a unitary selfadjoint operator ρ that commutes with τ, τ and S and anticommutes with R then (1.8) [R + S]τ = [R + S]τ ⇒ τ = τ . Indeed if [R + S]τ = [R + S]τ , then (cf. (1.7)) G (T , τ )∗ SG (T , τ ) = G (T , τ )∗ SG (T , τ ).

A New Quantum Deformation of ‘ax + b’ Group

333

It shows that the unitary operator U = G (T , τ )G (T , τ )∗ commutes with S and hence with |S|. Clearly G (T , τ )∗ = G (T , τ )∗ U.

(1.9)

Moreover T o S due to the Zakrzewski relation R o S and ρ anticommutes with T . As we know τ and τ anticommute with S, hence they commute with |S|. We shall use Proposition 2.4 (see the next section). Setting R1 = R2 = T , ρ1 = τ , ρ2 = τ , U1 = I , U2 = U and replacing S by |S| we have all the assumptions of that proposition satisfied. Therefore (1.9) implies the equality τ Np(T ) = τ Np(T ). It means that τ and τ coincide on H (T < 0). Then τ and τ coincide on ρH (T < 0) for any operator ρ commuting with τ and τ . If ρ commutes with S and anticommutes with R then it anticommutes with T and ρH (T < 0) = H (T > 0). In this case τ and τ coincide on H (T < 0) ⊕ H (T > 0) = H (this is because ker T is trivial). Hence τ = τ . We shall prove a result of the same flavor as (1.7): Proposition 1.4. Let R and S be strictly positive selfadjoint operators acting on a Hil

bert space H such that R o S and let τ , ρ, σ and ξ be unitary selfadjoint operators commuting with R and S. Assume that τ commutes with ξ and anticommutes with ρ and σ and ξ χ (τ = −1) = αρσ χ (τ = −1),

(1.10)

iπ 2 2

where α = i e . We set: T = ei /2 S −1 R. Then T is a positive selfadjoint operator π π with trivial kernel, σ S + ρR is a closed symmetric operator anticommuting with τ and the selfadjoint extension π π π = G (τ T , ξ )∗ σ S G (τ T , ξ ) σ S + ρR −τ (1.11) π −1 −1 ∗ = G (τ T , ξ )ρR G (τ T , ξ ) . Proof. At first we shall prove the first equality of (1.11). Inserting S −1 instead of R and R instead of S in [17, Example 3.1] we see that T is a positive selfadjoint operator with trivial kernel and T ik = e− 2 k S −ik R ik = e i 2

i 2 2 k

R ik S −ik

(1.12)

for any k ∈ R. Denote by X the right-hand side of the first equality in (1.11). We know that G (τ T , ξ ) π is unitary (in what follows we write G (τ T , ξ )−1 instead of G (τ T , ξ )∗ ). Operator S π commutes with σ and τ whereas σ and τ anticommute. Therefore σ S is a selfadjoint operator anticommuting with τ . So is X. π π Let Q = σ S + ρR . Clearly Q is a symmetric operator anticommuting with τ . By virtue of Proposition 1.1 it is sufficient to show that Q|H (τ =1) = X|H (τ =1) . π

(1.13)

Restricting G (τ T , ξ )∗ σ S G (τ T , ξ ) to H (τ = 1) we may replace the second τ by 1 and the first τ by −1 (this is because σ maps H (τ = 1) onto H (τ = −1)): X|H (τ =1) = G (−T , ξ )−1 σ S G (T , ξ )|H (τ =1) π

334

W. Pusz, S.L. Woronowicz

and using (1.2) we obtain −1 π −1 π X|H (τ =1) = 1 + iξ T Vθ log T − πi σ S Vθ (log T )|H (τ =1) . (1.14) π

Now we shall move σ S to the right end of (1.14). It is known (cf. [17, relation (1.30)]) that the function Vθ (x) has no poles and no zeroes in strip = {x ∈ C : 0 ≤ x ≤ π }. Therefore functions Vθ (x) and Vθ (x)−1 are continuous on and holomorphic inside . Moreover (cf. [17, the asymptotic formula (1.37)]), Vθ (x) −→ 1 when x −→ −∞ whereas x stays bounded and using formula (1.32) of [17] one can easily show that for 2 2 any λ > 0, functions e−λx Vθ (x) and e−λx Vθ (x)−1 are bounded on . Furthermore T is a strictly positive selfadjoint operator and T Statement (3) of Theorem 3.1 of [17] we obtain π

o

S. Therefore T

π

π

o

S and using

π

S Vθ (log T ) = Vθ (log T + iπ ) S . Inserting this formula into (1.14) and using in the second step formula (1.28) of [17] we get: π −1 π X|H (τ =1) = 1 + iξ T Vθ (log T − πi)−1 Vθ (log T + π i)σ S |H (τ =1) π −1 2π π = 1 + iξ T 1 + T σ S |H (τ =1) π π = 1 − iξ T S σ |H (τ =1) . On the other hand multiplying both sides of (1.10) by σ from the right we obtain ξ σ χ(τ = 1) = αρχ (τ = 1). Therefore ρ|H (τ =1) = αξ σ |H (τ =1) and π π Q|H (τ =1) = S + αξ R σ |H (τ =1) . To end this part of the proof it is sufficient to show that π π π π S + αξ R = 1 − iξ T S .

(1.15)

We shall use (1.12). It shows that for any x, y ∈ H and any k ∈ R we have i 2 y S ik x − i e 2 k y ξ R ik x = I + iξ T −ik y S ik x . π π π Let x ∈ D S ∩ D R . If y ∈ D(T ) then both sides of the above formula have continuous holomorphic continuation to the strip − π ≤ k ≤ 0. Inserting k = −i π we obtain π π π π y S x + α y ξ R x = I + iξ T y S x . π π π This formula holds for any y in the domain of I +iξ T . Therefore S x ∈ D I − iξ T π π π π and S x + αξ R x = I − iξ T S x. This way we showed that π π π π S + αξ R ⊂ I − iξ T S .

(1.16)

A New Quantum Deformation of ‘ax + b’ Group

335

π π To prove the converse inclusion we use again (1.12). Let x ∈ D S and S x ∈ π D T . Then for any y ∈ H and k ∈ R: e− 2 k

i 2

π π R −ik y S −ik x = y T ik S x .

π

If y ∈ D(R ) then both sides of the above formula have continuous holomorphic continuation to the strip − π ≤ k ≤ 0. Inserting k = −i π we obtain π π π iα R y x = y T S x . π

π

This formula holds for any y ∈ D(R ). Therefore x ∈ D(Rπ). This wayweπ showed the π π π π π inclusion D(T S ) ⊂ D(R ). Consequently D I −iξ T S ⊂D S + αξ R . Combining this result with (1.16) we get (1.15) and (1.13). This way the first equality of (1.11) is shown. The second equality may be shown in the same manner. However it is simpler to use the following trick based on (1.6). Let J be an antiunitary involutive operator acting on H and Rn = J SJ, ρn = J σ J,

Sn = J RJ, σn = JρJ,

τn = J τ J, ξn = J ξ J.

The subscript ‘n’ stands for ‘new’. One can easily show that the new operators satisfy all the assumptions of our theorem. In particular ξn χ (τn = −1) = J ξ χ (τ = −1)J = J αρσ χ(τ = −1)J = ασn ρn χ (τn = −1) = (ασn ρn χ (τn = −1))∗ = αρn σn χ(τn = −1). In the present case Tn = ei /2 Sn−1 Rn = J e−i /2 R −1 SJ = J T −1 J and the first equality of (1.11) takes the form:

π

π

σn Sn + ρn Rn

−τn

π

= G (J τ T −1 J, J ξ J )∗ σn Sn G (J τ T −1 J, J ξ J ) π

= J G (τ T −1 , ξ )ρR G (τ T −1 , ξ )∗ J. A moment of reflection shows that the left-hand side of this formula equals π π J and the second equality of (1.11) follows immediately. J σ S + ρR −τ

π

π

To end the proof we have to show that the operator Q = σ S + ρR is closed. π π Operator ξ T is selfadjoint. Therefore operator I − iξ T is invertible with the inverse π −1 ∈ B(H ). Using this fact one can easily show that the composition I − iξ T π π I − iξ T S σ is a closed operator. Restricting this operator to H (τ = 1) we obtain Q|H (τ =1) . Hence Q|H (τ =1) is closed. Remembering that Q anticommute with τ we conclude that Q is closed.

336

W. Pusz, S.L. Woronowicz

Remark 1.5. According to (1.10) operator αρσ χ (τ =−1) is selfadjoint. Using this fact and remembering that ρ and σ anticommute with τ one can show that ρσ = α 2τ σρ. Conversely let τ, ρ, σ be unitary selfadjoint operators commuting with R, S and let τ anticommute with ρ and σ . If the above relation is satisfied, then using (1.10) to define ξ on H (τ = −1) and extending it in an arbitrary way to a unitary selfadjoint operator defined on the whole space we obtain the quadruple (ρ, σ, τ, ξ ) of operators satisfying the assumptions of Proposition 1.4. We end this section with the reformulation of Theorem 6.1 of [17]. Theorem 1.6. Let (R, S) be a pair of selfadjoint operators acting on a Hilbert space H

such that ker R = ker S = {0} and R o S and let ρ, σ be unitary selfadjoint operators on H . Assume that ρ commutes with R, ρ anticommutes with S, σ commutes with S and σ anticommutes with R. We set: T = ei /2 S −1 R, τ = αρσ χ (S < 0) + ασρχ (S > 0), iπ 2

where α = i e 2 . Then

1. T is selfadjoint, sgn T = (sgn R) (sgn S), T o R and T o S. 2. τ is unitary selfadjoint, τ commutes with T and τ anticommutes with R and S. 3. G satisfies the following exponential function equality: G (R, ρ)G (S, σ ) = G (T , τ )∗ G (S, σ )G (T , τ ) = G ([R + S]τ , σ),

(1.17)

where [R + S]τ is the selfadjoint extension of R + S corresponding to the reflection operator τ and σ = G (T , τ )∗ σ G (T , τ ). Proof. By direct computation one can easily show that τ 2 = I , τ ∗ = τ and τ χ (T < 0) = αρχ (R < 0)σ χ (S < 0) + ασ χ (S < 0)ρχ (R < 0). Now, our theorem follows immediately from [17, Theorem 6.1].

Remark 1.7. In Theorem 1.6, operator τ may be replaced by τ = αρσ χ (R > 0) + ασρχ (R < 0). Operator σ is not affected by this change. Indeed, using the formula sgn T = sgn R sgn S, one can verify that τ χ (T < 0) = τ χ(T < 0). It shows that G (T , τ ) = G (T , τ ).

A New Quantum Deformation of ‘ax + b’ Group

337

2. The Special Functions and Affiliation Relation In this section we shall use the concept of a C∗ -algebra generated by a set of affiliated elements [13, Def. 4.1, p. 501]. Let C, A be C∗ -algebras and V be an element affiliated with C ⊗ A. We say that A is generated by an element V η (C ⊗ A) if and only if for any π ∈ Rep(A) and any B ∈ C ∗ (Hπ ) we have: (id⊗π)V η (C⊗B) ⇒ π ∈ Mor(A, B) . (2.1) In general the above condition is not easy to verify. We shall use the following criterion (cf. [13, Example 10, p. 507]): Proposition 2.1. Let C, A be C∗ -algebras and V be a unitary element of M(C ⊗ A). Assume that there exists a faithful representation φ of C such that: 1. For any φ-normal linear functional ω on C we have (ω ⊗ id)V ∈ A. 2. The smallest ∗ -subalgebra of A containing {(ω ⊗ id)V : ω is φ-normal} is dense in A. Then A is generated by V ∈ M(C ⊗ A). We recall that a linear functional ω on C is said to be φ-normal if there exists a trace-class operator ρ acting on Hφ such that ω(c) = Tr(ρφ(c)) for all c ∈ C. Let be the locally compact space obtained from R × {−1, 1} by gluing points (r, −1) and (r, 1) for all r ≥ 0. Then:

f (r, −1) = f (r, 1) C∞ () = f ∈ C∞ R × {−1, 1} : . for all r ≥ 0 If R, ρ are operators acting on a Hilbert space H , R is selfadjoint, ρ is unitary selfadjoint and ρ commutes with R, then the mapping C∞ () f −→ π(f ) = f (R, ρ) ∈ B(H )

(2.2)

is a representation of C∞ () acting on H . Operators R and ρ Np(R) are determined by π. Indeed R = π(f1 ) and ρ Np(R) = π(f2 ), where f1 , f2 are elements of C∞ ()η = C() introduced by the formulae f1 (r, ) = r,

f2 (r, ) = Np(r)

(2.3)

for any r ∈ R and = ±1. Using [13, Example 2, p. 497] we see that f1 , f2 generate C∞ (). Therefore for any π ∈ Rep (C∞ ()) and any B ∈ C ∗ (Hπ ) we have: π(f1 ), π(f2 ) η B ⇒ π ∈ Mor(C∞ (), B) ⇒ π(f ) η B for any f ∈ C() . In particular for π introduced by (2.2) we obtain the following result: R, ρ Np(R) η B ⇒ f (R, ρ) η B . f ∈ C()

(2.4)

Our special function G is continuous and satisfies the relation G (r, −1) = G (r, 1) for all r ≥ 0. In other words G ∈ C(). For any r ∈ R, = ±1 and t > 0 we set: F (t; r, ) = G (r, )G (tr, ).

(2.5)

Let R+ = {t ∈ R : t > 0}. Then F is a continuous function on R+ × with values of modulus 1 and we may treat F as a unitary element of M (C∞ (R+ ) ⊗ C∞ ()). We shall prove the following

338

W. Pusz, S.L. Woronowicz

Proposition 2.2. The C∗ -algebra C∞ () is generated by F ∈ M (C∞ (R+ ) ⊗ C∞ ()). Proof. We shall use Proposition 2.1 with C = C∞ (R+ ), A = C∞ () and V = F . Let φ be the natural representation of C∞ (R+ ) acting on L2 (R+ ). For any g ∈ C∞ (R+ ), φ(g) is the multiplication by g. Then φ is faithful and a linear functional ω on C∞ (R+ ) is φ-normal if and only if it is of the form ω(g) =

R+

g(t)ϕ(t) dt,

where ϕ ∈ L1 (R+ ). Applying ω ⊗ id to F ∈ M (C∞ (R+ ) ⊗ C∞ ()) we obtain an element of M (C∞ ()), i.e. a bounded continuous function on . Clearly for any r ∈ R and = ±1 we have (ω ⊗ id)F (r, ) =

F (t; r, )ϕ(t) dt G (tr, )ϕ(t) dt. = G (r, ) R+

(2.6)

R+

Taking into account the asymptotic behavior (1.4) and using the Riemann–Lebesgue lemma one can verify that the integral on the right-hand side tends to 0 when r → ±∞. In other words, (ω ⊗ id)F ∈ C∞ (). Using Statement 7 of Theorem 1.1 of [17] one can easily show that lim

t→0+

1 r G (tr, ) − 1 = t 2i sin (/2)

(2.7)

for all r ∈ R and = ±1. Let r, r ∈ R and , = ±1. Assume for the moment that (ω ⊗ id)F (r, ) = (ω ⊗ id)F (r , ) for all φ-normal functionals ω. Then G (r, )G (tr, ) = G (r , ) G (tr , ) for all t > 0. Going to the limit when t → +0 we get G (r, ) = G (r , ). Comparing this formula with the previous one we see that G (tr, ) = G (tr , ) for all t > 0. Formula (2.7) shows now that r = r and by (1.3) χ (r < 0) = χ (r < 0). This way we have shown that the functions (2.6) separate points of . Now, using the Stone - Weierstrass theorem (applied to the one point compactification of ) we conclude that the smallest ∗ -algebra containing all functions (2.6) is dense in C∞ (). The following proposition will be very useful in proving many technical details important in future considerations. Proposition 2.3. Let R, ρ, U , S be operators acting on a Hilbert space H and C ∈ C ∗ (H ). Assume that: 1. 2. 3. 4.

R is selfadjoint and ρ is unitary selfadjoint commuting with R, U is unitary, S is positive selfadjoint, ker S = {0}, S commutes with ρ and U and R o S, Operators R, ρ Np(R), U and log S are affiliated with C.

Then G (R, ρ) ∈ M(C) and

A New Quantum Deformation of ‘ax + b’ Group

339

1. For any φ ∈ Rep(C) and any B ∈ C ∗ (Hφ ) we have: φ(R), φ(ρ Np(R)), φ(U ) φ(log S), φ G (R, ρ)∗ U ⇒ . are affiliated with B are affiliated with B 2. For any φ1 , φ2 ∈ Rep(C) such that Hφ1 = Hφ2 we have:   φ1 (R) = φ2 (R), φ1 (S) = φ2 (S),   ⇒  φ1 (ρ Np(R)) = φ2 (ρ Np(R)),  . φ1 G (R, ρ)∗ U = φ2 G (R, ρ)∗ U φ1 (U ) = φ2 (U ) Proof. Relation G (R, ρ) ∈ M(C) follows immediately from (2.4). Ad 1. Let λ ∈ R. Using the commutation relations satisfied by operators R, ρ, U, S we have: S −iλ G (R, ρ)∗ U S iλ = G (tR, ρ)∗ U, where t = eλ > 0. Applying a representation φ of C to both sides of the above relation we get φ(S)−iλ φ G (R, ρ)∗ U φ(S)iλ = φ G (tR, ρ)∗ U . If φ(log S), φ G (R, ρ)∗ U η B, then all factors on the left-hand side of the above equation belong to M(B) and depend continuously on λ (we use strict topology on M(B)). Therefore φ G (tR, ρ)∗ U ∈ M(B) for any t ∈ R+ and the mapping R+ t −→ φ G (tR, ρ)∗ U ∈ M(B) is strictly continuous. Applying the hermitian conjugation and multiplying from the left by φ G (R, ρ)∗ U ∈ M(B) we see that φ G (R, ρ)∗ G (tR, ρ) = φ F (t; R, ρ) ∈ M(B) and the mapping R+ t −→ φ F (t; R, ρ) ∈ M(B) (2.8) is strictly continuous. In the above relations F is the function introduced by (2.5).According to the general theory [13], strictly continuous bounded mappings from R+ into M(B) correspond to elements of M(C∞ (R+ )⊗B). A moment of reflection shows that the mapping (2.8) corresponds to the element (id ⊗ φ oπ)F , where π is the representation of C∞ () introduced by (2.2). This way we have shown that (id ⊗ φ oπ)F ∈ M(C∞ (R+ ) ⊗ B). Using now Proposition 2.2 we conclude that φ oπ ∈ Mor(C∞ (), B). Therefore φ oπ maps continuous functions on into elements affiliated with B. Applying this rule to functions f1 , f2 (cf. (2.3)) and G we obtain: φ(R), φ(ρ Np(R)) η B and φ(G (R, ρ)) ∈ M(B). Comparing the last relation with the assumed one φ G (R, ρ)∗ U ∈ M(B) we see that φ(U ) ∈ M(B). Statement 1 is shown. Ad 2. Let φ = φ1 ⊕ φ2 . Then Hφ = Hφ1 ⊕ Hφ2 and φ(c) = φ1 (c) ⊕ φ2 (c). In our case Hφ1 = Hφ2 . We set: B = m ⊕ m : m ∈ K(Hφ1 ) . Then B ∈ C ∗ (Hφ ). One can easily verify that for any c η C we have: φ(c) η B ⇐⇒ φ1 (c) = φ2 (c) . Now Statement 2 follows immediately from Statement 1.

340

W. Pusz, S.L. Woronowicz

We shall use a slightly different version of Statement 2 of the above proposition. Proposition 2.4. Let R1 , ρ1 , U1 , R2 , ρ2 , U2 , S be operators acting on a Hilbert space H . Assume that for each k = 1, 2 the operators Rk , ρk , Uk , S satisfy Assumptions 1-3 of the previous proposition. Then   R1 = R2 ,   G (R1 , ρ1 )∗ U1 = G (R2 , ρ2 )∗ U2 ⇒  ρ1 Np(R1 ) = ρ2 Np(R2 ),  . (2.9) U1 = U2 . Proof. Let C = K(H ) ⊕ K(H ) and for any m1 , m2 ∈ K(H ) we set φk (m1 ⊕ m2 ) = mk (k = 1, 2). We use Proposition 2.3 with R, ρ, U and S replaced by R1 ⊕ R2 , ρ1 ⊕ ρ2 , U1 ⊕ U2 and S ⊕ S. Now (2.9) follows immediately from Statement 2 of Proposition 2.3. Proposition 2.5. Let X and Y be selfadjoint operators acting on Hilbert spaces K and H respectively. Assume that the spectral measure of X is absolutely continuous with respect to the Lebesgue measure. Then for any A ∈ C ∗ (H ) we have: iX⊗Y e is affiliated ⇒ Y is affiliated with A . with K(K) ⊗ A Proof. For any normal linear functional ω on B(K) and t ∈ R we set fω (t) = ω eitX . Then fω is a continuous function on R. Remembering that the spectral measure of X is absolutely continuous with respect to the Lebesgue measure and using the RiemannLebesgue lemma one can easily show that fω (t) → 0 when t → ±∞. Therefore fω ∈ C∞ (R). Let t, t ∈ R, t = t . Assume for the moment that fω (t) = fω (t ) for all ω. Then itX = eit X and ei(t−t )X = I . It shows that the spectral measure of X is supported e 2π by the set t−t Z, which is in contradiction with the assumption saying that the spectral measure of X is absolutely continuous with respect to the Lebesgue measure. This way we showed that functions fω separate points of R. By the Stone – Weierstrass theorem, the smallest ∗ -subalgebra of C∞ (R) containing all fω is dense in C∞ (R). By the general theory strongly continuous mappings from R into the set of unitary operators acting on K correspond to unitary multipliers of K(K) ⊗ C∞ (R). Let X ∈ M(K(K) ⊗ C∞ (R)) be the unitary corresponding to the mapping R t −→ eitX ∈ B(K). Then for any normal linear functional ω on B(K) we have (ω ⊗ id)X = fω . Using Proposition 2.1 we see that C∞ (R) is generated by X ∈ M(K(K) ⊗ C∞ (R)). For any f ∈ C∞ (R) we set: π(f ) = f (Y ). Then π is a representation of C∞ (R) acting on the Hilbert space Hπ = H . A moment of reflection shows that (id ⊗ π)X = eiX⊗Y . If eiX⊗Y is affiliated with K(K) ⊗ A then π ∈ Mor(C∞ (R), A) and π maps continuous functions on R into elements affiliated with A. Applying this rule to the coordinate function f (t) = t we obtain Y = π(f ) η A.

A New Quantum Deformation of ‘ax + b’ Group

341

3. Constructions Related to Old Quantum ‘ax + b’ Groups In this section we recall the main results of [19]. The quantum ‘ax +b’ group will be presented as a quantum group of unitary operators. We shall construct a pair (A, V ), where A is a C∗ -algebra and V is a unitary element of M(K(K) ⊗ A), where K is a Hilbert space endowed with a certain structure and K(K) denotes the algebra of all compact operators acting on K. (A, V ) may be treated as a quantum family of unitary operators acting on K ‘labeled by elements’ of quantum space related to the C∗ -algebra A. Our construction will depend on a real parameter . We shall assume that 0 < < π/2. Negative value of leads to the C∗ -algebra anti-isomorphic to that with positive . On the other hand the restriction < π/2 is related to the technical assumption used in the theory of the quantum exponential function [17]. The main result of this section is contained in Theorem 3.2. It states that (A, V ) is a π quantum group if and only if = 2k+3 with k = 0, 1, 2, . . . . To define A we consider three operators a, b and β acting on the Hilbert space L2 (R). Operator a is strictly positive selfadjoint and such that for any τ ∈ R and any x ∈ L2 (R) we have a iτ x (t) = eτ/2 x(eτ t).

In other words a is the analytic generator of the one-parameter group of unitaries corresponding to the homotheties of R. Operator b is the multiplication operator: (bx)(t) = tx(t). By definition domain D(b) consists of all x ∈ L2 (R) such that the right-hand side of the above equation is square integrable. Finally, β is the reflection: for any x ∈ L2 (R) we have: (βx)(t) = x(−t). Clearly β is unitary selfadjoint. One can easily verify that aβ = βa and bβ = −βb. By the last relation ibβ is selfadjoint. Moreover a iτ ba −iτ = eτ b for any τ ∈ R. This relation means that a

o

(3.1)

b.

Theorem 3.1. Let

A=

norm closed

f , f , g ∈ C∞ (R) linear envelope . f1 (b) + βf2 (b) g(log a) : 1 2 f2 (0) = 0

Then: 1. A is a nondegenerate C∗ -algebra of operators acting on L2 (R), 2. log a, b and ibβ are affiliated with A: log a, b, ibβ η A, 3. log a, b and ibβ generate A.

(3.2)

342

W. Pusz, S.L. Woronowicz

Proof. Ad 1. Using the relation bβ = −βb one can easily show that norm closed

f , f ∈ C∞ (R) linear envelope B = f1 (b) + βf2 (b) : 1 2 f2 (0) = 0

(3.3)

is a non-degenerate C∗ -algebra of operators acting on L2 (R). Let C0 (R, B) denote the set of all continuous mappings from R into B with compact support. Then

norm closure it A= f (t)a dt : f ∈ C0 (R, B) . (3.4) R

To prove this formula it is sufficient to notice that for f (t) = f1 (b) + βf2 (b) ϕ(t), where t ∈ R and ϕ ∈ C0 (R) we have f (t)a it dt = f1 (b) + βf2 (b) g(log a),

R

where g(λ) = R ϕ(t)eiλt dt (λ ∈ R) and by the Riemann-Lebesque Lemma, g ∈ C∞ (R). On the other hand (3.1) shows that the unitaries a it (t ∈ R) implement a one parameter group of automorphisms of B. Using now the standard technique of the theory of crossed products (cf. [6, Sect. 7.6]) one can easily show that (3.4) is a non-degenerate C∗ -algebra of operators acting on L2 (R). Statement 1 is proven. Ad 2. We recall (cf. [5, 13]) that a closed operator T is affiliated with a C∗ -algebra A 1 1 if the z-transform zT = T (I + T ∗ T )− 2 ∈ M(A) and if (I + T ∗ T )− 2 A is dense in A. − 1 Inspecting definition (3.2) one can easily show that zlog a = (log a) I + (log a)2 2 − 1 is a right multiplier of A and that A I + (log a)2 2 is dense in A. Passing to adjoint ∗ operators we see that zlog a = zlog a is a left multiplier (hence zlog a ∈ M(A)) and that 1 − I + (log a)2 2 A is dense in A. It shows that log a is affiliated with A. 1

1

For T = b and T = iβb we have zT = b(I + b2 )− 2 and zT = iβb(I + b2 )− 2 1 1 respectively. In both cases (I + T ∗ T )− 2 = (I + b2 )− 2 . Taking into account definition 1 (3.2) one can easily show that (I +T ∗ T )− 2 A is dense in A and that zT is a left multiplier of A. However in both cases zT is selfadjoint. Therefore zT is also a right multiplier and zT ∈ M(A). It shows that b and iβb are affiliated with A. −1 Ad 3. We shall use Theorem 3.3 of [13]. By definition (3.2), (I +b2 )−1 I + (log a)2 ∈ A. To end the proof it is sufficient to show that a, b, iβb separate representations of A. If c ∈ A is of the form c = f1 (b) + βf2 (b) g(log a), (3.5) where f1 , f2 , g ∈ C∞ (R), f2 (0) = 0 and f2 is differentiable at point 0 ∈ R, then f2 (t) = ith(t), where t ∈ R and h ∈ C∞ (R) and (3.6) π(c) = f1 (π(b)) + π(iβb)h(π(b)) g(π(log a)) for any representation π of A. One can easily see that elements of the form (3.5) form a dense subset of A. Formula (3.6) shows now that π is determined uniquely by π(log a), π(b) and π(iβb).

A New Quantum Deformation of ‘ax + b’ Group

343

Now we pass to the description of the Hilbert space K (cf. the first paragraph of this section). The structure of K is determined by a triple of selfadjoint operators (a, b, β) acting on K and having the following properties:

1. a > 0, ker a = ker b = {0} and a o b, 2. β is a unitary involution, β commutes with a and anticommutes with b. One of the possible choices is: K = L2 (R) and (a, b, β) = (a, b, β). However there is another possibility that is even more interesting: (a, b, β) = (|b|−1 , ei /2 b−1 a, αβ),

(3.7)

where α = ±1. The reader easily verifies that these operators possess the required properties.

The Zakrzewski relation a o b implies that the spectral measures of a and b are absolutely continuous with respect to the Lebesgue measure. Moreover Sp(a) = R+ and Sp(b) = R. The latter fact follows from the relation β b = −bβ. Let V = G (b ⊗ b, β ⊗ β)∗ e log a⊗log a . i

(3.8)

This is the basic object considered in this section. We shall prove Theorem 3.2. 1. V is a unitary operator and V ∈ M(K(K) ⊗ A), 2. A is generated by V ∈ M(K(K) ⊗ A). i

Proof. Let R = b ⊗ b, ρ = β ⊗ β, U = e log a⊗log a , S = a −1 ⊗ I and C = K(K) ⊗ A. Then all the assumptions of Proposition 2.3 are satisfied. Clearly V = G (R, ρ)∗ U ∈ M(C) and Statement 1 is proved. Let π ∈ Rep(A) and B ∈ C ∗ (Hπ ). Then id ⊗ π is a representation of C acting on K ⊗ Hπ . The reader should notice that (id ⊗ π)S = a −1 ⊗ I is affiliated with K(K) ⊗ B. Assume that (id ⊗ π)V ∈ M(K(K) ⊗ B). By Statement 1 of Proposition 2.3, operators: i (id⊗π )R = b⊗π(b), (id⊗π )(ρ Np(R)) and (id⊗π )U = e log a⊗π(log a) are affiliated with K(K) ⊗ B. Using now Proposition A.1 of [19] we see that π(b) is affiliated with B. One can easily verify that β ⊗ I commutes with ρ and anticommutes with R. Therefore ρ Np(R) − (β ⊗ I )ρ Np(R)(β ⊗ I ) = ρ(Np(R) − Np(−R)) = ρR, and applying id ⊗ π to both sides we get (id ⊗ π )(ρ Np(R)) − (β ⊗ I )(id ⊗ π )(ρ Np(R))(β ⊗ I ) = (id ⊗ π )(ρR) = −i β b ⊗ π(iβb). The operators β ⊗ I and (id ⊗ π )(ρ Np(R)) appearing on the left-hand side are affiliated with K(K) ⊗ B. Therefore i β b ⊗ π(iβb) η K(K) ⊗ B and using again Proposition A.1 of [19] we see that π(iβb) is affiliated with B. Moreover, remembering that i e log a⊗π (log a) η K(K) ⊗ B and using Proposition 2.5 we see that π(log a) is affiliated with B. According to Statement 3 of Theorem 3.1, b, iβb and log a generate A. Therefore π ∈ Mor(A, B). We showed that (id ⊗ π)V ∈ M(K(K) ⊗ B) implies π ∈ Mor(A, B). It means that A is generated by V ∈ M(K(K) ⊗ A).

344

W. Pusz, S.L. Woronowicz

Now we are able to formulate the main result of this section: Theorem 3.3.   There exists ∈ Mor(A, A ⊗ A) π   such that , k = 0, 1, 2, . . . .   ⇐⇒ = 2k + 3 (id ⊗ )V = V12 V13 Proof. Let iπ 2

α = ie 2 , T = I ⊗ ei /2 b−1 a ⊗ b, τ = (I ⊗ β ⊗ β) αχ (b ⊗ b ⊗ I < 0) + αχ (b ⊗ b ⊗ I > 0) and W = G (T , τ )∗ e

− i I ⊗log|b|⊗log a

.

(3.9)

(3.10)

Clearly W is a unitary operator acting on K ⊗ L2 (R) ⊗ L2 (R). We shall prove that ∗

V12 V13 = W V12 W .

(3.11)

To make our formulae shorter we set Z = e− log|b|⊗log a .

i

i

U = e log a⊗log a , Using the relations a

o

b, a β = β a and a

o

b one can easily verify that

U (b ⊗ I )U ∗ = b ⊗ a, U (β ⊗ I )U ∗ = β ⊗ I, Z(a ⊗ I )Z ∗ = a ⊗ a.

(3.12) (3.13)

With the above notation V = G (b ⊗ b, β ⊗ β)∗ U and V12 V13 = G (b ⊗ b ⊗ I, β ⊗ β ⊗ I )∗ U12 G (b ⊗ I ⊗ b, β ⊗ I ⊗ β)∗ U13 . Using (3.12) we get U12 G (b ⊗ I ⊗ b, β ⊗ I ⊗ β)∗ = G (b ⊗ a ⊗ b, β ⊗ I ⊗ β)∗ U12 and

∗ V12 V13 = G (b ⊗ a ⊗ b, β ⊗ I ⊗ β) G (b ⊗ b ⊗ I, β ⊗ β ⊗ I ) U12 U13 . (3.14)

Let us consider the first factor in (3.14). We apply Theorem 1.6 with R = b ⊗ a ⊗ b, ρ = β ⊗ I ⊗ β, S = b ⊗ b ⊗ I, σ = β ⊗ β ⊗ I. Then T and τ are given by (3.9) and G (b ⊗ a ⊗ b, β ⊗ I ⊗ β) G (b ⊗ b ⊗ I, β ⊗ β ⊗ I ) = G (T , τ )∗ G (b ⊗ b ⊗ I, β ⊗ β ⊗ I ) G (T , τ ).

(3.15)

A New Quantum Deformation of ‘ax + b’ Group

345

Now (3.14) takes the form V12 V13 = G (T , τ )∗ G (b ⊗ b ⊗ I, β ⊗ β ⊗ I )∗ G (T , τ ) U12 U13 .

(3.16)

We shall move G (T , τ ) to the end of the right-hand side of this formula. Performing simple computations and using (3.13) we obtain: i

U12 U13 = e log a⊗log(a⊗a) ∗ . = Z23 U12 Z23

It turns out that log a ⊗ log(a ⊗ a) commutes with T , log a ⊗ log(a ⊗ a) commutes with τ.

(3.17) (3.18)

Indeed the Zakrzewski relation a o b implies b−1 o a. Using both relations we see that a ⊗ a commutes with ei /2 b−1 a ⊗ b. Therefore log(a ⊗ a) commutes with ei /2 b−1 a ⊗ b and log a ⊗ log(a ⊗ a) commutes with T = I ⊗ ei /2 b−1 a ⊗ b. Relation (3.17) is shown.

To prove (3.18) we use Zakrzewski relations a o b and a o b. They show that a commutes with sgn b and a commutes with sgn b. Therefore log a⊗log(a⊗a) commutes with sgn(b ⊗ b ⊗ I ) = sgn b ⊗ sgn b ⊗ I and (3.18) follows. Taking into account (3.17) and (3.18) we see that G (T , τ ) commutes with U12 U13 . Now relation (3.16) takes the form: ∗ V12 V13 = G (T , τ )∗ G (b ⊗ b ⊗ I, β ⊗ β ⊗ I )∗ Z23 U12 Z23 G (T , τ ). (3.19)

Finally b ⊗ I and β ⊗ I commute with log |b| ⊗ log a. Therefore G (b ⊗ b ⊗ I, β ⊗ β ⊗ I ) commutes with Z23 . Clearly G (b ⊗ b ⊗ I, β ⊗ β ⊗ I )∗ U12 = V12 and W = G (T , τ )∗ Z23 . Now (3.11) follows immediately from (3.19). By the Zakrzewski relation a iλ ba −iλ = eλ b for all λ ∈ R. Multiplication by a strictly positive number does not change the sign of an operator. Using this fact one can easily show that τ commutes with a iλ ⊗ I ⊗ I . Consequently τ commutes with a ⊗ I ⊗ I . Since T = I ⊗ ei /2 b−1 a ⊗ b and I ⊗ log |b| ⊗ log a obviously commute with a ⊗ I ⊗ I , we conclude that W commutes with a ⊗ I ⊗ I . Now we are ready to prove the main statement. ⇒ . Let ∈ Mor(A, A ⊗ A) and (id ⊗ )V = V12 V13 . We go back to the notation used in the proof of Theorem 3.2. In particular C = K(K) ⊗ A. For any c ∈ C we set: φ1 (c) = (id ⊗ )(c), φ2 (c) = W (c ⊗ I )W ∗ . Then φ1 and φ2 are representations of C acting on the same Hilbert space K ⊗ L2 (R) ⊗ L2 (R). One can easily verify that φ1 (a ⊗ I ) = a ⊗ I ⊗ I = φ2 (a ⊗ I ). Formula (3.11) shows that φ1 (V ) = φ2 (V ). In our notation (cf. the beginning of the proof of Theorem 3.2), a ⊗ I = S and V = G (R, ρ)∗ U , where in particular R = b ⊗ b. Statement 2 of Theorem 2.3 shows now that φ1 (R) = φ2 (R). It means that ∗

b ⊗ (b) = W (b ⊗ b ⊗ I ) W .

346

W. Pusz, S.L. Woronowicz

Taking into account (3.10) and using Proposition 1.2 we get: b ⊗ (b) = G (T , τ )(b ⊗ b ⊗ I )G (T , τ )∗ = b⊗a⊗b+b⊗b⊗I τ .

(3.20)

We recall that

τ = (I ⊗ β ⊗ β) αχ (b ⊗ b ⊗ I < 0) + αχ (b ⊗ b ⊗ I > 0) .

Inspecting the last two formulae we observe that b is the only operator appearing in the first leg position. We know that b is selfadjoint. Therefore replacing in both sides of (3.20) operator b by a real number λ we obtain a formula that should hold for almost all λ ∈ Sp b. For positive λ we get (b) = a ⊗ b + b ⊗ I , (3.21) τ+

where

τ+ = (β ⊗ β) αχ (b ⊗ I < 0) + αχ (b ⊗ I > 0) .

On the other hand for negative λ we have (b) = a ⊗ b + b ⊗ I where

τ−

,

τ− = (β ⊗ β) αχ (b ⊗ I > 0) + αχ (b ⊗ I < 0) .

(3.22)

(3.23)

(3.24)

Clearly the two expressions for (b) must coincide. Let us notice that the operator I ⊗ β commutes with τ+ , τ− and b ⊗ I and anticommutes with a ⊗ b. Therefore τ+ = τ− by iπ 2

Remark 1.3. Comparing (3.22) and (3.24) we get α = α. Remembering that α = i e 2 π and 0 < < π2 we conclude that = 2k+3 (k = 0, 1, 2, . . . ). π ⇐. Assume that = 2k+3 for some k = 0, 1, 2, . . . . Then formula (3.10) essentially simplifies. In this case α = (−1)k , τ = (−1)k (I ⊗ β ⊗ β) and W = W23 = I ⊗ W , where ∗ i (3.25) W = G ei /2 b−1 a ⊗ b, (−1)k β ⊗ β e− log|b|⊗log a . Formula (3.11) takes the form ∗ V12 V13 = W23 V12 W23 .

(3.26)

(c) = W (c ⊗ I )W ∗ .

(3.27)

For any c ∈ A we set Then is a representation of A acting on L2 (R)⊗L2 (R). We know that V

∈ M(K(K)⊗

A). Formula (3.26) shows that (id ⊗ )V = V12 V13 . Clearly V12 , V13 ∈ M(K(K) ⊗ A ⊗ A). Therefore (id ⊗ )V = V12 V13 ∈ M(K(K) ⊗ A ⊗ A). Remembering that A is generated by V we conclude that ∈ Mor(A, A ⊗ A).

A New Quantum Deformation of ‘ax + b’ Group

347

π Let = 2k+3 (k = 0, 1, 2, . . . ). Then formula (3.27) makes it possible to calculate (c) for any c ∈ A. The same holds for any c affiliated with A. We shall show that

(a) = a ⊗ a, (b) = a ⊗ b + b ⊗ I (−1)k β⊗β , (ib2k+3 β) = a 2k+3 ⊗ ib2k+3 β + ib2k+3 β ⊗ I − sgn(b⊗b) .

(3.28)

Formula for (a) follows immediately from (3.13); the reader should notice that operators ei /2 b−1 a ⊗ b and β ⊗ β commute with a ⊗ a. The formula for (b) was in fact shown in the proof of Theorem 3.3; in the present case τ+ = τ− = (−1)k β ⊗ β and the second formula of (3.28) coincides with (3.21) (and with (3.23) as well). It remains to prove the third formula. We know that |b| commutes with ib2k+3 β. Taking into account (3.25) we obtain (ib2k+3 β) = W (ib2k+3 β ⊗ I )W ∗ ∗ ib2k+3 β⊗I = G ei /2 b−1 a⊗b, (−1)k β⊗β ×G ei /2 b−1 a⊗b, (−1)k β⊗β .

(3.29)

To compute the right-hand side we use Proposition 1.4 with R = a ⊗ |b| ,

S = |b| ⊗ I,

τ = sgn(b ⊗ b),

ξ = (−1)k β ⊗ β,

ρ = I ⊗ i(sgn b)β, σ = i(sgn b)β ⊗ I. Remembering that β 2 = I and β anticommutes with b and hence commutes with |b| one can easily check that these operators fulfill all assumption of Proposition 1.4. In this case we have T = (ei /2 |b|−1 a) ⊗ |b| and τ T = ei /2 b−1 a ⊗ b. According to our assumption π = 2k + 3 is an odd positive integer. Therefore π σ S = i(sgn b)β ⊗ I |b|2k+3 ⊗ I = ib2k+3 β ⊗ I, π ρR = I ⊗ i(sgn b)β a 2k+3 ⊗ |b|2k+3 = a 2k+3 ⊗ ib2k+3 β, and formula (1.11) takes the form ib2k+3 β ⊗ I + a 2k+3 ⊗ ib2k+3 β

− sgn(b⊗b) ∗ 2k+3

ib = G ei /2 b−1 a⊗b, (−1)k β⊗β ×G ei /2 b−1 a⊗b, (−1)k β⊗β .

β⊗I

(3.30)

Comparing (3.29) with (3.30) we get the last formula of (3.28). This formula appeared without proof in [19].

348

W. Pusz, S.L. Woronowicz

Remark 3.4. Let s ∈ S 1 be a number of modulus 1. Replacing in the above computations σ = i(sgn b)β ⊗ I and ρ = i(sgn b)β ⊗ I by σ = s sgn b β ⊗ I and ρ = s sgn b β ⊗ I respectively, one can prove that (s sgn b |b|2k+3 β) = a 2k+3 ⊗ s sgn b |b|2k+3 β + s sgn b |b|2k+3 β ⊗ I (3.31) . − sgn(b⊗b)

If s = i then s sgn b = i sgn b and (3.31) reduces to the previous formula. For s = 1 we get . (3.32) (|b|2k+3 β) = a 2k+3 ⊗ |b|2k+3 β + |b|2k+3 β ⊗ I − sgn(b⊗b)

Assume now that K = L2 (R) and that the operators a, b, β are given by (3.7). Then operator (3.8) coincides with (3.25): V = W . Relation (3.26) takes the form: W23 W12 = W12 W13 W23 . This is the famous pentagon equation of Baaj and Skandalis [2]. It means that W is a multiplicative unitary. It is known that W is modular [8]. This property enables us to introduce the unitary antipode, scaling group and Haar weight (see [8, 16, 19, 10, 20] for details). In [19] we discussed the cyclic group of four elements acting on a quantum ‘ax + b’ group. In fact this action may be extended to an action of S 1 . At the beginning we set no condition for ∈ R. For any s ∈ S 1 and any closed operator c we set φs (c) = ws∗ c ws , where ws is the unitary operator introduced by ws = s χ(b<0) . Obviously ws commutes with a and b. Moreover ws∗ βws = s −χ(b<0) βs χ(b<0) = s −χ(b<0) s χ(b>0) β = s sgn b β. These facts show that the algebra A introduced by (3.2) is invariant under φs (for all real π ). For special values h = 2k+3 the algebra A is equipped with the comultiplication introduced by Theorem 3.3. Using (1.3) one can easily check that the multiplicative unitary (3.25) commutes with ws ⊗ ws . Formula (3.27) shows now that the comultiplication is preserved by the automorphisms φs : (φs (c)) = (φs ⊗ φs )(c) for all s ∈ S 1 and c ∈ A. 4. New Quantum Deformations of ‘ax + b’ Group In this section we shall show how to enlarge the set of admissible values of the deformation parameter beyond the one described in Theorem 3.3. To this end one has to add a new element to the set of generators of the C∗ -algebra A. This new element denoted by w is a unitary operator commuting with a and b such that w ∗ βw = s sgn b β.

(4.1)

In this formula s ∈ S 1 is a new deformation parameter. We shall see later that s is related to .

A New Quantum Deformation of ‘ax + b’ Group

349

To define the new C∗ -algebra A we consider four operators a, b, β and w acting on the Hilbert space L2 (R × S 1 ) introduced in the following way: for any τ ∈ R and any x ∈ L2 (R × S 1 ) we set: iτ a x (t, z) = eτ/2 x(eτ t, z), (bx)(t, z) = tx(t, z), (4.2) (βx)(t, z) = x(−t, z), (wx)(t, z) = s χ(t<0) z x(t, z). As in the previous section a is the analytic generator of the group of unitaries defined by the first formula. Operator b is selfadjoint. Its domain consists of all x such that |tx(t, z)|2 is integrable over R × S 1 . Clearly β and w are unitary and β ∗ = β. By simple computations, aβ = βa, bβ = −βb, aw = wa, bw = wb, w∗ βw = s sgn b β and

a o b. Furthermore βwβ = s sgn b w and βw sgn b β = (βwβ)− sgn b = (s sgn b w)− sgn b = s −1 w − sgn b . Hence βw sgn b = s −1 w − sgn b β. In what follows we shall use operator L introduced by the formula (Lx)(t, z) = z

∂x(t, z) . ∂z

(4.3)

One can easily verify that L is a selfadjoint operator with integer spectrum, it commutes with a, b and β and w ∗ Lw = L + I . Using the last relation we get (I ⊗ w)L⊗I (w ⊗ I )(I ⊗ w)−L⊗I = w ⊗ w.

(4.4)

Using essentially the same method as in the proof of Theorem 3.1 one can easily show Theorem 4.1. Let

f , f , g ∈ C∞ (R) A = (f1 (b) + βf2 (b)) g(log a)w : 1 2 f2 (0) = 0, k ∈ Z k

norm closed linear envelope

.

(4.5)

Then: 1. A is a nondegenerate C ∗ -algebra of operators acting on L2 (R × S 1 ), 2. log a, b, ibβ and w are affiliated with A: log a, b, ibβ, w η A, 3. log a, b, ibβ and w generate A, The reader should notice that the C∗ -algebra A introduced by (4.5) coincides with the crossed product of the C∗ -algebra A considered in the previous section (cf. (3.2)) by the automorphism that leaves a and b invariant and maps β into s sgn b β. Now we pass to the description of the Hilbert space K. The structure of K is determined by a quadruple of selfadjoint operators (a, b, β, L) acting on K and having the following properties: 1. a > 0, ker a = ker b = {0} and a

o

b,

2. β is a unitary involution, β commutes with a and anticommutes with b,

(4.6)

3. L is of integer spectrum, L strongly commutes with a and b, 4. β Lβ = L − sgn b. One of the possible choices is: K = L2 (R × S 1 ) and (a, b, β, L) = (a, b, β, L + χ(b > 0)). However there is another possibility that is even more interesting: (a, b, β, L) = (|b|−1 , ei /2 b−1 a, αwsgn b β, L),

(4.7)

350

W. Pusz, S.L. Woronowicz

where α ∈ S 1 and α 2 = s. The reader easily verifies that these operators possess the properties (4.6). Let V = G (b ⊗ b, β ⊗ β)∗ e log a⊗log a (I ⊗ w)L⊗I . i

(4.8)

This is the basic object considered in this section. We shall prove Theorem 4.2. 1. V is a unitary operator and V ∈ M(K(K) ⊗ A), 2. A is generated by V ∈ M(K(K) ⊗ A). i

Proof. Let R = b ⊗ b, ρ = β ⊗ β, U = e log a⊗log a (I ⊗ w)L⊗I , S = a −1 ⊗ I and C = K(K) ⊗ A. Then all the assumptions of Proposition 2.3 are satisfied. Hence V = G (R, ρ)∗ U ∈ M(C) and Statement 1 is proved. Let π be a representation of A and B ∈ C ∗ (Hπ ). Then id ⊗ π is a representation of C acting on K ⊗ Hπ . Assume that (id ⊗ π)V ∈ M(K(K) ⊗ B). Repeating the reasoning used in the proof of Theorem 3.2 we see that π(b) and π(iβb) are affiliated with B. i Furthermore (id ⊗ π )U = e log a⊗π(log a) (I ⊗ π(w))L⊗I is affiliated with K(K) ⊗ B. We know that a commutes with L. Therefore a respects the decomposition of K into a direct sum of eigenspaces of L. Let K = K(L = ). Then K =

!

K ,

a =

∈Z

! ∈Z

a and L =

!

I.

∈Z

With this notation (id ⊗ π)U =

!

e log a ⊗π(log a) (I ⊗ π(w) ). i

∈Z

Let = 0, 1. Remembering that (id ⊗ π)U is affiliated with K(K) ⊗ B we see that i i e log a0 ⊗π (log a) is affiliated with K(K0 )⊗B and e log a1 ⊗π(log a) (I ⊗π(w)) is affiliated with K(K1 )⊗B. Proposition 2.5 shows now that π(log a) is affiliated with B. Using this fact one can easily show that π(w) is also affiliated with B. According to Statement 3 of Theorem 4.1, b, iβb, log a and w generate A. Therefore π ∈ Mor(A, B). We showed that (id ⊗ π )V ∈ M(K(K) ⊗ B) implies π ∈ Mor(A, B). It means that A is generated by V ∈ M(K(K) ⊗ A). Now we are able to formulate the main result of this section: Theorem 4.3.   There exists ∈ Mor(A, A ⊗ A) p ∈ R, p > 2 π   such that .   ⇐⇒ = , where p and eiπp = −s (id ⊗ )V = V12 V13 Proof. We essentially repeat the proof of Theorem 3.3. Since in large part calculations are very similar, we sketch the main steps only and point out the necessary modifications. We shall use the operator L introduced in (4.3).

A New Quantum Deformation of ‘ax + b’ Group

351

Let iπ 2

α = ie 2 , T = I ⊗ ei /2 b−1 a ⊗ b,

(4.9)

τ = (I ⊗ βw − sgn b ⊗ β) αs −1 χ (b ⊗ b ⊗ I < 0) + αχ (b ⊗ b ⊗ I > 0) and W = G (T , τ )∗ e− I ⊗log|b|⊗log a (I ⊗ I ⊗ w)I ⊗L⊗I . i

(4.10)

Clearly W is a unitary operator acting on K ⊗ L2 (R × S 1 ) ⊗ L2 (R × S 1 ). We shall prove that ∗

V12 V13 = W V12 W .

(4.11)

In order to make our formulae shorter we set i

U = e log a⊗log a (I ⊗ w)L⊗I , Z = e− log|b|⊗log a (I ⊗ w)L⊗I . i

Using the commutation relations, one can easily verify that U (b ⊗ I )U ∗ = b ⊗ a, Z(a ⊗ I )Z ∗ = a ⊗ a,

U (β ⊗ I )U ∗ = (β ⊗ I )(I ⊗ w)− sgn b⊗I , (4.12) Z(w ⊗ I )Z ∗ = w ⊗ w. (4.13)

The last formula follows from (4.4). With the above notation V = G (b ⊗ b, β ⊗ β)∗ U and V12 V13 = G (b ⊗ b ⊗ I, β ⊗ β ⊗ I )∗ U12 G (b ⊗ I ⊗ b, β ⊗ I ⊗ β)∗ U13 . Taking into account (4.12) we get U12 G (b ⊗ I ⊗ b, β ⊗ I ⊗ β)∗ = G (b ⊗ a ⊗ b, (β ⊗ I ⊗ β)(I ⊗ w ⊗ I )− sgn b⊗I ⊗I )∗ U12 ∗ = G b ⊗ a ⊗ b, β ⊗ (I ⊗ β)(w ⊗ I )I ⊗sgn b U12 . The second equality follows from (1.3). Therefore ∗ V12 V13 = G (R, ρ) G (S, σ ) U12 U13 ,

(4.14)

(4.15)

where R = b ⊗ a ⊗ b, ρ = β ⊗ (I ⊗ β)(w ⊗ I )I ⊗sgn b , S = b ⊗ b ⊗ I, σ = β ⊗ β ⊗ I.

(4.16)

One can easily verify that R, S are selfadjoint, ρ, σ are unitary selfadjoint, R commutes with ρ and anticommutes with σ and S anticommutes with ρ and commutes

352

W. Pusz, S.L. Woronowicz

with σ . Operator T = ei /2 S −1 R = I ⊗ ei /2 b−1 a ⊗ b coincides with the operator T introduced by (4.9). Moreover σρ = I ⊗ (β ⊗ β)(w ⊗ I )I ⊗sgn b ≡ I ⊗ βw − sgn b ⊗ β, ρσ = I ⊗ (I ⊗ β)(w ⊗ I )I ⊗sgn b (β ⊗ I ) = I ⊗ (β ⊗ β)(βwβ ⊗ I )I ⊗sgn b = I ⊗ (β ⊗ β)(s sgn b w ⊗ I )I ⊗sgn b ≡ s −1 I ⊗ βw − sgn b ⊗ β, where ‘≡’ denotes the equivalence relation: x ≡ y if and only if xχ (T < 0) = yχ(T < 0). Consequently αρσ χ (S < 0) + ασρχ (S > 0) ≡ τ, where τ is given by (4.9). Theorem 1.6 shows now that G (R, ρ)G (S, σ ) = G (T , τ )∗ G (S, σ ) G (T , τ ) and (4.15) takes the form V12 V13 = G (T , τ )∗ G (b ⊗ b ⊗ I, β ⊗ β ⊗ I )∗ G (T , τ ) U12 U13 .

(4.17)

Performing simple computations and using (4.13) we obtain i

U12 U13 = e log a⊗log(a⊗a) (I ⊗ w ⊗ w)L⊗I ⊗I ∗ . = Z23 U12 Z23

Repeating the arguments used in the proof of Theorem 3.3 we see that G (T , τ ) i commutes with e log a⊗log(a⊗a) . One can easily check that T commute with L ⊗ I ⊗ I , I ⊗ w ⊗ w and τ commutes with L ⊗ I ⊗ I . Moreover (I ⊗ w ⊗ w)∗ τ (I ⊗ w ⊗ w) = τ (I ⊗ s sgn b ⊗ s sgn b ) ≡ τ. Therefore G (T , τ ) commutes with (I ⊗ w ⊗ w)L⊗I ⊗I and in (4.17) we may move G (T , τ ) to the most right position: ∗ G (T , τ ) (4.18) V12 V13 = G (T , τ )∗ G (b ⊗ b ⊗ I, β ⊗ β ⊗ I )∗ Z23 U12 Z23

Finally one easily verifies that b ⊗ I and β ⊗ I commute with log |b| ⊗ log a, L ⊗ I and I ⊗ w. Therefore G (b ⊗ b ⊗ I, β ⊗ β ⊗ I ) commutes with Z23 . Clearly G (b ⊗ b ⊗ I, β ⊗ β ⊗ I )∗ U12 = V12 and W = G (T , τ )∗ Z23 . Now (4.11) follows immediately from (4.18). Also in the present case W commutes with a ⊗ I ⊗ I . The same proof applies. Now we are ready to prove the main statement. ⇒ . Let ∈ Mor(A, A ⊗ A) and (id ⊗ )V = V12 V13 . Repeating the reasoning used in the proof of Theorem 3.3 we easily arrive at the formula (4.19) (b) = a ⊗ b + b ⊗ I τ = a ⊗ b + b ⊗ I τ , +

where

−

τ+ = (βw− sgn b ⊗ β) αs −1 χ (b ⊗ I < 0) + αχ (b ⊗ I > 0) , τ− = (βw− sgn b ⊗ β) αs −1 χ (b ⊗ I > 0) + αχ (b ⊗ I < 0) .

(4.20)

A New Quantum Deformation of ‘ax + b’ Group

353

Clearly the two expressions for (b) must coincide. Let us notice that the operator I ⊗ β commutes with τ+ , τ− and b ⊗ I and anticommutes with a ⊗ b. Therefore τ+ = τ− by iπ 2

Remark 1.3. Using (4.20) we get s = α 2 . Remembering that α = i e 2 and 0 < < π2 we conclude that = πp , where p ∈ R, p > 2 and eiπp = −s. ⇐. Assume that = πp , for some p ∈ R such that p > 2 and eiπp = −s. Then formula (4.10) essentially simplifies. In this case αs −1 = α, τ = I ⊗ αβw − sgn b ⊗ β = I ⊗ αwsgn b β ⊗ β and W = W23 = I ⊗ W , where ∗ i W = G ei /2 b−1 a ⊗ b, αw sgn b β ⊗ β e− log|b|⊗log a (I ⊗ w)L⊗I . (4.21) Formula (4.11) takes the form ∗ V12 V13 = W23 V12 W23 .

(4.22)

(c) = W (c ⊗ I )W ∗ .

(4.23)

For any c ∈ A we set

Then is a representation of A acting on L2 (R × S 1 ) ⊗ L2 (R × S 1 ). We know that V ∈ M(K(K) ⊗ A). Formula (4.22) shows that (id ⊗ )V = V12 V13 . Clearly V12 , V13 ∈ M(K(K) ⊗ A ⊗ A). Therefore (id ⊗ )V = V12 V13 ∈ M(K(K) ⊗ A ⊗ A). Remembering that A is generated by V we conclude that ∈ Mor(A, A ⊗ A). Let s = −eiπp and = πp for some p > 2. Formula (4.23) enables us to calculate (c) for any c ∈ A. The same holds for any c affiliated with A. We shall show that (a) = a ⊗ a, (b) = a ⊗ b + b ⊗ I , αwsgn b β⊗β β |b|p = (w ⊗ I )−I ⊗sgn b (a p ⊗ β |b|p ) + β |b|p ⊗ I

− sgn(b⊗b)

,

(4.24)

(w) = w ⊗ w. Repeating the reasoning preceding the formula (4.18) one can show that a ⊗ a and w ⊗ w commute with G ei /2 b−1 a ⊗ b, αw sgn b β ⊗ β and formulae for (a) and (w) follow immediately from (4.13). The formula for (b) coincides with (4.19). It remains to prove the third formula. According to (4.21) operator W is the com ∗ i position of two unitaries: G ei /2 b−1 a ⊗ b, αw sgn b β ⊗ β and e− log|b|⊗log a (I ⊗ w)L⊗I . Formula (4.23) shows now that = ψ oϕ, where ϕ(c) = e− log|b|⊗log a (I ⊗ w)L⊗I (c ⊗ I ) (I ⊗ w)−L⊗I e log|b|⊗log a , ∗ ψ(d) = G ei /2 b−1 a ⊗ b, αw sgn b β ⊗ β d G ei /2 b−1 a ⊗ b, αw sgn b β ⊗ β . i

i

One can easily verify that ϕ(b) = b ⊗ I and ϕ(β) = β ⊗ I . Therefore ϕ(β |b|p ) = β |b|p ⊗ I

354

W. Pusz, S.L. Woronowicz

and β |b|p = ψ β |b|p ⊗ I .

(4.25)

To compute the right-hand side we use Proposition 1.4 with R = a ⊗ |b| ,

S = |b| ⊗ I,

τ = sgn(b ⊗ b),

ξ = αw sgn b β ⊗ β,

ρ = (w ⊗ I )−I ⊗sgn b (I ⊗ β),

σ = β ⊗ I.

One can easily check that these operators fulfill all assumptions of Proposition 1.4. In this case we have T = (ei /2 |b|−1 a) ⊗ |b| and τ T = ei /2 b−1 a ⊗ b. According to our assumption π = p. Therefore π

σ S = β |b|p ⊗ I,

π ρR = (w ⊗ I )−I ⊗sgn b a p ⊗ β |b|p , and formula (1.11) takes the form

β |b|p ⊗ I + (w ⊗ I )−I ⊗sgn b a p ⊗ β |b|p

− sgn(b⊗b)

= ψ β|b|p ⊗I . (4.26)

Comparing (4.25) with (4.26) we get the third formula of (4.24).

5. Modularity and All That Now we shall investigate the unitary W introduced by (4.21). We shall prove that W is a modular multiplicative unitary. Throughout this section s = −eiπp and = πp , where p > 2. Let K be the Hilbert space complex conjugate to K. The structure of K is established by an antiunitary mapping K x ←→ x ∈ K. For any closed operator c acting on K, we denote by c the transpose of c. By definition c is an operator acting on K with domain D(c ) = {x : x ∈ D(c∗ )} such that c x = c∗ x 1

for any x ∈ D(c∗ ). In what follows Q = a 2 . Proposition 5.1. Let V be the unitary operator introduced by (4.8) and i = G −b⊗ei /2 ba −1 , −β ⊗αw sgn b β e log a ⊗log a (I ⊗ w)L ⊗I . V

(5.1)

is unitary and for any x, z ∈ K, y ∈ D(Q−1 ), u ∈ D(Q) we have: Then V x ⊗ Q−1 y . (x ⊗ u V z ⊗ y) = z ⊗ Qu V

(5.2)

A New Quantum Deformation of ‘ax + b’ Group

355

Proof. Let us notice that χ (−b⊗ei /2 ba −1 < 0) = χ (b ⊗ b > 0). By virtue of (1.3), we may replace β ⊗ αw sgn b β = β ⊗ αβw − sgn b by α(I ⊗ β) τ , where τ = (β ⊗ I )(I ⊗ w)− sgn b

⊗I

(5.3)

.

Therefore i = G −b⊗ei /2 ba −1 , −α(I ⊗ β) τ e log a ⊗log a (I ⊗ w)L ⊗I . V

(5.4)

We know that β L = (L − sgn b)β. Therefore L β = β (L − sgn b ) and ⊗I

(I ⊗ w)L

⊗I −sgn b ⊗I

(β ⊗ I ) = (β ⊗ I )(I ⊗ w)L

.

It shows that ⊗I

τ (I ⊗ w)L

⊗I

= (I ⊗ w)L

(β ⊗ I ).

(5.5)

We shall follow the proof of Proposition 2.3 of [19]. The reader should notice that in large part that proof is independent of the particular value of . To make our formulae shorter we set: i

U = e log a⊗log a ,

= e i log a ⊗log a , U

=U (I ⊗ w)L ⊗I , U = U (I ⊗ w)L⊗I , U " " " " = "b ⊗ ei /2 ba −1 " . B = "b ⊗ b" , B

(5.6)

We know that sgn b and Q commute. Therefore we may assume that u and y are eigenvectors of sgn b. Similarly we may assume that x and z are common eigenvectors of sgn b. Proceeding in the same way as in [19] we reduce (5.2) to the following three equations (cf. [19, formula (2.23) and the next two]): − π i)U x ⊗ Q−1 y , (5.7) x ⊗ u Vθ (log B)∗ U z ⊗ y = z ⊗ Qu Vθ (log B U x ⊗ Q−1 y , x ⊗ u Vθ (log B − πi)∗ U z ⊗ y = z ⊗ Qu Vθ (log B) (5.8)

x⊗u

∗ π i β ⊗ β B Vθ (log B − πi) U z ⊗ y

π Vθ (log B − π i)U x ⊗ Q−1 y . = z ⊗ Qu − iα(I ⊗ β) τB In these formulae θ = 2π = 2p. The left-hand side of the last formula π LHS of (5.9) = −i βx ⊗ βu B Vθ (log B − π i)∗ U z ⊗ y . Similarly remembering that β commutes with Q we have: π Vθ (log B − π i)U x ⊗ Q−1 y . τB RHS of (5.9) = −iα z ⊗ Qβu

(5.9)

356

W. Pusz, S.L. Woronowicz

We shall move τ to the most right position. Clearly (cf. (5.3)) this operator commutes i with B and U = e log a ⊗log a . Taking into account (5.5) we obtain π Vθ (log B − π i)U βx ⊗ Q−1 y . RHS of (5.9) = −iα z ⊗ Qβu B Replacing βx and βu by x and u respectively we see that (5.9) is equivalent to the equation π x ⊗ u αB Vθ (log B − πi)∗ U z ⊗ y π − πi)U x ⊗ Q−1 y . Vθ (log B = z ⊗ Qu B (5.10) Let us notice that our crucial formulae (5.7), (5.8) and (5.10) fit the same pattern: U x ⊗ Q−1 y , (5.11) (x ⊗ u fi (B)U z ⊗ y) = z ⊗ Qu gi (B) where fi and gi (i = 1, 2, 3) are functions on positive reals: f1 (t) = Vθ (log t),

g1 (t) = Vθ (log t − π i),

f2 (t) = Vθ (log t − π i),

g2 (t) = Vθ (log t),

π

π

f3 (t) = αt Vθ (log t − π i), g3 (t) = t Vθ (log t − π i) for all t > 0. (cf. (5.6)) we obtain a simplified version by U and U Replacing operators U and U of (5.11): U x ⊗ Q−1 y . (5.12) x ⊗ u fi (B)U z ⊗ y = z ⊗ Qu gi (B) It is known that the last equality holds in all three cases i = 1, 2, 3 (cf. [19, proof of Proposition 2.3]). We shall show that (5.11) follows from (5.12). We shall use the expansion (I ⊗ w)L⊗I =

∞ #

χ (L = m) ⊗ w m .

(5.13)

m=−∞

Inserting in (5.12), χ (L = m)x and wm y instead of x and y we obtain: x⊗u fk (B)U χ (L=m)⊗w m z⊗y U χ (L =m)⊗w m x⊗Q−1 y . = z⊗Qu gk (B) Summing over m and using (5.13) we obtain (5.11). The proof is complete.

We recall the basic definitions [2, 16, 8]. Let H be a Hilbert space and W be a unitary operator acting on H ⊗ H . We say that W is multiplicative unitary if it satisfies the pentagonal equation W23 W12 = W12 W13 W23 .

A New Quantum Deformation of ‘ax + b’ Group

357

A multiplicative unitary W is said to be modular if there exist strictly positive selfadjoint acting on H ⊗ H such that operators Q and Q acting on H and a unitary operator W Q ⊗ Q commutes with W and x ⊗ Q−1 y (5.14) (x ⊗ u W z ⊗ y) = z ⊗ Qu W for any x, z ∈ H , u ∈ D(Q) and y ∈ D(Q−1 ). In this definition H is the complex conjugate Hilbert space related to H by an antiunitary mapping H x ←→ x ∈ H . The main result of this section is contained in the following Theorem 5.2. The operator W introduced by (4.21) is a modular multiplicative unitary acting on L2 (R × S 1 ) ⊗ L2 (R × S 1 ). Proof. Assume that K = L2 (R × S 1 ). One can easily verify that operators a = |b|−1 ,

β = αw sgn b β,

b = ei /2 b−1 a, L = L

(5.15)

obey the properties listed in (4.6). In particular β 2 = I , β ∗ = β and β Lβ = wsgn b βLβw − sgn b = w sgn b Lw − sgn b = L − sgn b = L − sgn b. With this choice, the right-hand side of (4.8) coincides with that of (4.21): V = W and relation (4.22) takes the form: W23 W12 = W12 W13 W23 . Hence W is a multiplicative unitary operator. Let Q = a 1/2 and Q = |b|1/2 . Inserting in (5.1) operators (5.15) we obtain a uni satisfying formula (5.14). To end the proof we have to show that W tary operator W commutes with Q ⊗ Q. We know that a commutes with β and w. One can easily check that |b| commutes with αw sgn b β and L. Therefore Q ⊗ Q = |b|1/2 ⊗ a 1/2 commutes with αw sgn b β ⊗ β, L ⊗ I and I ⊗ w. Clearly it commutes with log |b| ⊗ log a. Moreover due to the Zakrzewski

relation a o b, Q ⊗ Q commutes with ei /2 b−1 a ⊗ b. Inspecting formula (4.21) we see that Q ⊗ Q commutes with W . Now we can use the full power of the theory of multiplicative unitaries [2, 16, 8]. Denoting by B(L2 (R × S 1 ))∗ the set of all normal functionals on B(L2 (R × S 1 )) we have: norm closure A = (ω ⊗ id)W : ω ∈ B(L2 (R × S 1 ))∗ . Indeed according to Theorem 1.5 of [16], the set on the right-hand side is a C∗ -algebra generated by W and the above equality follows immediately from Theorem 4.2 (in the present setting V = W ). Formula (4.22) shows that (4.8) is an adapted operator in the sense of [16, Definition 1.3]. Comparing (5.1) with Statement 5 of Theorem 1.6 of [16] one can easily find the unitary antipode R of our quantum group. It acts on a, b, β, w as follows: a R = a −1 ,

β R = −αwsgn b β,

bR = −ei /2 ba −1 , wR = w∗ .

358

W. Pusz, S.L. Woronowicz

The action of the scaling group is described by the formulae: τt (a) = a,

τt (β) = β,

τt (b) = et b, τt (w) = w. In the following, Tr denotes the trace of operators acting on L2 (R × S 1 ) and E0 denotes the orthogonal projection onto kernel of L − χ (b > 0): E0 = χ L − χ (b > 0) = 0 = χ L = 0 and b < 0 + χ L = 1 and b > 0 . The reader should notice that E0 commutes with all operators (5.15). Therefore E0 ⊗ I commutes with the multiplicative unitary W . For any positive c ∈ A we set h(c) = Tr E0 QcQE0 = Tr E0 |b|1/2 c |b|1/2 E0 . Let c = g(log a)f (b), where f, g ∈ C∞ (R). Then c ∈ A. In what follows, dµ(z) denotes the normalized Haar measure on S 1 . One can verify that the operator c QE0 = 1 c |b| 2 E0 is an integral operator: 1 Kc (t , z ; t, z) x(t, z) dt dµ(z) (c |b| 2 E0 x)(t , z ) = R×S 1

with the kernel " "−1/2 g (t /t)f (t) (z /z)χ(t>0) , Kc (t , z ; t, z) = "t " where g () =

1 g(τ )iτ/ dτ 2π R

for > 0 and g () = 0 for < 0. Therefore " " ∗ "Kc (t , z ; t, z)"2 dt dµ(z ) dt dµ(z) h(c c) = R×S 1 ×R×S 1 ∞ 2 d

=

| g ()|

0

=

R

|f (t)|2 dt

1 |g(τ )|2 dτ |f (t)|2 dt < ∞ π R R

for g, f ∈ L2 (R). Let > 0 and c = (I + log2 a)−1 (I + b2 )−1 . Then h(c∗ c ) < ∞ for any > 0. Clearly c → I in strict topology, when → +0. Therefore the left ideal {c ∈ A : h(c∗ c) < ∞} is dense in A. According to the theory developed by Van Daele [10], h is a right Haar weight on our quantum group. See also [20], where the right invariance of h is verified by a straightforward computation. One can easily construct the reduced dual of our quantum group. By definition (see [2, 16]) this is a quantum group (A, ) related to the multiplicative unitary W = W ∗

A New Quantum Deformation of ‘ax + b’ Group

359

( denotes the flip operator acting on the tensor product of a Hilbert space by itself: (x ⊗ y) = y ⊗ x). In particular norm closure . A = (id ⊗ ω)W ∗ : ω ∈ B(L2 (R × S 1 ))∗ Let a, b, β and L be operators introduced by (5.15). One can show that log a, b, β |b| and L are affiliated with A. Furthermore A is generated by these operators. The action of is described by the formula: (c) = W (c ⊗ I )W ∗ = W ∗ (I ⊗ c)W . In particular (a) = a ⊗ a, (b) = b ⊗ a + I ⊗ b , β⊗α wsgn b β β |b|p = (I ⊗ w)− sgn b⊗I β|b|p ⊗ a p + I ⊗ β|b|p

− sgn(b⊗b)

,

(5.16)

(L) = L ⊗ I + I ⊗ L, where w = α −2L = s −L . To derive the second and third formulae one has to use the second versions of formulae (1.7) and (1.11). The details are left to the reader. The last relation in (5.16) shows that (w) = w ⊗ w. It is easy to verify that operators a, b, β and w obey the same commutation relations as a, b, β and w. Using this fact one can show that there exists ψ ∈ Mor(A, A) such that ψ(a) = a, ψ(b) = b, ψ(ibβ) = i bβ and ψ(w) = s −L . Let opp be the comultiplication opposite to : opp = flipo. Comparing formulae (4.24) with (5.16) we see that opp (ψ(c)) = (ψ ⊗ ψ)(c)

(5.17)

for c = a, b, β |b|p , w. Functions of these operators generate A, so (5.17) holds for any c ∈ A. Let us notice that the operator (id ⊗ ψ)W = G (b ⊗ b, β ⊗ β) e log a⊗log a s −L⊗L i

commutes with . There exists an independent proof of (5.17) based on this observation. 6. ‘ax + b’-Groups at Roots of Unity iπ 2

In this section we shall assume that q 2 = e−i is a root of unity. Then s = α 2 = −e is a root of unity. Let N be the smallest natural number such that s N = 1. Formula (4.1) shows now that wN commutes with β. Consequently wN commutes with all elements of A and the set norm closure CN = (w N − I )c : c ∈ A

360

W. Pusz, S.L. Woronowicz

is a two-sided ideal in A. We know that (w) = w ⊗ w. Therefore (w N − I ) = w N ⊗ w N − I ⊗ I = w N ⊗ (w N − I ) + (w N − I ) ⊗ I. It shows that (CN ) ⊂ A ⊗ CN + CN ⊗ A and the comultiplication goes down to the quotient algebra AN = A/CN . More precisely there exists N ∈ Mor(AN , AN ⊗ AN ) such that N (π(c)) = (π ⊗ π )(c). In this formula π ∈ Mor(A, AN ) denotes the canonical epimorphism from A onto AN = A/CN . From now until the end of this section we shall work with the quantum group (AN , N ). To simplify notation we shall omit π and write a, b, β |b| and w instead of π(a), π(b), π(β |b|) and π(w). These operators are affiliated with AN and we have the following commutation relation: a ∗ = a, a > 0, b∗ = b, a

o

b,

β ∗ = β, β 2 = I, βaβ = a, βbβ = −b, w∗ w = ww∗ = I, w∗ aw = a, w∗ bw = b,

(6.1)

w∗ βw = s sgn b β, wN = I . The action of N is described by the formulae identical with (4.24). It is not difficult to describe AN as a concrete C∗ -algebra and find the multiplicative unitary corresponding to (AN , N ). To this end one has to repeat the considerations of Sect. 4 replacing S 1 by the cyclic group of N elements: ZN = s : = 0, 1, . . . , N − 1 = z ∈ S 1 : zN = 1 . In particular elements of AN will be operators acting on L2 (R × ZN ). To define a, b, β and w we shall use the same formulae (4.2) with necessary reinterpretation: now x ∈ L2 (R × ZN ) and z runs over ZN . One can easily verify that a, b, β and w satisfy the relations (6.1). Now the formula (4.5) defines a C∗ -algebra acting on L2 (R × ZN ). This algebra is isomorphic to AN . It is not possible to find a selfadjoint operator L acting on L2 (R × ZN ) such that Sp L ⊂ Z and w∗ Lw = L + I . However there is a replacement for (I ⊗ w)L⊗I . Instead of L we shall use an operator u acting on L2 (R × ZN ) according to the formula: (ux)(t, z) = x(t, s −1 z). One can easily verify that u is a unitary operator commuting with a, b, β such that wuw∗ = su and uN = I . By the last relation Sp u = ZN . Similarly Sp w ⊂ ZN . We shall use the bicharacter describing the selfduality of the group ZN : Ch : ZN × ZN −→ S 1 . By definition Ch(s , s k ) = s −k for any k, ∈ Z. The reader should notice that Ch(s , z) = z− and Ch(sz , z) = z−1Ch(z , z) for any ∈ Z and z, z ∈ ZN . Using the last formula and remembering that wuw∗ = su we obtain (w ⊗ I ) Ch(u ⊗ I, I ⊗ w)(w ∗ ⊗ I ) = Ch(su ⊗ I, I ⊗ w) = (I ⊗ w ∗ ) Ch(u ⊗ I, I ⊗ w). Therefore Ch(u ⊗ I, I ⊗ w)(w ⊗ I ) Ch(u ⊗ I, I ⊗ w)∗ = w ⊗ w.

(6.2)

A New Quantum Deformation of ‘ax + b’ Group

361

One should compare this formula with (4.4). It shows that Ch(u ⊗ I, I ⊗ w) is the right replacement for (I ⊗ w)L⊗I . For the moment we shall use the Hilbert space K and operators a, b, β and L the same as in Sect. 4. Remembering that Ch(s , z) = z− we obtain Ch(s −L ⊗ I, I ⊗ w) = (I ⊗ w)L⊗I and formula (4.8) takes the form: V = G (b ⊗ b, β ⊗ β)∗ e log a⊗log a Ch(w ⊗ I, I ⊗ w), i

(6.3)

where w = s −L . Taking into account (4.6) we see that w is a unitary operator with Sp w ⊂ ZN , it commutes with a and b and β wβ = s −L+sgn b = s sgn b w. We should note that in order to define V ∈ M(K(K) ⊗ AN ) we need only the operators a, b, β, w acting on K. These operators must have the following properties: 1. a, b are selfadjoint, a > 0, ker a = ker b = {0} and a

o

b,

2. β is a unitary involution, β commutes with a and anticommutes with b, 3. w is unitary with Sp w ⊂ ZN , w commutes with a and b,

(6.4)

4. β w β = s sgn b w. In this way the operator L disappears from our setup. Let us notice that operators a, b, β, w satisfy the same commutation relations as a, b, β, w. From the beginning of this section we assumed that the deformation parameters s 2 and are related by the formula s = α 2 = −eiπ / . Repeating (with the necessary modifications indicated above) the considerations of Sect. 4 we obtain the following formulae: ∗ i W = G ei /2 b−1 a ⊗ b, αw sgn b β ⊗ β e− log|b|⊗log a Ch(u ⊗ I, I ⊗ w), (6.5) ∗ V12 V13 = W23 V12 W23 , (6.6) N (c) = W (c ⊗ I )W ∗ (6.7) for any c ∈ AN . Assume that K = L2 (R × ZN ), a = |b|−1 , β = αw sgn b β, b = ei /2 b−1 a, and w = u. One can easily verify that these operators obey the properties listed in (6.4). In particular β 2 = I , β ∗ = β, β w β = wsgn b uw − sgn b = s sgn b u = s sgn b w, where in the second step we used the relation wuw∗ = su. With this choice, operators (6.3) and (6.5) coincide: V = W and (6.6) shows that W is a multiplicative unitary. Using the method described in Sect. 5 one can show that W is modular with Q = a 1/2 and Q = |b|1/2 . This is the multiplicative unitary corresponding to the quantum group (AN , N ). It can be used to determine the action of the unitary antipode and scaling group, the Haar weight and the reduced dual (AN , N ). We know that a, b, β and w satisfy the same commutation relations as a, b, β and w. Furthermore formula (6.3) is symmetric: replacing operators without ‘hats’ by corresponding operators with ‘hats’ we obtain an element of M(A ⊗sym A). Using these facts one can show that the quantum group (AN , N ) is opp isomorphic to (AN , N ). Acknowledgement. The authors acknowledge the financial support of the Polish Committee for Scientific Research (KBN, grants 115/E-343/SPB/6.PRUE/DIE50/2005-2008 and 2 P03A 040 22) and of the Foundation for Polish Science.

362

W. Pusz, S.L. Woronowicz

References 1. Arveson, W.: An invitation to C ∗ -Algebra. New York-Heidelberg-Berlin: Springer-Verlag, 1976 2. Baaj, S., Skandalis, G.: Unitaries multiplicatifs et dualité pour les produits croisé de C ∗ -algèbres. Ann. Sci. Ec. Norm. Sup. 4e série 26, 425–488 (1993) 3. Kustermans, J., Vaes, S.: Locally compact quantum groups. Ann. Sci. de l’Ecole Normale Supérieure 33(6), 837–934 (2000) 4. Masuda, T., Nakagami, Y., Woronowicz, S.L.: A C ∗ -algebraic framework for the quantum groups. Int. J. Math. 14(9), 903–1001 (2003) 5. Napiórkowski, K., Woronowicz, S.L.: Operator theory in the C ∗ -algebra framework. Rep. Math. Phys. 31(3), 353–371 (1992) 6. Pedersen, G.K.: C∗ -algebras and their Automorphism Groups. London-New York-San Francisco: Academic Press, 1979 7. Rowicka, M.: Exponential Equations Related to the Quantum ‘ax +b’ Group. Commun. Math. Phys. 244, 419–453 (2004) 8. Sołtan, P.M., Woronowicz, S.L.: A remark on manageable multiplicative unitaries. Lett. Math. Phys. 57, 239–252 (2001) 9. Sołtan, P.M.: New deformations of the group of affine transformations of the plane. Doctor dissertation (in Polish), University of Warsaw (2003) 10. Van Daele, A.: The Haar measure on some locally compact quantum groups. http://arxiv.org/list/math.OA/0109004v1, 2001 11. Van Daele, A., Woronowicz, S.L.: Duality for the quantum E(2) group. Pa. J. Math. 173, 375–385 (1996) 12. Woronowicz, S.L.: Unbounded elements affiliated with C ∗ -algebras and non-compact quantum groups. Commun. Math. Phys. 136, 399–432 (1991) 13. Woronowicz, S.L.: C∗ -algebras generated by unbounded elements. Rev. Math. Phys. 7(3), 481–521 (1995) 14. Woronowicz, S.L.: Quantum E(2) group and its Pontryagin dual. Lett. Math. Phys. 23, 251–263 (1991) 15. Woronowicz, S.L.: Operator equalities related to quantum E(2) group. Commun. Math. Phys. 144, 417–428 (1992) 16. Woronowicz, S.L.: From multiplicative unitaries to quantum groups. Int. J. Math. 7, 127–149 (1996) 17. Woronowicz, S.L.: Quantum exponential function. Rev. Math. Phys. 12(6), 873–920 (2000) 18. Woronowicz, S.L.: Quantum ‘az + b’ group on complex plane. Int. J. Math. 12, 461–503 (2001) 19. Woronowicz, S.L., Zakrzewski, S.: Quantum ‘ax + b’ group. Rev. Math. Phys. 14(7 & 8), 797–828 (2002) 20. Woronowicz, S.L.: Haar weight on some quantum groups. In: Group 24: Physical and mathematical aspects of symmetries, Proceedings of the 24th International Colloquium on Group Theoretical Methods in Physics Paris, 15 - 20 July 2002, Inst. of Physics, Conference Series Number 173, pp. 763–772 Communicated by A. Connes

Commun. Math. Phys. 259, 363–366 (2005) Digital Object Identifier (DOI) 10.1007/s00220-005-1362-1

Communications in

Mathematical Physics

Orthomodular Lattices Generated by Graphs of Functions W. Cegła1 , J. Florek2 1 2

Institute of Theoretical Physics, University of Wrocław, pl. Maxa Borna 9, 50–204 Wrocław, Poland. E-mail: [email protected] Institute of Mathematics, University of Economics, ul. Komandorska 118/120, 53–345 Wrocław, Poland. E-mail: [email protected]

Received: 12 July 2004 / Accepted: 8 December 2004 Published online: 19 May 2005 – © Springer-Verlag 2005

Abstract: In a subset Z ⊆ R × M, where R is the real line and M is an arbitrary topological space, an orthogonality relation is constructed from a family of graphs of continuous functions from connected subsets of R to M. It is shown that under two conditions on this family a complete lattice of double orthoclosed sets is orthomodular. 1. Introduction An orthomodularity condition is important in the quantum logic approach to quantum theory [4]. Any complete orthomodular lattice can be constructed using the orthogonality space (Z, ⊥), where Z is a nonempty set and ⊥, called the orthogonality relation, is a symmetric and irreflexive relation on Z, i.e. x ⊥ y ⇒ y ⊥ x and x ⊥ x for all x, y ∈ Z. Having (Z, ⊥) one defines A⊥ := {x ∈ Z; x ⊥ a ∀ a ∈ A}, A⊥⊥ := (A⊥ )⊥ for all A ⊆ Z and one considers the family of double orthoclosed sets ζ (Z, ⊥) := {A ⊆ Z; A = A⊥⊥ }. This family ζ (Z, ⊥) partially ordered by set-theoretical inclusion and equipped with the orthocomplementation A → A⊥ , with l.u.b. and g.l.b. given respectively by the formulas Aj = (∪Aj )⊥⊥ Aj = ∩Aj , forms a complete ortholattice [1], which in general is not orthomodular. To verify the orthomodularity of ζ (Z, ⊥) it is enough to prove one of the equivalent conditions discussed in [3], namely, if A is an orthogonal subset of Z (which means that x ⊥ y for all x, y ∈ A with x = y), then (OM)

x ∈ Z, x ∈ / A⊥ , and x ∈ / A⊥⊥ ⇒ A⊥ ∩ (x ⊥ ∩ A⊥ )⊥ = ∅.

Let R × M be the topological product of the real line R and arbitrary topological space M and p be a canonical projection of R × M on R. Let G be a set of continuous functions of the form g : Dg → M, where the domain Dg of g is a connected subset of R. We shall identify the function with its graph.

364

W. Cegła, J. Florek

We define a space (Z, ≤)G and a space (Z, ⊥)G generated by the family G as follows: Z= g, g∈G

x≤y x⊥y

iff there is g ∈ G such that {x, y} ⊆ g and p(x) ≤ p(y), iff x ≤ y and y ≤ x iff there is no g ∈ G such that {x, y} ⊆ g.

Of course (Z, ≤)G is antisymmetric and reflexive so (Z, ⊥)G is an orthogonality space. Two conditions for the set G listed in Sect. 3 ensure that the resulting lattice ζ (Z, ⊥)G is orthomodular. The first one says the (Z, ≤)G is a partial order space, the second one is purely topological. Section 3 contains the proof of orthomodularity of ζ (Z, ⊥). Three examples are discussed in Sect. 4. 2. Definitions and Symbols Let the space (Z, ≤)G and the orthogonality space (Z, ⊥)G be generated by the family G and p be a canonical projection of R × M on R. For a ∈ Z and A ⊆ Z we define: + a : = {z ∈ Z : a ≤ z} = z ∈ Z : p(z) ≥ p(a) ∧ ∃ (z ∈ g ∧ a ∈ g) , g∈G − a : = {z ∈ Z : z ≤ a} = z ∈ Z : p(z) ≤ p(a) ∧ ∃ (z ∈ g ∧ a ∈ g) , g∈G a ⊥ : = {z ∈ Z : z ⊥ a} = z ∈ Z : ∀ (z ∈ g ⇒ a ∈ / g) , g∈G + + A := a , a∈A

A− : =

a− ,

a∈A ⊥

A :=

⊥

a = z ∈ Z : ∀ (z ∈ g ⇒ g ∩ A = ∅) . g∈G

a∈A

One can check that: (I) a ≤ z iff z ∈ a + iff a ∈ z− , (II) a ⊥ = (a + ∪ a − ) and A⊥ = (A+ ∪ A− ) , where denotes the set complement in Z. For A ⊆ Z and g ∈ G we denote p(g ∩ A) = {p(a) ∈ Dg : a ∈ g ∩ A} . Then it is easy to see that: (III) p(g ∩ ( Ai )) = p(g ∩ Ai ) , i∈I i∈I (IV) p(g ∩ ( Ai )) = p(g ∩ Ai ) . i∈I

i∈I

Because g ⊆ Z so p(g ∩ Z) = Dg . Hence we have (V) p(g ∩ A ) = [p(g ∩ A)] , where A and [p(g ∩ A)] denote the set complement in Z and in Dg respectively.

Orthomodular Lattices Generated by Graphs of Functions

365

3. Basic Results From now on we assume that G satisfies the following conditions: (*)

∀

x,y,z∈Z

(**) ∀

z∈Z

(x ≤ y

∧

y ≤ z ⇒ x ≤ z),

z+ \ {z} and z− \ {z} are open sets in R × M.

Remark 1. The condition (*) is equivalent to the following one:

∀ ∀ ∀ p(y) ∈ p(g ∩ x + ) ∧ p(z) > p(y) => p(z) ∈ p(g ∩ x + ) .

x∈Z g∈G y,z∈g

Lemma 1. Let f ∈ G, A ⊆ Z, f ∩ A = ∅. (i) f ∩ A+ = ∅ ∧ f ⊆ A+ ⇒ f, A+ = (t, ∞) ∩ Df , where t ∈ Df , (ii) f ∩ A− = ∅ ∧ f ⊆ A− ⇒ f, A− = (−∞, s) ∩ Df , where s ∈ Df , (iii) f ∩ A⊥ = ∅ ∧  f ⊆ A⊥ ⇒ for f ∩ A+ = ∅ and f ∩ A− = ∅,  [s, t] ⊥ ⇒ p(f ∩ A ) = (−∞, t] ∩ Df for f ∩ A+ = ∅ and f ∩ A− = ∅,  [s, ∞) ∩ D for f ∩ A+ = ∅ and f ∩ A− = ∅, f + − where t = inff, A and s = supf, A . Proof of Lemma 1. We shall prove (i). Let A = {a}, f ∩ a + = ∅ and f ⊆ a + . Because f is continuous on Df and by (∗∗) a + \ {a} is an open set in Z so p(f ∩ a + ) = p(f ∩ (a + \ {a})) is an open set in Df . Hence using Remark 1, there exists ta ∈ Df such that p(f ∩ a + ) = (ta , ∞) ∩ Df . It is enough to see that by (*) and (III), p(f ∩ A+ ) = p(f ∩ a + ) = (ta , ∞) ∩ Df = (t, ∞) ∩ Df , where t ∈ Df . a∈A

a∈A

The proof in the case (ii) goes in a similar way. We shall prove (iii). If f ∩ A⊥ = ∅ then f ⊆ A+ and f ⊆ A− . If f ⊆ A⊥ then f ∩ A+ = ∅ or f ∩ A− = ∅. By (II), (III), (V) we obtain p(f ∩ A⊥ ) = [p(f ∩ A+ )] ∩ [p(f ∩ A− )] . Hence by (i), (ii) we get (iii). Theorem 1. If (Z, ⊥)G is the orthogonality space generated by the family G satisfying conditions (*) and (**) then ζ (Z, ⊥)G = {A ⊆ Z : A = A⊥⊥ } is an orthomodular lattice. Proof of Theorem 1. We shall prove that (Z, ⊥)G satisfies the (OM) condition. Assume that A ⊆ Z, x ∈ Z, x ∈ / A⊥ and x ∈ / A⊥⊥ . From the assumption x ∈ / A⊥⊥ = {z ∈ ⊥ Z : ∀ (z ∈ g ⇒ g ∩ A = ∅)} it follows that there exists f ∈ G such that x ∈ f g∈G

and f ∩ A⊥ = ∅. From the assumption x ∈ / A⊥ it follows, by (II), that x ∈ A+ or − + x ∈ A . We consider only the case x ∈ A . The proof in the second case proceeds in a similar way. Because f ∩ A⊥ = ∅ and x ∈ f ∩ A+ , so by Lemma 1(i), (iii), there exists y ∈ f ∩ A⊥ such that p(f ∩ A+ ) = (p(y), ∞) ∩ Df and p(y) < p(x). For every g ∈ G such that y ∈ g we have the following: (j) z ∈ g ∧ p(z) ≤ p(y) ⇒ z ∈ x − , (jj) z ∈ g

∧

p(z) > p(y) ⇒ z ∈ A+ ,

(jjj) g ∩ (x ∪ A)⊥ = ∅ .

366

W. Cegła, J. Florek

Case (j). Because z ∈ g, y ∈ g and p(z) ≤ p(y) therefore z ∈ y − . Because y ∈ f , x ∈ f and p(y) < p(x) so y ∈ x − . Hence, by Remark 1, we obtain z ∈ x − . Case (jj). Because y ∈ g, z ∈ g and p(z) > p(y) therefore y ∈ z− . Because y ∈ f ∩z− so p(y) ∈ p(f ∩ z− ). Hence, by (IV) and by Lemma 1(ii), p(f ∩ A+ ∩ z− ) = p(f ∩ A+ ) ∩ p(f ∩ z− ) = (p(y), ∞) ∩ p(f ∩ z− ) is an open interval. Then there exists v ∈ A+ ∩ z− . Hence, by (I), z ∈ v + , v ∈ A+ and, by (*), z ∈ A+ . Case (jjj). Observe that, by (j), (jj) and (II), g ⊆ x − ∪ A+ ⊆ (x ∪ A)− ∪ (x ∪ A)+ = ((x ∪ A)⊥ ) . By (jjj), y ∈ (x∪A)⊥⊥ = {z ∈ Z : ∀ (z ∈ g ⇒ g∩(x∪A)⊥ = ∅)}. But y ∈ A⊥ , so by g∈G

a property of an orthogonality relation, we have y ∈ A⊥ ∩(x∪A)⊥⊥ = A⊥ ∩(x ⊥ ∩A⊥ )⊥ . 4. The Examples Example 1. Let M be a normed vector space and Z be an open set in R × M. Let G be the family of all functions g : Dg ⊆ R → M on connected sets Dg of R with values in M satisfying the sharp Lipschitz condition with constant α > 0, ∀

such that Z =

t,s∈Dg

g(t) − g(s) < α|t − s| ,

g.

g∈G

The family G satisfies the condition (*), which is obvious, and condition (**) follows because Z is an open set in R×M and the balls in the normed vector space M are convex sets. The following example is a special case of Example 1. Example 2. Let Z = R × Rn and G be the family of all translations of graphs of linear functions g : R → Rn satisfying sharp Lipschitz condition with constant α = 1. The orthogonality relation in this case means a space-like or light-like separation. It was shown in [2] that ζ (Z, ⊥)G forms the orthomodular lattice. Example 3. If Z = {0} × M, G = {{(0, m)}; m ∈ M} and g = {(0, m)} ∈ G is a function with domain Dg = {0} such that g(0) = m. Two points (0, m) and (0, n) are orthogonal iff m = n. Of course ζ (Z, ⊥) is isomorphic to the complete, atomic Boolean lattice of all subsets of the set M. Acknowledgements. The authors are very grateful to an anonymous referee for the very detailed remarks and suggestions which improved the paper.

References 1. Birkhoff, G.: Lattice Theory. Amer. Math. Soc. Colloq. Publ. XXV, Providence, RI: Amer. Math. Soc., 1967 2. Cegła, W., Jadczyk, A.Z.: Commun. Math. Phys. 57, 213–217 (1977) 3. Foulis, D.J., Randall, C.H.: Lexicographic Orthogonality. J. Combin. Theory 11, 157–162 (1971) 4. Pták, P., Pulmannová, S.: Orthomodular Structures as Quantum Logics. Number 55 in “Fundamental Theories of Physics”, Dordrecht: Kluwer, 1991 Communicated by M.B. Ruskai

Commun. Math. Phys. 259, 367–389 (2005) Digital Object Identifier (DOI) 10.1007/s00220-005-1367-9

Communications in

Mathematical Physics

Large n Limit of Gaussian Random Matrices with External Source, Part II Alexander I. Aptekarev1 , Pavel M. Bleher2 , Arno B.J. Kuijlaars3 1

Keldysh Institute of Applied Mathematics, Russian Academy of Sciences, Miusskaya Square 4, Moscow 125047, Russia. E-mail: [email protected] 2 Department of Mathematical Sciences, Indiana University-Purdue University Indianapolis, 402 N. Blackford St., Indianapolis, IN 46202, U.S.A. E-mail: [email protected] 3 Department of Mathematics, Katholieke Universiteit Leuven, Celestijnenlaan 200 B, 3001 Leuven, Belgium. E-mail: [email protected] Received: 24 August 2004 / Accepted: 8 February 2005 Published online: 2 June 2005 – © Springer-Verlag 2005

Abstract: We continue the study of the Hermitian random matrix ensemble with external source 1 −nTr( 1 M 2 −AM) 2 e dM, Zn where A has two distinct eigenvalues ±a of equal multiplicity. This model exhibits a phase transition for the value a = 1, since the eigenvalues of M accumulate on two intervals for a > 1, and on one interval for 0 < a < 1. The case a > 1 was treated in Part I, where it was proved that local eigenvalue correlations have the universal limiting behavior which is known for unitarily invariant random matrices, that is, limiting eigenvalue correlations are expressed in terms of the sine kernel in the bulk of the spectrum, and in terms of the Airy kernel at the edge. In this paper we establish the same results for the case 0 < a < 1. As in Part I we apply the Deift/Zhou steepest descent analysis to a 3 × 3-matrix Riemann-Hilbert problem. Due to the different structure of an underlying Riemann surface, the analysis includes an additional step involving a global opening of lenses, which is a new phenomenon in the steepest descent analysis of Riemann-Hilbert problems. 1. Introduction This paper is a continuation of [6] to which we will frequently refer in this paper. It will be followed by a third part [8], which deals with the critical case. In these papers, we The first and third author are supported in part by INTAS Research Network NeCCA 03-51-6637 and by NATO Collaborative Linkage Grant PST.CLG.979738. The first author is supported in part by RFBR 05-01-00522 and the program “Modern problems of theoretical mathematics” RAS(DMS). The second author is supported in part by the National Science Foundation (NSF) Grant DMS-0354962. The third author is supported in part by FWO-Flanders projects G.0176.02 and G.0455.04 and by K.U.Leuven research grant OT/04/24 and by the European Science Foundation Program Methods of Integrable Systems, Geometry, Applied Mathematics (MISGAM) and the European Network in Geometry, Mathematical Physics and Applications (ENIGMA)

368

A.I. Aptekarev, P.M. Bleher, A.B.J. Kuijlaars

study the random matrix ensemble with external source A, µn (dM) =

1 −nTr(V (M)−AM) e dM, Zn

(1.1)

defined on n × n Hermitian matrices M, with Gaussian potential V (M) =

1 2 M , 2

(1.2)

and with external source A = diag(a, . . . , a , −a, . . . , −a ). n/2

(1.3)

n/2

In the physics literature, the ensemble (1.1) was studied in a series of papers of Br´ezin and Hikami [9–13], and P. Zinn-Justin [33, 34]. The Gaussian ensemble, V (M) = 21 M 2 , has been solved, in the large n limit, in the papers of Pastur [29] and Br´ezin-Hikami [9– 13], by using spectral methods and a contour integration formula for the determinantal kernel. The contour integration technique has been extended in the recent work of Tracy and Widom [30] to the large n double scaling asymptotics of the determinantal kernel at the critical point. Our aim is to develop a completely different approach to the large n asymptotics of the Gaussian ensemble with external source. Our approach is based on the Riemann-Hilbert problem and it is applicable, in principle, to a general V . We develop the Deift-Zhou nonlinear steepest descent method for Riemann-Hilbert (RH) problems [18], thereby extending the works [3, 4, 16, 17, 25, 26] who treated the unitary invariant case (i.e., A = 0) with RH techniques. While the unitary invariant case is connected with orthogonal polynomials [15, 27], the ensemble (1.1) is connected with multiple orthogonal polynomials [5]. These are characterized by a matrix RH problem [32], and the eigenvalue correlation kernel of (1.1) has a direct expression in terms of the solution of this RH problem, see [5, 14] and also formula (1.14) below. The RH problem for (1.1) has size (r + 1) × (r + 1) if r is the number of distinct eigenvalues of A. So with the choice (1.3), the RH problem is 3 × 3-matrix valued. The asymptotic analysis of RH problems has been mostly restricted to the 2 × 2 case. The analysis of larger size RH problems presents some novel technical features as already demonstrated in [6, 24]. In the present paper another new feature appears, namely at a critical stage in the analysis we perform a global opening of lenses. This global opening of lenses requires a global understanding of an associated Riemann surface, which is explicitly known for the Gaussian case (1.2). This is why we restrict ourselves to (1.2) although in principle our methods are applicable to a more general polynomial V . The Gaussian case has some special relevance in its own right as well. Indeed, first of all we note that for (1.2) we can complete the square in (1.1), and then it follows that M = M0 + A,

(1.4)

where M0 is a GUE matrix. So in the Gaussian case the ensemble (1.1) is an example of a random + deterministic model, see also [9–13]. A second interpretation of the Gaussian model comes from non-intersecting Brownian paths. This can be seen from the joint probability density for the eigenvalues

Large n Limit of Gaussian Random Matrices with External Source, Part II

369

of M, which by the HarishChandra/Itzykson-Zuber formula [19, 27], takes the form 1 ˜ Zn

n 1 n 2 (λj − λk ) det enλj ak j,k=1 e− 2 nλj

(1.5)

j =1

1≤j
for the case (1.2). Here a1 , . . . , an are the eigenvalues of A, which are assumed to be all distinct in (1.5). In the case of coinciding eigenvalues of A we have to take the appropriate limit of (1.5), see formula (3.17) in [5]. Formula (1.5) also arises as the distribution of non-intersecting Brownian paths. Consider n independent Brownian motions (in fact Brownian bridges) on the line, starting at some fixed points s1 < s2 < · · · < sn at time t = 0, ending at some fixed points b1 < b2 < . . . < bn at time t = 1, and conditioned not to intersect for t ∈ (0, 1). Then by a theorem of Karlin and McGregor [21], the joint probability density of the positions of the Brownian bridges at time t ∈ (0, 1) is given by pn (x1 , . . . , xn ) =

1 det(p(sj , xk ; t))nj,k=1 det(p(xj , bk ; 1 − t))nj,k=1, Cn

(1.6)

where p(x, y; t) is the transition kernel of the Brownian motion and Cn is a normalization constant. Let us consider a scaled Brownian motion for which n − n(x−y)2 , 2t (1.7) p(x, y; t) = e 2πt and let us take a limit when all initial points sj converge to the origin. In this case formula (1.6) takes the form nx b n n n j k 2 1 pn (x1 , . . . , xn ) = (xj − xk ) det e 1−t e− 2t (1−t) xj . (1.8) ¯ Cn 1≤j
(1.9)

So at any time t ∈ (0, 1) the positions of n non-intersecting Brownian bridges starting at 0 and ending at specified points are distributed as the eigenvalues of a Gaussian random matrix with external source. The connection between random matrices and non-intersecting random paths is actually well-known, see e.g. the recent works [2, 20, 22, 28] and references cited therein. P. Zinn-Justin showed that the m-point correlation functions for the eigenvalues of M have determinantal form (1.10) Rm (λ1 , . . . , λm ) = det Kn (λj , λk ) 1≤j,k≤m . It was shown in [5] that the average characteristic polynomial P (z) = E [det(zI − M)] is a multiple orthogonal polynomial of type II, which for the Gaussian case (1.2) is a multiple Hermite polynomial, see [1, 7], and that the correlation kernel Kn can be

370

A.I. Aptekarev, P.M. Bleher, A.B.J. Kuijlaars

expressed in terms of the solution of the RH problem for multiple orthogonal polynomials [32], see also [14]. We state the RH problem here for the Gaussian case (1.2) and for the external source (1.3) where n is even. Then the RH problem asks for a 3 × 3 matrix valued function Y satisfying the following: • Y : C \ R → C3×3 is analytic. • For x ∈ R, there is a jump



 1 w1 (x) w2 (x) 0  Y+ (x) = Y− (x) 0 1 0 0 1

(1.11)

where w1 (x) = e−n(x

2 /2−ax)

,

w2 (x) = e−n(x

2 /2+ax)

,

(1.12)

and Y+ (x) (Y− (x)) denotes the limit of Y (z) as z → x ∈ R from the upper (lower) half-plane. • As z → ∞, we have  n  z 0 0 Y (z) = (I + O(1/z))  0 z−n/2 0  . (1.13) 0 0 z−n/2 The RH problem has a unique solution in terms of multiple Hermite polynomials and their Cauchy transforms [5, 32]. The correlation kernel Kn is expressed in terms of Y as follows:   1 − 41 n(x 2 +y 2 ) e 0 enay e−nay Y −1 (y)Y (x) 0 . Kn (x, y) = (1.14) 2πi(x − y) 0 . Our goal is to analyze the above RH problem in the large n limit and to obtain from this scaling limits of the kernel (1.14) in various regimes. In this paper we consider the case 0 < a < 1. The case a > 1 was considered in [6] and the critical case a = 1 will be considered in [8]. First we describe the limiting mean density of eigenvalues. Theorem 1.1. The limiting mean density of eigenvalues ρ(x) = lim

n→∞

1 Kn (x, x) n

(1.15)

exists for every a > 0. It satisfies ρ(x) =

1 | Im ξ(x)| , π

(1.16)

where ξ = ξ(x) is a solution of the cubic equation, ξ 3 − xξ 2 − (a 2 − 1)ξ + xa 2 = 0.

(1.17)

The support of ρ consists of those x ∈ R for which (1.17) has a non-real solution.

Large n Limit of Gaussian Random Matrices with External Source, Part II

371

(a) For 0 < a < 1, the support of ρ consists of one interval [−z1 , z1 ], and ρ is real analytic and positive on (−z1 , z1 ), and it vanishes like a square root at the edge points ±z1 , i.e., there exists a constant ρ1 > 0 such that ρ1 ρ(x) = |x ∓ z1 |1/2 (1 + o(1)) as x → ±z1 , x ∈ (−z1 , z1 ). (1.18) π (b) For a = 1, the support of ρ consists of one interval [−z1 , z1 ], and ρ is real analytic and positive on (−z1 , 0) ∪ (0, z1 ), it vanishes like a square root at the edge points ±z1 , and it vanishes like a third root at 0, i.e., there exists a constant c > 0 such that ρ(x) = c|x|1/3 (1 + o(1)) ,

as x → 0.

(1.19)

(c) For a > 1, the support of ρ consists of two disjoint intervals [−z1 , −z2 ] ∪ [z2 , z1 ] with 0 < z2 < z1 , ρ is real analytic and positive on (−z1 , −z2 ) ∪ (z2 , z1 ), and it vanishes like a square root at the edge points ±z1 , ±z2 . Remark. Theorem 1.1 is a very special case of a theorem of Pastur [29] on the eigenvalues of a matrix M = M0 + A, where M0 is random and A is deterministic as in (1.4). Since in this paper our interest is in the case 0 < a < 1, we show in Sect. 9 how Theorem 1.1 follows from our methods for this case. See [6] for the case a > 1. Remark. Theorem 1.1 has the following interpretation in terms of non-intersecting Brownian motions starting at 0 and ending at some specified points bj . We suppose n is even, and we let half of the bj ’s coincide with b > 0 and the other half with −b. Then as explained before, at time t ∈ (0, 1) the (rescaled) positions of the Brownian paths coincide with the eigenvalues of the Gaussian random matrix with external source (1.3) where t a=b . (1.20) 1−t The phase transition at a = 1 corresponds to t = tc ≡

1 . 1 + b2

So, by Theorem 1.1, the limiting distribution of the Brownian paths as n → ∞ is supported by one interval when t < tc and by two intervals when t > tc . At the critical time tc the two groups of Brownian paths split, with one group ending at t = 1 at b and the other at −b. As in [6] we formulate our main result in terms of a rescaled version of the kernel Kn , Kˆ n (x, y) = en(h(x)−h(y)) Kn (x, y)

(1.21)

for some function h. The rescaling (1.21) does not affect the correlation functions (1.10). Theorem 1.2. Let 0 < a < 1 and let z1 and ρ be as in Theorem 1.1 (a). Then there is a function h such that the following hold for the rescaled kernel (1.21): (a) For every x0 ∈ (−z1 , z1 ) and u, v ∈ R, we have

1 u v sin π(u − v) ˆ lim Kn x0 + , x0 + = . n→∞ nρ(x0 ) nρ(x0 ) nρ(x0 ) π(u − v)

(1.22)

372

A.I. Aptekarev, P.M. Bleher, A.B.J. Kuijlaars

(b) For every u, v ∈ R we have

1 u v ˆ n z1 + K , z + 1 n→∞ (ρ1 n)2/3 (ρ1 n)2/3 (ρ1 n)2/3 Ai(u) Ai (v) − Ai (u) Ai(v) , = u−v lim

(1.23)

where Ai is the usual Airy function, and ρ1 is the constant from (1.18). Theorem 1.2 is similar to the main theorems Theorem 1.2 and Theorem 1.3 of [6]. It expresses that the local eigenvalue correlations show the universal behavior as n → ∞, both in the bulk and at the edge, that is well-known from unitary random matrix models. So the result itself is not that surprising. To obtain Theorem 1.2 we use the Deift/Zhou steepest descent method for RH problems and a main tool is the three-sheeted Riemann surface associated with Eq. (1.17) as in [6]. There is however an important technical difference with [6]. For a > 1, the branch points of the Riemann surface are all real, and they correspond to the four edge points ±z1 , ±z2 of the support as described in Theorem 1.1 (c). For a < 1, two branch points are purely imaginary and they have no direct meaning for the problem at hand. The other two branch points are real and they correspond to the edge points ±z1 as in Theorem 1.1 (a). See Fig. 1 for the sheet structure of the Riemann surface. The branch points on the non-physical sheets result in a non-trivial modification of the steepest descent method. As already mentioned before, one of the steps involves a global opening of lenses, and this is the main new technical contribution of this paper. The rest of the paper is devoted to the proof of the theorems with the Deift/Zhou steepest descent method for RH problems. It consists of a sequence of transformations which reduce the original RH problem to a RH problem which is normalized at infinity, and whose jump matrices are uniformly close to the identity as n → ∞. In this paper there are four transformations Y → U → T → S → R. A main role is played by the Riemann surface (1.17) and certain λ-functions defined on it. These are introduced in the next section, and they are used in Sect. 3 to define the first transformation Y → U . This transformation has the effect of normalizing the RH problem at infinity, and in addition, of producing “good” jump matrices that are amenable to subsequent analysis. However, contrary to earlier works, some of the jump matrices for U have entries that are exponentially growing as n → ∞. These exponentially growing entries disappear after the second transformation U → T in Sect. 4 which involves the global opening of lenses. The remaining transformation follows the pattern of [6, 16, 17] and other works. The transformation T → S in Sect. 5 involves a local opening of lenses which turns the remaining oscillating entries into exponentially decaying ones. Then a parametrix for S is built in Sects. 6 and 7. In Sect. 6 a model RH problem is solved which provides the parametrix for S away from the branch points, and in Sect. 7 local parametrices are built around each of the branch points with the aid of Airy functions. Using this parametrix we define the final transformation S → R in Sect. 8. It leads to a RH problem for R which is of the desired type: normalized at infinity and jump matrices tending to the identity as n → ∞. Then R itself tends to the identity matrix as n → ∞, which is then used in the final Sect. 9 to prove Theorems 1.1 and 1.2.

Large n Limit of Gaussian Random Matrices with External Source, Part II

373

2. The Riemann Surface and λ-Functions We start from the cubic equation (1.17) which we write now with the variable z instead of x ξ 3 − zξ 2 + (1 − a 2 )ξ + za 2 = 0.

(2.1)

It defines a Riemann surface that will play a central role in the proof. The inverse mapping is given by the rational function z=

ξ 3 − (a 2 − 1)ξ . ξ 2 − a2

(2.2)

There are four branch points ±z1 , ±iz2 with z1 > z2 > 0, which can be found as the images of the critical points under the inverse mapping. The mapping (2.2) has three inverses, ξj (z), j = 1, 2, 3, that behave near infinity as 1 + O(1/z2 ), z 1 ξ2 (z) = a + + O(1/z2 ), 2z 1 ξ3 (z) = −a + + O(1/z2 ). 2z ξ1 (z) = z −

(2.3)

The sheet structure of the Riemann surface is determined by the way we choose the analytical continuations of the ξj ’s. It may be checked that ξ1 has an analytic continuation to C \ [−z1 , z1 ], which we take as the first sheet. The functions ξ2 and ξ3 have analytic continuations to C \ ([0, z1 ] ∪ [−iz2 , iz2 ]) and C \ ([−z1 , 0] ∪ [−iz2 , iz2 ]), respectively, which we take to be the second and third sheets, respectively. So the second and third sheet are connected along [−iz2 , iz2 ], the first sheet is connected with the second sheet along [0, z1 ], and the first sheet is connected with the third sheet along [−z1 , 0], see Fig. 1.

ξ1

ξ2

ξ3 Fig. 1. The Riemann surface ξ 3 − zξ 2 + (1 − a 2 )ξ + za 2 = 0

374

A.I. Aptekarev, P.M. Bleher, A.B.J. Kuijlaars

We note the jump relations ξ1∓ = ξ2± ξ1∓ = ξ3± ξ2∓ = ξ3±

on (0, z1 ), on (−z1 , 0), on (−iz2 , iz2 ).

(2.4)

z The λ-functions are primitives of the ξ -functions λj (z) = ξj (s)ds, more precisely z λ1 (z) = ξ1 (s)ds, z1 z ξ2 (s)ds, λ2 (z) = (2.5) z1 z ξ3 (s)ds + λ1− (−z1 ). λ3 (z) = −z1+

The path of integration for λ3 lies in C\((−∞, 0]∪[−iz2 , iz2 ]), and it starts at the point −z1 on the upper side of the cut. All three λ-functions are defined on their respective sheets of the Riemann surface with an additional cut along the negative real axis. Thus λ1 , λ2 , λ3 are defined and analytic on C \ (−∞, z1 ], C \ ((−∞, z1 ] ∪ [−iz2 , iz2 ]), and C \ ((−∞, 0] ∪ [−iz2 , iz2 ]), respectively. Their behavior at infinity is 1 2 z − log z + 1 + O(1/z), 2 1 (2.6) λ2 (z) = az + log z + 2 + O(1/z), 2 1 λ3 (z) = −az + log z + 3 + O(1/z) 2 for certain constants j , j = 1, 2, 3. The λj ’s satisfy the following jump relations: λ1 (z) =

λ1∓ λ1− λ1+ λ2∓ λ2∓ λ1+ λ2+ λ3+

= λ2± = λ3+ = λ3− − πi = λ3± = λ3± − πi = λ1− − 2πi = λ2− + πi = λ3− + πi

on (0, z1 ), on (−z1 , 0), on (−z1 , 0), on (0, iz2 ), on (−iz2 , 0), on (−∞, −z1 ), on (−∞, 0), on (−∞, −z1 ),

(2.7)

where the segment (−iz2 , iz2 ) is oriented upwards. We obtain (2.7) from (2.4), (2.5), and the values of the contour integrals around the cuts in the positive direction ξ2 (s)ds = πi, ξ3 (s)ds = π i, ξ1 (s)ds = −2πi, which follow from (2.3). Remark. We have chosen the segment [−iz2 , iz2 ] as the cut that connects the branch points ±iz2 . We made this choice because of symmetry and ease of notation, but it is not essential. Instead we could have taken an arbitrary smooth curve lying in the region bounded by the four smooth curves in Fig. 2 (see the next section) that connect the points x0 , iz2 , −x0 , and −iz2 . For any such curve, the subsequent analysis would go through without any additional difficulty.

Large n Limit of Gaussian Random Matrices with External Source, Part II

375

3. First Transformation Y → U We define for z ∈ C \ (R ∪ [−iz2 , iz2 ]), U (z) = diag e−n1 , e−n2 , e−n3 Y (z) 1 2 × diag en(λ1 (z)− 2 z ) , en(λ2 (z)−az) , en(λ3 (z)+az) .

(3.1)

This coincides with the first transformation in [6]. Then U solves the following RH problem: • U : C \ (R ∪ [−iz2 , iz2 ]) → C3×3 is analytic. • U satisfies the jumps  n(λ −λ ) n(λ −λ ) n(λ −λ )  e 1+ 1− e 2+ 1− e 3+ 1−  U+ = U−  0 en(λ2+ −λ2− ) 0 0 0 en(λ3+ −λ3− ) and

 1 0 0  0 U+ = U− 0 en(λ2+ −λ2− ) n(λ −λ ) 3+ 3− 0 0 e

• U (z) = I + O(1/z)

on R,

(3.2)



on [−iz2 , iz2 ].

(3.3)

as z → ∞.

The asymptotic condition follows from (1.13), (2.6) and the definition of U . The jump on the real line (3.2) takes on a different form on the four intervals (−∞, −z1 ], [−z1 , 0), (0, z1 ], and [z1 , ∞). Indeed we get from (2.7), (3.2), and the fact that n is even,   1 en(λ2+ −λ1− ) en(λ3+ −λ1− )  on (−∞, −z1 ], U+ = U− 0 (3.4) 1 0 0 0 1  n(λ −λ ) n(λ −λ )  e 1+ 1− e 2+ 1− 1  0 1 0 U+ = U−  on (−z1 , 0), (3.5) n(λ −λ ) 3+ 3− 0 0 e  n(λ −λ )  e 1+ 1− 1 en(λ3 −λ1− )  U+ = U−  (3.6) on (0, z1 ), 0 en(λ2+ −λ2− ) 0 0 0 1   1 en(λ2 −λ1 ) en(λ3 −λ1 )  on [z1 , ∞). U+ = U− 0 (3.7) 1 0 0 0 1 Now to see what has happened it is important to know the sign of Re (λj − λk ) for j = k. Figure 2 shows the curves where Re λj = Re λk . From each of the branch points ±z1 , ±iz2 there are three curves emanating at equal angle of 2π/3. We have Re λ1 = Re λ2 on the interval [0, z1 ] and on two unbounded curves from z1 . Similarly, Re λ1 = Re λ3 on the interval [−z1 , 0] and on two unbounded curves from −z1 . We have Re λ2 = Re λ3 on the curves that emanate from ±iz2 . That is, on the vertical half-lines [iz2 , +i∞) and (−i∞, −iz2 ] and on four other curves, before

376

A.I. Aptekarev, P.M. Bleher, A.B.J. Kuijlaars 3 Re λ = Re λ 1 2 Re λ1 = Re λ3 Re λ2 = Re λ3 2

iz2

Im z

1

0

−z

−x

1

z

x

0

1

0

−iz

−1

2

−2

−3 −3

−2

−1

0 Re z

1

2

3

Fig. 2. Curves where Re λ1 = Re λ2 (dashed lines), Re λ1 = Re λ3 (dashed-dotted lines), and Re λ2 = Re λ3 (solid lines). This particular figure is for the value a = 0.4

they intersect the real axis. The points where they intersect the real axis are ±x0 for some x0 ∈ (0, z1 ). After that point we have Re λ1 = Re λ3 for the curves in the right half-plane and Re λ1 = Re λ2 for the curves in the left half-plane. Figure 2 was produced with Matlab for the value a = 0.4. The picture is similar for other values of a ∈ (0, 1). As a → 0+ or a → 1−, the imaginary branch points ±iz2 tend to the origin. Using Fig. 2 and the asymptotic behavior (2.6) we can determine the ordering of Re λj , j = 1, 2, 3 in every domain in the plane. Indeed, in the domain on the right, bounded by the two unbounded curves emanating from z1 , we have Re λ1 > Re λ2 > Re λ3 because of (2.6). Then if we go to a neighboring domain, we pass a curve where Re λ1 = Re λ2 , and so the ordering changes to Re λ2 > Re λ1 > Re λ3 . Continuing in this way, and also taking into account the cuts that we have for the λj ’s, we find the ordering in any domain. Inspecting the jump matrices for U in (3.3)–(3.7), we then find the following: (a) The non-zero off-diagonal entries in the jump matrices in (3.4) and (3.7) are exponentially small, and the jump matrices tend to the identity matrix as n → ∞. (b) The non-constant diagonal entries in the jump matrices in (3.5) and (3.6) have modulus one, and they are rapidly oscillating for large n. (c) The (1, 2)-entry in the jump matrix in (3.5) is exponentially decreasing on (−z1 , −x0 ), but exponentially increasing on (−x0 , 0) as n → ∞. Similarly, the (1, 3)-entry in the jump matrix in (3.6) is exponentially decreasing on (x0 , z1 ), and exponentially increasing on (0, x0 ).

Large n Limit of Gaussian Random Matrices with External Source, Part II

377

(d) The entries in the jump matrix in (3.3) are real. The (2, 2)-entry is exponentially increasing as n → ∞, and the (3, 3)-entry is exponentially decreasing. The exponentially increasing entries observed in items (c) and (d) are undesirable, and this might lead to the impression that the first transformation Y → U was not the right thing to do. However, after the second transformation which we do in the next section, all exponentially increasing entries miraculously disappear. 4. Second Transformation U → T The second transformation involves the global opening of lenses already mentioned in the introduction. It is needed to turn the exponentially increasing entries in the jump matrices into exponentially decreasing ones. Let be a closed curve, consisting of a part in the left half-plane from −iz2 to iz2 , symmetric with respect to the real axis, plus its mirror image in the right half-plane. The part in the left half-plane lies entirely in the region where Re λ2 < Re λ3 and it intersects the negative real axis in a point −x ∗ with x ∗ > z1 , see Fig. 3. So avoids the region bounded by the curves from ±iz2 to ±x0 . In a neighborhood of iz2 we take to be the analytic continuation of the curves where Re λ2 = Re λ3 . As a result, this means that λ2 − λ3

is real on in a neighborhood of iz2 .

(4.1)

3

2

Σ 1

iz

Im z

2

*

0

−x

−z1

z1

*

x

−iz

2

−1

−2

−3

−3

−2

−1

0 Re z

1

2

3

Fig. 3. Contour which is such that Re λ2 < Re λ3 on the part of in the left half-plane and Re λ2 > Re λ3 on the part of in the right half-plane

378

A.I. Aptekarev, P.M. Bleher, A.B.J. Kuijlaars

This will be convenient for the construction of the local parametrix in Sect. 7. The contour encloses a bounded domain and we make the second transformation in that domain only. So we put T = U outside and inside we put   1 0 0 1 0 for Re z < 0 inside , T = U 0 n(λ −λ ) 2 3 1 0 −e   (4.2) 10 0 T = U 0 1 −en(λ3 −λ2 )  for Re z > 0 inside . 00 1 Then T is defined and analytic outside the contours shown in Fig. 4. Using the jumps for U and the definition (4.2) we calculate the jumps for T on any part of the contour. We get different expressions for six real intervals, for the vertical segment [−iz2 , iz2 ], and for (oriented clockwise) in the left and right half-planes. The result is that T satisfies the following RH problem: • T : C \ (R ∪ [−iz2 , iz2 ] ∪ ) → C3×3 is analytic. • T satisfies the following jump relations on the real line   1 en(λ2+ −λ1− ) en(λ3+ −λ1− )  on (−∞, −x ∗ ], T+ = T− 0 1 0 0 0 1   n(λ −λ ) 1 0 e 3+ 1−  on (−x ∗ , −z1 ], T+ = T− 0 1 0 00 1  n(λ −λ )  e 1+ 1− 0 1  0 1 0 on (−z1 , 0), T+ = T−  0 0 en(λ3+ −λ3− )  n(λ −λ )  e 1+ 1− 1 0 T+ = T−  on (0, z1 ), 0 en(λ2+ −λ2− ) 0 0 0 1

(4.3)

(4.4)

(4.5)

(4.6)

2

Σ

1.5

iz

2

1

•

Im z

0.5

•

0 *

−x

•

•

−z1

z1

• *

x

−0.5 •

−1

−iz

2

−1.5

−2 −4

−3

−2

−1

0 Re z

1

2

Fig. 4. T has jumps on the real line, the interval [−iz2 , iz2 ] and on

3

4

Large n Limit of Gaussian Random Matrices with External Source, Part II



1 en(λ2 −λ1 )  T+ = T− 0 1 0 0  1 en(λ2 −λ1 )  T+ = T− 0 1 0 0

 0 0 1

on [z1 , x ∗ ),

 en(λ3 −λ1 )  0 1

The jump on the vertical segment is   1 0 0  1 T+ = T− 0 0 n(λ −λ ) 3+ 3− 0 −1 e

379

(4.7)

on [z1 , ∞).

(4.8)

on [−iz2 , iz2 ].

(4.9)

The jumps on are 

1 T+ = T− 0 0  1 T+ = T− 0 0

0 1

en(λ2 −λ3 )

 0 0 1 

0 0 1 en(λ3 −λ2 )  0 1

on {z ∈ | Re z < 0},

(4.10)

on {z ∈ | Re z > 0}.

(4.11)

• T (z) = I + O(1/z) as z → ∞. Now the jump matrices are nice. Because of our choice of we have that the jump matrices in (4.10) and (4.11) converge to the identity matrix as n → ∞. Also the jump matrices in (4.3), (4.4), (4.7) and (4.8) converge to the identity matrix as n → ∞. The (3,3)-entry  in the jump matrix in (4.9) is exponentially small, so that this matrix tends 1 0 0 to 0 0 1. 0 −1 0 The jump matrices in (4.6) and (4.7) have oscillatory entries on the diagonal, and they are turned into exponential decaying off-diagonal entries by opening a (local) lens around (−z1 , z1 ). This is the next transformation. 5. Third Transformation T → S We are now going to open up a lens around (z1 , z1 ) as in Fig. 5. There is no need to treat 0 as a special point. The jump matrix on (−z1 , 0), see (4.5), has factorization   n(λ −λ ) e 1 3+ 0 1  0 1 0 T−−1 T+ =  n(λ −λ ) − 1 3 0 0e     1 00 1 00 0 01 0 1 0 , 0 1 0  0 1 0  (5.1) = −λ ) −λ ) n(λ n(λ − + 3 3 1 1 −1 0 0 01 e 01 e

380

A.I. Aptekarev, P.M. Bleher, A.B.J. Kuijlaars 2

Σ

1.5

iz2

1

•

Im z

0.5

• •

0

iy*

•

−x*

•

−z

z

1

•

−0.5

1

•

x*

*

−iy

•

−1

−iz2

−1.5

−2 −4

−3

−2

−1

0 Re z

1

2

3

4

Fig. 5. Opening of lens around [−z1 , z1 ]. The new matrix-valued function S has jumps on the real line, the interval [−iz2 , iz2 ], on , and on the upper and lower lips of the lens around [−z1 , z1 ]

and the jump matrix on (0, z1 ), see (4.6), has factorization 

en(λ1 −λ2 )+ T−−1 T+ =  0 0  1 = en(λ1 −λ2 )− 0

1

 0 0 1

en(λ1 −λ2 )− 0    1 00 00 0 10 1 0 −1 0 0 en(λ1 −λ2 )+ 1 0 . 0 01 01 0 01

(5.2)

We open up the lens on [−z1 , z1 ] and we make sure that it stays inside . We assume that the lens is symmetric with respect to the real and imaginary axis. The point where the upper lip intersects the imaginary axis is called iy ∗ . Then we define S = T outside the lens and   1 00 0 1 0 S=T in upper part of the lens in left half-plane, −λ ) n(λ 1 3 −e 01   1 00 0 1 0 S=T in lower part of the lens in left half-plane, n(λ −λ ) 1 3 e 01   (5.3) 1 00 S = T −en(λ1 −λ2 ) 1 0 in upper part of the lens in right half-plane, 0 01   1 00 S = T en(λ1 −λ2 ) 1 0 in lower part of the lens in right half-plane. 0 01 Outside the lens, the jumps for S are as those for T , while on [−z1 , z1 ] and on the upper and lower lips of the lens, the jumps are according to the factorizations (5.1) and (5.2). The result is that S satisfies the following RH problem:

Large n Limit of Gaussian Random Matrices with External Source, Part II

381

• S is analytic outside the real line, the vertical segment [−iz2 , iz2 ], the curve , and the upper and lower lips of the lens around [−z1 , z1 ]. • S satisfies the following jumps on the real line:   1 en(λ2+ −λ1− ) en(λ3+ −λ1− )  S+ = S− 0 on (−∞, −x ∗ ], (5.4) 1 0 0 0 1   1 0 en(λ3+ −λ1− )  on (−x ∗ , −z1 ], S+ = S− 0 1 (5.5) 0 00 1   0 01 S+ = S−  0 1 0 on (−z1 , 0), (5.6) −1 0 0   0 10 S+ = S− −1 0 0 on (0, z1 ), (5.7) 0 01   1 en(λ2 −λ1 ) 0 on [z1 , x ∗ ), S+ = S− 0 (5.8) 1 0 0 0 1   1 en(λ2 −λ1 ) en(λ3 −λ1 )  on [x ∗ , ∞). S+ = S− 0 (5.9) 1 0 0 0 1 S has the following jumps on the segment [−iz2 , iz2 ]:   1 0 0  1 S+ = S− 0 0 on (−iz2 , −iy ∗ ), −λ ) n(λ 0 −1 e 3+ 3−   1 0 0  0 0 1 S+ = S−  on (−iy ∗ , 0), n(λ −λ ) n(λ −λ ) e 1 3− −1 e 3+ 3−   1 0 0  0 0 1 S+ = S−  on (0, iy ∗ ), n(λ −λ ) n(λ −λ ) −e 1 3− −1 e 3+ 3−   1 0 0  1 S+ = S− 0 0 on (iy ∗ , iz2 ). n(λ −λ ) 0 −1 e 3+ 3− The jumps on are



1 S+ = S− 0 0  1 S+ = S− 0 0

0 1

en(λ2 −λ3 )

 0 0 1 

0 0 1 en(λ3 −λ2 )  0 1

(5.10)

(5.11)

(5.12)

(5.13)

on {z ∈ | Re z < 0},

(5.14)

on {z ∈ | Re z > 0}.

(5.15)

382

A.I. Aptekarev, P.M. Bleher, A.B.J. Kuijlaars

Finally, on the upper and lower lips of the lens, we find jumps   1 00 0 1 0 on the lips of the lens in the left half-plane. S + = S−  en(λ1 −λ3 ) 0 1 (5.16)   1 00 S+ = S− en(λ1 −λ2 ) 1 0 on the lips of the lens in the right half-plane. 0 01 (5.17) • S(z) = I + O(1/z)

as z → ∞.

So now we have 14 different jump matrices (5.4)–(5.17). As n → ∞, all these jumps have limits. Most of the limits are the identity matrix, except for the jumps on (−z1 , z1 ), see (5.6) and (5.7), and on (−iz2 , iz2 ), see (5.10)–(5.13). In the next section we will solve explicitly the limiting model RH problem. The solution to the model problem will be further used in the construction of parametrix away from the branch points. 6. Parametrix Away from Branch Points The model RH problem is the following. Find N such that • N : C \ ([−z1 , z1 ] ∪ [−iz2 , iz2 ]) → C3×3 is analytic. • N satisfies the jumps   0 01 N+ = N−  0 1 0 on [−z1 , 0), −1 0 0   0 10 N+ = N− −1 0 0 on (0, z1 ], 0 01   1 0 0 N+ = N− 0 0 1 on [−iz2 , iz2 ]. 0 −1 0 • N(z) = I + O(1/z)

(6.1)

(6.2)

(6.3)

as z → ∞.

To solve the model RH problem we lift it to the Riemann surface (2.1) with the sheet structure as in Fig. 1, see also [6, 24], where the same technique was used. Consider to that end the range of the functions ξk on the complex plane, k = ξk (C) for k = 1, 2, 3. Then 1 , 2 , 3 give a partition of the complex plane into three regions, see Fig. 6. In this figure q, p and p0 are such that q = ξ1 (z1 ) = ξ2 (z1 ) = −ξ1 (−z1 ) = −ξ3 (−z1 ), ip = −ξ2 (iz2 ) = −ξ3 (iz2 ) = ξ2 (−iz2 ) = ξ3 (−iz2 ), ip0 = ξ1+ (0) = −ξ1− (0).

(6.4)

Large n Limit of Gaussian Random Matrices with External Source, Part II

383

Ω1

ip 0 Γ ip Ω3 −q

Ω2 a

−a

q

−ip

−ip 0 Fig. 6. Partition of the complex ξ -plane

Let be the boundary of 1 . Then we have ξ1± ([−z1 , z1 ]) = ∩ {±Im z ≥ 0}, ξ2− ([−iz2 , 0]) = [ip, ip0 ], ξ2− ([0, iz2 ]) = [−ip0 , −ip], ξ3− ([−iz2 , 0]) = [ip, 0], ξ3− ([0, iz2 ]) = [0, −ip],

(6.5)

and ξ2± (iy) = ξ3∓ (iy) for −z2 ≤ y ≤ z2 . According to our agreement, on the interval −iz2 ≤ y ≤ iz2 the minus side is on the right. We are looking for a solution N in the following form:  N1 (ξ1 (z)) N1 (ξ2 (z)) N1 (ξ3 (z)) N (z) = N2 (ξ1 (z)) N2 (ξ2 (z)) N2 (ξ3 (z)) , N3 (ξ1 (z)) N3 (ξ2 (z)) N3 (ξ3 (z)) 

(6.6)

where N1 (ξ ), N2 (ξ ), N3 (ξ ) are three scalar analytic functions on C \ ( ∪ [−ip0 , ip0 ]). To satisfy the jump conditions on N (z) we need the following jump relations for Nj (ξ ), j = 1, 2, 3: Nj + (ξ ) = Nj − (ξ ), ξ ∈ ( ∩ {Im z ≤ 0}) ∪ [−ip0 , −ip] ∪ [ip, ip0 ], Nj + (ξ ) = −Nj − (ξ ), ξ ∈ ( ∩ {Im z ≥ 0}) ∪ [−ip, ip].

(6.7)

So the Nj ’s are actually analytic across the curve in the lower half-plane and on the segments [ip, ip0 ] and [−ip0 , −ip]. What remains are the curve in the upper halfplane and the segment [−ip, ip], where the functions change sign. Since ξ1 (∞) = ∞, ξ2 (∞) = a, ξ3 (∞) = −a, then to satisfy N (∞) = I we require N1 (∞) = 1, N2 (∞) = 0, N3 (∞) = 0,

N1 (a) = 0, N2 (a) = 1, N3 (a) = 0,

N1 (−a) = 0; N2 (−a) = 0; N3 (−a) = 1.

(6.8)

384

A.I. Aptekarev, P.M. Bleher, A.B.J. Kuijlaars

Thus, we obtain three scalar RH problems on N1 , N2 , N3 . Equations (6.7)–(6.8) have the following solution: N1 (ξ ) =

ξ 2 − a2 (ξ 2

+ p 2 )(ξ 2

− q 2)

,

N2,3 (ξ ) = c2,3

ξ ±a (ξ 2

+ p 2 )(ξ 2 − q 2 )

, (6.9)

with cuts at ∩ {Im ξ ≥ 0} and [−ip, ip]. The constants c2,3 are determined by the equations N2,3 (±a) = 1. We have that (ξ 2 + p 2 )(ξ 2 − q 2 ) = ξ 4 − (1 + 2a 2 )ξ 2 + (a 2 − 1)a 2 ≡ R(ξ ; a),

(6.10)

− √i

. Thus, the solution to the model RH and as in Sect. 6 of [6], we obtain c2 = c3 = 2 problem is given by   ξ12 (z) − a 2 ξ22 (z)−a 2 ξ32 (z)−a 2 √ √  √ R(ξ2 (z);a) R(ξ3 (z);a)  R(ξ1 (z); a)       ξ1 (z) + a √ ξ2 (z)+a √ ξ3 (z)+a  , (6.11) N(z) =  −i −i −i √   2R(ξ (z);a) 2R(ξ (z);a) 2 3 2R(ξ1 (z); a)      ξ1 (z) − a ξ2 (z)−a ξ3 (z)−a  √ √ −i √ −i 2R(ξ (z);a) −i 2R(ξ (z);a) 2 3 2R(ξ1 (z); a) with cuts on [−z1 , z1 ] and [−iz2 , iz2 ]. 7. Local Parametrices Near the branch points N will not be a good approximation to S. We need a local analysis near each of the branch points. In a small circle around each of the branch points, the parametrix P should have the same jumps as S, and on the boundary of the circle P should match with N in the sense that P (z) = N (z) (I + O(1/n))

(7.1)

uniformly for z on the boundary of the circle. The construction of P near the real branch points ±z1 makes use of Airy functions and it is the same as the one given in [6, Sect. 7] for the case a > 1. The parametrix near the imaginary branch points ±iz2 is also constructed with Airy functions. We give the construction near iz2 . We want an analytic P in a neigborhood of iz2 with jumps   1 0 0 1 0 on left contour, P+ = P− 0 0 en(λ2 −λ3 ) 1   10 0 P+ = P− 0 1 en(λ3 −λ2 )  on right contour, (7.2) 00 1   1 0 0  1 P+ = P− 0 0 on vertical part. −λ ) n(λ 3+ 3− 0 −1 e In addition we need the matching condition (7.1). Except for the matching condition (7.1), the problem is a 2 × 2 problem.

Large n Limit of Gaussian Random Matrices with External Source, Part II

385

Let us consider λ2 − λ3 near the branch point iz2 . We know that (λ2 − λ3 )(iz2 ) = 0, see (2.7) and since ξ2 − ξ3 has square root behavior at iz2 it follows that

z

(λ2 − λ3 )(z) =

(ξ2 (s) − ξ3 (s))ds = (z − iz2 )3/2 h(z)

iz2

with an analytic function h with h(iz2 ) = 0. So we can take a 2/3-power and obtain a conformal map. To be precise, we note that arg((λ2 − λ3 )(iy)) = π/2,

for y > z2 ,

and so we define

2/3 3 f (z) = (λ2 − λ3 )(z) 4

(7.3)

such that arg f (z) = π/3,

for z = iy, y > z2 .

Then s = f (z) is a conformal map, which maps [0, iz2 ] into the ray arg s = − 2π 3 , and which maps the parts of near iz2 in the right and left half-planes into the rays arg s = 0 and arg s = 2π 3 , respectively. [Recall that λ2 − λ3 is real on these contours, see (4.1).] We choose P of the form   0 1 1 0 , 0 P (z) = E(z) n2/3 f (z) 0 e 2 n(λ2 −λ3 ) − 21 n(λ2 −λ3 ) 0 0 e

(7.4)

where E is analytic. In order to satisfy the jump conditions (7.2) we want that is defined and analytic in the complex s-plane cut along the three rays arg s = k 2πi 3 , k = −1, 0, 1, and there it has jumps 

1

+ = − 0 0  1

+ = − 0 0  1

+ = − 0 0

 0 0 1  0 0 0 1 −1 1  00 1 1 01 0 1 1

for arg s = 2π/3,

for arg s = −2π/3,

(7.5)

for arg s = 0.

Put y0 (s) = Ai(s), y1 (s) = ω Ai(ωs), y2 (s) = ω2 Ai(ω2 s) with ω = 2π/3 and Ai the standard Airy function. Then we take as

386

A.I. Aptekarev, P.M. Bleher, A.B.J. Kuijlaars



1

= 0 0  1

= 0 0  1

= 0 0

 0 0 y0 −y2  for 0 < arg s < 2π/3, y0 −y2  0 0 y0 y1  for − 2π/3 < arg s < 0, y0 y1  0 0 −y1 −y2  for 2π/3 < arg s < 4π/3. −y1 −y2

(7.6)

This satisfies the jumps (7.5). In order to achieve the matching (7.1) we define the prefactor E as

with

E = N L−1

(7.7)

   1 0 0 1 0 0 1  −1/6 −1/4  0 1 i  , f 0 0n L= √ 2 π 0 1/6 1/4 0 −1 i 0 n f

(7.8)

where f 1/4 has a branch cut along the vertical segment [0, iz2 ] and it is real and positive where f is real and positive. The matching condition (7.1) now follows from the asymptotics of the Airy function and its derivative 2 3/2 1 1 + O s −3/2 , Ai(s) = √ s −1/4 e− 3 s 2 π 2 3/2 1 Ai (s) = − √ s 1/4 e− 3 s 1 + O s −3/2 , 2 π 1/4

as s → ∞, | arg s| < π. On the cut we have f+  1 0 L+ = L− 0 0 0 −1

1/4

= if− . Then (7.8) gives  0 1 , 0

which is the same jump as satisfied by N , see (6.3). This implies that E = N L−1 is analytic in a punctured neighborhood of iz2 . Since the entries of N and L have at most fourth-root singularities, the isolated singularity is removable, and E is analytic. It follows that P defined by (7.4) does indeed satisfy the jumps (7.2) and the matching condition (7.1). A similar construction gives the parametrix in the neighborhood of −iz2 . 8. Fourth Transformation S → R Having constructed N and P , we define the final transformation by R(z) = S(z)N (z)−1 −1

R(z) = S(z)P (z)

away from the branch points, near the branch points.

(8.1)

Large n Limit of Gaussian Random Matrices with External Source, Part II

387

2

Σ

1.5

iz2

1

•

Im z

0.5

• •

0

−x*

iy*

•

•

−z1

z1 •

−0.5

•

x*

*

−iy

•

−1

−iz2

−1.5

−2 −4

−3

−2

−1

0 Re z

1

2

3

4

Fig. 7. R has jumps on this system of contours

Since jumps of S and N coincide on the interval (−z1 , z1 ) and the jumps of S and P coincide inside the disks around the branch points, we obtain that R is analytic outside a system of contours as shown in Fig. 7. On the circles around the branch points there is a jump R+ = R− (I + O(1/n)),

(8.2)

which follows from the matching condition (7.1). On the remaining contours, the jump is R+ = R− (I + O(e−cn ))

(8.3)

for some c > 0. Since we also have the asymptotic condition R(z) = I + O(1/z) as z → ∞, we may conclude as in [5, Sect. 8] that

1 as n → ∞, (8.4) R(z) = I + O n(|z| + 1) uniformly for z ∈ C, see also [15–17, 23]. 9. Proof of Theorems 1.1 and 1.2 We follow the expression for the kernel Kn as we make the transformations Y → U → T → S. From (1.14) and the transformation (3.1) it follows that Kn has the following expression in terms of U , for any x, y ∈ R,   1 2 2 e−nλ1+ (x) e 4 n(x −y ) nλ2+ (y) nλ3+ (y) −1  . (9.1) U+ (y)U+ (x)  Kn (x, y) = e 0 0e 2πi(x − y) 0 Then from (4.2) we obtain for y ≥ 0 inside the contour , and for any x ∈ R,   1 2 2 e−nλ1+ (x) e 4 n(x −y ) nλ2+ (y) −1 , Kn (x, y) = 0 T+ (y)T+ (x)  0 0e 2πi(x − y) 0

(9.2)

388

A.I. Aptekarev, P.M. Bleher, A.B.J. Kuijlaars

and from (5.3), we have when x, y ∈ [0, z1 ),

 −nλ (x)  1 2 2 e 1+ e 4 n(x −y ) nλ1+ (y) nλ2+ (y) −1 Kn (x, y) = e 0 S+ (y)S+ (x) e−nλ2+ (x)  . (9.3) −e 2π i(x − y) 0

Since λ1+ and λ2+ are each others complex conjugates on [0, z1 ), we can rewrite (9.3) for x, y ∈ [0, z1 ) as Kn (x, y) =

en(h(y)−h(x)) ni Im λ1+ (y) −ni Im λ1+ (y) e 0 −e 2πi(x − y)  −ni Im λ (x)  1+ e −1 (x)  , ni Im λ  1+ ×S+ (y)S+ (x) e 0

(9.4)

where 1 h(x) = Re λ1+ (x) − x 2 . 4

(9.5)

Note that (9.4) is exactly the same as Eq. (5.14) in [6]. Therefore we can almost literally follow the proofs in Sect. 9 of [6] to complete the proof of Theorem 1.1 and 1.2. Indeed as in [6] the limiting mean density (1.15) follows from (9.4) and (8.4) in case x > 0, where ρ(x) =

1 Im ξ1+ (x), π

x ∈ R.

(9.6)

The case x < 0 follows in the same way and also by symmetry. Recalling that the choice of the cut [−iz2 , iz2 ] was arbitrary as remarked at the end of Sect. 2, we note that we might as well have done the asymptotic analysis on a contour that does not pass through 0, so that we obtain (1.15) for x = 0 as well. The statement in part (a) on the behavior of ρ follows immediately from (9.6) and the properties of ξ1 as the inverse mapping of (2.2). This completes the proof of Theorem 1.1. The proof of part (a) of Theorem 1.2 for the case x0 > 0 follows from (9.4) and (8.4) exactly as in Sect. 9 of [6]. The case x0 < 0 follows by symmetry, and the case x0 = 0 follows as well, since we might have done the asymptotic analysis on a cut different from [−iz2 , iz2 ], as just noted above. The proof of part (b) follows as in [6] as well. Note however that the proof of part (b) relies on the local parametrix at the branch point z1 , which we have not specified explicitly in Sect. 7. However, the formulas are the same as those in [6] and the proof can be copied. This completes the proof of Theorem 1.2. References 1. Aptekarev, A.I., Branquinho, A., Van Assche, W.: Multiple orthogonal polynomials for classical weights. Trans. Amer. Math. Soc. 355, 3887–3914 (2003) 2. Baik, J.: Random vicious walks and random matrices. Commun. Pure Appl. Math. 53, 1385–1410 (2000) 3. Bleher, P., Its, A.: Semiclassical asymptotics of orthogonal polynomials, Riemann-Hilbert problem, and the universality in the matrix model. Ann. Math. 150, 185–266 (1999) 4. Bleher, P., Its, A.: Double scaling limit in the random matrix model. The Riemann-Hilbert approach. Commun. Pure Appl. Math. 56, 433–516 (2003)

Large n Limit of Gaussian Random Matrices with External Source, Part II

389

5. Bleher, P.M., Kuijlaars, A.B.J.: Random matrices with external source and multiple orthogonal polynomials. Internat. Math. Research Notices 2004, 3, 109–129 (2004) 6. Bleher, P.M., Kuijlaars, A.B.J.: Large n limit of Gaussian random matrices with external source. Part I. Commun. Math. Phys. 252, 43–76 (2004) 7. Bleher, P.M., Kuijlaars, A.B.J.: Integral representations for multiple Hermite and multiple Laguerre polynomials. http://arxiv.org/abs/math.CA/0406616, 2004 to appear in Annales de l’Institut Fourier 8. Bleher, P.M., Kuijlaars, A.B.J.: Large n limit of Gaussian random matrices with external source, Part III: double scaling limit in the critical case. In preparation 9. Br´ezin, E., Hikami, S.: Spectral form factor in a random matrix theory. Phys. Rev. E 55, 4067–4083 (1997) 10. Br´ezin, E., Hikami, S.: Correlations of nearby levels induced by a random potential. Nucl. Phys. B 479, 697–706 (1996) 11. Br´ezin, E., Hikami, S.: Extension of level spacing universality. Phys. Rev. E 56, 264–269 (1997) 12. Br´ezin, E., Hikami, S.: Universal singularity at the closure of a gap in a random matrix theory. Phys. Rev. E 57, 4140–4149 (1998) 13. Br´ezin, E., Hikami, S.: Level spacing of random matrices in an external source. Phys. Rev. E 58, 7176–7185 (1998) 14. Daems, E., Kuijlaars, A.B.J.: A Christoffel-Darboux formula for multiple orthogonal polynomials. J. Approx. Theory 130, 190–202 (2004) 15. Deift, P.: Orthogonal Polynomials and Random Matrices: a Riemann-Hilbert approach. Courant Lecture Notes in Mathematics, Vol. 3, Providence R.I.: Amer. Math. Soc., 1999 16. Deift, P., Kriecherbauer, T., McLaughlin, K.T.-R., Venakides, S., Zhou, X.: Uniform asymptotics of polynomials orthogonal with respect to varying exponential weights and applications to universality questions in random matrix theory. Commun. Pure Appl. Math. 52, 1335–1425 (1999) 17. Deift, P., Kriecherbauer, T., McLaughlin, K.T.-R., Venakides, S., Zhou, X.: Strong asymptotics of orthogonal polynomials with respect to exponential weights. Commun. Pure Appl. Math 52, 1491– 1552 (1999) 18. Deift, P., Zhou, X.: A steepest descent method for oscillatory Riemann-Hilbert problems. Asymptotics for the MKdV equation. Ann. Math. 137, 295–368 (1993) 19. Itzykson, C., Zuber, J.B.: The planar approximation II. J. Math. Phys. 21, 411–421 (1980) 20. Johansson, K.: Non-intersecting paths, random tilings and random matrices. Probab. Theo Related Fields 123, 225–280 (2002) 21. Karlin, S., McGregor, J.: Coincidence probabilities. Pacific J. Math. 9, 1141–1164 (1959) 22. Katori, M., Tanemura, H.: Scaling limit of vicious walks and two-matrix model. Phys. Rev. E 66, Art. No. 011105 (2002) 23. Kuijlaars, A.B.J.: Riemann-Hilbert analysis for orthogonal polynomials. In: Orthogonal Polynomials and Special Functions E. Koelink, W. Van Assche, (eds), Lecture Notes in Mathematics, Vol. 1817, Berlin-Heiderberg-New York, Springer-Verlag, 2003, pp. 167–210 24. Kuijlaars, A.B.J., Van Assche, W., Wielonsky, F.: Quadratic Hermite-Pad´approximation to the exponential function: a Riemann-Hilbert approach. http://Constr.org/list/math.CA/0302357, 2003 Approx. 21, 351–412 (2005) 25. Kuijlaars, A.B.J., Vanlessen, M.: Universality for eigenvalue correlations from the modified Jacobi unitary ensemble. Internat. Math. Research Notices 2002, 1575–1600 (2002) 26. Kuijlaars, A.B.J., Vanlessen, M.: Universality for eigenvalue correlations at the origin of the spectrum. Commun. Math. Phys. 243, 163–191 (2003) 27. Mehta, M.L.: Random Matrices. 2nd edition, Boston: Academic Press, 1991 28. Nagao, T., Forrester, P.J.: Vicious random walkers and a discretization of Gaussian random matrix ensembles. Nuclear Phys. B 620, 551–565 (2002) 29. Pastur, L.A.: The spectrum of random matrices (Russian). Teoret. Mat. Fiz. 10, 102–112 (1972) 30. Tracy, C.A., Widom, H.: The Pearcey process. http://arxiv.org/abs/math.PR/0412005, 2004 31. Van Assche, W., Coussement, E.: Some classical multiple orthogonal polynomials. J. Comput. Appl. Math. 127, 317–347 (2001) 32. Van Assche, W., Geronimo, J.S., Kuijlaars, A.B.J.: Riemann-Hilbert problems for multiple orthogonal polynomials. In: Special Functions 2000: Current Perspectives and Future Directions, J. Bustoz et al., (eds), Dordrecht: Kluwer, 2001, pp. 23–59 33. Zinn-Justin, P.: Random Hermitian matrices in an external field. Nucl. Phys. B 497, 725–732 (1997) 34. Zinn-Justin, P.: Universality of correlation functions of Hermitian random matrices in an external field. Commun. Math. Phys. 194, 631–650 (1998) Communicated by J.L. Lebowitz

Commun. Math. Phys. 259, 391–411 (2005) Digital Object Identifier (DOI) 10.1007/s00220-005-1348-z

Communications in

Mathematical Physics

Abelianizing Vertex Algebras Haisheng Li1,2, 1 2

Department of Mathematical Sciences, Rutgers University, Camden, NJ 08102, USA Department of Mathematics, Harbin Normal University, Harbin, China

Received: 10 September 2004 / Accepted: 25 November 2004 Published online: 12 April 2005 – © Springer-Verlag 2005

Abstract: To every vertex algebra V we associate a canonical decreasing sequence of subspaces and prove that the associated graded vector space gr(V ) is naturally a vertex Poisson algebra, in particular a commutative vertex algebra. We establish a relation between this decreasing sequence and the sequence Cn introduced by Zhu. By using the (classical) algebra gr(V ), we prove that for any vertex algebra V , C2 -cofiniteness implies Cn -cofiniteness for all n ≥ 2. We further use gr(V ) to study generating subspaces of certain types for lower truncated Z-graded vertex algebras. 1. Introduction Just as with classical (associative or Lie) algebras, abelian or commutative vertex algebras (should be) are the simplest objects in the category of vertex algebras. It was known (see [B]) that commutative vertex algebras exactly amount to differential algebras, namely unital commutative associative algebras equipped with a derivation. Related to the notion of a commutative vertex algebra, is the notion of a vertex Poisson algebra (see [FB]), where a vertex Poisson algebra structure combines a commutative vertex algebra structure, or equivalently, a differential algebra structure, with a vertex Lie algebra structure (see [K, P]). As it was shown in [FB], vertex Poisson algebras can be considered as classical limits of vertex algebras. In the classical theory, a well known method to abelianize an associative algebra is to use a good increasing filtration and then consider the associated graded vector space. A typical example is the universal enveloping algebra U (g) of a Lie algebra g with the filtration {Un }, where for n ≥ 0, Un is linearly spanned by the vectors a1 · · · am for m ≤ n, a1 , . . . , am ∈ g. In this case, the associated graded algebra grU (g) is naturally a Poisson algebra and the well known Poincar´e-Birkhoff-Witt theorem says that the associated graded Poisson algebra grU (g) is canonically isomorphic to the symmetric

Partially supported by an NSA grant

392

H. Li

algebra S(g) which is also a Poisson algebra. This result and the canonical isomorphism have played a very important role in Lie theory. Motivated by this classical result, in [Li2] we introduced and studied a notion of what we called good increasing filtration for a vertex algebra V and we proved that the associated graded vector space grV of V with respect to a good increasing filtration isnaturally a vertex Poisson algebra. Furthermore, for any N-graded vertex algebra V = n∈N V(n) with V(0) = C1, we constructed a canonical good increasing filtration of V . This increasing filtration was essentially used in [KL, GN, Bu1, 2, ABD and NT] in the study on generating subspaces of V with a certain property analogous to the well known Poincar´e-Birkhoff-Witt spanning property. In this paper, we introduce and study “good” decreasing filtrations for vertex algebras. To any vertex algebra V we associate a canonical decreasing sequence E of subspaces En for n ≥ 0 and we prove that the associated graded vector space gr E (V ) is naturally an N-graded vertex Poisson algebra, where for n ∈ Z, En is linearly spanned by the vectors (1)

(r)

u−1−k1 · · · u−1−kr v for r ≥ 1, u(i) , v ∈ V , ki ≥ 0 with k1 + · · · + kr ≥ n. Notice that unlike the increasing filtration which uses the weight grading, this decreasing sequence uses only the vertex algebra structure. For any vertex algebra V , there has been a fairly well known decreasing sequence C = {Cn }n≥2 introduced by Zhu [Z1, 2], where for n ≥ 2, Cn is linearly spanned by the vectors u−n v for u, v ∈ V . The notion of C2 was introduced and used in the fundamental study of Zhu on modular invariance, where the finiteness of dim V /C2 played a crucial role. It was shown in [Z2] that V /C2 has a natural Poisson algebra structure. In this paper, we relate our decreasing sequence E with Zhu’s sequence C. In particular, we show that C2 = E1 and C3 = E2 . We then show that the degree zero subspace E0 /E1 of gr E (V ), which is naturally a Poisson algebra, is exactly Zhu’s Poisson algebra V /C2 . We further show that gr E (V ) as a differential algebra is generated by the degree zero subalgebra V /C2 . As an application, we show that for any vertex algebra V , if V is C2 -cofinite, then V is En -cofinite and Cn+2 -cofinite for all n ≥ 0. Similarly we show that if V is a C2 -cofinite vertex algebra and if W is a C2 -cofinite V -module, then W is Cn -cofinite for all n ≥ 2. Under the assumption that V is an N-graded vertex algebra with dim V(0) = 1, it has been proved before by [GN] (see also [NT, Bu1, 2]) that C2 -cofiniteness implies Cn+2 -cofinite for all n ≥ 0. On the other hand, the original method of [GN] and [KL] used this assumption in an essential way. As we show in this paper, for certain vertex algebras, both sequences E and C are trivial in the sense that En = Cn+2 = V for all n ≥ 0. On the other hand,by using the connection between the two decreasing sequences we prove that if V = n≥t V(n) is a lower truncated Z-graded vertex algebra such as a vertex operator algebra in the sense of [FLM] and [FHL], then for any k, Cn , En ⊂ m≥k V(m) for n sufficiently large. Consequently, ∩n≥0 En = ∩n≥0 Cn+2 = 0. (In this case, both sequences are filtrations.) Furthermore, using this result and gr E (V ) we show that if a graded subspace U of V gives rises to a generating subspace of V /C2 as an algebra, then U generates V with a certain spanning property. Similar results have been obtained before in [KL, GN, Bu1, 2 and NT] under a stronger condition. This paper is organized as follows: In Sect. 2, we define the sequence E and show that the associated graded vector space is an N-graded vertex Poisson algebra. In Sect. 3, we

Abelianizing Vertex Algebras

393

relate the sequences E and C. In Sect. 4, we study generating subspaces of certain types for lower truncated Z-graded vertex algebras. 2. Decreasing Sequence E and the Vertex Poisson Algebra grE (V ) In this section we first recall the definition of a vertex Poisson algebra from [FB] and we then construct a canonical decreasing sequence E for each vertex algebra V and show that the associated graded vector space gr E V is naturally a vertex Poisson algebra. We also show that if V is an N-graded vertex algebra, then the sequence E is indeed a filtration of V . Let V be a vertex algebra. We have Borcherds’ commutator formula and iterate formula: m [um , vn ] = (ui v)m+n−i , (2.1) i i≥0 m (um v)n w = um−i vn+i w − (−1)m vm+n−i ui w (−1)i (2.2) i i≥0

for u, v, w ∈ V , m, n ∈ Z. Define a (canonical) linear operator D on V by D(v) = v−2 1

for v ∈ V .

(2.3)

Then Y (v, x)1 = ex D v

for v ∈ V .

(2.4)

Furthermore, [D, vn ] = (Dv)n = −nvn−1

(2.5)

for v ∈ V , n ∈ Z. If V is a vertex operator algebra in the sense of [FLM] and [FHL], then D = L(−1), a component of the vertex operator Y (ω, x) = n∈Z L(n)x −n−2 associated to the conformal (or Virasoro) vector ω. (See for example [LL] for an exposition of such facts.) A vertex algebra V is called a commutative vertex algebra if [um , vn ] = 0

for u, v ∈ V , m, n ∈ Z.

(2.6)

It is well known (see [B, FHL]) that (2.6) is equivalent to un = 0

for u ∈ V , n ≥ 0.

(2.7)

Remark 2.1. Let A be any unital commutative associative algebra with a derivation d. Then one has a commutative vertex algebra structure on A with Y (a, x)b = (exd a)b for a, b ∈ A and with the identity 1 as the vacuum vector (see [B]). On the other hand, let V be any commutative vertex algebra. Then V is naturally a commutative associative algebra with u · v = u−1 v for u, v ∈ V and with 1 as the identity and with D as a derivation. Furthermore, Y (u, x)v = (ex D u)v for u, v ∈ V . Therefore, a commutative vertex algebra exactly amounts to a unital commutative associative algebra equipped with a derivation, which is often called a differential algebra.

394

H. Li

A vertex algebra V equipped with a Z-grading V = n∈Z V(n) is called a Z-graded vertex algebra if 1 ∈ V(0) and if for u ∈ V(k) with k ∈ Z and for m, n ∈ Z, um V(n) ⊂ V(n+k−m−1) . (2.8) We say that a Z-graded vertex algebra V = n∈Z V(n) is lower truncated if V(n) = 0 for n sufficiently small. In particular, every vertex operator algebra in the sense of [FLM] and [FHL] is a lower truncated Z-graded vertex algebra. An N-graded vertex algebra is defined in the obvious way. We say that a vertex algebra V is Z-gradable (N-gradable) if there exists a Z-grading (N-grading) such that V becomes Z-graded (N-graded) vertex algebra. We see that a commutative Z-graded vertex algebra is naturally a Z-graded differential algebra. The following definition of the notion of vertex Lie algebra is due to [K and P]: Definition 2.2. A vertex Lie algebra is a vector space V equipped with a linear operator D and a linear map Y− : V → Hom (V , x −1 V [x −1 ]), vn x −n−1 v → Y− (v, x) =

(2.9)

n≥0

such that for u, v ∈ V , m, n ∈ N, (Dv)n = −nvn−1 , m 1 um v = (−1)m+i+1 D i vm+i u, i! i=0 m m [um , vn ] = (ui v)m+n−i . i

(2.10) (2.11)

(2.12)

i=0

A module (see [K]) for a vertex Lie algebra V is a vector space W equipped with a linear map Y−W : V → Hom (W, x −1 W [x −1 ]), vn x −n−1 v → Y−W (v, x) =

(2.13)

n≥0

such that (2.10) and (2.12) hold. Recall the following notion of vertex Poisson algebra from [FB] (cf. [DLM]): Definition 2.3. A vertex Poisson algebra is a commutative vertex algebra A, or equivalently, a (unital) commutative associative algebra equipped with a derivation ∂, equipped with a vertex Lie algebra structure (Y− , ∂) such that Y− (a, x) ∈ x −1 (Der A)[[x −1 ]]

for a ∈ A.

(2.14)

A module for a vertex Poisson algebra A is a vector space W equipped with a module structure for A as an associative algebra and a module structure for A as a vertex Lie algebra such that Y−W (u, x)(vw) = (Y−W (u, x)v)w + vY−W (u, x)w for u, v ∈ V , w ∈ W .

(2.15)

Abelianizing Vertex Algebras

395

The following result obtained in [Li2] gives a construction of vertex Poisson algebras from vertex algebras through certain increasing filtrations: Proposition 2.4. Let V be a vertex algebra and let F = {Fn }n∈Z be a good increasing filtration of V in the sense that 1 ∈ F0 , un Fs ⊂ Fr+s

(2.16)

for u ∈ Fr , r, s, n ∈ Z and un Fs ⊂ Fr+s−1

for n ≥ 0. (2.17) Then the associated graded vector space gr F V = n∈Z Fn+1 /Fn is naturally a vertex Poisson algebra with (u + Fm−1 )(v + Fn−1 ) = u−1 v + Fm+n−1 , ∂(u + Fm−1 ) = Du + Fm−1 , (ur v + Fm+n−2 )x −r−1 Y− (u + Fm−1 )(v + Fn−1 ) =

(2.18) (2.19) (2.20)

r≥0

for u ∈ Fm , v ∈ Fn with m, n ∈ Z. Furthermore, the following construction of good increasing filtrations was also given in [Li2]: an N-graded vertex algebra such that V(0) = C1. Theorem 2.5. Let V = n∈N V(n) be Let U be a graded subspace of V+ = n≥1 V(n) such that (1)

(r)

V = span{u−k1 · · · u−kr 1 | r ≥ 0, u(i) ∈ U, ki ≥ 1}. In particular, we can take U = V+ . For any n ≥ 0, denote by FnU the subspace of V linearly spanned by the vectors (1)

(r)

u−k1 · · · u−kr 1 for r ≥ 0, for homogeneous vectors u(1) , . . . , u(r) ∈ U and for k1 , . . . , kr ≥ 1 with wt u(1) + · · · + wt u(r) ≤ n. Then the sequence FU = {FnU } is a good increasing filtration1 of V . Furthermore, FU does not depend on U . Next, we give a construction of vertex Poisson algebras from vertex algebras using decreasing filtrations. First, we formulate the following general result, which is similar to Proposition 2.4 and which is classical in nature: Proposition 2.6. Let V be any vertex algebra and let E = {En }n≥0 be a decreasing sequence of subspaces of V such that 1 ∈ E0 and un v ∈ Er+s−n−1

for u ∈ Er , v ∈ Es , r, s ∈ N, n ∈ Z,

(2.21)

where byconvention Em = V for m < 0. Then the associated graded vector space gr E V = n≥0 En /En+1 is naturally an N-graded vertex algebra with (u + Er+1 )n (v + Es+1 ) = un v + Er+s−n 1

This good increasing filtration was denoted by EU = {EnU } in [Li2].

(2.22)

396

H. Li

for u ∈ Er , v ∈ Es , r, s ∈ N, n ∈ Z and with 1 + E1 ∈ E0 /E1 as the vacuum vector. Furthermore, gr E V is commutative if and only if un v ∈ Er+s−n

for u ∈ Er , v ∈ Es , r, s, n ∈ N.

(2.23)

Assume (2.21) and (2.23). Then the commutative vertex algebra gr E V is a vertex Poisson algebra where ∂(u + Er+1 ) = Du + Er+2 ,

(un v + Er+s−n+1 )x −n−1 Y− (u + Er+1 , x)(v + Es+1 ) =

(2.24) (2.25)

n≥0

for u ∈ Er , v ∈ Es with r, s ∈ N. Proof. Notice that the condition (2.21) guarantees that the operations given in (2.22) are well defined. Just as with any classical algebras, it is straightforward to check that gr E V is an N-graded vertex algebra and it is also clear that gr E V is commutative if and only if (2.23) holds. Assuming (2.21) and (2.23) we have a commutative associative N-graded algebra gr E (V ) with derivation ∂ defined by ∂(u + En+1 ) = (u + En+1 )−2 (1 + E1 ) = u−2 1 + En+2 = Du + En+2 , noticing that by (2.21) we have Du = u−2 1 ∈ En+1 . The condition (2.23) guarantees that the linear map Y− in (2.24) is well defined. It is straightforward to check that gr E (V ) equipped with Y− and ∂ is a vertex Lie algebra. Now, we check the compatibility condition (2.14). Let u ∈ Er , v ∈ Es , w ∈ Ek with r, s, k ∈ N. For m ≥ 0, using the Borcherds’ commutator formula for V we have m um (v−1 w) = v−1 (um w) + (ui v)m−1−i w i i≥0

= v−1 (um w) + (um v)−1 w +

m−1 i=0

m (ui v)m−1−i w. i

(2.26)

For 0 ≤ i ≤ m − 1, using (2.23) (twice) we have (ui v)m−1−i w ∈ Er+s+k−m+1 . Thus um (v−1 w) + Er+s+k−m+1 = v−1 (um w) + (um v)−1 w + Er+s+k−m+1 . This proves Y− (u, x) ∈ algebra.

x −1 (Der(gr

E

(V )))[x −1 ]. Therefore, gr

(2.27)

E (V ) is a vertex Poisson

In the following, for each vertex algebra we construct a canonical decreasing sequence E = {En }n≥0 which satisfies all the conditions assumed in Proposition 2.6. Definition 2.7. Let V be a vertex algebra and let W be a V -module. Define a sequence EW = {En (W )}n∈Z of subspaces of W , where for n ∈ Z, En (W ) is linearly spanned by the vectors (1)

(r)

u−1−k1 · · · u−1−kr w for r ≥ 1, u(1) , . . . , u(r) ∈ V , w ∈ W, k1 , . . . , kr ≥ 0 with k1 + · · · + kr ≥ n.

(2.28)

Abelianizing Vertex Algebras

397

Our main task is to establish the properties (2.21) and (2.23) for the sequence E. The following are some immediate consequences: Lemma 2.8. For any V -module W we have En (W ) ⊃ En+1 (W ) for any n ∈ Z, En (W ) = W for any n ≤ 0, u−1−k En (W ) ⊂ En+k (W ) for u ∈ V , k ≥ 0, n ∈ Z.

(2.29) (2.30) (2.31)

The following gives a stronger spanning property for En (W ): Lemma 2.9. Let W be a V -module. For any n ≥ 1, we have En (W ) = span{u−1−i w | u ∈ V , i ≥ 1, w ∈ En−i (W )}.

(2.32)

Furthermore, for n ≥ 1, En (W ) is linearly spanned by the vectors (1)

(r)

u−k1 −1 · · · u−kr −1 w

(2.33)

for r ≥ 1, u(1) , . . . , u(r) ∈ V , w ∈ W, k1 , . . . , kr ≥ 1 with k1 + · · · + kr ≥ n. Proof. Notice that (2.33) follows from (2.32) and induction. Denote by En (W ) the space on the right-hand side of (2.32). To prove (2.32), we need to prove that each spanning vector of En (W ) in (2.28) lies in En (W ). Now we use induction on r. If r = 1, we have (1) k1 ≥ n ≥ 1 and w ∈ W = En−k1 (W ), so that u−1−k1 w ∈ En (W ). Assume r ≥ 2. If k1 ≥ 1, we have u−1−k1 u−1−k2 · · · u−1−kr w ∈ En (W ) (1)

(2)

(2)

(r)

(r)

because u−1−k2 · · · u−1−kr w ∈ En−k1 (W ) with k2 + · · · + kr ≥ n − k1 . If k1 = 0, (2)

(r)

we have k2 + · · · + kr ≥ n, so that u−1−k2 · · · u−1−kr w ∈ En (W ). By the inductive (2) u−1−k2

hypothesis, we have 1, w ∈ En−k (W ), we have

(r) · · · u−1−kr w

∈ En (W ). Furthermore, for any b ∈ V , k ≥

u−1 b−1−k w = b−1−k u−1 w + (1)

(1)

−1 i≥0

i

(ui b)−2−k−i w . (1)

From the definition we have u−1 w ∈ En−k (W ), so that b−1−k u−1 w ∈ En (W ). On the other hand, for i ≥ 0, we have w ∈ En−k (W ) ⊂ En−k−i−1 (W ), so that (1)

(1)

(ui b)−2−k−i w ∈ En (W ). (1)

Therefore, u−1 b−1−k w ∈ En (W ). This proves that u−1−k1 u−1−k2 · · ·u−1−kr w ∈ En (W ), completing the induction. (1)

(1)

(2)

We have the following special case of (2.21) and (2.23) for E:

(r)

398

H. Li

Lemma 2.10. Let W be any V -module. For a ∈ V , m, n ∈ Z, we have am En (W ) ⊂ En−m−1 (W ).

(2.34)

Furthermore, am En (W ) ⊂ En−m (W )

for m ≥ 0.

(2.35)

Proof. By (2.31), (2.34) holds for m ≤ −1. Assume m ≥ 0. Since En−m (W ) ⊂ En−m−1 (W ) it suffices to prove (2.35). We now prove the assertion by induction on n. If n ≤ 0, we have En−m (W ) = W (because n − m ≤ 0), so that am En (W ) ⊂ W = En−m (W ). Assume n ≥ 1. From (2.32), En (W ) is spanned by the vectors u−1−k w for u ∈ V , k ≥ 1, w ∈ En−k (W ). Let u ∈ V , k ≥ 1, w ∈ En−k (W ). In view of Borcherds’ commutator formula we have m am u−1−k w = u−1−k am w + (ai u)m−k−i−1 w. i i≥0

Since w ∈ En−k (W ) with n − k < n, from the inductive hypothesis we have am w ∈ En−k−m (W ), (ai u)m−k−i−1 w ∈ En−m+i (W ) ⊂ En−m (W )

for i ≥ 0.

Furthermore, using the inductive hypothesis and Lemma 2.8 we have u−1−k am w ∈ u−1−k En−k−m (W ) ⊂ En−m (W ). Therefore, am u−1−k w ∈ En−m (W ). This proves am En (W ) ⊂ En−m (W ), completing the induction and the whole proof. Now we have the following general case: Proposition 2.11. Let W be a V -module and let u ∈ Er (V ), w ∈ Es (W ) with r, s ∈ Z. Then un w ∈ Er+s−n−1 (W )

for n ∈ Z.

(2.36)

un w ∈ Er+s−n (W )

for n ≥ 0.

(2.37)

Furthermore, we have

Proof. We are going to use induction on r. By Lemma 2.10, we have un w ∈ Es−n−1 (W ). If r ≤ 0, we have r +s −n−1 ≤ s −n−1, so that un w ∈ Es−n−1 (W ) ⊂ Er+s−n−1 (W ). Assume r ≥ 0 and u ∈ Er+1 (V ). In view of (2.32) it suffices to consider u = a−2−i b for some a ∈ V , 0 ≤ i ≤ r, b ∈ Er−i (V ). By the iterate formula (2.2) we have j −2 − i a−2−i−j bn+j w − (−1)i bn−2−i−j aj w . (−1) (a−2−i b)n w = j j ≥0

(2.38) If n ≥ 0, using the inductive hypothesis (with b ∈ Er−i (V )) and Lemma 2.10 we have a−2−i−j bn+j w ∈ a−2−i−j Er−i+s−n−j (W ) ⊂ Er+1+s−n (W ), bn−2−i−j aj w ∈ bn−2−i−j Es−j (W ) ⊂ Er+1+s−n (W ),

Abelianizing Vertex Algebras

399

from which we have that (a−2−i b)n w ∈ Er+1+s−n (W ). If n ≤ −1, we have a−2−i−j bn+j w ∈ a−2−i−j Er−i+s−n−j −1 (W ) ⊂ Er+s−n (W ), bn−2−i−j aj w ∈ bn−2−i−j Es−j (W ) ⊂ Er+1+s−n (W ) ⊂ Er+s−n (W ), so that (a−2−i b)n w ∈ Er+s−n (W ) = E(r+1+s)−n−1 (W ). This concludes the proof.

Combining Propositions 2.6 and 2.11 we immediately have: Theorem 2.12. Let V be any vertex algebra and let E = {En (V )} be the decreasing sequence defined in Definition 2.7 for V . Set

En /En+1 . (2.39) gr E (V ) = n≥0

Then gr E (V ) equipped with the multiplication defined by (a + Er+1 )(b + Es+1 ) = a−1 b + Er+s+1

(2.40)

is a commutative and associative N-graded algebra with 1 + E1 ∈ E0 /E1 as identity and with a derivation ∂ defined by ∂(u + En+1 ) = D(u) + En+2

for u ∈ En , n ∈ N.

Furthermore, gr E (V ) is a vertex Poisson algebra where (un v + Er+s−n+1 )x −n−1 Y− (a + Er+1 , x)(b + Es+1 ) =

(2.41)

(2.42)

n≥0

for a ∈ Er , b ∈ Es with r, s ∈ N. Proposition 2.13. Let W be any V -module and EW the decreasing sequence defined in Definition 2.7 for W . Then the associated graded vector space gr E (W ) = n≥0 En (W )/En+1 (W ) is naturally a module for the vertex Poisson algebra gr E (V ) with (v + Er+1 (V )) · (w + Es+1 (W )) = v−1 w + Er+s+1 (W ), (vn w + Er+s−n+1 )x −n−1 Y−W (v + Er+1 )(w + Es+1 (W )) =

(2.43) (2.44)

n≥0

for v ∈ Er (V ), w ∈ Es (W ). Proof. With the properties (2.36) and (2.37) the actions given by (2.43) and (2.44) are well defined. Clearly, 1 + E1 acts on gr E (W ) as identity and we have (u + Er+1 (V )) · ((v + Es+1 (V )) · (w + Ek+1 (W ))) = u−1 v−1 w + Er+s+k+1 (W ). By the iterate formula (2.2) we have (u−1 v)−1 w = (u−1−i v−1+i w + v−2−i ui w) , i≥0

where for i ≥ 1, using (2.37) and (2.36) we have u−1−i v−1+i w ∈ u−i−1 Es+k+1−i (W ) ⊂ Er+s+k+1 (W )

400

H. Li

and for i ≥ 0, similarly we have v−2−i ui w ∈ v−2−i Es+k−i (W ) ⊂ Er+s+k+1 (W ). Thus (u−1 v)−1 w ∈ u−1 v−1 w + Er+s+k+1 (W ). This proves that gr E (W ) is a module for gr E (V ) as an associative algebra. It is straightforward to check that it is a module for the vertex Lie algebra. Other properties are clear from the proof of Proposition 2.6. Notice that so far we have not excluded the possibility that the associated sequence EV is trivial in the sense that En (V ) = V for all n ≥ 0. Indeed, as we shall see in the next section, for some vertex algebras the associated sequence E is trivial. Nevertheless, we have: Lemma 2.14. Let V = n≥0 V(n) be an N-graded vertex algebra and E = {En } be the decreasing sequence defined in Definition 2.7 for V . Then En (V ) ⊂

V(m)

for n ≥ 0.

(2.45)

m≥n

Furthermore, the associated decreasing sequence E = {En } for V is a filtration, i.e., ∩n≥0 En (V ) = 0. Proof. By definition we have E0 = V = spanned by the vectors (1)

n≥0 V(n) . For n

(2.46) ≥ 1, recall that En is linearly

(r)

u−1−k1 · · · u−1−kr v for r ≥ 1, u(1) , . . . , u(r) , v ∈ V , k1 , . . . , kr ≥ 1 with k1 + · · · + kr ≥ n. If the vectors u(1) , . . . , u(r) , v are homogeneous, we have wt

(1)

(r)

u−1−k1 · · · u−1−kr v

= wt u(1) + k1 + · · · + wt u(r) + kr + wt v ≥ k1 + · · · + kr ≥ n. This proves (2.45) for n ≥ 1. Clearly, each subspace En of V is graded. From (2.45) we immediately have (2.46). In the next section we shall generalize Lemma 2.14 from an N-graded vertex algebra to a lower truncated Z-graded vertex algebra by using a relation between the decreasing sequence E and a sequence introduced by Zhu.

Abelianizing Vertex Algebras

401

3. The Relation Between the Sequences E and C In this section we first recall the sequence C introduced by Zhu and we then give a relation between the two decreasing sequences E and C. We show that if V is a lower truncated Z-graded vertex algebra, then both sequences are decreasing filtrations of V . The following definition is (essentially) due to Zhu ([Z1,2]): Definition 3.1. Let V be a vertex algebra and W a V -module. For any n ≥ 2 we define Cn (W ) to be the subspace of W , linearly spanned by the vectors v−n w for v ∈ V , w ∈ W . A V -module W is said to be C n -cofinite if W/Cn (W ) is finite-dimensional. In particular, if V /Cn (V ) is finite-dimensional, we say that the vertex algebra V is C n -cofinite. The following are easy consequences: Lemma 3.2. Let V be any vertex algebra, let W be a V -module and let n ≥ 2. Then Cm (W ) ⊂ Cn (W ) for m ≥ n, (3.1) u−k Cn (W ) ⊂ Cn (W ) for u ∈ V , k ≥ 0, (3.2) u−n v−k w ≡ v−k u−n w mod Cn+k (W ) for u, v ∈ V , w ∈ W, k ≥ 0. (3.3) Proof. For v ∈ V , r ≥ 2 we have v−r−1 = 1r (Dv)−r . From this we immediately have Cr+1 (W ) ⊂ Cr (W ) for r ≥ 2, which implies (3.1). Let u, v ∈ V , w ∈ W, k ≥ 0. Using the commutator formula (2.1) and (3.1) we have −k (ui v)−k−n−i w ∈ Cn (W ), u−k v−n w = v−n u−k w + i i≥0

proving that u−k Cn (W ) ⊂ Cn (W ). We also have −n (ui v)−n−k−i w ∈ Cn+k (W ), u−n v−k w − v−k u−n w = i i≥0

proving (3.3).

We also have the following more technical results: Lemma 3.3. Let V be any vertex algebra, let W be a V -module and let k ≥ 2. Then u−k Ck (W ) ⊂ Ck+1 (W )

for u ∈ V .

Proof. For u, v ∈ V , w ∈ W , in view of the iterate formula (2.2) we have (u−1 v)−2k+1 w = (u−1−i v−2k+1+i w + v−2k−i ui w) .

(3.4)

(3.5)

i≥0

Now we examine each term in (3.5). Notice that (u−1 v)−2k+1 w ∈ Ck+1 (W ) as −2k + 1 ≤ −k −1 and that v−2k−i ui w ∈ Ck+1 (W ) for i ≥ 0 as −2k −i ≤ −k −1. If i ≥ k, we have −1−i ≤ −k −1, so that u−1−i v−2k+1+i w ∈ Ck+1 (W ). For 0 ≤ i ≤ k −2, we have −2k + 1 + i ≤ −k − 1, so that v−2k+1+i w ∈ Ck+1 (W ). Then by Lemma 3.2 we have u−1−i v−2k+1+i w ∈ Ck+1 (W ) for 0 ≤ i ≤ k − 2. Therefore, the only remaining term u−k v−k w in (3.5) must also lie in Ck+1 (W ). This proves u−k Ck (W ) ⊂ Ck+1 (W ).

402

H. Li

Proposition 3.4. Let V be any vertex algebra, let W be a V -module and let n be any nonnegative integer. Then (1)

(r)

u−k1 · · · u−kr w ∈ Cn+2 (W )

(3.6)

for r ≥ 2n , u(1) , . . . , u(r) ∈ V , w ∈ W, k1 , . . . , kr ≥ 2. Proof. Since u−i Cn+2 (W ) ⊂ Cn+2 (W ) for u ∈ V , i ≥ 0 (by Lemma 3.2), it suffices to 1 prove the assertion for r = 2n . Also, since u−k = (k−1)! (Dk−2 u)−2 for u ∈ V , k ≥ 2, it suffices to prove the assertion for k1 = · · · = kr = 2. We are going to use induction on n. If n = 0, by definition we have v−2 w ∈ C2 (W ) for v ∈ V , w ∈ W . Assume the assertion holds for n = p, some nonnegative integer. Assume that r = 2p+1 and set s = 2p . Let u(1) , . . . , u(r) ∈ V , w ∈ W . By inductive hypothesis we have (s+1)

u−2

(r)

· · · u−2 w ∈ Cp+2 (W ),

so that (1)

(r)

(1)

(s)

u−2 · · · u−2 w ∈ u−2 · · · u−2 Cp+2 (W ).

(3.7)

Consider a typical spanning vector a−p−2 w of Cp+2 (W ) for a ∈ V , w ∈ W . Using (3.3) and (3.2) we have u−2 · · · u−2 a−p−2 w ≡ a−p−2 u−2 · · · u−2 w (1)

(s)

(1)

(s)

mod Cp+4 (W ).

(3.8)

Furthermore, by inductive hypothesis, we have u−2 · · · u−2 w ∈ Cp+2 (W ), (1)

(s)

which together with Lemma 3.3 gives a−p−2 u−2 · · · u−2 w ∈ a−p−2 Cp+2 (W ) ⊂ Cp+3 (W ). (1)

(s)

(3.9)

Thus by (3.8) we have u−2 · · · u−2 a−p−2 w ∈ Cp+3 (W ), (1)

(s)

(1)

(s)

proving that u−2 · · · u−2 Cp+2 (W ) ⊂ Cp+3 (W ).

(3.10)

Therefore, by (3.7) we have (1)

(r)

u−2 · · · u−2 w ∈ Cp+3 (W ). This finishes the induction steps and completes the proof.

The relation between the two decreasing sequences {En (W )} and {Cn (W )} is described as follows:

Abelianizing Vertex Algebras

403

Theorem 3.5. Let W be any module for vertex algebra V and let EW = {En (W )} be the associated decreasing sequence. Then for any n ≥ 2, Cn (W ) ⊂ En−1 (W ), Em (W ) ⊂ Cn (W )

(3.11)

whenever m ≥ max{1, (n − 2)2

n−2

}.

(3.12)

Furthermore, ∩n≥0 En (W ) = ∩n≥0 Cn+2 (W ).

(3.13)

Proof. From the definitions of Cn (W ) and En−1 (W ) we immediately have Cn (W ) ⊂ En−1 (W ). Consider a generic spanning element of Em (W ) (with m ≥ 1): (1)

(r)

X = u−1−k1 · · · u−1−kr w, where r ≥ 1, u(1) , . . . , u(r) ∈ V , w ∈ W, k1 , . . . , kr ≥ 1 with k1 + · · · + kr ≥ m. (i) If ki ≥ n − 1 for some i, by (3.1) we have u−1−ki W ⊂ C−1−ki (W ) ⊂ Cn (W ) and then by (3.2) we have X ∈ Cn (W ). If r ≥ 2n−2 , by Proposition 3.4 X ∈ Cn (W ). Since k1 + · · · + kr ≥ m ≥ (n − 2)2n−2 , we have either ki ≥ n − 1 for some i or r ≥ 2n−2 . Therefore, X ∈ Cn (W ) whenever m ≥ (n − 2)2n−2 . This proves (3.12). Combining (3.12) and (3.11) we have (3.13). Corollary 3.6. For any vertex algebra V and any V -module W , we have E1 (W ) = C2 (W ), E2 (W ) = C3 (W ).

(3.14)

Proof. By (3.11) we have C2 (W ) ⊂ E1 (W ) and C3 (W ) ⊂ E2 (W ). On the other hand, by (3.12) with m = 1, n = 2 we have E1 (W ) ⊂ C2 (W ) and by (3.12) with m = 2, n = 3 we have E2 (W ) ⊂ C3 (W ). Recall the following result of Zhu [Z1, 2]: Proposition 3.7. Let V be any vertex algebra. Then V /C2 (V ) is a Poisson algebra with u¯ · v¯ = u−1 v,

[u, ¯ v] ¯ = u0 v

for u, v ∈ V ,

(3.15)

where u¯ = u + C2 (V ), and with 1 + C2 (V ) as the identity element. It is clear that the degree zero subspace E0 /E1 of gr E (V ) is a Poisson algebra where (u + E1 )(v + E1 ) = u−1 v + E1 ,

[u + E1 , v + E1 ] = u0 v + E1

for u, v ∈ V . With E0 (V ) = V and E1 (V ) = C2 (V ), we see that this Poisson algebra is nothing but Zhu’s Poisson algebra V /C2 (V ). Thus we have: Proposition 3.8. Let V be any vertex algebra. The degree zero subspace gr E (V )(0) = E0 (V )/E1 (V ) of the N-graded vertex Poisson algebra gr E (V ) is naturally a Poisson algebra which coincides with Zhu’s Poisson algebra V /C2 (V ) = E0 /E1 . The following result generalizes the result of Lemma 2.14:

404

H. Li

Proposition 3.9. Let V = Then

n≥t

V(n) be a lower truncated Z-graded vertex algebra.

Cn (V ) ⊂

(3.16)

V(k)

k≥2t+n−1

for n ≥ 2. Furthermore, Em (V ) ⊂

V(k)

whenever m ≥ (n − 2)2n−2 ,

(3.17)

k≥2t+n−1

∩n≥0 En (V ) = ∩n≥2 Cn (V ) = 0.

(3.18)

Proof. For homogeneous vectors u, v ∈ V and for any n ≥ 2 we have wt (u−n v) = wt u + wt v + n − 1 ≥ 2t + n − 1. In view of this we have Cn (V ) ⊂

V(k)

k≥2t+n−1

for n ≥ 2. This proves (3.16), from which we immediately have that ∩n≥0 Cn+2 (V ) = 0. Using Theorem 3.5, we obtain (3.17) and (3.18). For the rest of this section, we consider vertex algebras whose associated decreasing sequence E is trivial. First we have: Lemma 3.10. Let V be a vertex algebra and let W be a V -module. If W = C2 (W ), then En (W ) = Cn+2 (W ) = W

for all n ≥ 0.

(3.19)

Proof. Since W = C2 (W ), we have E1 (W ) = C2 (W ) = W . Assume that Ek (W ) = W for some k ≥ 1. Then v−2 W = v−2 Ek (W ) ⊂ Ek+1 (W )

for v ∈ V .

From this we have W = C2 (W ) ⊂ Ek+1 (W ), proving Ek+1 (W ) = W . By induction, we have En (W ) = W for all n ≥ 0. In view of Theorem 3.5 we have Cn (W ) = W for all n ≥ 2. Suppose that V is a vertex algebra such that C2 (V ) = V . By Lemma 3.10 we have Cn+2 (V ) = V for n ≥ 0, so that V = ∩n≥0 Cn+2 (V ). Furthermore, if there exists a lower truncated Z-grading V = n∈Z V(n) with which V becomes an Z-graded vertex algebra, by (3.18) (Proposition 3.9) we have ∩n≥0 Cn+2 (V ) = 0, so that V = ∩n≥0 Cn+2 (V ) = 0. Therefore we have proved: Proposition 3.11. Let V be a nonzero vertex algebra such that C2 (V ) = V . Then there does not exist a lower truncated Z-grading V = n∈Z V(n) with which V becomes an Z-graded vertex algebra.

Abelianizing Vertex Algebras

405

From [B and FLM], associated to any nondegenerate even lattice L of finite rank, we have a vertex algebra VL . Furthermore, VL is a vertex operator algebra if and only if L is positive-definite in the sense that α, α > 0 for 0 = α ∈ L. In this case, VL is N-graded by L(0)-weight (with 1-dimensional weight-zero subspace), so that Lemma 2.14 (and Proposition 3.9) applies to VL . On the other hand, we have: Proposition 3.12. Let L be a finite rank nondegenerate even lattice that is not positivedefinite and let VL be the associated vertex algebra. Then Cn+2 (VL ) = En (VL ) = VL for n ≥ 0. Furthermore, there does not exist a lower truncated Z-grading on VL with which VL becomes a lower truncated Z-graded vertex algebra. Proof. First we show that there exists α ∈ L such that α, α < 0. Since L is not positivedefinite, there exists 0 = β ∈ L such that β, β ≤ 0. If β, β = 0, that is, β, β < 0, then we can simply take α = β. Suppose β, β = 0. Since L is nondegenerate, there exists γ ∈ L such that γ , β = 0. For m ∈ Z, we have

γ + mβ, γ + mβ = γ , γ + 2m γ , β. We see that γ + mβ, γ + mβ < 0 for some m. Then we can take α = γ + mβ with the desired property. Let α ∈ L be such that α, α < 0 and set α, α = −2k with k ≥ 1. Using the explicit expression of the vertex operators in [FLM], we have (eα )−2k−1 e−α = 1, so that 1 ∈ C2k+1 (VL ) ⊂ C2 (VL ). Then v = v−1 1 ∈ C2 (VL ) for v ∈ VL . Thus C2 (VL ) = VL . By Lemma 3.10 we have Cn+2 (VL ) = En (VL ) = VL for n ≥ 0. The last assertion follows immediately from Proposition 3.11. 4. Generating Subspaces of Vertex Algebras In this section we shall use the differential algebra structure on gr E (V ) to study certain kinds of generating subspaces of lower truncated Z-graded vertex algebras. First we prove the following results for classical algebras: Lemma 4.1. Let (A, ∂) be an N-graded (unital) differential algebra such that (∂A)A = A+ , where

A(n) . (4.1) A+ = n≥1

Let S be a generating subspace of A(0) as an algebra. Then A is linearly spanned by the vectors ∂ n1 (a1 ) · · · ∂ nr (ar )

(4.2)

for r ≥ 0, n1 ≥ n2 ≥ · · · ≥ nr ≥ 0, a1 , . . . , ar ∈ S, or equivalently, S generates A as a differential algebra. In particular, A(0) generates A as a differential algebra. Furthermore, A is linearly spanned by the vectors ∂ n1 (a1 ) · · · ∂ nr (ar ) for r ≥ 1, n1 > n2 > · · · > nr ≥ 0, a1 , . . . , ar ∈ A(0) .

(4.3)

406

H. Li

Proof. First, we show that A as a differential algebra is generated by A(0) . Let A be the differential subalgebra of A, generated by A(0) . We are going to show (by induction) that kn=0 A(n) ⊂ A for all k ≥ 0. From the definition, we have A(0) ⊂ A . Assume that kn=0 A(n) ⊂ A for some k ≥ 0. Consider the subspace A(k+1) of A. From our assumption, we have A(k+1) ⊂ A+ = A∂A, so A(k+1) is linearly spanned by the vectors a∂b for a ∈ A(r) , b ∈ A(s) with r + s + 1 = k + 1. For any a ∈ A(r) , b ∈ A(s) with r + s + 1 = k + 1, since r, s ≤ k (with r, s ≥ 0), by the inductive hypothesis, we have a, b ∈ A . Consequently, a∂b ∈ A . Thus A(k+1) ⊂ A . This proves that kn=0 A(n) ⊂ A for all k ≥ 0. Therefore, we have A = A , proving that A as a differential algebra is generated by A(0) . It follows that if S generates A(0) as an algebra, then S generates A as a differential algebra. For a positive integer n, let A(n) be the subspace of A(n) spanned by the vectors ∂ k1 (a1 ) · · · ∂ kr (ar )b

(4.4)

for r ≥ 1, k1 > k2 > · · · > kr ≥ 1, a1 , . . . , ar , b ∈ A(0) with k1 + · · · + kr = n. We must prove A(n) = A(n) for all n ≥ 1. For a positive integer n, denote by Pn the set of partitions of n. We now endow Pn with the reverse order of the lexicographic order on Pn . Set P = ∪n≥1 Pn . For α ∈ Pm , β ∈ Pn , combining α and β together we get a partition of m + n, which we denote by α ∗ β. Clearly, this defines an abelian semigroup structure on P . Furthermore, for α, β ∈ Pn , γ ∈ P , if α > β, then α ∗ γ > β ∗ γ . That is, the order is compatible with the multiplication. For α ∈ Pn , define Aα(n) to be the linear span of the vectors ∂ k1 (a1 ) · · · ∂ kr (ar )b for r ≥ 1, k1 ≥ k2 ≥ · · · ≥ kr ≥ 1, a1 , . . . , ar , b ∈ A(0) with k1 + · · · + kr = n and (k1 , . . . , kr ) ≤ α. Since A(0) generates A as a differential algebra, {Aα(n) } is a (finite) increasing filtration of A(n) . For a, b ∈ A(0) and k ≥ 1, we have ∂ (ab) = 2k

2k 2k i=0

which can be rewritten as 2k k ∂ (a)∂ k (b) k = ∂ (ab) − ∂ (a)b − ∂ (b)a − 2k

2k

2k

i

∂ 2k−i (a)∂ i (b),

k−1 2k i=1

i

∂ 2k−i (a)∂ i (b) + ∂ i (a)∂ 2k−i (b) . (4.5)

We see that (k, k) > (2k), (2k − i, i) for 1 ≤ i ≤ k − 1.

Abelianizing Vertex Algebras

407

Now consider a typical element of Aα(n) , X = ∂ k1 (a1 ) · · · ∂ kr (ar )b for (k1 , . . . , kr ) ∈ Pn , a1 , . . . , ar , b ∈ A(0) with (k1 , . . . , kr ) ≤ α. If all k1 > k2 > · · · > kr , then X ∈ A(n) . Otherwise, using (4.5) we see that X∈

β

A(n) .

β<α

Now it follows immediately from induction that A(n) = A(n) .

Lemma 4.2. Let V be a vertex algebra and let A = gr E (V ) be the vertex Poisson algebra, obtained in Theorem 2.12, which is in particular an N-graded (unital) differential algebra. Then A+ = A∂A. Furthermore, for any V -module W , the associated graded vector space gr E (W ) is an A-module with (u + Em+1 (V )) · (w + En+1 (W )) = u−1 w + Em+n+1 (W )

(4.6)

for u ∈ Em (V ), w ∈ En (W ) with m, n ∈ N, and gr E (W ) as an A-module is generated by E0 (W )/E1 (W ), i.e., gr E (W ) = A(E0 (W )/E1 (W )).

(4.7)

Proof. We have A = n∈N A(n) , where A(n) = En /En+1 for n ∈ N. For n ≥ 1, from Lemma 2.8, En is linearly spanned by the vectors u−2−i v ∈ En , where u ∈ V , v ∈ En−1−i for 0 ≤ i ≤ n − 1, and furthermore, we have 1 (Di+1 u)−1 v + En+1 (i + 1)! 1 (Di+1 u + Ei+2 )(v + En−i ) = (i + 1)! 1 = ∂ i+1 (u + E1 )(v + En−i ) (i + 1)! ∈ A∂A,

u−2−i v + En+1 =

noticing that for any r ∈ Z, DEr ⊂ Er+1 from the definition of D and Lemma 2.11. This proves En /En+1 ⊂ A∂A for n ≥ 1, so that A+ ⊂ A∂A. We also have that A∂A ⊂ AA+ ⊂ A+ . Therefore, A∂A = A+ . For a V -module W , from Proposition 2.13 gr E (W ) is a module for gr E (V ) as an algebra. We must prove that En (W )/En+1 (W ) ⊂ A(E0 (W )/E1 (W )) for n ≥ 1. By Lemma 2.8, En (W ) is linearly spanned by the subspaces u−2−i En−1−i (W ) for u ∈ V , 0 ≤ i ≤ n − 1. For w ∈ En−1−i (W ), we have u−2−i w + En+1 (W ) =

1 ∂ i+1 (u + E1 )(w + En−i (W )). (i + 1)!

Then it follows immediately from induction.

Combining Lemmas 4.1 and 4.2 we immediately have:

408

H. Li

Corollary 4.3. Let V be a vertex algebra and let gr E (V ) be the vertex Poisson algebra obtained in Theorem 2.12. Then gr E (V ) is linearly spanned by the vectors ∂ k1 (v (1) + E1 ) · · · ∂ kr (v (r) + E1 )

(4.8)

for r ≥ 1, v (i) ∈ V , k1 > · · · > kr ≥ 0. In particular, gr E (V ) as a differential algebra is generated by the subspace E0 /E1 (= V /C2 (V )). The following result generalizes a theorem of [GN] (see also [NT]): Proposition 4.4. Let V be any vertex algebra. If V is C2 -cofinite, then V is En -cofinite and Cn+2 -cofinite for any n ≥ 0. Proof. Since dim V /C2 < ∞, it follows from Corollary 4.3 that for each n ≥ 0, the degree n subspace En /En+1 of gr E (V ) is finite dimensional. Consequently, dim V /En = dim E0 /En < ∞ for all n ≥ 0. For any n ≥ 2, by (3.17) we have Em ⊂ Cn for m = (n − 2)2n−2 . Then dim V /Cn ≤ dim V /Em < ∞. Furthermore we have (cf. [Bu1, 2]): Proposition 4.5. Let V be any vertex algebra and W any V -module. If V and W are C2 -cofinite, then W is Cn -cofinite for all n ≥ 2. Proof. In the proof of Proposition 4.4, we showed that gr E (V ) is an N-graded differential algebra with finite-dimensional homogeneous subspaces. Since dim W/C2 (W ) < ∞, it follows from (4.7) that all the homogeneous subspaces of gr E (W ) are finite-dimensional. The same argument of Proposition 4.4 shows that W is Cn -cofinite for all n ≥ 2. Remark 4.6. It has been proved in [Bu1 and NT] that if V is a vertex operator algebra with nonnegative weights and with V(0) = C1 and if V is C2 -cofinite, then any irreducible V -module W is Cn -cofinite for all n ≥ 2. The following result generalizes a theorem of [GN] (cf. [Bu1-2, ABD]): Theorem 4.7. Let V = n≥t V(n) be any lower truncated Z-graded vertex algebra such as a vertex operator algebra in the sense of [FLM and FHL]. Then for any graded subspace U of V , V = U + C2 (V ) if and only if V is linearly spanned by the vectors (1)

(r)

u−n1 · · · u−nr 1

(4.9)

for r ≥ 0, n1 > · · · > nr ≥ 1, u(1) , . . . , u(r) ∈ U . Proof. Assume that V = U + C2 (V ). Denote by A the vertex Poisson algebra gr E (V ) obtained in Theorem 2.12. In particular, A is an N-graded (unital) differential algebra. Recall that A = n∈N A(n) , where A(n) = En /En+1 for n ∈ N. Let K be the subspace of V , spanned by those vectors in (4.9). Clearly, K is a graded subspace. For m ≥ 0, set Km = K ∩ Em . For any linear operator F on a vector space and for any nonnegative integer n, we set F (n) = F n /n!. From Corollary 4.3, for any m ≥ 0, Em /Em+1 is linearly spanned by the vectors ∂ (k1 ) (u(1) + E1 ) · · · ∂ (kr ) (u(r) + E1 )

Abelianizing Vertex Algebras

409

for r ≥ 1, u(i) ∈ U, k1 > k2 > · · · > kr ≥ 0 with k1 + · · · + kr = m. By definition we have ∂ (k1 ) (u(1) + E1 ) · · · ∂ (kr ) (u(r) + E1 ) = (D(k1 ) u(1) + Ek1 +1 ) · · · (D(kr ) u(r) + Ekr +1 ) (1)

(r)

= u−1−k1 · · · u−1−kr 1 + Em+1 . It follows that Em = Km + Em+1 . Then V = E0 = K0 + K1 + · · · + Kn + En+1 ⊂ K + En+1 for any n ≥ 0. Since K and En+1 are graded subspaces and since Em ⊂ for m ≥ (n − 2)2n by (3.17), we must have

k≥2t+n−1 V(k)

V = K = K0 + K1 + K2 + · · · , proving the desired spanning property. Conversely, assume the spanning property. Notice that if r ≥ 2, we have n1 ≥ 2, so (1) (r) (1) (r) that u−n1 · · · u−nr 1 ∈ C2 (V ). If nr ≥ 2, we also have u−n1 · · · u−nr 1 ∈ C2 (V ). Then we get V ⊂ U + C2 (V ), proving V = U + C2 (V ). By slightly modifying the proof of Theorem 4.7 we immediately obtain the following result (cf. [KL]): Theorem 4.8. Let V = n≥t V(n) be a lower truncated Z-graded vertex algebra such as a vertex operator algebra in the sense of [FLM and FHL] and let S be a graded subspace of V such that {u + C2 (V ) | u ∈ S} generates V /C2 (V ) as an algebra. Then V is linearly spanned by the vectors (1)

(r)

u−n1 · · · u−nr 1 for r ≥ 0, u(1) , . . . , u(r) ∈ S, n1 ≥ · · · ≥ nr ≥ 1. Furthermore, if S is linearly ordered, V is linearly spanned by the above vectors with u(i) > u(i+1) when ni = ni+1 . Definition 4.9. Let S be a subset of a vertex algebra V . We say that S is a type 0 generating subset of V if V is the smallest vertex subalgebra containing S, S is a type 1 generating subset of V if V is linearly spanned by the vectors (1)

(r)

u−k1 · · · u−kr 1

(4.10)

for r ≥ 0, u(i) ∈ S, ki ≥ 1. S is called a type 2 generating subset of V if for any linear order on S (if S is a vector space, replace S with a basis), V is linearly spanned by the above vectors with u(i) > u(i+1) when ni = ni+1 . Remark 4.10. A type 0 generating subset is just a generating subset in the usual sense and a type 1 generating subset of V is also called a strong generating subset V in [K]. Theorem 4.11. Let V be a lower truncated Z-graded vertex algebra and let U be a graded subspace. Then the following three statements are equivalent: (a) U is a type 1 generating subspace of V . (b) U is a type 2 generating subspace of V . (c) U/C2 (V ) = {u + C2 (V ) | u ∈ U } generates V /C2 (V ) as an algebra.

410

H. Li

Proof. By definition, (b) implies (a) and by Theorem 4.8, (c) implies both (a) and (b). Now it suffices to prove that (a) implies (c). Assuming (a) we have that V /C2 (V ) is (1) (r) linearly spanned by the vectors u−k1 · · · u−kr 1 + C2 (V ) for r ≥ 0, u(i) ∈ U, ki ≥ 1. If (1)

(r)

ki ≥ 2 for some i, we have u−k1 · · · u−kr 1 ∈ C2 (V ). Then V /C2 (V ) is linearly spanned (1)

(r)

by the vectors u−1 · · · u−1 1 + C2 (V ) for r ≥ 0, u(i) ∈ U . That is, U/C2 (V ) generates V /C2 (V ) as an algebra. With Lemma 4.2, from the proof of Theorem 4.7 we immediately have: Proposition 4.12. Let V be a lower truncated Z-graded vertex algebra and let U be a graded subspace of V such that U generates V /C2 (V ) as an algebra. Let W be a lower truncated Z-graded V -module and let W 0 be a graded subspace of W such that W = W 0 + C2 (W ). Then W is spanned by the vectors (1)

(r)

u−1−k1 · · · u−1−kr w for r ≥ 1, u(1) , . . . , u(r) ∈ U, w ∈ W 0 , k1 > · · · > kr ≥ 0. References [ABD] Abe, T., Buhl, G., Dong, C.: Rationality, regularity and C2 -cofiniteness. http:// arxiv.org/list/math.QA/0204021, 2002 [AN] Abe, T., Nagatomo, K.: Finiteness of conformal blocks over the projective line. In: Vertex Operator Algebras in Mathematics and Physics, Proc. of Workshop at Fields Institute for Research in Mathematical Sciences, 2000, Berman, S., Billig, Y., Huang, Y.-Z., Lepowsky, J. (eds.), Fields Institute Communications 39, Providence, RI: Amer. Math. Soc., 2003 [B] Borcherds, R. E.: Vertex algebras, Kac-Moody algebras, and the Monster. Proc. Natl. Acad. Sci. USA 83, 3068–3071 (1986) [Bu1] Buhl, G.: A spanning set for VOA modules. J. Algebra 254, 125–151 (2002) [Bu2] Buhl, G.: Rationality and C2 -cofiniteness is regularity. Ph.D. thesis, University of California, Santa Cruz, 2003 [DLM] Dong, C., Li, H.-S., Mason, G.: Vertex Lie algebra, vertex Poisson algebras and vertex algebras. In: Recent Developments in Infinite-Dimensional Lie Algebras and Conformal Field Theory, Proceedings of an International Conference at University of Virginia, May 2000, Contemp Math. 297, 69–96 (2002) [FB] Frenkel, E., Ben-Zvi, D.: Vertex Algebras and Algebraic Curves. Mathematical Surveys and Monographs, Vol. 88, Providence, RI: Amer. Math. Soc., 2001 [FHL] Frenkel, I., Huang, Y.-Z., Lepowsky, J.: On axiomatic approaches to vertex operator algebras and modules. Memoirs Amer. Math. Soc. 104, 1993 [FLM] Frenkel, I., Lepowsky, J., Meurman, A.: Vertex Operator Algebras and the Monster. Pure and Appl. Math. Vol. 134, Boston: Academic Press, 1988 [FZ] Frenkel, I., Zhu,Y.: Vertex operator algebras associated to representations of affine and Virasoro algebras. Duke Math. J. 66, 123–168 (1992) [GN] Gaberdiel, M., Neitzke, A.: Rationality, quasirationality and finite W-algebras. Commun. Math. Phys. 238, 305–331 (2003) [K] Kac, V. G.: Vertex Algebras for Beginners. University Lecture Series 10, Providence, RI: Amer. Math. Soc., 1997 [KL] Karel, M., Li, H.-S.: Certain generating subspaces for vertex operator algebras. J. Alg. 217, 393–421 (1999) [LL] Lepowsky, J., Li, H.-S.: Introduction to Vertex Operator Algebras and Their Representations. Progress in Math. 227, Boston, MA: Birkh¨auser, 2003 [Li1] Li, H.-S.: Some finiteness properties of regular vertex operator algebras. J. Algebra 212, 495– 514 (1999) [Li2] Li, H.-S.: Vertex algebras and vertex Poisson algebras. Commun. Contemp. Math. 6, 61–110 (2004) [M] Miyamoto, M.: Modular invariance of vertex operator algebras satisfying C2 -cofiniteness. Duke Math. J. 122, 51–91 (2004)

Abelianizing Vertex Algebras [NT] [P] [Z1] [Z2]

411

Nagatomo, K., Tsuchiya, A.: Conformal field theories associated to regular chiral vertex operator algebras I: theories over the projective line. http://arxiv.org/list/math.QA/0206223, 2002 Primc, M.: Vertex algebras generated by Lie algebras. J. Pure Appl. Alg. 135, 253–293 (1999) Zhu, Y.-C.: Vertex operator algebras, elliptic functions and modular forms. Ph.D. thesis, Yale University, 1990 Zhu, Y.: Modular invariance of characters of vertex operator algebras. J. Amer. Math. Soc. 9, 237–302 (1996)

Communicated by Y. Kawahigashi

Commun. Math. Phys. 259, 413–432 (2005) Digital Object Identifier (DOI) 10.1007/s00220-005-1340-7

Communications in

Mathematical Physics

The Structure of the Ladder Insertion-Elimination Lie Algebra Igor Mencattini1 , Dirk Kreimer2 1

Boston University, Department of Mathematics and Statistics, Boston University, 111 Cummington Street, Boston, MA 02215, USA. E-mail: igorre@@math.bu.edu 2 CNRS at IHES, 35, route de Chartres, 91440 Bures-sur-Yvette, France. E-mail: [email protected] Received: 17 September 2004 / Accepted: 18 November 2004 Published online: 15 April 2005 – © Springer-Verlag 2005

Abstract: We continue our investigation into the insertion-elimination Lie algebra LL of Feynman graphs in the ladder case, emphasizing the structure of this Lie algebra relevant for future applications in the study of Dyson–Schwinger equations. We work out the relation to the classical infinite dimensional Lie algebra gl + (∞) and we determine the cohomology of LL . 1. Introduction In the last few years perturbative QFT has been shown to have a rich algebraic structure [9] leading to relations with apparently unrelated sectors of mathematics like noncommutative geometry and Riemann-Hilbert like problems [4, 5]. Such extraordinary relations can be summarized, to some extent, by the existence of a commutative, non co-commutative Hopf algebra H defined on the set of Feynman diagrams. We will continue the investigation started in [15] where we discussed first relations of perturbative QFT with the representation theory of Lie algebras. In that paper we introduced the ladder Insertion-Elimination Lie algebra LL and we discussed relations of this Lie algebra with some more classical (infinite dimensonal) Lie algebras. In what follows we describe in greater detail the structure of this Insertion-Elimination Lie algebra. The plan of the paper is as follows: in Section Two we give some motivations for the relevance of the ladder insertion elimination Lie algebra LL for full QFT. In particular we stress the relation of LL to the quantum equations of motion or Dyson-Schwinger equations (DSEs). This section is meant to give the motivations from physics for the mathematical endeavor undertaken in the following sections. In Section Three we recollect some basic fact about the Lie algebra LL taken from [15]. Sections Four and Five are the core of this paper: in Sect. Four we give a structure D.K. supported by CNRS; both authors supported in parts by NSF grant DMS-0401262, Ctr. Math. Phys. at Boston Univ.; BUCMP/04-06.

414

I. Mencattini, D. Kreimer

theorem that stresses the relation of the Lie algebra LL with the classical infinite dimensional Lie algebra gl + (∞). Finally, in Sect. Five we collect some basic results about the cohomology of the Lie algebra LL . 2. The Significance of Zn,m The Lie algebra LL on generators Zn,m is an insertion elimination Lie algebra [15] obtained from these operations applied to a cocommutative and commutative Hopf algebra Hcomm built on generators (ladders) tn , n ≥ 0, (tn ) = nj=0 tj ⊗ tn−j , on which it acts as a derivation Zi,j (tn ) = (n − j )tn−j +i , where (n − j ) is defined as (n − j ) = 1 for n − j ≥ 0, and 0 otherwise. This seems to give just a glimpse of the full insertion-elimination Lie algebra of [6], which acts as a derivation on the full Hopf algebra of Feynman graphs in a renormalizable quantum field theory. Nevertheless, a full understanding of Zn,m goes a long way in understanding the full insertion elimination Lie algebra [13], using the fact that LL acts on elements in the full Hopf algebra which are homogenous in the appropriate grading resulting from the Hochschild cohomology of that very Hopf algebra [10]. There are two strong reasons for that: i) quantum field theory sums over all skeleton graphs in a symmetric fashion, ii) non-linear Dyson–Schwinger equations (DSEs) modify linear DSEs precisely by the anomalies generated by a non-vanishing β-function. The first fact ensures that we can work on homogeneous elements in the Hopf algebra; the second one ensures that there are effective methods available to deal with the operadic aspects of graph insertions. Here, DSEs are introduced combinatorially via a fixpoint equation in the Hochschild cohomology of a connected graded commutative Hopf algebra. Let us summarize the main features which emerged in recent work [11, 10, 12, 13]. Under the Feynman rules the Hochschild 1-cocycles, provided by the Hopf algebra of graphs, map to integral operators provided by the underlying skeletons of the theory. Renormalization conditions are determined by suitable boundary conditions for the integral equations so generated. The DSEs determine the Green functions from this Hochschild cohomology of the Hopf algebra of Feynman graphs, which is itself derived from free quantum field theory and the choice of renormalizable interactions. Indeed, following [12, 10], the identification of these 1-cocycles leads to a combinatorial Dyson–Schwinger equation: r = 1 +

[1] p∈HL res(p)=r

= 1+

∈HL res()=r

α |p| p B (Xp ) Sym(p) + α || . Sym()

(1)

The first sum in (1) is over a finite (or countable) set of Hopf algebra primitives Feynman graphs p, such that: (p) = p ⊗ e + e ⊗ p, p B+

(2)

each p indexing the closed Hochschild 1-cocycles above. The second sum in (1) is over all one-particle irreducible graphs contributing to the desired Green function, all

Insertion-Elimination Lie Algebra

415

weighted by their symmetry factors. Here, Xp is a polynomial in all r , and the superscript r ranges over the finite set (in a renormalizable theory) of superficially divergent Green functions. It indicates the number and type of external legs reflecting the monomials in the underlying Lagrangian. We use res(p) = r to indicate that the external legs of the graph p are of type r. The structure of these equations allows for a proof of locality using Hochschild cohomology [10], which is also evident using a coordinate space approach [2]. These fixpoint equations are solved by the following Ansatz: r = 1 +

∞

r

α k ck .

(3)

k=1

We grant ourselves the freedom to call such an equation a DSE or a combinatorial equation of motion for a simple reason: the DSEs of any renormalizable quantum field theory can be cast into this form. Crucially, in the above it can be shown (see [13], which we follow here) that |p|

Xp = res(p) Xcoupl ,

(4)

where Xcoupl is a connected Green function which maps to an invariant charge under the Feynman rules. This is rather obvious: consider, as an example, the vertex function in quantum electrodynamics. A n loop primitive graph p, contributing to it, provides 2n+1 internal vertices, 2n internal fermion propagators and n internal photon propagators. An invariant charge [8] is provided by a vertex function multiplied by the squareroot of the photon propagator and the fermion propagator. Thus the integral kernel corresponding to p is dressed by 2n invariant charges, and one vertex function. This is a general fact: each integral kernel corresponding to a Green function with external legs r in a renormalizable quantum field theory is dressed by a suitable power of invariant charges proportional to the grading of that kernel, and one additional appearance of r itself. This immediately shows that for a vanishing β-function the DSEs are reduced to a linear set of equations, and that the general case can be most efficiently handled by an expansion in the breaking of conformal symmetry induced by a non-vanishing β-function. Thus, a complete understanding of the linear case goes a long way in understanding the full solution. This emphasizes the crucial role which the insertion-elimination Lie algebra [6] in the ladder case [15] plays in the full theory: it defines an algebra of graphs which provide an underlying field of residues, which is then extended by the contributions resulting from a non-trivial β-function. The resulting scaling anomalies extend the Hopf algebra of graphs to a non-cocommutative one. Moreover, they force the appearance of new transcendental numbers and they result in the appearance of non-trivial representations of the symmetric group in the operad of graph insertions [13]. Here, we study the underlying linear DSEs which would suffice for a vanishing β-function. Indeed, we now define the linear DSE associated to the system above: r

lin = 1 +

[1] p∈HL res(p)=r

p

α |p| r p B ( ). Sym(p) + lin

(5)

The Hochschild closedness of B+ then ensures that we obtain a Hopf algebra isomorphic to the word Hopf algebra based on letters p, which we obtain as the underlying Dyson skeletons in the expansion of r .

416

I. Mencattini, D. Kreimer

The solutions of the linear DSE above are graded by the order in α and by the augmentation degree r

lin = 1 +

∞

∞

α j cj = 1 +

j =1

(6)

dj .

j =1

Here, cj is the sum of all words of order α j , where the degree | w | of a word w is the sum of the degree of its letters, and the degree of a letter is the loop number of the accompanying skeleton graph. These words uniquely correspond to Feynman graphs obtained by inserting primitive graphs into each other, where insertion now happens at a single vertex or edge in accordance with that linear DSE. On the other hand, dj is the sum α |w| w, (7) dj = j

w∈Haug

of all words made out of j letters, and we set | w |aug = j , the augmentation degree. Having defined the associated linear system, the propagator-coupling dualities [3] provide the general solution once the representation theory of the symmetric group has been established, which reflects the operadic nature of graph insertions [13]. But a complete understanding of linear Dyson–Schwinger equations comes first. To this end, it is profitable to study how the insertion elimination Lie algebra acts on the Hopf algebra of graphs in that case. In this paper, we start some groundwork by clarifying the structure of the insertion elimination Lie algebra which relates to the Hopf algebra structure of a linear DSE. The crucial point is always the identification of the Hochschild closed 1-cocycles in p the Hopf algebra of graphs B+ , typically parametrized by primitive elements p of the Hopf algebra of graphs [10]. We first mention that the Hopf algebra of graphs contains, as a corollary of the results in [6], a sub-Hopf algebra Hw of graphs generated by the linear DSE. It is naturally based on graphs which can be regarded as words, with the corresponding insertion-elimination Lie algebra Lw . It acts on the Hopf algebra Hw as w1 v if w = w2 v for some v Zw1 ,w2 (w) = (8) 0, if w has not this form. The Lie bracket in Lw is then [Zw1 ,w2 , Zw3 ,w4 ] = ZZw

1 ,w2 (w3 ),w4

−ZZw

− Zw3 ,Zw

3 ,w4 (w1 ),w2

K Z −δw 2 ,w3 w1 ,w4

2 ,w1 (w4 )

+ Zw1 ,Zw

4 ,w3 (w2 )

K + δw Z . 1 ,w4 w3 ,w2

(9)

See [6] for notation. of the Lie algebra Zn,m comes from the fact that the map B+ = The|p|significance p p α B+ maps the linear DSE to the fundamental DSE (which also underlies the polylog [12]) X = 1 + B+ (X),

(10)

where B+ is of order α. Note that B+ is not homogenous in α (there are primitive graphs of any degree in the coupling), but it is homogenous in the augmentation degree: all terms in its defining sum enhance this degree by one.

Insertion-Elimination Lie Algebra

417

There is a natural inclusion ιH from Hcomm to Hw which sends tn → dn . This induces a map Zw1 ,w2 ιL : L → Lw , Zn,m → , (11) #(m) |w1 |aug =n,|w2 |aug =m

where #(m) is the number of words of degree m, such that ιH (Zn,m (tk ))=ιL (Zn,m )(ι(tk )). It is compatible with the Lie bracket: [ιL (Zn1 ,m1 ), ιL (Zn2 ,m2 )] (ιH (tn )) = ιH [Zn1 ,m1 , Zn2 ,m2 ](tn ) . (12) As long as we study linear DSEs, the ladder insertion elimination Lie algebra on generators Zn,m suffices, where it now acts by increasing and decreasing the augmentation degree. In [13] the reader can find a discussion of the Galois theory, which is missing, to handle the general case. The study of such questions in QFT is a beautiful mathematical problem in its own right. It gives mathematical justification to early ideas [14] of the use of anomalous dimensions and bootstrap equations in QFT to absorb short-distance singularities. Progress along these lines following [12, 13] will be reported in future work. We now continue to treat LL . 3. Generalities about the Ladder Insertion-Elimination Lie Algebra Let us recall some basic definition from [15] to which we refer for the details which are omitted in what follows. Let us introduce the Lie algebra LL via generators and relations: Definition 3.1. LL = spanC Zn,m | n, m ∈ Z≥0 , with: Zn,m , Zl,s = (l − m)Zl−m+n,s − (s − n)Zl,s−n+m −(n − s)Zn−s+l,m + (m − l)Zn,m−l+s −δm,l Zn,s + δn,s Zl,m , where

(13)

(l − m) = 0 if l < m, (l − m) = 1 if l ≥ m,

and where δn,m is the usual Kronecker delta: δn,m = 1 if m = n, δn,m = 0 if n = m. Then we have Corollary 3.2. [15] 1) LL is Z-graded Lie algebra: LL = ⊕i∈Z li , where for each Zn,m ∈ li , deg(Zn,m ) = i = n − m and dimC li = +∞; 2) LL has the following decomposition: LL = L+ ⊕ L0 ⊕ L− , where L+ = ⊕n>0 ln , L− = ⊕n<0 ln and L0 = l0 .

(14)

(15)

418

I. Mencattini, D. Kreimer

Proof. The proof follows from the very definition of a graded Lie algebra and from the formula (13). We recall here the definition of graded Lie algebra: a Lie algebra g is G-graded (where G is any abelian group) if g = ⊕i∈G gi and [gi , gj ] ⊂ gi+j . We conclude this section with the following proposition: Proposition 3.3. Each element Zn,m ∈ LL can be written in the following form: Zn,m = [Zn,0 , Z0,m ] + (n − m)Zn−m,0 + (m − n)Z0,m−n − δn−m,0 Z0,0 . (16) Proof. The statement follows trivially by applying formula (13) to the elements Zn,0 and Z0,m for n > m, n < m and n = m. Remark 3.4. The previous proposition is equivalent to the following (vector space) decomposition of the Lie algebra LL : LL = [D, D] ⊕ D, where we define D = a+ ⊕ a− ⊕ C, and a+ = spanC Zn,0 : n > 0, a− = spanC Z0,n : n > 0 and C is the trivial Lie algebra generated by Z0,0 . In fact a+ and a− are commutative sub algebras of LL and Z0,0 is a central element. 4. Structure of the Lie Algebra LL Let us start this section with two statements whose proofs are collected at the end of the this section. Theorem 4.1. The center of the Lie algebra LL has dimension one and it is generated by the element Z0,0 . Theorem 4.2. l 0 is a maximal abelian sub-algebra of LL . In what follows we will show that the Lie algebra LL is not simple. Let us introduce the following: Definition 4.3. [15] gl + (∞) = spanC Ei,j : Zi,j − Zi+1,j +1 | i, j ∈ Z≥0 . We have: Proposition 4.4. 1) [Ei,j , Er,k ] = Ei,k δj,r − Er,j δk,i ; 2) gl+ (∞) is an ideal in LL .

Insertion-Elimination Lie Algebra

419

Proof. The proof of 1) and 2) is a simple but tedious application of the commutator formula (13). We can now define the quotient Lie algebra: C = LL /gl + (∞) and consider the exact sequence: π

0 → gl + (∞) → LL → C → 0.

(17)

It is now clear that to have a better understanding of the Lie algebra LL we need to study carefully the structure of the Lie algebra C. The crucial ingredient will be the following proposition: Proposition 4.5. gl+ (∞) = [LL , LL ]. Proof. Let us prove the two inclusions. gl + (∞) ⊂ [LL , LL ] since from the definition of gl+ (∞): Ei,j = Zi,j − Zi+1,j +1 = [Zi,0 , Z0,j ] − [Zi+1,0 , Z0,j +1 ], where the second equality follows from the formula (16). To show the other inclusion, i.e [LL , LL ] ⊂ gl + (∞), it suffices to observe that for any two generators, say Zh,p and Zr,q , of LL we have that their commutator is given by the difference between two elements having same degree (see formula (13)), say Zn,m and Zl,s , such that n − m = l − s. Under the hypothesis that k = n − m = l − s > 0 and that s > m (the other cases are completely analogous), we can write their difference as follows: Zn,m − Zl,s = Zm+k,m − Zs+k,s = Zm+k,m − Zm+k+1,m+1 + +Zm+k+1,m+1 − .... − Zs+k−1,s−1 + Zs+k−1,s−1 − Zs+k,s , which expresses the difference between Zn,m and Zl,s as finite linear combination of elements in gl + (∞). In particular we can rephrase the previous proposition as follows: Lemma 4.6. Two generators Zn,m and Zl,s are gl+ (∞)-equivalent if and only if they have the same degree, i.e: Zn,m ∼ Zl,s ⇐⇒ deg(Zm,n ) = deg(Zl,s ). Proof. Using the same argument we used to prove Proposition 4.5, we can conclude that if Zn,m and Zl,s have the same degree then they are equivalent. Suppose now that the difference between Zn,m and Zl,s can be written as a (finite) linear combination of elements in gl+ (∞) and also that n − m = l − s (w.l.o.g. we can assume that n − m > 0 and that l − s > 0). Under these assumptions, and from formula (16), it follows also that: Zn−m,0 − Zl−s,0 = f inite ai Epi ,qi . But this has as a consequence that each of these two elements are finite linear combinations of (homogeneous) elements in gl+ (∞). Accordingly we can write: Zn−m,0 = f inite ci Eri ,ki and Zl−s,0 = f inite ci Eti ,vi .

420

I. Mencattini, D. Kreimer

Rewriting the right-hand side of each of those two equalities in terms of the generators Zn,m , it follows that such equations can not hold. From Proposition 4.5 it follows that C is a (maximal) commutative Lie algebra coming from a quotient of LL . Let us now introduce a set of (natural) generators for C. Since the set Zn,m | n, m ∈ Z≥0 is a basis for LL and since π : LL −→ C is a surjection, it follows that Z n,m = π(Zn,m )| n, m ∈ Z≥0 is a set of generators for C. Moreover from Lemma 4.6 it follows that when n > m, then Zn,m ∼ Zn−m,0 , when m > n, then Zn,m ∼ Z0,m−n and, finally, when n = m, then Zn,m ∼ Z0,0 . So defining Zn = Z n,0 , Z−n = Z 0,n for n > 0 and Z0 = Z 0,0 , we get C = spanC Zn |n ∈ Z. The fact that such elements are also linearly independent (i.e. they form a basis for C) follows easily from Lemma 4.6. We now want to look more closely at the exact sequence (17). In particular we will prove the following result: Theorem 4.7. The exact sequence (17) does not split, i.e the Lie algebra LL is not the semi-direct product of the Lie algebra gl+ (∞) with the (commutative) Lie algebra C. Before addressing the proof of Theorem 4.7, we will introduce some preliminaries to make the paper as self-contained as possible. From the exact sequence (17) and from what we explained above, we can conclude that the Lie algebra LL is a non-abelian extension of the commutative Lie algebra C by the Lie algebra gl+ (∞). Let us explain with some care the meaning of such a statement (for more details we refer to the paper [1]). In what follows, g, h and e will be Lie algebras. Definition 4.8. [1] We will say that the Lie algebra e is an extension of the Lie algebra g by the Lie algebra h, if g, h and e fit into the following exact sequence: π

0 → h → e → g → 0.

(18)

Moreover we will say that two such extensions, e and e , are equivalent, if and only if e and e are isomorphic as Lie algebras. Let Der(h) be the Lie algebra of derivations of h, α , α ∈ HomC (g, Der(h)) and ρ , ρ ∈ HomC (2 g, h). On the set of the couples (α, ρ) introduced above, we define the following equivalence relation: (α, ρ) ∼ (α , ρ ) ⇐⇒ ∃ b ∈ H omC (g, h) such that: α (x).ξ = α(x).ξ + [b(x), ξ ]h , ρ (x, y) = ρ(x, y) + α(x).b(y) − α(y).b(x) − b([x, y]g ) + [b(x), b(y)]h .

Insertion-Elimination Lie Algebra

421

Then we have that Theorem 4.9. [1] 1) The classes of isomorphism of the extensions of the Lie algebra g by the Lie algebra h given by the exact sequence (18), are in one-to-one correspondence with the equivalence classes [(α, ρ)], such that: [α(x), α(y)]Der(h) .ξ − α([x, y]g ).ξ = [ρ(x ∧ y), ξ ]h , α(x).ρ(y, z) − ρ([x, y]g , z) = 0, cyclic

for every x, y, z ∈ g and ξ ∈ h. 2) The Lie algebra structure induced by the datum (α, ρ), on the vector space e = h ⊕ g, is given by [(ξ1 , x1 ), (ξ2 , x2 )]e = ([ξ1 , ξ2 ]h + α(x1 ).ξ2 − α(x2 ).ξ1 + ρ(x1 , x2 ), [x1 , x2 ]g ). (19) We apply this result to our setting, where g = C and h = gl + (∞). The exact sequence (17) tells us that we have: LL gl + (∞) ⊕ C, where such a splitting holds in the category of vector spaces. We first prove that Proposition 4.10. The Lie algebra structure on LL , given by the bracket (13), corresponds to the couple (α, ρ) defined by (En+k,j δi,k − Ei,k δn+k,j ) α(Zn ).(Ei,j ) = (n) k≥0

+(−n)

(Ek,j δk+n,i − Ei,k+n δj,k ) k≥0

for n = 0 and α(Z0 ) ≡ 0, while ρ(Zn , Zm ) = 0 if n, m ≥ 0 or n, m ≤ 0 and ρ(Zn , Z−m ) =

m−1

En−m+k,k

k=0

if n > m, and ρ(Zn , Z−m ) =

m−1

Ek,m−n+k ,

k=0

if n < m. Proof. The proof follows comparing formula (13) with formula (19).

422

I. Mencattini, D. Kreimer

We now remark that Lemma 4.11. [1] Given π

0 → h → e → g → 0,

(20)

as in (18), any splitting s : g −→ e (at the vector space level) of the previous exact sequence, induces a map αs ∈ H omC g, Der(h) , via the following: αs (X).ξ = [s(X), ξ ], for each X ∈ g and each ξ ∈ h.

Proposition 4.12. The map α ∈ HomC C, Der(gl + (∞)) , defined in Proposition 4.10, is induced by the linear map s : C −→ LL , which is defined as follows: s(Zn ) = (n)Zn,0 + (−n)Z0,n − δn,0 Z0,0 .

(21)

Proof. The map s defined in formula (21) is a section of the projection π : LL −→ C defined by the exact sequence (17). In other words, s ∈ H omC (C, LL ) such that s ◦ π = I dC . From Lemma (4.11) we know that such a section s induces a linear map: αs : C −→ Der(gl+ (∞)), defined by: αs (x).ξ = [s(x), ξ ]LL . It is now easy to check that this map is the same defined in Proposition 4.10.

We are now almost ready to prove Theorem 4.7. We only need to remark the following. From Theorem 4.9 we have that a given extension (α, ρ) of the Lie algebra C by the Lie algebra gl+ (∞) will split, i.e. will be equivalent to a semi-direct product of these two Lie algebras, if and only if (α, ρ) ∼ (α , 0), or, in other words, if and only if α is a morphism of Lie algebras. Theorem 4.9 tells us that this is equivalent to ask for the existence of a linear map b : C −→ gl+ (∞), such that s + b : C −→ LL is a morphism of Lie algebras. Moreover, since we are working with the category of graded Lie algebras, the map b has to be grade preserving. In conclusion, to prove Theorem 4.7, we are left to show that such a map b does not exist. Proof of Theorem 4.7. Suppose we can define a linear map b : C −→ gl+ (∞) such that s + b : C −→ LL is a morphism of (graded) Lie algebras. That means that we can N find elements M i=1 ahi Ehi +1,hi ∈ gl+ (∞) and i=1 bkj Ekj ,kj +1 ∈ gl+ (∞) such that M N b(Z1 ) = i=1 ahi Ehi +1,hi , b(Z−1 ) = i=1 bkj Ekj ,kj +1 and furthermore 0 = [(s + b)(Z1 ), (s + b)Z−1 ] = [Z1,0 +

M i=1

ahi Ehi +1,hi , Z0,1 +

N

bkj Ekj ,kj +1 ].

j =1

We can calculate such a commutator by re-writing each of the terms Ei,j in the sums in terms of the generators Zn,m , and applying to such terms the brackets given in formula (13). The result, written in terms of the generators Ei,j , takes the form: −E0,0 +

N j =1

bkj (Ekj +1,kj +1 − Ekj ,kj ) +

M i=1

ahi (1 + bhi )(Ehi +1,hi +1 − Ehi ,hi ) = 0.

Insertion-Elimination Lie Algebra

423

The right-hand side of the previous sum can be reorganized in terms of the summands Ej +1,j +1 − Ej,j as follows: −

L

φj (Ej +1,j +1 − Ej,j ),

i≥0

where L is the biggest between N and M and the φj ’s are coefficients. Then we have that E0,0 =

L

φj (Ej +1,j +1 − Ej,j ) = −φ0 E0,0 +

(φj +1 − φj )Ej,j + φL EL+1,L+1 ,

j ≥0

i≥0

that clearly give us a contradiction.

Proof of Theorems 4.1 and 4.2. We now conclude this section by giving the proofs for Theorems 4.1 and 4.2. We recall from [15] that the Lie algebra LL has an obvious module: Definition 4.13. [15] S=

Ctn = C[t0 , t1 , t2 , t3 .....].

n≥0

We will assign a degree equal to k to the generator tk for each k ≥ 0. LL acts on S via the following: Zn,m tk = 0 if m > k, Zn,m tk = tk−m+n if m ≤ k.

(22)

In what follows we will indicate by Z(LL ) the center of the Lie algebra LL . Proof of Theorem 4.1. It is obvious that CZ0,0 ⊂ Z(LL ). Let us prove the other inclusion. Let us suppose that there is some element α ∈ LL , not proportional to Z0,0 and that belongs to the center of LL . W.l.o.g. we assume α=

k i=1

ai Zni ,mi =

i: ni ,mi =0

bi Zni ,mi +

i: n˜ i =0

ci Zn˜ i ,0 +

di Z0,m˜ i ,

(23)

i: m ˜ i =0

where all the n˜ i ’s (m ˜ i ’s) are different from 0 and n˜ i = n˜ j (m ˜ i = m ˜ j ), if i = j , and (ni , mi ) = (nj , mj ), if i = j . We will prove that α, defined above, is equal to zero by showing that the coefficients bi , ci and di are all equal to zero. We will split the proof of this assertion into two lemmas. Lemma 4.14. If α ∈ Z(LL ), where α is defined as above, then bi = di = 0 for each i.

424

I. Mencattini, D. Kreimer

Proof. Let us consider some element Zn,0 ∈ LL such that 0 < n ≤ mini {mi , m ˜ i }. Then using formula (13), we get: [Zn,0 , α] = bi [Zn,0 , Zni ,mi ] + di [Zn,0 , Z0,m˜ i ] i

=

i

bi (Zni +n,mi − Zni ,mi −n ) +

i

di (Zn,m˜ i − Z0,m˜ i −n ).

i

Note that all the m ˜ i ’s are different (and different from 0), while in the set of the mi ’s (which are also all different from 0) we can have repetitions. . Let us now define the set M = {m1 , ...., mk , m ˜ 1 , ..., m ˜ r }, and let us consider the disjoint union: M = M1 ∪ · · · ∪ Ms . Each Mi corresponds to the set of all indices in M which are equal to some given index li , say. We remark once more that for each i, Mi ∩ {m ˜ 1, · · · , m ˜ r } contains at most one element, since in the set {m ˜ 1 , ..., m ˜ r } we do not have repetitions. Let us now consider p1 = l1 − n (which is ≥ 0, as a consequence of the condition we imposed on n), and let us also consider the corresponding element tp1 ∈ S. Since α belongs to Z(LL ), and since n > 0, we have:

0 = [Zn,0 , α](tp1 ) = − bi tp1 −mi +n+ni + di tp1 −m˜ i +n . (24) i: mi ∈M1

i: m ˜ i ∈M1

Remark 4.15. We observe that all the indices in M1 are equal to l1 and that p1 = l1 − n. Moreover the ni ’s in the first sum of the right-hand side in formula (24) are all different (since by assumption we have that (ni , mi ) = (nj , mj ) unless i = j and in our case all the mi belong to the class M1 ). Finally, we notice that the last sum, if not equal to zero, contains only one term. ˜ 1, · · · , m ˜ r } are both not Let us now suppose that M1 ∩ {m1 , · · · , mk } and M1 ∩ {m empty (the cases where one, or both, of those intersections are empty, are completely analogous). From the previous remark it follows that   0 = [Zn,0 , α](tp1 ) = −  bi tl1 −n−l1 +n+ni + di tl1 −n−l1 +n  =−

i: mi ∈M1

i: m ˜ i ∈M1

bi tni + d1 t0 .

i

Since all the ni in the first sum are different, we have that d1 = 0 and bi = 0 for each i. We can apply the same argument to the sets M2 ,...,Ms , to show that each of the coefficients bi and ci are equal to 0. From Lemma 4.14 we conclude that if α ∈ Z(LL ), α defined as in Eq. (23), then α= ci Zni ,0 . i

Insertion-Elimination Lie Algebra

425

To conclude the proof of Theorem 4.1, we need to show that Lemma 4.16. If α ∈ Z(LL ) and α = i ci Zni ,0 , then ci = 0 for each i. Proof. We first notice that we can suppose all ni = 0 and n1 < n2 .... Let us now consider some element Z0,n , such that n ≥ maxi {ni }. Since we suppose α = i ci Zni ,0 to be in the center of LL , we can write: ci [Zni ,0 , Z0,n ] = ci (Zni ,n − Z0,n−ni ). 0 = [α, Z0,n ] = i

i

By the hypothesis on n and the one on the ni ’s, we conclude that all the ci ’s are equal to zero. Proof of Theorem 4.2. Let us suppose that l 0 is not a maximal abelian sub-algebra of 0 LL , i.e. that there exists LL α ∈ / l , α = ni=1 ai Zni ,mi , such that [α, Zk,k ] = 0, ∀ k > 0. Without loss of generality we can suppose that in each of (ni , mi )’s, ni = mi (if no, α = β + i fi Zni ,ni and [β, Zk,k ] = [α, Zk,k ]). Such an element can be written as bi Zni ,mi + ci Zn˜ i ,0 + di Z0,m˜ i . (25) α= i: mi =0, ni =0

i: n˜ i =0

i: m ˜ i =0

Remark 4.17. We note that in formula (25) all the ni ’s and the mi ’s are different from 0 and also that n˜ i = n˜ j and m ˜ i = m ˜ j for each i = j . We will prove that such an element is identically equal to zero, showing that each of the coefficients in Eq. (25) is equal to zero. We will divide the proof of this statement in two lemmas. Lemma 4.18. Given α ∈ l 0 , defined as in formula (25), we have that ci = di = 0 for all i. Proof. Let us fix integer k, 0 < k ≤ mini {ni , mi , n˜ i , m ˜ i }. Then we get bi [Zni ,mi , Zk,k ] + ci [Zn˜ i ,0 , Zk,k ] + di [Z0,m˜ i , Zk,k ] [α, Zk,k ] = i: mi =0,ni =0

=

ci (Zk+n˜ i ,k − Zn˜ i ,0 ) +

i: n˜ i =0

i: n˜ i =0

i: m ˜ i =0

di (Z0,m˜ i − Zk,k+m˜ i ),

i: m ˜ i =0

since [Zni ,mi , Zk,k ] = 0, ∀ {ni , mi } such that ni ≥ k, mi ≥ k, [Zn˜ i ,0 , Zk,k ] = Zk+n˜ i ,k − Zn˜ i ,0 if 0 < k ≤ n˜ i , and [Z0,m˜ i , Zk,k ] = −Zk,m˜ i +k + Z0,m˜ i if 0 < k ≤ m ˜ i. Since α commutes with all the elements of the sub-algebra l 0 , we have ci (Zk+n˜ i ,k − Zn˜ i ,0 ) + 0= di (Z0,m˜ i − Zk,k+m˜ i ). i: n˜ i =0

i: m ˜ i =0

426

I. Mencattini, D. Kreimer

But in the right-hand side of the previous formula the first sum contains only elements of positive degree while the second sum contains only those of negative degree; thus the sum is equal to zero if and only if separately ci (Zk+n˜ i ,k − Zn˜ i ,0 ) = 0 and di (Z0,m˜ i − Zk,k+m˜ i ) = 0. i: n˜ i =0

i: m ˜ i =0

From this it follows that all ci ’s and di ’s are equal to zero. Indeed, consider the sum containing the ci ’s (the one containing the di ’s can be treated in the same way): ci (Zk+n˜ i ,k − Zn˜ i ,0 ) = 0. i: n˜ i =0

Since k = 0 and since n˜ i = n˜ j , if i = j , all the elements Zk+n˜ i ,k − Zn˜ i ,0 are linearly independent. Summarizing, so far we have proved that if a given element α commutes with each of the elements in l 0 , then: α= bi Zni ,mi . (26)

Lemma 4.19. If α, l

0

i: ni =0,mi =0

= 0, with α defined as in (26), then all the bi ’s are equal to 0.

Proof. Let us decompose the element α in term of elements of positive and negative degree, i.e: α= ai Zni ,mi = bi Zri +sj ,ri + ci Zpi ,pi +tj . i

j

i≥0

j

i≥0

Remark 4.20. We remark that in α elements of the same (negative or positive) degree could be present; as an example of such an element (of positive degree) we can consider: βj = bi Zri +sj ,ri , for a given j i

or the element (of negative degree): γj = ci Zpi ,pi +tj , for a given j . i

From the previous remark let us re-write α as: α= βj + γj , j

j

each βj ∈ L+ and each γj ∈ L− . Let us now consider some element Zk,k ∈ l 0 and let us take the commutator of such an element with α, [α, Zk,k ] = [βj , Zk,k ] + [γj , Zk,k ]. j

j

Since LL is a graded Lie algebra and since deg Zk,k = 0, we have that deg [βj , Zk,k ] = sj , ∀j

Insertion-Elimination Lie Algebra

427

and similarly deg [γj , Zk,k ] = −tj , ∀j. Hence [α, Zk,k ] = 0 ⇐⇒ [βj , Zk,k ] = 0 and [γj , Zk,k ] = 0 , ∀j. We are left to prove that any homogeneous element commuting with all the elements in l 0 can not exist. So, to fix ideas, let us now consider some element of positive degree s, say, β = l i=1 ai Zni +s,ni and let us suppose that [β, Zk,k ] = 0 ∀ k ≥ 1.

(27)

Without loss of generality we can further assume that 0 < n1 < n2 < ... < nk (that β fulfills the hypothesis is constrained by the assumptions given for the element α defined in formula (26), which translates for β into the condition ni = 0). To conclude, it suffices to show that each of the ai ’s of β = li=1 ai Zni +s,ni is equal to 0. So let us consider k = n2 in formula (27). Applying formula (13) to this case, we get: ai [Zni +s,ni , Zn2 ,n2 ] [β, Zk,k ] = a1 [Zn1 +s,n1 , Zn2 ,n2 ] +

i≥2

= a1 Zn2 +s,n2 − (n2 − n1 − s)Zn2 ,n2 −s

−(n1 + s − n2 )Zn1 +s,n1 + δn1 +s,n2 Zn2 ,n1 ,

since i≥2 ai [Zni +s,ni , Zn2 ,n2 ] = 0. By the previous formula and the hypothesis for the ni ’s, we conclude that [β, Zn2 ,n2 ] = 0 ⇐⇒ a1 = 0. Taking k = n3 , n4 , ...., and using the same argument, we can conclude that each of the ai ’s is equal to zero. 5. Cohomology of the Lie Algebra LL In what follows we will describe in some detail the cohomology of the Lie algebra LL . We will start with an explicit calculation for the dimension of the first cohomology group (with trivial coefficients) and we will continue using the general machinery to calculate the higher cohomology groups. Let us first introduce the derivation Y , acting on LL as follows: Y.Zn,m ≡ [Y, Zn,m ] = (n − m)Zn,m .

(28)

Let us now consider the following extension of the Lie algebra LL : Definition 5.1. Lˇ L = spanC Zn,m , Y |n, m ∈ Z≥0 , where the commutator [Zn,m , Zl,s ] is given by formula (13) and [Y, Zn,m ] = (n − m)Zn,m . We have the following: Theorem 5.2. 1. dimC H 1 (Lˇ L , C) = 1; 2. dimC H 1 (LL , C) = +∞. Proof. The elements of H 1 (LL , C) are in one-to-one correspondence with the elements φ ∈ HomC (LL , C) such that

428

I. Mencattini, D. Kreimer

φ([Zn,m , Zl,s ]) = 0, for each Zn,m , Zl,s ∈ LL . As a consequence of Proposition 3.3, Eq. (16), we have that the value of φ on a given element Zn,m , depends only on the degree of such an element. In fact, given: Zn,m = [Zn,0 , Z0,m ] + (n − m)Zn−m,0 + (m − n)Z0,m−n − δn−m,0 Z0,0 , φ(Zn,m ) = (n − m)φ(Zn−m,0 ) + (m − n)φ(Z0,m−n ) − δn−m,0 φ(Z0,0 ). From this remark, it follows that dimC H 1 (LL , C) = +∞. This proves the second assertion. To show that also the first holds, let us observe that, since: 0 = φ([Y, Zn,m ]) = (n − m)φ(Zn,m ), for each Zn,m ∈ LL , then φ(Zn,m ) = 0 for any element Zn,m with degree different from zero, so that we can write: φ(Zn,n ) = cφ (n) ∈ C. On the other hand, since the value of φ on a given element depends only on the degree of such an element, we have that: φ(Zn,m ) = cφ δn−m,0 , or, in other words: H 1 (Lˇ L , C) C. To go to the higher cohomology groups we need to introduce some notation and some (classical) results about the cohomology of Lie algebras. Let us start with the following lemma: Lemma 5.3. Let gl(n) the (Lie) algebra of n × n matrices (with entries in C). Let us define the direct system of (Lie) algebras: ... → gl(n − 1) → gl(n) → gl(n + 1) → ...,

(29)

where the arrows are given by the standard inclusions, i.e A ∈ gl(n) is mapped into A˜ ∈ gl(n + 1) such that a˜ i,j = ai,j , ∀i ≤ n, j ≤ n and a˜ i,j = 0 if i > n or j > n. Then the direct limit of such a direct system is isomorphic to the Lie algebra gl+ (∞) introduced in Definition 4.3. Proof. The proof follows immediately from Definition 4.3.

Let us next quote a result about the cohomology of the Lie algebra of the general linear group gl(n) [7]: Theorem 5.4. 1) The cohomology ring of the Lie algebra gl(n) is an exterior algebra in n generators of degree 1, 3, ..., 2n − 1: H • (gl(n)) = [c1 , c3 , ....., c2n−1 ];

Insertion-Elimination Lie Algebra

429

2) for any given n, the (inclusion) map defined in formula (29): i : gl(n − 1) −→ gl(n) induces a map i ∗ in cohomology: i ∗ : H • (gl(n)) −→ H • (gl(n − 1)), such that i ∗ : H p (gl(n)) −→ H p (gl(n − 1)) is an isomorphism for p < n, and it maps to zero the top degree generator when p = 2n − 1; 3) from 1), 2) and the previous Lemma 5.3, it follows that the cohomology ring of the Lie algebra gl+ (∞) is a (non finitely generated) exterior algebra having generators only in odd degree: H • (gl + (∞)) = [c1 , c3 , .....]. Proof. We refer to the book [7] for the proof of 1) and 2). Part 3) follows from Lemma 5.3, where we have identified the direct limit of the Lie algebras gl(n) with the Lie algebra gl+ (∞). Now let us go back to the exact sequence (17). This induces the following exact sequence in cohomology ([7, 17]): f1

δ

0 → H 1 (C) → H 1 (LL ) → H 1 (gl + (∞))C → H 2 (C) f2

δ

→ H 2 (LL ) → H 2 (gl + (∞))C → · · ·.

(30)

Here, by H i (gl + (∞))C we understand the (sub)-vector space of C-invariant elements in H i (gl + (∞)), i.e the space {a ∈ H i (gl + (∞))|x.a = 0, ∀ x ∈ C}. Using Theorem 5.4 and the previous exact sequence we have one of our main results. Theorem 5.5. The cohomology groups H i (LL ), for i = 1, 2, are infinite dimensional. In particular the Lie algebra LL has infinite many non equivalent central extensions. Proof. Since C is an (infinite dimensional) abelian Lie algebra the groups H n (C) are infinite dimensional. Let us now consider the following segment of the exact sequence (30): δ

f p+1

· · · → H p (gl + (∞))C → H p+1 (C) −→ H p+1 (LL ) → H p+1 (gl + (∞))C → · · ·. (31) Let us suppose first that p is an odd number. From Theorem 5.4, it follows that H p+1 (gl + (∞)) = 0 and H p (gl + (∞)) < ∞, so that f p+1 is surjective and ker( f p+1 ) is finite dimensional. We can argue in an analogous way for p an even number; in this case the map f p+1 is injective as H p (gl + (∞)) = 0. The statement about central extensions follows now from the fact that those are in one-to-one correspondence with the elements of the group H 2 (LL ).

430

I. Mencattini, D. Kreimer

We conclude this section with the following proposition: Proposition 5.6. The map f 1 , defined in the exact sequence (30), is not an isomorphism. The proof of this statement follows from: Claim 5.7. H 1 (gl + (∞))C C. Proof. Let us start by observing that H 1 (gl + (∞)) gl + (∞)/[gl + (∞), gl + (∞)] C, and identifying [gl + (∞), gl + (∞)] with sl+ (∞), i.e. with the Lie algebra of infinite matrices of finite rank, having trace equal to zero. In particular, this implies that the only non trivial class [φ] ∈ H 1 (gl + (∞)) corresponds to a (closed) cochain φ ∈ C 1 (gl + (∞)) whose kernel is sl+ (∞). Let us now define the action of the (abelian) Lie algebra C on H 1 (gl + (∞)): for any φ ∈ C 1 (gl + (∞)) and [Z] ∈ C LL /gl + (∞), define ([Z].φ)(α) = φ([Z + β, α]),

(32)

where Z ∈ LL and β ∈ gl + (∞)). On the other hand, since φ is a cocycle, we have that φ([Z + β, α]) = φ([Z, α]). It is a simple calculation to show that [LL , gl + (∞)] ⊂ sl+ (∞) so that, from the hypothesis on φ, we conclude that φ([Z, α]) = 0, i.e. [Z].φ = 0, or that φ is C-invariant.

Remark 5.8. In this remark we want to compare the Lie algebra gl + (∞) with its finite dimensional analogue, e.g. gl(n). These two Lie algebras are not simple; in fact the Lie algebra of the matrices having trace equal to zero is a non trivial ideal in both cases (sl + (∞) in the infinite dimensional case and sl(n) in the finite dimensional case). Moreover in both cases the quotient is the trivial Lie algebra, e.g. C. While in the finite dimensional case the quotient gl(n)/sl(n) C Z(gl(n)), where Z(gl(n)) is the center of gl(n), in the infinite dimensional case the quotient gl + (∞)/sl + (∞) does not correspond to any ideal in gl + (∞). In particular Z(gl + (∞)) = {0}. Remark 5.9. Because of the (vector space) isomorphism LL gl + (∞)⊕C, and because of the definition of the Lie algebra: gl + (∞) lim gl(n), −→

it is reasonable to ask if we can describe the Lie algebra LL as a direct limit of (finite) dimensional Lie algebras. Such a question becomes even more interesting, if we compare the Lie algebra LL with a Lie algebra that plays an important role in some recent developments of CFT [16]. This Lie algebra, LV in the notation of [16] is actually the central extension of the (infinite) symplectic Lie algebra sp(∞, R), it contains as an ideal the Lie algebra gl + (∞) (a property which is shared with LL ), and it allows for a finite dimensional approximation.

Insertion-Elimination Lie Algebra

431

Even if it is not clear to us if there is any reason to believe that these two Lie algebras are related, the question if it is possible or not to describe LL as direct limit of finite dimensional Lie algebra stands, in our opinion, is an interesting problem per se. At the moment we do not have any definitive answer to this question. Anyhow we can note the following. Let us define An = gl + (∞) ⊕ V (2n − 1), n ≥ 1, where V (2n − 1) is a vector space of dimension 2n − 1 whosegenerators, Z±k with k = {0, 1, . . . , n − 1}, are defined by the following: Zk = j ≥0 Ek+j,j , Z−k = E and Z = E . Then it is easy to show that: j,j +k 0 j,j j ≥0 j ≥0 Theorem 5.10. {An }n≥1 defines a filtration of Lie algebras, and: LL = lim An −→

in the category of Lie algebras. 6. Conclusion and Outlook In this paper we gave results about the structure of the Lie algebra LL . We first discussed its relevance for the structure of quantum field theory. Having motivated its study, we showed that LL is the (non-abelian) extension via gl+ (∞) of a commutative Lie algebra. We also showed that this extension does not split. Furthermore, we described the cohomology of LL and proved that the second cohomology group of this Lie algebra is infinite dimensional, allowing for infinitely many non-equivalent central extensions. It should be very interesting to understand the physical meaning of the central extensions of this Lie algebra in the future, in particular their relations with those DSEs. In future work we will study more closely the representation theory of this Lie algebra as well as the representation and the cohomology of the Lie algebra Lˇ L defined in 5.1, with the hope to shed some light on these problems. Acknowledgements. I.M. thanks the IHES for hospitality during a stay in May of 2004. We thank Takashi Kimura for several useful conversations. I.M wants to thank Pavel Etingof and Victor Kac and Zoran Skoda for stimulating discussions and valuable advice. We thank Claudio Bartocci and Ivan Todorov for carefully reading a preliminary version of the manuscript and for suggesting several improvements.

References 1. Alekseevsky, D., Michor, P.W., Ruppert, W.: Extension of Lie algebras. http://arxiv.org/list/ math.DG/0005042, 2000 2. Bergbauer, C., Kreimer, D.: The Hopf algebra of rooted trees in Epstein-Glaser renormalization. To appear in Ann. Henri Poincar´e, http://arxiv.org/list/hep-th/0403207, 2004 3. Broadhurst, D.J., Kreimer, D.: Exact solutions of Dyson-Schwinger equations for iterated one-loop integrals and propagator-coupling duality. Nucl. Phys. B 600, 403 (2001) 4. Connes, A., Kreimer, D.: Renormalization in quantum field theory and the Riemann Hilbert problem. I. The Hopf algebra structure of graphs and the main theorem. Commun. Math. Phys. 210(1), 249–273 (2000) 5. Connes, A., Kreimer, D.: Renormalization in quantum field theory and the Riemann Hilbert problem. II. The β-function, diffeomorphism and the renormalization group. Commun. Math. Phys. 216(1), 215–241 (2001) 6. Connes, A., Kreimer, D.: Insertion and Elimination: the doubly infinite Lie algebra of Feynmann graphs. Ann. Henri Poincare 3(3), 411–433 (2002)

432

I. Mencattini, D. Kreimer

7. Fuks, D.B.: The Cohomology of Inifinite Dimensional Lie Algebras., Contemporary Soviet Mathematics, New York: Consultant Bureau, 1986 8. Gross, D.: In: Methods in QFT, Les Houches 1975, Amsterdam: North Holland Publishing, 1976 9. Kreimer, D.: On the Hopf algebra structure of perturbative quantum field theory. Adv. Theor. Math. Phys. 2(2), 303–334 (1998) 10. Kreimer, D.: New mathematical structures in renormalizable quantum field theories. Annals Phys. 303, 179 (2003); [Erratum-ibid. 305, 79 (2003)], hep-th/0211136 11. Kreimer, D.: Factorization in quantum field theory: An exercise in Hopf algebras and local singularities. Les Houches Frontiers in Number Theory, Physics and Geometry, France, March 2003, hep-th/0306020 12. Kreimer, D.: The residues of quantum field theory: Numbers we should know. hep-th/0404090 13. Kreimer, D.: What is the trouble with Dyson-Schwinger equations?. Nucl.Phys. Proc. Suppl. 135, 238–242 (2004) 14. Mack, G., Todorov, I.T.: Conformal-Invariant Green Functions without Ultraviolet Divergences. Phy. Rev. D8, 1764 (1973) 15. Mencattini, I., Kreimer, D.: Insertion-Elimination Lie algebra: the Ladder case. Lett. Math. Phys. 64, (2004) 61–74 16. Nikolov, N.M., Stanev,Ya.S., Todorov, I.T.: Four Dimensional CFT Models with Rational Correlation Functions. J. Phys. A 35(12), 2985–3007 (2002) 17. Weibel, C.: Introduction to Homological Algebra. Cambridge Studies in Advanced Mathematics, 38. Cambridge University Press, Cambridge, 1994 Communicated by Y. Kawahigashi

Commun. Math. Phys. 259, 433–450 (2005) Digital Object Identifier (DOI) 10.1007/s00220-005-1372-z

Communications in

Mathematical Physics

Density of Eigenvalues of Random Normal Matrices Peter Elbau, Giovanni Felder Department of Mathematics, ETH-Zentrum, 8092 Zurich, Switzerland. E-mail: [email protected]; [email protected] Received: 27 September 2004 / Accepted: 7 February 2005 Published online: 14 June 2005 – © Springer-Verlag 2005

Abstract: The relation between random normal matrices and conformal mappings discovered by Wiegmann and Zabrodin is made rigorous by restricting normal matrices to have spectrum in a bounded set. It is shown that for a suitable class of potentials the asymptotic density of eigenvalues is uniform with support in the interior domain of a simple smooth curve. 1. Introduction In recent work initiated by P. Wiegmann and A. Zabrodin [1–4], a connection between the normal matrix model and conformal mappings was discovered. In this model one considers random normal N × N complex matrices with probability measure −1 PN (M) dM = ZN exp{−(N/t0 )tr(M ∗ M − p(M) − p(M)∗ )} dM,

(1)

where dM is a natural measure on the variety of normal matrices and ZN is the normalization factor. The result is that, as N → ∞, the density of eigenvalues is 1/π t0 times the characteristic function of a bounded domain in the complex plane. This domain is characterized by the fact that its exterior harmonic moments are the coefficients tj of the polynomial p appearing in the measure. Moreover, the Riemann mapping of the exterior of the unit disk onto the exterior of the domain obeys, as a function of the tj , the equations of the integrable dispersionless Toda hierarchy. These fascinating results remain however at the level of formal manipulations of undefined objects, as the integrals diverge, except in the simplest case of a polynomial p of degree 2, where the domain is bounded by an ellipse. The purpose of this note is to give a setting in which the above statements make mathematical sense and to give a proof of these statements. The problem of divergence of the integral over normal matrices is solved in a naive way, by restricting the integral to normal matrices whose eigenvalues lie in a compact

434

P. Elbau, G. Felder

domain D of the complex plane. Then for small t0 the results can be formulated in terms of polynomial curves, i.e., curves in the complex plane admitting a parametrization of the form w → h(w) = rw+ nj=0 aj w −j , |w| = 1. For simple polynomial curves the prob 1 −j dz lem of determining the curve out of its exterior harmonic moments tj = 2πi γ z¯ z and the area π t0 of the interior domain (the interior domain is the bounded connected component of the complement of the curve) has a unique solution for small t0 , as we show in Sect. 5. Our main result is then: Theorem 1.1. Let D ⊂ C be the closure of a bounded open set containing the origin. Let p(z) = t2 z2 + · · · + tn+1 zn+1 be a polynomial such that |z|2 − p(z) − p(z) has a non-degenerate absolute minimum in D at z = 0. Then there exists a δ > 0 so that for all 0 < t0 < δ, (i) there exists a unique simple polynomial curve γ with exterior harmonic moments t1 = 0, t2 , . . . , tn+1 , 0, 0, . . . and area of interior domain π t0 ; (ii) the expectation value of the density of eigenvalues of random normal matrices with spectrum in D and distribution (1) converges as N → ∞ to a uniform distribution with support in the interior domain of γ . The condition on p implies the Hessian condition |t2 | < 21 and it is fulfilled if the Hessian condition holds and t3 , . . . , tn are sufficiently small. It then follows from results of [5, 1] that the curve γ as a function of the tj in this range provides a solution of the integrable dispersionless 2D Toda hierarchy obeying the string equation. The paper is organized as follows: the basic definitions of the random normal matrix model are recalled in Sect. 2. We then introduce the “equilibrium measure” as the unique solution of a variational problem in Sect. 3 and show in Sect. 4 that the density of eigenvalues converges to it. These are either known results or adaptations of results known for hermitian matrices to our case. In Sect. 5 we introduce the notion of polynomial curve and prove Theorem 5.3, which is a stronger form of part (i) of Theorem 1.1. In Sect. 6 we prove part (ii) of Theorem 1.1, see Theorem 6.1, and discuss our results in Sect. 7. 2. Eigenvalues of Random Normal Matrices We consider the probability measure PN (M) dM =

1 −N tr V (M) e dM, ZN

ZN =

NN (D)

PN (M) dM,

defined by a potential V , on the set NN (D) = {M ∈ MatC (N ) | [M, M ∗ ] = 0, σ (M) ⊂ D} of normal N × N complex matrices with spectrum in some compact domain D ⊂ C. The measure dM is the Riemannian volume form on (the smooth part of) NN (D) 2 with respect to the metric induced from the standard metric on the vector space CN of all N × N matrices. In a parametrization by eigenvalues and unitary matrices it is given by [6] dM = dU

|zi − zj |2

1≤i<j ≤N

N i=1

d 2 zi ,

Density of Eigenvalues of Random Normal Matrices

435

∗ where M = U diag (zi )N i=1 U and dU denotes the normalized U(N ) invariant measure on U(N )/U(1)N . This leads to the probability measure N N 1 −N N V (zi ) i=1 d 2 zi = e |zi − zj |2 d 2 zi , PN (zi )N i=1 ZN i=1

1≤i<j ≤N

ZN =

DN

e−N

N

i=1 V (zi )

i=1

|zi − zj |2

1≤i<j ≤N

N

d 2 zi

i=1

on the space of eigenvalues zi ∈ D, 1 ≤ i ≤ N . 3. The Equilibrium Measure We are interested in the behavior of the function PN (z) as N → ∞. Because the probability that two eigenvalues are equal is always zero, we may consider PN as a function on the set D0N = {z ∈ D N | zi = zj ∀ i = j }. Introducing the probability measure δz (A) =

N 1 χA (zi ), N i=1

z = (zi )N i=1 ,

on D (χA shall denote the characteristic function of the set A), we write for z ∈ D0N , 1 2 −1 V (ζ ) dδz (ζ ) + exp −N log |ζ − ξ | dδz (ξ ) dδz (ζ ) . PN (z) = ZN ζ = ξ Letting N → ∞, only the infimum of the coefficient of −N 2 , I0 := inf I (µ), µ∈M(D) I (µ) := V (z) dµ(z) +

z = ζ

log |z − ζ |−1 dµ(ζ ) dµ(z),

will be relevant. Here M(D) denotes the set of all Borel probability measures on D without point masses (a measure with point masses could only arise from measures δz with some zi = zj , but then PN (z) = 0). Therefore, we safely can neglect the restriction on the double integral. The precise sense in which the infimum of I controls the large N behavior of PN will be discussed in the next section. Here we consider this reasoning only as a motivation for introducing the variational problem. χD Because of I ( |D| λ) < ∞ (λ the Lebesgue measure on C and |D| = λ(D) the area of D), I0 is finite. Definition. An equilibrium measure for V on D ⊂ C is a Borel probability measure µ on D without point masses so that I (µ) = I0 . Theorem 3.1. Every continuous function V on a compact subset D has a unique equilibrium measure.

436

P. Elbau, G. Felder

In the remaining part of this section we prove this theorem, which is a known fact from potential theory, see e.g. [7], and give necessary and sufficient conditions for µ to be an equilibrium measure. The constructions are adapted from the corresponding results for Hermitian matrices, see [9]. 3.1. Existence. To show the infimum is achieved, we choose a sequence (µn )∞ n=1 in M(D) with I (µn ) → I0 . Lemma 3.2. The space of all Borel probability measures on D is sequentially compact. Proof. By the theorem of Riesz-Markov each Borel measure µ on D corresponds to exactly one positive, linear functional φµ ∈ C(D)∗ and by the theorem of Alaoglu, the closed unit-sphere in C(D)∗ is weak-*-compact. Therefore, for each sequence (µn )∞ n=1 or Borel probability measures on D, the sequence (φµn )∞ n=1 contains a weak-*-convergent subsequence (φµn(k) )∞ k=1 , i.e. ∃ φ ∈ C(D)∗ : φµn(k) (f ) → φ(f )

∀f ∈ C(D).

Now we find a measure µ on D with φ = φµ . This measure fulfills f dµn(k) → f dµ (k → ∞) ∀f ∈ C(D), and hence is again a Borel probability measure. Because of this lemma there exists a convergent subsequence (µn(k) ) of (µn ) and a Borel probability measure µ with µn(k) → µ. To prove that I (µ) = I0 , we estimate with an arbitrary real constant L, lim I (µn(k) ) = lim V (z) dµn(k) (z) + lim log |z − ζ |−1 dµn(k) (ζ ) dµn(k) (z) k→∞ k→∞ k→∞ ≥ V (z) dµ(z)+ lim min{log |z − ζ |−1 , L} dµn(k) (ζ ) dµn(k) (z). k→∞

Approximating uniformly the second integrand according to the theorem of StoneWeierstraß up to ε > 0 with a polynomial in z, z¯ , ζ and ζ¯ and using Fubini’s theorem, we get a lower bound for the limit: lim I (µn(k) ) ≥ V (z) dµ(z) + min{log |z − ζ |−1 , L} dµ(ζ ) dµ(z) − 2ε. k→∞

Letting first ε → 0 and then L → ∞ shows that µ has no point masses (otherwise the right-hand side would diverge) and I0 = I (µ). 3.2. Uniqueness. Next, we want to show that there is exactly one measure µ ∈ M(D) with I (µ) = I0 . So suppose µ˜ ∈ M(D) also fulfills I (µ) ˜ = I0 . Then we consider the family µt = t µ˜ + (1 − t)µ = µ + t (µ˜ − µ), t ∈ [0, 1], in M(D).

Density of Eigenvalues of Random Normal Matrices

437

Lemma 3.3. Let µ and µ˜ be probability measures such that the function log |z − ζ |−1 is integrable with respect to µ ⊗ µ and to µ˜ ⊗ µ. ˜ Then log |z − ζ |−1 is also integrable with respect to µ ⊗ µ. ˜ Additionally, we have the inequality (2) log |z − ζ |−1 d(µ˜ − µ)(ζ ) d(µ˜ − µ)(z) ≥ 0, with equality if and only if µ = µ. ˜ Proof. We start with the distributional identity log |z|−1 ϕ(z) d2 z = −2π ϕ(0) for any Schwarz function ϕ. Introducing the Fourier transform ϕˆ of ϕ, we find 1 (|k|2 ϕ(k)) ˆ d2 k log |z|−1 ϕ(z) d2 z = − |k|2

i 1 1 ¯ 2 (kz+k z¯ ) − f (k) d 2 z d 2 k = ϕ(z) e 2π |k|2 1 1 i (kz+k¯ z¯ ) 2 = − f (k) d2 k ϕ(z) d2 z, e 2π |k|2 where f denotes a real, continuous function which is one in the vicinity of zero and becomes zero at infinity. So we have for all z ∈ C \ {0} the equation 1 1 i (kz+k¯ z¯ ) −1 2 − f (k) d2 k + C(f ) e log |z| = 2π |k|2 with some real constant C(f ). So, with Tonelli’s theorem, we see that log |z − ζ |−1 d(µ˜ − µ)(ζ ) d(µ˜ − µ)(z) 2 i 1 1 ¯ z¯ ) (kz+ k e2 d(µ˜ − µ)(z) d2 k = 2 2π |k| is non-negative and finite, which immediately implies the integrability of log |z − w|−1 with respect to µ ⊗ µ. ˜ To achieve equality in (2), we need i i ¯ z¯ ) ¯ (kz+ k dµ(z) = e 2 (kz+k z¯ ) dµ(z) ˜ e2 for all k ∈ C, which reads µ = µ. ˜

So we can expand I (µt ) and obtain −1 V (z) + 2 log |z − ζ | dµ(ζ ) d(µ˜ − µ)(z) I (µt ) = I (µ) + t +t 2 log |z − ζ |−1 d(µ˜ − µ)(ζ ) d(µ˜ − µ)(z).

(3)

438

P. Elbau, G. Felder

Lemma 3.3 now states that the coefficient of t 2 is non-negative and so the function t → I (µt ) is convex on [0, 1]. In particular, ˜ + (1 − t)I (µ) = I0 , I (µt ) ≤ tI (µ) which implies I (µt ) ≡ I0 . This requires the last summand in (3) to vanish and so, again with Lemma 3.3, we see that µ = µ. ˜ 3.3. A variational form. To determine if a given measure µ is the equilibrium measure for the potential V on the domain D, we may check a variational principle. Proposition 3.4. The probability measure µ is the equilibrium measure for the potential V on the domain D if and only if the function E(z) = V (z) + 2 log |z − ζ |−1 dµ(ζ ) (4) fulfills the relation E(z) dµ(z) ˜ ≥ E(z) dµ(z) =: E0 for all µ˜ ∈ M(D).

(5)

Additionally, we have the property E(z) ≡ E0 µ-almost everywhere.

(6)

Proof. Let us first assume µ is the equilibrium measure. Then, for an arbitrary measure µ˜ ∈ M(D), the condition d I (µt ) t=0 ≥ 0, µt = t µ˜ + (1 − t)µ, t ∈ [0, 1], dt has to hold. This means V (z) + 2 log |z − ζ |−1 dµ(ζ ) d(µ˜ − µ)(z) ≥ 0, which immediately implies Eq. (5). Assume on the other side µ fulfills condition (5). Then we obtain with the equilibrium measure µ0 , I (µ0 ) = I (µ) + V (z) + 2 log |z − ζ |−1 dµ(ζ ) d(µ0 − µ)(z) + log |z − ζ |−1 d(µ0 − µ)(ζ ) d(µ0 − µ)(z) ≥ I (µ), and so µ = µ0 . To show the additional statement, we consider for the probability measure µ the set B = {z ∈ D | E(z) < E0 }. If µ(B) > 0, the variational principle (5) for the measure µ˜ = χB (z) E0 ≤ E(z) dµ(z) < E0 , µ(B) and so µ(B) has to vanish. Thus, we get condition (6).

χB µ(B) µ

would yield

Instead of verifying condition (5), we prefer to use the following statement:

Density of Eigenvalues of Random Normal Matrices

439

Corollary 3.5. If for a measure µ ∈ M(D) the function E, defined by Eq. (4), fulfills, for some real constant E0 , E(z) ≡ E0 on the support of µ and E(z) ≥ E0 everywhere, then µ is the equilibrium measure. 4. The Eigenvalue Density Definition. The k-point correlation function R (k) is given by (k) RN (zi )ki=1 =

N! (N − k)!

D N −k

N PN (zi )N d 2 zi . i=1 i=k+1

So, the one-point correlation function is up to normalization nothing but the density of the eigenvalues. As indicated in the previous section, all the correlation functions can be calculated in the limit N → ∞ out of the equilibrium measure µ, as in the case of hermitian matrix models, see [8, 9]. Theorem 4.1. For all φ ∈ C(D k ) we have the equality lim

N→∞ D k

k

(k) 2 1 φ (zi )ki=1 RN (zi )ki=1 d zi = k N

I.e. the measure

k (k) 1 R (zi )ki=1 i=1 Nk N

i=1

k φ (zi )ki=1 dµ(zi ). i=1

d2 zi on D k converges weakly to

k

i=1

(7)

dµ(zi ).

Proof. Substituting in the left-hand-side of Eq. (7) the definition of the correlation functions and turning our attention to the highest order in N , we obtain (because PN is invariant under the symmetric group) k (k) 1 (k) 1 φ, k RN = k φ (zi )ki=1 RN (zi )ki=1 d 2 zi N N Dk =

1 Nk

N

k i1 ,... ,ik =1 D

i=1

N φ (zij )kj =1 PN (zi )N d2 zi + o(1). i=1 i=1

Since for large values of N the probability distribution localizes at values z ∈ D0N with I (δz ) ≈ I0 , let us consider the sets AN,η = {z ∈ D0N | I (δz ) ≤ I0 + η},

η > 0.

Lemma 4.2. The probability PN (D N \ AN,η ) drops for N → ∞ exponentially to zero. 1 2 Proof. For an absolutely continuous

equilibrium measure dµ(z) = ψ(z) d z, we get with Jensen’s theorem and I (δz ) dµ(zi ) = I0 + o(1),

ZN ≥

e

N

−N 2 I (δz )−

{z∈D N

i=1 log ψ(zi )

| ψ(zi ) = 0 ∀i}

N

dµ(zi ) ≥ e−N

2 I +o(N 2 ) 0

,

i=1

1 This case suffices for our needs, but the restriction is in fact not necessary. Indeed, we could perform an analogous argument for the measures dµε (z) = ψε (z) d2 z, ψε (z) = 1 2 Bε (z) dµ. In the limit πε ε → 0, where I (µε ) → I0 , this would yield the desired statement.

440

P. Elbau, G. Felder

and therefore, PN (D N \ AN,η ) ≤

DN

eN

2 I +o(N 2 )−N 2 (I +η) 0 0

N

d2 zi = o(e−N

2 η/2

).

i=1

Let the continuous function N1k φ (zij )kj =1 take its maximum on the compact set AN,η at ζ , and set νN,η = δζ . Then, 1 (k) 1 φ, k RN ≤ k N N

N i1 ,... ,ik =1

φ (ζij )kj =1 =

k φ (zi )ki=1 dνN,η (zi ). i=1

Because of Lemma 3.2, we find a convergent subsequence νN(n),η → νη (n → ∞) with lim

N→∞

φ,

k 1 (k) k ≤ φ (z R ) dνη (zi ). i i=1 Nk N i=1

Lemma 4.3. We have νη ∈ M(D) and, in the limit η → 0, I (νη ) → I0 . Proof. Using ζ ∈ AN(n),η , we obtain with the cut-off L ∈ R: N(n) 1 1 I0 + η ≥ V (ζi ) + min{log |ζi − ζj |−1 , L} N (n) N (n)2 i=1 1≤i=j ≤N(n) = V (z) dνN(n),η (z) L + min{log |z − ζ |−1 , L} dνN(n),η (ζ ) dνN(n),η (z) − . N (n)

Sending first n and then L to infinity brings us to νη ∈ M(D) and therefore I0 ≤ I (νη ) ≤ I0 + η. So, letting η → 0, a subsequence of νη converges to the equilibrium measure µ, and thus, k 1 (k) k lim φ, k RN ≤ φ (zi )i=1 dµ(zi ). N→∞ N i=1

Arguing in the same way for the limes inferior concludes the proof.

5. Polynomial Curves Definition. A polynomial curve of degree n is a smooth simple closed curve in the complex plane with a parametrization h : S 1 ⊂ C → C of the form h(w) = rw + a0 + a1 w −1 + · · · + an w −n ,

|w| = 1,

(8)

with r > 0 and an = 0. The standard (counterclockwise) orientation of the circle induces an orientation on the curve. We say that a polynomial curve is positively oriented if this orientation is counterclockwise, i.e., if the tangent vector to the curve makes one full turn in the counterclockwise direction as we go around the unit circle.

Density of Eigenvalues of Random Normal Matrices

441

Proposition 5.1. Let γ be a positively oriented polynomial curve with parametrization h of the form (8). Then h, viewed as a homolorphic map on C× , restricts to a biholomorphic map from the exterior of the unit disk onto the exterior of γ . Proof. We have to show that h (w) = 0 for all w in the complement of the unit disk. Let t denote the tangent vector map w → t (w) = h (w)iw = i(rw − j aj w −j ). Since γ is a simple closed curve, the map w → t (w)/|t (w)| is a map of degree 1 from the unit circle to itself. Therefore we have 1 1= d arg(t (w)) 2π |w|=1 1 t (w) = dw 2πi |w|=1 t (w) 1 t (w) =N+ dw. 2πi |w|=R t (w) Here N ≥ 0 denotes the number of zeros of t (w), counted with multiplicity, in the complement of the unit disk and R is so large that it contains them all. The latter integral is 1 as can be seen by sending R to infinity. Thus N = 0, and h has no zeros in the complement of the unit disk. A simple consequence of this proposition is that a polynomial curve is uniquely parametrized by a map of the form (8) with r > 0. Indeed, any other conformal mapping of the complement differs by an automorphism of the complement of the unit disk. But non-trivial automorphisms are given by fractional linear transformations which do not preserve the conditions. From now on, we will only consider polynomial curves encircling the origin, i.e., such that the origin is contained in their interior domain. This can always be achieved by a translation, i.e., a shift of a0 . Definition. The harmonic moments (tj )∞ j =1 of the exterior domain D− of a polynomial curve (or more generally of an analytic curve) encircling the origin are defined by 1 1 −j 2 tj = − z d z= z¯ z−j dz, πj D− 2πij γ where only the right integral should be taken as a definition for j ≤ 2. Proposition 5.2. Let γ be a positively oriented polynomial curve of degree n encircling the origin. (i) The exterior harmonic moments tj of γ vanish for all j > n + 1. (ii) There exist universal polynomials Pj,k ∈ Z[r, a0 , . . . , ak−j ], 1 ≤ j ≤ k, so that for j = 1, . . . , n + 1, j tj = a¯ j −1 r −j +1 +

n

a¯ k r −k Pj,k (r, a0 , . . . , ak−j ).

(9)

k=j

Moreover, Pj,k is a homogeneous polynomial of degree k − j + 1 and it is also weighted homogeneous of degree k − j + 1 for the assignment deg(aj ) = j + 1, deg(r) = 0.

442

P. Elbau, G. Felder

(iii) The area of the domain enclosed by γ is πt0 , where t0 = r 2 −

n

j |aj |2 .

j =1

Proof. Since γ encircles the origin, h(w) never vanishes for |w| ≥ 1. Hence the contour in the formula for tj may be computed by taking residues at infinity. For j ≥ 1, 1 ¯ −1 )h (w)h(w)−j dw h(w j tj = 2π i |w|=1 −j n n n a¯ k −j k−j −l−1 −l−1 =r r− 1+ lal w al w /r dw. w 2π i k=0

l=1

l=0

The integrals in this sum vanish if k ≤ j − 2. The formula for tj in terms of ak , r is obtained by expanding the geometric series and picking the coefficient of w−1 in the integrand. This proves (i) and the first part of (ii). The homogeneity property is clear. The weighted homogeneity follows by rescaling w in the integral. The same formula ¯ −1 ), which does not contribute can be used to compute t0 , but the first term rw −1 in h(w to the integral and was omitted for j ≥ 1, must be added here. Examples. The terms in tj involving polynomials Pj,k with k ≤ 3 are t1 = a¯ 0 − r −1 a¯ 1 a0 − r −2 a¯ 2 (2 a1 r − a0 2 ) + r −3 a¯ 3 (3 a0 a1 r − 3 a2 r 2 − a0 3 ) + · · · , 2 t2 = r −1 a¯ 1 − 2 r −2 a¯ 2 a0 − 3 r −3 a¯ 3 (a1 r − a0 2 ) + · · · , 3 t3 = r −2 a¯ 2 − 3 r −3 a¯ 3 a0 + · · · . Theorem 5.3. Let t2 , . . . , tn+1 be complex numbers so that |t2 | < 1/2. Then there exists an A0 = A0 (t2 , . . . , tn+1 ) > 0 so that for all A, t1 with 0 < A < A0 and |t1 |2 < A(1/2 − |t2 |), there exists a unique positively oriented polynomial curve of degree ≤ n encircling the origin, with area A and exterior harmonic moments t1 , . . . , tn+1 , 0, 0, . . . . Proof. The idea is to invert the map (r, a0 , . . . , an ) → (t0 , . . . , tn+1 ) for small r and a0 . Set αj = r −j aj , ρ = r 2 and consider instead the polynomial map F : (ρ, α0 , . . . , αn ) → (t0 , . . . , tn+1 ), as a map from R × Cn+1 to itself. The first claim is that this map has a smooth inverse in some neighborhood of any point t ∈ R × Cn+1 such that t0 = t1 = 0 and |t2 | = 1/2. By Prop. 5.2, this map is given by t0 = ρ −

n

ρ j j |αj |2 ,

j =1

j tj = α¯ j −1 +

n

α¯ k Pj,k (r, α0 , rα1 , . . . , r k−j αk−j )

k=j

= α¯ j −1 +

n k=j

α¯ k Pj,k (ρ, α0 , α1 , . . . , αk−j ).

Density of Eigenvalues of Random Normal Matrices

443

From the contour integral representation of the harmonic moments we get the integral Pj,k (ρ, α0 , . . . , αk−j ) −j n n 1 k−j l −l−1 l −l−1 1− 1+ w lαl ρ w αl ρ w dw = 2π i |w|=R l=1

l=0

for any sufficiently large R. By computing the residue at infinity, we can calculate Pj,k and thus tj up to terms of at least second order in α0 , ρ, t0 = ρ (1 − |α1 |2 ) + · · · , j tj = α¯ j −1 − j α¯ j α0 − (j + 1)ρ α¯ j +1 α1 + · · · ,

j ≥ 1.

Hence F (0, 0, 2 t¯2 , . . . , (n + 1) t¯n+1 ) = (0, 0, t2 , . . . , tn+1 ) and the tangent map at this point sends (ρ, ˙ α˙ 0 , . . . , α˙ n ) to (t˙0 , . . . , t˙n ) with t˙0 = (1 − 4|t2 |2 )ρ, ˙ ¯ j t˙j = α˙ j −1 − j (j + 1) tj +1 α˙ 0 − 2j (j + 1)(j + 2) tj +2 t2 ρ, ˙

j ≥ 1.

The tangent map is invertible if |t2 | = 1/2. By the inverse function theorem, F has a smooth inverse on some neighborhood of (0, 0, t2 , . . . , tn+1 ). If |t2 | < 1/2, F preserves the positivity of the first coordinate. In terms of the original variables, this means that given any t = (t0 , t1 , t2 , . . . , tn+1 ) with small t0 > 0, t1 and such that |t2 | = 1/2, there is a curve w → h(w) with h(w) = rw + α0 + rα1 w −1 + · · · + r n αn w −n and αj (j + 1) t¯j +1 . It remains to show that if r > 0 is small enough, h parametrizes a positively oriented simple closed curve containing the origin. We first show that h is an immersion. Since h (w) = r − rα1 w −2 + O(r 2 ) and limr→0 α1 = 2 t¯2 , we see that as long as |t2 | = 1/2, h (w) does not vanish on the unit circle. Similarly, we show that h : S 1 → C is injective: we have

|h(w) − h(w )|2 = r|w − w + 2 t¯2 (w −1 − w −1 )| + O(r 2 ) = r|w − w + 2 t¯2 (w¯ − w¯ )| + O(r 2 ). But the expression in the absolute value can only vanish for w = w if |2 t¯2 | = 1 which is excluded by the hypothesis. Moreover h(w) = rw + t¯1 + 2r t¯2 w −1 + O(r 2 ). Therefore h parametrizes a perturbation of an ellipse centered at t¯1 . The condition on t1 is a sufficient condition for this ellipse to contain the origin. Example. Let k ≥ 3 and let us consider curves with tj = 0 for all j = k. Then aj = 0 for all j = k − 1, so that h(w) = rw + ak−1 w −k+1 . The relation between (t0 , tk ) and (r, ak−1 ) is t0 = r 2 − (k − 1)|ak−1 |2 , k tk = a¯ k−1 r −k+1 . This map is a diffeomorphism from the region 0 < r < (k − 1) |ak−1 |, which is the condition for h(w) to be an embedding of the unit circle, onto the region 0 < t0 < (k(k − 1)|tk |)−2/(k−2) (k − 2)/(k − 1). As t0 approaches the upper bound for given tk , the curve develops cusp singularities.

444

P. Elbau, G. Felder

6. The Equilibrium Measure for a Polynomial Curve In this section we evaluate the equilibrium measure corresponding to potentials 1 V (z) = t0

|z| − 2 Re 2

n+1

tk z

k

.

(10)

k=1

We anticipate the result: 1 Theorem 6.1. For any set (tk )∞ k=1 ⊂ C with t1 = 0, |t2 | < 2 and tk = 0 for k > n + 1 and any compact domain D ⊂ C containing the origin as an interior point and such that t0 V , V given by Eq. (10), is positive on D \ {0}, there exists a δ > 0 so that for all 0 < t0 < δ the equilibrium measure µ for V on D is given by

µ=

1 χD λ, πt0 +

where D+ denotes the interior domain of the polynomial curve γ defined by the harmonic moments (tk )∞ k=0 , and λ is the Lebesgue measure on C. The rest of the section is dedicated to the proof of this theorem.

6.1. The Schwarz reflection. We first need the notion of a reflection on an analytic curve (see e.g. [10]). Definition. The Schwarz function of an analytic curve γ is defined as the analytic continuation (in a neighbourhood of the curve) of the function S(z) = z¯ on γ . The Schwarz reflection ρ for the analytic curve in this domain is the anti-holomorphic map ρ(z) = S(z). Definition. Under the critical radius R of the polynomial curve defined by the parametrization h we understand the value R = max{|w| | h (w) = 0, w ∈ C}, which, by definition of the map h, is less than 1. Lemma 6.2. Let γ be a polynomial curve parametrized by h and R its critical radius. Then the Schwarz function S and the Schwarz reflection ρ of the curve γ restricted to h(B1/R \ B¯ R ), where BR denotes the open disk with radius R around zero, are biholomorphic respectively anti-biholomorphic maps. They are given by S(z) = h¯

1 −1 h (z)

1 and ρ(z) = h . h¯ −1 (¯z)

(11)

Therefore, ρ maps γ identically on itself, h(B1 \ B¯ R ) to h(B1/R \ B¯ 1 ) and vice versa. Also, we have ρ 2 = id.

Density of Eigenvalues of Random Normal Matrices

445

Proof. By definition of the Schwarz function, in a neighborhood of |w| = 1, ¯ −1 ). S(h(w)) = h(w Because for w ∈ C \ B¯ R the function h is biholomorphic, we may write 1 1 ¯ S(z) = h . , ρ(z) = S(z) = h h−1 (z) h¯ −1 (¯z) Taking the derivatives, we find that they do not vanish for R < |h−1 (z)| <

1 R.

Lemma 6.3. In the interior domain D+ of the polynomial curve defined by the parameters tk from (10) as its harmonic moments, the function −1 z 2 log − 1 d2 ζ E(z) = V (z) + πt0 D+ ζ is equal to zero and in the exterior domain D \ D+ , its gradient reads ∂z¯ E(z) =

1 (z − ρ(z)). t0

(12)

Proof. To verify the first statement, we use Green’s theorem and obtain 2 |ζ |2 1 log |z − ζ |−1 ζ¯ + dζ. log |z − ζ |−1 d2 ζ = −|z|2 + Re π D+ 2πi γ ζ −z Integrating by parts of the second integrand yields immediately 2 1 −1 2 2 log |z − ζ | d ζ = −|z| − 2 Re log(ζ − z)ζ¯ dζ, π D+ 2π i γ and expanding the logarithm around z = 0 leads us to E(z) = 0 in D+ . For the proof of the second part we write S = Si + Se , where Si is analytic in D+ and Se in the complement D \ D+ . For the exterior function Se one finds with the Cauchy integral and Stokes formula: ¯ ζ − Si (ζ ) 1 1 1 Se (z) = − dζ = d2 ζ, z ∈ D \ D+ . 2π i γ ζ − z π D+ z − ζ Because we know E to be constant on D+ , n+1 1 1 1 k−1 2 0 = ∂z E(z) = z¯ − ktk z − d ζ t0 π D+ z − ζ k=1

for all z ∈ D+ . So on im γ , and by analytic continuation in the entire domain where S is holomorphic (which includes the exterior domain D \ D+ ), we have k−1 . Si (z) = n+1 k=1 ktk z Therefore, for all z ∈ D \ D+ , 1

1 z − Si (z) − Se (z) = (z − ρ(z)) . ∂z¯ E(z) = t0 t0

446

P. Elbau, G. Felder

Let us remark that this proof shows that the Schwarz function S has, at least around infinity, the form S(z) =

n+1

ktk zk−1 +

k=1

∞

t0 vk z−k−1 , + z k=1

where the vk = π1 D+ zk d2 z denote the harmonic moments of the interior domain D+ . This fact was already used in [2] to establish a connection between the harmonic moments and the coefficients of the parametrization of γ . And as was shown in [2], we find, with Theorems 4.1 and 6.1, that the vk are nothing but the expectation values of t0 k N tr(M ) with respect to the probability measure PN (M) dM in the limit N → ∞ and hence, are completely determined by the harmonic moments of the exterior domain and t0 .

6.2. The Gaussian case. In this case, where the polynomial curve is an ellipse, we are able to calculate the equilibrium measure on C explicitly. Proposition 6.4. The equilibrium measure µ for the potential V (z) =

1 (|z|2 − t2 z2 − t¯2 z¯ 2 ), t0

|t2 | <

1 , 2

on C is µ=

1 χD λ, ab +

where D+ denotes the interior of the ellipse √ √ Re( t2 z)2 Im( t2 z)2 + = |t2 |, a2 b2

a=

1 + 2|t2 | t0 , 1 − 2|t2 |

b=

1 − 2|t2 | t0 . 1 + 2|t2 |

(13)

Proof. As a polynomial curve, the ellipse (13) has the parametrization h(w) = r(w + 2t2 w −1 ),

r=

a+b . 2

(14)

We check that the given measure µ is the equilibrium measure by verifying the conditions of Corollary 3.5. To this end, let us introduce for |w| > 1 the function E(w) = E(h(w)),

E(z) = V (z) +

2 πt0

−1 z log − 1 d2 ζ. ζ D+

Integrating Eq. (12) and its complex conjugate analog we get for E(w) the expression w 1 ¯ −1 )h (w) dw . |h(w)|2 − |h(1)|2 − 2 Re h(w E(w) = t0 1

Density of Eigenvalues of Random Normal Matrices

447

Substituting in this expression relation (14) for h, we obtain t0 E(w) = r 2 |w + 2t2 w −1 |2 − |1 + 2t2 |2 −2r 2 Re(t¯2 w 2 + (1 − 4|t2 |2 ) log w + t2 w −2 − (t2 + t¯2 )) = r 2 |w|2 − 1 − 4|t2 |2 + 4|t2 |2 |w|−2 + (1 − 4|t2 |2 ) log(|w|−2 ) +2r 2 Re(2t2 ww ¯ −1 − t2 (ww ¯ −1 )|w|−2 − t2 (ww ¯ −1 )|w|2 ) = r 2 (|w|2 − 1)(1 − 4|t2 |2 |w|−2 ) + r 2 (1 − 4|t2 |2 ) log(|w|−2 ) −2r 2 (|w|2 − 1)(1 − |w|−2 ) Re(t2 ww ¯ −1 ) = r 2 (|w|2 − 1)(1 − 2|t2 |)(1 + 2|t2 ||w|−2 ) + r 2 (1 − 4|t2 |2 ) log(|w|−2 ) +2r 2 (|w|2 − 1)(1 − |w|−2 )(|t2 | − Re(t2 ww ¯ −1 )). We are now ready to start estimating E(w) for |w| > 1 and |t2 | < 21 . The last bracket we can estimate by ¯ −1 ) ≥ |t2 | − |t2 ww ¯ −1 | = 0. |t2 | − Re(t2 ww It therefore remains to show (|w|2 − 1)(1 + 2|t2 ||w|−2 ) + (1 + 2|t2 |2 ) log(|w|−2 ) ≥ 0, which follows immediately out of the following lemma. Lemma 6.5. For 0 ≤ α ≤ 1 the function f (x) = (x − 1)(1 + αx −1 ) − (1 + α) log x is non-negative on the interval [1, ∞). Proof. For the function f and its derivatives f and f we have 2α 1 ≥ 0. f (1) = 0, f (1) = 0 and f (x) = 2 1 + α − x x

So we showed that E(z) ≥ 0 for all z ∈ C \ D+ . Because of Lemma 6.3, we also know that in the interior domain D+ , E is zero and we therefore can apply Corollary 3.5 to see that µ is indeed the equilibrium measure for the potential V on C. 6.3. The proof of Theorem 6.1. As in the Gaussian case we are going to show that the measure µ given in the theorem fulfills the conditions of Corollary 3.5 and is therefore the uniquely defined equilibrium measure. Because Theorem 6.1 is only valid for interior domains with small area, we are going to consider the asymptotical behaviour t0 → 0, where the harmonic moments (tk )∞ k=1 are kept fixed. To catch the asymptotical behavior of the corresponding polynomial curve, let us parametrize it as in the proof of Theorem 5.3: h(w) = rw +

n

r j αj w −j .

j =0

Then, for r → 0, we have r 2 t0 , αj (j + 1)t¯j +1 , j ≥ 1, and, because we set t1 = 0, α0 r 2 .

448

P. Elbau, G. Felder

Lemma 6.6. The critical radius of h is asymptotically constant for r → 0: R = |α1 | + O(r). Proof. The√roots of the function h (w) = r −rα1 w −2 −· · ·−nr n αn w −n−1 are in zeroth order at ± α1 and (n − 1-times degenerated) at zero. We consider now for z ∈ D respectively for w ∈ h−1 (D \ D+ ) the functions −1 z log − 1 d2 ζ and (15) E(z) = V (z) + ζ D+ w 1 ¯ w˜ −1 )h (w) E(w) = E(h(w)) = |h(w)|2 − |h(1)|2 + 2 Re h( ˜ dw˜ . (16) t0 1 We already showed in Lemma 6.3 that E(z) ≡ 0 in D+ , so the first condition of the corollary is satisfied (this is essentially the way we have chosen our potential V ). Now, also with Lemma 6.3, we see that E ≥ 0 in the vicinity of the curve γ , strictly speaking in the domain h(B1/R \ B¯ 1 ). Indeed, if we look at the connected components of the contour lines of the function E (which are smooth curves in the considered domain because there ∂z¯ E = 0), we see that the gradient vector ∂z¯ E always points outwards, i.e. in the exterior domain of the contour line. Therefore, the value of E on the contour lines is increasing outwards as desired. A bit farther from the curve, i.e. for 1/R ≤ |w| < r −α , 0 < α < 13 , the function E equals asymptotically the one of the corresponding ellipse h(0) (w) = rw + α0 + rα1 w −1 , we denote it by E (0) . Indeed, remarking that for the area of this ellipse we have (0) t0 = t0 + O(r 4 ) and that E (0) (w) = O(r −2α ), we obtain, uniformly in w, 1 (|h(w)|2 − |h(0) (w)|2 − |h(1)|2 + |h(0) (1)|2 ) t0 w 2 + Re (h(w˜ −1 ) − h(0) (w˜ −1 ))h (w) ˜ dw˜ t0 1 w 2 + Re (h (w) ˜ − h(0) (w))h ˜ (0) (w˜ −1 ) dw˜ t0 1 1 (0) t0 − t0 E (0) (w) + t0 = O(r 1−α ) + O(r 1−3α ) + O(r 1−2α ) + O(r 2−2α ) → 0 (r → 0).

E(w) − E (0) (w) =

Because E (0) (w) ≥ C > 0 for all |w| ≥ R1 and r > 0, we may choose r so small that |E(w) − E (0) (w)| < E (0) (w) and so E(w) > 0 for all w ∈ Br −α \ B1/R . The domain remains, where |w| ≥ r −α . For k ≥ 2, we obtain  l k n k (rw)k−l  r j αj w −j  h(w)k − (rw)k = l l=1

j =0

l=1

j =0

 l k n k k k−2l  j −1 = r w r αj w 1−j  = O(r 2 ). l

Density of Eigenvalues of Random Normal Matrices

449

And since t0 V (z) = |z|2 − t2 z2 − t¯2 z¯ 2 + o(|z|2 ) 1

= |z − 2t2 z¯ |2 + (1 − 4|t2 |2 )|z|2 + o(|z|2 ) 2 for z → 0 and V > 0 on D \{0}, we may find a constant C > 0 such that t0 V (z) ≥ C|z|2 for all z in the compact domain D. Therefore, for r → 0, C V (h(w)) = V (rw) + O(1) ≥ |rw|2 + O(1), t0 which tends to infinity at least as r −2α . On the other side, the integral over the logarithm in (15) diverges for r → 0 only as log r. So we have for r small enough E(h(w)) > 0 for all w ∈ h−1 (D \ D+ ) \ Br −α . This proves now E(z) ≥ 0 for all z ∈ D and therefore, with Corollary 3.5, that µ is the equilibrium measure. 6.4. Shifting the origin. As a little generalization, we consider the case where t1 = 0. This corresponds to a shift of the origin. Therefore, we like to define the harmonic moments also for a curve which does not encircle the origin. Definition. Let h(w) = rw +

n

aj w −j

j =0

parametrize a polynomial curve of degree n. Then the harmonic moments (tk )n+1 k=1 are given by the equation system (9). All other harmonic moments are set to zero. Proposition 5.2 tells us that this definition coincides with the previous one if the origin is in the interior domain of the curve. 1 Corollary 6.7. For any set (tk )∞ tk = 0 for k > n + 1 and k=1 ⊂ C with |t2 | < 2 and 2 k any compact domain D ⊂ C such that U (z) = |z| − 2 Re n+1 k=1 tk z has exactly one absolute minimum in the interior of D, there exists a δ > 0 so that for all 0 < t0 < δ the equilibrium measure µ for V = t10 U on D is given by

1 χD λ, πt0 + where D+ denotes the interior domain of the polynomial curve γ defined by the harmonic moments (tk )∞ k=0 . µ=

Proof. Let us first shift the origin by a0 , such that V (z + a0 ) has its absolute minimum in 0. Thereby, the potential gets the form n+1 1 |z|2 − 2 Re V (z + a0 ) = tk zk + V (a0 ), t0 k=2

where the tk are the harmonic moments of the shifted curve γ − a0 . Indeed, the coefficients of V and the harmonic moments depend polynomially on the shift a0 . Because we know them to coincide as long as the origin is inside D+ , they do so for all a0 . Applying now Theorem 6.1 for the shifted potential gives the desired result.

450

P. Elbau, G. Felder

7. Discussion We have shown that under suitable assumptions on the polynomial p appearing in the potential and on the integration range D of the eigenvalues, the asymptotic density is uniform with support on a domain uniquely determined by the coefficients of the polynomial p. It would be interesting to understand what happens at the range of validity of our assumptions. If the potential V has more than one minimum in D then one should expect for small t0 to have an equilibrium measure with disconnected support, so that a description by a polynomial curve cannot be valid. Also as t0 gets bigger, polynomial curves develop singularities and become non-simple. The question is then what happens to the eigenvalues. Finally we note that polynomial curves are (real sections of complex) rational curves. Curves of higher genus should arise by replacing p by more general holomorphic functions. References 1. Wiegmann, P. B., Zabrodin, A.: Conformal Maps and integrable hierarchies. Commun. Math. Phys. 213(3), 523–538 (2000) 2. Kostov, I. K., Krichever, I., Mineev-Weinstein, M., Wiegmann, P. B., Zabrodin, A.: τ -function for analytic curves. In: Random matrix models and their applications, Math. Sci. Res. Inst. Publ. 40, Cambridge: Cambridge Univ. Press, 2001, pp. 285–299 3. Marshakov, A., Wiegmann, P., Zabrodin, A.: Integrable Structure of the Dirichlet Boundary Problem in Two Dimensions. Commun. Math. Phys. 227(1), 131–153 (2002) 4. Krichever, I., Marshakov, A., Zabrodin, A.: Integrable Structure of the Dirichlet Boundary Problem in Multiply-Connected Domains. http://arxiv.org/list/hep-th/0309010, 2003 5. Takasaki, K., Takebe, T.: Integrable Hierarchies and Dispersionless Limit. Rev. Math. Phys. 7(5), 743–808 (1995) 6. Chau, L.-L., Zaboronsky, O.: On the Structure of Correlation Functions in the Normal Matrix Model. Commun. Math. Phys. 196(1), 203–247 (1998) 7. Saff, E. B., Totik, V.: Logarithmic Potentials with External Fields. Grundlehren der mathematischen Wissenschaften 316, Berlin-Heidelberg-New York: Springer, 1997 8. Johansson, K.: On Fluctuations of Eigenvalues of Random Hermitian Matrices. Duke Math J. 91, 151–204 (1998) 9. Deift, P. A.: Orthogonal polynomials and random matrices: a Riemann-Hilbert Approach. Courant Lecture Notes in Mathematics, Vol 3, New York: Courant Institute of Mathematical Sciences, 1999 10. Davis, P. J.: The Schwarz function and its applications. The Carus Mathematical Monographs, No. 17, Washington, DC: The Mathematical Association of America, 1974 Communicated by L. Takhtajan

Commun. Math. Phys. 259, 451–474 (2005) Digital Object Identifier (DOI) 10.1007/s00220-005-1373-y

Communications in

Mathematical Physics

Traveling Fronts in a Reactive Boussinesq System: Bounds and Stability Brandy Winn Department of Mathematics, University of Chicago, Chicago, IL 60637, USA. E-mail: [email protected] Received: 28 September 2004 / Accepted: 4 January 2005 Published online: 14 June 2005 – © Springer-Verlag 2005

Abstract: This paper considers a simplified model of active combustion in a fluid flow, with the reaction influencing the flow. The model consists of a reaction-diffusion-advection equation coupled with an incompressible Navier-Stokes system under the Boussinesq approximation in an infinite vertical strip. We prove that for certain ignition nonlinearities, including all that are C 2 , and for any domain width, planar traveling front solutions are nonlinearly and exponentially stable within certain weighted H 2 spaces, provided that the Rayleigh number ρ is small enough. The same result holds for bistable nonlinearities in unweighted H 2 spaces. We also obtain uniform bounds on the Nusselt number, the bulk burning rate, and the average maximum vertical velocity for chemistries that include bistable and ignition nonlinearities.

1. Introduction Transition processes in nature, such as the transition from one equilibrium state to another, are often manifested as traveling waves. Traveling waves are observed in combustion [17], chemical kinetics [4], propagation of dominant genes [3, 7], and propagation of nerve impulses [6], to name a few applications. The use of parabolic equations to reproduce wave propagation dates back to the work of Kolmogorov, Petrovski˘ı, and Piskunov [7] and Fisher [3] in 1937. Since the 1970’s, parabolic systems of reaction diffusion equations have been actively used and studied to model traveling wave phenomena in a wide variety of physical, chemical, and biological problems [14]. Many of these problems of interest take place in fluids and are strongly influenced by the fluid dynamics [9, 17]. Appropriately, numerous studies have investigated hydrodynamical effects on reaction processes. Most of these studies consider passive advection, in which the reactants are carried by the flow, but the flow is unaffected by the reaction. Only a few studies rigorously examine the effects of active advection, in which the fluid flow is influenced by the reaction [2, 8, 10, 11, 13]. The above references are merely a small

452

B. Winn

sampling of the literature; additional references can be found within the recent reviews [1, 16] as well as the above sources. This paper considers a reactive Boussinesq system in an infinite vertical strip = [0, ]x × Rz : ∂t T + u · ∇T = T + f (T ), ∂t u + u · ∇u + ∇P = σ u + σρT eˆz , ∇ · u = 0.

(1.1) (1.2) (1.3)

In this model a reaction-diffusion-advection equation is coupled with an incompressible Navier-Stokes system under the Boussinesq approximation. We regard the system as a simplified model of combustion in a fluid flow, with the reaction influencing the flow. The reaction chemistry is given by f ; of particular interest are ignition, bistable, and KPP nonlinearities. The boundary conditions in the x-direction are periodic, and in the z-direction are given by lim T = 1,

z→−∞

lim T = 0,

z→∞

lim u = 0.

z→±∞

(1.4)

The corresponding one-dimensional problem without x variation on a domain R and with an ignition, bistable, or KPP nonlinearity is well known to have traveling front solutions connecting the lower regions of hot fluid with the higher regions of cold fluid. These fronts have the form T = T (z−ct), u = 0 and are easily seen to solve the full system (1.1)–(1.4) as well. One of the central questions concerning this system is whether or not these planar fronts are stable with respect to small perturbations in the initial data. This issue was analyzed in [2] for ignition, bistable, and KPP nonlinearities. The authors proved that for small aspect ratio < C1 and small Rayleigh number ρ < C2 /3 for some constants C1 and C2 , the only traveling front solutions of (1.1)–(1.4) are the planar fronts corresponding to the one-dimensional problem, and any other solution of the system becomes planar in the sense that ||∇ ⊥ · u(t)||L2 () + ||∂x T (t)||L2 () → 0 as t → ∞. The authors also found that planar traveling fronts of this system with speed two, T = T (z − 2t), u = 0, are linearly unstable if ρ > 2C/σ and is large enough. In this case the planar fronts lose stability due to longwave perturbations that grow exponentially. Note that in general a Boussinesq system might have nonplanar traveling front solutions. This is certainly the case for reactive Boussinesq systems with bistable nonlinearities and large Rayleigh numbers; in [10, 11] such systems are proven to admit nonplanar fronts as solutions. This paper examines the nonlinear stability of planar fronts for ignition and bistable nonlinearities. We prove that for certain ignition nonlinearities, including all that are C 2 , and for any domain width , these planar traveling fronts are nonlinearly and exponentially stable within certain weighted H 2 spaces, provided that the Rayleigh number ρ is small enough to satisfy ρ < δ(, σ ) for some δ with δ(, σ ) → 0 as → ∞ or σ → ∞. The same result holds for bistable nonlinearities in unweighted H 2 spaces provided that the initial data for T decay exponentially at infinity. Our other main result concerns uniform bounds. We obtain bounds on the Nusselt number, the bulk burning rate, and the average maximum vertical fluid velocity, provided that the initial data for the temperature satisfy front-like exponential decay conditions and provided that the chemistry f satisfies certain conditions which include bistable and ignition nonlinearities. Similar bounds were obtained in [2] for the case of concave KPP nonlinearities.

Traveling Fronts in a Reactive Boussinesq System

453

The organization of this paper is as follows. Secttion 2 takes a closer look at the reactive Boussinesq system, noting some basic properties and symmetries of the equations, as well as setting up notation and assumptions to be used throughout this paper. Bounds for the temperature are derived in Sect. 3.1. The temperature bounds are then used to derive bounds for the Nusselt number, the bulk burning rate, and the average maximum vertical velocity in Sect. 3.2. The next section examines the nonlinear stability of the planar waves. First, Sect. 4.1 studies the spectrum of a relevant linear operator L and gives some bounds on the resolvent operator. Then the exponential of L and fractional powers of L are defined and a few needed bounds involving these are stated. Finally, Sect. 4.2 proves a stability result for ignition and bistable nonlinearities. 2. The Reactive Boussinesq System We consider the system ∂t T + u · ∇T = T + f (T ), ∂t u + u · ∇u + ∇P = σ u + σρT eˆz , ∇ ·u = 0

(2.1) (2.2) (2.3)

along with its vorticity ω = ∇ ⊥ · u = ∂x u2 − ∂ξ u1 , ∂t ω + u · ∇ω = σ ω + σρTx

(2.4)

on the domain = [0, ]x × Rz . We interpret T as the temperature, u as the fluid velocity, P as the pressure, and eˆz = (0, 1) as the unit vector in the direction opposite gravity. The positive constants σ and ρ are nondimensional, with σ representing the Prandtl number or normalized fluid viscosity, and ρ representing the Rayleigh number. The chemistry f (T ) is Lipschitz continuous, nontrivial, and satisfies f (0) = f (1) = 0. Additional conditions on f will be introduced as needed; of particular interest are ignition and bistable nonlinearities. For ignition f (T ) = 0 in [0, q∗ ] ∪ {1} and f (T ) > 0 in (q∗ , 1) for some q∗ ∈ (0, 1). For bistable nonlinearities f (T ) < 0 in (0, q∗ ) and f (T ) > 0 in (q∗ , 1) for some q∗ ∈ (0, 1). The boundary conditions are periodic in the x-direction, T (t, x, z) = T (t, x + , z), u(t, x, z) = u(t, x + , z), ω(t, x, z) = ω(t, x + , z), (2.5) and are given by lim T = 1,

z→−∞

lim T = 0,

z→∞

lim u = 0, and

z→±∞

lim ω = 0

z→±∞

(2.6)

in the z-direction. We assume the initial condition T0 (x, z) satisfies 0 ≤ T0 (x, z) ≤ 1. Note that the incompressibility of the system and the boundary conditions at z = ±∞ force the mean of u2 to be independent of z and hence zero: 0= (∂x u1 + ∂z u2 ) dx = ∂z u2 dx, 0

and so

0

0

u2 dx = 0.

454

B. Winn

This system is symmetric with respect to reflections so that if u, T , ω is a solution, then so is ˜˜ x, z) = (−u1 (t, − x, z), u2 (t, − x, z)) , u(t, T˜˜ (t, x, z) = T (t, − x, z), ˜˜ x, z) = −ω(t, − x, z). ω(t,

We assume that the initial data u0 , T0 is invariant with respect to this symmetry. With ˜˜ x, z) are solutions with the same initial data this assumption, T , u and T˜˜ (t, x, z), u(t, and boundary conditions. So this invariance is preserved in time and u1 (t, x, z) = −u1 (t, − x, z), T (t, x, z) = T (t, − x, z),

u2 (t, x, z) = u2 (t, − x, z), ω(t, x, z) = −ω(t, − x, z).

Under these circumstances, u1 and ω are odd, and hence have mean zero: u1 dx = 0, ω dx = 0. 0

0

This symmetry assumption on the initial data will be used only in Sect. 4.2 to obtain Poincar´e inequalities for u and ω, thereby completing the proof of stability. This assumption can be replaced by any other condition that ensures Poincar´e inequalities for u and ω. 3. Some Bounds 3.1. Bounds for T . First note that by the maximum principle, our assumption that 0 ≤ T0 ≤ 1 forces 0 ≤ T ≤ 1 for all t ≥ 0. Proposition 1. Suppose T0 satisfies T0 (x, z) ≤ ke−az for some positive constants k and a and suppose f (T ) ≤ 0 for T ∈ [0, q∗ ] for some q∗ ∈ (0, 1). Choose b ≥ a 2 + f ∞ . Then t u2 (s)∞ ds T (x, z, t) ≤ k exp −az + bt + a (3.1) 0

for all t ≥ 0. Proof. Define t u2 (s)∞ ds . θ+ (z, t) = k exp −az + bt + a 0

Then

So

∂t θ+ + u · ∇θ+ − θ+ = θ+ b + au2 (t)∞ − au2 − a 2 ≥ θ+ f ∞ . ∂t (θ+ − T ) + u · ∇(θ+ − T ) − (θ+ − T ) ≥ f ∞ θ+ − f (T ).

Traveling Fronts in a Reactive Boussinesq System

455

Note that f (T ) < f ∞ T for all T > 0. Define η := min f ∞ T − f (T ) > 0. T ∈[q∗ ,1]

We claim that θ+ − T ≥ 0 for all t ≥ 0. Suppose not. Then we may choose a number µ such that 0 > µ > inf(θ+ − T ) and 0 > µ > − 2fη . Moreover, there exists a point ∞ pµ = (xµ , zµ , tµ ) with (θ+ − T )(pµ ) = µ and inf (θ+ − T ) = µ. Then at pµ we ×[0,tµ ]

have

0 ≥ ∂t (θ+ − T ) + u · ∇(θ+ − T ) − (θ+ − T ) ≥ f ∞ θ+ − f (T ). If T ≤ q∗ , then f ∞ θ+ − f (T ) > 0 automatically. If T > q∗ , then at pµ f θ+ − f (T ) = f (θ+ − T ) + f T − f (T ) ∞ ∞ ∞ ≥ f ∞ µ + η η > . 2

Thus f θ+ − f (T ) > 0 for all T at pµ , a contradiction. ∞

Proposition 2. Suppose T0 satisfies 1 − T0 (x, z) ≤ keaz for some positive constants k and a and suppose f (T ) ≥ 0 for T ∈ [q∗ , 1] for some q∗ ∈ (0, 1). Choose b ≥ a 2 + f ∞ . Then t u2 (s)∞ ds (3.2) 1 − T (x, z, t) ≤ k exp az + bt + a 0

for all t ≥ 0. Proof. Note that ∂t (1 − T ) + u · ∇(1 − T ) = (1 − T ) − f (1 − (1 − T )) . Define T˜ = 1 − T and f˜(T˜ ) = −f (1 − T˜ ). Then ∂t T˜ + u · ∇ T˜ = T˜ + f˜(T˜ ) and f˜(T˜ ) ≤ 0 for T˜ ∈ [0, 1 − q∗ ]. Applying the proof of Proposition 1 to T˜ yields the result.

3.2. Some General Bounds. We assume in this section that f (T ) ≤ 0 for T ∈ [0, q∗1 ] and f (T ) ≥ 0 for T ∈ [q∗2 , 1] for some 0 < q∗1 ≤ q∗2 < 1 so that Propositions 1 and 2 hold. In particular f could be an ignition or bistable nonlinearity. We would like to obtain bounds on the Nusselt number N u, the bulk burning rate V∞ , and the average maximum vertical velocity W (t). These are given by the relations

1 2 ∇T (t)L2 , N u = lim N (t), where N (t) = t→∞ 1 V∞ = lim V (t) , where V (t) = ∂t T (x, z, t) dxdz, t→∞

456

B. Winn

and W (t) = u2 (t)∞ . Here φ(t) =

1 t

t

φ(t) dt 0

denotes the time average. At this point it will be useful to introduce a weighted L2 norm that is independent of the domain width : 1 φ2L2 = |φ|2 dxdz. (3.3) Lemma 1. Suppose the initial data T0 of system (2.1)–(2.3) satisfies (3.1) so that T ≤ k1 e−a1 z+B1 (t) ,

(3.4)

where

t

B1 (t) = b1 t + a1 0

u2 (s)∞ ds = b1 t + a1 tW (t).

Then V (t) ≤ W (t) +

b1

+ a1 t

(3.5)

for all t ≥ 0, where is a constant depending on the initial data. Proof. V (t) = = = ≤ = =

1 t 1 ∂t T (x, z, t) dxdz dt t 0 11 [T (x, z, t) − T0 (x, z)] dxdz t B1 (t)/a1 ∞ 0 11 (T − T0 )dz+ (T − T0 )dz+ (T − T0 )dz dx t 0 0 −∞ B1 (t)/a1 B1 (t)/a1 ∞ 0 11 −a1 z+B1 (t) (1 − T0 ) dz + 1 dz+ k1 e dz dx t 0 0 −∞ B1 (t)/a1

1 B1 (t) k1

0 + + t a1 a1

0 + k1 /a1 b1 + + W (t), t a1

where 0 is a constant depending only on the initial data T0 .

Traveling Fronts in a Reactive Boussinesq System

457

Lemma 2. Suppose the initial data T0 of system (2.1)-(2.3) satisfies (3.2) so that 1 − T ≤ k2 ea2 z+B2 (t) , where

t

B2 (t) = b2 t + a2 0

(3.6)

u2 (s)∞ ds = b2 t + a2 tW (t).

Then 1 b2 + , a2 t where is a constant depending on the initial data. N (t) ≤ V (t) + W (t) +

(3.7)

Proof. First note that by Eq. (2.1),

1 f (T ) dxdz. V (t) = Let ϕ(T ) = T (1 − T ). Making use of Eq. (2.1) we find (∂t + u · ∇)ϕ(T ) = ϕ (T ) [∂t T + u · ∇T ] = ϕ (T ) [T + f (T )] .

(3.8)

Integrating over and using integration by parts gives d

2 ϕ(T )dxdz = − ϕ (T )|∇T | dxdz + ϕ (T )f (T ) dxdz dt ≥2 |∇T |2 dxdz − f (T ) dxdz

= 2∇T 2L2 − V (t). Taking the time average and dividing by 2 yields 1 1 N(t) ≤ V (t) + [ϕ(T ) − ϕ(T0 )] dxdz 2 2t 1 1 1 = V (t) + (T − T0 )dxdz + (T0 − T )(T0 + T )dxdz 2 2t 2t 1 = V (t) + (T0 − T )(T0 + T )dxdz. (3.9) 2t The last integral in (3.9) is equivalent to 0 −B2 (t)/a2 (T0 + T ) (T0 + T ) 1 dz + (T0 − T ) dz (T0 − T ) t 0 2 2 −∞ −B2 (t)/a2 ∞ (T0 + T ) + dz dx (T0 − T ) 2 0 −B2 (t)/a2 0 ∞ 1 ≤ (1 − T ) dz + 1 dz + T0 dz dx t 0 −∞ −B2 (t)/a2 0 ∞ −B2 (t)/a2 1 B2 (t) ≤ k2 exp{a2 z + B2 (t)} dz + + T0 dz dx t 0 a2 −∞ 0 b2 11 ∞ k2 + + W (t) + T0 dzdx. = a2 t a2 t 0 0

458

B. Winn

Inserting this last relation into (3.9) reveals that b2 1 k2 1 ∞ N(t) ≤ V (t) + W (t) + + + T0 dzdx . a2 t a2 0 0

Lemma 3. Any solution of system (2.1)–(2.3) satisfies 1 W (t) ≤ C max{2 , } ρ N (t) + √ ω0 L2 σt

(3.10)

for some constant C. Here ω0 is the initial data of the vorticity ω. Proof. We will use bar notation to denote averages in x: φ(z, t) := 1 0 φ(x, z, t)dx. Recall that u2 = 0. Now any function φ satisfying φ = 0 also satisfies 2 |φ|2 ≤ 0 |∂x φ|dx ≤ 0 |∂x φ|2 dx. In particular

|u2 |2 ≤

|∂x u2 |2 dx,

(3.11)

0

|∂x u2 | ≤ 2

0

and

|∂z u2 |2 ≤

|∂x2 u2 |2 dx,

(3.12)

|∂x ∂z u2 |2 dx.

(3.13)

0

Treating u2 as a function in z alone, the Sobolev inequalities tell us that ∞ ∞ |u2 |2 dz + |∂z u2 |2 dz . |u2 |2 ≤ C −∞

−∞

(3.14)

Inserting the inequalities (3.11)–(3.13) into the right-hand side of (3.14) yields ∞ ∞ |u2 |2 ≤ C 3 |∂x2 u2 |2 dxdz + |∂x ∂z u2 |2 dxdz . (3.15) −∞ 0

−∞ 0

We now apply integration by parts to continue transforming the inequality: ∞ ∞ |u2 |2 ≤ C 3 |∂x2 u2 |2 dxdz + (∂x2 u2 )(∂z2 u2 )dxdz −∞ 0 −∞ 0 |u2 |2 dxdz ≤ C max{3 , } 3 |∂x ω|2 dxdz = C max{ , }

≤ C max{4 , 2 }∇ω2L2 .

It now follows that

W (t) = u2 (t)∞ ≤ C max{2 , } ∇ωL2 .

Traveling Fronts in a Reactive Boussinesq System

459

Applying Cauchy-Schwartz in time, we obtain 1/2 W (t) ≤ C max{2 , } ∇ω2L2 .

(3.16)

Multiplying the vorticity equation (2.4) by ω, integrating over , and applying integration by parts yields 1 d ω(t)2L2 + σ ∇ω(t)2L2 2 dt = σρ

ω∂x T dxdz

= −σρ (∂x ω)(T − T ) dxdz σ ≤ |∂x ω|2 + ρ 2 |T − T |2 dxdz 2 σ σρ 2 2 ≤ ∇ω(t)2L2 + ∇T 2L2 , by Sobolev’s inequality. 2 2 Taking the time average and dividing by , transforms the above inequality into 1 1 ω(t)2L2 + σ ∇ω(t)2L2 ≤ σρ 2 2 N (t) + ω(0)2L2 . (3.17) t t Combining (3.16) and (3.17) gives the desired bound.

Theorem 1. If a solution of system (2.1)–(2.3) has front-like initial data obeying (3.4) and (3.6) then 1 b1 b2 1+ + + , W (t) ≤ 2C 2 max{6 , 4 }ρ 2 + 2C max{2 , } √ ω0 L2 + 2a1 2a2 2 t σt (3.18) 1 2b 2b +

1 2 N(t) ≤ 4C 2 max{6 , 4 }ρ 2 + 4C max{2 , } √ ω0 L2 + + +2 a1 a2 t σt (3.19) and 1 3b1 b2 1 + 3 V (t) ≤ 2C 2 max{6 , 4 }ρ 2 +2C max{2 , } √ ω0 L2 + , + + 2a1 2a2 2 t σt (3.20) where and are constants depending on the initial data T0 , and C is the Sobolev constant from (3.14). Proof. Applying Young’s inequality to (3.10) yields W (t) ≤

1 1 N (t) + C 2 max{6 , 4 }ρ 2 + C max{2 , } √ ω0 L2 . 4 σt

(3.21)

Combining the relations (3.7), (3.5), and (3.21) in a straightforward way yields the desired results.

460

B. Winn

Corollary 1. If a solution of system (2.1)-(2.3) has front-like initial data obeying (3.4) and (3.6) then 2b1 2b2 2 7 5 2 N u ≤ 4C max{ , }ρ + (3.22) + a1 a2 and V∞ ≤ 2C 2 max{6 , 4 }ρ 2 +

3b1 b2 + . 2a1 2a2

(3.23)

4. Stability of Planar Traveling Waves We wish to investigate the stability of the special family of traveling wave solutions u = 0, T = U (z − c0 t + γ ), γ ∈ R, where U solves Uzz + c0 Uz + f (U ) = 0,

(4.1)

lim U = 1, and lim U = 0.

(4.2)

z→∞

z→−∞

The constant c0 is unique for ignition and bistable nonlinearities and can assume all values in the interval [2 f (0), ∞) for KPP nonlinearities. In all three cases U < 0. To obtain stability, we will switch to a set of moving coordinates ξ = z − c0 t + εc1 (t, ε), ε > 0, that move approximately with the traveling waves. The extra degree of freedom from allowing the shift γ = εc1 (t, ε) to vary in time will prove helpful. After performing the switch ∂t T − (c0 + ε∂t c1 )∂ξ T + u · ∇T ∂t u − (c0 + ε∂t c1 )∂ξ u + u · ∇u + ∇p ∇ ·u ∂t ω − (c0 + ε∂t c1 )∂ξ ω + u · ∇ω lim T = 0,

ξ →∞

lim T = 1

ξ →−∞

= T + f (T ), = σ u + σρT eˆξ , = 0, = σ ω + σρTx ,

lim u = 0, and

ξ →±∞

(4.3) (4.4) (4.5) (4.6)

lim ω = 0.

ξ →±∞

We now perturb about the special solution 0, U (ξ ) by writing T (t, x, ξ ) = U (ξ ) + ετ (t, x, ξ ), u(t, x, ξ ) = 0 + ερv(t, x, ξ ), ω(t, x, ξ ) = 0 + ερ ω(t, ˜ x, ξ ). Then τ , v, and ω˜ satisfy ∂t τ = −Lτ + F, 1 ∂t v = σ v + (c0 + ε∂t c1 )∂ξ v − ερv · ∇v − ∇ p˜ + σ τ eˆξ + σ U eˆξ , ε ∇ · v = 0, ∂t ω˜ = σ ω˜ + (c0 + ε∂t c1 )∂ξ ω˜ − ερv · ∇ ω˜ + σ ∂x τ,

(4.7) (4.8) (4.9) (4.10)

Traveling Fronts in a Reactive Boussinesq System

lim τ = 0,

ξ →±∞

461

lim v = 0, and

ξ →±∞

lim ω˜ = 0,

ξ →±∞

where −L = + c0 ∂ξ + f (U ), F = ε∂t c1 ∂ξ τ − ερv · ∇τ + (∂t c1 − ρv2 )Uξ + εN, 1 N = 2 [f (U + ετ ) − f (U ) − f (U )ετ ]. ε

(4.11) (4.12) (4.13)

The decay of τ and hence the stability of the system is determined to a great degree by the spectrum of L. By differentiating Eq. (4.1), we see that LUξ = 0 so that zero may be an eigenvalue of L. Clearly Uξ does not decay in time; so the zero-eigenspace could potentially be a problem. In the case of a zero eigenvalue, the pursuit of stability requires one to restrict to the orthogonal complement of the zero-eigenspace. To obtain stability, it will then suffice for the remaining spectrum of −L to lie strictly in the left half-plane with a gap from the imaginary axis. For some nonlinearities, such as the bistable nonlinearity, a spectral gap occurs automatically in H 2 (); however for other nonlinearities such as ignition and KPP, it is necessary to introduce weights to create a gap. Ultimately, the existence of a gap and the need for weights is determined by the behavior of f at the zeros of f , since these points are equilibria of the system.

4.1. Properties of the Linear Operator L. Consider the linear operator L given by −L = + c0 ∂ξ + f (U ),

(4.14)

where f is either an ignition chemistry with the properties f (T ) = 0 for T ∈ [0, q∗ ] ∪ {1}, 0 < q∗ < 1, f (T ) > 0 for T ∈ (q∗ , 1), f (T ) is Lipschitz continuous,

(4.15)

or a bistable chemistry with the properties f (0) = f (q∗ ) = f (1) = 0, 0 < q∗ < 1, f (T ) < 0 for T ∈ (0, q∗ ), f (T ) > 0 for T ∈ (q∗ , 1), f (T ) is Lipschitz continuous,

(4.16)

and c0 > 0 is the constant uniquely determined by (4.1)–(4.2). Consider for a moment the case of ignition. Note that f (U (−∞)) = f (1) < 0 and f (U (∞)) = f (0) = 0, so that 1 is a stable solution of the problem ∂t T = f (T ), but 0 is not. To obtain a spectral gap it is necessary to reweight L near ξ = ∞. We take the domain of L to be a weighted Sobolev space H 2 (w, ) on = [0, ]x × Rξ with weight w(ξ ) = (1 + eξ c0 /2 )2 . The corresponding weighted L2 space will be denoted by L2 (w, ) or L2 (w) when is clear, and its norm will be denoted by −L2 (w,) =

1/2 | − |2 w(ξ ) d

.

462

B. Winn

The corresponding inner product will be written as g, hw, =

1/2 ghw d

.

On the other hand, for bistable nonlinearities f (0) < 0 and f (1) < 0. So 0 and 1 are both stable solutions of the problem ∂t T = f (T ), and it is not necessary to work in a weighted space to obtain a spectral gap. To investigate the properties of the operator L, it is helpful to first consider the one dimensional operator −Lξ = ∂ξ2 + c0 ∂ξ + f (U )

(4.17)

on H 2 (w, ˜ R), where w˜ = w for ignition and w˜ = 1 for bistable. In Sattinger [12] the spectrum of −Lξ as an operator on weighted L∞ spaces is analyzed for a wide variety of nonlinearities, including ignition, bistable, and KPP. The methods and proofs in [12] can be modified in a straightforward way to apply to −Lξ as an operator on weighted H 2 spaces as well; although some of the exact details are different, the main ideas are identical [15]. The results can then be extended to the full operator −L by using a Fourier representation of τ . In this way, we obtain a theorem trapping the nonzero spectrum of −L in a parabola that lies strictly in the left half-plane and explicitly bounding the resolvent on regions exterior to this parabola, [15]. Before stating the theorem, we make a few definitions. First define λ1 by c02 2 2

. λ1 := inf λ ∈ [−c0 /4, 0) : (λ, 0) contains no eigenvalues of M = ∂ξ + f (U ) − 4 Next define the regions P ± by     2 c c 0 , P ± = λ : Re λ − f (U (±∞)) + 0 >  4 2 and define the region P by P=

P − , if f is ignition, P − ∩ P + , if f is bistable.

Note that P ± is the region exterior to a parabola that lies strictly in the left half-plane and intersects the real axis at f (U (±∞)). We will also be interested in shifts of these regions by an amount . Define P± = λ + : λ ∈ P ± . Theorem 2. Assume f is either an ignition nonlinearity with the properties (4.15) or a bistable nonlinearity with the properties (4.16) and f ∈ C 2 on neighborhoods of 0 and 1. Define the weight w˜ to be w(ξ ) = (1 + eξ c0 /2 )2 if f is ignition and 1 if f is bistable.

Traveling Fronts in a Reactive Boussinesq System

463

Then zero is a simple isolated eigenvalue of −L as an operator on H 2 (w, ˜ ) with eigenfunction Uξ , and the rest of the spectrum lies in a parabolic region strictly in the left half plane. More specifically the set {λ = 0} ∩ {| arg(λ − λ1 )| < π} ∩ P ∩ {| arg(λ + (2π/)2 )| < π }

(4.18)

is in the resolvent of −L. Furthermore, for all > 0, φ ∈ ( π2 , π), 0 < β < 1, and λ ∈ {λ = 0} ∩ {| arg(λ − λ1 )| < φ} ∩ P ∩ {Re λ ≥ −β(2π/)2 } ∪ {| arg(λ)| ≤ φ} we have the bounds (λ + L)−1 ψ

L2 (w,) ˜

(λ + L)−1 ψ

−1

(λ + L)

ψ

H 2 (w,) ˜

≤

C(, φ, β) ψL2 (w,) , ˜ |λ|

(4.19)

≤

C(, φ, β) ψL2 (w,) , ˜ F(|λ|)

(4.20)

H 1 (w,) ˜

≤ C(, φ, β)(1 +

(λ + L)−1 ∂ξ ψ

L2 (w,) ˜

≤

−2

1 ψL2 (w,) ) 1+ , (4.21) ˜ F(|λ|)

C(, φ, β) ψL2 (w,) , ˜ F(|λ|)

(4.22)

and (λ + L)−1 ψ

H 1 (w,) ˜

≤

where F(|λ|) = {|λ| if |λ| ≤ 1;

C(, φ, β) ||ψ||H 1 (w,) , if f

∞ < ∞, ˜ 2 F(|λ|)

(4.23)

√ |λ| if |λ| ≥ 1}.

Note that for all n ∈ Z, −(2π n/)2 is an eigenvalue of −L with eigenfunction ξ . So the spectral gap of −L depends on , and for large the spectral gap shrinks. In fact ≤ (2π/)2 , and for large , = (2π/)2 . Consider now the adjoint operator −L∗ of −L in H 2 (w, ˜ ), given by ei(2πn/)x U

1 [ − c0 ∂ξ + f (U (ξ ))]w˜ w ˜    + 1 − √2 c ∂ + f (U (ξ )) − 0 ξ w =   − c0 ∂ξ + f (U (ξ )) for bistable.

−L∗ =

c0

c02 e 2 2w

ξ

for ignition,

Zero is a simple eigenvalue of −L∗ with eigenfunction Uξ∗ = C w1˜ ec0 ξ Uξ , where the constant C is chosen so that Uξ , Uξ∗ = 1. w, ˜

464

B. Winn

Corollary 2. Assume f is either an ignition nonlinearity with the properties (4.15) or a bistable nonlinearity with the properties (4.16) and f ∈ C 2 on neighborhoods of 0 and 1. Define the weight w˜ to be w(ξ ) = (1 + eξ c0 /2 )2 if f is ignition and 1 if f is bistable. Then zero is a simple isolated eigenvalue of −L∗ as an operator on H 2 (w, ˜ ), and the rest of the spectrum lies in a parabolic region strictly in the left half plane. More specifically the set {λ = 0} ∩ {| arg(λ − λ1 )| < π} ∩ P ∩ {| arg(λ + (2π/)2 )| < π } is in the resolvent of −L∗ . Furthermore, for all > 0, φ ∈ ( π2 , π), 0 < β < 1, and λ ∈ {λ = 0} ∩ {| arg(λ − λ1 )| < φ} ∩ P ∩ {Re λ ≥ −β(2π/)2 } ∪ {λ : | arg(λ)| ≤ φ} we have the bounds C(, φ, β) ψL2 (w,) ≤ , (λ + L∗ )−1 ψ 2 ˜ L (w,) ˜ |λ| (λ + L∗ )−1 ψ ∗ −1

(λ + L ) and

ψ

H 2 (w,) ˜

H 1 (w,) ˜

≤

C(, φ, β) ψL2 (w,) , ˜ F(|λ|)

≤ C(, φ, β)(1 +

(λ + L∗ )−1 ∂ξ ψ

L2 (w,) ˜

where F(|λ|) = {|λ| if |λ| ≤ 1;

≤

−2

1 ψL2 (w,) ) 1+ , ˜ F(|λ|)

C(, φ, β) ψL2 (w,) , ˜ F(|λ|)

√ |λ| if |λ| ≥ 1}.

Assume from here on that f is an ignition or bistable nonlinearity as in (4.15) or (4.16) and f ∈ C 2 on neighborhoods of 0 and 1. We now take a moment to consider the restriction of −L to the orthogonal complement of its zero-eigenspace. Define P1 to be the projection operator onto the zero-eigenspace of −L, and let ∗ ∗ P2 = I −P1 . Then P1 ψ = ψ, Uξ Uξ . Note that LP1 ψ = ψ, Uξ LUξ = 0 and w, ˜ w, ˜ P1 Lψ = Lψ, Uξ∗ Uξ = ψ, L∗ Uξ∗ Uξ = 0, so that L commutes with P1 and w, ˜

w, ˜

hence also with P2 . Finally, define ϒ to be the orthogonal complement of the zero-eigenspace of −L in H 2 (w, ˜ ): 2 2 ∗ ϒ = ϒH 2 (w,) = P2 H (w, ˜ ) = ψ ∈ H (w, ˜ ) : ψ, Uξ = 0 . (4.24) ˜ w, ˜

Denote by L|ϒ the restriction of L to ϒ. Since zero is an isolated point of the spectrum of −L, it is in the resolvent of L|ϒ . So (λ + L|ϒ )−1 not only satisfies the bounds (4.19)–(4.23), but is also bounded near λ = 0. Thus L|ϒ (and L|∗ϒ ) is a sectorial operator [5], and −L|ϒ is the infinitesimal generator of an analytic semigroup {e−tL|ϒ }t≥0 , where 1 e−tL|ϒ = (λ + L|ϒ )−1 eλt dλ, 2πi C and C is a contour in the resolvent of −L|ϒ that

Traveling Fronts in a Reactive Boussinesq System

465

1. lies strictly in the left half-plane 2. stays outside a sector containing the spectrum of −L|ϒ 3. tends to infinity along rays with arguments ±γ for some γ ∈ (π/2, π ). Since L|ϒ is sectorial and Re σ (L|ϒ ) > 0, fractional powers of L|ϒ may be defined for any µ > 0 as ∞ 1 −µ L|ϒ = t µ−1 e−tL|ϒ dt, (µ) 0 µ −µ −1 , L|ϒ = L|ϒ where

(µ) =

∞

t µ−1 e−t dt.

0

For 0 < µ < 1, this definition is equivalent to sin(π µ) ∞ −µ −µ L|ϒ = λ (λ + L|ϒ )−1 dλ. π 0 The bounds (4.19)–(4.23) on the resolvent operator and the definitions of the exponential of −L|ϒ and fractional powers of L|ϒ combine in a straightforward way to yield the following bounds which will be needed in our proof of stability. Similar bounds hold for the adjoint operator L|∗ϒ . Proposition 3. Assume f is either an ignition nonlinearity with the properties (4.15) or a bistable nonlinearity with the properties (4.16) and f ∈ C 2 on neighborhoods of 0 and 1. Define the weight w˜ to be w(ξ ) = (1 + eξ c0 /2 )2 if f is ignition and 1 if f is bistable. Fix α with 0 < α < min{(2π/)2 , −λ1 , k(f )} ≤ , where k(f ) = −f (1) for ignition, and k(f ) = min{−f (0), −f (1)} for bistable. Then the exponential satisfies the bounds e−tL|ϒ ψ e−tL|ϒ ψ

H 2 (w,) ˜ H 1 (w,) ˜

e−αt ψL2 (w,) , ˜ t e−αt ≤ C(α, ) √ ψL2 (w,) , ˜ t ≤ C(α, )

e−tL|ϒ ψ 1 ≤ C(α, )e−αt ||ψ||H 1 (w,) , if f

∞ < ∞, ˜ H (w,) ˜ −tL|ϒ ψ 2 ≤ C(α, )e−αt ψL2 (w,) . e ˜ ˜ L (w,)

For

1 2

< µ < 1 the fractional powers of L|ϒ satisfy the bound −µ

L|ϒ ψ

H 1 (w,) ˜

Moreover, for µ ≥ 0 we have the bound µ −tL|ϒ L|ϒ e 2 2

≤ C(µ) ψL2 (w,) . ˜

L (w,)→L ˜ (w,) ˜

≤ C(µ, α, )t −µ e−αt .

466

B. Winn

4.2. Stability of the Traveling Waves. We return now to investigating the stability of the family of planar traveling wave solutions. Recall that τ , v, and ω˜ satisfy ∂t τ = −Lτ + F,

(4.25)

1 ∂t v = σ v + (c0 + ε∂t c1 )∂ξ v − ερv · ∇v − ∇ p˜ + σ τ eˆξ + σ U eˆξ , ε ∇ · v = 0, ∂t ω˜ = σ ω˜ + (c0 + ε∂t c1 )∂ξ ω˜ − ερv · ∇ ω˜ + σ ∂x τ, lim τ = 0,

ξ →±∞

lim v = 0, and

ξ →±∞

(4.26) (4.27) (4.28)

lim ω˜ = 0,

ξ →±∞

where −L = + c0 ∂ξ + f (U ), F = ε∂t c1 ∂ξ τ − ερv · ∇τ + (∂t c1 − ρv2 )Uξ + εN, 1 N = 2 [f (U + ετ ) − f (U ) − f (U )ετ ]. ε

(4.29) (4.30) (4.31)

Again we assume f is an ignition or bistable nonlinearity as in (4.15) or (4.16) and f ∈ C 2 on neighborhoods of 0 and 1. Our method follows closely many of the ideas of Xin and Malham in [8], which examines a slightly different coupled Boussinesq system and establishes neutral stability of nonplanar fronts generated by a shear flow. Let us attempt to simplify the situation and to deal with any zero-eigenspace by applying the projections P1 , P2 to τ . We begin by examining the projection P1 τ which has the equation ∂t P1 τ = P1 F = ε∂t c1 ∂ξ τ − ερv · ∇τ + (∂t c1 − ρv2 )Uξ + εN, Uξ∗ Uξ w, ˜

+ −ερv · ∇τ − ρv2 Uξ + εN, Uξ∗ = ∂t c1 1 + ε ∂ξ τ, Uξ∗ w, ˜

w, ˜

Let us simplify this equation by choosing ∂t c1 so that P1 F = 0: − −ερv · ∇τ − ρv2 Uξ + εN, Uξ∗ w, ˜ ∂t c1 = . ∗ 1 + ε ∂ξ τ, Uξ

Uξ .

(4.32)

w, ˜

This choice of ∂t c1 is clearly possible for small ε and small time, so that the denominator stays nonzero. We will later show that for ε sufficiently small, our choice of ∂t c1 will stay bounded so that it is valid for all time. Then P1 τ (t) = P1 τ0 = τ0 , Uξ∗ Uξ w, ˜ 1 [T0 (x, ξ ) − U (ξ )]Cec0 ξ Uξ (ξ ) dξ dx = Uξ ε # " C = Uξ T 0 (ξ ) − U (ξ ) ec0 ξ Uξ (ξ ) dξ, ε R

Traveling Fronts in a Reactive Boussinesq System

467

where T 0 (ξ ) = 1 0 T0 (x, ξ ) dx. Assume T0 decays exponentially at infinity, so that ˜ , ) for some 0 < c˜ < c . For ignition, this is a reasonable assumption, T0 ∈ L2 (ecξ 0 2 ˜ , ). For bistable nonlinearities, this is an additional restricsince L (w, ) ⊂ L2 (ecξ tion. Since U is monotonic and T 0 −U → 0 as ξ → ±∞, there must be a γ ∈ R such that # " T 0 (ξ ) − U (ξ + γ ) ec0 ξ Uξ (ξ + γ ) dξ = 0. R

To see this, define

# U (ξ + γ ) − T 0 (ξ ) ec0 ξ |Uξ (ξ + γ )| dξ R " # −c0 γ =e U (y) − T 0 (y − γ ) ec0 y |Uξ (y)| dy. "

G(γ ) =

R

We begin by showing that G is negative for large γ . Let p1 = U −1 (1/4) so that U (y) < 1/4 for y > p1 . Choose b ≥ p∗ large enough so that ∞ 1. q∗ ec0 p∗ b [U (y) − T (y)]dy ≤ 1. ec0 p1 1 2. Uz ∞ + 1 < inf |ec0 y Uz (y)| · (b − p1 ). c0 2 y≥p1 Since T 0 (z) → 1 as z → −∞, there exists a γ1 ≥ 0 such that T 0 (z) ≥ 3/4 for z ≤ b−γ and γ ≥ γ1 . Then for γ ≥ γ1 we have p1 b ∞ # " ec0 γ G(γ ) = + + U (y) − T 0 (y − γ ) ec0 y |Uξ (y)| dy −∞ p1 b

c p 0 1 e 1 3 ≤ Uz ∞ + − inf ec0 y Uz (y) · (b − p1 ) c0 4 4 y≥p1 ∞ [U (y) − T (y − γ )]dy +q∗ ec0 p∗ b

ec0 p1 1 = Uz ∞ − inf ec0 y Uz (y) · (b − p1 ) − q∗ ec0 p∗ c0 2 y≥p1 ∞ c0 p∗ +q∗ e [U (y) − T (y)]dy

b

T (y)dy b−γ

b

ec0 p1 1 ≤ Uz ∞ − inf ec0 y Uz (y) · (b − p1 ) + 1 c0 2 y≥p1 < 0. We now show that G is positive for γ negative enough. −c0 γ c0 y G(γ ) = e U (y)e |Uξ (y)| dy − T 0 (ξ )ec0 ξ |Uξ (ξ + γ )| dξ ≥ e−c0 γ

R R

R

U (y)ec0 y |Uξ (y)| dy − 1/2

R

˜ |T 0 (ξ )|2 ecξ dξ

1/2

˜ |Uξ (ξ + γ )|2 e(2c0 −c)ξ dξ R c˜ = e−c0 γ U (y)ec0 y |Uξ (y)| dy − e−(c0 − 2 )γ T 0 L2 (ecξ˜ ,R) Uξ L2 (e(2c0 −c)ξ ˜ ,R) ,

R

468

B. Winn

which is positive for γ negative enough. So the continuity of G implies that it has a zero somewhere. Without loss of generality we may assume that U is chosen so that γ = 0. Then P1 τ = 0. So the zero-eigenspace is not a problem. Let us now examine the projection P2 τ . Since P1 τ = 0, we have τ = P2 τ and τ0 = P2 τ0 . We also know P1 F = 0, and hence P2 F = F . So the solution τ satisfies ∂t τ = −L|ϒ τ + P2 F and τ (t, x, ξ ) = e

−tL|ϒ

t

τ0 (x, ξ ) +

e−(t−s)L|ϒ P2 F (s) ds.

(4.33)

0

Theorem 3. Assume f is an ignition nonlinearity with the (4.15) or a bistable properties nonlinearity with the properties (4.16), f ∈ C 1 [0, 1], f

∞ < ∞, and f ∈ C 2 on neighborhoods of 0 and 1. In addition, assume T0 decays exponentially at infinity and assume the initial data u0 and T0 are invariant with respect to the symmetries of the equations, as mentioned in Sect. 2. Define the weight w˜ to be w(ξ ) = (1 + eξ c0 /2 )2 if f is ignition and 1 if f is bistable. Choose c1 (t, ε) to satisfy (4.32). Write M(t) = ||τ (t)||H 1 (w,) + ˜

√

ρ ||v(t)||H 1 () + |∂t c1 (t, ε)|.

Then for any domain width , there exist constants K = K() > 0 and δ = δ(, σ, M(0)) > 0, so that if ε, ρ < δ(, σ ), then M(t) ≤ KM(0)e−βt ,

(4.34)

where β = β() is any constant satisfying 0 < β < 41 min{4π 2 −2 , σ −2 , −f (1), −λ1 }. Moreover, δ = O((1 + σ )−2 ) and δ → 0 as → ∞. Proof. Aside from the weights, the proofs for ignition and bistable nonlinearities are identical; so we only provide details here for the slightly more complicated ignition case. The plan is to use a supersolution argument to obtain stability. First, we need to derive an inequality for M. Let us begin by examining |∂t c1 |. Recall that ∂t c1 was chosen so that ∂t c1 = −ε∂t c1 ∂ξ τ + ερv · ∇τ + ρv2 Uξ − εN, Uξ∗

w,

.

Using Taylor’s Theorem we have 1 |f (U + ετ ) − f (U ) − f (U )ετ | ε2 1 1 |f (U + µετ ) − f (U )| dµ = |τ | ε 0 1

1 f |µετ | dµ ≤ |τ | ∞ ε 0 1 = f

∞ |τ |2 . 2

|N | =

(4.35)

Traveling Fronts in a Reactive Boussinesq System

469

Consequently

|∂t c1 | ≤ ε|∂t c1 | ∂ξ τ L2 (w,) Uξ∗

√ ∗ ∇τ + ερ||v|| 2 2 L () L (w,) Uξ w L2 (w,) ∞

f ∞ τ 2L2 (w,) Uξ∗ +ρ||v2 ||L2 () ||Uξ Uξ∗ w||L2 () + ε ∞ 2 " ≤ C ε|∂t c1 | ||τ ||H 1 (w,) + ερ ||v||H 1 () ||τ ||H 1 (w,) + ρ ||v||H 1 () +ε ||τ ||2H 1 (w,) . (4.36)

We now examine ||v||H 1 () . Since v has mean zero in x, it satisfies the Poincar´e inequality ||v||L2 () ≤ ||∂x v||L2 () . So ||v||H 1 () ≤

1 + 2 ||∇v||L2 () = 1 + 2 ||ω|| ˜ L2 () .

Multiplying the vorticity equation (4.28) by ω˜ and integrating over yields 1 d ˜ 2L2 () + σ ||ω|| ˜ L2 () ||τ ||H 1 () . ||ω|| ˜ 2L2 () ≤ −σ ||∇ ω|| 2 dt Since ω˜ has mean zero in x, it also satisfies a Poincar´e inequality: ||ω|| ˜ L2 () ≤ ||∂x ω|| ˜ L2 () ≤ ||∇ ω|| ˜ L2 () . Hence d σ ||ω|| ˜ L2 () ≤ − 2 ||ω|| ˜ L2 () + σ ||τ ||H 1 (w,) . dt Applying Gronwall’s inequality, we obtain t 2 −(σ/2 )t ||ω(t)|| ˜ ≤ || ω(0)|| ˜ e + σ e−(σ/ )(t−s) ||τ (s)||H 1 (w,) ds. 2 2 L () L () 0

Thus ||v(t)||H 1 () t 2 −(σ/2 )t 2 ≤ 1 + 2 ||ω(0)|| ˜ e + σ 1 + e−(σ/ )(t−s) ||τ (s)||H 1 (w,) ds. L2 () 0

(4.37) Finally, let’s examine the term ||τ ||H 1 (w,) . Recalling the equality (4.33), we have ||τ (t)||H 1 (w,)

t

e−(t−s)L|ϒ P2 F (s) 1 ds H (w,) t ≤ C()e−αt ||τ0 ||H 1 (w,) + ερ e−(t−s)L|ϒ P2 (v · ∇τ )(s)

≤ e

−tL|ϒ

τ0

H 1 (w,)

+

0

0

t e−α(t−s) +C() {ε|∂t c1 | ∂ξ τ (s)L2 (w,) √ t −s √0 +ρ Uξ w ∞ ||v2 (s)||L2 () + ε N (s)L2 (w,) } ds,

H 1 (w,)

ds

470

B. Winn

where 0 < α < min{(2π/)2 , −f (1), −λ1 } ≤ . By the equality (4.35) 2 1 f

∞ τ L2 (w,) 2 √ 2 1 ≤ f

∞ τ w L4 () 2 √ √ ≤ C||τ w||L2 () ||∇(τ w)||L2 () , by Gagliardo-Nirenberg $ c % 0 τ L2 (w,) ||τ ||H 1 (w,) . ≤ C max 1, 2

N L2 (w,) ≤

We proceed now to bound the term e−(t−s)L|ϒ P2 (v · ∇τ )(s) 1 2

with < µ1 < 1, and then choose µ2 with 0 < µ2 < and µ2 < 21 . So e−(t−s)L|ϒ P2 (v · ∇τ )(s) −µ

. Choose µ1

H 1 (w,) 1−µ1 . Note that 21 < µ1 +µ2

H 1 (w,)

= L|ϒ 1 L|ϒ1 e−(t−s)L|ϒ P2 (v · ∇τ )(s) 1 H (w,) µ1 −(t−s)L|ϒ P2 (v · ∇τ )(s) 2 ≤ C(µ1 ) L|ϒ e L (w,) µ1 +µ2 −(t−s)L|ϒ −µ2 = C(µ1 ) L|ϒ e L|ϒ P2 (v · ∇τ )(s) 2 L (w,) −µ2 µ1 +µ2 −(t−s)L|ϒ ≤ C(µ1 ) L|ϒ e P (v · ∇τ )(s) 2 L| 2 ϒ 2 µ

L (w,)→L (w,)

≤ C(µ1 , µ2 , )

e−α(t−s) (t − s)µ1 +µ2

−µ2 L|ϒ P2 (v · ∇τ )(s)

2 , −f (1), −λ } ≤ . where 0 < α < min{(2π/) 1 −µ2 Examining the term L|ϒ P2 (v · ∇τ )(s) 2

L (w,)

−µ

<1

L|ϒ 2 P2 (v · ∇τ ), g

w,

L2 (w,)

,

we see that

−µ = v · ∇τ, P2 (L|ϒ 2 )∗ g w, & ' = v · ∇τ, (L|∗ϒ )−µ2 P2 g w, √ ≤ ||v||Lp () ∇τ L2 (w,) w(L|∗ϒ )−µ2 P2 g √ ≤ ||v||H 1 () ||τ ||H 1 (w,) w(L|∗ϒ )−µ2 P2 g

where 21 + p1 + q1 = 1. √ It remains to bound w(L|∗ϒ )−µ2 P2 g

Lq ()

√ w(L|∗ϒ )−µ2 P2 g Lq () ∞ √ −t (L|ϒ )∗ 1 ≤ t µ2 −1 we P2 g (µ2 ) 0 ∞ √ −t (L|ϒ )∗ 1 t µ2 −1 we P2 g ≤ (µ2 ) 0

L2 (w,)

Lq () Lq ()

,

,

Lq () θ H 2 ()

dt √ −t (L|ϒ )∗ we P2 g

1−θ L2 ()

dt

Traveling Fronts in a Reactive Boussinesq System

by Gagliardo-Nirenberg for 0 ≤

1 2

−

1 q

471

≤ θ ≤ 1,

∞ 1−θ θ 1 ∗ −t (L|ϒ )∗ t µ2 −1 e−t (L|ϒ ) P2 g 2 P2 g 2 dt e H (w,) L (w,) (µ2 ) 0 ∞ C() t µ2 −1−θ e−αt dt P2 gL2 (w,) , ≤ (µ2 ) 0 ≤ C(, µ2 ) gL2 (w,) ,

=

for µ2 > θ. Hence for µ = µ1 + µ2 ,

1 2

e−(t−s)L|ϒ P2 (v · ∇τ )(s)

< µ < 1, we have the bound

H 1 (w,)

≤ C(, µ)

e−α(t−s) ||v||H 1 () ||τ ||H 1 (w,) . (t − s)µ

Then ||τ (t)||H 1 (w,) ≤ C()e−αt ||τ0 ||H 1 (w,) + C(, µ) +C() 0

t

t 0

e−α(t−s) ερ ||τ ||H 1 (w,) ||v||H 1 () ds (t − s)µ

e−α(t−s) {ρ ||v(s)||H 1 () + ε|∂t c1 | ||τ (s)||H 1 (w,) √ t −s

+ε ||τ (s)||2H 1 (w,) } ds.

(4.38)

Combining the bounds (4.36), (4.37), and (4.38) we obtain M(t) ≤

2 √ −αt ||τ0 ||H 1 (w,) ρ 1 + 2 e−(σ/ )t ||ω(0)|| ˜ L2 () + C()e √ √ 2 2 +C[εM (t) + ε ρM (t) + ρM(t)] t 2 √ 2 +σ ρ 1 + e−(σ/ )(t−s) M(s) ds 0 t 1 1 √ +C() +√ e−α(t−s) [ε ρM 2 (s) + εM 2 (s) µ (t − s) t −s 0 √ + ρM(s)] ds.

Let us simplify things a bit by choosing η = min{α, σ/2 } so that we only have one type of exponential, and let us assume that ε, ρ < δ, where 0 < δ < 1 is yet to be determined. Then √ M(t) ≤ C()e−ηt M(0) + C δ[M 2 (t) + M(t)] t √ 1 1+ e−η(t−s) [M 2 (s) + M(s)] ds. +[C() + ] δ(1 + σ ) (t − s)µ 0 Remember the plan is to use a supersolution argument; so we want to eliminate the integrals and get back to a differential equation. Choosing p > 2/(1 − µ) > 4 and

472

B. Winn

applying H¨older’s inequality with p and p/2 yields √ M(t) ≤ C()e−ηt M(0) + C δ M 2 (t) + M(t) √ +[C() + ] δ(1 + σ ) p1 t p2 t −η(t−s) p −η(t−s) p . e M(s) ds + e M(s) ds 0

0

Let us now eliminate the integrals by defining G(t) =

t

eηs M(s)p ds.

(4.39)

0

Then G (t) = eηt M(t)p , #1/p " M(t) = e−ηt/p G (t) , and G satisfies the equation " #1/p e−ηt/p G (t)

√ ≤ C()e−ηt M(0) + [C() + ] δ(1 + σ ) e−2ηt/p G (t)2/p + e−ηt/p G (t)1/p

+e−ηt/p G(t)1/p + e−2ηt/p G(t)2/p .

Multiplying each side by e2ηt/p and then raising to the power p, we obtain the more manageable expression eηt G (t) ≤ C1 ()e−(p−2)ηt M(0)p

+[C() + ]δ 2 (1 + σ p ) G (t)2 + eηt G (t) + eηt G(t) + G(t)2 . (4.40)

Recall that we want M(t) ≤ Ce−βt for some β > 0. So we need G (t) ≤ Ce(η−βp)t ˜ with η˜ < η. To get this, we will fix η˜ with 0 < η˜ < η and choose B large =: Ceηt enough so that B η˜ > [1 + C1 ()]M(0)p . We will now show that ˜ G(t) := Beηt

is a supersolution of (4.40) for δ small enough.

Traveling Fronts in a Reactive Boussinesq System

473

Inserting G(t) into the right-hand side of (4.40) we get for t ≥ 0, C1 ()e−(p−2)ηt M(0)p + [C() + ]δ 2 (1 + σ p ) G (t)2 +eηt G (t)+eηt G(t) + G(t)2 = C1 ()e−(p−2)ηt M(0)p + [C() + ]δ 2 (1 + σ p ) ˜ ˜ ˜ ˜ + B ηe ˜ (η+η)t + Be(η+η)t + B 2 e2ηt B 2 η˜ 2 e2ηt $ % ˜ ≤ C1 ()M(0)p + [C() + ]δ 2 (1 + σ p ) B 2 η˜ 2 + B η˜ + B + B 2 e(η+η)t ˜ ≤ B ηe ˜ (η+η)t ˜

= eηt G (t),

(4.41)

provided that δ is small enough, which is possible given our choice of B. Subtracting the inequalities (4.40) and (4.41) for G and G then leads to the relation eηt [G (t) − G (t)] $ " # ≥ C(, σ )δ 2 G (t)2 − G (t)2 + eηt G (t) − G (t) % +eηt [G(t) − G(t)] + G(t)2 − G(t)2

(4.42)

for t ≥ 0. Note now that if G (t∗ ) = G (t∗ ) for some t∗ ≥ 0, then (4.42) forces G(t∗ ) ≤ G(t∗ ). By our choice of B, G(0) > G(0) and G (0) > G (0); consequently G(t) > G(t) and G (t) > G (t) for all t ≥ 0. Hence #1/p " M(t) = e−ηt G (t) ˜ ≤ [B η] ˜ 1/p e−(η−η)t/p

for all t ≥ 0.

Acknowledgements. I would like to extend a special thanks to Peter Constantin and Lenya Ryzhik for all their help and advice during my investigations, as well as for suggesting this problem. This work was partially supported by the ASCI Flash center at the University of Chicago.

References 1. Berestycki, H.: The influence of advection on the propagation of fronts in reaction-diffusion equations. In: Nonlinear PDEs in Condensed Matter and Reactive Flows, NATO Science Series C, 569, Berestycki, H., Pomeau, Y. (eds.), Doordrecht, NL: Kluwer Acad. Publ., 2003, pp. 11–48 2. Constantin, P., Kiselev, A., Ryzhik, L.: Fronts in reactive convection: bounds, stability and instability. Commun. Pure and Appl. Math. 56, 1781–1803 (2003) 3. Fisher, R. A.: The wave of advance of advantageous genes. Ann. Eugenics 7, 355–369 (1937) 4. Frank-Kamenetski˘ı, D. A.: Diffusion and Heat Transfer in Chemical Kinetics, New York: Plenum Press, 1969 5. Henry, D.: Geometric Theory of Semilinear Parabolic Equations. New York: Springer-Verlag, 1981 6. Hodgkin, A. L., Huxley, A. F.: A quantitative description of membrane current and its application to conduction and excitation in nerves. J. Physiol. 117, 500–544 (1952) ´ 7. Kolmogorov, A. N., Petrovski˘ı, I. G., Piskunov, N. S.: Etude de l’´equation de la chaleur avec croissance de la quantit´e de mati`ere et son application a` un probl`eme biologique. Bull. Moskov. Gos. Univ Mat. Mekh. 1(6), 1–25 (1937) 8. Malham, S., Xin, J.: Global solutions to a reactive Boussinesq system with front data on an infinite domain. Comm. Math. Phys. 193, 287–316 (1998)

474

B. Winn

9. Peters, N.: Turbulent Combustion. Cambridge, UK: Cambridge University Press, 2000 10. Texier-Picard, R., Volpert, V.: Probl`ems de r´eaction-diffusion-convection dans des cylindres non born´es. C. R. Acad. Sci. Paris Sr. I Math. 333, 1077–1082 (2001) 11. Texier-Picard, R., Volpert, V.: Reaction-diffusion-convection problems in unbounded cylinders. Revista Matematica Complutense 16(1), 233–276 (2003) 12. Sattinger, D. H.: On the Stability of Waves of Nonlinear Parabolic Systems. Adv. in Math. 22, 312–355 (1976) 13. Vladimirova, N., Rosner, R.: Model flames in the Boussinesq limit: the effects of feedback. Phys. Rev. E. 67, 066305 (2003) 14. Volpert, A., Volpert, V., Volpert, V.: Traveling Wave Solutions of Parabolic Systems. Translations of Mathematical Monographs 140, Providence, RI: Amer. Math. Soc., 1994 15. Winn, B.: Doctoral Thesis. Chicago: The University of Chicago Press, 2005, to appear 16. Xin, J.: Front propagation on heterogeneous media. SIAM Rev. 42, 161–230 (2000) 17. Zel’dovich,Ya. B., Barenblatt, G. I., Librovich, V. B., Makhviladze, G. M.: The Mathematical Theory of Combustion and Explosions. New York: Consultants Bureau, 1985 Communicated by P. Constantin

Commun. Math. Phys. 259, 475–509 (2005) Digital Object Identifier (DOI) 10.1007/s00220-005-1375-9

Communications in

Mathematical Physics

Dispersive Estimates for Schr¨odinger Equations with Threshold Resonance and Eigenvalue K. Yajima Department of Mathematics, Gakushuin University, 1-5-1 Mejiro, Toshima-ku, Tokyo 171-8588, Japan. E-mail: [email protected] Received: 15 October 2004 / Accepted: 28 December 2004 Published online: 28 June 2005 – © Springer-Verlag 2005

Abstract: Let H = − + V (x) be a three dimensional Schr¨odinger operator. We study the time decay in Lp spaces of scattering solutions e−itH Pc u, where Pc is the orthogonal projection onto the continuous spectral subspace of L2 (R3 ) for H . Under suitable decay assumptions on V (x) it is shown that they satisfy the so-called Lp -Lq estimates e−itH Pc up ≤ (4π|t|)−3(1/2−1/p) uq for all 1 ≤ q ≤ 2 ≤ p ≤ ∞ with 1/p + 1/q = 1 if H has no threshold resonance and eigenvalue; and for all 3/2 < q ≤ 2 ≤ p < 3 if otherwise.

1. Introduction The present paper is concerned with the time decay in Lp spaces of solutions of three dimensional Schr¨odinger equations, i∂t u = (− + V (x))u,

x ∈ R3 .

(1.1)

Throughout the paper we assume that potentials V (x) are real valued and decay at infinity at least as rapidly as |V (x)| ≤ Cx−β , 1

for some β > 5/2,

(1.2)

where x = (1 + |x|2 ) 2 . Under this condition, the operator H = − + V is selfadjoint in the Hilbert space H = L2 (R3 ) with domain D(H ) = H 2 (R3 ), the Sobolev space of order 2, and the solution in H of (1.1) which satisfies the initial condition u(0) = ϕ ∈ H is uniquely given by u(t) = e−itH ϕ in terms of the unitary operator e−itH defined by the functional calculus. The spectrum of H consists of a finite number of non-positive eigenvalues of finite multiplicities and the absolutely continuous part [0, ∞). If ϕ is an eigenfunction

476

K. Yajima

of H , u(t) = e−itH ϕ is a stationary solution and never decays in time in any sense; however, if ϕ ∈ L2c (H ), the continuous spectral subspace for H , it is a scattering solution in the sense that for a unique ϕ± ∈ H, u(t) − e−itH0 ϕ± 2 → 0 as t → ±∞

(1.3)

(cf. [13, 23, 24]), where H0 = − is the free Schr¨odinger operator. For the free Schr¨odinger equation it has long been known (see e.g. [15]) that, although e−itH0 is unitary in L2 , the solution e−itH0 u decays as t → ±∞ in Lp if p > 2 and it satisfies

e−itH0 up ≤ (4π|t|)

−3

1 1 2−p

uq ,

u ∈ L2 ∩Lq (R3 ),

(1.4)

where 1 ≤ q < 2 is the dual exponent of p: 1/p + 1/q = 1 and Lp is the Lebesgue Lp space with the norm up . This decay estimate is known as an Lp -Lq estimate and it has been a very useful and important tool for studying linear and nonlinear Schr¨odinger equations (see e.g. [16]). In view of the relation (1.3), it is natural to expect that scattering solutions of (1.1) also decay in Lp if p > 2. Indeed, under the condition that V satisfies (1.2) with β > 3 and that H is of generic type, viz. H satisfies a spectral condition at the threshold 0 (see Definition 1.1 below), estimate (1.4) with e−itH Pc in place of e−itH0 , Pc being the orthogonal projection onto L2c (H ),

e

−itH

Pc up ≤ Cp t

−3

1 1 2−p

uq ,

u ∈ L2 ∩ Lq ,

(1.5)

has recently been proved by Goldberg-Schlag ([8], see [12, 2, 30, 30, 31, 28, 25, 27] for earlier and related works). It is also known that (1.5) cannot hold for all 2 ≤ p ≤ ∞ if H is of exceptional type as it would contradict the local decay estimate of Jensen-Kato[10] or Murata[19]. In this paper, we show, when H is of exceptional type, how (1.5) is violated and propose a new estimate which replaces (1.5); when H is of generic type, we prove that (1.5) is satisfied under the assumption (1.2), relaxing the decay condition of Goldberg and Schlag [8] (see, however, the note at the end of the introduction). To state the main results of the paper we introduce some notation and recall some known facts (see also the beginning of Sects. 3 and 4). For 1 ≤ p, q ≤ ∞, Lp,q is the Lorentz space with the norm up,q ([3, 21]). For γ ∈ R, Hγ = L2 (R3 , x2γ dx) is the weighted L2 space. The spaces H−γ and Hγ are duals of each other with respect to the coupling u, v = u(x)v(x)dx. R3

z)−1

We write R0 (z) = (H0 − and R(z) = (H − z)−1 for the resolvents of H0 and H respectively. We define for λ ∈ C, iλ|x−y| e 1 G0 (λ)u(x) = u(y)dy. (1.6) 4π |x − y| We have R0 (λ2 ) = G0 (λ) for λ > 0. The integral kernel of G0 (λ) is an entire function of λ ∈ C and, using its derivatives at λ = 0, we define 1 Dj u(x) = |x − y|j −1 u(y)dy, j = 0, 1, . . . , (1.7) 4πj ! so that G0 (λ) = D0 + iλD1 + (iλ)2 D2 + · · · at least formally.

Dispersive Estimates for Schr¨odinger Equations

477

For any 1/2 < γ < β − 1/2, the operator D0 V is of Hilbert-Schmidt type in H−γ and we denote the null space of 1 + D0 V by M: 1 V (y)φ(y) M = φ ∈ H−γ : φ(x) + dy = 0 . (1.8) 4π |x − y| The space M is finite dimensional and is independent of 1/2 < γ < β − 1/2. All φ ∈ M satisfy the stationary Schr¨odinger equation −φ(x) + V (x)φ(x) = 0

(1.9)

and, conversely, any function φ ∈ H− 3 which satisfies (1.9) belongs to M. The eigen2 space E of H with eigenvalue 0 is therefore a subspace of M. The function φ ∈ M is in E if and only if V , φ = 0 and codimM E ≤ 1. The sesquilinear form −(u, V v) is an inner product in M. Definition 1.1. We say H or V is of generic type if M = {0} and is of exceptional type otherwise. H is of exceptional type of the first kind if M = {0} and E = 0; of the second kind if E = M = {0}; and of the third kind if {0} ⊂ E ⊂ M with strict inclusions. A function φ ∈ M \ E is called a resonance of H . Note that most V are of generic type: If V is of exceptional type, then λV is of generic type for all λ = 1 near λ = 1 because D0 V is compact. It is easy to see from (1.8) that the resonance φ(x) satisfies φ(x) − C|x|−1 ∈ H for some constant C = 0 and that the eigenfunctions φ ∈ E may decay as |x| → ∞ as slowly as Cx−2 in contrast to the ones with negative eigenvalues, which generally decay exponentially. We write P0 for the orthogonal projection in H onto E. As φ ∈ E satisfy |φ(x)| ≤ Cx−2 , P0 defined on L2 ∩ Lq can be extended to a bounded operator from Lq to Lp for all 1 ≤ q < 3 and 3/2 < p ≤ ∞. We abuse notation and denote such extensions also by P0 . When H is of exceptional type of the third kind, we let φ1 ∈ M be a (uniquely determined) resonance such that V , φ1 > 0, −φ1 , V φ1 = 1 and −φ1 , V φj = 0 for all φj ∈ E and define the canonical resonance ([10]) by ϕ(x) = φ1 (x) + P0 V D2 V φ1 (x).

(1.10)

Using ϕ(x), we define a constant a and a function ζ (t, x) by x2

a = 4πi|V , ϕ|−2 ,

ζ (t, x) = ei 4t ϕ(x).

(1.11)

We define a function µ(t, x), which plays a special role in what follows, by i µ(t, x) = |x|

1

(e

i|x|2 4t

−e

iθ 2 |x|2 4t

)dθ ;

(1.12)

0

µ(t) is multiplication with µ(t, x). We use the notation |f g| interchangeably with f ⊗ g to denote the rank one operator defined by the integral kernel f (x)g(y) (not f (x)g(y)).

478

K. Yajima

Definition 1.2. We define the operators R(t) and S(t) respectively by 3π

ae−i 4 R(t) = √ ζ (t, ·) ⊗ ζ (t, ·), πt

(1.13)

3π

e−i 4 S(t) = √ (−iP0 V D3 V P0 + µ(t)D2 V P0 + P0 V D2 µ(t)) . πt

(1.14)

When H is of exceptional type of the first or the second kind, we use the same notation, setting, of course, S(t) = 0 or R(t) = 0 respectively. We remark that for a constant C > 0, 1 1 |x| |ζ (t, x) − ϕ(x)| + |µ(t, x)| ≤ C min √ , , . (1.15) t |x| |t| As remarked above, eigenfunctions φ ∈ E satisfy V (x)φ(x)dx = 0. It follows that (D2 V φ)(x) are bounded and, if {φ2 , . . . , φd } is an orthonormal basis of E and wj (t, x) = µ(t, x)(D2 V φj )(x), j = 2, . . . , d, then wj (t, x) are bounded by (1.15) and S(t) may be written in the form   iπ d d e4  aj k φj ⊗ φ k + (wj (t) ⊗ φj + φj ⊗ wj (t)) . √ π t j,k=2 j =2 Theorem 1.3. (1) Let V satisfy |V (x)| ≤ Cx−β for some β > 5/2. Suppose that H is of generic type. Then, for any 1 ≤ q ≤ 2 ≤ p ≤ ∞ such that 1/p + 1/q = 1, e−itH Pc up ≤ Cp t

−3

1 1 2−p

uq ,

u ∈ L 2 ∩ Lq .

(1.16)

(2) Let V satisfy |V (x)| ≤ Cx−β for some β > 11/2. Suppose that H is of exceptional type. Then the following statements are satisfied: (i) Estimate (1.16) holds when p and q are restricted to 3/2 < q ≤ 2 ≤ p < 3 and 1/p + 1/q = 1. 3 (ii) Estimate (1.16) holds when p = 3 and q = 3/2 provided that L3 and L 2 are 3 respectively replaced by Lorentz spaces L3,∞ and L 2 ,1 . (iii) When 3 < p ≤ ∞ and 1 ≤ q < 3/2 are such that 1/p + 1/q = 1, there exists a constant Cpq such that for any u ∈ L2 ∩ Lq , −3 21 − p1 −itH Pc − R(t) − S(t) u ≤ Cpq t uq . e p

(1.17)

If H is of exceptional type of the first kind, statement (2) holds under a weaker decay condition |V (x)| ≤ Cx−β with β > 9/2. 1

We remark that (R(t) + S(t))up ≤ C|t|− 2 uq for p, q such that 3 < p ≤ ∞ 1 and 1 ≤ q < 3/2 and that (R(t) + S(t))u3,∞ ≤ C|t|− 2 u3/2,1 ; however, R(t) is not bounded from Lq to Lp for any other pairs and that Pc is, although an orthogonal projection in H, bounded in Lp only for 3/2 < p < 3 in general. Combining Theorem 1.3 and the estimate (1.15), we immediately obtain the following theorem.

Dispersive Estimates for Schr¨odinger Equations

479

Theorem 1.4. Let V satisfy |V (x)| ≤ Cx−β for some β > 11/2. Suppose that H is of exceptional type. Then, for 3 < p ≤ ∞ and 1 ≤ q < 3/2 such that 1/p + 1/q = 1, there exists a constant C such that e−itH Pc up ≤ Ct

−3( 21 − p1 )

6

(uq + x q

−5

(1.18)

u1 ) 6

−5

for any u ∈ L2 ∩ Lq which satisfies φ, u = 0 for all φ ∈ M and x q u ∈ L1 . If H is of exceptional type of the first kind, the same statement holds under the weaker decay condition |V (x)| ≤ Cx−β with β > 9/2. We display here the plan of the paper, explaining the idea of the proof of Theorem 1.3 using a slightly sloppy argument. We refer the readers to the text for a more rigorous treatment. We say that a family of operator {T (t) : t ∈ R} is regularly dispersive if it is a strongly continuous family of bounded operators in H and, in addition, it satisfies the estimate (1.16) for all 1 ≤ q ≤ 2 ≤ p ≤ ∞ such that 1/p + 1/q = 1. In Sect. 2, we collect some results, well known as the limiting absorption principle (LAP for short), on the behavior of resolvents R0 (z) and R(z) near the reals. We state them for G0 (λ) and G(λ) which is defined by G(λ) = R(λ2 ) on the upper half plane λ > 0. We also record some results on certain integrals. Lemma 2.4 and Lemma 2.7 are the main tools and are frequently used in the paper. We prove the first statement of Theorem 1.3 for the generic case in Sect. 3, following basically the argument of [25] and [8] but more concisely. We use the well known representation formula of the propagator: 1 2 −itH e Pc = lim e−itλ G(λ)λdλ. (1.19) δ↓0 iπ |λ|>δ Here the principle value is taken to remove the contribution from P0 . We write as G(λ) = (1 + G0 (λ)V )−1 G0 (λ) and expand (1 + G0 (λ)V )−1 : G(λ) =

2

(−1)n G0 (λ)(V G0 (λ))n − G0 (λ)V G(λ)V G0 (λ)V G0 (λ).

n=0

Then e−itH P

c = 0 (t)− 1 (t)+ 2 (t)+W3 (t). An explicit computation using Lemma 2.4 shows that the integral kernel of n (t) is given by n √ A2 A j π j =1 V (xj ) i 4tj dx1 , . . . , dxn e n (t, x, y) = √ 3 n+1 2 it 2 R3j j =1 |xj − xj −1 |

with x0 = x and xn+1 = y and Aj =

n+1

j =1 |xj

− xj −1 |. As is shown by [25], 3

| n (t, x, y)| ≤ C|t|− 2

and n (t) is regularly dispersive. We write N (λ) = G(λ)V G0 (λ) and apply integration by parts with respect to λ, which gives 1 2 W3 (t) = e−itλ (G0 (λ)V N (λ)V G0 (λ)) dλ. 2πt R Out of three integrals produced after differentiation, we explain here how to treat the one with G0 (λ)V N (λ)V G0 (λ) as a prototype, which we denote by W31 (t). It is important to notice that, if we denote the integral kernel of L(λ) = xσ V N (λ)V xσ by

480

K. Yajima

L(λ, z2 , z1 ), then that of W31 (t) may be given by using the solution of the one-dimensional free Schr¨odinger equation by W31 (t, x, y) = √

1 2πt

R

ˇ · , z2 , z1 ))(A) 1 (eit L( dz1 dz2 . 2 σ 16π z2 |x − z2 |z1 σ |z1 − y|

(1.20)

Here A = |x − z2 | + |z1 − y|, is the one dimensional Laplacian acting on the variable denoted by · and Lˇ is the inverse Fourier transform of L with respect to the variable λ. We have ˇ · , z2 , z1 ))(A)| ≤ Ct − 2 L(·, z2 , z1 )H s , |(eit L( 1

(1.21)

provided s > 1/2. The LAP stated in Sect. 2 implies that L(λ, z2 , z1 ) is indeed an L2 (Rz62 ,z1 )-valued H s (Rλ ) function of λ for some σ > 1/2 and s > 1/2. Applying the 3

Schwarz inequality to (1.20) and using (1.21), we then obtain |W31 (t, x, y)| ≤ C|t|− 2 . 3 Other integrals may be estimated similarly and we obtain |W3 (t, x, y)| ≤ C|t|− 2 . This proves statement (1) of Theorem 1.3 by the help of interpolation theory. We study exceptional cases in Sect. 4. When H is of exceptional type, we break up (1.19) into two parts, e−itH Pc = Wh (t) + Wl (t), the high and the low energy parts, by inserting a partition of unity χl (λ) + χh (λ) = 1 into the integrand, where χl ∈ C0∞ (R) is even and χl (λ) = 1 for |λ| < λ0 /2 and χl (λ) = 0 for |λ| > λ0 for a small positive constant λ0 . The argument of Sect. 3 for the generic case shows that the high energy part Wh (t) which contains χh is regularly dispersive. For the low energy part Wl (t) we write G(λ) = G0 (λ) − G0 (λ)V (1 + G0 (λ)V )−1 G0 (λ) in the integrand. The integral which contains χl (λ)G0 (λ) may be treated as in the generic case and it is regularly dispersive. We are left with −1 2 Wl0 (t) = lim χl (λ)e−itλ G0 (λ)V (1 + G0 (λ)V )−1 G0 (λ)λdλ. (1.22) δ↓0 iπ |λ|>δ

We study Wl0 (t) by examining the behavior of (1+G0 (λ)V )−1 as λ → 0. After some preparation, we study it when H is of exceptional type of the first kind in Subsect. 4.3, the second kind in Subsect. 4.4 and, synthesizing the results of previous two subsections, the third kind in Subsect. 4.5. If H is of exceptional type of the first kind we have (see Theorem 4.8) (1 + G0 (λ)V )−1 = I + K(λ) − aλ−1 |φV φ|, where V K(λ) satisfies the property similar to that of L(λ) in (1.20) and a is the constant defined in (1.11). Integral (1.22) with I + K(λ) in place of (1 + G0 (λ)V )−1 can then be studied by the method of Sect. 3 for W31 (t) and it produces a regularly dispersive family of operators. On the other hand −aλ−1 |φV φ| produces a 2 Wl (t) = χl (λ)e−itλ G0 (λ)|V φV φ|G0 (λ)dλ, (1.23) πi R and its integral kernel may be computed explicitly: c(t, A)V (z1 )V (z2 )φ(z1 )φ(z2 ) Wl (t, x, y) = a dz1 dz2 , 16π 2 |x − z2 ||z1 − y| R6

(1.24)

Dispersive Estimates for Schr¨odinger Equations

481

where A = |x − z2 | + |z1 − y| and c(t, A) is given by 1 c(t, A) = πi

χl (λ)e

−itλ2 +iλA

dλ =

e−

R

2 iA2 is A e 4t . F e 4t χˇ l √ 2t πt i3π 4

(1.25)

Here F is the Fourier transform. This is except for a normalization constant the well known formula for solutions of the one dimensional free Schr¨odinger equation. Since 1 1 |c(t, A)| ≤ Ct − 2 , we have |Wl (t, x, y)| ≤ Ct − 2 x−1 y−1 . Since x−1 ∈ L3,∞ , H¨older’s inequality in Lorentz spaces implies 1

Wl (t)u3,∞ ≤ Ct − 2 u3/2,1 .

(1.26)

We have shown above that e−itH Pc − Wl (t) is regularly dispersive and it also satisfies (1.26). Hence 1

e−itH Pc u3,∞ ≤ Ct − 2 u3/2,1 ,

(1.27)

and statement (2)(ii) of Theorem 1.3 follows for this case. By virtue of the interpolation theorem for Lorentz spaces, (1.27) and the obvious L2 bound e−itH Pc u2 ≤ Cu2 imply statement (2) (i). To prove statement (2)(iii), we first note that (1.27) and the bound |ϕ(x)| ≤ Cx−1 imply 1

(e−itH Pc − R(t))u3,∞ ≤ Ct − 2 u 3 ,1 . 2

If we replace in the right of (1.25) first e e

2 +|y|2 4t

i |x|

is 2 4t

by 1, then χl (A/2t) by 1 and finally e

, we obtain |y|2 |x|2 c(t, A) − (π t)− 21 e−i 3π4 ei 4t ei 4t ≤ Ct − 23 xyz1 2 z2 2 .

(1.28) iA2 4t

by

(1.29)

We insert (1.29) into (1.24) and recall that φ(x) = −D0 V φ. This produces 3π 3 ae−i 4 ζ (t, x)ζ (t, y) ≤ Ct − 2 . Wl (t, x, y) − √ πt Since e−itH Pc − Wl (t) is regularly dispersive, it then follows that 3

(e−itH Pc − R(t))u∞ ≤ Ct − 2 u1 .

(1.30)

Interpolating (1.28) and (1.30), we obtain statement (2)(iii) of the theorem. If H is of exceptional type of the second or the third kind, which will be discussed in Subsects. 4.2. and 4.3 respectively, (1 + G0 (λ)V )−1 contains singularities also of order λ−2 and the argument becomes a bit more involved. However, basically the same idea works. We refer to the text for the details. We use the following notation and conventions. For s, σ ∈ R, H s (Rd ) is the Sobolev space of order s on Rd and Hσs (R3 ) = {u : xσ u ∈ H s (R3 )} is the weighted Sobolev space. For Hilbert spaces X and Y, B(X , Y) is the Banach space of bounded operators from X to Y, B(X ) = B(X , X ) and B2 (X ) is the Hilbert space of Hilbert-Schmidt

482

K. Yajima

operators in X . We denote by C the complex plane, C+ = {z ∈ C : z > 0} is the + + upper half plane and C is the closed upper half plane: C = {z : z ≥ 0}. For a ∈ R, a− (resp. a+) denotes any number smaller (resp. larger) than a. In what follows we always assume that V at least satisfies (1.2) although some statements hold under less stringent conditions, and after Sect. 3 we shall assume much stronger decay conditions. We occasionally use the physics notation |v and u|v to denote vectors and the inner product. After submission of this paper we were informed that Theorem 1.3 (1) for the generic case has recently been proved by Goldberg [7] for more general potentials V ∈ Lr (R3 ) ∩ Ls (R3 ), r < 3/2 < s, and that a result similar to statement (2) of Theorem 1.3 has been obtained by Erdoˇgan and Schlag [6] under a slightly stronger decay condition on the potentials. We thank Professor Piero D’Ancona and the anonymous referee for bringing this to our attention. 2. Preliminaries In this section we collect some results on the resolvents, G0 (λ) and G(λ), and estimates on the integrals which will often appear in the sequel. 2.1. Resolvents. We recall that for λ ∈ C, 1 G0 (λ)u(x) = 4π

eiλ|x−y| u(y)dy. |x − y|

(2.1)

For λ > 0, G0 (λ) is a B(H)-valued analytic function and R0 (λ2 ) = G0 (λ). Lemma 2.1. (1) Let σ, τ > 1/2 and σ +τ > 2. Then, x−σ G0 (λ)x−τ is a B2 (H)-val+ ued C ρ function of λ ∈ C for any ρ such that ρ < min(τ +σ −2, τ −1/2, σ −1/2). If ρ = j + κ, j = 0, 1, . . . , and 0 ≤ κ < 1, we have x−σ (G0 (λ) − G0 (µ))x−τ B2 ≤ C. |λ − µ|κ λ =µ

sup x−σ G0 (λ)x−τ B2 + sup (j )

λ∈C

+

(j )

(j )

We have G0 (λ)∗ = G0 (−λ) when λ ∈ R. 1 (2) Let σ > 1/2. Then, x−σ G0 (λ)x−σ is a B(H)-valued C σ − 2 function of λ ∈ + C \ {0}. For j = 0, 1, . . . , we have x−σ −j ∂λ G0 (λ)x−σ −j B(H) ≤ Cj |λ|−1 , j

|λ| ≥ 1.

(2.2)

Proof. (1) Write m = min(τ + σ − 2, τ − 1/2, σ − 1/2). We may assume τ ≤ σ . Suppose first that 0 < m ≤ 1. Then, τ ≤ 3/2 and without losing generality we may assume τ < 3/2. We then have, with xˆ = x/|x|, dxdy 1 x−σ G0 (λ)x−τ 2B2 = 2σ 2 16π R6 x |x − y|2 y2τ dx dy ≤ 2τ −1 |xˆ − y|2 |y|2τ 3 x2σ R3 |x| R Cdx <∞ ≤ 2σ 2τ −1 R3 x |x|

Dispersive Estimates for Schr¨odinger Equations

483

and x−σ G0 (λ)x−τ B2 is uniformly bounded. Here we changed variables y to |x|y and used |x|y ≥ |x||y| in the second step, 2 + 2τ > 3 in the third and 2σ + 2t − 1 > 3 in the last. Since 0 < ρ < 1, we have |eia − eib | ≤ 2ρ |a − b|ρ and we may likewise estimate as follows: |eiλ|x−y| − eiµ|x−y| |2 −σ −τ 2 x (G0 (λ) − G0 (µ))x B2 (H) = dxdy 2σ 2τ 2 2 R6 16π x |x − y| y C|λ − µ|2ρ dxdy C1 |λ − µ|2ρ dx ≤ ≤ C2 |λ − µ|2ρ . ≤ 2σ 2σ 2τ −2ρ−1 2−2ρ y2τ R6 x |x − y| R3 x x Here we used 2τ − 2ρ + 2 > 3 in the second step and 2τ + 2σ − 2ρ − 1 > 3 in the last step. This proves (1) when 0 < m ≤ 1. If j < m ≤ j + 1, j = 1, 2, . . . , we have (j ) m = τ −1/2. Write ρ = j +κ, 0 ≤ κ < 1. The j th derivative G0 (λ) has integral kernel (j ) (4π )−1 i j eiλ|x−y| |x − y|j −1 and xσ G0 (λ)xτ B2 ≤ C follows entirely similarly as above. As 1 < 2(τ − ρ) < 3 and σ ≥ τ > 3/2, we have x−σ (G0 (λ) − G0 (µ))x−τ 2B2 (H) ≤ C (j )

≤ C1

(j )

|λ − µ|2κ (|x|2ρ R3

+ |y|2ρ )dxdy

x2σ |x − y|2 y2τ

R6

|λ − µ|2κ |x − y|2(ρ−1) dxdy x2σ y2τ

≤ C2 |λ − µ|2κ .

Statement (1) follows. Statement (2) is well known (see [1] and [11]).

Recall that we are assuming (1.2). The following is an obvious consequence of Lemma 2.1. Corollary 2.2. Let 1/2 < γ < β − 1/2. Then, x−γ G0 (λ)V x+γ is a B2 (H)-valued + C ρ function of λ ∈ C for any ρ < min(β − 2, γ − 21 , β − γ − 21 ). The operator valued function x+γ V G0 (λ)x−γ satisfies the same property. Under condition (1.2), it is well known (see [13]) that H = − + V has no positive eigenvalues and the point spectral subspace Hp (H ) for H is finite dimensional. Thus R(λ2 ) = (H − λ2 )−1 is a B(H)-valued meromorphic function of λ ∈ C+ with possible poles iκ1 , . . . , iκn on the imaginary axis such that −κ12 , . . . , −κn2 are eigenvalues of H . The resolvent equation implies that outside those poles in the upper half plane R(λ2 ) = G0 (λ)(1 + V G0 (λ))−1 = (1 + G0 (λ)V )−1 G0 (λ). Here V G0 (λ) (resp.G0 (λ)V ) is a B2 (Hγ )-valued (resp. B2 (H−γ )-valued) continuous + function of λ ∈ C if 1/2 < γ < β − 1/2 by virtue of Corollary 2.2 and −1 ∈ σ (V G0 (λ)) (resp. −1 ∈ σ (G0 (λ)V )) if and only if λ2 is an eigenvalue of H (see [1]). Since positive eigenvalues are absent from H as mentioned above, R(λ2 ) considered as a B(Hγ , H−γ ) valued function is continuous up to the boundary R of C+ except possibly at λ = 0. We set for λ ∈ R \ {0}, G(λ) = G0 (λ)(1 + V G0 (λ))−1 = (1 + G0 (λ)V )−1 G0 (λ).

(2.3)

484

K. Yajima

Lemma 2.3. For 1/2 < σ, τ < β − 1/2 such that σ + τ > 2, x−σ G(λ)x−τ , as a B2 (H)-valued or B(H)-valued function of λ ∈ {λ ∈ R : |λ| > ε}, ε > 0, satisfies the same smoothness and decay properties as x−σ G0 (λ)x−τ as stated in Lemma 2.1. This is true on the whole line λ ∈ R, if 1 + V G0 (0) or 1 + G0 (0)V is invertible respectively in Hγ or H−γ for some, and therefore for all, 1/2 < γ < β − 1/2. Proof. We use the same notation as in the proof of Lemma 2.1. Let 0 < m ≤ 1 first. By virtue of (2.3), Lemma 2.1 and Corollary 2.2 we have x−σ G(λ)x−τ B2 ≤ C. By telescoping the difference, we may estimate as follows: x−σ (G(λ) − G(µ))x−τ B2 ≤ x−σ (1 + G0 (λ)V )−1 xσ B x−σ (G0 (λ) − G0 (µ))x−τ B2 +x−σ (1 + G0 (λ)V )−1 xσ B x−σ (G0 (λ) − G0 (µ))x−τ B2 ×xτ V (1 + G0 (µ)V )−1 xβ−τ B xτ −β G0 (µ)x−τ B ≤ C|λ − µ|ρ , and the lemma follows for this case. When 1 < m ≤ 2, we differentiate (2.3) and use the resolvent equation. We obtain G (λ) = (1 − G(λ)V )G0 (λ)(1 − V G(λ)). We then repeat the argument above using the previous result for 0 < m ≤ 1. We omit repetitious details also for general m. By the functional calculus for selfadjoint operators, the propagator e−itH may be expressed in terms of G(λ) in the following form: 1 2 e−itH Pc = lim e−itλ G(λ)λdλ. (2.4) δ↓0 iπ |λ|>δ Equation (2.4) is the starting point for the proof of the main theorem. 2.2. Integrals. We collect here some formulae and estimates on integrals which will be of frequent use in what follows. We begin with the following lemma on the Gauss integral: Lemma 2.4. Let s > 1/2. Then, there exists a constant Cs depending only on s such that for any χ ∈ H s (R), A ∈ R, t > 0 and L > 0, e−itλ2 +iλA χ (λ/L)dλ ≤ Cs χ H s t − 21 . (2.5) R

As L → ∞ we have e R

−itλ2 +iλA

χ (λ/L)dλ → e

−i π4

e

iA2 4t

π χ (0). t

Suppose in addition that χ is even and λχ (λ) ∈ H s (R) then e−itλ2 +iλA χ (λ/L)λdλ ≤ Cs (χ H s + λχ H s )|A|t − 23 . R

(2.6)

(2.7)

Dispersive Estimates for Schr¨odinger Equations

485

As L → ∞ we have

Ae−i 4 e χ (λ/L)λdλ → 2 π

e

−itλ2 +iλA

R

iA2 4t

π χ (0). t3

(2.8)

Proof. We first prove estimate (2.5). If we write χˇ (κ) = χˆ (−κ) √ for the conjugate Fourier transform of χ , the integral on the left in (2.5) is equal to 2π (eit χˇ L )(A), where is the one dimensional Laplacian, and by virtue of the well known formula for the kernel of the propagator eit , π iA2 2 √ e−i 4 e 4t −i Ar +i r it 2π (e χˇ L )(A) = √ e 2tL 4tL2 χˇ (r)dr. 2t

(2.9)

If s > 1/2, this is bounded in modulus by 1

1

1

(2t)− 2 χˇ 1 ≤ Cs (2t)− 2 rs χˇ 2 = Cs (2t)− 2 χ H s , and (2.5) follows. Taking the limit L → ∞ in (2.9) we obtain (2.6). 2 2 Since λe−itλ = 2ti (d/dλ)e−itλ , integration by parts shows that the integral in the left of (2.7) is equal to A 1 2 −itλ2 +iλA e χ (λ/L)dλ + e−itλ +iλA χ (λ/L)L−1 dλ. 2t R 2it R The argument of the first part shows that the first summand satisfies (2.7) and it converges to the right-hand side of (2.8) as L → ∞. Since χ (λ/L) is odd, the second summand may be written in the form 1 dλ A 1 2 2 e−itλ (eiλA − e−iλA )χ (λ/L) e−itλ +iλθA ζ (λ/L)dλ dθ, = 4it R L 4t −1 R where ζ (λ) = χ (λ)λ. Applying again (2.5) and (2.9) to the λ-integral, we see that the 3 second summand is bounded in modulus by Cσ |A|(2t)− 2 ζ H s and converges to zero as L → ∞. This completes the proof. We recall the Kato norm:

V K = sup

a∈R3 R3

|V (z)|dz . |z − a|

Lemma 2.5. Let xn+1 = x and x0 = y. Then, for n = 1, 2, . . . , n (4V K )n j =1 |V (xj )| dx1 , . . . , dxn ≤ . n+1 |x − y| R3 j =1 |xj − xj −1 | Proof. By induction, it suffices to show the case n = 1: |V (z)|dz 4V K ≤ . |x − y| R3 |x − z||z − y|

(2.10)

486

K. Yajima

Change variables z to z + y and write w = x − y. We have 2 |V (z + y)|dz 2 |V (z + y)|dz ≤ ≤ V K . |w| |w − z| |w| |z|≥|w|/2 |w − z||z| If |z| < |w|/2, then |w − z| ≥ |w|/2 and |V (z + y)|dz 2 |V (z + y)|dz 2 ≤ ≤ V K . |w| |z| |w| |z|<|w|/2 |w − z||z| The lemma follows.

Following is a result of the celebrated Kato smoothness theorem ([15]): Lemma 2.6. Let T (λ), λ ∈ R, be a weakly measurable family of bounded operators in H such that xσ T (λ)xσ B(H) ≤ C for some σ > 1. Then, for t ∈ R, the weak integral ∞ 2 e−itλ G0 (λ)T (λ)G0 (λ)λdλ U (t) = −∞

converges in H and defines a bounded operator in H. The family {U (t) : t ∈ R} is strongly continuous and uniformly bounded in B(H). Proof. When σ > 1, the multiplication operator by x−σ is H0 -smooth ([15]): ∞ −σ G (λ)u2 |λ|dλ ≤ Cu2 . It follows by the Schwarz inequality that U (t) 0 2 2 −∞ x is uniformly bounded in H. It also follows by the Schwarz inequality that ∞ 2 2 |e−itλ − e−isλ |2 x−σ G0 (λ)u2 |λ|dλ (U (t) − U (s))u2 ≤ C −∞

and Lebesgue’s dominated convergence theorem implies the lemma. Lemma 2.7. Let s, σ > 1/2 and let R λ → Gσ (λ) ≡ valued H s (R) function of λ. Define 2 N (t) = e−itλ G0 (λ)N (λ)G0 (λ)dλ,

xσ N (λ)xσ

be a B2 (H)-

t = 0.

R

Then N (t) has a bounded continuous integral kernel N (t, x, y) and it satisfies 1

|N (t, x, y)| ≤ Cs |t|− 2 Gσ H s (R,B2 (H)) .

(2.11)

If σ > 3/2, then N (t, x, y) satisfies the stronger estimate, 1

|N (t, x, y)| ≤ Cs |t|− 2 x−1 y−1 Gσ H s (R,B2 (H)) . xσ +1 N (λ)xσ

If Gσ 1 (λ) = H s (R), then for any t = 0,

(resp. Gσ 2 (λ) =

N1 (t) =

R

(resp. N2 (t) = R

xσ N (λ)xσ +1 )

(2.12) is B2 (H)-valued

e−itλ G0 (λ)N (λ)G0 (λ)dλ 2

e−itλ G0 (λ)N (λ)G0 (λ)dλ) 2

has a continuous integral kernel N1 (t, x, y) (resp. N2 (t, x, y)) and it satisfies (2.11) with obvious modifications.

Dispersive Estimates for Schr¨odinger Equations

487

Proof. We take χ ∈ C0∞ (R) such that χ (λ) = 1 for |λ| ≤ 1 and define 2 e−itλ χ (λ/L)G0 (λ)N (λ)G0 (λ)dλ. NL (t) = R

If γ > 23 , G0 (λ)N (λ)G0 (λ)B(Hγ ,H−γ ) ≤ Cλ−2 Gσ (λ)B2 by virtue of Lemma 2.1(3) and NL (t) − N (t)B(Hγ ,H−γ ) → 0 as L → ∞. Denote the integral kernel of Gσ (λ) by Gσ (λ, x, y) and A = |x − z2 | + |z1 − y|. Then, z2 −σ z1 −σ 2 eitλ +iλA χ (λ/L) Gσ (λ, z2 , z1 ) dz1 dz2 dλ. NL (t, x, y) = |x − z2 | |z1 − y| R R6 For almost all (z1 , z2 ), (χ (λ/L) − 1)Gσ (λ, z2 , z1 )H s (Rλ ) → 0 as L → ∞ and (2.5) implies that 2 2 eitλ +iλA χ (λ/L)Gσ (λ, z2 , z1 )dλ → eitλ +iλA Gσ (λ, z2 , z1 )dλ R

R − 21

and that the left side is bounded by C|t| Gσ (·, z2 , z1 )H s uniformly with respect to L ≥ 1. By the Schwarz inequality, z1 −σ z2 −σ Gσ (·, z2 , z1 )H s dz1 dz2 |x − z2 | |z1 − y| z2 −σ z1 −σ ≤ (2.13) |z − x| 2 |z − y| 2 Gσ H s (R,B2 (H)) . 2 1 Lz Lz 2

1

√ It follows that |NL (t, x, y)| ≤ C/ t for all x, y ∈ R3 and, by Lebesgue’s dominated convergence theorem, that NL (t, x, y) converges to the integral kernel N (t, x, y) of N (t) as L → ∞: z2 −σ z1 −σ 2 eitλ +iλA Gσ (λ, z2 , z1 )dλ dz1 dz2 . N (t, x, y) = |x − z2 | |z1 − y| R6 Here,

2 +iλA

eitλ

Gσ (λ, z2 , z1 )dλ is an L2 (Rz61 ,z2 )-valued continuous function of 1

(t, x, y), t = 0, since it is bounded in modulus by C|t|− 2 Gσ (·, z2 , z1 )H s and, for almost all (z1 , z2 ), it is continuous with respect to (t, x, y), t = 0, as can be seen from (2.9). Then, since z1 −σ z2 −σ /|x − z2 ||y − z1 | is also a continuous function of (x, y) with values in L2 (Rz61 ,z2 ), N (t, x, y) is continuous with respect to (t, x, y) if t = 0. By virtue of (2.13), N (t, x, y) satisfies the estimate (2.11). If σ > 3/2, the right side of (2.13) is bounded by Cx−1 y−1 Gσ H s (R,B2 (H)) and (2.12) is satisfied. This proves the lemma for N (t). Modifications necessary for the proof for N1 (t) and N2 (t) are obvious and we omit the details. 3. The Case of Generic Type In this section, we prove statement (1) of Theorem 1.3. Thus we assume that |V (x)| ≤ Cx−β with β > 5/2 and that H is of generic type. We recall that D0 , D1 , . . . , are selfadjoint integral operators defined by

488

K. Yajima

Dj u(x) =

1 4πj !

|x − y|j −1 u(y)dy.

(3.1)

If j is odd, Dj is of finite rank. We have a formal expansion G0 (λ) = D0 + (iλ)D1 + (iλ)2 D2 + · · · .

(3.2)

We denote the null spaces of 1 + V D0 and 1 + D0 V considered respectively as operators in H−γ or in Hγ by M = N (1 + D0 V ),

N = N (1 + V D0 ).

Since D0 V and V D0 are compact and D0 V = (V D0 )∗ , dim M = dim N < ∞. Moreover, M and N are independent of 1/2 < γ < β − 1/2 because M (resp. N ) decreases (resp. increases) with γ (see [10]). As H is of generic type, λ → G(λ) ∈ B(Hγ , H−γ ) is continuous on R for γ > 1 by Lemma 2.3 and, by the spectral theorem, 1 1 2 −itH −itλ2 Pc = e G(λ)λdλ = lim e−itλ χL (λ)G(λ)λdλ (3.3) e L→∞ iπ R iπ R as strong convergence in H, where χL (λ) = χ (λ/L) and χ ∈ C0∞ (R) is even, χ (λ) = 1 for |λ| ≤ 1 and χ (λ) = 0 for |λ| ≥ 2. Iterating the resolvent equation G(λ) = G0 (λ) − G0 (λ)V G(λ), we insert in the right of (3.3), G(λ) =

2

(−1)n G0 (λ)(V G0 (λ))n − G0 (λ)V G0 (λ)V G(λ)V G0 (λ).

n=0

The result is e−itH Pc = 0 (t) − 1 (t) + 2 (t) + W3 (t), where for n = 0, 1, 2, 1 2 e−itλ χL (λ)G0 (λ)(V G0 (λ))n λdλ. (3.4) n (t) = lim L→∞ iπ R We have 0 (t) = e−itH0 . Lemma 2.1 and Lemma 2.6 imply sup n (t)B(H) ≤ C,

n = 0, 1, 2.

(3.5)

t∈R

Lemma 3.1. There exists a constant C > 0 such that 3 V K n u0 1 , n (t)u∞ ≤ C(n + 1)|t|− 2 4π

n = 0, 1, 2, . . . .

(3.6)

Proof. We follow the argument due to Rodnianski-Schlag [25]. The integral kernel of the operator defined by the integral on the right side of (3.4) is given with C1 = 1/4π , A = n+1 j =1 |xj −1 − xj | and dx1 , . . . , dxn = dX by n 1 j =1 V (xj ) n −itλ2 +iλA . (3.7) e λχL (λ)dλ n+1 C1 dX iπ R j =1 |xj −1 − xj | Note that the integrand is absolutely convergent by virtue of (2.10): 2n V nK |(integrand of (3.7))|dXdλ ≤ λχL 1 |x − y| R3n R

Dispersive Estimates for Schr¨odinger Equations

489

and by the help of the Fubini theorem the computation (3.7) is legitimate. Moreover, with x = xn+1 and y = x0 ,

A

n

j =1 |V (xj )|dX

j

|xj −1 − xj |

=

n+1

n

k=1

j =1 |V (xj )|dX

j =k

|xj −1 − xj |

≤ (n + 1)V nK .

Hence, Lemma 2.4 implies that (3.7) converges as L → ∞ to 2 A nj=1 V (xj ) C1n−1 i A4t dX, e n+1 3 (4iπ t) 2 j =1 |xj −1 − xj | 3

which is bounded by C(n + 1)|t|− 2

V K 4π

n . This implies the lemma.

Define N (λ) = V G0 (λ)V G(λ)V . If 0 < ε < 1/2, by virtue of Lemma 2.1, we have x1+ε N (λ)x1+ε B(H) ≤ Cλ−2 . It follows by virtue of Lemma 2.6 that 1 2 W3 (t) = − e−itλ G0 (λ)N (λ)G0 (λ)λdλ (3.8) iπ R is a strongly continuous family of uniformly bounded operators in H. By integration by parts, we may write 1 2 W3 (t) = e−itλ {G0 (λ)N (λ)G0 (λ)} dλ. (3.9) 2tπ R Differentiation in the right side produces three integrals which respectively contain G0 (λ)N(λ)G0 (λ), G0 (λ)N (λ)G0 (λ), and G0 (λ)N (λ)G0 (λ). Thus, in view of Lemma 2.7, Theorem 1.3(1) is a consequence of the following lemma and the interpolation theorem for Lp spaces. Lemma 3.2. Let |V (x)| ≤ Cx−β , β > 5/2. Then, for some σ, s > 1/2, x1+σ N (λ)xσ ,

xσ N (λ)x1+σ ,

xσ N (λ)xσ

are B2 (H)-valued H s functions of λ ∈ R. Proof. We estimate the operators by using Lemma 2.1 and Lemma 2.3. We first deal with x1+σ N (λ)xσ . If σ > 1/2 is sufficiently close to 1/2, x1+σ N (λ)xσ B2 ≤ x1+σ V G0 (λ)x−σ −1 B2 x1+σ V G(λ)V xσ B ≤ Cλ−1 . (3.10) We show for some s > 1/2 that for λ and µ ∈ R such that |λ − µ| ≤ 1, x1+σ (N (λ) − N (µ))xσ B2 ≤ Cλ−1 |λ − µ|s .

(3.11)

By reducing s by an arbitrarily small amount, two estimates (3.10) and (3.11) will imply that x1+σ N (λ)xσ ∈ H s (R, B2 (H)) for some s > 1/2 (cf. [18], Theorem 10.2).

490

K. Yajima

In what follows in the proof we choose and fix parameters σ , τ and the exponent s in such a way that 3 1 1 3 < τ < σ + 1 < 2, τ + σ < β − , < s < min{β − σ − , τ − 1} 2 2 2 2 hence, β − σ > 2 and β − τ > 1. We write N (λ) − N (µ) in the form V (G0 (λ) − G0 (µ))V G(λ)V + V G0 (µ)V (G(λ) − G(µ))V . Since β − σ > 2 and β − τ > 1, Lemma 2.3 implies xτ V G(λ)V xσ B ≤ Cλ−1 ,

x1+σ V G0 (µ)x−1−σ B ≤ Cλ−1 .

It follows by the choice of s that x1+σ V (G0 (λ) − G0 (µ))V G(λ)V xσ B2 (H) ≤ x1+σ V (G0 (λ) − G0 (µ))x−τ B2 xτ V G(λ)V xσ B ≤ C|λ − µ|s λ−1 .

(3.12)

As τ < β − σ < β − 1/2 and G(λ) and G0 (λ) satisfy similar regularity and decay properties, x1+σ V G0 (µ)V (G(λ) − G(µ))V xσ B2 ≤ x1+σ V G0 (µ)x−1−σ B · x1+σ V (G(λ) − G(µ))V xσ B2 ≤ C|λ − µ|s λ−1 .

(3.13)

The two estimates (3.12) and (3.13) imply (3.11). The operator xσ N (λ)x1+σ satisfies estimates corresponding to (3.10) and (3.11) because it is obtained from xσ +1 N (λ)xσ by taking the adjoint after replacing G0 (λ) and G(λ) respectively by G(−λ) and G0 (−λ). Finally we deal with xσ N (λ)xσ which may be written as xσ V G0 (λ)V G(λ)V xσ + xσ V G0 (λ)V G (λ)V xσ . Since β − σ > 2 and β − τ > 1, we have xσ V G0 (λ)V G(λ)V xσ B2 ≤ xσ V G0 (λ)x−τ B2 xτ V G(λ)V xσ B ≤ Cλ−1 . Replacing G0 (λ) and G(λ) and taking the adjoint in the estimate above yield xσ V G0 (λ)V G (λ)V xσ B2 ≤ Cλ−1 . It follows that xσ N (λ)xσ ≤ C.

(3.14)

Since s < β − σ − 3/2 < β − τ − 1/2 and min(τ, β − σ ) > 3/2, we have xσ V (G0 (λ) − G0 (µ))V G (λ)V xσ B2 ≤ xσ V (G0 (λ) − G0 (µ))xτ V B2 x−τ G (λ)V xσ B ≤ C|λ − µ|s λ−1 .

(3.15)

Dispersive Estimates for Schr¨odinger Equations

491

Likewise, since s < β − σ − 3/2 and |λ − µ| < 1, xσ V G0 (µ)V (G (λ) − G (µ))V xσ B2 ≤ xσ V G0 (µ)x−σ B xσ V (G (λ) − G (µ))V xσ B2 ≤ Cλ−1 |λ − µ|s .

(3.16)

Symmetrically we have xσ V G0 (λ)V (G(λ) − G(µ))V xσ B2 ≤ Cλ−1 |λ − µ|s , xσ V (G0 (λ) − G0 (µ))V G(µ)V xσ B2 ≤ Cλ−1 |λ − µ|s .

(3.17)

The combination of (3.15), (3.16) and (3.17) yields xσ (N (λ) − N (µ))xσ B2 ≤ Cλ−1 |λ − µ|s . xσ V N (λ)V xσ

The estimates (3.14) and (3.18) imply that s > 1/2. This completes the proof of the lemma.

∈

H s (R, B2 )

(3.18) for some

4. The Cases of Exceptional Type In this section we prove statement (2) of Theorem 1.3 for the case that H is of exceptional type. We first reduce the proof to the analysis of a simpler operator W0l (t) to be defined by (4.1) below. Then, because of the reasons stated in the introduction, we study it according to the type of exceptionality of H separately in Subsects. 4.3, 4.4 and 4.5. 4.1. Reduction to low energy analysis. For an even function χl ∈ C0∞ (R) such that χl (λ) = 1 near λ = 0 we define −1 2 W0l (t) = lim e−itλ χl (λ)G0 (λ)V G(λ)λdλ. (4.1) δ↓0 iπ |λ|>δ Recall that a family {T (t) : t ∈ R} of bounded operators in H is said to be regularly dispersive if it is strongly continuous and, in addition, it satisfies T (t)up ≤ Ct

−3

1 1 2−p

uq ,

u ∈ L2 ∩ Lq

(4.2)

for all 1 ≤ q ≤ 2 ≤ p ≤ ∞ such that 1/p + 1/q = 1. In this case we shall often say simply that T (t) is regularly dispersive. Lemma 4.1. The operator (t) = e−itH Pc − W0l (t) is regularly dispersive. Proof. As in the generic case, we decompose e−itH Pc in the form e−itH Pc =

2

(−1)n n (t) + W3 (t).

n=0

Recall the definition (3.8) of W3 (t). As was shown in (3.5) and (3.6), n (t) are regularly dispersive. We define the low and the high energy parts Wh (t) and Wl (t) of W3 (t) = Wh (t) + Wl (t) by −1 2 Wh,l (t) = lim e−itλ χh,l (λ)G0 (λ)N (λ)G0 (λ)λdλ, (4.3) δ↓0 iπ |λ|>δ

492

K. Yajima

where χh (λ) = 1 − χl (λ) and N (λ) = V G0 (λ)V G(λ)V . Since G(λ) has no singularities on the support of χh , it follows, by virtue of Lemma 2.7 and Lemma 3.2, and in view of the argument in Sect. 3 for the generic case, that Wh (t) is also regularly dispersive. Using the resolvent equation, we write G0 (λ)N (λ)G0 (λ) = G0 (λ)V G(λ) +

2

(−1)j (G0 (λ)V )j G0 (λ)

j =1

in (4.3) and further decompose Wl (t) = W0l (t) − W1l (t) + W2l (t): −1 2 e−itλ χl (λ)(G0 (λ)V )n G0 (λ)λdλ, 1 ≤ n ≤ 2. Wnl (t) = iπ R

(4.4)

The operator Wnl (t) is the same as the one defined by the integral in the right of (3.4) with −χl replacing χL and the proof of Lemma 3.1 implies 3

Wnl (t)u0 ∞ ≤ C|t|− 2 V K u0 1 , j

1 ≤ n ≤ 2.

(4.5)

Lemma 2.6 clearly implies that Wnl (t) are strongly continuous families of uniformly bounded operators in H. Thus, W1l (t) and W2l are regularly dispersive and so is (t) = 2 n n=0 (−1) n (t) + Wh (t) − W1l (t) + W2l (t). This proves the lemma. 4.2. Low energy resolvent analysis. Preliminary. In the following subsections we study W0l (t) separately according to the kind of exceptionality. In each case, we need to investigate the behavior of G(λ) near λ = 0. We do it mostly following Jensen-Kato [10] and we collect here some preliminary information. The following two lemmas collect Lemmas 2.4, 2.5, 2.6, 3.1, 3.2 and 3.3 of [10]. We recall the operators Dj , j = 0, 1, . . . , are defined by (3.1) and M = N (1 + D0 V ),

N = N (1 + V D0 ).

Lemma 4.2. (1) If v ∈ Hγ , 21 < γ ≤ 25 , and v, 1 = 0, D0 v ∈ Hγ2−2 (R3 ). (2) For u, v ∈ H 5 +0 such that u, 1 = v, 1 = 0, D2 u, v = −D0 u, D0 v. 2

Lemma 4.3. Let

1 2

< γ < β − 21 . Then the following statements hold:

(1) M ⊂ H 2 1 (R3 ) and (H0 + V )M = {0}. If −2−

1 2

< γ < 23 , N (H0 + V ) = M as an

2 . operator from H−γ (2) H0 and V are isomorphisms M → N . D0 is an isomorphism N → M. (3) For u ∈ M, u ∈ H if and only if u, V = 0. In this case, u ∈ H 12 (R3 ). 2−

(4) For v ∈ N , D0 v ∈ H if and only if 1, v = 0. In this case, v ∈ Hβ+ 1 − . 2

In what follows γ is always assumed to satisfy 1/2 < γ < β − 1/2. Notice that (u, D0 v) is a strictly positive quadratic form on Hγ and that V D0 is real and formally selfadjoint with respect to this form. It follows that all eigenvalues λ of V D0 are real and the eigenspaces are semi-simple: N (V D0 − λ) = N ((V D0 − λ)2 ). By the duality, the same is true for D0 V .

Dispersive Estimates for Schr¨odinger Equations

493

Lemma 4.4. There exist operators Q and K which are bounded in H−γ for any 1/2 < γ < β − 1/2 such that Q2 = Q, QK = KQ = 0 and (1 + D0 V )Q = Q(1 + D0 V ) = 0, (1 + D0 V )K = K(1 + D0 V ) = 1 − Q. (1) The projector Q is of finite rank and K − I ∈ B2 (H−γ ). (2) We have the identities V K = K ∗ V , KD0 = D0 K ∗ . Proof. The first statement is a result of the separation of the spectrum theorem ([14], p. 178). By the same theorem 1 + D0 V + Q is invertible and (I + D0 V + Q)−1 − I = −(D0 V + Q)(1 + D0 V + Q)−1 ∈ B2 (H−γ ). Since K = (1 + D0 V + Q)−1 (1 − Q), K − I ∈ B2 (H−γ ). Statement (2) may be found in Lemma 3.5 of [10]. If u = D0 u˜ and v = D0 v, ˜ u, ˜ v˜ ∈ N , −(V u, v) = (D0 u, ˜ v). ˜ It follows that −(V u, v) defines an inner product in M and the spectral projection 1 Q=− (D0 V − z)−1 dz 2πi |z+1|=δ satisfies Q∗ V = V Q. The next lemma follows. Note that D0 V and V D0 are real operators and we may choose a real basis of M and N . Lemma 4.5. Let {φ1 , . . . , φd } be an orthonormal basis of M with respect to the inner product −(V u, v). Define ψj = −V φj . Then {ψ1 , . . . , ψd } is the dual basis of N with natural coupling φj , ψk = δj k and, simultaneously, is orthonomal with respect to the inner product (D0 u, v). With these bases Q=

d j =1

|φj ψj |,

∗

Q =

d

|ψj φj |

j =1

and Q∗ is the spectral projection onto N with respect to 1 + V D0 . We have the identity QD0 = D0 Q∗ . By virtue of Lemma 4.3 (3), the 0 eigenspace E of H = − + V is a subspace of M of codimension at most one. We write Q = 1 − Q. If we define closed subspaces X−γ = QH−γ and Y−γ = ˙ −γ {u, v} → u + v ∈ H−γ is an isomorphism between QH−γ , the map X−γ +Y ˙ −γ , Banach spaces. In the direct sum decomposition H−γ = X−γ +Y M(λ) = 1 + G0 (λ)V may be written in the matrix form: M00 (λ) QM(λ)Q QM(λ)Q M(λ) = ≡ M10 (λ) QM(λ)Q QM(λ)Q

M01 (λ) . M11 (λ)

(4.6)

We often consider operators Mj k (λ) and etc. also as operators in H−γ by extending them to the complementary subspaces as zero operators.

494

K. Yajima

Lemma 4.6. There exists λ0 such that M00 (λ) : X−γ → X−γ is invertible for |λ| < λ0 and M00 (λ)−1 −I ∈ B2 (X−γ ). As a B2 (X−γ )-valued function of |λ| < λ0 , M00 (λ)−1 −I is of class C δ for δ < min(β − γ − 21 , γ − 21 , β − 2). Proof. By virtue of Lemma 2.1, M00 (λ) − 1 is a B2 (X−γ )–valued C δ function of λ and M00 (0) = Q(1 + D0 V )Q is invertible by Lemma 4.4. The lemma follows by a Neumann series expansion. The following well known lemma is very useful. ˙ 1 be a direct sum decomposition of a vector space X . Lemma 4.7. Let X = X0 +X Suppose that a linear operator L in X is written in the form L00 L01 L= L10 L11 in this decomposition and that L−1 00 exists. Set C = L11 − L10 L−1 00 L01 . Then, L−1 exists if and only if C −1 exists. In this case −1 L00 + L−1 L01 C −1 L10 L−1 −L−1 L01 C −1 −1 00 00 00 L = . −C −1 L10 L−1 C −1 00

(4.7)

4.3. Exceptional type of the first kind. In this subsection we prove Theorem 1.3 (2) when H is of exceptional type of the first kind. In this case dim M = 1 and nontrivial φ ∈ M satisfies c φ(x) − ∈ H, φ ∈ H−2 1 − (4.8) |x| 2 for a constant c = 0. We take a uniquely determined φ ∈ M such that −φ, V φ = 1 and V , φ > 0 so that Q = −|φV φ|. Theorem 4.8. Let |V (x)| ≤ Cx−β with β > 9/2. Assume H is of exceptional type of the first kind. Let φ ∈ M be as above. Then, in a small punctured neighbourhood 0 < |λ| < λ0 of zero, (1 + G0 (λ)V )−1 may be written in the form (1 + G0 (λ)V )−1 = I + K(λ) + aλ−1 Q,

a=

4π i , |V , φ|2

(4.9)

where x1+σ V K(λ)x1+σ is a B2 (H)–valued C 1+ρ function of λ ∈ (−λ0 , λ0 ) for some σ > 1/2 and ρ > 1/2. Proof. We may assume 9/2 < β < 5 without losing the generality. We have β − 3 < (β − 1)/2. We apply Lemma 4.7 to (4.6). We need to study C(λ) ≡ M11 (λ) − −1 (λ)M01 (λ) first. Recall D1 = (1/4π)(1 ⊗ 1). We define the operator J (λ) M10 (λ)M00 by the equation λ2 J (λ) = M(λ) − (1 + D0 V + iλD1 V )

λ = 0.

Dispersive Estimates for Schr¨odinger Equations

495 1

We have (M(λ)−(1+D0 V +iλD1 V ))φ ∈ C γ − 2 − (R, H−γ ) for any 3/2 < γ < β − 21 by virtue of (4.8) and Lemma 2.1 and it vanishes at λ = 0 along with its derivative. 5 Hence, J (λ)φ and λJ (λ)φ are, as H−γ -valued functions, respectively of class C γ − 2 − 3 and C γ − 2 − including λ = 0. It follows by choosing γ < β − 1/2 arbitrarily close to β − 1/2 that λ|V φ|1|2 M11 (λ) = (4.10) − λ2 V φ|J (λ)φ Q = −λc0 (λ)Q, 4iπ where λc0 (λ), c0 (λ) and V φ|J (λ)φ are functions respectively of class C β−1− , C β−2− and C β−3− on R. Likewise we have ˜ ψ(λ) ≡ M(λ)φ = (iD1 V + λJ (λ))φ ∈ C γ − 2 − (R, H−γ ), 3

˜ ∗ ≡ M(λ)∗ V φ = V (−iD1 V + λJ (−λ))φ ∈ C γ − 2 − (R, Hβ−γ ) (4.11) ψ(λ) 3

for any 3/2 < γ < β − 1/2. Using these functions, we may write ˜ + c0 (λ)φ) ⊗ V φ, M01 (λ) = −λ(ψ(λ) M10 (λ) = −λφ ⊗ (ψ˜ ∗ (λ) + c0 (λ)V φ)

(4.12)

−1 (λ)M01 (λ) = λ2 c1 (λ)Q, where and −M10 (λ)M00 −1 ˜ (λ)(ψ(λ) + c0 (λ)φ). c1 (λ) = ψ˜ ∗ (λ) + c0 (λ)V φ, M00

(4.13)

Then, (4.11) and Lemma 4.6 for M00 (λ)−1 to (4.13) imply that c1 (λ) ∈ C β−3− . Combining this with (4.10), we have λ|V φ|1|2 C(λ) = + λ2 c2 (λ) Q with c2 of class C β−3− , (4.14) 4iπ and C(λ)−1 exists for small 0 < |λ| < λ0 . Moreover, C −1 (λ) =

a λ

+ d(λ) Q,

d(λ) ∈ C β−3− , a =

4π i . |V , φ|2

(4.15)

It follows from Lemma 4.7 that M(λ)−1 may be written in the form (4.7) with obvious modifications. Using (4.12) and (4.15), we write −1 M01 C −1 = −(a + λd(λ))|ξ1 (λ)V φ|, −M00 −1 −C −1 M10 M00 = −(a + λd(λ))|φξ2 (λ)|, −1 −1 M00 M01 C −1 M10 M00 = −(a + λd(λ))|ξ1 (λ)ξ2 (λ)|

(4.16)

˜ + c0 (λ)φ) and ξ2 (λ) = M00 (λ)∗−1 (ψ˜ ∗ (λ) + c0 (λ)V φ) and, with ξ1 = M00 (λ)−1 (ψ(λ) by virtue of (4.11) and Lemma 4.6, x1+σ V (x)ξ1 (λ) and x1+σ ξ2 (λ) are H-valued H 1+ρ functions of |λ| < λ0 for σ and ρ such that 1 + σ, 1 + ρ < β − 3. Thus, putting the operators in (4.16), (d(λ) − 1)Q and M00 (λ)−1 − Q into K(λ), we obtain the theorem.

496

K. Yajima

We are ready to study W0l (t) when H is of exceptional type of the first kind. We choose χl ∈ C0∞ (R) such that χl is even, χ1 (λ) = 1 when |λ| < λ0 /2 and χl (λ) = 0 when |λ| ≥ λ0 . We write, using (4.9), G0 (λ)V G(λ) = G0 (λ)V (1 + G0 (λ)V )−1 G0 (λ) = G0 (λ)V G0 (λ) + G0 (λ)V K(λ)G0 (λ) + aλ−1 G0 (λ)V QG0 (λ), and insert this in the right of (4.1) to obtain W0l (t) = W1l (t) + Z( t) + Z2 (t).

(4.17)

We know that W1l (t) is regularly dispersive from the proof of Lemma 4.1. Next we consider −1 2 e−itλ χl (λ)G0 (λ)V K(λ)G0 (λ)λdλ. (4.18) Z1 (t) = iπ R Lemma 4.9. Assume β > 9/2. Then, Z1 (t) is regularly dispersive. Proof. Denote K1 (λ) = χl (λ)K(λ). Take σ, ρ > 1/2 as in Theorem 4.8. Then, x1+σ V K1 (λ)x1+σ B(H) ≤ C and, by virtue of Lemma 2.6, Z1 (t) is strongly continuous and uniformly bounded in B(H). It is also obvious that G0 (λ)V K1 (λ)G0 (λ) is C 1 as a B2 (H−σ )-valued function and, after integration by parts we obtain 1 2 e−itλ {G0 (λ)V K1 (λ)G0 (λ)} dλ. (4.19) Z1 (t) = πt R 3

Lemma 2.7 then implies Z1 (t)u∞ ≤ C|t|− 2 u1 and the lemma follows by interpolation. Finally we study the contribution from the singular part of (4.9): −a 2 e−itλ χl (λ)G0 (λ)V QG0 (λ)dλ. Z2 (t) = iπ R

(4.20)

Lemma 4.10. Let β > 9/2. Then, Z2 (t) is a strongly continuous family of uniformly bounded operators in H and its integral kernel Z2 (t, x, y) satisfies 3π 1 3 ae−i 4 i (x 2 +y 2 ) e 4t φ(x)φ(y) ≤ C min(t − 2 x−1 y−1 , t − 2 ) (4.21) Z2 (t, x, y) − √ πt for a constant C > 0. In particular, Z2 (t) satisfies 1

Z2 (t)u3,∞ ≤ Ct − 2 u 3 ,1 , 2

3

u ∈ L2 ∩ L 2 ,1 .

(4.22)

Proof. Since Z2 (t) = e−itH Pc − (t) − Z1 (t), Lemma 4.1 and Lemma 4.9 implies the first statement. The integral kernel Z2 (t, x, y) is given by V (z2 )φ(z2 )V (z1 )φ(z1 ) c(t, A) (4.23) dz1 dz2 , Z2 (t, x, y) = a 16π 2 |x − z2 ||z1 − y| R6

Dispersive Estimates for Schr¨odinger Equations

497

where A = |x − z2 | + |z1 − y| and 1 c(t, A) = iπ

3π

e

−itλ2 +iλA

R

e−i 4 ei χl (λ)dλ = √ πt

A2 4t

F e

is 2 4t

χˇ l

A 2t

.

(4.24)

1

We have |c(t, A)| ≤ χˇ l 1 (π t)− 2 , hence C |Z2 (t, x, y)| ≤ √ t

R6

1

|V (z2 )φ(z2 )| |V (z1 )φ(z1 )| Ct − 2 dz1 dz2 ≤ . |x − z2 | |z1 − y| xy

(4.25)

is 2

Estimate (4.25) implies (4.22). We prove (4.21). Since |e 4t −1| ≤ |s 2 |/4t and |χl (A/t)− 1| ≤ C|A/t|, we have 2 A ≤ Ct −1 (s 2 χˇ l L1 + |A|). F e is4t χˇ l − 1 2t If we set B = 2(|x − z2 ||z2 | + |z1 − y||z1 |) + |z1 |2 + |z2 |2 , it is easy to see that 2 2 2 |eiA /4t − ei(x +y )/4t | ≤ B/4t. It follows that 2 2 i x4t i y4t −i 3π 4 c(t, A) − e √e e ≤ C(1 + A + B)t − 23 . (4.26) πt Combine (4.23) and (4.26) and use the relation (1 + D0 V )φ = 0 and (1 + A + B)|V (z2 )φ(z2 )V (z1 )φ(z1 )| dz1 dz2 < ∞ sup |x − z2 ||z1 − y| x,y R6 which follows from |V (x)φ(x)| ≤ Cx−β−1 with β > 9/2. We see that the left side of 3 (4.21) is bounded by Ct − 2 . Estimate (4.25) and the bound |φ(x)| ≤ Cx−1 show it is 1 also bounded by Ct − 2 x−1 y−1 . We are done. Proof of Theorem 1.3 when H is exceptional type of the first kind. We recall (t) of ˜ ˜ Lemma 4.1 and define (t) = (t) + W1l (t) + Z1 (t) so that e−itH Pc = (t) + Z2 (t). − 21 ˜ ˜ By virtue of Lemma 4.9, (t) is regularly dispersive and (t)u3 ≤ Ct u 3 , in 3

2

3

particular. Since L 2 ,1 ⊂ L 2 and L3 ⊂ L3,∞ , this and (4.22) imply 1

e−itH Pc u3,∞ ≤ Ct − 2 u 3 ,1 .

(4.27)

2

We interpolate (4.27) with the L2 -bound: e−itH Pc u2,2 ≤ u2,2 . If we set 2 θ 1 = (1 − θ ) + , q 3 2

1 1 θ = (1 − θ ) + , p 3 2

0 < θ < 1,

then 2/3 < q < 2 < p < 3 with 1/p + 1/q = 1 and, using also Lp,q ⊂ Lp,p = Lp , 3 we have [L 2 ,1 , L2 ]θ,q = Lq , [L3,∞ , L2,2 ]θ,q = Lp,q ⊂ Lp (see [3], Theorem 5.3.1) and the desired estimate for this case:

1

e−itH Pc up ≤ Ct − 2 (1−θ) uq = Ct

−3

1 1 2−p

uq .

(4.28)

498

K. Yajima

We next show the estimate corresponding to (1.17): −3 21 − p1 −itH Pc − R(t) u ≤ Ct uq . e

(4.29)

p

Estimates (4.27) and |φ(x)| ≤ Cx−1 imply 1

(e−itH Pc − R(t))u3,∞ ≤ Ct − 2 u 3 ,1 .

(4.30)

2

3

By virtue of (4.21), we have (Z2 (t) − R(t))u∞ ≤ Ct − 2 u1 . Combining this with ˜ the fact that (t) = e−itH Pc − Z2 (t) is regularly dispersive, we obtain 3

(e−itH Pc − R(t))u∞ ≤ Ct − 2 u1 .

(4.31)

We interpolate (4.30) and (4.31). This time we set 1 2 θ = (1 − θ ) + , q 3 1

1 1 θ = (1 − θ ) + , p 3 ∞

0 < θ < 1,

so that 1 < q < 2/3, 3 < p < ∞ and 1/p + 1/q = 1. Then, again using Lp,q ⊂ Lp , 3 we have [L 2 ,1 , L1 ]θ,q = Lq , [L3,∞ , L∞ ]θ,q = Lp,q ⊂ Lp and

(e

−itH

Pc − R(t))up ≤ Ct

− 21 (1−θ)− 23 θ

uq = Ct

−3

1 1 2−p

uq ,

(4.32)

which is (4.29). This completes the proof of Theorem 1.3 when H is exceptional type of the first kind.

4.4. Exceptional type of the second kind. In this subsection we prove Theorem 1.3 (2) when H is of exceptional type of the second kind. In view of Lemma 4.1, we need to study W0l (t) only. As previously we begin by studying the resolvent G(λ) near λ = 0. In this case M coincides with the 0 eigenspace E of H and all φ ∈ E satisfy V , φ = 0, |φ(x)| ≤ Cx−2 , hence φ ∈ H 1 − . 2

(4.33)

Theorem 4.11. Let |V (x)| ≤ Cx−β for some β > 11/2. Assume that H is of exceptional type of the second kind and let P0 be the orthogonal projection in H onto the 0 eigenspace of H = − + V . Then there exists a constant λ0 > 0 such that for 0 < |λ| < λ0 , (1 + G0 (λ)V )−1 = I + K(λ) + λ−2 P0 V + iλ−1 P0 V D3 V P0 V ,

(4.34)

where x1+σ V K(λ)x1+σ is a B2 (H)-valued C 1+ρ function of −λ0 < λ < λ0 (including λ = 0) for some σ > 1/2 and ρ > 1/2.

Dispersive Estimates for Schr¨odinger Equations

499

Proof. Without losing generality we assume 11/2 < β < 6, which implies β − 4 < (β − 1)/2. We again apply Lemma 4.7 to (4.6) and the argument is parallel to that of the proof of Theorem 4.8. We define E2 (λ) = (iλ)−2 (M(λ) − (1 + D0 V + iλD1 V )), J4 (λ) = (iλ)−4 (M(λ) − (1 + D0 V + · · · + (iλ)3 D3 V )).

(4.35)

It follows from (4.33) (see Lemma 2.2 (2) of [9]) that 5 1 <γ <β+ ; 2 2 5 1 ∗ γ − 25 − E2 (λ) V φ ∈ C (R, Hβ−γ +1 ), <γ <β+ ; 2 2 9 1 γ − 29 − J4 (λ)φ ∈ C (R, H−γ +1 ), <γ <β+ . 2 2 5

E2 (λ)φ ∈ C γ − 2 − (R, H−γ +1 ),

(4.36)

Since (1 + D0 V + iλD1 V )Q = Q(1 + D0 V + iλD1 V ) = 0, we have M01 (λ) = (iλ)2 QE2 (λ)Q, M10 (λ) = (iλ)2 QE2 (λ)Q, M11 (λ) = (iλ)2 QE2 (λ)Q, E2 (λ) = D2 V + iλD3 V + (iλ)2 J4 (λ).

(4.37)

Take an orthonormal basis {φj } of M and its dual basis {−V φj }. Then, QJ4 (λ)Q = j,k aj k (λ)(φj ⊗ V φk ) with aj k (λ) = V φj , J4 (λ)φk and, by choosing γ arbitrarily close to β + 1/2 in the last relation of (4.36), we see that aj k (λ) are of class C β−4− . By virtue of Lemma 4.2 (2), V φj , D2 V φk |φj V φk | QD2 V Q = =− D0 V φj , D0 V φk |φj V φk | = − φj , φk |φj φk |V . 1 The matrix A = (φj , φk ) is positive definite and, if we define B = A− 2 and φ˜ k = ˜ ˜ j Bj k φj , then {φ1 , . . . , φd } becomes an orthonormal basis of M with respect to the 2 standard L inner product, and Bj2k |φj φk |V = − |φ˜ j φ˜ k |V = −P0 V . (4.38) (QD2 V Q)−1 = −

Since φ˜ j |V φk = −Bj k , we have P0 V Q = P0 V . It follows by a Neumann series expansion that M11 (λ)−1 = λ−2 P0 V (I − iλQD3 V P0 V + λ2 QJ4 (λ)P0 V )−1 Q = λ−2 P0 V + iλ−1 P0 V D3 V P0 V + QE3 (λ)Q.

(4.39)

C β−4− , Here E3 (λ) collects all remaining terms in the expansion and, as J4 (λ) is of class β−4− if we write as QE3 (λ)Q = bj k (λ)φj ⊗ V φk , bj k (λ) are also of class C . We have −1 −1 (λ)M01 (λ) = λ4 QE2 (λ)QM00 (λ)QE2 (λ)Q. M10 (λ)M00

(4.40)

500

K. Yajima

−1 Since E2 (λ)φk and E2 (λ)∗ V φj satisfy the property (4.36) and QM00 (λ)Q is a B(H−r )– δ valued C function of λ for 1/2 < δ < min(β − γ − 1/2, γ − 1/2, β − 2) by virtue of Lemma 4.6, the matrix elements

E2 (λ)∗ V φj , QM00 (λ)−1 QE2 (λ)φk of (4.40) with respect to these bases are of class C β−4 . We obtain, combining this with (4.39), that −1 (λ)M01 (λ)M11 (λ)−1 = λ2 QE5 (λ)Q M10 (λ)M00

with E5 (λ) which has C β−4− matrix elements. It follows that −1 (λ)M01 (λ) = (I − λ2 QE5 (λ)Q)M11 (λ) C(λ) = M11 (λ) − M10 (λ)M00

is invertible for λ = 0 and C −1 (λ) = M11 (λ)−1 (I − λ2 QE5 (λ)Q)−1 = λ−2 P0 V + iλ−1 P0 V D3 V P0 V + QE6 (λ)Q

(4.41)

with C β−4− function E6 (λ). From (4.36) and Lemma 4.6, it also follows that 5

5

x−β+ 2 − QM00 (λ)−1 QE2 (λ)Q, QE2 (λ)QM00 Qx 2 + ∈ C β−4− (R, B2 (H)). Then, by virtue of (4.41), we see that the operators −1 (λ)M01 (λ)C −1 (λ), −M00

−1 −C −1 (λ)M01 (λ)M00 (λ),

−1 −1 M00 (λ)M01 (λ)C −1 (λ)M01 (λ)M00 (λ) 5

(4.42)

5

are, when sandwiched by x−β+ 2 − and x 2 − from the left and the right respectively, all B2 (H)-valued C β−4− functions. Since β > 11/2, putting the operators in (4.42), QE6 (λ)Q and M00 − I into K(λ), we obtain the theorem. Now we are ready to study W0l (t): −1 2 e−itλ χl (λ)G0 (λ)V G(λ)λdλ W0l (t) = lim δ↓0 iπ |λ|>δ

(4.43)

in the case when H is an exceptional type of the second kind. We may choose the cut off function χl (λ) such that χ (λ) = 0 for λ > λ0 and χl (λ) = 1 for |λ| < λ0 /2 as previously. By virtue of (4.34), we have G0 (λ)V G(λ) = G0 (λ)V G0 (λ) + G0 (λ)V K(λ)G0 (λ) +λ−2 G0 (λ)V P0 V G0 (λ) + iλ−1 G0 (λ)V P0 V D3 V P0 V G0 (λ). (4.44) The contribution of G0 (λ)V G0 (λ) to W0l (t) is equal to W1l (t) and it is regularly dispersive. We denote the contribution from G0 (λ)V K(λ)G0 (λ) by X1 (t), which corresponds to Z1 (t) in the first case. By virtue of Theorem 4.11 xσ +1 V K(λ)xσ +1 is a B2 (H)-val3 ued H 2 + function of |λ| < λ0 . It follows by the argument used for studying Z1 (t) of the

Dispersive Estimates for Schr¨odinger Equations

501

previous subsection that X1 (t) is regularly dispersive. Let X2 (t) and X3 (t) respectively be the contributions from the fourth and the third summands: −1 2 X2 (t) = e−itλ χl (λ)G0 (λ)V P0 V D3 V P0 V G0 (λ)dλ, (4.45) π R −1 2 X3 (t) = e−itλ λ−1 χl (λ)G0 (λ)V P0 V G0 (λ)dλ. (4.46) lim iπ δ↓0 |λ|>δ A priori we know that X2 (t) + X3 (t) is a strongly continuous family of uniformly bounded operators in H: (X2 (t) + X3 (t))u2 ≤ Cu2 ,

t ∈ R,

(4.47)

as it may be written as a sum of operators which satisfy this property. Lemma 4.12. There exists C such that 1

3

X2 (t)u3,∞ ≤ Ct − 2 u 3 ,1 , u ∈ L2 ∩ L 2 ,1 , 2 −i 3π 4 3 e X2 (t)u + i √ P0 V D3 P0 u ≤ Ct − 2 u1 , u ∈ L2 ∩ L1 . πt

(4.48) (4.49)

∞

Proof. We let {φ˜ j } be an orthonomal basis of E with respect to the L2 -norm. With cj k = φ˜ j , V D3 V φ˜ k we write P 0 V D3 V P0 =

d

cj k |φ˜ j φ˜ k |,

cj k = φ˜ j |V D3 V |φ˜ k .

j,k=1

We define Wj k (t) =

−1 π

e−itλ χl (λ)G0 (λ)V |φ˜ j φ˜ k |V G0 (λ)dλ. 2

R

Notice that Wj k (t) is exactly of the same form as Z2 (t) except that a is replaced by −i and the resonance φ by the eigenfunctions φ˜ j and φ˜ k . It follows by the argument which led to (4.25) that the integral kernel Wj k (t, x, y) of Wj k (t) satisfies Wj k (t, x, y) ≤ C|t|− 21 x−1 y−1 , (4.50) which implies (4.48). It also implies 3π 3 e−i 4 i (x 2 +y 2 ) Wj k (t, x, y) + i √ e 4t φ˜ j (x)φ˜ k (y) ≤ C|t|− 2 . πt

(4.51)

−2 ˜ Here, however, as eigenfunctions decay faster than resonances and |φj (x)| ≤ Cx , i x2 we may estimate (e 4t − 1)φ˜ j (t, x) ≤ Ct −1 . It follows that

3π e−i 4 Wj k (t)u + i √ (u, φ˜ k )φ˜ j πt

3

≤ Ct − 2 u1 .

∞

Summing up (4.52) with respect to j, k, we obtain the lemma.

(4.52)

502

K. Yajima

Lemma 4.13. For φ ∈ E, a zero eigenfunction of H , define 1 dλ 2 w(t, ˜ x) = lim e−itλ χl (λ)(G0 (λ) − D0 )V φ(x) . δ↓0 iπ |λ|>δ λ

(4.53)

Then w(t, ˜ x) satisfies the following properties: 1

|w(t, ˜ x) − e

−i 3π 4

|w(t, ˜ x)| ≤ Ct − 2 x−1 , 3 µ(t, x) (D2 V φ)(x)| ≤ Ct − 2 , √ πt

where µ(t, x) is the function defined by (1.12): 1 i|x|2 iθ 2 |x|2 i µ(t, x) = (e 4t − e 4t )dθ. |x| 0 Proof. Since φ satisfies 1, V φ = 0, we may write iλ|x−y| − 1 eiλ|x| − 1 1 e (G0 (λ) − D0 )V φ(x) = − V (y)φ(y)dy. 4π |x − y| |x|

(4.54) (4.55)

(4.56)

(4.57)

We write the function inside the parenthesis under the integral sign in the form 1 iλ (eiλ(θ|x−y|+(1−θ)|x|) − ei|x−y|λθ )dθ. (4.58) (|x − y| − |x|) |x| 0 After rewriting (G0 (λ) − D0 )V φ(x) in this way, we compute the right-hand side of (4.53) by first performing the λ integral as always. If we set A = θ |x − y| + (1 − θ)|x| and B = θ |x − y|, we have 1 2 e−itλ (eiλA − eiλB )χl (λ)dλ = c(t, A) − c(t, B), iπ R where c(t, X) is defined by (4.24): 2 3π is X e−i 4 iX2 , c(t, X) = √ e 4t F e 4t χˇ l 2t πt and w(t, ˜ x) may now be written in the form 1 i (|x − y| − |x|)(c(t, A) − c(t, B))V (y)φ(y)dy dθ. 4π|x| 0

(4.59)

1

Since |c(t, X)| ≤ Ct − 2 and ||x − y| − |x|| ≤ |y|, (4.59) clearly implies 1

|w(t, ˜ x)| ≤ C|x|−1 t − 2 . However, the choice of origin is arbitrary and we obtain (4.54). Since |A2 − |x|2 | = θ |(|x − y| − |x|)(θ (|x − y| − |x|) + 2|x|)| ≤ 2|y|(|x| + |y|), the argument which leads to (4.26) implies uniformly with respect to θ , 3π 3 3 e−i 4 i|x|2 c(t, A) − √ e 4t ≤ C(|A| + s 2 χˇ 1 + |y|(|x| + |y|))t − 2 ≤ Cxy2 t − 2 . πt

Dispersive Estimates for Schr¨odinger Equations

503

Likewise, we have ||x − y|2 − |x|2 | ≤ 2|y|(|x| + |y|) and 3π 3 e−i 4 iθ 2 |x|2 c(t, B) − √ e 4t ≤ Cxy2 t − 2 . πt Note that |y3 V (y)φ(y)| ≤ Cy−β+1 is integrable by the assumption β > 11/2. It follows that w(t, ˜ x) differs from 3π

ie−i 4 √ π t|x|

1

(e

i|x|2 4t

−e

iθ 2 |x|2 4t

)dθ

0

1 4π

(|x − y| − |x|)V (y)φ(y)dy

3

by a function bounded by Ct − 2 . Here the function in the second parenthesis is equal to (D2 V φ)(x) because V , φ = 0. We have obtained (4.55). Lemma 4.14. Let µ(t) be the multiplication by µ(t, x). Then, there exists C such that 1

X3 (t)u3,∞ ≤ Ct − 2 u3/2,1 , 3π e−i 4 (µ(t)D2 V P0 + P0 V D2 µ(t)) u X3 (t)u − √ πt

(4.60) 3

≤ Ct − 2 u1 .

(4.61)

∞

Proof. Using D0 V P0 = −P0 and P0 V D0 = −P0 , which follows since the 0 eigenfunctions φ of H satisfy D0 V φ = −φ, we may write G0 (λ)V P0 V G0 (λ) = (G0 (λ) − D0 )V P0 V (G0 (λ) − D0 ) −(G0 (λ) − D0 )V P0 − P0 V (G0 (λ) − D0 ) + P0 . This produces X3 (t) = X31 (t) + X32 (t) + X33 (t) where i dλ 2 X31 (t) = lim e−itλ χl (λ)(G0 (λ) − D0 )V P0 V (G0 (λ) − D0 ) , π δ↓0 |λ|>δ λ dλ 1 2 X32 (t) = lim e−itλ χl (λ)(G0 (λ) − D0 )V P0 , iπ δ↓0 |λ|>δ λ 1 dλ 2 X33 (t) = e−itλ χl (λ)P0 V (G0 (λ) − D0 ) . lim iπ δ↓0 |λ|>δ λ

(4.62) (4.63) (4.64)

Here the contribution from P0 vanishes because e−itλ λ−1 χl (λ) is an odd function of λ. We take an orthonormal basis {φ1 , . . . , φd } of E with respect to the L2 inner product and let w˜ j (t, x) be the w(t, ˜ x) of Lemma 4.13 corresponding to φj , j = 1, . . . , d. Then the integral kernels of X32 (t) and X33 (t) are given respectively by 2

X32 (t, x, y) =

d

w˜ j (t, x)φj (y),

j =1

X33 (t, x, y) =

d

φj (x)w˜ j (t, y),

j =1

and, by virtue of Lemma 4.13, the lemma follows if we prove 1

X31 (t)u3,∞ ≤ Ct − 2 u3/2,1 ,

3

X31 (t)u∞ ≤ Ct − 2 u1 .

(4.65)

504

K. Yajima

By using (4.57) and (4.58), we write the integral kernel of X31 (t) in the following form. We define i 2 a(t, A) = e−itλ +iλA λχl (λ)dλ π R and use the short-hand notation L(x, y) = |x − y| − |x|,

ψj (x) = −V (x)φj (x), j = 1, . . . , d.

Note that |L(x, y)| ≤ |y|. If we define Ykj (t, x, y, θ, θ ) for k = 1, . . . , 4 and j = 1, . . . , d by −1 Ykj = L(x, z2 )L(y, z1 )ψj (z2 )ψj (z1 )a(t, Ak )dz1 dz2 , 16π 2 |x||y| R6 where the variables A1 , . . . , A4 inside a(t, Ak ) are respectively given by A1 = θ |x − z2 | + θ |y − z1 | + (1 − θ )|y|, A2 = θ |x − z2 | + θ |y − z1 |, A3 = θ |x − z2 | + (1 − θ )|x| + θ |y − z1 |, A4 = θ |x − z2 | + (1 − θ )|x| + θ |y − z1 | + (1 − θ )|y|, then, the integral kernel of X31 (t) may be written in the form X31 (t, x, y) =

d 4

(−1)

1 1

k 0

k=1 j =1

Ykj (t, x, y, θ, θ )dθ dθ .

(4.66)

0

Clearly |Ak | ≤ (x + z2 + z1 + y), k = 1, . . . , 4 and 1

|a(t, A)| ≤ Ct − 2 ,

3

|a(t, A)| ≤ Ct − 2 |A|,

(4.67)

by virtue of (2.5) and (2.7). It follows that

1 3 t − 2 t − 2 (x + y) |X31 (t, x, y)| ≤ C min , . |x||y| |x||y|

(4.68)

Here again the choice of the origin of coordinates is irrelevant for the estimate and we 1 1 3 3 may replace t − 2 (1/|x||y|) by Ct − 2 (1/xy) and t − 2 (x + y/|x||y|) by t − 2 (x + y/xy) in (4.68) and (4.65) follows. This completes the proof of the lemma. Proof of Theorem 1.3 when H is exceptional type of the second kind. We have shown that e−itH Pc − (X2 (t) + X3 (t)) is regular dispersive. It follows by virtue of Lemma 4.12 and Lemma 4.14, 1

e−itH Pc u3,∞ ≤ Ct − 2 u 3 ,1 , 2

1

(e−itH Pc − R(t))u3,∞ ≤ Ct − 2 u 3 ,1 , 2

(e−itH Pc − R(t))u∞ ≤ Ct

− 23

u1 .

(4.69) (4.70) (4.71)

We interpolate (4.69) with the L2 bound e−itH Pc u2 ≤ u2 and (4.70) with (4.71). The argument is virtually a repetition of the corresponding part of the previous subsection and we omit the details.

Dispersive Estimates for Schr¨odinger Equations

505

4.5. Exceptional case of the third kind. We finally consider the case when H is of exceptional type of the third kind. As usual we begin by studying M(λ)−1 = (1 + G0 (λ)V )−1 near λ = 0. We take the orthonormal (with respect to the inner product −(V u, v)) basis {φ1 , . . . , φd } of M of Lemma 4.5 in such a way that {φ2 , . . . , φd } is a basis of P0 H and such that φ1 , V > 0. The last condition determines φ1 uniquely. Define the orthogonal projections π1 onto {φ1 } and π2 onto P0 H with respect to this inner product, viz. π1 = −|φ1 V φ1 | and π2 = − dj =2 |φj V φj |, and Q0 = Q = 1 − Q, Q1 = Qπ1 Q, Q2 = Qπ2 Q. We have Q = Q1 + Q2 . As previously we write ψj = −V φj : j = 1, . . . , d. {ψj } is the basis of N = M∗ which is dual to {φj }. Lemma 4.15. As identities in H−γ , we have the following: Qj Qk (1 + D0 V )Q1 Q2 D1 V Q0 Q0 D1 V Q2

= = = =

δj k (j, k = 0, 1, 2) and Q0 + Q1 + Q2 = I, (1 + D0 V )Q2 = 0, 0, Q2 D1 V Q1 = 0, Q2 D1 V Q2 = 0, 0, Q1 D1 V Q2 = 0.

(4.72) (4.73) (4.74) (4.75)

Proof. Equations (4.72) and (4.73) are obvious. Since D1 = (1/4π )|11|, (4.74) and (4.75) follow from Q2 |1 = 0 and 1|V Q2 = 0. We first study [QM(λ)Q]−1 by using Lemma 4.7. We write QM(λ)Q in matrix form ˙ Q2 M: with respect to the decomposition M = Q1 M + Q1 M(λ)Q1 Q1 M(λ)Q2 M11 (λ) M12 (λ) QM(λ)Q = ≡ . (4.76) Q2 M(λ)Q1 Q2 M(λ)Q2 M21 (λ) M22 (λ) In what follows we assume 11/2 < β < 6 so that β − 4 < 21 (β − 1) and irrespectively denote by E(λ) various finite dimensional operator valued functions of λ which are of class C β−4− in a neighborhood of λ = 0. The function V φ1 |G0 (λ)|V φ1 is of class C β−1− because V φ1 ∈ Hβ− 1 − . Since φ1 2 satisfies (1 + D0 V )φ1 = 0 and V , φ1 = 0, it follows as in the case of the first type that with c1 ∈ C β−3− , M11 (λ) = c(λ)Q1 with c(λ) = (4πi)−1 λ|V , φ1 |2 + λ2 c1 (λ). Hence M11 (λ) is invertible for 0 < |λ| < λ0 for sufficiently small λ0 > 0 and, with a = 4π i|V , φ1 |−2 as previously, −1 M11 (λ) = (λ−1 a + d(λ))Q1 ,

d ∈ C β−3− .

(4.77)

Likewise M12 (λ) and M21 (λ) are of C β−1− and, as Q2 D1 V = D1 V Q2 = 0, M12 (λ) = −λ2 Q1 (D2 V + λE(λ))Q2 , M21 (λ) = −λ2 Q2 (D2 V + λE(λ))Q1 , −1 M21 (λ)M11 (λ)M12 (λ) = λ3 Q2 (aD2 V Q1 D2 V + λE(λ))Q2 .

(4.78)

Since V φj (x) ∈ Hβ+ 1 − for 2 ≤ j ≤ d, M22 (λ) is of class C β− and 2

M22 (λ) = −λ2 Q2 (D2 V + iλD3 V − λ2 E(λ))Q2 .

(4.79)

506

K. Yajima

Notice that M22 (λ) is what corresponds to M11 (λ) of the previous Subsect. 4.4. Hence (4.38) and (4.39) imply, with P0 being the orthogonal projection in H onto E that (Q2 D2 V Q2 )−1 = −P0 V , P0 V Q2 = P0 V and that M22 (λ)−1 = λ−2 P0 V + iλ−1 P0 V D3 V P0 V + P0 V E(λ)Q2 .

(4.80)

It follows by a Neumann series expansion that −1 C22 (λ) = M22 (λ) − M21 (λ)M11 (λ)M12 (λ) −1 = M22 (λ)(1 − M22 (λ)−1 M21 (λ)M11 (λ)M12 (λ))

is invertible and −1 (λ) = λ−2 P0 V + iλ−1 P0 V D3 V P0 V C22

+ aλ−1 P0 V D2 V Q1 D2 V P0 V + P0 V E(λ)P0 V .

(4.81)

If we set φ˜ 1 = P0 V D2 V φ1 ∈ P0 H, then P0 V D2 V Q1 D2 V P0 V = −|φ˜ 1 φ˜ 1 |V and the right side of (4.81) may be written in the form λ−2 P0 V + iλ−1 P0 V D3 V P0 V − λ−1 a|φ˜ 1 φ˜ 1 |V + P0 V E(λ)P0 V .

(4.82)

Using (4.77), (4.78), (4.81) and the definition of φ˜ 1 , we may write −1 −1 (λ)M12 (λ)C22 (λ) = −aλ−1 |φ1 φ˜ 1 |V + E(λ), −M11 −1 −1 −C22 (λ)M21 (λ)M11 (λ) = −aλ−1 |φ˜ 1 φ1 |V + E(λ), −1 −1 −1 M11 (λ)M12 (λ)C22 (λ)M21 (λ)M11 (λ) = E(λ).

(4.83)

Combining (4.77), (4.82) and (4.83) by means of Lemma 4.7, we see that (QM(λ)Q)−1 is in matrix form given modulo an E(λ) by −aλ−1 |φ1 V φ1 | −aλ−1 |φ1 V φ˜ 1 | (4.84) −aλ−1 |φ˜ 1 V φ1 | λ−2 P0 V + iλ−1 P0 V D3 V P0 V − λ−1 a|φ˜ 1 V φ˜ 1 | and, therefore, if we define the canonical resonance ϕ = φ1 − φ˜ 1 as in (1.10), ϕ still satisfies ϕ ∈ M and ϕ, V = 1, and we obtain (QM(λ)Q)−1 =

a P0 V iP0 V D3 V P0 V − |ϕϕ|V + E(λ). + λ2 λ λ

(4.85)

For studying M(λ)−1 we repeat a similar argument. We write M(λ) in the matrix ˙ M: form with respect to the decomposition H−γ = QH−γ + M(λ) =

L00 (λ) L01 (λ) QM(λ)Q QM(λ)Q ≡ , L10 (λ) L11 (λ) QM(λ)Q QM(λ)Q

where the right-hand side is the definition. By virtue of Lemma 4.6, for any 1/2 < γ < β − 1/2, A(λ) ≡ L00 (λ)−1 exists in QH−γ and of class C δ for any δ < min(β − γ − 1/2, γ −1/2, β −2) and A(λ)−Q is of Hilbert-Schmidt class. By virtue of (4.73), (4.74)

Dispersive Estimates for Schr¨odinger Equations

507

and (4.75), with respect to the decomposition Q = Q1 + Q2 , L10 (λ)L−1 00 (λ)L01 (λ) = QM(λ)A(λ)M(λ)Q may be written as

Q1 M(λ)A(λ)M(λ)Q1 Q1 M(λ)A(λ)M(λ)Q1

Q1 M(λ)A(λ)M(λ)Q2 Q2 M(λ)A(λ)M(λ)Q2

λ2 E11 (λ) λ3 E12 (λ) , λ3 E21 (λ) λ4 E22 (λ)

=

−1 is of the form where Eij are of class C β−4− . Since L−1 11 (λ) = (QM(λ)Q)

L−1 11 (λ)

λ−1 E(λ) = −1 λ E(λ)

λ−1 E(λ) λ−2 E(λ)

(4.86)

˙ 2 M, by virtue of (4.84), in the decomposition in M = Q1 M+Q −1 N (λ) ≡ L−1 11 (λ)L10 (λ)L00 (λ)L01 (λ) =

λE(λ) λE(λ)

λ2 E(λ) . λ2 E(λ)

It follows that C(λ) = L11 (λ)−L10 (λ)L−1 00 (λ)L01 (λ) = L11 (λ)(1−N (λ)) is invertible for 0 < |λ| < λ0 , −1 −1 C −1 (λ) = L−1 11 (λ) + (1 − N (λ)) N (λ)L11 (λ)

(4.87)

and (1 − N (λ))−1 N (λ)L−1 11 (λ) is of the form

λE(λ) λE(λ)

−1 λ E(λ) λ−1 E(λ)

λ2 E(λ) λ2 E(λ)

λ−1 E(λ) = E(λ). λ−2 E(λ)

(4.88)

We have L01 (λ) = QM(λ)Q = λQF1 (λ)Q1 + λ2 QF2 (λ)Q2 with F1 (λ) = λ−1 G0 (λ)V Q1 ,

F2 (λ) = λ−2 G0 (λ)V Q2 (λ).

Here λ−1 G0 (λ)V φ1 is an H−γ -valued C γ −3/2− function of λ for any 3/2 < γ < β−1/2 and, as in (4.36), λ−2 G0 (λ)V φj , 2 ≤ j ≤ d, are H−γ +1 –valued C γ −5/2− functions for any 5/2 < γ < β + 1/2. It follows by applying Lemma 4.6 for L00 (λ) respectively with γ = β − 2 − ε and with γ = β − 1 − ε with 0 < ε < 21 that A(λ)QF1 (λ)φ1 and A(λ)QF1 (λ)φj , 2 ≤ j ≤ d are B(M, H−β+2+ε )-valued C β−4 functions of λ (recall that A(λ) = L−1 00 (λ)). Combining this with (4.86), (4.87) and (4.88), we conclude that λ−1 E(λ) A(λ)L01 (λ)C −1 (λ) = λA(λ)QF1 (λ)Q1 λ2 A(λ)QF2 (λ)Q2 λ−1 E(λ)

λ−1 E(λ) λ−2 E(λ)

is a B(M, H−β+2+ε )–valued C β−4− function of λ near λ = 0. By an argument dual to the previous one, we see that β−4− C −1 (λ)L10 (λ)L−1 00 (λ) is also of class C

as a B(H−2−ε , M)–valued function of λ near the origin. Summarizing the results by using Lemma 4.7, we have shown the following theorem:

508

K. Yajima

Theorem 4.16. Suppose V satisfies |V (x)| ≤ Cx−β with 11 2 < β and H is of exceptional type of the third kind. Let ϕ be the canonical resonance and a = 4π i|V , ϕ|−2 . Then, (I + G0 (λ)V )−1 − I =

P0 V iP0 V D3 V P0 V a + − |ϕϕ|V + K(λ), (4.89) λ2 λ λ

where K(λ) is such that x1+σ V K(λ)x1+σ is a B2 (H)–valued C 1+s function of λ in a neighbourhood of λ for some σ, s > 1/2. Once Theorem 4.16 is obtained, the proof of Theorem 1.3 for the case H is an exceptional type of the third kind completed by combining the arguments in the preceding two subsections. We may safely omit the repetitious proof. 4.6. Dispersive estimates. Finally we prove Theorem 1.4. We may assume H is an 1 exceptional type of third kind. We have |ζ (t, x) − ϕ(x)| + |µ(t, x)| ≤ C min |x| t , |x| . Hence, |ζ (t, x) − ϕ(x)| + |µ(t, x)| ≤ Ct

2− q3

6

|x| q

−5

,

1 ≤ q ≤ 3/2.

Thus, if u, φ = 0 for all φ ∈ M, then, for any p > 3,

|a| −3 R(t)up ≤ √ ϕp |ζ (t) − ϕ, u| ≤ Ct πt

1 1 q −2

6

|x| q

−5

u1 .

(4.90)

For φ ∈ E, we have |D2 V φ(x)| ≤ C. It follows, since φ, V D2 µ(t)u = D2 V φ, µ(t)u, that |φ, V D2 µ(t)u| ≤ Cµ(t)u1 ≤ Ct

2− q3

6

|x| q

−5

u1 ,

φ ∈ E.

Since φ ∈ E belong to Lp for p > 3, we also have S(t)up ≤ Ct

−3

1 1 q −2

6

|x| q

−5

u1 .

(4.91)

We choose p > 3 as the dual exponent of 1 ≤ q < 3/2 and combine (4.90) and (4.91) with (1.17). We obtain (1.18). This completes the proof. References 1. Agmon, S.: Spectral properties of Schr¨odinger operators and scattering theory. Ann. Scuola Norm. Sup. Pisa Cl. Sci. 2(4), 151–218 (1975) 2. Artbazar, G., Yajima, K.: The Lp -continuity of wave operators for one dimensional Schr¨odinger operators. J. Math. Sci. Univ. Tokyo 7, 221–240 (2000) 3. Bergh, J., L¨ofstr¨om, J.: Interpolation spaces, an introduction. Berlin-Heidelberg-New York: Springer-Verlag, 1976 4. Cuccagna, S.: Stabilization of solutions to nonlinear Schr¨odinger equations. Commun. Pure Appl. Math. 54, 1110–1145 (2001) 5. Cycon, H.L., Froese, R.G., Kirsch, W., Simon, B.: Schr¨odinger operators with application to quantum mechanics and global geometry. Berlin: Springer-Verlag, 1987 6. Erdoˇgan, M.B., Schlag, W.: Dispersive estimates for Schr¨odinger operators in the presence of a resonance and/or an eigenvalue at zero energy in dimension three I. Dynamics of PDE, 1(4), 359 (2004)

Dispersive Estimates for Schr¨odinger Equations

509

7. Goldberg, M.: Dispersive bounds for the three-dimensional Schr¨odinger equation with almost critical potentials. To appear in Geom. and Funct. Anal 8. Goldberg, M., Schlag, W.: Dispersive estimates for Schr¨odinger operators in dimensions one and three. Commun. Math. Phys. 251, 157–178 (2004) 9. Galtbayar, A., Jensen, A., Yajima, K.: Local time-decay of solutions to Schr¨odinger equation with time-periodic potentials. J. Stat. Phys. 116, 231–282 (2004) 10. Jensen, A., Kato, T.: Spectral properties of Schr¨odinger operators and time-decay of the wave functions. Duke Math. J. 46, 583–611 (1979) 11. Jensen, A., Mourre, E., Perry, P.: Multiple commutator estimates and resolvent smoothness in quantum scattering theory. Ann. Inst. H. Poincar´e Phys. Th´eor. 41, 207–225 (1984) 12. Journ´e, J.-L., Soffer, A., Sogge, C.D.: Decay estimates for Schr¨odinger operators. Commun. Pure Appl. Math. 40, 573–604 (1991) 13. Kato, T.: Growth properties of solutions of the reduced wave equation with a variable coefficient. Commun. Pure Appl. Math. 12, 403–425 (1959) 14. Kato, T.: Perturbation theory for linear operators. New York: Springer Verlag, 1966 15. Kato, T.: Wave operators and similarity for some non-selfadjoint operators. Ann. Math. 162, 258–279 (1966) 16. Kato, T.: On nonlinear Schr¨odinger equations. Ann. Inst. H. Poincar´e Phys. Th´eor. 46, 113–129 (1987) 17. Keel, M., Tao, T.: End point Strichartz estimates. Am. J. Math. 120, 955–980 (1998) 18. Lions, J.L., Magenes, E.: Probl´emes aux limites non homog´enes et applications I. Paris: Dunod, 1968 19. Murata, M.: Asymptotic expansions in time for solutions of Schr¨odinger-type equations. J. Funct. Anal. 49, 10–56 (1982) 20. Nier, F., Soffer, A.: Dispersion and Strichartz estimates for some finite rank perturbations of the Laplace operator. J. Funct. Anal. 198, 511–535 (2003) 21. O’Neil, R.: Convolution operators and L(p, q) spaces. Duke Math. J. 30, 129–142 (1963) 22. Reed, M., Simon, B.: Methods of moderm mathemtical physics. Vol II, Fourier analysis, selfadjointness. New York-San Francisco-London: Academic Press, 1975 23. Reed, M., Simon, B.: Methods of moderm mathemtical physics. Vol III, Scattering theory. New York-San Francisco-London: Academic Press, 1979 24. Reed, M., Simon, B.: Methods of moderm mathemtical physics. Vol IV, Analysis of Operators. New York-San Francisco-London: Academic Press, 1978 25. Rodnianski, I., Schlag, W.: Time decay for solutions of Schr¨odinger equations with rough and time dependent potentials. Invent. Math. 155(3), 451–513 (2004) 26. Rodnianski, I., Schlag, W., Soffer, A.: Dispersive analysis of charge transfer models. To appear in CPAM 27. Schlag, W.: Dispersive estimates for Schr¨odinger operators in dimension two. Commun. Math. Phys. 257, 87–117 (2005) 28. Weder, R.: Lp -Lp estimates for the Schr¨odinger equations on the line and inverse scattering for the nonliner Schr¨odinger equation with a potential. J. Funct. Anal. 170, 37–68 (2000) 29. Yajima, K.: The W k,p -continuity of wave operators for Schr¨odinger operators. J. Math. Soc. Japan 47, 551–581 (1995) 30. Yajima, K.: The W k,p -continuity of wave operators for Schr¨odinger operators III. J. Math. Sci. Univ. Tokyo 2, 311–346 (1995) 31. Yajima, K.: Lp -boundedness of wave operators for two dimensional Schr¨odinger operators. Commun. Math. Phys. 208, 125–152 (1999) 32. Yajima, K.: Time Periodic Schr¨odinger equations, in Topics in the theory of Schr¨odinger operators, ed. H. Anaki and H. Ezawa, 9–69, World Scientific (2004) Communicated by B. Simon

Commun. Math. Phys. 259, 511–543 (2005) Digital Object Identifier (DOI) 10.1007/s00220-005-1389-3

Communications in

Mathematical Physics

Spin Gromov-Witten Invariants Tyler J. Jarvis1, , Takashi Kimura2, , Arkady Vaintrob3, 1

Department of Mathematics, Brigham Young University, Provo, UT 84602, USA. E-mail: [email protected] 2 Department of Mathematics, Boston University, 111 Cummington Street, Boston, MA 02215, USA. E-mail: [email protected] 3 Department of Mathematics, University of Oregon, Eugene, OR 974003, USA. E-mail: [email protected] Received: 17 October 2003 / Accepted: 21 March 2005 Published online: 2 August 2005 – © Springer-Verlag 2005

Abstract: We define and study r-spin Gromov-Witten invariants and r-spin quantum cohomology of a projective variety V , where r ≥ 2 is an integer. The main element of the 1/r construction is the space Mg,n (V ) of r-spin maps, the stable maps into a variety V from n-pointed algebraic curves of genus g with the additional data of an r-spin structure on 1/r the curve. We prove that Mg,n (V ) is a Deligne-Mumford stack and use it to define the r-spin Gromov-Witten classes of V . We show that these classes yield a cohomological field theory (CohFT) which is isomorphic to the tensor product of the CohFT associated to the usual Gromov-Witten invariants of V and the r-spin CohFT. Restricting to genus zero, we obtain the notion of an r-spin quantum cohomology of V , whose Frobenius structure is isomorphic to the tensor product of the Frobenius manifolds corresponding to the quantum cohomology of V and the r th Gelfand-Dickey hierarchy (or, equivalently, the Ar−1 singularity). We also prove a generalization of the descent property which, in particular, explains the appearance of the ψ classes in the definition of gravitational descendants. 0. Introduction In this paper, we present a generalization of the theory of quantum cohomology and Gromov-Witten invariants arising from algebraic curves with higher spin structures. Recall that the construction of the ordinary Gromov-Witten invariants of a projective variety V is based on the moduli spaces Mg,n (V ) of stable maps to V . The space Mg,n (V ) is a Deligne-Mumford stack compactifying the space of holomorphic maps to V from Riemann surfaces of genus g with n marked points. In particular, the moduli space of stable maps to a point coincides with the moduli of stable curves Mg,n .

Research of the first author was partially supported by NSA grant number MDA904-99-1-0039 Research of the second author was partially supported by NSF grant number DMS-9803427 Research of the third author was partially supported by NSF grant DMS-0104397

512

T. J. Jarvis, T. Kimura, A. Vaintrob

Although the space Mg,n (V ) is not smooth in general, it has a virtual fundamental class [Mg,n (V )]virt which plays the role of the usual fundamental class in intersection theory. It gives rise to the collection of Gromov-Witten classes ∗ Vg,n ∈ H • (Mg,n ) ⊗ H • (V )⊗n , defined as Vg,n (γ1 , . . . , γn ) := st∗ ev1∗ γ1 . . . evn∗ γn ∩ [Mg,n (V )]virt , where γj ∈ H • (V ), st : Mg,n (V ) → Mg,n is the stabilization (forgetting the target) map, and evj : Mg,n (V ) → V is the evaluation of the stable map at the j th marked point. These classes behave nicely when restricted to the boundary strata of Mg,n . This allows one to define a collection of multilinear operations on the space H • (V ), parametrized by elements of H• (Mg,n ). These operations satisfy the axioms of a cohomological field theory (CohFT) in the sense of Kontsevich-Manin [19]. In particular, their restriction to stable maps of genus zero endows H • (V ) with the structure of a (formal) Frobenius manifold [7, 10, 22], called the quantum cohomology of V , whose multiplication is a deformation of the usual cup product in H • (V ). The diagonal map Mg,n → Mg,n × Mg,n induces an operation of a tensor product in the category of CohFTs. Behrend [4] proved that despite the fact that the space Mg,n (V × V ) is not isomorphic to the product Mg,n (V ) ×Mg,n Mg,n (V ), the tensor

product of the CohFTs associated to Mg,n (V ) and Mg,n (V ) is the CohFT associated to Mg,n (V × V ). Restricting to genus zero, this gives a K¨unneth formula for quantum cohomology. In [16], we introduced a new of CohFTs, one for each integer r ≥ 2, based class1/r,m 1/r on the moduli space Mg,n = Mg,n of higher spin curves, constructed in [11]. m 1/r,m

Recall that for m = (m1 , . . . , mn ), with mi ∈ Z, the moduli space Mg,n is a compactification of the space of Riemann surfaces of genus g with n marked points p1 , . . . , pn and an r th root of the twisted canonical line bundle ω ⊗ O(− i mi pi ). This CohFT has rank r − 1 and is called an r-spin CohFT. The construction of an r-spin CohFT in [16] is based on a choice of a special cohomology class c1/r (called a spin vir1/r tual class) in H • (Mg,n ), satisfying certain axioms. These axioms are similar to the Behrend-Manin axioms [5] for the virtual fundamental class. As in the case of the CohFT based on ordinary stable maps, the r-spin CohFT a priori may depend on a choice of the spin virtual class c1/r . Currently, two different constructions of a spin virtual class on 1/r Mg,n are known: an algebro-geometric construction of [27], resembling the algebraic construction of the virtual fundamental class, and the analytic construction of [24] based on Witten’s original idea [29]. While it is not known yet whether these constructions give the same class for all g and r, they agree when g = 0 (and any r) or r = 2 (and any g). This r-spin CohFT is related to the work of Witten [29], who conjectured that a 1/r,m generating function of certain intersection numbers on Mg,n is a τ function of the r th Gelfand-Dickey (or KdVr ) hierarchy. When r = 2, this conjecture reduces to an earlier conjecture of Witten’s on the intersection numbers of Mg,n , which was proved

Spin Gromov-Witten Invariants

513

by Kontsevich [18]. In [16], following Witten’s ideas, we constructed the spin virtual class and proved the conjecture in the cases g = 0 (for all r) and r = 2 (for all g). 1/r In [17], it was proven that the tensor product of the CohFTs associated to Mg,n and 1/r

Mg,n is realized by the moduli stack of curves endowed with both an r and an r spin structure. More generally, moduli stacks of stable curves endowed with multiple spin structures provide an intersection-theoretic realization of the tensor products of spin CohFTs. The goal of this paper is to complete this picture by introducing and studying moduli spaces that give an intersection-theoretic realization of the tensor product of the Gromov-Witten CohFT and the r-spin CohFT. 1/r We construct Mg,n (V ), the stack of stable r-spin maps into a projective variety V — objects which combine both the data of a stable map and an r-spin structure. We prove 1/r that Mg,n (V ) is a Deligne-Mumford stack and a ramified cover of Mg,n (V ). Similar to the case of ordinary stable maps, stable r-spin maps to a point are just stable r-spin curves. 1/r We introduce the spin virtual class c˜1/r ∈ H • (Mg,n (V )), which is an analog of a 1/r

class c1/r on Mg,n . Using the class c˜1/r and the virtual fundamental class of Mg,n (V ), we define the r-spin Gromov-Witten classes ∗ 1/r ,r) • (V ,r) ⊗n (V ∈ H (M (V )) ⊗ H , g,n g,n where H(V ,r) = H • (V ) ⊗ H(r) , and H(r) is the state space of the r-spin CohFT. We prove that these spin Gromov-Witten classes give rise to a CohFT with the state space H(V ,r) , which is isomorphic to the tensor product of the Gromov-Witten CohFT and the r-spin CohFT. As with Behrend’s theorem, this result is not trivial because the space 1/r 1/r Mg,n (V ) is not isomorphic to the fiber product Mg,n (V ) ×Mg,n Mg,n . Restricting to genus zero, we obtain that the r-spin quantum cohomology of V is the tensor product of its ordinary quantum cohomology with the Frobenius manifold associated to KdVr (or equivalently, to the Ar−1 singularity). It is worth observing that our spin Gromov-Witten invariants have a physical interpretation. They may be regarded as the correlators in a theory of topological gravity coupled to topological matter, where the matter sector of the theory is the topological sigma model with target space V coupled with a certain type of gauged SU (2)r−2 /U (1) Wess-Zumino-Witten model. It would be very interesting to find an enumerative interpretation of these invariants similar to the interpretation of the ordinary Gromov-Witten invariants. Structure of the paper. We will now give a more detailed description of the structure of the paper and of our results. After a brief review in the first section of the ideas of r-spin structures (in order to set notation that will be necessary thereafter), we begin in the second section by setting up the geometric framework for the rest of the paper. We introduce stable spin maps and 1/r the stack Mg,n (V ) of such maps. We prove that it is a Deligne-Mumford stack, and we establish important properties of its associated morphisms. The proof of Theorem 2.2.1, that the stabilization map is truly a morphism of stacks, is rather involved and concludes the second section.

514

T. J. Jarvis, T. Kimura, A. Vaintrob 1/r

In the third section, we introduce cohomology classes on Mg,n (V ), especially the 1/r

spin virtual class c˜1/r on Mg,n (V ). The class c˜1/r is defined by pulling back c1/r from 1/r

Mg,n when 2g − 2 + n > 0, and it is defined by a direct construction for other values of g and n. We establish some properties of the spin virtual class and relate it, in genus zero, to the top Chern class of the top cohomology R 1 π∗ Er of the r-spin structure bundle. At the end of this section we prove a key theorem (Theorem 3.3.1) on the decomposition of 1/r,m classes pushed down from Mg,n (V , β) to Mg,n . This theorem is the main ingredient in proving Theorem 4.3.2, that the CohFT arising from r-spin maps is a tensor product. In the fourth section, using the class c˜1/r and the virtual fundamental class of (V ,r) Mg,n (V ), we define the r-spin Gromov-Witten classes g,n . We show that these classes give rise to a CohFT which has state space H(V ,r) , and which is isomorphic (in the stable range) to the tensor product of the Gromov-Witten CohFT and the r-spin CohFT. We prove that these classes satisfy properties analogous to Gromov-Witten classes, and we show that even in the unstable range, at least for the small phase space, all the correlators of the stable-spin-maps CohFT are determined by the usual r-spin CohFT and the Gromov-Witten invariants. Finally, we verify that the descent axiom of [15] holds for these correlators in the genus zero case. This gives a new and interesting geometric description of gravitational descendants not only in the r-spin theory, but also in the case of usual Gromov-Witten invariants, since these correspond to the special case of r = 2, as described in Sect. 5. In the fifth and last section, we discuss a number of special cases. We first prove that when r = 2, the stable spin maps CohFT reduces to the usual Gromov-Witten invariants of V . We then examine the case of genus zero and degree zero, and conclude with the 1/3 calculation of the small phase space potential function associated to M0,n (P1 ). 1. Review of Spin Structures For the remainder of the paper we fix an integer r ≥ 2. For the reader’s convenience and to establish notation, we briefly review the definitions of an r-spin structure given in [11]. Our notation here is somewhat improved over that of [11].

1.1. Overview. Although the concept of an r-spin structure is intuitively simple, its formal definition is somewhat technical. For that reason we first give a brief overview of the ideas involved. Intuitively, an r-spin structure on a smooth, n-pointed curve (X, p1 , . . . , pn ) is just a choice of a line bundle L on X, together with an isomorphism mi pi b : L⊗r - ωX − to the canonical dualizing sheaf of X with zeros of order m = (m1 , . . . , mn ), for some n-tuple of non-negative integers m. For degree reasons, an r-spin structure of type m exists on a genus g curve X only if 2g − 2 − mi is divisible by r. If we want to compactify the spaces involved by considering stable maps and prestable curves, the preceding, intuitive definition of an r-spin structure is insufficient. In particular, we must replace line bundles by rank-one torsion-free sheaves and allow the

Spin Gromov-Witten Invariants

515

homomorphism b : L⊗r → ωX (− mi pi ) to have non-trivial cokernel at the nodes of the curve. Alternatively, one may continue to use invertible sheaves, but allow the source curves to be stacks (“twisted” nodal curves, or orbicurves) as in [1]. The two approaches are completely equivalent and give isomorphic compactifications. Although the orbicurves approach is more attractive notationally, the approach based on torsion-free sheaves is more explicit and is closer to the original physically motivated constructions of [29]. Also, it is better suited to the treatment of the descent axiom, a generalization of which we prove in Subsect. 4.6. This generalized descent property gives a nice geometric explanation for the appearance of ψ classes in the usual (non-spin) Gromov-Witten theory. Therefore, we feel that the approach based on torsion-free sheaves is more suitable for the purposes of this paper. There are two very different types of behavior of the torsion-free sheaf L near a node q ∈ X. When it is still locally free, the sheaf L is said to be Ramond at the node q. If the sheaf L is not locally free at q, it is called Neveu-Schwarz. (In the twisted curve formulation, the Ramond case corresponds to a trivial stack structure at the point in question, while the Neveu-Schwarz case corresponds to a non-trivial stack structure at that point.) Although in the Ramond case, the homomorphism b remains an isomorphism near the node q, in the Neveu-Schwarz case it cannot be an isomorphism. The local structure of the sheaf L near a Neveu-Schwarz node can be described as follows. Near the node q, the structure sheaf OX is generated by two functions x and y, such dy that xy = 0. The sheaf ωX (− mi pi ) is locally generated by dx x = − y . Near q the sheaf L is generated by two elements + and − , supported on the x and y branches respectively (that is, x − = y + = may be chosen so that the 0). The two generators m+ +1 ( dx ) = x m+ dx and ⊗r homomorphism b : L⊗r → ωX (− mi pi ) takes ⊗r to x + − x m− dy, where (m + 1) + (m + 1) = r. to y m− +1 ( dy ) = y + − y Definition 1.1.1. We call m+ (respectively m− ) the order of the spin structure along the x-branch (respectively y-branch). 1.2. Formal definitions. Definition 1.2.1. For any integer r > 1 and for any n-tuple of integers m = (m1 , . . . , mn ) such that r divides 2g − 2 − mi , an r-spin structure of type m on a family X/T of n-pointed prestable curves is a coherent net of r th roots of ωX/T of type m. Recall that, by Definition 2.3.4 of [11], a coherent net of r th roots of ωX/T of type m is a set of rank-one, torsion-free OX -modules {Ed } for every positive d|r, and a ⊗d/d collection of OX -module homomorphisms {cd,d : Ed → Ed }, defined for every positive d |d|r, such that for each geometric fiber Xt of X/T , the sheaves {Ed } and homomorphisms {cd,d } induce a coherent net of r th roots of ωXt of type m, and each homomorphism cd,d is an isomorphism on the locus in X/T , where Ed is locally free. That is, • E1 = ωXt , and cd,d = 1d is the identity map, for every positive d dividing r. • For each divisor d of r and each divisor d of d, we require: – For every point p ∈ Xt , where Ed is not free, the length of the cokernel of cd,d at p is (d/d ) − 1.

516

T. J. Jarvis, T. Kimura, A. Vaintrob

– d · deg Ed = d · (deg Ed − mi ), where m = (m1 , . . . , mn ) is the reduction of m modulo d/d (i.e. 0 ≤ mi < d/d and mi ≡ mi (mod d/d )). • The homomorphisms {cd,d } are compatible. That is, the diagram

commutes for every d |d |d|r. Finally, recall that these sheaves and homomorphisms must have a special type of local structure. The details of these conditions, while rather technical, are important for the proof of Theorem 2.2.1. We review them briefly here, for the reader’s convenience and for purposes of fixing notation. For a node q ∈ Xt in a fiber of X/T over a geometric point t ∈ T , we denote by md,+ and md,− the orders of the d th root map cd,1 : Ed⊗d → ω − mi pi on the branches of the normalization of Xt at q. We define. ud := (md,+ + 1)/ d

and

vd := (md,− + 1)/ d ,

where d := gcd(md,+ + 1, md,− + 1). If cd,1 is an isomorphism at q, we set ud = vd = 0. The first requirement on the local structure of a net of coherent roots on a family X/T is the existence of a special local coordinate system near any node q where cr,1 is not an isomorphism (i.e., Er is Neveu-Schwarz at q). This local coordinate system consists of an e´ tale neighborhood T of t with an element τ ∈ OT ,t , and an e´ tale neighborhood U of q in X ×T T with sections x, y ∈ OU , such that for s := ur + vr we have • xy = τ s . • The ideal generated by x and y has the singular locus of X/T as its associated closed subscheme. • The homomorphism OT ,t [x, y]/(xy − τ s ) → OU,q induces an isomorphism of the completions ∼- ˆ Oˆ T ,t [[x, y]]/(xy − τ s ) OU,q . The second requirement on the local structure is that the sheaves Ed must have a special presentation in terms of this special coordinate system. In particular, any rankone, torsion-free sheaf F always has a presentation of the form F∼ = ζ1 , ζ2 |eζ1 = xζ2 , yζ1 = hζ2

Spin Gromov-Witten Invariants

517

for some e and h in OT ,t , such that eh = τ s ; but for sheaves in the net we require that if Ed is not locally free at the node q, then Ed must have such a presentation with e = τ (r/d)(vd d ) and h = τ (r/d)(ud d ) . In other words, Ed is isomorphic near the node q to the sheaf Ed := ζ1 , ζ2 |τ (r/d)(vd d ) ζ1 = xζ2 , yζ1 = τ (r/d)(ud d ) ζ2 . If Ed is locally free at q, then for uniformity of notation we will use the unusual presentation Ed ∼ = Ed := ζ1 , ζ2 |ζ1 = ζ2 . Finally, each homomorphism ⊗d cdj,j : Edj

- Ej

in the net must be a so-called power map, in the sense of Definition 2.3.1 of [11]. This means that, if we use the local presentations Edj = ξ1 , ξ2 |τ (r/(dj ))(vdj dj ) ξ1 = xξ2 , yξ1 = τ (r/(dj ))(udj dj ) ξ2 , and Ej = ζ1 , ζ2 |τ (r/j )(vj j ) ζ1 = xζ2 , yζ1 = τ (r/j )(uj j ) ζ2 , of the sheaves Edj and Ej , then the map Symd (Edj ) → Ej ,

(1)

induced by the homomorphism cdj,j , acts on the generators ξ1d−i ξ2i of Symd (Edj ) as

ξ1d−i ξ2i

x u −i τ iv ζ1

→ y v −d+i τ (d−i)v ζ2

if 0 ≤ i ≤ u if u < i ≤ d.

(2)

Here we require that uj ≡ udj d (mod s) and vj ≡ vdj d (mod s), and we define u := (udj d − uj )/s and v := (vdj d − vj )/s. If Edj is locally free at q, then the existence of a good presentation is automatically satisfied, and we have no additional power map requirement except that the map (1) be an isomorphism.

2. Stable Spin Maps In this section we introduce stable r-spin maps and begin to study their moduli stack.

2.1. Definitions. Definition 2.1.1. Let r ≥ 2 be an integer, and let n and g be non-negative integers. Let V be an algebraic variety, and let β be a class in H2 (V , Z). Finally, let m = (m1 , m2 , . . . , mn ) be an n-tuple of integers such that r divides 2g − 2 − mi . A family

518

T. J. Jarvis, T. Kimura, A. Vaintrob

of stable, n-pointed, r-spin maps into V of genus g, type m, and class β is a pair (f, ({Ed }, {cd,d })), consisting of a family of stable n-pointed genus g maps f : X/T → V of class β, and an r-spin structure ({Ed }, {cd,d }) of type m on X/T . Example 2.1.2. If V is a point, then any stable, n-pointed, r-spin map into V is just a stable r-spin curve. f Definition 2.1.3. An isomorphism from an r-spin map (X V , p1 , . . . , f pn , ({Ed }, {cd,d })) to another (X V , p1 , . . . , pn , ({Ed }, {cd,d })) of the same type m consists of an isomorphism τ of n-pointed, stable maps

τ - X

X

f

f

? ? V ======= V ∼ and a set of OX -module isomorphisms {θd : τ ∗ Ed - Ed }, with θ1 being the canonical ∼ isomorphism τ ∗ ωX (− i mi p i ) - ωX (− mi pi ), and such that the homomor phisms θd are compatible with all the maps cd,d and τ ∗ cd,d .

Definition 2.1.4. Let V be an algebraic variety over C, and β an element of H2 (V , Z). The stack of stable r-spin maps to V (n-pointed, of genus g, and class β) is the disjoint union 1/r 1/r,m Mg,n (V , β) := Mg,n (V , β) m 0≤mi
of stacks Mg,n (V , β) of (families of) stable n-pointed r-spin maps to V of genus g, type m = (m1 , . . . , mn ), and class β. 1/r,m

1/r

We will see in Sect. 2.3 that Mg,n (V , β) (and, therefore, Mg,n (V , β)) is a Deligne-Mumford stack whenever Mg,n (V , β) is. As in the special case of V = pt, no information is lost by restricting m to the range 0 ≤ mi ≤ r − 1. 1/r,m

Proposition 2.1.5. If m ≡ m (mod r), then Mg,n (V , β) is canonically isomorphic 1/r,m

to Mg,n

(V , β).

Proof. When m ≡ m (mod r), every r-spin structure of type m naturally gives an r-spin structure of type m simply by Er → Er ⊗ O

mi − m i pi . r

Spin Gromov-Witten Invariants

519 1/r,m

2.2. Fundamental morphisms of stacks of stable spin maps. The stack Mg,n (V , β) has a natural projection 1/r,m

p˜ : Mg,n (V , β)

- Mg,n (V , β)

(3)

which forgets the spin structure. The usual evaluation maps evi : Mg,n (V , β) → V , which send a point [X ation maps

f-

V , p1 . . . pn ] ∈ Mg,n (V , β) to f (pi ) ∈ V , induce evalu1/r,m

ev ˜ i = evi ◦ p˜ : Mg,n (V , β)

- V.

Less obvious is the fact that for any morphism s : V → V taking β to β , we have a stabilization morphism 1/r,m

st˜ : Mg,n (V , β)

- M1/r,m (V , β ) g,n

(4)

which takes f to f := s ◦ f and contracts components of the source curve that are unstable with respect to f . Theorem 2.2.1. For any morphism V → V , taking β to β , the stabilization map (4) is a morphism of stacks. The proof of Theorem 2.2.1, which is rather intricate, will be given in Subsect. 2.4. The various canonical maps introduced above are shown in the following commutative diagram.

We will use the notation of this diagram throughout the remainder of the paper, and we will denote the composition q2 ◦ q1 by q. 1/r,m 1/r,m The universal curves Cg,n → Mg,n and Cg,n → Mg,n (V , β) will be denoted by π.

520

T. J. Jarvis, T. Kimura, A. Vaintrob 1/r,m

Remark 2.2.2. The stack Mg,n (V , β) is not isomorphic to the fibered product 1/r,m

Mg,n

×Mg,n Mg,n (V , β), although on the smooth locus the map 1/r,m

q1 : Mg,n (V , β)

- M1/r,m g,n ×Mg,n Mg,n (V , β)

is an isomorphism when g and n are in the stable range (2g − 2 + n > 0). The isomorphism for the smooth locus is straightforward: If X/T is a smooth family of curves, then a stable map f : X → V and an r-spin structure ({Ed }, {cd,d }) are precisely the data necessary to construct an r-spin map, i.e., there is a canonical morphism 1/r,m

j : Mg,n

1/r,m

×Mg,n Mg,n (V , β) → Mg,n (V , β)

which is clearly the inverse of the morphism q1 . But when the curve X is not stable, this morphism j no longer exists. For example, let X be a prestable curve that has two irreducible components C and E, where C is a smooth curve of genus g, and E is a smooth, rational curve, without marked points, joined to C at a single node q. Let f : X → V be an embedding of X in V . An r-spin structure ({Ed }, {cd,d }) on X is equivalent to a pair of r-spin structures ({Ed }, {cd,d }) on C and ({Ed }, {cd,d }) on E of orders 0 and r −2, respectively, at q. Thus the automorphism group of the r-spin map (f, ({Ed }, {cd,d })) is µr × µr , corresponding to multiplication of Er and Er by r th roots of unity. But the stabilization map st˜ takes (f, ({Ed }, {cd,d })) to the spin map (f |C , {Ed }, {cd,d }) on C, and the automorphism group of ˜ p(f, ˜ ({Ed }, {cd,d })) × st(f, ({Ed }, {cd,d })) = (f |C , ({Ed }), {cd,d })

is simply µr , since C is irreducible and Er is invertible on C. Thus the morphism q1 is not an isomorphism. Proposition 2.2.3. The morphism q1 is flat and proper. Proof. Flatness follows from the valuative criterion of flatness [8, 11.8.1], which states that it is enough to check flatness of q1 over each R-valued point 1/r,m

Spec R → Mg,n

×Mg,n Mg,n (V , β),

where R is a discrete valuation ring. Since the completion Rˆ of R is faithfully flat over R, it suffices to check this for each complete discrete valuation ring. But in this case, the results of [11] show that the universal deformation (relative to the universal stable map f : C → V ) of a spin structure over the central fiber of Spec R corresponds to the ring homomorphism R → R[t]/(t d − s), for some positive d dividing r, and where s ∈ R is a uniformizing parameter for R. In particular, R[t]/(t d − s) is a free R-module, and thus is flat over R. Since the universal deformation is faithfully flat (actually, e´ tale) over 1/r,m Mg,n (V , β), this shows that q1 is also flat. Properness also follows by the valuative criterion in exactly the same manner as was proved in [12] for spin structures on stable curves. Nothing in that proof required the underlying curves to be stable—only prestable.

Spin Gromov-Witten Invariants

521

2.3. The algebraic nature of the stack of stable spin maps. A useful notion in dealing with stacks is the idea of a Deligne-Mumford morphism, or morphism of Deligne-Mumford type. This is analogous to the concept of a representable morphism. Definition 2.3.1. A morphism of stacks f : S → T is called Deligne-Mumford (or of Deligne-Mumford type) if for every representable U and every U -valued point U → T , the fibered product S ×T U is a Deligne-Mumford stack. The most useful fact about these morphisms is that if S → T is a Deligne-Mumford morphism, and if T is a Deligne-Mumford stack, then S is a Deligne-Mumford stack (see [14, Prop. 3.1.3]). Theorem 2.3.2. For all V and β, the forgetful morphism (Eq. (3)) is a finite (meaning proper and quasi-finite, but not necessarily representable) Deligne-Mumford mor1/r,m phism of stacks. In particular, Mg,n (V , β) is a Deligne-Mumford stack whenever Mg,n (V , β) is. Proof. Given a T -valued point T → Mg,n (V , β) for a representable T , we must show that the stack 1/r,m

R(X/T ) := Mg,n (V , β) ×Mg,n (V ,β) T

of coherent nets of r th roots of ωX (− mi pi ) on the associated family X/T of prestable curves is a Deligne-Mumford stack, finite over T . In particular, we need to construct a smooth cover of R(X/T ) and show that the diagonal

: R(X/T ) ×T R(X/T )

- R(X/T )

is representable, unramified, and proper. These facts are all straightforward generalizations of their counterparts over the stack Mg,n of stable curves as described in [11]. The only real difference is that we are now working with a specific family of prestable curves over T , as opposed to working with the universal family of stable curves (over Mg,n ), but that changes nothing of substance in the proof. - T is also an easy generalization of the The proof of properness of R(X/T ) case of stable r-spin curves, and the morphism is obviously quasi-finite. 2.4. Proof that stabilization is a morphism. We now turn to the proof of Theorem 2.2.1, that for any morphism s : V → V , taking β ∈ H2 (V , Z) to β ∈ H2 (V , Z), the stabilization map st˜ (4) is a morphism of stacks. It is straightforward to check that the stabilization of the underlying curves preserves r-spin structures on each individual fiber, but we must also show that the stabilization morphism on the underlying curves preserves the r-spin structure in families. Theorem 2.2.1 obviously follows from the following lemma. ˜ Lemma 2.4.1. Let st : X/T → X/T be a morphism taking a family of n-pointed pre˜ ˜ and let ({E˜d }, {c˜d,d }) stable curves X/T to an n-pointed partial stabilization X of X, ˜ be an r-spin structure of type m = (m1 , . . . , mn ) on X, with 0 ≤ mi ≤ r − 1 for every i. In this case, the sheaf R 1 st∗ E˜d is zero for every d|r, and the push-forward ({st∗ E˜d }, {st∗ c˜d,d }) is an r-spin structure of type m on X.

522

T. J. Jarvis, T. Kimura, A. Vaintrob

Proof. As mentioned above, it is straightforward to check that the maps st∗ c˜d,d and the sheaves st∗ E˜d are T -flat and produce an r-spin structure of type m on each fiber of X/T (this will also follow from the computations below). Thus we only need to verify that R 1 st∗ E˜ = 0 (which implies that this construction commutes with base change), and that the maps and sheaves meet the local conditions outlined in Subsect. 1.2 for being a coherent net on the family of curves X/T , provided the original sheaves {E˜d } and maps ˜ . {c˜d,d } form a coherent net on the family X/T Let us fix a point p of a geometric fiber Xt of X/T . There are three cases to consider. First is the case when the point p is not the image of a contracted component (i.e., st −1 (p) is a single point). Second is the case when p is a smooth point of the fiber Xt , ˜ ; that is, st but p is the image of a whole irreducible component of the fiber X˜ t of X/T contracts a −1-curve to the point p. Third is the case that p is a node of the fiber Xt containing it, and it is the image of a contracted component of X˜ t ; that is, p is the image ˜ of a −2-curve E. Case 1. The first case is easy, since when st −1 (p) is a single point, then st is an isomorphism in a neighborhood of p (or of st −1 (p)). In particular, st∗ is an isomorphism, R 1 st∗ E˜d = 0, and cd,d = st∗ (c˜d,d ) is a d/d th power map near p. The second and third cases are more involved. Before we attack them, we note that the conditions we must verify are local (and analytic) on the base T , so it suffices to check the result when T is affine and is the spectrum of a complete local ring R. Moreover, the conditions are analytic on X; that is, the conditions are all determined by restricting to the completion of the local ring of X near the point p. To simplify, we will make the calculations in the case of d = r, but all other values of d (dividing r) are similar. Case 2. In the second case (st contracts a −1-curve of X˜ to the point p) we will show that the induced sheaves st∗ E˜d are locally free at p, and the maps cd,d are all isomorphisms; thus the local coordinate and power map conditions are automatically fulfilled. The fiber X˜ t over Xt has one irreducible component C lying over p, and C contains at most one marked point p , labeled with an integer m, where 0 ≤ m ≤ r − 1. This is indicated in Fig. 1. On C, the sheaf (E˜r /torsion)⊗r is isomorphic to ωC (−m+ q + − mp), where q + is the point of C which maps to the node q attaching C to the rest of X˜ t . Moreover, r must divide 2gC − 2 − m+ − m, so either m = r − 1, which implies that E˜r is locally free (Ramond) near q, or r − 2 = m+ + m, which implies that E˜r is not locally free (it is Neveu-Schwarz) at q. In either case, E˜r |C has degree −1 and thus has no global sections. Also R 1 st∗ E˜r = 0, since this is true on each fiber. Now, in the Neveu-Schwarz case, the sheaf st∗ E˜r |Xt is simply the sheaf E˜r restricted ˜ ˜ ˜ ˜ (modulo torsion) to the rest of the prestable fiber Y = (Xt − C). But Er /torsion on Y is an r th root of ωY˜ (−m− q − − pi =p mi pi ), where q − is the other side of the node defined by q + . The actual value of m− is determined by the relation m+ + m− = r − 2, which implies that m− = m. In the Ramond case, the vanishing of the global sections of E˜r |C implies that st∗ (E˜r |X˜ t ) ˜ is Er |Y˜ ⊗ E(−q − ), so it is an r th root of ωXt (−(r − 1)p). In both the Ramond and Neveu-Schwarz cases, the new marked point p = st (q − ) of Xt is labeled with m, just as the old marked point p was labeled with m on X˜ t . If no point was marked on C, then the point p remains unmarked (and m− = 0). Finally, st∗ E˜d is T -flat and R 1 st∗ E˜r vanishes, so we have that st∗ E˜r commutes with base change, and the calculations above on the fibers all hold globally on the family

Spin Gromov-Witten Invariants

523

C p’ ~ = ~ Normalization of X C Y t q+ ~ Y

C qp’

normalization

~ ~ Xt = C Y ~ Y

q

st ~~ Xt = Y p Fig. 1. A depiction of Case 2 of Lemma 2.4.1: fibers X˜ t and Xt , the stabilization map st : X˜ t → Xt , and the normalization of X˜ t . The morphism st contracts the unstable component C to the point p and induces an isomorphism from the rest of the curve Y˜ to Xt

X/T . Thus st∗ Er is invertible near p, and st∗ cr,1 is an isomorphism near p. In particular, st∗ cr,1 is an r-power map. A similar argument holds for each E˜d and each c˜d,d near p. ˜ Case 3. The third case is that of a point p ∈ X which is the image of a −2-curve C˜ of X. Just as in Case 2, it is easy to see that on the unstable (contracted) −2-curve, the degree of the bundle is −1. Also, we have R 1 st∗ E˜r = 0; the sheaf st∗ E˜r is T -flat and commutes with base change; and on the fibers, the induced collection of sheaves and bundles forms an r-spin structure of type m. We still must check that the induced sheaves have the necessary family structure for spin curves (existence of a local coordinate of suitable type, with respect to which the sheaves have the standard presentation—see Definition 1.2.1), and that the induced maps are power maps, as described in Eq. (1). For simplicity we will assume that the orders m+ , m− , m+ , and m− of the r-spin map c˜r,1 along the two nodes q and q , where the −2-curve intersects the rest of the fiber have the property that gcd(m+ + 1, m− + 1) = 1 = gcd(m+ + 1, m− + 1). The case with common divisors larger than 1 is similar. It is shown in [12, §3.1] that X˜ is locally isomorphic to ProjA A[µ, ν]/(xν − er µ, hr ν − µy), where A = Oˆ X,x ∼ = R[[x, y]]/xy − π r , and e, h and π are elements of the maximal ideal mR of R with eh = π. This shows the existence of the special local coordinate.

524

T. J. Jarvis, T. Kimura, A. Vaintrob

We next show that st∗ Ed has a presentation of the form st∗ Ed ∼ = ζ1 , ζ2 |π (r/d)(vd d ) ζ1 = xζ2 , yζ1 = π (r/d)(ud d ) ζ2 . If we let µ/ν = s and ν/µ = z, then near the exceptional −2-curve C˜ the curve X˜ is covered by two open sets, U = {µ = 0} ∼ = Spec A[z]/(xz − er , y − hr z) and V = {ν = 0} ∼ = Spec A[s]/(x − er s, ys − hr ). ∼ Since ({E˜d }, {cd,d }) is an r-spin structure, we can describe E˜r on U by E˜r |U = v u u v u v ∼ ˜ EU (e , e ) := ζ1 , ζ2 |zζ2 = e ζ1 , xζ1 = e ζ2 , and on V by Er |V = EV (h , h ) := ξ1 , ξ2 |sξ2 = hu ξ1 , yξ1 = hv ξ2 , where u + v = u + v = r. On the exceptional curve C˜ ∼ = P1 , the sheaf (E˜r /torsion)⊗r is isomorphic to ωP1 ((1− u) + (1 − v )), and degree considerations show that u + v = r, so u = u and v = v . ˜ if Di is the image of the i th section pi : T → X, the Moreover, in a neighborhood of C, invertible sheaf ωX˜ (− mi Di ) is trivial and is generated by the element w = dx x = dy ds th power map c˜ − dz = = − . The r is an isomorphism away from the nodes r,1 z s y ∼ EU (ev , eu ) and ˜ and since it is a power map (changing the isomorphisms E˜r |U = of X, E˜r |V ∼ = EV (hu , hv ), if necessary), it maps the generators ζi and ξi as follows: ζ1r → zu w,

ζ2r → x v w

ξ1r → s v w,

ξ2r → y u w.

and

Since c˜r,1 is an isomorphism away from the nodes, we have ζ1r = zr ξ1r , or ζ1 = zθξ1 , for some r th root of unity θ. Changing the isomorphism E˜r |V ∼ = EV (hu , hv ) by θ, we may assume ζ1 = zξ1 . On U ∩ V we also have ζ2 = sev ζ1 = ev ξ1 and ξ2 = zhu ξ1 = hu ζ1 . So global sections of E˜ are of the form (E˜r ) = ((fU ζ1 + fU ζ2 ), (fV ξ1 + fV ξ2 )) ∈ EU ⊕ EV | fU ζ1 + fU ζ2 = fV ξ1 + fV ξ2 on U ∩ V . We claim that the A-module E(π u , π v ) := η1 , η2 |xη2 = π u η1 , yη1 = π v η2 is isomorphic to (E˜r ) via η1 → (ζ2 , ev ξ1 ) and η2 → (hu ζ1 , ξ2 ).

Spin Gromov-Witten Invariants

525

The map is clearly an A-module homomorphism. Moreover, for any section ((fU ζ1 + fU ζ2 ), (fV ξ1 + fV ξ2 )) ∈ (E˜r ) we may assume that fU ∈ R[z] and fU ∈ R[[x]]. Likewise, we may assume that fV ∈ R[s] and fV ∈ R[[y]]. Consequently, we have zfU (z) + ev fU (x) − fV (s) − zq u fV (y) = 0, or zfU (z) + ev fU (ser ) − fV (s) − zhu fV (zhr ) = 0. Thus fU and fV are completely determined by fU = hu fV (y) and fV = ev fU (x). We may, therefore, map (E˜r ) to E(π u , π v ) via (hu fV (y)ζ1 + fU (x)ζ2 ), (ev fU (x)ξ1 + fV (y)ξ2 ) → fU (x)η1 + fV (y)η2 , and it is easy to check that this homomorphism is the inverse of the first. An identical argument shows that (E˜d ) is isomorphic to E(π u , π v ), where u ≡ u (mod d) and v ≡ v (mod d). This shows the existence of the desired presentation for st∗ E˜d . It remains to show that the maps st∗ c˜d,d are power maps (2). Again, since the arguments are essentially identical for each pair d and d , it suffices to prove this in the case of c˜r,σ for some σ dividing r. As above, we have u+v = r. Let σ be a divisor of r, and d = r/σ . Let u be the smallest non-negative integer congruent to ud modulo r and v be the smallest non-negative integer congruent to vd modulo r. Define integers u and v as u =

du − u dv − v and v = . r r

The module (E˜r ) ∼ = E(π u , π v ) is generated by η1 , and η2 with η1 = (ζ2 , ev ξ1 ) and u η2 = (h ζ1 , ξ2 ). Further, E˜σ may be defined on U by φ1 , φ2 |zφ2 = eu φ1 , xφ1 = ev φ2 and on V by ψ1 , ψ2 |sψ2 = hv ψ1 , yψ1 = eu ψ2 , so we may describe st∗ E˜σ as above: u v the module (E˜σ ) is isomorphic to E(π , π ), and is generated by γ1 = (φ2 , ev ψ1 ) and γ2 = (hu φ1 , ψ2 ). We must show that η1d−i η2i maps, via st∗ (c˜r,σ ), to π ui x v −i γ1 for 0 ≤ i ≤ u and to π v(d−i) y u −(d−i) γ2 for u ≤ i ≤ d. We will do the first case—the second case is similar. The element η1d−i η2i is of the form. η1d−i η2i = (ζ2 , ev ξ1 )d−i (hu ζ1 , ξ2 )i = (hui ζ1i ζ2d−i , ev(d−i) ξ1d−i ξ2i ), so on U , this element η1d−i η2i maps as hui ζ1i ζ2d−i → x v

−i

eui hui φ2 = π ui x v

−i

φ2 .

On V , the element η1d−i η2i maps as e(d−i)v ξ1d−i ξ2i → s v

−i

hiu e(d−i)v ψ1 .

It is straightforward to check that these are the same on U ∩ V . But this is exactly the canonical d th power map (2) for E(π v , π u )⊗d → E(π v , π u ), as desired.

526

T. J. Jarvis, T. Kimura, A. Vaintrob

Remarks 2.4.2. (1) It is important to note that if any of the mi is greater than r − 1, Lemma 2.4.1 is no longer true. In particular, the sheaf R 1 st∗ E˜r no longer vanishes in case 2 of the proof, and the subsequent fiber-to-family transitions are not valid. (2) As was mentioned in the Introduction, the entire theory including the above proof can be reformulated in the language of twisted curves (orbicurves). The proof in the orbicurve formulation requires the use of the Abramovich-Vistoli stabilization of twisted stable maps [2, Prop 9.1.1] instead of the usual Behrend-Manin stabilization that we use here, but the cases and conditions that need to be checked are essentially the same in both approaches. For this paper we chose the torsion-free sheaf formulation because it is more concrete, closer to the physical origin of the theory, 1/r and consistent with the papers [27, 26, 24] where the virtual class on Mg,n is constructed. It is also better suited to the treatment of the generalized descent property of Subsect. 4.6, which provides an important link between ordinary (non-spin) Gromov-Witten invariants with descendants and spin Gromov-Witten invariants without descendants. 3. Cohomology Classes 1/r,m

Here we introduce and study various cohomology classes in H • (Mg,n (V , β), Q) necessary for constructing spin Gromov-Witten invariants and the corresponding CohFT. 3.1. Tautological classes. There are many natural cohomology classes in 1/r,m H • (Mg,n (V , β), Q). Of special interest are the tautological classes induced by the universal sections 1/r,m

1/r,m

pi : Mg,n (V , β) → Cg,n

1/r,m

1/r,m

corresponding to the marked points of the universal curve π : Cg,n These are classes

→ Mg,n (V , β).

ψ˜ i := c1 (p∗i (Er ))

(6)

ψi := c1 (p∗i (ωπ )) and

(d) (and also classes ψ˜ i for each divisor d of r). In [16] it is proved that these classes are closely related:

r ψ˜ i = (mi + 1)ψi .

(7)

Although they will not be used in this paper, it is worth noting that the boundary classes, which are also of interest, have a combinatorial structure that is nicely described in terms of decorated graphs in a straightforward generalization of the methods of [16]. 3.2. Spin virtual class. Recall from [16, §4.1] that an r-spin virtual class on the stack of stable, r-spin curves gives, among other things, a cohomology class 1/r

1/r,m

cg,n (m) ∈ H 2D (Mg,n , Q) for every stable g, n, and r, (i.e., for 2g − 2 + n > 0). Here, the dimension D is

(8)

Spin Gromov-Witten Invariants

527

n 1 D= mi . (r − 2)(g − 1) + r

(9)

i=1

1/r

The collection of classes cg,n (m) is required to satisfy the axioms of convexity, cutting edges, vanishing, and forgetting tails. Currently two different constructions of a spin virtual class c1/r are known, an algebro-geometric [27] and an analytic [24]. We can use the choice of an r-spin virtual class for stable r-spin curves to produce a similar r-spin class for all stacks of stable spin maps in the stable range of (g, n). 1/r,m

1/r

Definition 3.2.1. Given an r-spin virtual class {cg,n (m) ∈ H 2D (Mg,n , Q)} satisfying the axioms of [16, §4.1], then for each V , and for each stable pair (g, n), we define 1/r,m the r-spin virtual class on Mg,n (V , β) by ∗ 1/r

1/r

1/r,m

c˜g,n (m) = st˜ cg,n (m) ∈ H 2D (Mg,n (V , β), Q).

(10)

In the case that (g, n) is not a stable pair (i.e., 2g − 2 + n ≤ 0), we define the r-spin virtual class directly. We do this first in genus zero. 1/r

Definition 3.2.2. If g = 0 and n < 3 then we define c˜0,n (m) to be the top Chern class of the dual of the first cohomology of the r-th root bundle Er ; namely, 1/r

c˜0,n (m) = cD (−R 1 π∗ Er ),

(11)

where Er is the r th root of the universal spin structure ({Ed }, {cd,d }) on the universal 1/r,m

curve π : C → M0,n (V , β). 1/r

In the case that g = 1 and n = 0, the moduli space M1,0 (V , β) decomposes into the disjoint union of d substacks, where d is the number of positive divisors of r (including 1 and r); these components correspond to the fact that (on the smooth locus) r-spin structures are in one-to-one correspondence with r-torsion points of the Jacobian of the underlying curve. No deformation of the underlying curve can take a point of order i to a point of order j unless i = j , so the moduli space breaks up into disjoint substacks 1/r

M1,0 (V , β) =

1/r,(i)

M1,0

(V , β).

i|r 1≤i≤r

We call i the index of the substack if the r th root is a point of exact order i in the Jacobian of the underlying curve. 1/r

Definition 3.2.3. If g = 1 and n = 0, define the r-spin virtual class c˜1,0 (V , β) to be the following 0-dimensional class −(r − 1) if the index is 1 1/r c˜1,0 = (12) 1 otherwise.

528

T. J. Jarvis, T. Kimura, A. Vaintrob

Proposition 3.2.4. If g = 0 and n < 3, and if no marking mi in m is equal to r − 1, the 1/r r-spin virtual class c˜0,n (m) has dimension zero; and thus we have 1/r c˜0,n (m)

=

0 if any mi = r − 1 1 otherwise.

Proof. The degree of the sheaf Er is an integer and is given by deg Er = (2g − 2 − mi )/r, hence when g = 0 we have

mi ≡ −2

(mod r).

1/r

The dimension D of c˜0,n is D = ((2 − r) +

mi )/r.

If n = 0 we have mi = 0, which implies r = 2, and we immediately have D = 0. If 2 ≥ n ≥ 1, then since 0 ≤ mi ≤ r − 2, we have 0 ≤ mi ≤ 2(r − 2); and hence mi = r − 2 is the only solution to the congruence mi ≡ −2 (mod r). Consequently, D = (2 − r + mi )/r = 0. If any of the mi are equal to r − 1, then the argument in the proof of Axiom 4 in [16, Theorem 4.1] shows that c˜1/r must be zero. 1/r Theorem 3.2.5. If g = 0, then c˜0,n (m) is the top Chern class cD (−R 1 π˜ ∗ E˜r ) of the bundle whose fiber is the dual of the first cohomology of the r th root E˜r on the universal 1/r,m curve π˜ : C˜ → M0,n (V , β).

Proof. For n < 3, this is true by definition. 1/r,m 1/r In the case that n ≥ 3, since g = 0, the r-spin virtual class c0,n ∈ H 2D (M0,n , Q) is the top Chern class cD (−R 1 π∗ Er ) of the first cohomology of the r th root Er on the 1/r,m universal curve π : C → M0,n , by the convexity axiom of [16, §4.1]. We have the following commutative diagram. C˜

φ C× @ @ π˜ @

@ R

1/r,m

1/r,m

M0,n

M0,n (V , β) p2 ?

1/r,m

M0,n (V , β)

p1 C

st˜ -

π ? 1/r

M0,n

˜ If E˜r is the r th root on C, ˜ Here φ is the natural map induced by π˜ and stabilization of C. then by Lemma 2.4.1 and the universality of the sheaves involved, φ˜ ∗ E˜r is isomorphic

Spin Gromov-Witten Invariants

529

to the pullback p1∗ Er of the r th root Er from C, and R 1 φ∗ E˜r = 0. By the Leray spectral sequence we have R 1 π˜ ∗ E˜r = R 1 p2∗ (p1∗ Er ). Even though the morphism st˜ is not flat, the natural map ∗ st˜ R 1 π∗ Er

- R 1 p2∗ (p ∗ Er )) 1

(13)

is an isomorphism. Indeed, since the morphism p2 has relative dimension 1, for any sheaf F we have R 2 p2∗ F = 0. This implies that the functor R 1 p2∗ ◦ p1∗ is right exact and therefore by [9, III.12.5] the map (13) is an isomorphism. Thus we have ∗

∗ 1/r

cD (−R 1 p2∗ (p1∗ Er )) = st˜ cD (−R 1 π∗ Er ) = st˜ c0,n = c˜0,n (m). 1/r

Remark 3.2.6. The proof of Theorem 3.2.5 depends upon the fact that the markings mi in m lie in the range 0 ≤ mi ≤ r − 1. In particular, when an mi lies outside that range, Lemma 2.4.1 fails. We shall also see (in Remark 4.6.2) that Theorem 3.2.5 is false in the case that any mi is larger than r − 1. 1/r

Definition 3.2.7. We define [Mg,n (V , β)]virt to be the pullback 1/r

[Mg,n (V , β)]virt := p˜ ∗ [Mg,n (V , β)]virt of the usual virtual fundamental class [Mg,n (V , β)]virt of Mg,n (V , β) via 1/r

p˜ : Mg,n (V , β) → Mg,n (V , β). 3.3. Decomposition of classes. Using the notation of the commutative diagram (5), since ev ˜ i = evi ◦ p, ˜ for any γ1 , . . . , γn ∈ H • (V , Q) we have the equality ev ˜ ∗1 γ1 ∪ ev ˜ ∗2 γ2 ∪ · · · ∪ ev ˜ ∗n γn = p˜ ∗ (ev1∗ γ1 ∪ · · · ∪ evn∗ γn ). We also have the following important relation on pushforwards of classes, which is the crucial step in proving that the CohFT defined by stable r-spin maps is the tensor product of the CohFTs of r-spin curves and stable maps (Theorem 4.3.2). Theorem 3.3.1 (Decomposition). Given any set {γ1 , . . . , γn } of classes in A∗ (V ) (or 1/r,m H • (V )), and given the r-spin virtual class c˜1/r on Mg,n (V , β) defined by Eqs. (10), (11) and (12), the relation

n 1/r ∗ 1/r virt q∗ c˜ ∪ ev ˜ i (γi ) ∩ [Mg,n (V , β)] i=1

= p∗ c

1/r

∪ st∗

n i=1

holds.

evi∗ (γ ) ∩ [Mg,n (V , β)]virt

(14)

530

T. J. Jarvis, T. Kimura, A. Vaintrob

Proof. We will give the proof on the level of (operational) Chow groups A∗ with notation as in [22, V §8]. From [22, VI §2] it will follow then that such results also hold for H • (V ). 1/r To begin, let us fix some notation. We denote the identity maps on Mg,n , Mg,n , 1/r

1/r

Mg,n (V , β), Mg,n ×Mg,n Mg,n (V , β), and Mg,n (V , β) by I, Ir , IV , I× , and Ir,V , 1/r 1/r 1/r respectively. We have c1/r ∈ A∗ (Mg,n ) := A¯ ∗ (Ir : Mg,n → Mg,n ), and c˜1/r =

∗ st˜ (c1/r ) ∈ A∗ (Mg,n (V , β)). We take γi in A∗ (V ), so that ev ˜ ∗i (γi ) is in A∗ (Mg,n (V , β)). Also, we have [Mg,n (V , β)]virt ∈ A∗ (Mg,n (V , β)). Finally, by 1/r

1/r

c˜1/r ∪

n

ev ˜ ∗i (γi ) ∩ [Mg,n (V , β)]virt 1/r

i=1

we mean

c˜

1/r

∪

n i=1

1/r

ev ˜ i (γi )

∩ [Mg,n (V , β)]virt . Ir,V

As in [22, V §8.9], for any morphism Y → X, we define f ∗ : A∗ (X) → A∗ (Y ) to be f ∗ (δ)h ∩ y := δf ◦h ∩ y,

(15)

where δ ∈ A∗ (X) and h : L → Y is an arbitrary morphism, and y ∈ A∗ (L). We also define, for any proper, flat morphism f : Y → X of Deligne-Mumford stacks X and Y , the proper flat pushforward f• : A∗ (Y ) → A∗ (X) to be f• αg ∩ c := f∗ (αfY ∩ f ∗ (c)),

(16)

where g : L → X is an arbitrary morphism, α is an element of A∗ (Y ), and c is an element of A∗ (X). Remark 3.3.2. Note that part (ii) of Manin’s definition in [22, V §8.9] of the operational Chow ring A∗ (M) for the identity morphism I : M → M states that elements of A∗ (M) only need to commute with pullback along representable, flat morphisms of DM-stacks, despite the fact that standard definitions of general operational Chow rings require that these elements commute with pullback along all flat morphisms of DM-stacks (see Vistoli [28, 5.1.i] and Manin [22, V.8.1.i]). In what we do below, we will need the definition of A∗ (M) that requires commutativity with all flat pullbacks; that is, we require the following. Let f : X → Y be a flat morphism of Deligne-Mumford stacks, which is not necessarily representable, and let h : Y → Z be an arbitrary morphism of Deligne-Mumford stacks. For any σ ∈ A∗ (Z) and y ∈ A∗ (Y ), we have σh◦f ∩ f ∗ (y) = f ∗ (σh ∩ y).

(17)

This seemingly minor difference in the definition of A∗ allows us to prove a projection formula for non-representable morphisms.

Spin Gromov-Witten Invariants

531

Lemma 3.3.3. Let f : X → Y be a proper, flat morphism of Deligne-Mumford stacks (which is not necessarily representable). 1. For an arbitrary morphism h : L → Y of Deligne-Mumford stacks we have h∗ f• = fL• h∗X .

(18)

2. (Projection formula for f• ) For any σ ∈ A∗ (X) and β ∈ A∗ (Y ) we have f• (σf ∗ (β)) = f• (σ )β.

(19)

Proof. For Part 1 of the lemma, the same proof as given by Manin for this equation [22, V.8.30] works exactly for our case, too; nowhere is the representability of f used in Manin’s proof. For Part 2, again Manin’s proof of the projection formula [22, V.8.29] works for nonrepresentable morphisms, the only change needed is that [22, V.8.22] (commutativity with flat, representable pullbacks) must be replaced by our Eq. (17) for non-representable, flat pullbacks. One more fact we will need in the proof of Theorem 3.3.1 is the commutativity with proper pushforwards required by the definition of A∗ (cf. [22, V.8.21]); namely, if p : P → L is proper, and h : L → M is an arbitrary morphism, then by definition of A∗ (M), for any σ ∈ A∗ (M) and for any y ∈ A∗ (P ) we have σh ∩ p∗ (y) = p∗ (σhp ∩ y).

(20)

Now we may proceed with the proof of Theorem 3.3.1. We will refer throughout the proof to the notation of the commutative diagram (5). Since q1 is a birational map, it is a splitting morphism (i.e., q1• q1∗ = I× , as a map on A∗ ). Moreover, the morphism st is proper,p is flat [11, Theorem 2.2] and proper [11, Theorem 2.3], and q1 is flat and proper by Proposition 2.2.3. We have the following relations:

c˜

q∗

1/r



∪

n

(dfn. of q1• )

∗

ev ˜ i (γi ) ∩ p˜ [Mg,n (V , β)]

i=1

= q2∗ q1∗ q1∗ pr1∗ c1/r ∪ pr2∗ 

n

= q2∗ q1• q1∗ pr1∗ c1/r ∪ pr2∗

n



∩ q1∗ pr2∗ [Mg,n (V , β)]virt 

evi∗ (γi )

i−1

virt

Ir,V

∩ pr2∗ [Mg,n (V , β)]virt 

evi∗ (γi ) I

i−1

×

pr1∗ c1/r ∪ pr2∗ evi∗ (γi ) (q1 is splitting) = q2∗ ∩ pr2∗ [Mg,n (V , β)]virt I×

= st∗ pr2∗ pr1∗ c1/r ∪ pr2∗ evi∗ (γi ) ∩ pr2∗ [Mg,n (V , β)]virt

I×



532

T. J. Jarvis, T. Kimura, A. Vaintrob

evi∗ (γi ) = st∗ pr2• pr1∗ c1/r ∪ pr2∗ ∩ [Mg,n (V , β)]virt IV

pr2• (pr1∗ c1/r ) ∪ (prj. fmla. for pr2• ) = st∗ evi∗ (γi ) ∩ [Mg,n (V , β)]virt IV

(st ∗ p• c1/r ) ∪ (Equation 18) = st∗ evi∗ (γi ) ∩ [Mg,n (V , β)]virt IV ∗ 1/r ∗ = st∗ (st p• c )IV ∩ ( evi (γi ))IV ∩ [Mg,n (V , β)]virt 1/r (dfn. of st ∗ ) = st∗ (p• c )st ∩ ( evi∗ (γi ))IV ∩ [Mg,n (V , β)]virt 1/r (Equation 20) = (p• c )I ∩ st∗ ( evi∗ (γi ))IV ∩ [Mg,n (V , β)]virt . (dfn. of pr2• )

This completes the proof of Theorem 3.3.1.

4. Gromov-Witten Invariants and Tensor Products of CohFTs 4.1. Standard Gromov-Witten invariants. Let V be a smooth projective variety. The 1/r moduli space of stable r-spin maps Mg,n (V ) gives rise to a set of correlators satisfying axioms analogous to those satisfied by Gromov-Witten invariants. This will follow from 1/r Theorem 4.3.2, which states that the CohFT associated to Mg,n (V ) is the tensor product of the Gromov-Witten CohFT with the r-spin CohFT. (V ) We recall that the Gromov-Witten invariants g,n,β : H • (V , C) → H • (Mg,n , C), defined as n (V ) g,n,β (γ1 , . . . , γn ) = st∗ ( evi∗ γi ) ∩ [Mg,n (V , β)]virt , i=1

can be combined in formal power series as follows: Definition 4.1.1. Let R denote the ring consisting of formal sums of expressions q β with complex coefficients, where β ∈ H2 (V , Z) belongs to the semigroup B(V ) of numerical equivalence classes such that β · L ≥ 0 for all ample divisor classes L in V . Further(V ) more, we impose on R the relations q β1 +β2 = q β1 q β2 . We define g,n : H • (V )⊗n → H • (Mg,n , R) as (V ) ) (V q β g,n,β . g,n := β

Let (V ) denote the collection {g,n } and 1 denote the unit in H • (V ). Let η be the Poincar´e pairing on H • (V ) and let ηµν := η(eµ , eν ) be the coefficients of its matrix with respect to a basis {eµ } for H • (V ). Denote by (ηµν ) the inverse matrix of (ηµν ). (V )

Recall that a fundamental property of the Gromov-Witten invariants (V ) is that they define a CohFT on (H • (V ), η) with flat identity over R [19]. We refer the reader to [19, 16] for further details about CohFTs.

Spin Gromov-Witten Invariants

533

4.2. Spin CohFT. Like ordinary Gromov-Witten invariants defined by means of stable maps, the spin Gromov-Witten invariants also form CohFTs. Definition 4.2.1. Let r ≥ 2 be an integer and let (H(r) , η(r) ) be the (r − 1)-dimensional C vector space with basis {e0 , . . . , er−2 } together with a metric η(r) m1 ,m2 := η(r) (em1 , em2 ) = δm1 +m2 ,r−2 . 1/r

Let c1/r be an r-spin virtual class on Mg,n satisfying the axioms from [16, §4.1]. Let (r) (r) g,n : H

⊗n

→ H • (Mg,n )

be defined by 1/r,m

1−g (r) p∗ cg,n g,n (em1 , . . . , emn ) := r

(21) 1/r,m

for all nonnegative numbers g, n such that 2g − 2 + n > 0 where p : Mg,n Finally, let

(r)

denote the collection

→ Mg,n .

(r) {g,n }.

Remark 4.2.2. As in the case of the CohFT based on ordinary stable maps, the clas(r) ses {g,n } a priori may depend on the choice of the spin virtual class c1/r . Currently, 1/r

there exist two different constructions of a candidate for such class on Mg,n : an algebro-geometric construction of [27], resembling algebraic constructions of the virtual fundamental class, and an analytic construction of [24] developing Witten’s original idea [29]. While it is not known yet whether these constructions give the same class for all g and r, they agree when g = 0 (and any r) or r = 2 (and any g). In these cases, any class satisfying the axioms must be equal to the class constructed in [16] and therefore (r) the resulting classes {g,n } and the corresponding correlators do not depend on this choice. Theorem 4.2.3 ([16, Theorem 3.8]). For each integer r ≥ 2, the triple (H(r) , η(r) , (r) ) forms a CohFT with flat identity e0 . It is called the r-spin CohFT. 1/r

Since the space Mg,n is associated to the r-spin CohFT, and the space Mg,n (V ) is associated to Gromov-Witten theory, it is natural to ask if there is a natural CohFT 1/r associated to the space Mg,n (V , β). The answer is yes, and this CohFT is the tensor product of the other two. 4.3. Tensor products of CohFTs. The category of cohomological field theories has a canonical tensor product operation (see [21]). This reflects the fact that the diagonal map Mg,n → Mg,n × Mg,n is a coproduct with respect to the composition maps of the modular operad {H• (Mg,n )}. In the case of Gromov-Witten invariants, Behrend [4] proved that the CohFT arising from Mg,n (V ×V ) is the tensor product of that arising from Mg,n (V ) and Mg,n (V ). Restricting to genus zero, one can regard this as a deformation of the K¨unneth theorem. Similarly, it was shown in [17] that the tensor product of an r-spin CohFT and an r -spin CohFT can be geometrically realized by means of the moduli space of (r, r )-spin curves. To complete this picture, we need to provide an intersection-theoretical description of the tensor product of the Gromov-Witten theory with the r-spin CohFT.

534

T. J. Jarvis, T. Kimura, A. Vaintrob

Definition 4.3.1. Let (H • (V , C), ηP ) denote the cohomology of V together with its Poincar´e pairing ηP . Let (H(V ,r) , η) denote the tensor product of (H • (V ), ηP ) with (H(r) , η(r) ). For each stable pair (g, n) and β ∈ H2 (V , Z), define the (cohomological) correlators (or the spin Gromov-Witten invariants) to be linear maps g,n,β : H(V ,r) → H • (Mg,n , C) (V ,r)

given by (V ,r) g,n,β (γ1

1/r,m (c˜g,n

⊗ em1 , . . . , γn ⊗ emn ) = Q∗

n

1/r evi∗ γi ) ∩ [Mg,n (V , β)]virt

, (22)

i=1 1/r

where Q : Mg,n (V ) → Mg,n is the morphism that forgets both the stable map and 1/r

1/r

the r-spin structure, [Mg,n (V , β)]virt is the virtual fundamental class of Mg,n (V ), and γi ⊗ emi ∈ H(V ,r) . The following theorem holds. (V ,r)

Theorem 4.3.2. Let g,n : H(V ,r)

⊗n

→ H • (Mg,n , R), where

,r) (V g,n :=

(V ,r)

q β g,n,β .

β

Let (V ,r) denote the collection {g,n }. The collection (H • (V , R), η, ) forms a CohFT (over the ground ring R) with flat identity 1 ⊗ e0 and is the tensor product of the CohFTs (H • (V , R), η, (V ) ) and (H(r) , η(r) , (r) ). (V ,r)

Proof. This is an immediate consequence of Theorem 3.3.1. The r-spin CohFTs behave as though the elements of H(r) were cohomology classes of fractional dimension, similar to the orbifold cohomology classes of Chen and Ruan [6]. Since r-spin CohFTs correspond to the case of r-spin maps into a point, the elements of B(V ) in that theory are all trivial. However, the theory associated to r-spin maps into a general target V does satisfy axioms analogous to those of Gromov-Witten theory. In particular, this theory, like the Gromov-Witten theory, is of qc-type in the sense of [22, 23]. (V ,r)

4.4. Spin Gromov-Witten invariants. The classes g,n,β have properties analogous to those of Gromov-Witten invariants. (V ,r)

Theorem 4.4.1. Let (g, n) be a stable pair of integers. The collection {g,n,β } satisfies the following properties: (V ,r)

1. (Effectivity) g,n,β = 0 if β ∈ / B(V ). (V ,r)

2. (Sn -Equivariance) Each map g,n,β is Sn -equivariant. 3. (Degeneration Axioms) Given a basis {eµ } for H(V ,r) , let η(V ,r) µν := η(V ,r) (eµ , eν ) µν and (η(V ,r) ) denote the inverse matrix.

Spin Gromov-Witten Invariants

535

(a) Let - Mg,n

ρtree : Mk,j +1 × Mg−k,n−j +1

be the gluing map corresponding to the stable graph i1

ij+1

tree =

k

ij

.

g-k in

(V ,r)

The forms g,nβ satisfy the composition property: ρ∗ tree g,n,β (γ1 , γ2 , . . . , γn ) = (V ,r)

µν

(V ,r)

β1 +β2 =β

(V ,r)

k,j +1,β1 (γi1 , . . . , γij , eµ )η(V ,r) ⊗ g−k,n−j +1,β2 (eν , γij +1 , . . . , γin )

for all γi ∈ H(V ,r) . (b) Let - Mg,n

ρloop : Mg−1,n+2

be the gluing map corresponding to the stable graph i

loop =

i

1 2

g-1

.

i n

(V ,r)

The forms g,nβ satisfy the composition property: ρ∗ loop g,n,β (γ1 , γ2 , . . . , γn ) = g−1,n+2,β (γ1 , γ2 , . . . , γn , eµ , eν )η(V ,r) (V ,r)

(V ,r)

µν

for all γi ∈ H(V ,r) . 4. (Identity Axiom) Let 1 := 1 ⊗ e0 , where 1 is the unit in H • (V ) and e0 the unit of H(r) . We have g,n+1,β (γ1 , . . . , γn , 1) = π ∗ g,n,β (γ1 , . . . , γn ) (V ,r)

(V ,r)

for all γi ∈ H(V ,r) , where π : Mg,n+1 (V ) → Mg,n is the forgetful morphism. (V ,r) 5. (Dimension Axiom) Let KV denote the canonical class on V . The map g,n,β of Z-graded modules must be homogeneous of degree 2 (V ,r) g,n,β = 2 KV + 2(g − 2) dimC V + (r − 2)(g − 1). r β 6. (Divisor Axiom) Let α ⊗ e0 belong to H 2 (V ) ⊗ H(r) . We have (V ,r) π∗ g,n+1,β (γ1 , . . .

, γn , α ⊗ e0 ) =

(V ,r) g,n,β (γ1 , . . .

, γn )

α, β

for all γi ∈ H(V ,r) , where π : Mg,n+1 (V ) → Mg,n is the forgetful morphism.

536

T. J. Jarvis, T. Kimura, A. Vaintrob

7. (Mapping to a Point Axiom) ,r) (V g,n (γ1

⊗ em1 , . . . , γn ⊗ emn ) = p2∗

p1∗ (

n

1/r,m

γi ) ∪ cd (T V L) ∪ p∗ cg,n

i=1

for all γi ∈ H • (V ), where p1 : V × Mg,n → V and p2 : V × Mg,n → Mg,n are the canonical projections, T V is the tangent bundle, L = R 1 π∗ OCg,n where OCg,n is the structure sheaf on the universal curve π : Cg,n → Mg,n , and d = g dimC V 1/r

(the rank of T V ⊗ L). Finally, p : Mg,n → Mg,n is the morphism forgetting the spin structure and m = (m1 , . . . , mn ). Proof. All axioms follow immediately from Theorem 3.3.1 and the corresponding properties of usual Gromov-Witten invariants [19]. 4.5. Potential functions. Recall the potential functions associated to Mg,n (V ). Definition 4.5.1. Consider the correlation functions τa1 (γ1 ) . . . τan (γn ) g,β := [Mg,n

(V ,β)]virt

n

(ψiai evi∗ γi )

i=1

H • (V ).

for all integers a1 , . . . , an ≥ 0 and γ1 , . . . , γn in Correlation functions such that some of the ai are nonzero are called gravitational descendants. The large phase space potential (function) associated to Mg,n (V ) is ) −2 2 α (V ) (t) := λ2g−2 (V g (t) ∈ λ R[[λ ]][[ta ]], g≥0

where

) (V g (t) :=

exp(t · τ ) g,β q β

β∈B(V )

and t · τ :=

taα τa (εα ),

a≥0 α

relative to a basis {εα } for H • (V ) such that ε0 is the identity. The small phase space potential (function), (V ) (x) where x = (x 1 , . . . , x n ) are coordinates on H • (V ) relative to the basis {εα }, is obtained from (V ) (t) by setting x α := t0α and taα := 0 for all a ≥ 1 and all α. 1/r

There are analogous potential functions associated to Mg,n (V ). Definition 4.5.2. Consider the correlation functions τa1 (γ1 ⊗ em1 ) . . . τan (γn ⊗ emn ) g,β := 1/r,m

[Mg,n (V ,β)]virt

r 1−g c˜1/r (m)

n (ψiai evi∗ γi ) i=1

Spin Gromov-Witten Invariants

537

for integers a1 , . . . , an ≥ 0, γ1 , . . . , γn ∈ H • (V ), and em1 , . . . , emn ∈ H(r) . Correlation functions such that some of the ai are nonzero are called gravitational descendants. 1/r The large phase space potential (function) associated to Mg,n (V ) is ,r) (V ,r) (u) := λ2g−2 (V (u) ∈ λ−2 R[[λ2 ]][[H(V ,r) ]], g g≥0

where ,r) (V (u) := g

exp(u · τ ) g,β q β

β∈B(V )

and u · τ :=

uα,m a τa (εα ⊗ em ),

a≥0 α,m

relative to the basis {εα ⊗ em } for H(V ,r) . The small phase space potential (function), (V ,r) (y) where y consists of coordinates {y α,m } on H • (V ) relative to the basis {εα ⊗ em }, is obtained from (V ,r) (u) by setting y α,m := uα,m and uα,m := 0 for all a ≥ 1 and all α, m. a 0 Theorem 4.5.3. The small phase space potential function (V ,r) (y) is completely deter(V ) (r) mined by the potential (V ) (x), the cohomological correlators {g,n }, and {g,n }. Proof. Theorem 3.3.1 shows that the intersection numbers γ1 ⊗ em1 · · · γn ⊗ emn g,n (V ) (r) are completely determined by the classes {g,n } and {g,n } if (g, n) is stable. We must still address the unstable cases—when (g, n) ∈ {(0, 0), (0, 1), (0, 2), (1, 0)}. But by Proposition 3.2.4 and Definition 3.2.3, these are always of dimension zero. 1/r,m 1/r,m,(i) 1/r,m,(i) Let Mg,n := i Mg,n , where Mg,n are the connected components of 1/r,m

1/r,m,(i)

Mg,n , and let p˜ (i) : Mg,n

(V , β) → Mg,n (V , β) be the morphisms forgetting 1/r,m,(i)

the r-spin structure. Furthermore, let c˜1/r,m,(i) be c˜1/r restricted to Mg,n and let 1/r,m,(i) us assume that c˜ is zero dimensional. For all γ ⊗ e := γ1 ⊗ em1 · · · γn ⊗ emn in H(V ,r) , we have 1/r,m 1−g ˜ ∗ γ ∪ c˜1/r ) ∩ [Mg,n (V , β)]virt (ev γ ⊗ e g,β = r 1/r,m,(i) 1/r,m ˜ ∗ γ ∩ [Mg,n (V , β)]virt = c˜g,n r 1−g ev i

=

1/r,m,(i) 1−g c˜g,n r

i

=

1/r,m,(i) 1−g c˜g,n r

i

=

1/r,m,(i) 1−g

c˜g,n

r

∗ ∗ p˜ (i) ev∗ γ ∩ p˜ (i) [Mg,n (V , β)]virt ∗ (ev∗ γ )p˜(i) ∩ p˜ (i) [Mg,n (V , β)]virt

∗ p˜ (i) ev∗ γ ∩ [Mg,n (V , β)]virt

i

=

i

1/r,m,(i) 1−g

c˜g,n

r

deg(p˜ (i) )

ev∗ γ ∩ [Mg,n (V , β)]virt

538

T. J. Jarvis, T. Kimura, A. Vaintrob

=

1/r,m,(i) 1−g

c˜g,n

r

deg(p˜ (i) )γ g,β ,

i

where deg denotes the (orbifold) degree of p˜ (i) . This completes the proof.

4.6. The descent property. In this subsection, we show that when g = 0, our construc1/r tions on M0,n (V , β) satisfy a generalization of the so-called descent property (introduced in [15]). This property of r-spin invariants gives a geometric origin for the ψ classes (at least in genus zero) in the definition of the usual Gromov-Witten invariants of V . 1/r It may seem curious that Mg,n (V , β) is defined to be the disjoint union of 1/r,m

Mg,n (V , β), where the n-tuple of nonnegative integers m = (m1 , . . . , mn ) is required to satisfy mi ≤ r − 1 for all i = 1, . . . , n. The latter restriction, however, is reasonable because of the isomorphism 1/r, m

Mg,n (V , β)

m+rδ i - M1/r, (V , β) g,n

from Proposition 2.1.5, where i = 1, . . . , n, δ i is the n-tuple whose i-th component is 1 and the rest are zero, and m := (m ˜ 1, . . . , m ˜ n ) is any n-tuple of nonnegative integers. On the other hand, in genus zero the classes c1/r ( m) change under this identification in the following manner. Theorem 4.6.1. (The descent property) Let m = (m ˜ 1, . . . , m ˜ n ) be an n-tuple of nonnegative integers and let m = (m1 , . . . , mn ) be the reduction of m (mod r) (i.e., m ≡m (mod r) and 0 ≤ mi ≤ r − 1 for i = 1, . . . , n). 1/r,m m) be the top Chern class of the vector bundle R 1 π∗ E( m)∗ on M0,n . Let c˜1/r ( 1/r,m

The following equation is satisfied on M0,n for all i = 1, . . . , n, where δ i is the n-tuple whose i-th component is 1 and the rest are zero: r c˜1/r ( m + rδ i ) = −(m ˜ i + 1)ψi c˜1/r ( m).

(23)

1/r

Proof. The proof is identical to the case of Mg,n in [15]. It follows from the short exact sequence 0

- Er ( m + rδ i )

- Er ( m)

- σ ∗ Er ( m) i

- 0

and the fact that m)) = (mi + 1)ψi r ψ˜ i := rc1 (σi∗ Er ( for all i = 1, . . . , n, which follows from an immediate generalization of Proposition 2.2 from [16]. 1/r

1/r

Remark 4.6.2. The descent property holds on both M0,n and M0,n (V , β), but the ψ 1/r

1/r

classes on M0,n (V , β) are not pullbacks of the corresponding ψ classes on Mg,n — just as in the case of stable maps, they differ by divisors that are collapsed under the stabilization map (see [20, 22]). This illustrates the fact, alluded to in Remarks 2.4.2 ∗ m) is not equal to the and 3.2.6, that when any m ˜ i is larger than r − 1, the class st˜ c1/r ( 1/r class c˜ ( m).

Spin Gromov-Witten Invariants

539

The previous theorem motivates the following generalization of the small phase space potential function in genus zero. Definition 4.6.3. Let the n-tuples m = (m ˜ 1, . . . , m ˜ n ) and m and the class c˜1/r ( m) on 1/r,m M0,n be the same as in the previous theorem. Define the correlation functions n 1/r τ˜0 (γ1 ⊗ em˜ 1 ) . . . τ˜0 (γn ⊗ em˜ n ) 0,β := r c ˜ ( m ) ev ˜ ∗i γi . 1/r,m [M0,n (V ,β)]virt

i=1

Consider the analog of the genus zero small phase space potential (V ,r) (t˜ ) ∈ R[[λ2 ]][[t˜α,m˜ ]], 0 where (V ,r) (t˜ ) := 0

exp(t˜ · τ ) 0,β q β ,

β∈B(V )

and t˜ · τ :=

t˜α,m˜ τ˜0 (εα ⊗ em˜ ),

α,m ˜

where the last sum runs over all α and all nonnegative integers m. ˜ (V ,r) (t˜ ) and (V ,r) (u) Corollary 4.6.4. Let r ≥ 2 be an integer. The potential functions 0 0 are equal after making the assignment: t˜α,(ar+m) :=

(−1)a r a uα,m , [r(a − 1) + m + 1]r a

where a and m are nonnegative integers such that m ≤ r − 1 and [r(a − i) + m + 1]r :=

a

(r(a − i) + m + 1).

i=1

5. Examples and Special Cases 5.1. The case of r = 2. In [16, 29], the virtual class c1/r (m) when r = 2 was constructed for all genera and n-tuples m = (m1 , . . . , mn ) with 0 ≤ mi ≤ 1. It was shown that the r = 2 case reduced to the Gromov-Witten invariants of a point. A similar result is true for all 2-spin Gromov-Witten invariants. Theorem 5.1.1. For a pair of nonnegative integers (g, n) and β ∈ H2 (V , Z) let p˜ : 1/2 Mg,n (V , β) → Mg,n (V , β) be the map forgetting the spin structure. For i = 1, . . . , n, let γi ⊗ e0 belong to H(V ,r) , then

n n 1/2 ∗ 1−g 1/2 virt 2 p˜ ∗ c (0) (ev = ˜ i γi ) ∩ [Mg,n (V , β)] (evi∗ γi ) ∩ [Mg,n (V , β)]virt . i=1

i=1

Consequently, the large phase space potential functions (V ,2) (u) and V (t) agree after (α,0) = taα . setting ua

540

T. J. Jarvis, T. Kimura, A. Vaintrob

Proof. This was proved in the case where V is a point in [16]. The same proof goes through here using the definition of c˜1/r (which is now defined in the unstable range) 1/2 and the fact that [Mg,n (V , β)]virt = p˜ ∗ [Mg,n (V , β)]virt . 5.2. The case of g = 0 and β = 0. Genus zero Gromov-Witten invariants of V give rise to the quantum cohomology of V , which is a certain deformation of the cup product on H • (V ). The cup product itself appears as the β = 0 part of the genus zero potential 1/r function. Similarly, the Frobenius structure associated to Mg,n (V ) can be regarded as a deformation of the following commutative, associative product on H(V ,r) . Proposition 5.2.1. Let V be a smooth projective variety and n ≥ 3 be an integer. Let γ1 , . . . , γn belong to H • (V ) and e0 , . . . , er−2 be the standard basis in H(r) , then 1/r γ1 ⊗ em1 · · · γn ⊗ emn g,β=0 = c (m) γ1 ∪ . . . ∪ γ n . 1/r,m M0,n

V

Proof. This follows from the Mapping to a Point property.

5.3. The case of g = 0, r = 3, and V = P1 . Throughout this section let r = 3 and V = P1 . We will now compute its genus zero small phase potential function, denoted by (P1 ,3)

χ (t) := 0

(t),

where t is a set of coordinates t α,m associated to the basis {τα,m := εα ⊗ em } (where 1 α = 0, 1 and m = 0, 1) for H(P ,3) . Here ε0 is the identity element in H • (P1 ) and ε1 is 2 1 the element in H (P ) Poincar´e dual to a point. The metric in this basis is η(α1 ,m1 ),(α2 ,m2 ) := η(εα1 ⊗ em1 , εα2 ⊗ em2 ) = δα1 +α2 ,1 δm1 +m2 ,1 . The potential function can be broken into two pieces: χ (t) = χβ=0 (t) + (t), where χβ=0 (t) consists of only those terms corresponding to the moduli spaces M0,n (P1 , 0); while (t) contains the contributions (“instanton corrections”) from M0,n (P1 , β), where β = 0. Corollary 5.2.1 implies that χβ=0 (t) =

1 1,1 0,0 2 1 t (t ) + t 0,0 t 0,1 t 1,0 + t 1,1 (t 0,1 )3 . 2 18

(24)

Theorem 4.4.1 implies that (t) =

β≥1 n1 ,n2 ≥0

qβ

(t 0,1 )n1 (t 1,0 )n2 (t 1,1 )6β+2n1 −5 n1 n2 6β+2n1 −5 τ0,1 τ1,0 τ1,1 β . n1 !n2 !(6β + 2n1 − 5)!

(25)

Furthermore, Theorem 4.4.1 implies that the potential function must satisfy the WDVV equation ∂ 3 χ (t) ∂ 3 χ (t) (α+ ,m+ ),(α− ,m− ) η = ∂t α1 ,m1 ∂t α2 ,m2 ∂t α+ ,m+ ∂t α− ,m− ∂t α3 ,m3 ∂t α4 ,m4

Spin Gromov-Witten Invariants

541

∂ 3 χ (t) ∂ 3 χ (t) (α+ ,m+ ),(α− ,m− ) η ∂t α3 ,m3 ∂t α2 ,m2 ∂t α+ ,m+ ∂t α− ,m− ∂t α1 ,m1 ∂t α4 ,m4 for all mi , αi = 0, 1 and i = 1, . . . , 4, and where the summation convention has been used. Setting (α1 , m1 ) = (1, 0), (α2 , m2 ) = (0, 1), and (α3 , m3 ) = (α4 , m4 ) = (1, 1) in the WDVV equation and plugging in Eq. (24), we obtain 3 2 2 ∂1,1 = − ∂0,1 ∂1,0 ∂1,0 ∂1,1 2 2 ∂0,1 ∂1,1 − ∂0,1 ∂1,0 1 2 + t 0,1 ∂0,1 ∂1,1 3 2 2 + ∂0,1 ∂1,1 ∂1,0 ∂1,1

+ (∂0,1 ∂1,0 ∂1,1 )2 , where we have used the shorthand notation

n ∂ n ∂α,m = . ∂t α,m Together with the Divisor Axiom in Theorem 4.4.1, we obtain the recursion relations for β = 1 correlators 3 1 = τ0,1 τ1,1

1 2 τ τ1,1 1 , 3 1,0

and, for all n1 ≥ 2, n1 2n1 +1 τ1,1 1 = τ0,1

n1 n1 −1 2n1 −1 τ 1 . τ 3 0,1 1,1

These collectively imply that for all n1 ≥ 1, n1 2n1 +1 τ1,1 1 = τ0,1

n1 ! 2 τ τ1,1 1 . 3n1 1,0

Furthermore, the tensor product property implies that 2 τ1,1 1 = 1. τ1,0

Together with the Divisor Axiom, this determines all of the β = 1 correlators. If β ≥ 2 then we obtain the following recursion relation for all n1 ≥ 0:

n1 β 2 n1 −1 6β+2n1 −7 n1 6β+2n1 −5 τ0,1 τ0,1 τ1,1 τ1,1 = β β 3

n 6β +2n −5 − 8 n 6β + 2n n +2 6β +2n1 −1 1 1 1 1 τ 1τ + −β β τ τ 0,1 1,1 β 1,0 1,1 β n1 6β + 2n1 − 1

n1 6β + 2n1 − 8 n +1 6β +2n1 −3 n +1 6β +2n1 −3 −(β )2 τ1,01 τ1,1 τ0,11 τ1,1 β β n1 6β + 2n1 − 3

542

T. J. Jarvis, T. Kimura, A. Vaintrob

n1 6β + 2n1 − 8 n1 +2 6β +2n1 −1 n1 6β +2n1 −5 τ τ +(β )2 τ0,1 τ1,1 β 1,0 1,1 β n1 6β + 2n1 − 2

n +1 6β +2n −3

n1 6β + 2n1 − 8 n1 +1 6β +2n1 −3 1 , τ 1 τ1,1 τ +β β τ 0,1 1,1 β 1,0 β n1 6β + 2n1 − 4 where the summation is over β , β ≥ 1 such that β = β + β , and over n1 , n1 ≥ 0 such that n1 = n1 + n1 . Furthermore, we have defined −1 τ1,1 β := 0. τ0,1 6β−7

Together with the Divisor Axiom, these recursion relations completely determine all of the n-point correlators of the theory where n ≥ 3. Finally, the 0, 1 and 2 point correlators (those in the unstable range) are determined as a special case of Theorem 4.5.3. The only nonvanishing correlators of these types are τ1,1 1 = τ1,0 τ1,1 1 = 1. Acknowledgements. Parts of the paper were written while T.K. was visiting the ´ Universit´e de Bourgogne and A.V. was visiting Institut des Hautes Etudes Scientifiques. We would like to thank these institutions for their hospitality and support. References 1. Abramovich D., Jarvis T.: Moduli of twisted spin curves. Proc. Amer. Math. Soc. 131, no. 3, 685–699 (2002) 2. Abramovich D., Vistoli A.: Compactifying the space of stable maps. J. Amer. Math. Soc. 15, no. 1. 27–75 (2001) 3. Behrend K.: Gromov-Witten invariants in algebraic geometry. Invent. Math. 127, 601–617 (1997) 4. Behrend K.: The product formula for Gromov-Witten invariants. J. Alg. Geom. 8, 529–541 (1999) 5. Behrend K., Manin Yu., Stacks of stable maps and Gromov-Witten invariants. Duke Math. J. 85, 1–60 (1996) 6. Chen W., Ruan Y.: A new cohomology theory for orbifold. Commun. Math. Phys. 248, no. 1, 1–31 (2004) 7. Dubrovin B.: Geometry of 2D topological field theories. In: “Integrable Systems and Quantum Groups,” Lecture Notes in Math. 1620, Berlin: Springer-Verlag, 1996 ´ ements de G´eom´etrie Alg´ebrique IV: Etude ´ 8. Grothendieck A., Dieudonn´e J.: El´ Locale des Sch´emas et des Morphismes de Sch´emas. Volume 28. Paris: Publications Math´ematiques IHES, 1966 9. Hartshorne R.: Algebraic Geometry. New York: Springer-Verlag, 1977 10. Hitchin N.: Frobenius manifolds. In: “Gauge Theory and Symplectic Geometry (Montreal, 1995),” J. Hurtubise et al. (eds.), NATO Adv. Sci. Inst. Series C 488, Dordrecht: Kluwer Publ., 1997, pp. 69–112. 11. Jarvis T. J.: Geometry of the moduli of higher spin curves. Internat. J. of Math. 11, 637–663 (2000) 12. Jarvis T. J.: Torsion-free sheaves and moduli of generalized spin curves. Compositio Math. 110, 291–333 (1998) 13. Jarvis T. J.: Picard group of the moduli of higher spin curves. New York J. Math. 7, 23–47 (2001) 14. Jarvis T. J.: Compactification of the universal Picard over the moduli of stable curves. Math. Zeitschrift 235, 123–149 (2000) 15. Jarvis T., Kimura T., Vaintrob A.: Gravitational descendants and the moduli space of higher spin curves. In: E. Previato (ed.), Advances in Algebraic Geometry Motivated by Physics (Lowell, MA, 2000), Contemporary Mathematics 276, Providence, RI: AMS, 2001, pp. 167–177 16. Jarvis T., Kimura T., Vaintrob A.: Moduli spaces of higher spin curves and integrable hierarchies. Compositio Math. 126, no. 2, 157–212 (2001) 17. Jarvis T., Kimura T., Vaintrob A.: Tensor products of Frobenius manifolds and moduli spaces of higher spin curves. In: “Confer´ence de Mosh´e Flato 1999, Vol. 2,” G. Dito, D. Sternheimer (eds.), Dordrecht: Kluwer, 2000, pp. 145–166

Spin Gromov-Witten Invariants

543

18. Kontsevich M.: Intersection theory on the moduli space of curves and the matrix Airy function. Commun. Math. Phys. 147, 1–23 (1992) 19. Kontsevich M., ManinYu. I.: Gromov-Witten classes, quantum cohomology, and enumerative geometry. Commun. Math. Phys. 164, 525–562 (1994) 20. Kontsevich M., Manin Yu. I.: Relations between the correlators of the topological sigma-model coupled to gravity. Comm. Math. Phys. 196, no. 2, 385–398 (1998) 21. Kontsevich M., Manin Yu. I.: (with Appendix by R. Kaufmann): Quantum cohomology of a product. Invent. Math. 124, 313–340 (1996) 22. Manin Yu. I.: “Frobenius manifolds, quantum cohomology, and moduli spaces.” Providence, RI: Amer. Math. Soc. 1999 23. Manin Yu. I.: Three constructions of Frobenius manifolds: a comparative study. Asian J. Math. 3, 179–220 (1999) 24. Mochizuki T.: The virtual class of the moduli stack of r-spin curves. Preprint, December 2001 25. Mumford D.: Towards an enumerative geometry of the moduli space of curves. In: “Arithmetic and Geometry,” eds. M. Artin, J. Tate, Part II, Progress in Math., Vol. 36, Birkh¨auser, 1983, pp. 271–328 26. Polishchuk A.: Witten’s top Chern class on the moduli space of higher spin curves. Frobenius manifolds, Aspects Math., Vieweg, Wiesbaden, E36, 253–264 (2004) 27. Polishchuk A., Vaintrob A.: Algebraic construction of Witten’s top Chern class. In: E. Previato (ed.), Advances in Algebraic Geometry Motivated by Physics (Lowell, MA, 2000), Contemporary Mathematics 276, Providence, RI: AMS, 2001, pp. 229–249 28. Vistoli A.: Intersection theory on algebraic stacks and on their moduli spaces. Invent. Math. 97, 613–670, (1989) 29. Witten E.: Algebraic geometry associated with matrix models of two dimensional gravity. In: Topological Methods in Modern Mathematics (Stony Brook, NY, 1991), Houston: Publish or Perish, 1993, pp. 235–269 Communicated by A. Connes

Commun. Math. Phys. 259, 545–559 (2005) Digital Object Identifier (DOI) 10.1007/s00220-005-1370-1

Communications in

Mathematical Physics

Rigidity of Asymptotically Hyperbolic Manifolds Yuguang Shi1, , Gang Tian1,2,, 1 2

Key Laboratory of Pure and Applied Mathematics, School of Mathematics Science, Peking University, Beijing, 100871, P.R. China. E-mail: [email protected] Department of Mathematics, Massachusetts Institute of Technology, Cambridge, MA 02139, USA. E-mail: [email protected]

Received: 19 September 2004 / Accepted: 9 February 2005 Published online: 14 June 2005 – © Springer-Verlag 2005

Abstract: In this paper, we prove a rigidity theorem of asymptotically hyperbolic manifolds only under the assumptions on curvature. Its proof is based on analyzing asymptotic structures of such manifolds at infinity and a volume comparison theorem. 1. Introduction In this paper, we study the rigidity problem for asymptotically hyperbolic manifolds. Much progress has been made on this problem. In [8], using the Dirac operator, Min-Oo proved that a spin manifold of dimension n must be a hyperbolic space if it is asymptotic to hyperbolic space in a strong sense and its scalar curvature is not less than −n(n − 1). His argument was refined and new exciting results were obtained by Andersson and Dahl [2] and X.D. Wang [11]. For even dimensional manifolds, Leung proved in [7] that any conformally compact Einstein manifold (Bn , g) which is asymptotically hyperbolic of order greater than 2 must be hyperbolic. By exploring properties of positive eigenfunctions, J.Qing proved that a conformally compact Einstein manifold with round sphere as its conformal infinity has to be a hyperbolic space when the dimension is not greater than 7 (cf. [9]). He did not need to assume that the manifold considered is spin. However, his approach relies on the positive mass theorem for asymptotically flat manifolds. In all above results, one needs to assume that there are nice coordinates at infinity and in such coordinates, the metrics tensor behaves well. In view of geometry, it would be natural to ask whether such an assumption can be replaced by an intrinsic geometric condition. In this paper, we will show a rigidity theorem of this type only under the assumption on curvature. Let (Xn+1 , g) be a complete noncompact Riemannian manifold; we call it an asymptotically locally hyperbolic manifold, which we abbreviate as ALH in the following, of

The first author’s research is partially supported by NSF grant of China. The second author’s research is partially supported by an NSF grant and a Simon fund. Current address: Department of Mathematics, Princeton University, Princeton, NJ 08544, USA

546

Y. Shi, G. Tian

order α if |K(x) + 1| = O(e−αρ(x) ), where K(x) is the sectional curvature of g at point x in any direction and ρ(x) = distg (x, o). Recall that a Riemannian manifold X has a pole o if the exponential map expo : To X → X is a diffeomorphism. Without loss of generality, in our case, we may assume that the sectional curvature is negative outside a unit ball of (X, g). We have: Theorem 1.1. Suppose that (Xn+1 , g) n ≥ 2 and n = 3 is an ALH manifold of order α with a pole and there is a ρ > 1 such that the geodesic sphere with radius ρ and center at the pole is convex. If we further have α > 2 and Ric(g) ≥ −ng, then (X n+1 , g) is isometric to Hn+1 . As a corollary, we have: Corollary 1.2. Suppose that (X n+1 , g) n ≥ 2 and n = 3 is a simply connected ALH manifold of order α (α > 2), K ≤ 0 and Ric(g) ≥ −ng, then (Xn+1 , g) is isometric to Hn+1 . Let Rm0 denote the traceless part of the curvature tensor1 ,Rm0 denote the norm of the tensor for (X, g), then for n = 3, we have: Theorem 1.3. Suppose that (X 4 , g) is an ALH manifold of order α > 2 with a pole and there is a ρ > 1 such that the geodesic sphere with radius ρ and center at the pole is convex. If we further have Rm0 ∈ L1 (X) and Ric(g) ≥ −3g, then (X 4 , g) is isometric to H4 . We will use the volume comparison theorem to prove the above theorem. In order to use the volume comparison, we need to estimate the volume growth of geodesic spheres at infinity. We will carry this out in several steps. First, we show that by changing the metric conformally, we can compactify (X, g) in an appropriate way. Next, we will show that the boundary of the compactified Riemannian manifold is isometric to the standard sphere; in this step, we first verify that the boundary is conformal to the standard sphere. It follows from the assumption on curvature that the boundary is diffeomorphic to the standard sphere, hence, it suffices to show that the boundary is locally conformally flat. By a direct computation, we can show that the Weyl tensor of the boundary vanishes; if the induced metric on the boundary is sufficiently smooth we know that it is locally conformally flat. However, since the metric on the compactified boundary is not necessarily smooth enough, we have to check what the locally conformal flatness of the boundary means in our current case. We will prove a generalization of Weyl’s theorem: A W 2,p metric on a manifold of dimension n is conformally flat if n > 3 and its Weyl tensor vanishes, or n = 3 and the Schouton tensor is closed (cf. Theorem 2.6). Under the assumption on Ricci curvatures in the above theorem, we observe that the scalar curvature and volume of the boundary of compactified manifolds is less than or equal to those of the standard sphere. It follows that the scalar curvature of the boundary is actually equal to that of the standard sphere, hence, if n = 2, we see that the boundary is isometric to the standard sphere; if n ≥ 3, then by Obata’s theorem, we know that the boundary is also isometric to the standard sphere. Finally, we can show that the volume of geodesic spheres of (X, g) is equal to that of the corresponding geodesic spheres in Hn+1 with the same radius; then, by the volume comparison theorem, we prove the main theorem. 1 The metric g is of constant sectional curvature iff Rm0 vanishes. This property determines Rm0 uniquely.

Rigidity of Asymptotically Hyperbolic Manifolds

547

This assumption α > 2 should be optimal, since there are many asymptotically hyperbolic Einstein metrics on B4 with α = 2; we refer the readers to Theorem C and Appendix in [1] for details. In the case of n = 3, in order to show locally conformal flatness of the boundary, one has to check that a certain linear combination of covariant derivatives of Schouten tensor vanishes; for the time being, we do not know how to deduce this one from the assumption α > 2. This is the reason why we need the extra assumption Rm0 g ∈ L1 (X), we doubt its necessity. We also think that the assumption on the existence of a pole is unnecessary. In order to remove the assumption on a pole, one may study asymptotics of certain eigenfunctions at infinity and use appropriate power of them to scale metrics as we do in the next section. We will discuss this in a future paper. Also one can generalize arguments to study rigidity of asymptotic symmetric spaces. One particularly interesting case is for asymptotic complex hyperbolic K¨ahler manifolds. We expect that a similar result can be proved for K¨ahler manifolds by assuming that bisectional curvature tends to −1 at a sufficiently fast rate. The organization of this paper is as follows: In Sect. 2, we discuss the compactification and conformal structure of (X, g) at infinity; in Sect. 3, we show that the boundary of the compactified manifold is isometric to the standard sphere and then use it to deduce the main theorem. 2. Compactfication and Conformal Structure at Infinity In this section, we give a compactfication of (X, g) at infinity and study the induced conformal structure at infinity. This compactification is crucial in the proof of our main theorem. Let ρ be the geodesic sphere in (X, g) with radius ρ and a fixed center o. Define g¯ to be sinh−2 ρg, then we have: Theorem 2.1. There is a subsequence of (ρ , g¯ ρ ) which converges to a W 2,p ∩ C 1,α Riemannian manifold (∞ , g¯ ∞ ) in the weakly W 2,p -topology, where p ∈ (1, ∞) and α ∈ (0, 1) are arbitrary. Here by a W 2,p ∩ C 1,α structure on (∞ , g¯ ρ ), we mean that there is a covering {Ui } of ∞ by coordinates φi : Ui → Rn such that the transition functions φi · φj−1 and the metric tensors φi−1∗ g are in W 2,p ∩ C 1,α . Furthermore, (∞ , g¯ ∞ ) is conformally equivalent to the standard sphere. Here g¯ ρ denotes the restriction of g¯ to ρ . By the compactness theorem proved in [5], in order to have the convergence property of (ρ , g¯ ρ ), we only need to show the following Lemma 2.2. There exists a constant C such that |Rm(g¯ ρ )| ≤ C, V ol(ρ , g¯ ρ ) ≥ C −1 , diam(ρ , g¯ ρ ) ≤ C, where Rm(g) ¯ denotes the curvature tensor of g. ¯ Actually, arguments in this section are valid on more general ALH manifolds (X, g), namely, there is a compact convex hypersurface which is diffeomorphic to Sn in (X, g), and sectional curvature of (X, g) is nonpositive outside this surface. Thus, Theorem 2.1 and Lemma 2.2 is also true for those manifolds. Let us first recall some basic formulae. For the time being, we assume g¯ = u2 g. Let {ωa }1≤a≤n+1 be a local orthonormal coframe of g such that ωn+1 = dρ and {ωi }1≤i≤n is tangent to ρ . For convenience, we also denote g = dρ 2 + gij (ρ, θ )dθi dθj . Then we have structure equations

548

Y. Shi, G. Tian

dωa = n+1 ωab ∧ ωb , ωab + ωba = 0, b=1 n+1 dωab = c=1 ωac ∧ ωcb − 21 n+1 c,d=1 Rabcd ωc ∧ ωd ,

(2.1)

where Rabcd denote components of the curvature tensor. The second fundamental form, denoted by A = (hij )1≤i,j ≤n , of ρ with respect to g is given by ωn+1i | =

n

hij ωj ,

hij = hj i ,

j =1

where (·)| denotes the restriction of an 1-form to ρ . The corresponding mean curvature is given by H = ni=1 hii . Let ηa = uωa (1 ≤ a ≤ n + 1), then {ηa }1≤a≤n+1 is an orthonormal coframe for the metric g, ¯ and dηa = n+1 ηab ∧ ηb , ηab + ηba = 0, b=1 (2.2) n+1 ¯ dηab = c=1 ηac ∧ ηcb − 21 n+1 c,d=1 Rabcd ηc ∧ ηd , where R¯ abcd are components of the curvature tensor of (X, g) ¯ in the coframe {ηa }1≤a≤n+1 . By a direct computation, we see that ηab = ωab − (log u)b ωa + (log u)a ωb . Here for any smooth function f on X, fa is defined by df = n+1 a=1 fa ωa . Thus, we get ηn+1,i |ρ = (hij +

∂ (log u)δij )u−1 ηj . ∂ρ

It follows that the second fundamental form of ρ with respect to g¯ and {ηi }1≤i≤n is given by ∂ h¯ ij = (hij + (log u)δij )u−1 . ∂ρ

(2.3)

On the other hand, we can deduce the Riccati equations from the structure equations for curvatures ∂hij hik hkj = −Rn+1in+1j . + ∂ρ n

(2.4)

k=1

In order to estimate hij , we need the following Lemma 2.3. Suppose that f is a smooth function and for any ρ > 0, we have |f (ρ) − 1| ≤ Ke−αρ for some α > 2 and 41 ≤ f (ρ). If y is a solution of the equation y + y 2 = f (ρ) and y(0) > 0. Then there is a constant C > 0, which depends only on K and y(0), such that |y − 1| ≤ Ce−2ρ . Proof. We will prove this lemma in the following steps.

Rigidity of Asymptotically Hyperbolic Manifolds

549

Claim 1. 0 < y(ρ) ≤ ρ + C1 for any ρ > 0. Here and in the sequel, Ci always denotes a constant which depends only on y(0) and K. Clearly, y(ρ) ≤ ρ + C1 . To see that y(ρ) > 0, we first observe that f ≥ 41 , hence, by using the equation, y (ρ) > 0 whenever y(ρ) < 21 . It follows that y increases in the region where y < 21 . Then the claim follows from y(0) > 0. Claim 2. |y − 1| ≤ C2 e−ρ . Set v = y − 1. We have ρ + C1 − 1 ≥ v ≥ −1 and −1 ≤ v(0) ≤ y(0). Choose β = 1 + α2 < α. Then |v| ≤ C3 e(α−β)ρ , consequently, using the equation for y, we can deduce (v 2 ) + 2v 2 ≤ (v 2 ) + (4 + 2v)v 2 ≤ C4 e−βρ ,

2 < β < α.

It follows (v 2 e2ρ ) ≤ C4 e(2−β)ρ . Integrating this inequality, we get v 2 ≤ (v 2 (0) +

2C4 −2ρ ≤ C5 e−2ρ . )e α−2

Claim 2 follows. Now we can finish the proof of this lemma. By Claim 2, we have |v| ≤ C5 e−ρ . Using this and the equation for y, we have (v 2 ) + (4 − 2|v|)v 2 ≤ 2Ke−αρ |v|. Suppose that we have proved v 2 ≤ e−2βk ρ for some βk ≥ 1. From Claim 2 we know that this inequality is true for β0 = 1, then it follows from the above (v 2 e4ρ ) ≤ C6 (e(4−α−βk )ρ + e(4−3βk )ρ ). Integrating this, we get v 2 ≤ C7 (e−4ρ + e− min{3βk ,α+βk }ρ ). If min{3βk , α + βk } ≥ 4, we are done, otherwise, then we take βk+1 = 21 min{3βk , α + βk } ≥ βk + α2 − 1 and repeat the above process. Then the lemma follows after finitely many iterations. Lemma 2.4. Let A = ij hij ωi ⊗ ωj be the second fundamental form of ρ in (X, g) and write hij = δij + Tij e−2ρ , then T g ≤ C < +∞, where T = ij Tij ωi ⊗ ωj . Remark 2.5. If hˆ ij denotes components of the second fundamental form of ρ in (X, g) in the coordinate frame { ∂θ∂ i }, then we have

Write ωi =

j

hˆ ij = gij + pij e−2ρ . bij dθ j ; we have (gij ) = (bij )T · (bij ) and (pij ) = (bij )T · (Tij ) · (bij ).

550

Y. Shi, G. Tian

Proof. Let λmax and λmin be the largest and smallest eigenvalue of matrix (hij ), then they are Lipschitz, and we claim that d λmax + λ2max = 1 + O(e−αρ ), dρ

(2.5)

d λmin + λ2min = 1 + O(e−αρ ). dρ

(2.6)

In fact, for any ρ = ρ0 , let V be the unit eigenvector of λmax , then V T (hij )V |ρ=ρ0 = λmax (ρ0 ), and V T (hij )V ≤ λmax (ρ) for any ρ, thus, d T d λmax |ρ=ρ0 = V (hij )V |ρ=ρ0 , dρ dρ hence, d λmax |ρ=ρ0 + λ2max |ρ=ρ0 = 1 + O(e−αρ ), dρ which implies (2.5) is true; by the same reason, (2.6) is true too. On the other hand, when ρ is sufficiently large the eigenvalue of matrix (Rn+1in+1j ) is less than − 41 , and note that there is a convex geodesic sphere with sufficiently large radius. Hence, we may assume the initial data of Eq. (2.5), (2.6) is positive, then by Lemma 2.3, we see Lemma 2.4 is true. Due to Lemma 2.4, we have supρ |Tij | ≤ C < +∞ for any ρ ≥ 1. Proof of Lemma 2.2. By a direct computation, we have R¯ abcd = u−2 Rabcd − u−2 ((log u)bm − (log u)m (log u)b )(δac δdm − δad δmc ) −u−2 ((log u)am − (log u)m (log u)a )(δmc δdb − δmd δbc ) −u−2 |∇ log u|2 (δac δbd − δad δbc ),

(2.7)

¯ in the coframe where R¯ abcd denote the components of the curvature tensor of (X, g) {ηa }. By our assumption on asymptotic hyperbolicity, we may write Rabcd = (δbc δad − δac δbd ) + E¯ abcd , where |Eabcd | = O(e−αρ ). Now let u = sinh−1 ρ. Noticing that for any 1 ≤ a, b ≤ n, (log u)ab = (log u)ρ (δab + Tab e−2ρ ), we can deduce from the above and Lemma 2.42 1 R¯ abcd = (Tbd δac − Tbc δad − Tad δbc + Tac δbd ) + E¯ abcd , 4 2

Without loss of generality, we may assume that α ≤ 4.

1 ≤ a, b, c, d ≤ n. (2.8)

Rigidity of Asymptotically Hyperbolic Manifolds

551

Here, |E¯ abcd | = O(e(2−α)ρ ) as ρ tends to infinity, R¯ n+1bcd = O(e(2−α)ρ ), 1 R¯ n+1bn+1d = Tbd − δbd + O(e(2−α)ρ ). 2 Let h¯ ij be the components of the second fundamental form of ρ ⊂ X with respect to the metric g. ¯ It follows from (2.3) that: h¯ ij = O(e−ρ ).

(2.9)

Now let us estimate the volume and diameter of (ρ , g). ¯ We can write g in the form dρ 2 + gij (ρ, θ )dθ i dθ j , then we have ∂ gij = 2hˆ ij . ∂ρ Using the facts that hˆ ij = gij +pij e−2ρ and −c(gij ) ≤ (hˆ ij ) ≤ c(gij ) for some constant c, we can show that there exists a constant independent of ρ such that

−1 e2ρ (δij ) ≤ (gij ) ≤ e2ρ (δij ).

(2.10)

It follows that diam(ρ , g) ¯ ≤ C2 and V ol(ρ , g) ¯ ≥ δ0 > 0. The proof of Lemma 2.2 is complete. By using (2.8), (2.9) and the Gauss equations, we see that the sectional curvature of (ρ , g) ¯ is uniformly bounded. Then it follows from Lemma 2.2 and [5] that there exists a sequence of (ρi , g¯ ρi ), which will be denoted by (i , g¯ i ), converges to (∞ , g¯ ∞ ) in q ∈ ∞ , there is a the sense of weak topology of W 2,p for any p < ∞, and for any coordinate chart (Bq , θ i ) in which the components of g¯ ∞ are C 1,α W 2,p , ∀p < +∞ and the curvature of (∞ , g¯ ∞ ) is bounded. Let Rˆ ij kl be the components of the curvature tensor of (ρ , g¯ ρ ) under the orthonormal coframe ηi (1 ≤ i ≤ n), then the Weyl tensor is: 1 (Rˆ ik δj l − Rˆ j k δil + Rˆ j l δik − Rˆ il δj k ) n−2 Rˆ + (δik δj l − δj k δil ). (n − 1)(n − 2)

Wˆ ij kl = Rˆ ij kl −

Combined with (2.8) and (2.9), we see that Wˆ = o(1) as ρ tends to ∞, and the Ricci tensor is of the form 1 Rˆ ij = [(n − 2)Tij + trg¯ T δij ] + Eij , 4 where Eij = g¯ kl Eikj l and |Eij | = o(1) as ρ tends to infinity. Recall that the Schouten tensor of g is ˆ 1 R Sˆij = 2Rˆ ij − δij . n−2 n−1

(2.11)

552

Y. Shi, G. Tian

Therefore, the Weyl tensor of (∞ , g¯ ∞ ) vanishes in the Lp -sense. Together with Gauss equations and Codazzi equations and (2.11), we deduce 1 2 ∇¯ k Sˆij − ∇¯ j Sˆik = e2ρ sinh ρRn+1kij + (∇¯ k Eij − ∇¯ j Eik ) 2 n−2 1 − (∇¯ k Eδij − ∇¯ j Eδik ). (n − 1)(n − 2)

(2.12)

If n = 3, by the assumption that Rm0 g ∈ L1 (X), we see that there are ρi which tend to infinity such that ||Rm0 ||g sinh3 ρi → 0; ρi

in particular, we have |R4ij k |e3ρi → 0. ρi

It follows that for any φ ∈ C ∞ (ρi ), φR4ij k e2ρi sinh ρi → 0.

(2.13)

ρi

Without loss of generality, we may assume (ρi , g¯ ρi ) converges to (∞ , g¯ ∞ ), for simplicity, in the sequel, (∞ , g¯ ∞ ) will be denoted by (, g) ¯ and the components of its curvature tensor will be simply denoted by R¯ ij kl . Then if n = 3, we see that the Schouton tensor of (, g) ¯ satisfies the following equations in the sense of distribution: ∇¯ k S¯ij − ∇¯ j S¯ik = 0,

(2.14)

that is, for any φ ∈ C ∞ (), we have S¯ij ∇¯ k φ − S¯ik ∇¯ j φ = 0,

where ∇¯ k are covariant derivatives of (, g) ¯ with respect to an orthonormal basis {ei }1≤i≤n . Now, we are in the position to show the following: Theorem 2.6. Suppose that (, g) is an n-dimensional Riemannian manifold and the metric g is in W 2,p for any 1 < p < ∞. If its curvature tensor is bounded and the Weyl tensor W = 0 if n > 3, and (2.14) is true if n = 3, then g is locally conformally flat, i.e., for any point q ∈ , there is a neighborhood U , such that in U , we have a positive W 2,p function f with g = f geuc , where geuc denotes a flat metric on U . Clearly, it is a local result, hence, we need to consider only the problem in a local coordinate chart, i.e., we assume that is a ball B n ⊂ Rn and g = gij dxi dxj , where x1 , . . . , xn are euclidean coordinates of Rn . By our assumption, gij are W 2,p functions on B n for any 1 < p < ∞. It follows that the the curvature tensor Rij kl and the

Rigidity of Asymptotically Hyperbolic Manifolds

553

Christoffel symbol ji k are in Lp and W 1,p respectively. Hence, we can define covariant derivatives of Rij kl,h in the sense of distribution, that is, for any φ ∈ C0∞ (B n ), we have: √ ∂ Rij kl,h φ det(g)dx = − Rij kl h (φ det(g))dx + Rm ∗ φ det(g)dx, ∂x Bn Bn Bn where det(g) = det (gij ) and Rm ∗ refers to a bilinear form of Rij kl and ji k . Since √ Rij kl are in Lp and ∂x∂ h (φ det(g)) is in C α for some α > 0, the right hand side of the above equation is well defined. Similarly, we can define Rij kl,hm in the sense of distribution, that is, for any φ ∈ C0∞ (B n ), we have: Rij kl,hm φ det(g)dx Bn ∂2 ∂ = Rij kl h m (φ det(g))dx − Rm ∗ m (φ det(g))dx ∂x ∂x ∂x Bn Bn ∂ Rm ∗ m (φ det(g))dx + Rm ∗ ∗ φ det(g)dx. − ∂x Bn Bn Now we have: Lemma 2.7. Suppose that g ∈ W 2,p for some p > 1, then in the distributional sense, we have the second Bianchi identity Rij kl,h + Rij lh,k + Rij hk,l = 0

(2.15)

Rik,lt = Rik,tl + Ric ∗ Rm.

(2.16)

and

That is, for any φ ∈ C0∞ (B n ), we have (Rij kl,h + Rij lh,k + Rij hk,l )φ det(g)dx = 0 Bn

and

Bn

(Rik,lt − Rik,tl − Ric ∗ Rm)φ det(g)dx = 0.

Here Rij is the Ricci tensor of g and Ric ∗ Rm denotes a bilinear form of Ricci tensor and curvature tensor. Proof. By the assumption, we may take a sequence of smooth metrics gi which converges to g in W 2,p . Since (2.15) and (2.16) hold for the curvature tensor of gi and curvature tensors of gi converge to that of g in Lp , we see that (2.15) and (2.16) hold for g, too. Next we construct harmonic coordinates around any point of manifold. Without loss of generality, we only need to show Lemma 2.8. Suppose that gij are in W 2,p on B n for any 1 < p < ∞, then there are harmonic coordinates (z1 , · · · , zn ) around o ∈ B n with zi in W 3,p .

554

Y. Shi, G. Tian

Proof. Let ji k denote the Christoffel symbols of g in euclidean coordinates x 1 , · · · , x n . Define yi by x i = y i − ji k (o)y j y k (1 ≤ i ≤ n), by the Inverse Theorem, we see that y i are smooth functions of (x 1 , · · · , x n ) around o and form coordinates. Let ¯ ji k be the Christoffel symbols of g in coordinates (y 1 , · · · , y n ), then by direct computations, we see ¯ ijk

∂x s ∂ 2xs ∂x l ∂x m s = + . ∂y k ∂y i ∂y j ∂y j ∂y i lm

It follows that ¯ ijk (o) = 0, consequently, y i = 0 at o. This implies that ||y i ||L∞ (B (o)) tends to 0 as goes to zero. Consider the following boundary value problem on B (o) zi = 0, i z |∂B = y i |∂B . Then, using standard estimates for elliptic equations, we get ||zi − y i ||C 1,α ≤ C||y i ||L∞ (B ) . Therefore, z1 , · · · , zn form local coordinates on B (0) when is sufficiently small. Clearly, zi are in W 3,p (B ) and harmonic with respect to g. The lemma is proved. By a direct computation and Lemma 2.8, we see that the metric tensor of g in coordinates z1 , · · · , zn is also in W 2,p . In the following, we will consider the problem in these harmonic coordinates, and the metric components will be still denoted by gij . Lemma 2.9. Let R be the scalar curvature of g and bounded, then when is sufficiently small, the following equation: n−2 u − 4(n−1) Ru= 0 u|∂B = 1|∂B has a positive solution in W 2,p (B ). Proof. We note that when is sufficiently small, the first Dirichlet eigenvalue can be arbitrarily large, and R is bounded. Hence, the corresponding homogenous equation has only trivial solution, and this implies the above equation has nonnegative solution. Then by Lemma 3.4 in [4] (p.34), we see that the solution has to be positive. This finishes the proof of the lemma. In order to show Theorem 2.6, we need the following lemma (see Theorem 17.2.7, [6], p.18 for its proof). Lemma 2.10. Let aij (x) be Lipschitz continuous in an open set ⊂ Rn , and assume that the matrix (aij ) is positive definite and u ∈ L2loc (). Then

∂ ∂u (aj k k ) = f, j ∂x ∂x

1,2 2,2 −1 () if f ∈ Hloc (), moreover, if f ∈ L2loc (), then u ∈ Wloc (). implies u ∈ Wloc 1,2 −1 Here Hloc () is the dual space of W0 ().

Rigidity of Asymptotically Hyperbolic Manifolds

555

Now we can finish the proof of Theorem 2.6. Since the scalar curvature of (, g) is bounded, by Lemma 2.9, we may choose a sufficiently small neighborhood of q such that there is a positive W 2,p function u on this neighborhood such that the scalar curvature 4 of g¯ = u n−2 g vanishes. It is easy to show that g¯ is also in W 2,p for any 1 < p < ∞, moreover, its Weyl tensor also vanishes if n ≥ 4 and (2.14) still holds if n = 3. By Lemma 2.8, we can choose harmonic coordinates of the metric g¯ with metric tensor g¯ ij in W 2,p . It suffices to show that the corresponding Ricci tensor is smooth in these harmonic coordinates. In the sequel, we will do everything in these coordinates. Since the Weyl tensor and the scalar curvature vanish, we have 1 (2.17) (R¯ ik g¯ j l − R¯ j k g¯ il + R¯ j l g¯ ik − R¯ il g¯ j k ). n−2 On the other hand, by Lemma 2.7, we have the second Bianchi identity for g, ¯ hence, by a direct computation, we deduce R¯ ij kl =

g¯ j h R¯ j k,h = 0.

(2.18)

If n ≥ 4, using (2.17), (2.18) and the Bianchi identity, we can also derive R¯ il,k − R¯ ik,l = 0.

(2.19)

When n = 3, since the scalar curvature vanishes, the above equation is nothing but (2.14). It follows g¯ kt R¯ il,kt − g¯ kt R¯ ik,lt = 0, and because of (2.16) in Lemma 2.7, we have ¯ ∗ Rm. ¯ g¯ kt R¯ ik,lt = g¯ kt R¯ ik,tl + g¯ ∗ Ric Note that g¯ kt R¯ ik,tl = (g¯ kt R¯ ik,t ),l = 0, so we have ¯ ∗ Rm. ¯ g¯ kt R¯ il,kt = g¯ ∗ Ric Since g¯ is in W 2,p , the above equation can be written as ∂ R¯ il ∂ ¯ + g¯ ∗ Ric ¯ ∗ Rm. ¯ (g¯ kt k ) = ∂ g¯ ∗ ∂ Ric (2.20) t ∂x ∂x ¯ ∈ Lp for any p > 1, we see that the right hand side of Noticing that g¯ ∈ W 2,p and Rm −1 (2.20) is in H , which is dual to W 1,2 . Then it follows from Lemma 2.10 that R¯ ij are 0

1,2 ; in turn, this implies that the right side of (2.20) is in L2loc . Then again actually in Wloc 2,2 by Lemma 2.10, we see that R¯ ij are in Wloc ; then it follows from the standard theory for 2,α elliptic equations that R¯ ij are actually Cloc . Therefore, g¯ is smooth, and consequently, by the classical Weyl Theorem, it is locally conformal flat. Theorem 2.6 is proved. Now, we can prove Theorem 2.1.

Proof of Theorem 2.1. It only remains to show that (, g) ¯ is conformally equivalent to the standard sphere. By the assumption of Theorem 2.1, we see that is diffeomorphic ¯ is a locally conformally to Sn . On the other hand, by Theorem 2.6, we know that (, g) flat manifold, so is conformally equivalent to Sn . Theorem 2.1 is proved.

556

Y. Shi, G. Tian

3. Proof of Main Theorems To prove Theorem 1.1 and Theorem 1.3, we need to compare both the volume and the scalar curvature of (, g) ¯ which is the boundary of the Riemannian manifold (X, g) ¯ with those corresponding quantities of the standard sphere. Using this, we are able to show that (, g) ¯ is actually isometric to the standard sphere. Then by the Volume Comparison theorem, we can conclude that the original manifold (X, g) is isometric to Hn+1 . Lemma 3.1. Let ωn denote the volume of Sn and R¯ be the scalar curvature of (, g), ¯ then we have Vol(, g) ¯ ≤ ωn and R¯ ≤ n(n − 1). Proof. Recall that Rˆ is the scalar curvature of (ρ , g¯ ρ ), then by the computations in the last section, we have n n − 1 ij g pij + o(1), as ρ → ∞ Rˆ = 2 i,j =1

and n

H = n + e−2ρ

g ij pij ,

i,j =1

where H denotes the mean curvature of ρ in (X, g). On the other hand, because of Ric(g) ≥ −ng, we can use the Laplacian Comparison Theorem to get H |ρ = g ρ|ρ ≤ n coth ρ.

(3.1)

It follows n

2ne2ρ , e2ρ − 1

g ij pij ≤

i,j =1

and consequently, Rˆ ≤ n(n − 1) + o(1); letting ρ go to ∞, we get R¯ ≤ n(n − 1). To show Vol(, g) ¯ ≤ ωn , we only need to prove for any ρ > 0, Vol(ρ , g) ≤ (sinh ρ)n ωn . For any δ > 0, integrating (3.1), we obtain g ρdVg ≤ Bτ +δ \Bτ

which is equivalent to Vol(τ +δ − Vol(τ ) n ≤ δ δ Let δ → 0, we have

Bτ +δ \Bτ

(3.2)

n coth ρdVg ,

τ +δ

coth ρVol(ρ )dρ. τ

(log (sinh τ )−n Vol(τ ) ) ≤ 0.

Hence (sinh τ )−n Vol(τ ) is non-increasing with τ . Since limτ →0 (sinh τ )−n Vol(ρ ) = ωn , we see from the above that (3.2) is true. This implies that Vol(, g) ¯ ≤ ωn . Thus Lemma 3.1 is proved.

Rigidity of Asymptotically Hyperbolic Manifolds

557

Our next goal is to establish Lemma 3.2. The limit space (, g) ¯ is isometric to the standard sphere (Sn , g0 ). Proof. If n = 2, we only need to show R¯ = 2. Suppose not, we have R¯ < 2 and Vol(, g¯ ) ≤ 4π. This is in contradiction with Gauss-Bonnet formula. If n ≥ 3, it suffices to prove that R¯ = n(n − 1). In fact, we can write 4

g¯ = u n−2 g0

(3.3)

for some u > 0 which belongs to W 2,p for any p < ∞. If R¯ = n(n − 1), u satisfies a semi-linear elliptic equation and the standard regularity theory implies that u is smooth. Then Lemma 3.2 follows from the Obata theorem. Let d V¯ and dV0 be the volume elements of (, g) ¯ and (Sn , g0 ), respectively, then 2n by (3.3), d V¯ = u n−2 dV0 . The following equation is well-known n+2 4(n − 1) R¯ = u 2−n (n(n − 1)u − Sn u). n−2

It follows

Sn

2n

¯ n−2 dV0 = Ru

Sn

((n − 1)nu2 +

4(n − 1) |∇Sn u|2 )dV0 , n−2

since R¯ ≤ n(n − 1), we get 4(n−1) 2 2 2n 2 n ((n − 1)nu + n−2 |∇Sn u| )dV0 n(n − 1)( u n−2 dV0 ) n ≥ S . 2n n−2 Sn ( Sn u n−2 dV0 ) n 2n

Using the fact that d V¯ = u n−2 dV0 and V ol(, g) ¯ ≤ ωn , we see that: 4(n−1) 2 2 2 n ((n − 1)nu + n−2 |∇Sn u| )dV0 n(n − 1)ωn n ≥ S . 2n n−2 ( Sn u n−2 dV0 ) n 4

2

2−n

(3.4)

2n

2 , where ψ(x) = ( 1+|x| ) 2 , then dV = ψ n−2 dx, where dx Write g0 = ψ n−2 dsR n 0 2 is the volume element of Rn , we have: 4(n − 1) Rn |∇Rn (uψ)|2 dx . (3.5) The RHS of (3.4) = 2n n−2 n − 2 ( n (uψ) n−2 dx) n

R

On the other hand, by a direct computation, we have: 2 4(n − 1) Rn |∇Rn ψ|2 dx . n(n − 1)ωn n = 2n n−2 n − 2 ( n ψ n−2 dx) n

(3.6)

R

Putting (3.4), (3.5) and (3.6) together, we obtain 2 2 Rn |∇Rn ψ| dx Rn |∇Rn uψ| dx ≥ . 2n 2n n−2 n−2 ( Rn ψ n−2 dx) n ( Rn (uψ) n−2 dx) n 2

2−n

(3.7)

2 ; we know that the LHS of (3.7) is the best Sobolev constant Note that ψ = ( 1+|x| 2 ) for Rn , hence, the equality in (3.7) holds, so R¯ = n(n − 1). Thus we see that (, g) ¯ is nothing but (Sn , g0 ). Lemma 3.1 is proved.

558

Y. Shi, G. Tian

Proof of Theorem 1.1. By Lemma 3.2 and the fact that (ρ , g¯ ρ ) subconverges to (, g) ¯ in the Cheeger-Gromov topology, we get lim (sinh ρ)−n Vol(ρ , g) = ωn ,

ρ→∞

while in the proof of Lemma 3.1, we have shown that (sinh ρ)−n Vol(ρ , g) is nonincreasing, and hence, we get for all ρ > 0, ωn ≤ (sinh ρ)−n Vol(ρ , g) ≤ ωn . Thus, we obtain that for any ρ > 0, (sinh ρ)−n Vol(ρ , g) = ωn . Now we claim that g ρ = H |ρ = n coth ρ, ∀ρ > 0. If it is false, there is a point p ∈ ρ such that g ρ|p < n coth ρ|p , so g τ dVg < n coth τ dVg , Bρ+δ (o)\Bρ (o)

Bρ+δ (o)\Bρ (o)

or equivalently

ρ+δ

Vol(ρ+δ ) − Vol(ρ ) < n

coth τ Area(τ )dτ. ρ

This contradicts that Vol(τ ) = (sinh τ )n ωn . Hence for any ρ > 0, H |ρ = n coth ρ and consequently ∂H H2 + = n. ∂ρ n However, from (2.4) and assumption Ric(g) ≥ −ng, we see that ∂H H2 + ≤ n; ∂ρ n moreover, the equality holds if and only if hij = coth ρgij . On the other hand, a direct computation shows that ∂gij = 2hij = 2 coth ρgij , ∂ρ and lim ρ −2 gij = (g0 )ij .

ρ→0

Hence, gij = (sinh ρ)2 (g0 )ij , where g0 is the standard metric on Sn . Therefore, we see g = dρ 2 + (sinh ρ)2 (g0 )ij dθ i dθ j , that is, (X, g) is isometric to Hn+1 . Theorem 1.1 is proved. Acknowledgements. Part of this work was done during the first author’s visit at the Department of Mathematics of MIT. He would like to thank colleagues there for providing an excellent research environment. Especially, he wants to thank Dr. X.D. Wang for stimulating discussions during this visit.

Rigidity of Asymptotically Hyperbolic Manifolds

559

References 1. Anderson, M.T.: Einstein metrics with prescribed conformal infinity on 4-manifolds. http://arxiv.org/list/math.DG/0105243, v1, 29 May 2001 2. Andersson, L., Dahl, M.: Scalar curvature rigidity for asymptotically locally hyperbolic manifolds. Ann. Global Anal.Geom. 16, 1–27 (1998) 3. Aubin, T.: Non-linear analysis on Manifolds, Monge-Amp´ere equations. NewYork: Springer-Verlag, 1982 4. Gilbarg, D., Trudinger, N.S.: Elliptic Partial Differential Equations of Second Order. BerlinHeidelberg-New York: Springer, 1998 5. Green, R.E., Wu, H.: Lipshitz Convergence of Riemannian Manifolds. Pacific J. Math. 131(1), 119–141 (1988) 6. H¨ormander, L.: The Analysis of Linear Partial Differential Operators III. NewYork: Springer-Verlag, 1984 7. Leung, M.C.: Pinching theorem on asymptotically hyperbolic spaces. Internat.J. Math. 4(5), 841–857 (1993) 8. Min-Oo, M.: Scalar curvature rigidity of asymptotically hyperbolic spin manifolds. Math.Ann. 285, 527–539 (1989) 9. Qing, J.: On the Rigidity for Conformally Compact Einstein Manifolds. http://arxiv.org/list/ math/0305084 10. Schoen, R., Yau, S.-T.: Lectures on Differential Geometry. Cambridge, MA: International Press, 1984 11. Wang, X.: The Mass of Asymptotically Hyperbolic Manifolds. J. Diff. Geom. 57, 273–299 (2001) Communicated by P. Sarnak

Commun. Math. Phys. 259, 561–576 (2005) Digital Object Identifier (DOI) 10.1007/s00220-005-1374-x

Communications in

Mathematical Physics

Heteroclinic Connections Between Periodic Orbits in Planar Restricted Circular Three Body Problem. Part II Daniel Wilczak1, , Piotr Zgliczynski ´ 2 1

WSB – NLU, Faculty of Computer Science, Department of Computational Mathematics, Zielona 27, 33-300 Nowy S¸acz, Poland. E-mail: [email protected] 2 Jagiellonian University, Institute of Computer Science, Nawojki 11, 30-072 Krakow, Poland. E-mail: [email protected] Received: 1 October 2004 / Accepted: 5 January 2005 Published online: 21 June 2005 – © Springer-Verlag 2005

Abstract: We present a method for proving the existence of symmetric periodic, heteroclinic or homoclinic orbits in dynamical systems with the reversing symmetry. As an application we show that the Planar Restricted Circular Three Body Problem (PCR3BP) corresponding to the Sun-Jupiter-Oterma system possesses an infinite number of symmetric periodic orbits and homoclinic orbits to the Lyapunov orbits. Moreover, we show the existence of symbolic dynamics on six symbols for PCR3BP and the possibility of resonance transitions of the comet. This extends earlier results by Wilczak and Zgliczynski [12]. Electronic Supplementary Material: Supplementary material is available in the online version of this article at http://dx.doi.org/10.1007/s00220-005-1374-x.

1. Introduction The Planar Restricted Circular Three Body Problem (PCR3BP) has attracted much attention of scientists. In particular, some transport properties of this system may be applied in space mission design (see [4] and references given there). The problem has been studied by Koon, Lo, Marsden and Roos in [4], where the numerical evidence of the resonance transitions for the PCR3BP for the parameter values corresponding to the Sun-Jupiter-Oterma system is presented. The rigorous proof of some facts discovered in [4] was given by Wilczak and Zgliczynski in [12] (see also the Stoffer and Kirchgraber paper [8]). In the present paper, as in [12], we restrict our attention to the following parameter values for PCR3BP C = 3.03, µ = 0.0009537 - the parameter values for the Oterma comet in the Sun-Jupiter system (see [4]). For these parameter values there are two hyperbolic periodic orbits L∗1 and L∗2 , the Lyapunov orbits, around the libration points L1

Research supported by Polish State Committee for Scientific Research grant 2 P03A 041 24

562

D. Wilczak, P. Zgliczy´nski

and L2 , respectively. In [12] and [8] it was proven that there exist homoclinic solutions to both L∗1 and L∗2 periodic orbits and a pair of heteroclinic connections between them in both directions. In this paper we present the proof of the following facts: • The PCR3BP possesses two pairs of homoclinic orbits both to L∗1 and L∗2 . These homoclinic orbits are geometrically different. Informally speaking they are close to different resonances, namely 3:2, 5:3 for the orbits homoclinic to L∗1 and 1:2, 2:3 for the orbits homoclinic to L∗2 . Moreover, it is possible for a comet to move between these four resonances in arbitrary order. • The PCR3BP possesses an infinite number of geometrically different symmetric periodic and homoclinic orbits. Let us describe now what constitutes a new element in the present paper. While the numerical evidence of three of the above mentioned homoclinic orbits is given in [4], the 2:3 homoclinic orbit appears to be a new one. The technique of the proof of the existence of these homoclinic orbits and the symbolic dynamics is the same as in [12], i.e. combines topological tools (covering relations) with rigorous numerics. The main novelty of the present paper, when compared to [4, 12], is the existence of an infinite number of geometrically different symmetric periodic and homoclinic orbits. While the numerical evidence of the simplest symmetric homoclinic orbits is given in [4] the numerical method used there cannot yield the existence of an infinite number of them even with the help of validated numerics. In this paper we use the topological method introduced recently by Wilczak in [10] and developed later by both authors in [13]. For the purpose of this introduction we briefly describe the main points of method for symmetric periodic points. The method is based on two observations: • to detect symmetric periodic orbits for map P (a Poincar´e map) with a reversing symmetry R (in the PCR3BP case composition of a suitable reflection and the time inversion) it is enough to look for intersections of Fix(R) = {x | R(x) = x} with P k (Fix(R)). This is the Fixed Set Iteration method [5, 6] (also known as DeVogelaere method [3]). Any point from such intersection gives rise to a 2k-periodic point. • covering relations give some control of pieces of P k (Fix(R)), which make it possible to prove that P k (Fix(S)) ∩ Fix(R) is nonempty for k sufficiently large and that the period of the periodic point is indeed equal to 2k. The paper is organized as follows. In Sect. 2 we recall the PCR3BP and its properties. In Sect. 3 the proof of the existence of a new pair of homoclinic orbits both in exterior and interior regions to the Lyapunov orbits L∗1 , L∗2 is presented. In Sect. 4 the symbolic dynamics on six symbols is established. We also discuss the resonance transitions there. In Sect. 5 the existence of symmetric periodic and homoclinic orbits is proven. Throughout the paper we will use the definitions and notations from [12]. 2. Short Description of the System We follow papers [4, 12] and use the notation introduced there. Let S and J be two bodies called Sun and Jupiter, of masses ms = 1−µ and mj = µ, µ ∈ (0, 1), respectively. They rotate in the plane in circles counter clockwise about their common center and with angular velocity normalized as one. Choose a rotating coordinate system, so that the origin is at the center of mass and the Sun and Jupiter are fixed on the x-axis at (−µ, 0) and (1 − µ, 0) respectively. In this coordinate frame the

Heteroclinic Connections Between Periodic Orbits. Part II

563

equations of motion of a massless particle called the comet or the spacecraft under the gravitational action of Sun and Jupiter are (see [4] and references given there) x¨ − 2y˙ = x (x, y),

y¨ + 2x˙ = y (x, y),

(1)

where x2 + y2 µ µ(1 − µ) 1−µ (x, y) = + + + , 2 r1 r2 2 r2 = (x − 1 + µ)2 + y 2 . r1 = (x + µ)2 + y 2 , Equations (1) are called the equations of the planar circular restricted three-body problem (PCR3BP). They have a first integral called the Jacobi integral, which is given by C(x, y, x, ˙ y) ˙ = −(x˙ 2 + y˙ 2 ) + 2(x, y).

(2)

We consider PCR3BP on the hypersurface M(µ, C) = {(x, y, x, ˙ y) ˙ | C(x, y, x, ˙ y) ˙ = C}, and we restrict our attention to the following parameter values C = 3.03, µ = 0.0009537 - the parameter values for the Oterma comet in the Sun-Jupiter system (see [4]). The projection of M(µ, C) onto position space is called a Hill’s region and gives the region in the (x, y)-plane, where the comet is free to move. The Hill’s region for the parameter considered in this paper is shown on Fig. 1 in white, the forbidden region is dark. The Hill’s region consists of three regions: an interior (Sun) region, an exterior region and Jupiter region. As was mentioned in the Introduction we restrict our attention to the following parameter values C = 3.03, µ = 0.0009537 - the parameter values for Oterma comet in the Sun-Jupiter system (see [4]). Since we work with fixed parameter values we usually drop the dependence of various objects defined throughout the paper on µ and C, so for example M = M(µ, C). −2

− 1.5

−1

− 0.5

0

0.5

1

1.5

1

0.5

y

0 − 0.5 −1 − 1.5

x

Fig. 1. 3:2 homoclinic orbit to L∗1 Lyapunov orbit (interior region) and 1:2 homoclinic orbit to L∗2 Lyapunov orbit (exterior region)

564

D. Wilczak, P. Zgliczy´nski

2.1. Poincar´e maps. We consider Poincar´e sections: = {(x, y, x, ˙ y) ˙ ∈ M | y = 0}, + = ∩ {y˙ > 0}, − = ∩ {y˙ < 0}. On ± we can express y˙ in terms of x and x˙ as follows: y˙ = ± 2(x, 0) − x˙ 2 − C. Hence the sections ± can be parameterized by two coordinates (x, x) ˙ and we will use this identification throughout the paper. More formally, we have the transformation T± : R2 → ± given by the following formula: ˙ = (x, 0, x, ˙ ± 2(x, 0) − x˙ 2 − C ). T± (x, x) The domain of T± is given by an inequality 2(x, 0) − x˙ 2 − C ≥ 0. Let πx˙ : ± −→ R and πx : ± −→ R denote the projection onto x˙ and x coordinate, respectively. We have πx˙ (x0 , x˙0 ) = x˙0 and πx (x0 , x˙0 ) = x0 . We will say that (x, x) ˙ ∈ ± , meaning that (x, x) ˙ represents two-dimensional coordinates of a point on ± . Analogously we give a meaning to the statement M ⊂ ± for a set M ⊂ R2 . We define the following Poincar´e maps between sections: P+ : + → + , P− : − → − , P 1 ,+ : + → − , 2

P 1 ,− : − → + . 2

As a rule the sign + or − tells that the domain of the maps P± or P 1 ,± is contained in 2 ± (the same sign). Observe that P+ (x) = P 1 ,− ◦ P 1 ,+ (x), 2

2

P− (x) = P 1 ,+ ◦ P 1 ,− (x) 2

2

whenever P+ (x) and P− (x) are defined. These identities express the following simple fact: to return to + we need to cross with negative y˙ (this is P 1 ,+ first and then we 2 return to with y˙ > 0 (this is P 1 ,− ). 2 Sometimes we will drop signs in P± and P 1 ,± , hence P (z) = P+ (z) if z ∈ + and 2 P (z) = P− (z) if z ∈ − , a similar convention will be applied to P 1 . 2

2.2. Symmetry properties of PCR3BP. Notice that PCR3BP has the following symmetry: R(x, y, x, ˙ y, ˙ t) = (x, −y, −x, ˙ y, ˙ −t), which expresses the following fact, if (x(t), y(t)) is a trajectory for PCR3BP, then (x(−t), −y(−t)) is also a trajectory for PCR3BP. From this it follows immediately that if

if P± (x0 , x˙0 ) = (x1 , x˙1 ) P 1 ,± (x0 , x˙0 ) = (x1 , x˙1 ) 2

then P± (x1 , −x˙1 ) = (x0 , −x˙0 ), then P 1 ,∓ (x1 , −x˙1 ) = (x0 , −x˙0 ). 2

(3)

˙ = (x, −x) ˙ for (x, x) ˙ ∈ We will denote also by R the map R : ± → ± , R(x, x) ± . Now Eq. (3) can be written as if P± (x0 ) = x1 if P 1 ,± (x0 ) = x1 2

then P± (R(x1 )) = R(x0 ), then P 1 ,∓ (R(x1 )) = R(x0 ). 2

Heteroclinic Connections Between Periodic Orbits. Part II

565

3. The Existence of New Homoclinic Orbits The goal of this section is to present the proof of the existence of new homoclinic orbits with different resonances. The notion of the resonance. We rewrite here an informal definition of the resonance from [4, Sect. 5.1]. Recall that the PCR3BP is a perturbation of the two-body problem. Hence, outside a small neighborhood of Jupiter, the trajectory of a comet follows essentially a two-body orbit around the Sun. In the heliocentric inertial frame, the orbit is nearly elliptical. The mean motion resonance of the comet with respect to Jupiter is equal to a −3/2 , where a is the semi-major axis of this elliptical orbit. Recall that the Sun-Jupiter distance is normalized to be 1 in the PCR3BP. The comet is said to be in p : q resonance with Jupiter if a −3/2 ≈ p/q, where p and q are small integers. In the heliocentric inertial frame, the comet makes roughly p revolutions around the Sun in q Jupiter periods. Observe that this definition of the resonance also make sense for the orbits, which are non-periodic (for example orbits homoclinic to L∗1 or L∗2 ), we just have to compute the semi-major axis for the piece of orbit away from Jupiter. A heuristic approach, which allows to read the resonance of an orbit from the trajectory in the rotating frame is described in Appendix. In [12] the following theorem was proved. Theorem 3.1. [12, Thm.6.5,Thm.6.7]. Consider PCR3BP with C = 3.03, µ = 0.0009537. Then • there exists a homoclinic orbit to the L∗1 orbit (in Sun region). This orbit is close to the 3 : 2 resonance. • there exists a homoclinic orbit to the L∗2 orbit (in exterior region). This orbit is close to the 1 : 2 resonance. These orbits are presented in Fig. 1. In this section we establish the existence of new homoclinic connections both in exterior and interior regions. The new homoclinic orbit in the exterior region is close to the 2:3 resonance. As was mentioned in the Introduction this orbit has been found numerically in√[4], see Fig. 5.4, and the intersection of the stable and unstable manifolds of L∗2 at L = a ≈ 1.26. The other new homoclinic orbit in the interior region is close to the 5:3 resonance and it appears to be a new one. 3.1. The existence of the 2 : 3 homoclinic orbit in the exterior region. We define the following h-sets Gi = t (ci , ui , si ), for i = 0, . . . , 4, where: c0 c1 c2 c3 c4

= = = = =

(−1.12327231155833984, 0), (1.093337837571255552, −0.02510094170679043584), (1.047131544421841024, −0.001056187943513949696), (1.08194053721089792, −2.521361165903333888 · 10−5 ), (1.04682616720451456, −9.169345277545603072 · 10−7 ),

and s0 s1 s2 s3 s4

= = = = =

(−1 · 10−8 , 4 · 10−7 ), (1 · 10−7 , 21 · 10−8 ), (−1 · 10−7 , 35 · 10−8 ), (−1 · 10−7 , 23 · 10−8 ), (−1 · 10−7 , 35 · 10−8 ),

u0 u1 u2 u3 u4

= = = = =

−R(s0 ), −R(s1 )/10, −R(s2 )/10, −R(s3 )/10, −R(s4 )/4.

566

D. Wilczak, P. Zgliczy´nski −1

−0.5

0

0.5

1 1.5

1

0.5

y

0

−0.5

−1

−1.5 x

Fig. 2. 5:3 homoclinic orbit to L∗1 Lyapunov orbit (interior region) and 2:3 homoclinic orbit to L∗2 Lyapunov orbit (exterior region)

We assume that G0 , G2 , G4 ⊂ + and G1 , G3 ⊂ − . With a computer assistance we proved the following: Lemma 3.2. The maps P 1 ,+ : G0 ∪ G2 ∪ G4 → − , 2

P 1 ,− : G1 ∪ G3 → + 2

are well defined and continuous. Moreover, the following covering relations hold: P1/2,+

P1/2,−

P1/2,+

P1/2,−

P1/2,+

G0 ⇒ G1 ⇒ G2 ⇒ G3 ⇒ G4 ⇒ H22 . Theorem 3.3. For PCR3BP with C = 3.03 and µ = 0.0009537 there exists an orbit homoclinic to L∗2 close to the 2 : 3 resonance. Proof. From Lemma 3.2 and [12, Lemma 5.6] it follows that P1/2,+

P1/2,−

P1/2,+

P1/2,−

P1/2,+

P−

P−

G0 ⇒ G1 ⇒ G2 ⇒ G3 ⇒ G4 ⇒ H22 ⇒ H2 ⇒ H2 . Note that the h-set G0 is R-symmetric by its definition. Therefore P−

P−

P1/2,−

P1/2,+

H2 = R(H2 ) ⇐ R(H2 ) ⇐ R(H22 ) ⇐ R(G4 ) ⇐ R(G3 ) P1/2,−

P1/2,+

P1/2,−

R(G3 ) ⇐ R(G2 ) ⇐ R(G1 ) ⇐ R(G0 ) = G0 . Since P− is hyperbolic on |H2 | ([12, Lemma 5.5]) the assertion is a consequence of [2, Theorem 4].

Heteroclinic Connections Between Periodic Orbits. Part II

567

3.2. The existence of the 5 : 3 homoclinic orbit in the interior region. As in the previous section we construct a chain of covering relations in order to prove the existence of the homoclinic orbit to L∗1 orbit. We define h-sets Vi = t (ci , ui , si ), for i = 0, . . . , 4, where c0 c1 c2 c3 c4

= (0.5217056203008400006, 0), = (−0.5822638014577352639, −0.2793408708392046136), = (0.919204446847046941, 0.004093829363524479834), = (0.9522506335647477061, 0.0001333182992547130779), = (0.9208022956271231241, 2.918364277340028028 · 10−6 ),

and s0 s1 s2 s3 s4

= = = = =

(−1 · 10−7 , 2 · 10−7 ), (2 · 10−8 , 4 · 10−7 ), (−4 · 10−7 , 102 · 10−8 ), (−1 · 10−7 , 365 · 10−9 ), (−1 · 10−7 , 25733011 · 10−14 ),

u0 u1 u2 u3 u4

= = = = =

−R(s0 ), (3 · 10−8 , 0), −R(s2 )/5, −R(s3 )/10, −R(s4 )/2.

We assume that V0 , V2 , V4 ⊂ + and V1 , V3 ⊂ − . With a computer assistance we proved the following: Lemma 3.4. The maps P 1 ,+ : V0 ∪ V2 ∪ V4 → − , 2

P 1 ,− : V1 ∪ V3 → + 2

are well defined and continuous. Moreover, we have the following chain of covering relations P1/2,+

P1/2,−

P1/2,+

P1/2,−

P+

V0 ⇒ V1 ⇒ V2 ⇒ V3 ⇒ V4 ⇒ H12 . Theorem 3.5. For PCR3BP with C = 3.03 and µ = 0.0009537 there exists an orbit homoclinic to L∗1 close to the 5 : 3 resonance. Proof. Since the sets H1 and H12 are R-symmetric [12, Lemma 5.6] and [12, Corollary 3.14] imply that P+

P+

H12 = R(H12 ) ⇐ R(H1 ) = H1 ⇒ H1 . After combining the above with Lemma 3.4 we obtain P1/2,+

P1/2,−

P1/2,+

P1/2,−

P+

P+

P+

V0 ⇒ V1 ⇒ V2 ⇒ V3 ⇒ V4 ⇒ H12 ⇐ H1 ⇒ H1 . Note that the h-set V0 is R-symmetric by its definition. Therefore P+

P+

P1/2,+

P1/2,−

P1/2,+

P1/2,−

H1 ⇒ H12 ⇐ R(V4 ) ⇐ R(V3 ) ⇐ R(V2 ) ⇐ R(V1 ) ⇐ R(V0 ) = V0 . Since P+ is hyperbolic on |H1 | ([12, Lemma 5.5]) the assertion is a consequence of [2, Theorem 4].

568

D. Wilczak, P. Zgliczy´nski

4. Symbolic Dynamics on Six Symbols and Resonance Transitions As a consequence of theorems proved in [12] and in the previous section we obtain the existence of symbolic dynamics on six symbols. Let L1 , L2 denote the Lyapunov orbits regions (see [4]) , S and I denote two parts of the Sun region corresponding to suitable vicinities of two homoclinic orbits to L∗1 . Let X and E denote two parts of the exterior region corresponding to suitable vicinities of two homoclinic orbits to the L∗2 orbit. Schematically this situation is shown in Fig. 3. In [12] the symbolic dynamics on four symbols, i.e. {L1 , L2 , X, S} was established. The new homoclinic orbits allow us to include more symbols in it. We state this result more precisely. Let α, β ∈ {L1 , L2 , X, E, I, S} be such that there is an arrow from α to β on the graph presented in Fig. 3. We define the function

fβ,α

 P+ ,      P− ,    4  P  − ◦ (P1/2,+ ◦ P1/2,− ) ◦ P1/2,+ ◦ P+ ,     P+ ◦ P1/2,− ◦ (P1/2,+ ◦ P1/2,− )4 ◦ P− ,     P+ ◦ (P1/2,− ◦ P1/2,+ )2 ◦ P1/2,−    P ◦ P 2 + 1/2,− ◦ (P1/2,+ ◦ P1/2,− ) = P−2 ◦ P1/2,+ ◦ (P1/2,− ◦ P1/2,+ )2     (P 1 ,− ◦ P1/2,+ )2 ◦ P1/2,− ◦ P−2   2    ◦ (P1/2,+ ◦ P1/2,− )2 ◦ P1/2,+ , P  −     P1/2,− ◦ (P1/2,+ ◦ P1/2,− )2 ◦ P− ,     P 2 ◦ (P1/2,− ◦ P1/2,+ )2 ,    + (P1/2,− ◦ P1/2,+ )2 ◦ P+2 ,

if (α, β) = (L1 , L1 ) if (α, β) = (L2 , L2 ) if (α, β) = (L1 , L2 ) if (α, β) = (L2 , L1 ) if (α, β) = (S, L1 ) if (α, β) = (L1 , S) . if (α, β) = (X, L2 ) if (α, β) = (L2 , X)

(4)

if (α, β) = (E, L2 ) if (α, β) = (L2 , E) if (α, β) = (I, L1 ) if (α, β) = (L1 , I )

For each symbol α ∈ {L1 , L2 , X, E, I, S} we define the h-set Qα , where QL1 = H1 , QL2 = H2 , QS = E0 , QX = F0 , QI = V0 , QE = G0 . Definition 4.1. The bi-infinite sequence (αi )i∈Z is called admissible if for every i ∈ Z there is an arrow from αi to αi+1 on the graph presented in Fig. 3. The finite sequence (α0 , α1 , . . . , αn ) is called admissible if for every i = 0, 1, . . . , n − 1 there is an arrow from αi to αi+1 on the graph presented in Fig. 3. Let be the set of all admissible sequences (αi )i∈Z ∈ {L1 , L2 , X, E, I, S}Z .

I

S

L1

J

E L2 X

Fig. 3. The graph of symbolic dynamics on six symbols

Heteroclinic Connections Between Periodic Orbits. Part II

569

Theorem 4.2. For every (αi )i∈Z ∈ there exists a sequence (xi )i∈Z satisfying 1. xi ∈ |Qαi | for i ∈ Z, 2. fαi+1 ,αi (xi ) = xi+1 , for i ∈ Z. Moreover, we have periodic orbits: if the sequence (αi )i∈Z is periodic with the principal period k, then the trajectory (xi )i∈Z may be chosen so that xk = x0 , hence its trajectory is periodic; homo- and heteroclinic orbits: if the sequence (αi )i∈Z is such that αk = Li− for k ≤ k− and αk = Li+ for k ≥ k+ , where i− , i+ ∈ {1, 2}, then lim xk = L∗i− ,

lim xk = L∗i+ .

k→−∞

Proof. The same as [12, Theorem 7.1].

k→∞

4.1. Resonance transitions. Theorem 4.2 implies the possibility for a comet to move between various resonances. If we interpret staying close to L∗1 or L∗2 periodic orbits as the 1 : 1 resonance, then Theorem 4.2 says that the comet can travel between exterior and Sun regions in both directions and can move between 5 : 3, 3 : 2, 1 : 2, 2 : 3 and 1 : 1 resonances in an arbitrary order. 5. Symmetric Periodic and Homoclinic orbits In Sect. 2.2 the symmetry property of PCR3BP and the associated Poincar´e maps are described. In this section we give the proof of the existence of an infinite number of symmetric periodic and homoclinic orbits. Definition 5.1. Let I t → u(t) ∈ R4 be a solution of PCR3BP, where I is the maximal interval of the existence of the solution. An orbit t → u(t) is called R-symmetric iff Image(u) = {u(t) | t ∈ I } = {R(u(t)) | t ∈ I } = R(Image(u)). In this section we apply the method for finding symmetric periodic, homo and heteroclinic orbits first introduced in [10, 11] for the planar case and later developed in [13] in the multidimensional situation. We recall here the basic definitions. Definition 5.2. Let N be a h-set with one unstable and one stable direction and let γ : [a, b] → R2 be a continuous curve. We say that γ is a horizontal curve in N if the following conditions hold: 1. γ ((a, b)) ⊂ int(|N|), 2. either γ (a) ∈ N le and γ (b) ∈ N re , or γ (a) ∈ N re and γ (b) ∈ N le . The geometry of this concept is shown in Fig. 4. Definition 5.3. Let N be a h-set with one unstable and one stable direction and let γ : [a, b] → R2 be a continuous curve. We say that γ is a vertical curve in N if the following conditions hold: 1. γ ((a, b)) ⊂ int(|N |),

570

D. Wilczak, P. Zgliczy´nski

s γ (b) ∈ N re

γ (a) ∈ N le s

Fig. 4. An h-set N and a horizontal curve γ in N

2. either γ (a) ∈ N te and γ (b) ∈ N be , or γ (a) ∈ N te and γ (b) ∈ N be . The following theorem is a special case of [13, Thm.3]. Theorem 5.4. Assume N0 , N1 , . . . , Nk are h-sets with one unstable and one stable direction and f0

f1

fk−1

N0 ⇐⇒ N1 ⇐⇒ · · · ⇐⇒ Nk . ¯ → R2 is a vertical curve If γ : [a, b] → R2 is a horizontal curve in N0 and γ¯ : [a, ¯ b] in Nk , then there exists t0 ∈ (a, b) such that (fm ◦ · · · ◦ f0 ◦ γ )(t0 ) ∈ int(|Nm+1 |),

(5)

for m = 0, . . . , k − 1 and ¯ (fk−1 ◦ · · · ◦ f0 ◦ γ )(t0 ) ∈ γ¯ ((a, ¯ b)).

(6)

Theorem 5.4 was first proven in [10] for a planar case and direct covering relations. The generalization to a higher dimension with one unstable direction and the direct (forward) covering is presented in [11]. The proof of a general situation (i.e. direct and backward covering in the multidimensional case) requires more sophisticated techniques and is presented in [13]. 5.1. Symmetric periodic orbits. In this section we will use Theorem 5.4 in order to prove the existence of an infinite number of geometrically different symmetric periodic orbits. Before we state the main result in this section we introduce some notation. Let (α, β) ∈ {L1 , L2 , X, E, I, S}2 be an admissible sequence of symbols. Let the maps fβ,α be defined as in (4). fβ,α

Notation. By Qα ⇐⇒ Qβ we will denote the chain of covering relations associated with the sequence (α, β), i.e. P

P

P

Pk−1

P

k 1 2 3 Qα ⇐⇒ V1 ⇐⇒ V2 ⇐⇒ · · · ⇐⇒ Vk−1 ⇐⇒ Qβ ,

where fβ,α = Pk ◦ . . . ◦ P1 and Vi , i = 1, . . . , k − 1 are suitable h-sets.

Heteroclinic Connections Between Periodic Orbits. Part II

571

Definition 5.5. Let f : X → X. By Fix(f ) we will denote the set of fixed points of f , i.e. Fix(f ) = {y ∈ X | f (y) = y}. Theorem 5.6. Let φ : R × R4 −→ ◦ R4 denote the local flow induced by the PCR3BP with C = 3.03 and µ = 0.0009537. Assume (α0 , α1 , . . . , αn ) ∈ {S, I, X, E, L1 , L2 }n , n > 0 is an admissible sequence of symbols. Then there exists a point x0 ∈ |Qα0 | such that (fαm ,αm−1 ◦ · · · ◦ fα1 ,α0 )(x0 ) ∈ |Qαm |, (fα−1 ◦ · · · ◦ fα−1 )(x0 ) ∈ |Qαm |, m−1 ,αm 0 ,α1

(7)

for m = 1, . . . , n, i.e., the trajectory of x0 is coded by the periodic sequence of symbols (αn , . . . , α1 , α0 , α1 , . . . , αn ).

(8)

Moreover, x0 is periodic and its orbit is R-symmetric. Proof. From the definitions of the h-sets used in the proof of homo- and heteroclinic chains it follows that the sets H1 , H2 , V0 , G0 , E0 , F0 are R-symmetric. Therefore Fix(R) may be parameterized both as a horizontal and as a vertical curve in each of these sets (see ¯ → |Qαn | be Fig. 5). Let γ : [a, b] → |Qα0 | be the horizontal curve in Qα0 and γ¯ : [a, ¯ b] ¯ ¯ b]) ⊂ Fix(R). Now, Theorem 5.4 the vertical curve in Qαn , such that γ ([a, b]) ∪ γ¯ ([a, applied to the sequence fα

,α

fα

,α

fα

fαn ,αn−1

,α

1 0 2 1 3 2 Qα0 ⇐⇒ Qα1 ⇐⇒ Qα2 ⇐⇒ · · · ⇐⇒ Qαn

implies that there exists a point x0 = γ (t0 ) ∈ |Qα0 | ∩ Fix(R) such that (fαm ,αm−1 ◦ · · · ◦ fα1 ,α0 )(x0 ) ∈ |Qαm |, (fαn ,αn−1

m = 1, . . . , n, ¯ ⊂ Fix(R). ◦ · · · ◦ fα1 ,α0 )(x0 ) ∈ γ¯ ((a, ¯ b)) for

right edge N re γ ([a,b])

left edge N le

Fig. 5. A symmetric h-set. The γ curve is both horizontal and vertical in N

572

D. Wilczak, P. Zgliczy´nski

From the definition of fβ,α (see Eq.(4)) as a composition of suitable Poincar´e maps, it follows that there exists T > 0 such that φ(T , x0 ) = (fαn ,αn−1 ◦ · · · ◦ fα1 ,α0 )(x0 ) ∈ Fix(R). Since R is the reversing symmetry of φ we obtain φ(T , x0 ) = R(φ(T , x0 )) = φ(−T , R(x0 )) = φ(−T , x0 ), which proves x0 is periodic and its orbit is R-symmetric. There remains to prove that the trajectory of x0 is coded by the sequence (8), i.e. (7) is satisfied. We formulate this as a separate lemma. Lemma 5.7. Assume (α0 , . . . , αn ) is an admissible sequence of symbols. If x ∈ dom(fαn ,αn−1 ◦ · · · ◦ fα1 .α0 ) then R(x) ∈ dom(fα−1 ◦ · · · ◦ fα−1 ) and n−1 ,αn 0 ,α1 (R ◦ fαm ,αm−1 ◦ · · · ◦ fα1 ,α0 )(x) = (fα−1 ◦ · · · ◦ fα−1 ◦ R)(x), m−1 ,αm 0 ,α1 for m = 1, . . . , n. Moreover, if x = R(x) then ◦ · · · ◦ fα−1 )(x) ∈ |Qαm | (fα−1 m−1 ,αm 0 ,α1 for m = 1, . . . , n. Proof. One observes that if (α, β) is admissible, then (β, α) is admissible, too. More−1 over, from the definition of fβ,α (Eq. (4)) it follows that R ◦ fβ,α = fα,β ◦ R. Let x ∈ dom(fαk ,αk−1 ◦ · · · ◦ fα1 .α0 ). Then (R ◦ fαm ,αm−1 ◦ · · · ◦ fα1 ,α0 )(x) = (fα−1 ◦ R ◦ fαm−1 ,αm−2 ◦ · · · ◦ fα1 ,α0 )(x) = m−1 ,αm · · · = (fα−1 ◦ · · · ◦ fα−1 ◦ R)(x), m−1 ,αm 0 ,α1 for m = 1, . . . , n. If in addition x = R(x) then x ∈ dom(fα−1 ◦ · · · ◦ fα−1 ) and m−1 ,αm 0 ,α1 (fα−1 ◦ · · · ◦ fα−1 )(x) ∈ R(|Qαm |) = |Qαm |. m−1 ,αm 0 ,α1

Remark 5.8. Theorem 5.6 implies that there exist infinitely many geometrically different symmetric periodic orbits. This follows immediately from the fact that there exists an infinite number of admissible chains satisfying the assumptions of Theorem 5.6.

5.2. Symmetric homoclinic orbits. In this section we apply Theorem 5.4 in order to prove the existence of infinitely many geometrically different symmetric homoclinic orbits to L∗1 and L∗2 Lyapunov orbits. The following theorem shows how to use the method of covering relations in order to prove the existence of symmetric homoclinic or heteroclinic orbits. Later we will apply it to Poincar´e maps for PCR3BP.

Heteroclinic Connections Between Periodic Orbits. Part II

573

Theorem 5.9. Let N0 , N1 , . . . , Nk be h-sets, such that f0

f1

fk−1

fk

N0 ⇐⇒ N1 ⇐⇒ · · · ⇐⇒ Nk ⇐⇒ Nk and let γ : [a, b] → R2 be a horizontal curve in N0 . If fk is hyperbolic (see [2, Def. 1]) on Nk , then there exists a point x0 ∈ γ ((a, b)) such that (fm ◦ · · · ◦ f0 )(x0 ) ∈ int(|Nm+1 |), for m = 0, 1, . . . , k − 1, (fkn ◦ fk−1 ◦ · · · ◦ f0 )(x0 ) ∈ int(|Nk |), for n > 0. Moreover, lim (fkn ◦ fk−1 ◦ · · · ◦ f0 )(x0 ) = x∗ ,

n→∞

where x∗ is a unique fixed point of fk in |Nk |. Proof. From Theorem 5.4 it follows that for every n > 0 there exists tn ∈ [a, b] such that (fm ◦ · · · ◦ f0 )(γ (tn )) ∈ int(|Nm+1 |), for m = 0, 1, . . . , k − 1, (fkn ◦ fk−1 ◦ · · · ◦ f0 )(γ (tn )) ∈ int(|Nk |). Since γ ([a, b]) is compact we can find t∗ ∈ [a, b] such that (fm ◦ · · · ◦ f0 )(γ (t∗ )) ∈ int(|Nm+1 |), for m = 0, 1, . . . , k − 1, (fkn ◦ fk−1 ◦ · · · ◦ f0 )(γ (t∗ )) ∈ int(|Nk |), for n > 0. Since neither f (γ (a)) ∈ / N1 nor f (γ (b)) ∈ / N1 we get t∗ ∈ (a, b). Now, fk is hyperbolic on Nk . Therefore by Theorem 3 in [2], lim (fkn ◦ fk−1 ◦ · · · ◦ f0 )(x0 ) = x∗ ,

n→∞

where x0 := γ (t∗ ).

Now we can state the basic result in this section. Theorem 5.10. Assume (α0 , α1 , . . . , αn ) is an admissible nonconstant chain of symbols {S, I, X, E, L1 , L2 }, such that αn ∈ {L1 , L2 }. Then there exists a symmetric homoclinic orbit associated with the sequence of symbols (. . . , αn , αn , αn−1 , . . . , α1 , α0 , α1 , . . . , αn−1 , αn , αn , . . . ).

(9)

Proof. Let γ : [a, b] → |Qα0 | be a horizontal curve in Qα0 such that γ ([a, b]) ⊂ Fix(R). From Lemma 5.5 in [12] it follows that P+ is hyperbolic on |H1 | = QL1 and P− is hyperbolic on |H2 | = QL2 . Since αn ∈ {L1 , L2 } Theorem 5.9 there exists x0 ∈ γ ((a, b)) such that (fαm ,αm−1 ◦ · · · ◦ fα1 ,α0 )(x0 ) ∈ int(|Qαm |), (fαkn ,αn

for m = 1, . . . , n,

◦ fαn ,αn−1 ◦ · · · ◦ fα1 ,α0 )(x0 ) ∈ int(|Qαn |), lim

k→∞

(fαkn ,αn

for k > 0,

◦ fαn ,αn−1 ◦ · · · ◦ fα1 ,α0 )(x0 ) = L,

574

D. Wilczak, P. Zgliczy´nski

where L = L∗1 or L = L∗2 is a unique fixed point in |Qαn |. Since x0 = R(x0 ), Lemma 5.7 implies that (fα−1 ◦ · · · ◦ fα−1 )(x0 ) ∈ int(|Qαm |), m−1 ,αm 0 ,α1 (fα−k n ,αn

for m = 1, . . . , n,

◦ fα−1 ◦ · · · ◦ fα−1 )(x0 ) ∈ int(|Qαn |), n−1 ,αn 0 ,α1 lim (fα−k ◦ fα−1 ◦ · · · ◦ fα−1 )(x0 ) = n ,αn n−1 ,αn 0 ,α1 k→∞

for k > 0, R(L) = L,

which proves that the trajectory of x0 is a symmetric homoclinic orbit coded by the sequence of symbols (9). Remark 5.11. Theorem 5.10 implies that there exist infinitely many symmetric homoclinic orbits which are geometrically different. This follows immediately from the fact that there exists an infinite number of admissible chains satisfying the assumptions of Theorem 5.10. 6. Technical Data The computer assisted proofs of Lemma 3.2 and Lemma 3.4 will be not discussed here. All ideas involved in such proof were presented in [12]. The C++ sources containing the rigorous numerical proof of lemmas from [12], Lemma 3.2 and Lemma 3.4 is available at [9]. The program uses the interval arithmetic and set algebra package developed at Jagiellonian University by the CAPD group [1]. The whole proof took 14 minutes on the Pentium IV 2.4GHz processor (the gcc-3.3.1 compiler, PLD linux distribution). The reader should note that the computation time reported here is considerably shorter than the one from [12] (40 minutes on 1.1GHz) despite the fact that here we have more conditions to check. This is a result of improved numerical algorithms and a faster computer. For comparison purposes we had also run our program on a 1.1GHz machine and the resulting computation time was 34 minutes. The gain (6 minutes) was mainly due to various optimizations in the algorithms. The main one was the use of Evaluation 5 instead of Evaluation 3 in the C 1 -Lohner algorithm. 7. Appendix. Reading Resonances from the Trajectory in Rotating Frame We describe the heuristic approach, which allows us to read the resonance from the inspection of the trajectory in the rotating coordinate frame. We assume that Jupiter and the comet move in the heliocentric inertial frame in the counterclockwise direction and the distance comet-Sun has well discernible maxima or minima along the trajectory. This means that an approximate ellipse on which the comet is moving has nonzero eccentricity. Let R denote the resonance. Let T be an approximate period of the comet in the heliocentric frame. Let us recall that the period of the Jupiter is equal to 1. Hence R=

1 . T

(10)

Then in the heliocentric inertial frame the average angular velocity of the comet is and that of the Jupiter is equal to 2π.

2π T

Heteroclinic Connections Between Periodic Orbits. Part II

575

For an approximate periodic trajectory of a comet in the rotating frame we introduce the following notation: • θ is the number of full turns around the Sun during the whole period. This number is positive for trajectories in the interior region and negative in the exterior region. • M - the number of maxima (or minima) of the distance between the Sun and the comet. Since the distance Sun-comet reaches the maximum (or minimum) only when the comet is at the aphelion (or perihelion), hence consecutive maxima (minima) occur with the period T . In the rotating frame the difference between the angular variables of the comet and Jupiter is equal to 2πθ/M. Observe that this difference is the same in both reference frames, the inertial one and the rotating one. Hence 2πθ 2π = − 2π T , M T θ = 1 − T, M T =

M −θ . M

R=

M . M −θ

Hence finally (11)

Let us apply (11) to Figs. 1 and 2. For interior homoclinics we count the maxima and for exterior homoclinic we count the minima. We have • • • •

the interior homoclinic orbit in Fig. 1: θ = 1, M = 3. Hence R = 23 = 3 : 2. the interior homoclinic orbit in Fig. 2: θ = 2, M = 5. Hence R = 53 = 5 : 3. the exterior homoclinic orbit in Fig. 1: θ = −1, M = 1. Hence R = 21 = 1 : 2. the exterior homoclinic orbit in Fig. 2: θ = −1, M = 2. Hence R = 23 = 2 : 3.

References 1. CAPD - Computer Assisted Proofs in Dynamics, a package for rigorous numeric. http://capd.wsb-nlu.edu.pl. 2. Galias, Z., Zgliczynski, P.: Abundance of homoclinic and heteroclinic orbits and rigorous bounds for the topological entropy for the Henon map. Nonlinearity 14, 909–932 (2001) 3. DeVogelaere, R.: On the structure of symmetric periodic solutions of conservative systems. In: Contribution to the theory of nonlinear oscillations, Vol. 4, Princeton, NJ: Princeton University Press, 1958 4. Koon, W. S., Lo, M. W., Marsden, J. E., Ross, S. D. Heteroclinic Connections between Periodic Orbits and Resonance Transitions in Celestial Mechanics. Chaos 10(2), 427–469 (2000) 5. Lamb, J.S.W.: Reversing symmetries in dynamical systems. J. Phys. A:Math. Gen. 25, 925–937 (1992) 6. Lamb, J.S.W.: Reversing symmetries in dynamical systems. PhD Thesis, Amsterdam University, 1994 7. Moser, J.: On the generalization of a theorem of Liapunov. Comm. Pure Appl. Math. 11, 257–271 (1958) 8. Stoffer, D., Kirchgraber, U.: Possible chaotic motion of comets in the Sun Jupiter system - an efficient computer-assisted approach. Nonlinearity 17, 281–300 (2004)

576

D. Wilczak, P. Zgliczy´nski

9. Wilczak, D.: http://www.wsb-nlu.edu.pl/˜dwilczak 10. Wilczak, D.: Chaos in the Kuramoto–Sivashinsky equations – a computer assisted proof. J. Diff. Eq, 194, 433–459 (2003) 11. Wilczak, D.: Symmetric heteroclinic connections in the Michelson system – a computer assisted proof. To appear in SIAM J. App. Math (2005) 12. Wilczak, D., Zgliczy´nski, P.: Heteroclinic Connections between Periodic Orbits in Planar Restricted Circular Three Body Problem - A Computer Assisted Proof. Commun. Math. Phys. 234, 37–75 (2003) 13. Wilczak, D., Zgliczy´nski, P.: Topological method for symmetric periodic orbits for maps with a reversing symmetry. Submitted, available at http://www.wsb-nlu.edu.pl/˜dwilczak Communicated by G. Gallavotti

Commun. Math. Phys. 259, 577–613 (2005) Digital Object Identifier (DOI) 10.1007/s00220-005-1376-8

Communications in

Mathematical Physics

Bundle Gerbes for Chern-Simons and Wess-Zumino-Witten Theories Alan L. Carey1 , Stuart Johnson2 , Michael K. Murray2 , Danny Stevenson3 , Bai-Ling Wang4 1

Mathematical Sciences Institute, Australian National University, Canberra ACT 0200, Australia. E-mail: [email protected] Department of Pure Mathematics, University of Adelaide, Adelaide, SA 5005 Australia. E-mail: [email protected]; [email protected] 3 Department of Mathematics, 202 Surge Building, University of California at Riverside, Riverside, CA 92521-0135, USA. E-mail: [email protected] 4 Institut f¨ ur Mathematik, Universit¨at Z¨urich, Winterthurerstrasse 190, 8057 Z¨urich, Switzerland. E-mail: [email protected]

2

Received: 25 October 2004 / Accepted: 11 January 2005 Published online: 14 June 2005 – © Springer-Verlag 2005

Abstract: We develop the theory of Chern-Simons bundle 2-gerbes and multiplicative bundle gerbes associated to any principal G-bundle with connection and a class in H 4 (BG, Z) for a compact semi-simple Lie group G. The Chern-Simons bundle 2-gerbe realises differential geometrically the Cheeger-Simons invariant. We apply these notions to refine the Dijkgraaf-Witten correspondence between three dimensional Chern-Simons functionals and Wess-Zumino-Witten models associated to the group G. We do this by introducing a lifting to the level of bundle gerbes of the natural map from H 4 (BG, Z) to H 3 (G, Z). The notion of a multiplicative bundle gerbe accounts geometrically for the subtleties in this correspondence for non-simply connected Lie groups. The implications for Wess-Zumino-Witten models are also discussed.

Contents 1. Introduction . . . . . . . . . . . . . . . . . . . . . . . 2. Deligne Characteristic Classes for Principal G-Bundles 3. From Chern-Simons to Wess-Zumino-Witten . . . . . 4. Bundle 2-Gerbes . . . . . . . . . . . . . . . . . . . . 5. Multiplicative Bundle Gerbes . . . . . . . . . . . . . . 6. The Chern-Simons Bundle 2-Gerbe . . . . . . . . . . . 7. Multiplicative Wess-Zumino-Witten Models . . . . . . References . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . .

. . . . . . . .

. . . . . . . .

. . . . . . . .

. . . . . . . .

. . . . . . . .

. . . . . . . .

. . . . . . . .

. . . . . . . .

. . . . . . . .

. . . . . . . .

578 581 586 592 597 602 609 612

The authors acknowledge the support of the Australian Research Council. ALC thanks MPI f¨ ur Mathematik in Bonn and ESI in Vienna and BLW thanks CMA of Australian National University for their hospitality during part of the writing of this paper.

578

A.L. Carey, S. Johnson, M.K. Murray, D. Stevenson, B.-L. Wang

1. Introduction In [42] Quillen introduced the determinant line bundle of Cauchy-Riemann operators on a Hermitian vector bundle coupled to unitary connections over a Riemann surface. This work influenced the development of many lines of investigation including the study of Wess-Zumino-Witten actions on Riemann surfaces. Note that Quillen’s determinant line bundle also plays an essential role in the construction of the universal bundle gerbe in [15], see also [8]. The relevance of Chern-Simons gauge theory has been noted by many authors, starting with Ramadas-Singer-Weitsman [43] and recently Dupont-Johansen [20], who used gauge covariance of the Chern-Simons functional to give a geometric construction of Quillen line bundles. The curvatures of these line bundles in an analytical set-up were studied extensively by Bismut-Freed [3] and in dimension two, went back to the AtiyahBott work on the Yang-Mills equations over Riemann surfaces. [2]. A new element was introduced into this picture by Freed [25] and [24] (a related line of thinking was started by some of the present authors [14]) through the introduction of higher algebraic structures (2-categories) to study Chern-Simons functionals on 3-manifolds with boundary and corners. For closed 3-manifolds one needs to study the behaviour of the Chern-Simons action under gluing formulae (that is topological quantum field theories) generalising the corresponding picture for Wess-Zumino-Witten. Heuristically, there is a Chern-Simons line bundle as in [43], such that for a 3-manifold with boundary, the Chern-Simons action is a section of the Chern-Simons line bundle associated to the boundary Riemann surface. For a codimension two submanifold, a closed circle, the Chern-Simons action takes values in a U (1)-gerbe or an abelian group-like 2-category. Gerbes first began to enter the picture with J-L Brylinski [6] and Breen [5]. The latter developed the notion of a 2-gerbe as a sheaf of bicategories extending Giraud’s [29] definition of a gerbe as a sheaf of groupoids. J-L Brylinski used Giraud’s gerbes to study the central extensions of loop groups, string structures and the relation to Deligne cohomology. With McLaughlin, Brylinski developed a 2-gerbe over a manifold M to realise degree 4 integral cohomology on M in [10] and introduced an expression of the 2-gerbe holonomy as a Cheeger-Simons differential character on any manifold with a triangulation. This is the starting point for Gomi [31, 32] who developed a local theory of the Chern-Simons functional along the lines of Freed’s suggestion. A different approach to some of these matters using simplicial manifolds has been found by Dupont and Kamber [21]. Our contribution is to develop a global differential geometric realization of ChernSimons functionals using a Chern-Simons bundle 2-gerbe and to apply this to the question raised by Dijkgraaf and Witten about the relation between Chern-Simons and WessZumino-Witten models. Our approach provides a unifying perspective on all of this previous work in a fashion that can be directly related to the physics literature on ChernSimons field theory (thought of as a path integral defined in terms of the Chern-Simons functional). In [23] it is shown that three dimensional Chern-Simons gauge theories with gauge group G can be classified by the integer cohomology group H 4 (BG, Z), and conformally invariant sigma models in two dimension with target space a compact Lie group (Wess-Zumino-Witten models) can be classified by H 3 (G, Z). It is also established that the correspondence between three dimensional Chern-Simons gauge theories and Wess-Zumino-Witten models is related to the transgression map τ : H 4 (BG, Z) → H 3 (G, Z),

Bundle Gerbes for Chern-Simons and Wess-Zumino-Witten Theories

579

which explains the subtleties in this correspondence for compact, semi-simple non-simply connected Lie groups ([36]). In the present work we introduce Chern-Simons bundle 2-gerbes and the notion of multiplicative bundle gerbes, and apply them to explore the geometry of the DijkgraafWitten correspondence. To this end, we will assume throughout that G is a compact semi-simple Lie group. The role of Deligne cohomology as an ingredient in topological field theories goes back to [27] and we add a new feature in Sect. 2 by using Deligne cohomology valued characteristic classes for principal G-bundles with connection. Briefly speaking, a degree p Deligne characteristic class for principal G-bundles with connection is an assignment to any principal G-bundle with connection over M of a class in the degree p Deligne cohomology group H p (M, Dp ) satisfying a certain functorial property. Deligne cohomology valued characteristic classes refine the characteristic classes for principal G-bundles. We will define three dimensional Chern-Simons gauge theories CS(G) as degree 3 Deligne cohomology valued characteristic classes for principal G-bundles with connection, but will later show that there is a global differential geometric structure, the Chern-Simons bundle 2-gerbe, associated to each Chern-Simons gauge theory. We will interpret a Wess-Zumino-Witten model as arising from the curving of a bundle gerbe associated to a degree 2 Deligne cohomology class on the Lie group G as in [12] and [28]. We then use a certain canonical G bundle defined on S 1 × G to construct a transgression map between classical Chern-Simons gauge theories CS(G) and classical Wess-Zumino-Witten models W ZW (G) in Sect. 3, which is a lift of the transgression map H 4 (BG, Z) → H 3 (G, Z). The resulting correspondence : CS(G) −→ W ZW (G) refines the Dijkgraaf-Witten correspondence between three dimensional Chern-Simons gauge theories and Wess-Zumino-Witten models associated to a compact Lie group G. On Deligne cohomology groups, our correspondence induces a transgression map H 3 (BG, D3 ) −→ H 2 (G, D2 ), and refines the natural transgression map τ : H 4 (BG, Z) → H 3 (G, Z) (cf. Proposition 3.4). See [9] for a related transgression of Deligne cohomology in a different set-up. For any integral cohomology class in H 3 (G, Z), there is a unique stable equivalence class of bundle gerbe ([37, 38]) whose Dixmier-Douady class is the given degree 3 integral cohomology class. Geometrically H 4 (BG, Z) can be regarded as stable equivalence classes of bundle 2-gerbes over BG, whose induced bundle gerbe over G has a certain multiplicative structure. To study the geometry of the correspondence , we revisit the bundle 2-gerbe theory developed in [44] and [34] in Sect. 4. Note that transformations between stable isomorphisms provide 2-morphisms making the category BGrbM of bundle gerbes over M and stable isomorphisms between bundle gerbes into a bi-category (cf. [44]). For a smooth surjective submersion π : X → M, consider the face operators πi : X [n] → X [n−1] on the simplicial manifold X• = {Xn = X [n+1] }. Then a bundle 2-gerbe on M consists of the data of a smooth surjective submersion π : X → M together with 1. An object (Q, Y, X[2] ) in BGrbX[2] . 2. A stable isomorphism m: π1∗ Q ⊗ π3∗ Q → π2∗ Q in BGrbX[3] defining the bundle 2-gerbe product which is associative up to a 2-morphism φ in BGrbX[4] .

580

A.L. Carey, S. Johnson, M.K. Murray, D. Stevenson, B.-L. Wang

3. The 2-morphism φ satisfies a natural coherency condition in BGrbX[5] . We then develop a multiplicative bundle gerbe theory over G in Sect. 5 as a simplicial bundle gerbe on the simplicial manifold associated to BG. We say a bundle gerbe G over G is transgressive if the Deligne class of G, written d(G) is in the image of the correspondence map : CS(G) → W ZW (G) = H 2 (G, G 2 ). The main results of this paper are the following two theorems (Theorem 5.8 and Theorem 5.9): 1. The Dixmier-Douady class of a bundle gerbe G over G lies in the image of the transgression map τ : H 4 (BG, Z) → H 3 (G, Z) if and only if G is multiplicative. 2. Let G be a bundle gerbe over G with connection and curving, whose Deligne class d(G) is in H 2 (G, D2 ). Then G is transgressive if and only if G is multiplicative. Let φ be an element in H 4 (BG, Z). The corresponding G-invariant polynomial on the Lie algebra under the universal Chern-Weil homomorphism is denoted by . For any connection A on the universal bundle EG → BG with the curvature form FA , (φ, (

i FA )) ∈ H 4 (BG, Z) ×H 4 (BG,R) 4cl,0 (BG) 2π

(where 4cl,0 (BG) is the space of closed 4-forms on BG with periods in Z), defines a unique degree 3 Deligne class in H 3 (BG, D3 ). Here we fix a smooth infinite dimensional model of EG → BG by embedding G into U (N ) and letting EG be the Stiefel manifold of N orthonormal vectors in a separable complex Hilbert space. We will show that H 3 (BG, D3 ) classifies the stable equivalence classes of bundle 2-gerbes with curving on BG, (we already know that the second Deligne cohomology classifies the stable equivalence classes of bundle gerbes with curving). These are the universal Chern-Simons bundle 2-gerbes Qφ in Sect. 6 (cf. Proposition 6.4) giving a i geometric realisation of the degree 3 Deligne class determined by (φ, ( FA )). 2π We show that for any principal G-bundle P with connection A over M, the associated Chern-Simons bundle 2-gerbe Qφ (P , A) over M is obtained by the pull-back of the universal Chern-Simons bundle 2-gerbe Qφ via a classifying map. The bundle 2-gerbe i curvature of Qφ (P , A) is given by ( FA ), and the bundle 2-gerbe curving is given 2π by the Chern-Simons form associated to (P , A) and φ. Under the canonical isomorphism between Deligne cohomology and CheegerSimons cohomology, there is a canonical holonomy map for any degree p Deligne class from the group of smooth p-cocycles to U (1). This holonomy is known as the CheegerSimons differential character associated to the Deligne class. The bundle 2-gerbe holonomy for this Chern-Simons bundle 2-gerbe Qφ (P , A) over M as given by the Cheeger-Simons differential character is used in the integrand for the path integral for the Chern-Simons quantum field theory. In the SU (N ) Chern-Simons theory, is chosen to be the second Chern polynomial. For a smooth map σ : Y → M, under a fixed trivialisation of σ ∗ (P , A) over Y , the corresponding holonomy of σ is given by e2π iCS(σ,A) , where CS(σ, A) can be written as the following well-known ChernSimons form: k 1 T rσ ∗ (A ∧ dA + A ∧ A ∧ A). 8π 2 Y 3 Here k ∈ Z is the level determined by φ ∈ H 4 (BSU (N ), Z) ∼ = Z.

Bundle Gerbes for Chern-Simons and Wess-Zumino-Witten Theories

581

We will establish in Theorem 6.7 that the Chern-Simons bundle 2-gerbe Qφ (P , A) over M is equivalent in Deligne cohomology to the Cheeger-Simons invariant associated to the principal G-bundle P with a connection A and a class φ ∈ H 4 (BG, Z). In the concluding section we show that the Wess-Zumino-Witten models in the image of this correspondence satisfy a quite interesting multiplicative property that is associated with the group multiplication on G. This multiplicative property is a feature of the holonomy of every multiplicative bundle gerbe. It implies that for the transgressive Wess-Zumino-Witten models, the so-called B field satisfies a certain integrality condition. Using our multiplicative bundle gerbe theory, we can give a very satisfying explanation why, for non-simply connected groups, multiplicative bundle gerbes only exist for Dixmier Douady classes that are certain particular multiples of the generator in H 3 (G, Z) (we call this the ‘level’). For a non-simply connected Lie group, there exists a subtlety in the construction of positive energy representations of its loop group, see [41, 45], where the level is defined in terms of its Lie algebra. While this paper was in preparation, Aschieri and Jurˇco in [1] proposed a similar construction of Chern-Simons 2-gerbes in terms of Deligne classes developed in [34] to study M5-brane anomalies and E8 gauge theory. Their discussions have some overlaps with our local descriptions of bundle 2-gerbes and holonomy of 2-gerbes. 2. Deligne Characteristic Classes for Principal G-Bundles In this section, we first review briefly Deligne and Cheeger-Simons cohomology, and then define a Deligne cohomology valued characteristic class for any principal G-bundle with connection over a smooth manifold M with G a compact semi-simple Lie group. Let H p (M, Dp ) be the p th Deligne cohomology group, which is the hypercohomology group of the complex of sheaves on M: dlog

d

d

p

U (1) → 1M → · · · → M , p

where U (1) is the sheaf of smooth U (1)-valued functions on M, M is the sheaf of imaginary-valued differential p-forms on M. Take any degree p Deligne class ξ = [g, ω1 , · · · ωp ], ˇ then with respect to a good cover of M, {gi0 i1 ···ip } represents a U (1)-valued Cech p-cocycle on M, and hence defines an element in H p (M, U (1)) ∼ = H p+1 (M, Z). The corresponding element in H p+1 (M, Z) is denoted by c(ξ ), and referred to as the characteristic class of ξ . Moreover, dωp is a globally defined closed p + 1 form on M with periods in 2π iZ called the curvature of ξ and denoted by curv(ξ ). Without causing any confusion, we often identify curv(ξ ) with curv(ξ )/2π i whose periods are in Z. We have indexed the Deligne cohomology group so that a degree p Deligne class has holonomy (to be discussed later in this section) over p dimensional sub-manifolds and a characteristic class in H p+1 (M, Z). For example, H 1 (M, D1 ) is the space of equivalence classes of line bundles with connection, whose holonomy is defined for any smooth path and whose characteristic class in H 2 (M, Z) is given by the first Chern class of the underlying line bundle. Next in the hierarchy, H 2 (M, D2 ) is the space of stable isomorphism classes of bundle gerbes with connection and curving, whose holonomy

582

A.L. Carey, S. Johnson, M.K. Murray, D. Stevenson, B.-L. Wang

is defined for any 2-dimensional closed sub-manifold and whose characteristic class in H 3 (M, Z) is given by the Dixmier-Douady class of the underlying bundle gerbe. The Deligne cohomology group H p (M, Dp ) is part of the following two exact sequences (cf. [7]): c

p

0 → cl,0 (M) → p (M) → H p (M, Dp ) → H p+1 (M, Z) → 0,

(2.1)

p

where cl,0 (M) is the subspace of closed p-forms on M with periods in Z, in the space of p-forms p (M), and c is the characteristic class map; and curv

p+1

0 → H p (M, R/Z) → H p (M, Dp ) → cl,0 (M) → 0,

(2.2)

where the map curv is the curvature map on H p (M, Dp ). We remark that for a Deligne class ξ ∈ H p (M, Dp ), curv(ξ ) the curvature of ξ defines a cohomology class in H p+1 (M, R) which agrees with the image of c(ξ ) under the map H p+1 (M, Z) → H p+1 (M, R) sending an integral class to a real class. Recall that the Cheeger-Simons group of differential characters of degree p in [16], Hˇ p (M, U (1)), is defined to be the space of pairs, (χ , ω) consisting of a homomorphism χ : Zp (M, Z) → U (1), where Zp (M, Z) is the group of smooth p-cycles, and an imaginary-valued closed (p + 1)-form ω on M with periods in 2πiZ such that for any smooth (p + 1) chain σ χ (∂σ ) = exp( ω). σ

The Cheeger-Simons group Hˇ p (M, U (1)) enjoys the same exact sequences (2.1) and (2.2) as the Deligne cohomology group H p (M, Dp ). In fact, the holonomy and the curvature of a Deligne class define a canonical isomorphism (hol, curv) : H p (M, Dp ) −→ Hˇ p (M, U (1)).

(2.3)

Here the holonomy of a Deligne class ξ is defined as follows. For a smooth p-cycle given by a triangulation of a smooth map X → M, pull back ξ to X to obtain a Deligne class on X. Lift this class to an element α in p (X) from the exact sequence (2.1) as H p+1 (X, Z) = 0 and then hol(ξ ) = exp( α) X

is independent of the choice of α; this again follows from (2.1). For a general smooth p-cycle σ = k nk σk , we choose a local representative (g, ω1 , · · · ωp ) of ξ under a good cover {Ui } of M such that for each smooth p-simplex fk : σk → M, with possible subdivisions, fk (σk ) is contained in some open set Uik for which fk has a smooth extension. We can define (see [27] for p = 2, and [13, 31] for p = 2, 3) p p−1 A(σk ) = exp fk∗ ωik + fk∗ ωiτ ik + · · · ·

σk

τ(1)

τ(p) ∈τ(p−1) ⊂···⊂τ(1) ⊂σk

τ(1) ⊂σk

gik iτ

(1)

(1)

···iτ(p) ,

Bundle Gerbes for Chern-Simons and Wess-Zumino-Witten Theories

583

where {τ(j ) } is the set of codimension j faces of σk , with τ(j ) ⊂ Uiτ(j ) and induced orientation from σk . It is routine to show that hol(ξ ) = A(σk )nk k

is independent of local representative of ξ and subdivisions because k nk σk is a cycle. With the understanding of (2.3), we often identify a Deligne class with the corresponding Cheeger-Simons differential character. In [33], Hopkins and Singer develop a cochain model for the Cheeger-Simons cohomology where certain integrations are well-defined, see also [22]. In this paper, we need to define an integral on the Deligne cohomology group: : H 3 (S 1 × G, D3 ) → H 2 (G, D2 ). (2.4) S1

While this makes sense under the map (hol, curv) and Hopkins-Singer’s integration for the cochain model for the Cheeger-Simons differential characters we will instead apply the exact sequences (2.1) and (2.2) to uniquely define the integration map (2.4) via the following two commutative diagrams: curv

4 3 1 H 3 (S 1 ×G, R/Z) → H 3 (S 1 × G, D ) → cl,0 (S × G) ↓ S1 ↓ S1 ↓ S1 curv 3 2 2 2 H (G, R/Z) → H (G, D ) → cl,0 (G),

(2.5)

where the integration map (2.4) is well-defined modulo the image of H 2 (G, R/Z) → H 2 (G, D2 ); and 3 (S 1 × G) c → H 3 (S 1 × G, D3 ) → H 4 (S 1 × G, Z) 3cl,0 (S1 × G) ↓ S1 ↓ S1 ↓ S1 2 (G) c → H 2 (G, D2 ) → H 3 (G, Z), 2cl,0 (G)

(2.6)

where the integration map (2.4) is well-defined modulo the image of 2 (G) → H 2 (G, D2 ). To see how these two exact sequences may be used to uniquely specify the integration map we let r : H p (M, Z) → H p (M, R) be the map that sends an integral class to a real class. Cheeger and Simons [16] define p

R p (M, Z) = {(ω, u) ∈ cl,0 (M) ⊕ H p (M, Z) | r(u) = [ω]},

(2.7)

where [ω] is the real cohomology class of the differential form ω. This enables them to combine the map of a Deligne class to its characteristic class and the map to its curvature into one map from the Deligne cohomology group into R p+1 . There is a short exact sequence: 0→

H p (M, R) (curv,c) → H p (M, Dp ) −→ R p+1 (M, Z) → 0. r(H p (M, Z))

(2.8)

584

A.L. Carey, S. Johnson, M.K. Murray, D. Stevenson, B.-L. Wang

Then the following induced diagram is commutative: H 3 (S 1 × G, R) (curv,c) → H 3 (S 1 × G, D3 ) −→ R 4 (S 1 × G, Z) 3 1 r(H (S × G, Z)) ↓ S1 ↓ S1 ↓ S1 0

→

H 2 (G, D2 )

(curv,c)

−→

R 3 (G, Z)

→ 0.

This commutative diagram and the fact that H 2 (G, R) = 0 for any compact semi-simple Lie group G is the reason the integration map (2.4) is well defined. With these preparations we may now define Deligne characteristic classes for principal G-bundles with connection. Recall that a characteristic class c for principal Gbundles is an assignment of a class c(P ) ∈ H ∗ (M, Z) to every isomorphism class of principal G-bundles P → M. Of course we could do Q, R, etc. instead of Z. This assignment is required to be ‘functorial’ in the following sense: if f : N → M is a smooth map we require that c(f ∗ (P )) = f ∗ (c(P )), where f ∗ P is the pull-back principal G-bundle over N. It is a standard fact that characteristic classes are in bijective correspondence with elements of H ∗ (BG, Z). The proof is: given a characteristic class c, we have, of course, c(EG) ∈ H ∗ (BG, Z) and conversely if ξ ∈ H ∗ (BG, Z) is given, then defining cξ (P ) = f ∗ (ξ ) for any classifying map f : M → BG gives rise to a characteristic class for the isomorphism class of principal G-bundles defined by the classifying map f . This uses the fact that any two classifying maps f and g are homotopy equivalent so that f ∗ = g ∗ , and hence f ∗ (ξ ) = g ∗ (ξ ). This definition motivates our definition of Deligne characteristic classes for principal G-bundles with connection. Note that the characteristic classes only depend on the underlying topological principal G-bundle, in order to define a Deligne cohomology valued characteristic class, we will restrict ourselves to differentiable principal G-bundles. Definition 2.1. A Deligne characteristic class d (of degree p) for principal G-bundles with connection is an assignment to any principal G-bundle P with connection A over M of a class d(P , A) ∈ H p (M, Dp ) which is functorial in the sense that if f : N → M then d(f ∗ (P ), f ∗ (A)) = f ∗ (d(P , A)), where f ∗ (P ) is the pull-back principal G-bundle with the pull-back connection f ∗ (A). Note that if we add two Deligne characteristic classes using the group structure in H p (M, Dp ) the result is another Deligne characteristic class. Denote by Dp (G) the group of all Deligne characteristic classes of degree p for principal G-bundles. If d ∈ Dp (G) is a Deligne characteristic class for principal G-bundles and P → M is a principal G-bundle, we can choose a connection A on P , then d(P , A) ∈ H p (M, Dp ). Composing with the characteristic class map for Deligne cohomology c : H p (M, Dp ) → H p+1 (M, Z), we get c(d(P , A)) = c ◦ d(P , A) ∈ H p+1 (M, Z).

(2.9)

Lemma 2.2. The above map (2.9) defines a homomorphism Dp (G) → H p+1 (BG, Z).

Bundle Gerbes for Chern-Simons and Wess-Zumino-Witten Theories

585

Proof. If we can show that (2.9) is independent of the choice of connections then we have defined a characteristic class for P , which corresponds to an element in H p+1 (BG, Z). Here we approximate BG by finite dimensional smooth models (see [40]), and use the fact that H ∗ (BG, Z) is the inductive limit of the cohomology of these models. To see that c ◦ d(P , A) doesn’t depend of the choice of connections, let A0 and A1 be two connections on P and consider the connection A on Pˆ = P × R → M × R given by A = (1 − t)A0 + tA1 . Let ιt : M → M × R be the inclusion map ιt (m) = (m, t). It is well known that the induced maps on cohomology ι∗t : H p (M × R, Z) → H p (M, Z) are all equal and isomorphisms. Moreover ι∗0 (Pˆ , A) = (P , A0 ) and ι∗1 (Pˆ , A) = (P , A1 ) which imply c ◦ d(P , A0 ) = c ◦ d(ι∗0 (Pˆ , A)) = ι∗0 c ◦ d(Pˆ , A) = ι∗1 c ◦ d(Pˆ , A) = c ◦ d(ι∗1 (Pˆ , A)) = c ◦ d(P , A1 ) ∈ H p+1 (M, Z). Hence we have defined a homomorphism Dp (G) → H p+1 (BG, Z). Define I k (G) to be the ring of invariant polynomials on the Lie algebra of G. Then we have the Chern-Weil homomorphism: cw : I k (G) → H 2k (BG, R). If G is compact then this is an isomorphism. Define A2k (G, Z) = {(, φ) ∈ I k (G) × H 2k (BG, Z) | cw() = r(φ)}. In [16] Cheeger and Simons show that each (, φ) ∈ A2k (G, Z) defines a differential character valued characteristic class of degree 2k − 1, whose value on a principal G-bundle P over M with connection A is denoted by S,φ (P , A) ∈ Hˇ 2k−1 (M, U (1)) (cf. Remark 6.1). Let c,φ (P , A) be the element in H 2k−1 (M, D2k−1 ) such that under the natural isomorphism Hˇ 2k−1 (M, U (1)) → H 2k−1 (M, D2k−1 ), S,φ (P , A) → c,φ (P , A). For the category of principal G-bundles with connection whose morphisms are connection preserving bundle morphisms, then c,φ (P , A) is a Deligne characteristic class, which is a functorial lifting of ((

i 2k FA ), φ(P )) ∈ 2k cl,0 (M) × H (M, Z), 2π

where FA is the curvature 2-form of the connection A and φ(P ) is the corresponding characteristic class of P . This defines a map: A2k (G, Z) → D2k−1 (G).

586

A.L. Carey, S. Johnson, M.K. Murray, D. Stevenson, B.-L. Wang

In particular, if G is compact and φ ∈ H 2k (BG, Z) then = cw−1 (r(φ)) ∈ I k (G) satisfies cw() = r(φ), which means (, φ) ∈ A2k (G, Z). As is determined by φ in this case we write cφ ≡ c,φ . So we have a composed map H 2k (BG, Z) → A2k (G, Z) → D2k−1 (G) which sends φ to cφ . Proposition 2.3. For a compact Lie group G, each element φ in H 2k (BG, Z) defines a degree 2k − 1 Deligne characteristic class cφ in D2k−1 (G) such that cφ → φ under the homomorphism in Lemma 2.2. Proof. From the above discussion and the definition of the Deligne characteristic class, we obtain that, given a principal G-bundle P with a connection A over M, cφ (P , A) = c,φ (P , A) is a functorial lifting of i 2k FA ), φ(P )) ∈ 2k cl,0 (M) × H (M, Z). 2π This implies that the corresponding characteristic class of P is φ(P ). From the bijective correspondence between degree 2k characteristic classes of principal G-bundles and elements of H 2k (BG, Z), we know that cφ → φ under the homomorphism in Lemma 2.2. ((

Fixing a smooth infinite dimensional model of EG → BG by embedding the compact semi-simple Lie group G into U (N ) and letting EG be the Stiefel manifold of N orthonormal vectors in a separable complex Hilbert space, we know that the Deligne cohomology group H 2k−1 (BG, D2k−1 ) is well-defined. Let A be a universal connection on EG, φ ∈ H 2k (BG, Z) defines a degree 2k − 1 Deligne characteristic class c,φ (EG, A) ∈ H 2k−1 (BG, D2k−1 ), where ∈ I k (G) satisfies cw() = r(φ). Then the following commutative diagram H 2k−1 (BG, D2k−1 )

/ H 2k (BG, Z)

2k (BG) cl,0

/ H 2k (BG, R)

shows that the map φ → cφ (EG, A) refines the Chern-Weil homomorphism. 3. From Chern-Simons to Wess-Zumino-Witten Let G be a compact, connected, semi-simple Lie group. In [23] Dijkgraaf and Witten discuss a correspondence map between three dimensional Chern-Simons gauge theories and Wess-Zumino-Witten models associated to the compact Lie group G from the topological actions viewpoint, which naturally involves the transgression map τ : H 4 (BG, Z) → H 3 (G, Z).

Bundle Gerbes for Chern-Simons and Wess-Zumino-Witten Theories

587

(To be precise, τ is actually the inverse of the transgression in Borel’s study of topology of Lie groups and characteristic classes but this is of no real importance.) We recall the definition of τ . We take a class φ ∈ H 4 (BG, Z) and pull its representative φ back to π ∗ (φ), a four-cocycle on EG. As EG is contractible we have that φ = dτφ for a three-cocycle τφ on EG. Restricting τφ to a fibre which we identify with G it is easy to show that the result is a closed cocycle defining an element of H 3 (G, Z) and that moreover this cohomology class is independent of all choices made. It is shown in [23] that three dimensional Chern-Simons gauge theories with gauge group G can be classified by the integer cohomology group H 4 (BG, Z), and conformally invariant sigma models in two dimension with target space a compact Lie group (Wess-Zumino-Witten models) can be classified by H 3 (G, Z). Recall the commutative diagram H 3 (BG, D3 )

/ H 4 (BG, Z)

4cl,0 (BG)

/ H 4 (BG, R).

To classify the exponentiated Chern-Simons action in three dimensional Chern-Simons gauge theories, we propose the following mathematical definitions of a three dimensional Chern-Simons gauge theory and a Wess-Zumino-Witten model. Definition 3.1. We make the following definitions: 1. A three dimensional Chern-Simons gauge theory with gauge group G is defined to be a Deligne characteristic class of degree 3 for a principal G-bundle with connection. We denote the group of all three dimensional Chern-Simons gauge theories with gauge group G by CS(G). 2. A Wess-Zumino-Witten model on G is defined to be a Deligne class on G of degree 2 and we denote the group of all such by W ZW (G). In brief, CS(G) = D3 (G) and W ZW (G) = H 2 (G, D2 ). With these preliminaries taken care of we can now explain our refined geometric definition of the Dijkgraaf-Witten map and discuss its image. Firstly, we give a more geometric definition of the transgression map τ , which has the advantage that it can be lifted to define a correspondence map from CS(G) to W ZW (G). We do this by constructing a canonical G-bundle P with connection A on the manifold S 1 × G. It follows that if d ∈ CS(G) then d(P, A) ∈ H 3 (S 1 × G, D3 ). We can integrate along S 1 with the Deligne characteristic class in H 3 (S 1 × G, D3 ) to get the required map S1

d(P, A) ∈ H 2 (G, D2 ) = W ZW (G).

We want to show that for any G there is a natural G bundle over S 1 × G with connection. To do this it is convenient to work with pre-G-bundles. Definition 3.2. A pre-G-bundle is a pair (Y, g), ˆ where π : Y → M is a surjective submersion and gˆ : Y [2] → G such that

588

A.L. Carey, S. Johnson, M.K. Murray, D. Stevenson, B.-L. Wang

g(y ˆ 1 , y3 ) = g(y ˆ 1 , y2 )g(y ˆ 2 , y3 ) for any y1 , y2 , y3 all in the same fibre of π : Y → M. Here we denote by Y [p] the p-fold fibre product of π : Y → M. If P → M is a principal G bundle, there is a canonical map gˆ : P [2] → G defined by p1 g(p ˆ 1 , p2 ) = p2 . Then (P , g) ˆ is a pre-G-bundle. Conversely if (Y, g) ˆ is a pre-G-bundle over M, we can construct a principal G-bundle P → M as follows. Take Y × G and define (y1 , h1 ) ∼ (y2 , h2 ) if π(y1 ) = π(y2 ) and h1 g(y ˆ 1 , y2 ) = h2 . The space of equivalence classes P = Y × G/ ∼ is a principal G-bundle over M with right G-action on equivalence classes [y, g] given by [y, h] · g = [y, hg]. Two pre-G-bundles (Y, gˆ 1 ) and (X, gˆ 2 ) give rise to isomorphic principal G bundles if and only if there is an hˆ : Y ×π X → G such that ˆ 1 , x1 )gˆ 1 (y1 , y2 ) = gˆ 2 (x1 , x2 )h(y ˆ 2 , x2 ) h(y for any collection of points y1 , y2 ∈ Y , x1 , x2 ∈ X mapping to the same point in M. A pre-G-bundle (Y, g) ˆ is trivial if there is an hˆ : Y → G such that ˆ 1 )−1 h(y ˆ 2) g(y ˆ 1 , y2 ) = h(y for every (y1 , y2 ) ∈ Y [2] . Given a pre-G-bundle (Y,g) ˆ over M, we denote by gˆ −1d gˆ the pull-back by g: ˆ Y [2] → G −1 of the Maurer-Cartan form. Then gˆ d gˆ is a g (the Lie algebra of G)-valued . Let A be a g-valued one-form on Y . We say that A is a connection for the pre-G-bundle (Y, g) ˆ if π1∗ (A) = ad(g)π ˆ 2∗ (A) − gˆ −1 d g, ˆ where π1 , π2 : Y [2] → Y are the projections and we denote the adjoint action of G on its Lie algebra by ad(g). ˆ It is easy to check that there is a one-to-one correspondence between connections on a pre-G-bundle and connections on the associated principal G bundle. We wish to define a G bundle on S 1 × G with connection. From the previous discussion it suffices to define a pre-G-bundle with connection. Let A be all smooth maps h from R to G with h−1 dh periodic and h(0) = 1. Define π : A → G by π(h) = h(1). Notice that if π(g) = π(h) then g = hγ , where γ is a smooth map from [0, 1] to G with γ −1 dγ periodic and γ (1) = 1 = γ (0). Such a γ is actually a smooth based loop in the based loop group G. We can identify A with the space of G-connections on the circle S 1 = R/Z. Then A is contractible and π : A → G is the holonomy map. Hence, π : A → G is a universal G-bundle, and G is a classifying space BG of G. Let Y = A × S 1 → G × S 1 . Define gˆ : Y [2] → G by g(h ˆ 1 , h2 , θ) = h1 (θ )−1 h2 (θ ). Then the pair (Y, g) ˆ is a pre-G-bundle over G × S 1 . Let Yˆ = R × A and the projection ˆ ˆ h) = h(t). hˆ −1 d hˆ being Y → Y induced by R → S 1 . Define hˆ : Yˆ → G by h(t, periodic descends to a g-valued one-form on Y . It is straightforward to check that this defines a connection A for the pre-G-bundle (Y, g) ˆ over G × S 1 .

Bundle Gerbes for Chern-Simons and Wess-Zumino-Witten Theories

S1

589

The principal G-bundle over G × S 1 corresponding to the pre-G-bundle Y = A × → G × S 1 can be obtained as follows (cf. [11] and [39]). Denote by P=

A × S1 × G , G

the quotient space of G-action on A × S 1 × G, where the G-action is given by, for γ ∈ G and (h, θ, g) ∈ A × S 1 × G, γ · (h, θ, g) = (hγ , θ, γ (θ )−1 g).

(3.1)

Notice that P admits a natural free G-action from the right multiplication on G-factor. The connection A on the pre-G-bundle (Y, g) ˆ defines a natural connection A on P. Definition 3.3. The canonical principal G-bundle over G×S 1 is given by P with connection A. The correspondence map from three dimensional Chern-Simons gauge theories CS(G) to Wess-Zumino-Witten models W ZW (G) is defined to be CS(G) = D3 (G) d → d(P, A) ∈ H 2 (G, D2 ) = W ZW (G). S1

Denote this map by : CS(G) −→ W ZW (G). The next proposition shows that the map descends to the natural transgression map from H 4 (BG, Z) to H 3 (G, Z) and hence refines the Dijkgraaf-Witten correspondence. Proposition 3.4. The correspondence map from CS(G) to W ZW (G) induces the natural transgression map τ : H 4 (BG, Z) → H 3 (G, Z). Proof. We first give another construction of τ . Let EG → BG be the universal G-bundle, then (as is well known) the G bundle π˜ : EG → BG formed by applying the based loop functor to EG → BG gives another model of the universal G-bundle. In particular we have a homotopy equivalence BG → G which lifts to an G-equivariant homotopy equivalence EG → A. This leads to the isomorphism: H 3 (BG, Z) → H 3 (G, Z). On the other hand, the natural evaluation map: ev : BG × S 1 −→ BG defines a pull-back map ev ∗ : H 4 (BG, Z) → H 4 (BG × S 1 , Z), from which the integration along S 1 gives rise to another construction of the transgression map: ◦ ev ∗ : H 4 (BG, Z) → H 3 (BG, Z) ∼ (3.2) = H 3 (G, Z).

590

A.L. Carey, S. Johnson, M.K. Murray, D. Stevenson, B.-L. Wang

From the homotopy equivalence between the two universal G-bundles: π : A → G and π˜ : EG → BG, we get a homotopy equivalence of two principal G-bundles: P=

EG × S 1 × G A × S1 × G ∼ . G G

(3.3)

Here the G action on EG × S 1 × G is given by the similar action on A × S 1 × G 1 ×G is a principal G-bundle over BG × S 1 . as in (3.1). EG×S G Now we show that the pull-back of the universal G-bundle: EG → BG via the evaluation map ev, which is

(3.4) ev ∗ (EG) = BG × S 1 ×BG EG is isomorphic to

EG×S 1 ×G G

as principal G-bundles. The isomorphism map

EG × S 1 × G → BG × S 1 ×BG EG G

(3.5)

is given by [(γ˜ , θ, g)] → [(π( ˜ γ˜ ), θ, γ˜ (θ) · g)]. Here (γ˜ , θ, g) ∈ EG × S 1 × G, π˜ is the map EG → BG, γ˜ (θ) is the image of the evaluation map on EG × S 1 and the action of g on γ˜ (θ) is induced from the right G-action on the universal G: EG → BG. It is easy to check that (3.5) is a well-defined G-bundle isomorphism by direct calculation: ˜ γ˜ · γ ), θ, (γ˜ γ )(θ ) · (γ (θ )−1 g))] [(γ˜ · γ , θ, γ (θ )−1 g)] → [(π( = [(π( ˜ γ˜ ), θ, γ˜ (θ ) · g)], and for g ∈ G, [(γ˜ , θ, gg )] → [(π( ˜ γ˜ ), θ, γ˜ (θ) · gg )]. Then from (3.3), (3.4) and (3.5), we obtain a homotopy equivalence of two principal G-bundles: P ev ∗ (EG) ↓ ∼ ↓ S 1 × G S 1 × BG.

(3.6)

Hence, with the definition of the integration map (2.4) given by (2.5) and (2.6), we see that the transgression map τ = ◦ev ∗ : H 4 (BG, Z) → H 3 (G, Z) agrees with the map induced by our correspondence : CS(G) → W ZW (G). For a compact Lie group, Proposition 2.3 tells us that there exists a one-to-one map CS(G) → H 3 (BG, D3 ) and the exact sequence (2.1) implies the exact sequence 0 → 2 (G)/ 2cl,0 (G) → W ZW (G)) → H 3 (G, Z) → 0.

Bundle Gerbes for Chern-Simons and Wess-Zumino-Witten Theories

591

As in general, the map τ : H 4 (BG, Z) → H 3 (G, Z) is not surjective, the map : CS(G) → W ZW (G) is not surjective either, for a general compact semi-simple Lie group. We will see that the Wess-Zumino-Witten models from the image of exhibit some special properties by exploiting bundle gerbe theory. We give a summary of how various bundle gerbes enter. First it is now well understood how, given a WZW model, we can define an associated bundle gerbe over the group G, as W ZW (G) = H 2 (G, D2 ) is the space of stable isomorphism classes of bundle gerbes with connection and curving [12, 38]. We will see in Sect. 4 that H 4 (BG, Z) is the space of stable equivalence classes of bundle 2-gerbes on BG. Thus an element in H 4 (BG, Z) defines a class of bundle 2-gerbes on M associated to a principal G-bundle over M using the pullback construction of the classifying map. The corresponding transgressed element in H 3 (G, Z) defines a bundle gerbe over G. In fact, as H 3 (BG, R) = 0, we know that the third Deligne cohomology group H 3 (BG, D3 ) is determined by the following commutative diagram: c

H 3 (BG, D3 ) −→ H 4 (BG, Z) ↓ ↓curv 4 cl,0 (BG) −→ H 4 (BG, R). In the last section we showed that given an element φ ∈ H 4 (BG, Z), we can take a G-invariant polynomial ∈ I 2 (G) corresponding to φ, then for a connection A on the universal bundle EG over BG, (φ, (

i FA )) ∈ H 4 (BG, Z) ×H 4 (BG,R) 4cl,0 (BG) 2π

represents a degree 3 Deligne class cφ (EG, A) ∈ H 3 (BG, D3 ). In a following section we will define a universal Chern-Simons bundle 2-gerbe determined by cφ (EG, A). We will then pull it back to define, for a principal G-bundle P with connection A over M, a Chern-Simons bundle 2-gerbe over M, with 2-curving given by the Chern-Simons form associated to (P , A). Remark 3.5. Brylinski [7] defines a generalisation of H 3 (G, Z), called the differentiable 3 (G, U (1)) for which there is an isomorphism H 3 (G, U (1)) cohomology, denoted Hdiff diff ∼ = H 4 (BG, Z). In low dimensions these differentiable cohomology classes have the following interpretations [7]: 1 (G, U (1)) Hdiff 2 (G, U (1)) Hdiff 3 (G, U (1)) Hdiff

∼ = ∼ = ∼ =

smooth homomorphisms G → U (1), isomorphism classes of central extensions of G by U (1), equivalence classes of multiplicative U (1)-gerbes on G.

His multiplicative U (1)-gerbes motivated us to define multiplicative bundle gerbes.

592

A.L. Carey, S. Johnson, M.K. Murray, D. Stevenson, B.-L. Wang

4. Bundle 2-Gerbes Bundle 2-gerbe theory on M is developed in [44]. A bundle 2-gerbe with connection and curving defines a degree 3 Deligne class in H 3 (M, D3 ). In [34] it is shown that the group of stable equivalence classes of bundle 2-gerbes with connection and curving is isomorphic to H 3 (M, D3 ). We review the results in [44] and [34], then we define multiplicative bundle gerbes on compact Lie group. We begin with the definition of a simplicial bundle gerbe as in [44] on a simplicial manifold X• = {Xn }n≥0 with face operators di : Xn+1 → Xn (i = 0, 1, · · · , n + 1). We remark that the simplicial manifolds we use in this paper are not required to have degeneracy operators (see [19]). Definition 4.1 (cf. [44]). A simplicial bundle gerbe on a simplicial manifold X• consists of the following data. 1. A bundle gerbe G over X1 , 2. A bundle gerbe stable isomorphism m : d0∗ G ⊗ d2∗ G → d1∗ G over X2 , where di∗ G is the pull-back bundle gerbe over X2 , 3. The bundle gerbe stable isomorphism m is associative up to a natural transformation, called an associator, φ : d2∗ m ◦ (d0∗ m ⊗ I d) → d1∗ m ◦ (I d ⊗ d3∗ m) between the induced stable isomorphisms of bundle gerbes over X3 . The line bundle Lφ over X3 induced by φ admits a trivialisation section s such that δ(s) agrees with the canonical trivialisation of d0∗ Lφ ⊗ d1∗ L∗φ ⊗ d2∗ Lφ ⊗ d3∗ L∗φ ⊗ d4∗ Lφ . If in addition the bundle gerbe G is equipped with a connection and a curving, and m is a stable isomorphism of bundle gerbes with connection and curving, we call it a simplicial bundle gerbe with connection and curving on X• . Remark 4.2. For those unfamiliar with 2-gerbes we offer the following amplification: 1. A bundle gerbe stable isomorphism m in Definition 4.1 is a fixed trivialisation of the bundle gerbe δ(G) = d0∗ G ⊗ d1∗ G ∗ ⊗ d2∗ G over X2 , where d1∗ G ∗ is the dual bundle gerbe of d1∗ G. (See [37, 38] for various operations on bundle gerbes and the definition of a bundle gerbe stable isomorphism.) 2. With the understanding of m in Definition 4.1 as a fixed trivialisation of the bundle gerbe d0∗ G ⊗ d1∗ G ∗ ⊗ d2∗ G over X2 , we can see that d2∗ m ◦ (d0∗ m ⊗ I d) and d1∗ m ◦ (I d ⊗ d3∗ m) represent two trivialisations of the bundle gerbe over X3 . This induces a line bundle over X3 (cf. [37]), called the φ-induced or associator line bundle. A simplicial bundle gerbe G requires that this associator line bundle is trivial and the trivialisation section s satisfies the natural cocycle condition. 3. A simplicial bundle gerbe with connection and curving has in its definition, a restrictive condition, as it requires that the bundle gerbe stable isomorphism m preserves connections and curvings. This implies, d0∗ (curv(G)) − d1∗ (curv(G)) + d2∗ (curv(G)) = 0.

(4.1)

For the simplicial bundle gerbe constructed in this paper, the underlying bundle gerbe is often equipped with a connection and curving, but we shall not require that the bundle gerbe stable isomorphism m preserves the connection and curving. In Sect. 7, instead of (4.1), we have

Bundle Gerbes for Chern-Simons and Wess-Zumino-Witten Theories

593

d0∗ (curv(G)) − d1∗ (curv(G)) + d2∗ (curv(G)) = dB, for some 2-form B on X2 , which is not necessarily obtained from a 2-form on X1 by the δ-map d0∗ − d1∗ + d2∗ . For a smooth submersion π : X → M, there is a natural associated simplicial manifold X• = {Xn } (which one might well think of as the ‘nerve’ associated to π : X → M) with Xn given by Xn = X [n+1] = X ×M X ×M · · · ×M X, the (n + 1)-fold fiber product of π, and face operators di = πi+1 : Xn → Xn−1 (i = 0, 1, · · · , n) are given by the natural projections from X [n+1] to X [n] by omitting the entry in i position for πi . For an exception, we denote by EG[•] the associated simplicial manifold {EG[n] } for the universal bundle π : EG → BG. Now we recall the definition of bundle 2-gerbe on a smooth manifold M from [44] and [34]. Definition 4.3. A bundle 2-gerbe on M consists of a quadruple of smooth manifolds (Q, Y ; X, M), where π : X → M is a smooth, surjective submersion, and (Q, Y ; X[2] ) is a simplicial bundle gerbe on the simplicial manifold X• = {Xn = X[n+1] } associated to π : X → M. It is sometimes convenient to describe bundle 2-gerbes using the language of 2-categories (see for example [35]). One first observes that transformations between stable isomorphisms provide 2-morphisms making the category BGrbM of bundle gerbes over M and stable isomorphisms between bundle gerbes into a weak 2-category or bi-category (cf. [44]). Note that the space of 2-morphisms between two stable isomorphisms is one-to-one corresponding to the space of line bundles over M. Consider the face operators πi : X[n] → X [n−1] on the simplicial manifold X• = {Xn = X[n+1] }. We can define a bifunctor πi∗ : BGrbX[n−1] −→ BGrbX[n] sending objects, stable isomorphisms and 2-morphisms to the pull-backs by πi (i = 1, . . . , n). One can then use this language to describe the data of a bundle 2-gerbe as follows. A bundle 2-gerbe on M consists of the data of a smooth surjective submersion π : X → M together with 1. An object (Q, Y, X[2] ) in BGrbX[2] . 2. A stable isomorphism m: π1∗ Q ⊗ π3∗ Q → π2∗ Q in BGrbX[3] defining the bundle 2-gerbe product which is associative up to a 2-morphism φ in BGrbX[4] . 3. The 2-morphism φ satisfies a natural coherency condition in BGrbX[5] . We now briefly pause to describe some new notation which provides a good way to encode the simplicial bundle gerbe data (cf. Definition 4.1 and Remark 4.2). We define maps πij : X [n] → X[2] for n > 2 which send a point (x1 , . . . , xn ) of X[n] to the point (xi , xj ) ∈ X[2] . It is clear that these maps can be written (non-uniquely) in terms of the πi (the non-uniqueness stems from the simplicial identities satisfied by the face maps πi ’s). Let us write Qij for πij∗ Q. For example, the bundle gerbe Q12 over X [3] is the pull-back π3∗ Q of Q. Returning to the definition of bundle 2-gerbe, the next part of the definition requires that there is a stable isomorphism m : Q23 ⊗ Q12 → Q13 of bundle gerbes over X [3] together with a natural transformation called an associator

594

A.L. Carey, S. Johnson, M.K. Murray, D. Stevenson, B.-L. Wang

φ : π3∗ m ◦ (π1∗ m ⊗ I d) → π2∗ m ◦ (I d ⊗ π4∗ m), which is a 2-morphism in the bi-category BGrbX[4] , between the induced stable isomorphisms of bundle gerbes over X[4] making the following diagram commute up to an associator φ (represented by a 2-arrow in the diagram): I d⊗π4∗ m

/ Q34 ⊗ Q13 m m m m φ mm m π1∗ m⊗I d π2∗ m m mm ∗ m m rz π3 m / Q14 Q24 ⊗ Q12

Q34 ⊗ Q23 ⊗ Q12

(4.2)

Here we write π1∗ m as a stable isomorphism Q34 ⊗ Q23 → Q24 over X[4] , similarly for π2∗ m, π3∗ m and π4∗ m. Hence, the associator φ as a 2-morphism in BGrbX[4] defines a line bundle Lφ over X [4] , which is required to have a trivialisation section. In order to write efficiently the coherence condition satisfied by the natural transformation φ, we need one last piece of new notation. Let us write Qij k = Qj k ⊗ Qij , Qij kl = Qkl ⊗ Qj k ⊗ Qij and so on. So for example, Q123 = Q23 ⊗ Q12 , Q1234 = Q34 ⊗ Q23 ⊗ Q12 and the diagram (4.2) in BGrbX[4] can be written as Q1234

ww wwww w www

Q124

/ Q134 w

φ wwwwww

/ Q14

which is commutative if and only if φ is the identity 2-morphism denoted by I d. The coherency condition satisfied by the natural transformation φ can then be viewed from the following two equivalent diagrams in BGrbX[5] , calculating the associator (2morphism) between the two induced stable isomorphisms from Q12345 to Q15 (one is Q12345 → Q1345 → Q145 → Q15 , and the other is Q12345 → Q1235 → Q125 → Q15 ): / Q1345 Q12345I GG I u ∗ v GG II π5 φ v vvv uu v GG v I u v II GG uu vvvv I u v v G# I v ∗ v zuu $ ~ v π1 φ /Q Q1245 Q1235Iks g g g ggg ww 145 II u g ∗φ g g u g π g II u w g3gggg II uu ww II uu ggggggggg ww u w $ zuow ggg {w / Q15 Q125 Q12345 f/ Q fffff vv 1345GGG u f f f u f f GG u v fIfdfff GG uu vv GG uu fffffffff vv u v # zuow fff zv π2∗ φ / Q135 ks Q145 Q1235I H HH II uuu ww π4∗ φ u HH II uuuu ww H II u w u H u w HH II w uu $ v~ uuu $ {ww / Q125 Q15

(4.3)

Bundle Gerbes for Chern-Simons and Wess-Zumino-Witten Theories

595

which implies the canonical isomorphism of two trivial line bundles over X [5] , π ∗ Lφ ⊗ π ∗ Lφ ⊗ π ∗ Lφ ∼ = π ∗ Lφ ⊗ π ∗ Lφ . 1

3

5

2

4

Remark 4.4. The first example of a bundle 2-gerbe is the tautological bundle 2-gerbe constructed in [14] over a 3-connected manifold M with a closed 4-form ∈ 4cl,0 (M), see [44] for a detailed proof and more examples. Definition 4.5. Let (Q, Y ; X, M) be bundle 2-gerbe on M. A bundle 2-gerbe connection on Q is a pair (∇, B) where ∇ is a bundle gerbe connection on the bundle gerbe (Q, Y ; X[2] ) and B is a curving for the bundle gerbe with connection (Q, ∇), whose bundle gerbe curvature ω on X [2] satisfies δ(ω) = 0, where δ = π1∗ − π2∗ + π3∗ : ∗ (X [2] ) → ∗ (X [3] ). Then we can solve the equation ω = (π1∗ − π2∗ )(C) for a three form C on X, such a choice of C is called a 2-curving for the bundle 2-gerbe (Q, Y ; X, M), or simply the bundle 2-gerbe curving. Then dC = π ∗ () for a closed four form on M, which is called the bundle 2-gerbe curvature. Locally, as in [44] and [34], a bundle 2-gerbe on M with connection and curving is determined by a degree 3 Deligne class [(gij kl , Aij k , Bij , Ci )] ∈ H 3 (M, D3 )

(4.4)

for a good cover {Ui } of M, over which there are local sections si : Ui → X. Then Ci = si∗ C. Using (si , sj ), we can pull-back the bundle gerbe (Q, Y, X [2] ) to Uij = Ui ∩ Uj , such that Qij = (si , sj )∗ Q is trivial. Then the bundle 2-gerbe product gives rise to the following stable isomorphism of bundle gerbes with connection and curving: Qij ⊗ Qj k → Qik ⊗ δ(Gij k ) for a bundle gerbe Gij k with connection and curving over Uij k = Ui ∩ Uj ∩ Uk , hence the curving Bij = (si , sj )∗ B satisfies dBij = Ci − Cj ,

Bij + Bj k + Bki = dAij k

for a connection 1-form Aij k on Gij k . Moreover, the associator φ defines a U (1)-valued ˇ function gij kl over Uij kl such that gij kl satisfies the Cech 3-cocycle condition −1 gij kl gij−1km gij lm giklm gj klm = 1

and Aij k − Aij l + Aikl − Aj kl = gij−1kl dgij kl . Definition 4.6. A bundle 2-gerbe (Q, Y ; X, M) is called trivial if (Q, Y ; X[2] ) is isomorphic in BGrbX[2] to δ(G) = π2∗ (G ∗ ) ⊗ π1∗ (G) for a bundle gerbe G over X together with compatible conditions on bundle 2-gerbe products in BGrbX[3] and associator natural transformations in BGrbX[4] (see [44] for more details). A bundle 2-gerbe (Q1 , Y1 ; X, M) is called stably isomorphic to a bundle 2-gerbe (Q2 , Y2 ; X, M) if and only if Q1 is isomorphic to Q2 ⊗ δ(G) for a bundle gerbe G over X together with extra conditions involving the associator natural isomorphisms for Q1 and Q2 .

596

A.L. Carey, S. Johnson, M.K. Murray, D. Stevenson, B.-L. Wang

Lemma 4.7. Let (P, X; Y, M) be a bundle 2-gerbe with connection and curving. Suppose there exists a stable isomorphism of bundle gerbes (P, X) ∼ = (Q, Z) over Y [2] . Then there exists a bundle 2-gerbe structure (Q, Z, Y, M) with induced connection and curving which has the same Deligne class in H 3 (M, D3 ) as the original bundle 2-gerbe (P, X; Y, M). Proof. First we must show that (Q, Z; Y, M) admits a bundle 2-gerbe product. We use the bundle 2-gerbe product on (P, X; Y, M) to define it. Recall that this product is a stable isomorphism of bundle gerbes over Y [3] , π ∗P ⊗ π ∗P ∼ = π ∗ P. 1

3

2

It is convenient here to realise the stable isomorphism as a trivial bundle gerbe by expressing the product as a bundle gerbe isomorphism π1∗ P ⊗ π2∗ P ∗ ⊗ π3∗ P = δ(J ). Similarly we have an isomorphism P = Q ⊗ δ(L) representing the stable isomorphism of bundle gerbes over Y [2] . Thus we have π1∗ (Q ⊗ δ(L)) ⊗ π2∗ (Q ⊗ δ(L))∗ ⊗ π3∗ (Q ⊗ δ(L)) = δ(J ) and so π1∗ Q ⊗ π2∗ Q∗ ⊗ π3∗ Q = δ(J ) ⊗ π1∗ δ(L)∗ ⊗ π2∗ δ(L) ⊗ π3∗ δ(L)∗ , where we use the fact that πi∗ δ(L) ⊗ πi∗ δ(L)∗ is canonically trivial. Since the pullback of a trivial bundle gerbe must itself be trivial and a tensor product of trivial bundle gerbes is trivial then the right hand side is trivial and thus we potentially have a bundle 2-gerbe product for Q. To confirm that it does define a bundle 2-gerbe product we must check the associativity conditions. Note that it is helpful now to look at diagram (4.2) and diagram (4.3) to understand the following arguments. Recall that there is a bundle called the associator bundle on X[4] which is the obstruction to the bundle 2-gerbe product being associative. It can be defined by considering the product π1−1 δ(J ) ⊗ π2−1 δ(J )∗ ⊗ π3−1 δ(J ) ⊗ π4−1 δ(J )∗ , where πi : Y [4] → Y [3] are the face maps in the simplicial complex. This product defines the associator line bundle on Y [4] (cf. Remark 4.2 and Diagram (4.2)). Changing to the trivialisation representing the bundle gerbe product for Q, we find that the extra terms involving δ(J ) all cancel (in the sense of having canonical trivialisations), hence the associator line bundles for P and Q are the same, so the bundle 2-gerbe product for Q is well defined. With the induced connection and curving, it is straightforward to show that the Deligne class is cohomologous to the Deligne class for (P, X; Y, M). From Lemma 4.7, we know that two stably isomorphic bundle 2-gerbes with connection and curving have the same Deligne class. Given a representative of a Deligne class as in (4.4), we can construct a local bundle 2-gerbe with connection and curving over M as in [44] and [34]. Analogous to the fact that H 2 (M, D2 ) classifies stable equivalence classes of bundle gerbes with connection and curving, we have the following proposition, whose complete proof can be found in [34]. Proposition 4.8 (Cf. [34]). The group of stable isomorphism classes of bundle 2-gerbes with connection and curving over M is isomorphic to H 3 (M, D3 ).

Bundle Gerbes for Chern-Simons and Wess-Zumino-Witten Theories

597

5. Multiplicative Bundle Gerbes The simplicial manifold BG• associated to the classifying space of G is constructed in [19], where the total space of the universal G-bundle EG also has a simplicial manifold structure. The simplicial manifold BG• = {BGn = G × · · · × G (n copies)} (where n = 0, 1, 2, · · ·), is endowed with face operators di 0, 1, · · · , n + 1)   (g1 , . . . , gn ), di (g0 , . . . , gn ) = (g1 , . . . , gi−1 gi , gi+1 , . . . , gn ),  (g , . . . , g ), 0 n−1

: Gn+1 → Gn , (i = i = 0, 1 ≤ i ≤ n, i = n + 1.

Definition 5.1. A multiplicative bundle gerbe over a compact Lie group G is defined to be a simplicial bundle gerbe on the simplicial manifold BG• associated to the classifying space of G. For a compact, simply connected, simple Lie group G, the tautological bundle over G associated to any class in H 3 (G, Z) is a simplicial bundle gerbe as shown in [44], hence, a multiplicative bundle gerbe. Proposition 5.2. Let G be a compact, connected Lie group. Then there is an isomorphism between H 4 (BG; Z) and the space of isomorphism classes of multiplicative bundle gerbes on G. First of all, it is not very hard to see that H 4 (BG; Z) corresponds to isomorphism classes of simplicial bundle gerbes on the simplicial manifold EG[•] . This is because a simplicial bundle gerbe on EG[•] is the same thing as a bundle 2-gerbe on EG → BG. Here we say that two simplicial bundle gerbes G and Q on EG[•] are isomorphic if there is a stable isomorphism G ∼ = Q which is compatible with all the multiplicative structures on G and Q. On a general simplicial manifold X• the notion of isomorphism of simplicial bundle gerbes is more complicated, involving bundle gerbes on X0 ; however because EG is contractible we may use this simpler notion of isomorphism without any loss of generality. ˇ Recall from [7] and [10] the definition of the simplicial Cech cohomology groups ∗ H (X• ; A) for a simplicial manifold X• and some topological abelian group A. To define these one first needs the notion of a covering of the simplicial manifold X• . By definition this is a family of covers U n = {Uαn } of the manifolds Xn which are compatible with the face and degeneracy operators for X• . Brylinski and McLaughlin in [10] explain how one may inductively construct such a family of coverings by first starting with an arbitrary cover U 0 of X0 and then choosing a common refinement U 1 of the induced covers d0−1 (U 0 ) and d1−1 (U 0 ) of X1 . U 1 then induces three covers d0−1 (U 1 ), d1−1 (U 1 ) and d2−1 (U 1 ) of X2 . One then chooses a common refinement U 2 and repeats this process. In particular, the covering U • may be chosen so that each U n is a good cover of Xn . ˇ The simplicial Cech cohomology H ∗ (U • , A) of X• for the covering U • is by definiˇ tion the cohomology of the double complex C p (U q , A) where C ∗ (U p , A) is the Cech complex for the covering U p of the manifold Xp . The differential for the complex C p (U ∗ , A) is induced in the usual way from the face operators di on X• . The groups

598

A.L. Carey, S. Johnson, M.K. Murray, D. Stevenson, B.-L. Wang

H ∗ (X• , A) are then defined by taking a direct limit of the coverings U • . If the covering U • is good in the sense that each U n is a good cover then H ∗ (X• ; A) ∼ = H ∗ (U • ; A). Note that there is a spectral sequence converging to a graded quotient of H ∗ (X• ; A) with p,q

E1

= H p (Xq ; A).

The following proposition is a straightforward extension of Theorem 5.7 in Part I of [10] to the language of bundle gerbes. Proposition 5.3. Let X• be a simplicial manifold. Then we have that isomorphism clasˇ ses of simplicial bundle gerbes on X• are classified by the simplicial Cech cohomology 3 group H (X•≥1 ; U (1)). Here X•≥1 denotes the truncation of X• through degrees ≥ 1. We sketch a proof of this proposition below. Let us first be clear about what we mean by the group H 3 (X•≥1 ; C∗ ). By this we mean that if U • is a good covering of X• , so that H ∗ (X• , U (1)) = H ∗ (U • , U (1)), then H ∗ (X•≥1 , U (1)) is the cohomology of the double complex C p (U q ; U (1)) with q in degrees ≥ 1. Also, by an isomorphism of simplicial bundle gerbes on X• we mean a stable isomorphism G ∼ = Q compatible with all the product structures on G and Q. We do not require that G ∼ = Q ⊗ δ(T ) for some bundle gerbe T on X0 . As noted above this is a restrictive definition but is sufficient for the cases we are interested in, such as the contractible space EG. ˇ Given a simplicial bundle gerbe G on X• , we associate to G a simplicial Cech cohomol3 • ogy class in H (X•≥1 , U (1)) as follows. For the good covering U on X• , let g = (gαβγ ) ˇ be a Cech cocycle representative for the Dixmier-Douady class of G. Then it is easy to see that the 2-cocycle d0∗ (g) d1∗ (g −1 )d2∗ (g) = δ(h) ˇ for some 1-cochain h = (hαβ ) on the covering U 2 . Then the simplicial Cech 1-cochain d0∗ (h)d1∗ (h−1 )d2∗ (h)d3∗ (h−1 ) ˇ on the cover U 3 is a 1-cocycle: it is a Cech representative for the first Chern class of the associator line bundle on X3 (the line bundle induced by the associator). Consequently, we must have d0∗ (h) d1∗ (h−1 )d2∗ (h) d3∗ (h−1 ) = δ(k) ˇ for some simplicial Cech 0-cochain k = (kα ) on U 3 . The cocycle condition for the associator section shows that we must have d0∗ (k) d1∗ (k −1 )d2∗ (k) d3∗ (k −1 )d4∗ (k) = 1 ˇ on U 4 . The triple (g, h, k) is a simplicial Cech cocycle in the truncated double complex p q≥1 C (U , U (1)) representing a class in H 3 (X•≥1 , U (1)). Conversely, such a simplicial ˇ Cech cocycle (d, h, k) in the truncated double complex C p (U q≥1 , U (1)) determines a unique isomorphism class of simplicial bundle gerbes on X• . If the cocycle (d, h, k) associated to a simplicial bundle gerbe G is trivial in H 3 (X•≥1 , U (1)), then there is a

Bundle Gerbes for Chern-Simons and Wess-Zumino-Witten Theories

599

stable isomorphism between G and the trivial simplicial bundle gerbe, consisting of a trivial bundle gerbe on X1 equipped with the trivial product structures. As an immediate consequence of Proposition 5.3, we see that isomorphism classes of ˇ simplicial bundle gerbes on BG• are classified by the simplicial Cech cohomology group H 3 (BG• , U (1)). Since the simplicial map EG[•] → BG• is a homotopy equivalence in each degree, we see from the spectral sequence above that it induces an isomorphism H 3 (BG• , U (1)) ∼ = H 3 (EG[•] ; U (1)). As we have already noted, isomorphism classes of simplicial bundle gerbes on EG[•] correspond exactly to isomorphism classes of bundle 2-gerbes on EG → BG and hence to H 4 (BG; Z). Given a principal G-bundle π : P → M, there exists a natural map gˆ : P [2] −→ G given by p1 · g(p ˆ 1 , p2 ) = p2 such that P [2] → P ×G sending (p1 , p2 ) to (p1 , g(p ˆ 1 , p2 ) is a diffeomorphism. Lemma 5.4. Given a principal G-bundle P → M and a multiplicative bundle gerbe G over G then there exists a bundle 2-gerbe over M of the form (Q, X; P , M) such that (Q, X, P [2] ) is the pull-back bundle gerbe gˆ ∗ G. Proof. Note that there exists a diffeomorphism P × Gn −→ P [n+1] given by

(p, g1 , g2 , · · · , gn ) → p, p · g1 , p · g1 g2 , · · · , p · (g1 g2 · · · gn ) .

The inverse of this map, composing with the projection to Gn , defines a simplicial map gˆ • : P• = {Pn = P [n+1] } −→ BG• = {BGn = Gn }, which can be used to pull back the simplicial bundle gerbe over BG• corresponding to G to a simplicial bundle gerbe Q over P• . This defines a bundle 2-gerbe (Q, Y ; P , M) over M of the required form. Given a multiplicative bundle gerbe G over G, applying the above Lemma 5.4 to the universal bundle π : EG → BG, we obtain a bundle 2-gerbe over BG of the form (gˆ ∗ G; EG, BG) with gˆ ∗ G a bundle gerbe over EG[2] obtained by the pull-back of G via gˆ : EG[2] → G. Conversely, we will show that every bundle 2-gerbe over BG is stably isomorphic to a bundle 2-gerbe of this form. Lemma 5.5. Every bundle 2-gerbe on BG is stably isomorphic to a bundle 2-gerbe of the form (Q, X; EG, BG), where (Q, X) is a bundle gerbe over EG[2] . Proof. We use the classifying theory of bundle 2-gerbes. It is well known that H 4 (M, Z) ∼ = [M; K(Z, 4)]. We use the iterated classifying space BBBU (1) (or B 3 U (1)) as a model for K(Z, 4) with a differential space structure constructed in [26]. In [26] Theorem H, it is shown that for a smooth manifold M, the group H 4 (M, Z) is isomorphic to the group of isomorphism classes of smooth principal B 2 U (1)-bundles over M.

600

A.L. Carey, S. Johnson, M.K. Murray, D. Stevenson, B.-L. Wang

We can transgress to the degree 4 class in H 4 (B 3 U (1), Z) to get a degree 3 class in 3 H (B 2 U (1), Z) which determines a multiplicative bundle gerbe over B 2 U (1). Then we [2]

apply the canonical map gˆ : EB 2 U (1) → B 2 U (1) to pull-back the corresponding [2] multiplicative bundle gerbe over B 2 U (1) to EB 2 U (1) . This gives rise to the universal bundle 2-gerbe Q˜ over B 3 U (1). The classifying bundle 2-gerbe then has the form Q˜ ⇓ [2] EB 2 U (1) ⇒ EB 2 U (1) ↓ B 3 U (1). As B 3 U (1) is 3-connected, and H 4 (B 3 U (1), Z) ∼ = Z, the tautological bundle 2-gerbe developed in [14] can be adapted to give another construction of such a classifying bundle 2-gerbe over B 3 U (1) associated to any integral class in H 4 (B 3 U (1), Z). Any bundle 2-gerbe on BG is defined by pulling back the universal bundle 2-gerbe by a classifying map ψ : BG → B 3 U (1). Consider the map π ◦ ψ : EG → B 3 U (1), where π is the projection in the universal G-bundle. By using the homotopy lifting ˆ property and the contractibility of EG we can always find a lift ψ, ψˆ

EG → EB 2 U (1) ↓ ↓ ψ

BG → B 3 U (1). Thus we can pull back the universal bundle 2-gerbe to get a bundle 2-gerbe of the form (Q, X; EG, BG), with (Q, X) a bundle gerbe over EG[2] . Lemma 5.6. Any bundle gerbe over EG[2] is stably isomorphic to gˆ ∗ G, where gˆ : EG[2] → G is the map satisfying e2 = e1 g(e ˆ 1 , e2 ) for (e1 , e2 ) ∈ EG[2] and G is some bundle gerbe over G. Proof. We may identify EG[2] with EG×G via the map (e1 , e2 ) → (e1 , g(e ˆ 1 , e2 )). Thus stable isomorphism classes of bundle gerbes over EG[2] are classified by H 3 (EG×G, Z) which, since EG is contractible, is equal to H 3 (G; Z). Under the identification we see that the Dixmier-Douady class of the bundle gerbe on EG[2] must be obtained from a class in H 3 (G, Z) by the map g. ˆ Proposition 5.7. Every bundle 2-gerbe on BG is stably isomorphic to a bundle 2-gerbe of the form (gˆ ∗ G, X; EG, BG) for a multiplicative bundle gerbe G over G. Proof. We start with any bundle 2-gerbe on BG. By Lemma 5.5 we may assume without loss of generality that it is of the form (Q, X; EG, BG). Next we use Lemma 5.6 to replace the bundle gerbe (Q, X, EG[2] ) with (gˆ ∗ G, gˆ ∗ Y, EG[2] ), where (G, Y, G) is now a bundle gerbe over G. Note that (Q, X; EG[2] ) and (gˆ ∗ G, gˆ ∗ Y, EG[2] ) is stably isomorphic. Using Lemma 4.7, we know that there exists a bundle 2-gerbe structure on (gˆ ∗ G, gˆ ∗ Y ; , EG, BG) which does not change the stable isomorphism class of the original bundle 2-gerbe on BG. For the bundle 2-gerbe (gˆ ∗ G, gˆ ∗ Y ; EG, BG) to be defined, gˆ ∗ G must be a simplicial bundle gerbe over EG• = {EGn = EG[n+1] }. Fix a point p ∈ EG, then pˆ : Gn → EG[n+1] with

p(g ˆ 1 , g2 , · · · , gn ) = p, p · g1 , p · g1 g2 , · · · , p · (g1 g2 · · · gn )

Bundle Gerbes for Chern-Simons and Wess-Zumino-Witten Theories

601

is a simplicial map between BG• = {BGn = Gn } and EG• . Then G ∼ = pˆ ∗ ◦ gˆ ∗ G implies that G is a simplicial bundle gerbe over BG• . Hence, G is a multiplicative bundle gerbe over G. Theorem 5.8. Given a bundle gerbe G over G, G is multiplicative if and only if its Dixmier-Douady class is transgressive, i.e., in the image of the transgression map τ : H 4 (BG, Z) → H 3 (G, Z). Proof. Given a multiplicative bundle gerbe G over G, applying Lemma 5.4 to the universal bundle π : EG → BG, we obtain a bundle 2-gerbe Q over BG of the form (gˆ ∗ G; EG, BG) with gˆ ∗ G a bundle gerbe over EG[2] obtained by the pull-back of G via gˆ : EG[2] → G. The stable isomorphism class of (gˆ ∗ G; EG, BG) defines a class φ ∈ H 4 (BG, Z). We now show that the transgression of φ under the map τ : H 4 (BG, Z) → H 3 (G, Z) is the Dixmier-Douady class of G, hence transgressive. The class φ defines a homotopy class in [BG, K(Z, 4)] such that the bundle 2-gerbe Q is stably isomorphic to a bundle 2-gerbe obtained by pulling back the universal bundle 2-gerbe Q˜ over B 3 U (1) = K(Z, 4) via a classifying map ψ : BG → K(Z, 4) of φ and the commutative diagram ψˆ

EG → EK(Z, 3) ↓ ↓ ψ

BG → K(Z, 4). This commutative diagram gives rise to a homotopy class of maps ψˆ : G → K(Z, 3) which determines a class cψ ∈ H 3 (G, Z) ∼ = [G, K(Z, 3)]. Using the long exact sequence for homotopy groups for the K(Z, 3)-fibration EK(Z, 3) → BK(Z, 3) = K(Z, 4) and the Hurewicz theorem for 3-connected spaces, we know that the transgression map H 4 (BK(Z, 3), Z) → H 3 (K(Z, 3), Z) sends the generator of H 4 (BK(Z, 3), Z) to the generator of H 3 (K(Z, 3), Z). Therefore, we obtain that cψ = τ (φ), as φ ∈ H 4 (BG, Z) ∼ = [BG, K(Z, 4)] is defined by pulling back the generator of H 4 (BK(Z, 3), Z) via the classifying map ψ. As bundle gerbes over EG[2] , Q = gˆ ∗ G is stably isomorphic to the pull-back bundle gerbe (ψˆ [2] )∗ Q˜ via the map ψˆ [2] : EG[2] → EK(Z, 3)[2] . This implies that the Dixmier-Douady class of gˆ ∗ G is given by the homotopy class of the map EG[2]

ψˆ [2]

/ EK(Z, 3)[2]

/ K(Z, 3).

As gˆ ∗ induces an isomorphism H 3 (G, Z) → H 3 (EG[2] , Z), putting all these together, we know that the Dixmier-Douady class of G is given by cψ = τ (φ), hence, transgressive. Conversely, suppose G is a bundle gerbe over G whose Dixmier-Douady class is a transgressive class τ (c) ∈ H 3 (G, Z) for a class c ∈ H 4 (BG, Z). By Proposition 5.7 we may, without loss of generality, realise c by a bundle 2-gerbe (Qc , X; EG, BG) over

602

A.L. Carey, S. Johnson, M.K. Murray, D. Stevenson, B.-L. Wang

BG from a multiplicative bundle gerbe Gc over G such that Qc = gˆ ∗ Gc . The above argument shows that the Dixmier-Douady class of Gc is also given by τ (c). Therefore, the bundle gerbe G is stably isomorphic to the multiplicative bundle gerbe Gc . Then the multiplicative structure on Gc , mc : d0∗ Gc ⊗ d2∗ Gc → d1∗ Gc , induces a multiplicative structure on G m : d0∗ G ⊗ d2∗ G → d1∗ G, as a stable isomorphism in the bi-category BGrbG2 . The associator for (G, mG ) (see (4.2)) φ : d2∗ m ◦ (d0∗ m ⊗ I d) → d1∗ m ◦ (I d ⊗ d3∗ m) is also induced by the corresponding associator φc for (Gc , mc ) in the bi-category BGrbG3 . The coherent condition for φ (see (4.3)) follows from the corresponding coherent condition for φc in the bi-category BGrbG4 . Hence, the bundle gerbe G over G, whose Dixmier-Douady class is transgressive, is multiplicative. Note that there exists a simpler proof of Theorem 5.8 by applying Proposition 5.2 and an observation that H 4 (BG, Z) classifies the isomorphism classes of simplicial bundle gerbe over BG• . The constructive proof for Theorem 5.8 is given so that the proof can be adopted to establish the following theorem, which is the refinement of Theorem 5.8 for the correspondence from three dimensional Chern-Simons gauge theories with gauge group G to Wess-Zumino-Witten models on the group manifold G. The proof will be postponed until after we discuss Chern-Simons bundle 2-gerbes. Theorem 5.9. Denote by : CS(G) −→ W ZW (G) ∼ = H 2 (G, D2 ) the correspondence map and let G be a bundle gerbe over G with connection and curving, whose Deligne class d(G) is in H 2 (G, D2 ). Then d(G) ∈ I m(), if and only if G is multiplicative. Remark 5.10. Note that the transgression map H 4 (BSO(3), Z) → H 3 (SO(3), Z) sends the generator of H 4 (BSO(3), Z) to twice of the generator of H 3 (SO(3), Z). Hence, only those bundle gerbes G2k over SO(3), with even Dixmier-Douady classes in H 3 (SO(3),Z) ∼ = Z are multiplicative. 6. The Chern-Simons Bundle 2-Gerbe In this section, we will construct a Chern-Simons bundle 2-gerbe over BG such that the proof of the main theorem (Theorem 5.9) follows. Given a principal G-bundle P with a connection A over M, a Chern-Simons gauge theory c ∈ CS(G) defines a degree 3 Deligne characteristic class c(P , A) ∈ H 3 (M, D3 ). We will construct a bundle 2-gerbe with connection and curving over M corresponding to the Deligne class c(P , A). We shall call this a Chern-Simons bundle 2-gerbe. We define it in terms of a universal Deligne characteristic class represented by the universal Chern-Simons bundle 2-gerbe.

Bundle Gerbes for Chern-Simons and Wess-Zumino-Witten Theories

603

In order to use differential forms on BG we need to fix a smooth infinite dimensional model of BG by embedding G into U (N ) and letting EG be the Stiefel manifold of N orthonormal vectors in a separable complex Hilbert space. Alternatively, we could choose a smooth finite dimensional n-connected approximation Bn of the classifying space BG as in [40]. Given a principal G-bundle P with a connection A over M and an integer n > max{5, dimM}, there is a choice of n-connected finite dimensional princi¯ is a classifying space of (P , A). pal G-bundle En with a connection A¯ such that (En , A) k k ∼ In particular H (B, Z) = H (BG, Z) for k ≤ n. For convenience, we suppress this latter detail and work directly on the infinite dimensional smooth model of EG → BG. Note that there exists a connection A on the universal bundle EG over BG and a classifying map f for (P , A) such that (P , A) = f ∗ (EG, A), which implies c(P , A) = f ∗ c(EG, A) with c(EG, A) ∈ H 3 (BG, D3 ) is the Deligne characteristic class for (EG, A). From the commutative diagram: H 3 (BG, D3 ) −→ H 2 (G, D2 ) ↓c ↓c 4 3 H (BG, Z) → H (G, Z), where the vertical maps are the characteristic class maps on Deligne cohomology groups, we see that Theorem 5.9 refines the result in Theorem 5.8. Given a class φ ∈ H 4 (BG, Z), denote by the corresponding G-invariant polynomial on the Lie algebra of G. Then associated to a connection A on a principal G-bundle (with curvature FA ) there is a closed 4-form (

i FA ) ∈ 4cl,0 (M) 2π

i i FA ) = f ∗ ( FA ), and 2π 2π

i φ, ( FA ) ∈ H 4 (BG, Z) × 4cl,0 (BG) 2π determines a unique Deligne class with integer periods. In fact, (

cφ (EG, A) ∈ H 3 (BG, D3 ). Hence cφ (P , A) = f ∗ (cφ (EG, A)) ∈ H 3 (M, D3 ). So we can say that a class φ ∈ H 4 (BG, Z) defines a canonical Chern-Simons gauge theory cφ with gauge group G. From the exact sequence for H 3 (BG, D3 ), 0 → 3 (BG)/ 3cl,0 (BG) → H 3 (BG, D3 ) → H 4 (BG, Z) → 0, any Chern-Simons gauge theory with the same characteristic class φ in H 4 (BG, Z) differs from cφ by a Deligne class [1, 0, 0, C] for a 3-form C on BG. Note that [1, 0, 0, C] defines a trivial bundle 2-gerbe over BG, so in this section, we only construct the universal Chern-Simons bundle 2-gerbe over BG corresponding to the canonical ChernSimons gauge theory cφ with gauge group G. Notice that for two different connections A1 and A2 on the universal bundle EG, cφ (EG, A1 ) − cφ (EG, A2 ) = [(1, 0, 0, CSφ (A1 , A2 ))],

604

A.L. Carey, S. Johnson, M.K. Murray, D. Stevenson, B.-L. Wang

where CSφ (A1 , A2 ) is the Chern-Simons form on BG associated to a pair of connections A1 and A2 on EG (cf. [17]). Remark 6.1. Recall [16] that associated with each principal G-bundle P with connection A Cheeger and Simons constructed a differential character S,φ (P , A) ∈ Hˇ 3 (M, U (1)), where ∈ I 2 (G) is a G-invariant polynomial on its Lie algebra and φ ∈ H 4 (BG, Z) is a characteristic class corresponding to under the Chern-Weil homomorphism. This differential character is uniquely defined when it satisfies the following: 1. The image of S,φ (P , A) under the curvature map Hˇ 3 (M, U (1)) → 4cl,0 (M, R) i FA ), where FA is the curvature form of A. 2π 2. The image of S,φ (P , A) under the characteristic class map Hˇ 3 (M, U (1)) → H 4 (M, Z) is φ(P ), the characteristic class of P associated to φ. 3. The assignment of S,φ (P , A) to (P , A) is natural with respect to morphisms of principal G-bundles with connection. is (

Since differential characters and bundle 2-gerbes with connection and curving on M are both classified by the Deligne cohomology group H 3 (M, D3 ), our Chern-Simons bundle 2-gerbe is a bundle gerbe version of the Cheeger-Simons invariant described above. Given a connection A on π : EG → BG, the canonical Chern-Simons gauge theory cφ associated to a class φ ∈ H 4 (BG, Z) defines a universal Deligne characteristic class i cφ (EG, A) from the pair (φ, ( FA )). Then there is a Chern-Simons form CSφ (A) 2π associated to cφ and (EG, A), satisfying

i dCSφ (A) = π ∗ ( FA ) , 2π

(6.1)

and the restriction of CSφ (A) to a fiber of EG → BG determines a left-invariant closed 3-form ωφ on G. This universal Chern-Simons form CSφ (A) can be constructed as in [24] via the pull-back principal G-bundle π ∗ EG → EG, which admits a section e → (e, e). This trivialisation defines a trivial connection A0 on π ∗ EG. Then π ∗ A and A0 defines a path of connections At = tπ ∗ A + (1 − t)A0 for 0 ≤ t ≤ 1, which can be thought of as a connection on [0, 1]×π ∗ EG → [0, 1]×EG. Define i ( FAt ). (6.2) CSφ (A) = 2π [0,1] Then the relation (6.1) follows from Stokes’ theorem for the projection [0, 1] × EG → i EG and ( 2π FA0 ) = 0 for A0 a trivial connection.

Bundle Gerbes for Chern-Simons and Wess-Zumino-Witten Theories

605

Remark 6.2. The corresponding left-invariant closed 3-form ωφ on G is an integer multiple of the standard 3-form < θ, [θ, θ ] > where θ is the left-invariant Maurer-Cartan form on G and < , > is the symmetric bilinear form on the Lie algebra of G defined by ∈ I 2 (G). Given the fibration EG → BG, we introduce the natural map gˆ : EG[2] → G defined by e2 = e1 · g(e ˆ 1 , e2 ), where (e1 , e2 ) ∈ EG[2] . Definition 6.3. The universal Chern-Simons bundle 2-gerbe Qφ associated to φ ∈ H 4 (BG, Z) is a bundle 2-gerbe (Qφ , EG[2] ; EG, BG) illustrated by the following diagram: G (G, ωφ ) o7 gˆ oooo ooo ooo π1 // (EG, CS (A)) φ EG[2] Qφ

π2

π

(BG, (

i FA )), 2π

where the bundle gerbe Qφ over EG[2] is obtained from the pull-back of a multiplicative bundle gerbe G over G associated to τ (φ) ∈ H 3 (G, Z). Given a connection A on EG, the bundle gerbe Qφ over EG[2] is equipped with a connection whose bundle gerbe curvature is given by (π2∗ − π1∗ )CSφ (A), with CSφ (A) the Chern-Simons form (6.2) on i EG associated to φ and A, and the bundle 2-gerbe curvature is given by ( FA ). 2π Moreover the bundle gerbe G over G is equipped with a connection and curving with the bundle gerbe curvature given by ωφ . Proposition 6.4. Given a class φ ∈ H 4 (BG, Z), there exists a universal Chern-Simons bundle 2-gerbe over BG associated to a connection A on EG. Proof. From the proof of Lemma 5.5, Proposition 5.7 and Theorem 5.8, we can represent a class φ ∈ H 4 (BG, Z) by a bundle 2-gerbe (Qφ , X; EG, BG) over BG which is associated to the universal G-bundle EG → BG and to a multiplicative bundle gerbe G over G, and is such that Qφ is stably isomorphic to gˆ ∗ G for gˆ : EG[2] → G. To complete the proof we have to equip (Qφ , X; EG, BG) with a bundle 2-gerbe connection and curving. Let g : P → G be a gauge transformation on P . Denote by Ag the connection on EG such that Ag (e) = A(e · g). Then by direct calculation, we know that CSφ (Ag ) = CSφ (A) + g ∗ ωφ − d(A, dg · g −1 ),

606

A.L. Carey, S. Johnson, M.K. Murray, D. Stevenson, B.-L. Wang

where CSφ (A) is the universal Chern-Simons form (6.2) associated to φ and (EG, A), i and the left-invariant closed 3-form ωφ on G (the transgression of ( FA )). Then 2π under the map π2∗ − π1∗ , CSφ (A) is mapped to a closed 3-form on EG[2] with periods in Z. To see this note that (π2∗ − π1∗ )CSφ (A) is closed, as d(π2∗ − π1∗ )CSφ (A) = (π2∗ − π1∗ ) ◦ π ∗ (

i FA ) = 0, 2π

and to confirm that (π2∗ − π1∗ )CSφ (A) has its periods in Z, we take a 3-cycle σ in EG[2] , and form a 4-cycle in EG given by gluing two chains in EG with boundi ary π2 (σ ) and −π1 (σ ) as EG is contractible. Then ( FA ) ∈ 4cl,0 (BG) implies 2π that (π2∗ − π1∗ )CSφ (A) ∈ 3cl,0 (EG[2] ). Actually, we have an explicit expression for (π2∗ − π1∗ )CSφ (A), for (e1 , e2 ) ∈ EG[2] : (π2∗ − π1∗ )CSφ (A)(e1 , e2 ) = CSφ (A)(e2 ) − CSφ (A)(e1 ) = CSφ (A)(e1 · g(e ˆ 1 , e2 )) − CSφ (A)(e1 ) = gˆ ∗ ωφ − d(A, d gˆ · gˆ −1 ), from which we can see that ωφ is a closed 3-form ωφ on G with periods in Z. Hence, we can choose a bundle gerbe connection and curving on the multiplicative bundle gerbe G such that the bundle gerbe curvature is given by ωφ . Moreover, we can choose a bundle gerbe connection and curving on (Q, X, EG[2] ) whose bundle gerbe curvature is given by (π2∗ − π1∗ )CSφ (A), hence the bundle 2-gerbe curving is given by the universal Chern-Simons form CSφ (A). On the Deligne cohomology level, we obtain a degree 2 Deligne class in H 2 (G, D2 ) associated to the multiplicative bundle gerbe G with connection and curving over G and i to the degree 3 Deligne class in H 3 (BG, D3 ) determined by (φ, ( FA )), the Deligne 2π class for the bundle 2-gerbe Q over BG with connection and curving. This completes the proof of the existence of a universal Chern-Simons bundle 2-gerbe associated to φ and (EG, A). The upshot of all this is that the universal Chern-Simons bundle 2-gerbe over BG gives a geometric realization of the correspondence between three dimensional ChernSimons gauge theories and Wess-Zumino-Witten models associated to G, from which the proof of Theorem 5.9 immediately follows. Let P → M be a principal G-bundle with connection A. Let f : M → BG be a classifying map for this bundle with connection. This means that f ∗ (EG, BG) ∼ = (P , M) and there exists a connection A on EG such that f ∗ A = A. Definition 6.5. For the Chern-Simons gauge theory canonically defined by a class φ ∈ H 4 (BG, Z), the Chern-Simons bundle 2-gerbe Qφ (P , A) associated with the principal G-bundle P with connection A over M is defined to be the pullback of the universal Chern-Simons bundle 2-gerbe by the classifying map f of (P , A). For a principal G-bundle P → M with a connection A, the corresponding ChernSimons form for (P , A) corresponding to the class φ ∈ H 4 (BG, Z) is CSφ (A) = f ∗ CSφ (A) ∈ 3 (P ),

Bundle Gerbes for Chern-Simons and Wess-Zumino-Witten Theories

607

such that dCSφ (A) = π ∗ (

i FA ) ∈ 4cl,0 (M). 2π

Hence, the curvature of the Chern-Simons bundle 2-gerbe Qφ (P , A) associated to (P , A) i over M is given by ( FA ) and its bundle 2-gerbe curving is given by the Chern2π Simons form CSφ (A). Remark 6.6. We can connect our approach to the familiar Chern-Simons action in the physics literature when G = SU (N ). We have a Deligne class constructed using the pullback construction from the universal Chern-Simons gauge theory cφ (P , A) ∈ H 3 (M, D3 )

(hol,curv)

−→

Hˇ (M, U (1)).

Therefore for any smooth map σ from a closed 3-dimensional manifold Y to M, the holonomy of cφ (P , A) associated to σ is given by e2πiCSφ (σ ;A) = hol(cφ (P , A))(σ ).

(6.3)

See also [24] for similar constructions. Now the Chern-Simons functional CSφ (σ, A) will be the familiar formula when = cw−1 (r(φ)) is the second Chern polynomial for SU (N ). Fix a trivialisation of (σ ∗ P , σ ∗ A) over Y , we can write the level k Chern-Simons functional as k 1 CS(σ, A) = T r(σ ∗ A ∧ σ ∗ dA + σ ∗ A ∧ σ ∗ A ∧ σ ∗ A). 2 8π Y 3 Theorem 6.7. With the canonical isomorphism between the Deligne cohomology and Cheeger-Simons cohomology, the Chern-Simons bundle 2-gerbe Qφ (P , A) is equivalent in Deligne cohomology to the Cheeger-Simons invariant S,φ (P , A) described in Remark 6.1. Proof. It is well known that Cheeger-Simons differential characters are classified by Deligne cohomology (see, for example, [7]). Stable isomorphism classes of bundle 2gerbes with connection and curving are also classified by Deligne cohomology (Proposition 4.8). The theorem of Cheeger and Simons given above defines a unique differential character satisfying certain conditions, thus it uniquely defines a class in Deligne cohomology and an equivalence class of bundle 2-gerbes with connection and curving, so we must show that the corresponding Deligne class associated to the Chern-Simons bundle 2-gerbe satisfies the required conditions. 1. The image under the map H 3 (M, D3 ) → 40 (M, R) is the curvature 4-form of the Chern-Simons bundle 2-gerbe. The curvature of the universal CS bundle 2-gerbe is i i given by ( FA ). Under pullback this becomes ( FA ). 2π 2π 2. The image of the map H 3 (M, D3 ) → H 4 (M, Z) is the characteristic 4-class of a bundle 2-gerbe. For the Chern-Simons bundle 2-gerbe, this shall be the pull-back of the 4-class of the universal Chern-Simons bundle 2-gerbe by the classifying map, which is by construction φ ∈ H 4 (BG, Z).

608

A.L. Carey, S. Johnson, M.K. Murray, D. Stevenson, B.-L. Wang

3. If a principal G-bundle P1 with connection A1 , is related to another principal Gbundle with connection (P2 , A2 ), via a bundle morphism ψ then the corresponding classifying maps are related by fP1 ,A1 = ψ ◦ fP2 ,A2 , so using both sides to pull back the universal CS bundle 2-gerbe we see that their corresponding Deligne classes behave as their Cheeger-Simons invariants S,φ (P1 , A1 ) = ψ ∗ S,φ (P2 , A2 ). Recall that for a connection on a line bundle over M, a gauge transformation is given by a smooth function M → U (1), and an extended gauge transformation for a bundle gerbe with connection and curving is given by a line bundle with connection over M. We can discuss extended gauge transformations for a bundle 2-gerbe with connection and bundle 2-gerbe curving over M. The Chern-Simons bundle 2-gerbe Qφ (P , A) is equipped with a bundle 2-gerbe connection and curving such that the 2-curving is given by the Chern-Simons form CSφ (A) with dCSφ (A) = π ∗ (

i FA ) ∈ 4cl,0 (M). 2π

Choose a covering {Ui } of M, such that over Ui , π : P → M admits a section si ; then ˇ we obtain a Cech representative of the Deligne class corresponding to Qφ (P , A) cφ (P , A) = [(gij kl , Aij k , Bij , Ci )] with Ci = si∗ (CSφ (A)). These local 3-forms {Ci } are called the ‘C-field’ in string theory. We can say that our Chern-Simons bundle 2-gerbe Qφ (P , A) carries the ChernSimons form as the ‘C-field’. As the bundle 2-gerbe curving (‘C-field’) is not uniquely determined, different choices are related by an extended gauge transformation. Suppose that (gij kl , Aij k , Bij , Ci ) represents the Deligne class of Qφ (P , A), then adding a term (1, 0, 0, ω) with a closed 3-form of integer period ω ∈ 3cl,0 (M) doesn’t change the Deligne class, following from the exact sequence (2.1). If M is 2-connected, we know that ω ∈ 3cl,0 (M) canonically defines a bundle gerbe with connection and curving over M whose bundle gerbe curvature is given by ω. We call this bundle gerbe an extended gauge transformation of the Chern-Simons bundle 2-gerbe. Note that given another connection A on π : P → M, the Chern-Simons bundle 2-gerbe Qφ (P , A ) is stably isomorphic to Qφ (P , A) as bundle 2-gerbes over M ( they have the same characteristic class determined by φ). Qφ (P , A) and Qφ (P , A ) have different bundle 2-gerbe curving, on the level of Deligne cohomology, the difference is given by cφ (P , A) − cφ (P , A ) = [(1, 0, 0, CSφ (A, A ))], where CSφ (A, A ) is the well-defined Chern-Simons 3-form on M associated to a straight line path of connections on P connecting A and A , (we recall that CSφ (A) is well-defined only on P in general). It is natural (cf.[18]) to define the so-called C-field on M to be a pair (A, c), where A is a connection on P and c ∈ 3 (M). So the space of C-fields is AP × 3 (M), where AP is the space of connections on π : P → M. A C-field (A, c) canonically defines a degree 3 Deligne class cφ (P , A) + [(1, 0, 0, c)] ∈ H 3 (M, D3 )

Bundle Gerbes for Chern-Simons and Wess-Zumino-Witten Theories

609

through the Deligne class cφ (P , A) of the Chern-Simons bundle 2-gerbe Qφ (P , A). The gauge transformation group for the space of C-fields is defined to be 1 (M, adP ) × H 2 (M, D2 ), with the action on C-fields given by

(α, D) · (A, c) = A + α, c + CSφ (A, A + α) + curv(D) ,

(6.4)

where (α, D) ∈ 1 (M, adP ) × H 2 (M, D2 ), and curv(D) ∈ 3cl,0 (M) is the curvature of the degree 2 Deligne class D (or the corresponding bundle gerbe with connection and curving) on M. Then it is easy to see that two C-fields that are gauge equivalent under (6.4) define the same degree 3 Deligne class on M through the Chern-Simons bundle 2-gerbe, hence, the same Cheeger-Simons differential character on M. Remark 6.8. Denote by G(P group of P which acts on AP × ) the gauge transformation

3 (M) via g · (A, c) = Ag , c + CSφ (A, Ag ) . Due to the fact that CSφ (A, Ag1 g2 ) − CSφ (A, Ag2 ) − CSφ (Ag2 , Ag1 g2 ) depends on the choice of A, this G(P )-action is not a group action. It is observed in [18] that if one interprets the space of C-fields with gauge group action (6.4) as an action groupoid, then G(P )-action is a sub-groupoid action. 7. Multiplicative Wess-Zumino-Witten Models In this section, we study the Wess-Zumino-Witten models in the image of the correspondence from CS(G) to W ZW (G). Let G be a bundle gerbe with connection and curving over a compact Lie group G, whose Deligne class is in H 2 (G, D2 ). With the identification between the Deligne cohomology and the Cheeger-Simons cohomology (2.3), (hol, curv) : H 2 (G, D2 ) → Hˇ 2 (G, U (1)), where hol and curv are the holonomy and the curvature maps for the Deligne cohomology, we will define the bundle gerbe holonomy for stable equivalence classes of bundle gerbes with connection and curving. Let σ : → G be a smooth map from a closed 2-dimensional surface to G. σ represents a smooth 2-cocycle in Z2 (G, Z). Define the holonomy of the bundle gerbe G over G to be the holonomy of the corresponding Deligne class in H 2 (G, D2 ), denoted by holG , then holG (σ ) ∈ U (1), is called the bundle gerbe holonomy holG (·), evaluated on σ : → G. We point out that as H 2 (G, D2 ) classifies stable isomorphism classes of bundle gerbes over G with connection and curving, our bundle gerbe holonomy holG (σ ) depends only on the stable isomorphism class of G.

610

A.L. Carey, S. Johnson, M.K. Murray, D. Stevenson, B.-L. Wang

Proposition 7.1. For a multiplicative bundle gerbe G with connection and curving over G, the bundle gerbe holonomy satisfies the following multiplicative property: holG (σ1 · σ2 ) = holG (σ1 ) · holG (σ2 )

(7.1)

for any pair of smooth maps (σ1 , σ2 ) from any closed surface to G. Here σ1 · σ2 denotes the smooth map from to G obtained from the pointwise multiplication of σ1 and σ2 with respect to the group multiplication on G. Proof. Recall our correspondence map in Definition 3.3 and our integration map (2.4). We constructed from a canonical G-bundle over S 1 × G in Definition 3.3 such that the bundle gerbe holonomy for the multiplicative bundle gerbe G with connection and curving over G, being in the image of , corresponds to the bundle 2-gerbe holonomy for the Chern-Simons bundle 2-gerbe Q over S 1 × G as follows. Given a smooth map σ : → G, I d × σ defines a smooth map S 1 × → S 1 × G, and holG (σ ) = H olQ (I d × σ ),

(7.2)

where H olQ denotes the bundle 2-gerbe holonomy for the Chern-Simons bundle 2-gerbe over S 1 × G. Given a pair of smooth maps σ1 and σ2 from any closed surface to G, denote by 0,3 a fixed sphere with three holes. We can construct a flat G-bundle over 0,3 × with boundary orientation given in such a way that the usual holonomies for flat G-bundle are σ1 , σ2 and σ1 · σ2 respectively. The Chern-Simons bundle 2-gerbe associated to this flat G-bundle is a flat bundle 2-gerbe in the sense that the bundle 2-gerbe holonomy is a homotopy invariant (as follows from the exact sequence (2.2)). This implies that the bundle 2-gerbe holonomies for this flat Chern-Simons bundle 2-gerbe satisfy H olQ (I d × σ1 · σ2 ) = H olQ (I d × σ1 ) · H olQ (I d × σ2 ).

(7.3)

Combining (7.2) and (7.3), we obtain the multiplicative property for the bundle gerbe holonomy of G: holG (σ1 · σ2 ) = holG (σ1 ) · holG (σ2 ) for any pair of smooth maps (σ1 , σ2 ) from any closed surface to G.

Recall that di (i = 0, 1, 2) are the face maps from G×G → G such that d0 (g1 , g2 ) = g2 , d1 (g1 , g2 ) = g1 g2 and d2 (g1 , g2 ) = g1 for (g1 , g2 ) ∈ G. Let G be a bundle gerbe with connection and curving. Let curv(G) be the bundle gerbe curvature of G. We can consider the pair (σ1 , σ2 ) as a map into G × G. Hence we can define the holonomy of (σ1 , σ2 ) with respect to the bundle gerbe connection and curving on δ(G) = d0∗ (G) ⊗ d1∗ (G ∗ ) ⊗ d2∗ (G), whose curvature is given by d0∗ (curv(G)) − d1∗ (curv(G)) + d2∗ (curv(G)) = dB, for a 2-form B on G × G. Because we have a trivialisation for δ(G), we can calculate this holonomy as exp (σ1 , σ2 )∗ B.

Bundle Gerbes for Chern-Simons and Wess-Zumino-Witten Theories

611

On the other hand we can compose (σ1 , σ2 ) with the three maps into G. This gives σ1 , σ1 · σ2 and σ2 where the second of these maps is the result of pointwise multiplying in G. Because the bundle gerbe connection and curving on δ(G) = d0∗ (G) ⊗ d1∗ (G ∗ ) ⊗ d2∗ (G) are δ of those on G, then the bundle gerbe holonomy can also be calculated as holG (σ1 )holG (σ1 σ2 )−1 holG (σ2 ) so that we have

holG (σ1 )holG (σ2 ) = exp (σ1 , σ2 )∗ B holG (σ1 · σ2 ).

The following proposition gives a necessary condition for a bundle gerbe with connection and curving to be multiplicative, and can be proved by direct calculation. Proposition 7.2. If a bundle gerbe G with connection and curving on G is multiplicative, then d0∗ G ⊗ d2∗ G is stably isomorphic to d1∗ G as bundle gerbes over G × G, and there exist a imaginary valued 2-form B on G × G such that d0∗ (curv(G)) − d1∗ (curv(G)) + d2∗ (curv(G)) = dB, (σ1 , σ2 )∗ B ∈ 2πiZ,

for any pair of smooth maps from any closed surface to G. For a Wess-Zumino-Witten model with group manifold G in the image of the correspondence map : CS(G) → W ZW (G), we know that the bundle gerbe G with connection and curving on G is multiplicative. The Wess-Zumino-Witten action regarded as a function on the space of smooth maps {σ : → G} exponentiates to the bundle gerbe holonomy of G, that is, for a smooth map σ , we have

exp Swzw (σ ) = holG (σ ). From Proposition 7.1, we know that the Wess-Zumino-Witten action for a multiplicative Wess-Zumino-Witten model satisfies the following property:

exp Swzw (σ1 · σ2 ) = exp Swzw (σ1 ) · exp Swzw (σ2 ) , for a pair of smooth maps σ1 and σ2 from any closed surface to G. From the commutative diagram: H 3 (BG, D3 ) −→ H 2 (G, D2 ) ↓c ↓c 4 3 H (BG, Z) → H (G, Z) we see that for a general compact semi-simple Lie group G, H 3 (BG, D3 ) → H 2 (G, D2 ) is not surjective. In particular, for non-simply connected compact semi-simple Lie group G, we know that the Wess-Zumino-Witten model on G is only multiplicative at certain levels. For example, the Wess-Zumino-Witten model on SO(3) is multiplicative if and only if the Dixmier-Douady class of the corresponding bundle gerbe is an even class in H 3 (SO(3), Z) ∼ = Z.

612

A.L. Carey, S. Johnson, M.K. Murray, D. Stevenson, B.-L. Wang

References 1. Aschieri, P., Jurˇco, B.: Gerbes. M5-brane anomalies and E8 Gauge theory. JHEP 0410, 068 (2004) 2. Atiyah, M., Bott, R.: The Yang-Mills equations over Riemann surfaces. Philos. Trans. Roy. Soc. London Ser. A 308(1505), 523–615 (1983) 3. Bismut, J-M., Freed, D.: The analysis of elliptic families. I. Metrics and connections on determinant bundles. Commun. Math. Phys. 106(1), 159–176 (1986). The analysis of elliptic families. II. Dirac operators, eta invariants, and the holonomy theorem. Commun. Math. Phys. 107(1), 103–163 (1986) 4. Bouwknegt, P., Carey, A., Mathai, V., Murray, M., Stevenson, D.: Twisted K-theory and K-theory of bundle gerbes. Commun. Math. Phys. 228(1), 17–45 (2002) 5. Breen, L.: On the classification of 2-gerbes and 2-stacks. Asterisque No. 225, 1994 6. Brylinski, J-L.: Loop spaces, Characteristic Classes and Geometric Quantization. Boston, MA: Birkhuser Boston, Inc., 1993 7. Brylinski, J-L.: Differentiable Cohomology of Gauge Groups. http://arxiv.org/list/math.DG/ 0011069, 2000 8. Brylinski, J-L.: Geometric construction of Quillen line bundles. In: Advances in Geometry, Progr. Math. 172, Basel-Boston: Birkh¨auser, 1999, pp. 107–146 9. Brylinski, J-L., McLaughlin, D. A.: The converse of the Segal-Witten reciprocity law. Internat. Math. Res. Notices, 8, 371–380 (1996) 10. Brylinski, J-L., McLaughlin, D. A.: The geometry of degree-four characteristic classes and of line bundles on loop spaces. I. Duke Math. J. 75(3), 603–638 (1994). The geometry of degree-4 characteristic classes and of line bundles on loop spaces. II. Duke Math. J. 83(1), 105–139 (1996) 11. Carey, A. L., Mickelsson, J.: The universal gerbe, Dixmier-Douady class, and gauge theory. Lett. Math. Phys. 59(1), 47–60 (2002) 12. Carey, A.L., Mickelsson, J., Murray, M.K.: Bundle Gerbes Applied to Quantum Field Theory. Rev.Math.Phys. 12, 65–90 (2000) 13. Carey, A.L., Murray, M.K., Johnson, S.: Holonomy on D-Branes. http://arxiv.org/list/ hep-th/0204199, 2002 14. Carey, A., Murray, M.K., Wang, B.L.: Higher bundle gerbes and cohomology classes in gauge theory. J. Geom. Phys. 21, 183–197 (1997) 15. Carey, A.L., Wang, B.L.: The Universal Gerbe and Local Family Index Theory. http://arxiv.org/list/ math.DG/0407243, 2004 16. Cheeger, J., Simons, J.: Characteristic forms and geometric invariants. In: Geometry and topology. Lecture Notes in Mathematics 1167, Berlin-Heidelberg-New York: Springer, 1985 17. Chern, S. S., Simons, J.: Characteristic forms and geometric invariants. Ann. of Math. 99, 48–69 (1974) 18. Diaconescu, E., Moore, G., Freed, D.: The M-theory 3-form and E8 gauge theory. http://arxiv.org/list/hep-th/0312069, 2003 19. Dupont, J. L.: Curvature and characteristic classes. Lecture Notes in Mathematics, Vol. 640. Lecture Notes in Math., Berlin: Springer, 1978 20. Dupont, J., Johansen, F. L.: Remarks on determinant line bundles, Chern-Simons forms and invariants. Math. Scand. 91(1), 5–26 (2002) 21. Dupont, J., Kamber, F.: Gerbes, simplicial forms and invariants for families of foliated bundles. Commun. Math. Phys. 253, 253–282 (2004) 22. Dupont, J. L., Ljungmann, R.: Integration of simplicial forms and Deligne cohomology. http:// arxiv.org/list/math.DG/0402059, 2004 23. Dijkgraaf, R., Witten, E.: Topological Gauge Theories and Group Cohomology. Commun. Math. Phys. 129, 393–429 (1990) 24. Freed, D.: Classical Chern-Simons theory, Part 1. Adv. Math. 113, 237–303 (1995); Classical ChernSimons theory, Part 2. Special issue for S. S. Chern. Houston J. Math. 28(2), 293–310 (2002) 25. Freed, D.: Higher algebraic structures and quantization. Commun. Math. Phys. 159(2), 343–398 (1994) 26. Gajer, P.: Geometry of Deligne cohomology. Invent. Math. 127, 155–207 (1997) 27. Gawedzki, K.: Topological actions in two-dimensional quantum field theory . In: Nonperturbative Quantum Field Theories, ’t Hooft, G., Jaffe, A., Mack, G., Mitter, P. K., Stora, R. (eds.) NATO Series Vol. 185, London-New York: Plenum Press 1988, pp. 101–142 28. Gawedzki, K., Reis, N.: WZW branes and gerbes. Rev. Math. Phys. 14(12), 1281–1334 (2002) 29. Giraud, J.: Cohomologie non ablienne. Berlin-New York: Springer-Verlag, 1971 30. Gomi, K.: Connections and curvings on lifting bundle gerbes. J. London Math. Soc. (2) 67(2), 510–526 (2003) 31. Gomi, K.: The formulation of the Chern-Simons action for general compact Lie groups using Deligne cohomology. J. Math. Sci. Univ. Tokyo 8(2), 223–242 (2001)

Bundle Gerbes for Chern-Simons and Wess-Zumino-Witten Theories

613

32. Gomi, K.: Gerbes in classical Chern-Simons theory. http://arxiv.org/list/hep-th/0105072, 2001 33. Hopkins, M.J., Singer, I.M.: Quadratic functions in geometry, topology,and M-theory. http:// arxiv.org/list/math.AT/0211216, 2002 34. Johnson, S. : Constructions with Bundle Gerbes. Thesis, University of Adelaide. http://arxiv.org/ list/math.DG/0312175, 2003 35. Kelly, G. M., Street, R.: Review of the elements of 2-categories. In: Category Seminar (Proc. Sem., Sydney 1972/73), Lecture Notes in Mathematics, Vol. 420, Berlin: Springer, 1974, pp. 75–103 36. Moore, G., Seiberg, N.: Taming the conformal Zoo. Phys. Lett. B 220(3), 422–430 (1989) 37. Murray, M. K.: Bundle gerbes. J. London Math. Soc. (2) 54(2), 403–416 (1996) 38. Murray, M. K., Stevenson, D.: Bundle gerbes: Stable isomorphsim and local theory. J. London Math. Soc. (2). 62(3), 925–937 (2002) 39. Murray, M. K., Stevenson, D.: Higgs fields, bundle gerbes and string structures. Commun. Math. Phys. 243(3), 541–555 (2003) 40. Narasinhan, H.S., Ramananan, S.: Existence of universal connections. Am. J. Math. 83, 563–572 (1961); 85, 223–231 (1963) 41. Pressley, A., Segal, G.: Loop groups. Oxford: Oxford University Press, 1988 42. Quillen, D.: Determinants of Cauchy-Riemann operators on Riemann surfaces. Funct. Anal. Appl. 19(1), 31–34 (1985) 43. Ramadas, T. R., Singer, I. M., Weitsman, J.: Some comments on Chern-Simons gauge theory. Commun. Math. Phys. 126(2), 409–420 (1989) 44. Stevenson, D.: Bundle 2-gerbes. Proc. Lond. Math. Soc. (3) 88, 405–435 (2004) 45. Toledano-Laredo, V.: Positive energy representations of the loop groups of non-simply connected Lie groups. Commun. Math. Phys. 207(2), 307–339 (1999) Communicated by A. Connes

Commun. Math. Phys. 259, 615–637 (2005) Digital Object Identifier (DOI) 10.1007/s00220-005-1379-5

Communications in

Mathematical Physics

Hypercontractivity in Non-Commutative Holomorphic Spaces Todd Kemp Cornell University, Malott Hall, Ithaca, NY 14853-4201, USA. E-mail: [email protected] Received: 4 November 2004 / Accepted: 11 January 2005 Published online: 21 June 2005 – © Springer-Verlag 2005

Abstract: We prove an analog of Janson’s strong hypercontractivity inequality in a class of non-commutative “holomorphic” algebras. Our setting is the q-Gaussian algebras q associated to the q-Fock spaces of Bozejko, K¨ummerer and Speicher, for q ∈ [−1, 1]. We construct subalgebras Hq ⊂ q , a q-Segal-Bargmann transform, and prove Janson’s strong hypercontractivity L2 (Hq ) → Lr (Hq ) for r an even integer. 1. Introduction As part of the work in the 1960s and 1970s to construct a mathematically consistent theory of interacting quantum fields, Nelson proved his famous hypercontractivity inequality in its initial form [N1]; by 1973 it evolved into the following statement, which may be found in [N2]. Theorem 1 (Nelson, 1973). Let Aγ be the Dirichlet form operator for Gauss measure 2 dγ (x) = (2π )−n/2 e−|x| /2 dx on Rn . For 1 < p ≤ r < ∞ and f ∈ Lp (Rn , γ ), e−tAγ f r ≤ f p ,

for t ≥ tN (p, r) =

1 r −1 log . 2 p−1

(1.1)

For t < tN (p, r), e−tAγ is not bounded from Lp to Lr . (If p < 2, one must first extend e−tAγ to Lp ; this can be done, uniquely, and Theorem 1 should be interpreted as such in this case. The same comment applies to all of the following.) It is worth noting that tN , the least time to contraction, does not depend on the dimension n of the underlying space Rn . The initial purpose of such hypercontractive inequalities was to prove the semiboundedness of Hamiltonians in the theory of Boson quantum fields. (See, for example, [Gli, N1, Se2].) In [G1], Gross used this inequality (through an appropriate cut-off approximation) to show that the Boson energy operator in a model of 2-dimensional Euclidean

616

T. Kemp

quantum field theory has a unique ground state. In that paper he also showed that if one represents the Fock space for Fermions as the L2 -space of a Clifford algebra (as in [Se1]), then inequalities similar to 1.1 also hold. He developed this further in [G3]. Over the subsequent three decades, Nelson’s hypercontractivity inequality (and its equivalent form, the logarithmic Sobolev inequality, invented by Gross in [G2]) found myriad applications in analysis, probability theory, differential geometry, statistical mechanics, and other areas of mathematics and physics. See, for example, the recent survey [G5]. The Fermion hypercontractivity inequality in [G3] remained unproven in its sharp form until the early 1990s. Lindsay [L] and Meyer [LM] proved that it holds L2 → Lr for r = 2, 4, 6, . . . (and in the dual cases Lr → L2 as well). Soon after, Carlen and Lieb [CL] were able to complete Gross’ original argument with some clever non-commutative integration inequalities, thus proving the full result. (Precisely: they showed that the Clifford algebra analogs of the inequalities 1.1 hold with exactly the same constants.) Then, in 1997, Biane [B1] extended Carlen and Lieb’s work beyond the Fermionic (Clifford algebra) setting to the q-Gaussian von Neumann algebras q of Bozejko, K¨ummerer, and Speicher [BKS]. His theorem may be stated as follows. Theorem 2 (Biane, 1997). Let −1 < q < 1, let Nq denote the number operator associated to q , and let · p be the non-commutative Lp -norm associated to the vacuum expectation state τq on q . Then for 1 < p ≤ r < ∞, e−tNq f r ≤ f p for all f ∈ Lp (q , τq ) iff t ≥ tN (p, r). Of particular interest is the case q = 0 which corresponds to free probability. Biane proved the full result (for −1 < q < 1) by first extending Carlen and Lieb’s work to the case of a system of mixed spins (in a von Neumann algebra generated by elements which satisfy some commutation and some anti-commutation relations), and then applying a central limit theorem due to Speicher [S]. The case q = −1 is Carlen and Lieb’s adaptation of Gross’ work, while the q = 1 case is Nelson’s original hypercontractive estimate (Theorem 1). Concurrent to the work on non-commutative hypercontractivity, a different sort of extension of Nelson’s theorem was being developed. In 1983 Janson, [J], discovered that if one restricts the semigroup e−tAγ in Theorem 1 to holomorphic functions on R2n ∼ = Cn then the contractivity of Eq. 1.1 is attained in a shorter time than tN . Writing HLp = Lp (R2n , γ ) ∩ Hol(Cn ), Janson’s strong hypercontractivity may be stated thus. Theorem 3 (Janson, 1983). Let 0 < p ≤ r < ∞, and let f ∈ HLp . Then e−tAγ f r ≤ f p ,

for t ≥ tJ (p, r) =

r 1 log . 2 p

(1.2)

For t < tJ (p, r), e−tAγ is not bounded from HLp to HLr . Note that the least time tJ to contraction is shorter than the time tN (if 1 < p < r < ∞). Moreover, Janson’s result holds as p → 0, in a regime where the semigroup e−tAγ is not even well-defined in the full Lp -space. These results have been further generalized by Gross in [G4] to the case of complex manifolds. In this paper, non-commutative algebras Hq will be introduced, which are q-deformations of the algebra of holomorphic functions. The special cases q = ±1 and q = 0

Hypercontractivity in Non-Commutative Holomorphic Spaces

617

are already known; H−1 is defined in [BSZ], while H0 is isomorphic to the free SegalBargmann space of [B2]. We will construct a unitary isomorphism Sq from L2 (q ) to L2 (Hq ), which is a q-analog of the Segal-Bargmann transform. Hq itself will be constructed as a subalgebra of q , and so inherits its p-norms as well as its number operator Nq . In the context of these q-deformed Segal-Bargmann spaces, the following theorem is our main result. Theorem 4. For −1 ≤ q < 1 and r an even integer, e−tNq f r ≤ f 2 for all f ∈ L2 (Hq , τq ) iff t ≥ tJ (2, r). It is interesting that the least time to contraction, tJ , is independent of both the dimension of the underlying space and the parameter q. We fully expect the same results to hold Lp (Hq ) → Lr (Hq ) for 2 ≤ p ≤ r < ∞, but standard interpolation techniques fail to work in the holomorphic algebras we consider. (In particular, the dual results that Lindsay and Meyer achieved in the full Clifford algebra do not follow in this holomorphic setting.) This paper is organized as follows. We begin with a summary of the q-Fock spaces Fq and the von Neumann algebras q associated to them. We will also define the holomorphic subalgebras Hq and construct a q-Segal-Bargmann transform. In the subsequent section, we prove the appropriate strong hypercontractivity estimates for algebras with arbitrary mixed spins (mixed commutation and anti-commutation relations), much in the spirit of Biane’s approach [B1]. We then proceed to review Speicher’s central limit theorem, and apply it to prove Theorem 4. 2. The q-Fock Space and Associated Algebras We begin by briefly reviewing the q-Fock spaces of Bozejko, K¨ummerer and Speicher, relevant aspects of the von Neumann algebras q (which are related to the creation and annihilation operators on Fq ), and the number operators on them. We then proceed to define the Banach algebra Hq which corresponds to the classical Segal-Bargmann space, and exhibit a ∗-isomorphism between H0 and the free Segal-Bargmann space Chol defined in [B2]. We finally construct a generalized q-Segal-Bargmann transform, which is a unitary isomorphism L2 (q ) → L2 (Hq ) that respects the action of the number operator. 2.1. The q-Fock space Fq and the algebra q . Our development closely follows that found in [B1]; the details may be found in [BKS]. Let H be a real Hilbert space with complexification HC . Let be a unit vector in a 1-dimensional complex Hilbert space (disjoint from HC ). We refer to as the vacuum, and by convention define HC⊗0 ≡ C. The algebraic Fock space F(H ) is defined as F(H ) ≡

∞

HC⊗n ,

n=0

where the direct sum and tensor product are algebraic. For any q ∈ [−1, 1], we then define a Hermitian form (·, ·)q to be the conjugate-linear extension of (, )q = 1, (f1 ⊗ · · · ⊗ fj , g1 ⊗ · · · ⊗ gk )q = δj k

π∈Sk

q ι(π) (f1 , gπ1 ) · · · (fk , gπk ),

618

T. Kemp

for fi , gi ∈ HC , where Sk is the symmetric group on k symbols, and ι(π ) counts the number of inversions in π ; that is ι(π) = #{(i, j ) ; 1 ≤ i < j ≤ k, π i > πj }. The reader may readily verify that (−1)ι(π ) = parity(π ) for any permutation π . Hence, the form (·, ·)−1 reduces to the standard Hermitian form associated to the Fermion Fock space. Similarly, the form (·, ·)1 yields the standard Hermitian form on the Boson Fock space. In each of these cases the form is degenerate, thus requiring that we take a quotient of F(H ) before completing to form the Fermion or Boson Fock space. It is somewhat remarkable that, for −1 < q < 1, the form (·, ·)q is already non-degenerate on F(H ). Proposition 1 ([BKS]). The Hermitian form (·, ·)q is positive semi-definite on F(H ). Moreover, it is an inner product on F(H ) for −1 < q < 1. For −1 < q < 1, the q-Fock space Fq (H ) is defined as the completion of F(H ) with respect to the inner-product (·, ·)q . (It should be noted that, in the case q = 0, the definition form (·, ·)0 requires the convention that 00 = 1. It follows that F0 (H ) ∞of the ⊗n is just n=0 HC with the Hilbert space tensor product and direct sum.) These spaces interpolate between the classical Boson and Fermion Fock spaces F±1 (H ), which are constructed by first taking the quotient of F(H ) by the kernel of (·, ·)±1 and then completing. As in the classical theory, the spaces Fq come equipped with creation and annihilation operators. For any vector f ∈ H ⊂ HC , define the creation operator cq (f ) on Fq (H ) to extend cq (f ) = f, cq (f )f1 ⊗ · · · ⊗ fk = f ⊗ f1 ⊗ · · · ⊗ fk . The annihilation operator cq∗ (f ) is its adjoint, which the reader may compute satisfies cq∗ (f ) = 0, cq∗ (f )f1 ⊗ · · · ⊗ fk =

k

q j −1 (fj , f )f1 ⊗ · · · ⊗ fj −1 ⊗ fj +1 ⊗ · · · ⊗ fk .

j =1

These are similar to the definitions of the creation and annihilation operators in the Fermion and Boson cases, where appropriate (anti)symmetrization must also be applied. One notable difference is that, in the Boson (q = 1) case, the operators are unbounded. For q < 1, the creation and annihilation operators are always bounded, and hence we may discuss the von Neumann algebra they generate without difficulties. The operators cq , cq∗ satisfy the q-commutation relations, which interpolate between the canonical commutation relations (CCR) and canonical anticommutation relations (CAR) usually associated to the Boson and Fermion Fock spaces. Over the q-Fock space, we have cq∗ (g)cq (f ) − qcq (f )cq∗ (g) = (f, g)idFq (H ) for f, g ∈ H .

(2.1)

It is worth pausing at this point to note one significant difference between the q = ±1 cases and the −1 < q < 1 cases. For both Bosons and Fermions, the operators c, c∗ also satisfy additional (anti)commutation relations. In the Boson case, for example, c(f ) and

Hypercontractivity in Non-Commutative Holomorphic Spaces

619

c(g) commute for any choices of f and g. It is a fact, however, that if q = ±1 there are no relations between cq (f ) and cq (g) if (f, g) = 0. It is a well-known theorem that the creation and annihilation operators in the Boson and Fermion cases are irreducible; that is, they have no non-trivial invariant subspaces. That theorem is also true for the operators cq for −1 < q < 1, although a published proof does not seem to exist. We prove it here for completeness. The q = 0 case will be used in Proposition 4 below. Theorem 5. For −1 < q < 1, the von Neumann algebra generated by {cq (h) ; h ∈ H } is B(Fq (H )). Proof. Denote by Wq the von Neumann algebra generated by the cq ’s. We consider the q = 0 case first. Let {e1 , e2 , . . . } be an orthonormal basis for H , and consider the operator P =

∞

c(ej ) c∗ (ej )

j =1

(where c(h) = c0 (h)), which is in W0 since W0 is weakly closed. It is easy to calculate that P (ei1 ⊗ · · · ⊗ ein ) = ei1 ⊗ · · · ⊗ ein , while P = 0. Thus, P is the projection onto the orthogonal complement of the vacuum. So W0 1 − P = P , the projection onto the vacuum. Therefore W0 contains the operator c(ei1 ) · · · c(ein )P c∗ (ej1 ) · · · c∗ (ejm ), which is the rank-1 operator with image spanned by ei1 ⊗ · · · ⊗ ein and kernel orthogonal to ej1 ⊗ · · · ⊗ ejm . It follows that W0 contains all finite rank operators, and hence is the full algebra B(F0 (H )). For q = 0, it is proved in [DN] that there is a unitary map Uq : F0 → Fq , which preserves the vacuum and satisfies Uq C0 Uq∗ ⊆ Cq , where Cq is the C ∗ -algebra generated by {cq (h) ; h ∈ H }. As Wq is the weak closure of Cq , it follows easily that B(Fq (H )) = Uq B(F0 (H ))Uq∗ = Uq W0 Uq∗ ⊆ Wq as well, and this completes the proof. For q < 1 and for each f ∈ H , define the self-adjoint operator Xq (f ) on Fq (H ) by Xq (f ) = cq (f ) + cq∗ (f ). These operators are in Wq = B(Fq (H )), but they do not generate it. The von Neumann algebra they do generate is defined to be q (H ), the q-Gaussian algebra over H . (In the q = 1 case, 1 (H ) is the von Neumann algebra generated by the operators ϕ(X(f )) for ϕ ∈ L∞ (R).) The notation q is chosen to be consistent with the second quantization functor from constructive quantum field theory (see [BSZ]), which assigns to each real Hilbert space H a von Neumann algebra (H ) and to each contraction T : H → K a unital positivity-preserving map (T ) : (H ) → (K ). Indeed, q can be construed as such a functor as well. The isomorphism classes of the von Neumann algebras q (H ) for q ∈ / {±1, 0} are ´ not yet understood. (For some partial results, however, see [R] and [Sn].) The ±1 cases have been understood since antiquity: 1 (H ) = L∞ (M, γ ) for a certain measure space M with a Gaussian measure γ , while −1 (H ) is a Clifford algebra modeled on H . These facts rely upon the additional commutation relations between c(f ) and c(g) that

620

T. Kemp

hold in those cases. (Indeed, in the Boson case X(f ) and X(g) commute, resulting in a commutative von Neumann algebra (H ). It is primarily for this reason that it is customary to begin with a real Hilbert space and complexify – if c(f ) were defined for all f ∈ HC , then c(f ) and c(g) would no longer commute even in the Boson case. While there are no commutation relations between cq (f ) and cq (g), it is still advantageous for us to have the real subspace H ⊂ HC in order to define the holomorphic subalgebra in Sect. 2.2.) 0 (H ) was shown (in [V]) to be isomorphic to the group von Neumann algebra of a free group with countably many generators. One known fact about the algebras q (H ) for −1 < q < 1 is that they are all type I I1 factors. This is a consequence (in the dim H = ∞ case) of the following theorem, which was proved in [BSp]. Proposition 2 (Bozejko, Speicher). Let −1 < q < 1. The vacuum expectation state τq (A) = (A, )q on B(Fq (H )) restricts to a faithful, normal, finite trace on q (H ). The reader may wish to verify that τq (cq∗ cq ) = 1, while τq (cq cq∗ ) = 0; hence, τq is certainly not a trace on all of B(Fq (H )). The algebra q can actually be included as a dense subspace of Fq . The map A → A is one-to-one from q into Fq . The precise action of this map will be important to us, q and so it bears mentioning. The q-Hermite polynomials Hn are one-variable real polyq q nomials defined so that H0 (x) = 1, H1 (x) = x, and satisfying the following recurrence relation: q

q

xHn (x) = Hn+1 (x) +

qn − 1 q H (x), q − 1 n−1

(2.2)

where (q n − 1)/(q − 1) is to be interpreted as n when q = 1. In this case, the generated polynomials Hn1 are precisely the Hermite polynomials that play an important role in the Boson theory. When q = 0, the polynomials Hn0 are the Tchebyshev polynomials, and play an analogous role in the theory of semi-circular systems (see [V]). We can express q the action of the above map A → A succinctly in terms of the polynomials Hn . The following proposition is proved in [BKS]. Proposition 3. The map A → A from q to Fq is one-to-one, and extends to a unitary isomorphism L2 (q , τq ) → Fq . If {ej } are orthonormal vectors in H and j = j +1 for 1 ≤ ≤ k − 1, then k 1 Hn1 (Xq (ej1 )) · · · Hnk (Xq (ejk )) = ej⊗n ⊗ · · · ⊗ ej⊗n . 1 k

q

q

(2.3)

The algebraic Fock space F(H ) carries a number operator N , whose action is given by N = 0, N (f1 ⊗ · · · ⊗ fn ) = nf1 ⊗ · · · ⊗ fn . This operator extends to a densely-defined, essentially self-adjoint operator Nq on Fq (H ). The algebra q then inherits the action of Nq , via the map in Proposition 3. The reader may readily check that if {ej } are orthonormal vectors in H and j = j +1 for 1 ≤ ≤ k − 1, then the element q

q

Hn1 (Xq (ej1 )) · · · Hnk (Xq (ejk ))

Hypercontractivity in Non-Commutative Holomorphic Spaces

621

is an eigenvector of Nq with eigenvalue n1 + · · · + nk . In the case H = Rd , this is a precise analogy to the action of the number operator for Bosons. The algebra 1 (Rd ) is isomorphic to L∞ (Rd , γ ), and the operators X1 (ej ) (for the standard basis vectors ej ) are multiplication by the coordinate functions xj . The Boson number operator Aγ has Hn11 (x1 ) · · · Hn1k (xk ) as an eigenvector, with eigenvalue n1 + · · · + nk . The number operator generates a contraction semigroup e−tNq on L2 (q , τq ), which is known to restrict for p > 2, and extend for 1 ≤ p < 2, to a contraction semigroup on Lp (q , τq ). Biane’s hypercontractivity theorem, Theorem 2, is an extension of these results.

2.2. The holomorphic algebra, and the q-Segal-Bargmann transform. Let q < 1. We wish to define a Banach algebra of “holomorphic" elements in B(Fq ). To that end, we follow a similar procedure to the formal construction of holomorphic polynomials. We begin by doubling the number of variables, and so we consider the algebra q (H ⊕H ). This algebra contains two independent copies of the variable X(h) ∈ q (H ): X(h, 0) and X(0, h). (Here, (h, 0) denotes a pair in H ⊕ H , not the inner product of h with 0. Whenever this ambiguity in notation may be confusing, we will clarify by denoting the inner product as (·, ·)K for the appropriate Hilbert space K .) We then introduce a new variable Z(h), 1 Z(h) = √ (X(h, 0) + iX(0, h)). 2

(2.4)

In the case H = R √ and q = 1, this precisely corresponds to the holomorphic variable z = (x + iy)/ 2, the normalization chosen so that z is a unit vector in HL2 (γ ). We define the q-holomorphic algebra Hq (HC ) as the Banach algebra generated by {Z(h) ; h ∈ H }. In [B2], Biane introduced a Banach algebra Chol in the q = 0 case which is also an analog of the algebra of holomorphic functions. His algebra is not contained in 0 (H ⊕ H ), so it is less natural to consider an action of N0 on it. We introduce it here (with slightly changed notation to avoid inconsistencies) to show that it is isomorphic to H0 , and so the work presented here indeed generalizes Biane’s results. Consider the von Neumann algebra B(F0 (H ⊕ H )); it contains all the operators c0 (h, g) and their adjoints, for h, g ∈ H . Define the operator B(h) = c0 (h, 0) + c0∗ (0, h). Let C (HC ) be the von Neumann algebra generated by {B(h) ; h ∈ H }, and Chol (HC ) the Banach algebra so generated. The expectation state τ0 (A) = (A, ) restricts to a faithful, normal, finite trace on C (HC ), and the map h → B(h) is a circular system with respect to τ0 (see [V]). Although Biane’s algebra Chol (HC ) is not contained in 0 (H ⊕ H ), it is in fact isomorphic to our algebra H0 (HC ), in the following strong sense. Proposition 4. There is a ∗-automorphism of B(F0 (H ⊕H )) which maps 0 (H ⊕H ) onto C (HC ). In particular, it maps Z(h) to B(h), and so sends H0 (HC ) to Chol (HC ).

622

T. Kemp

Proof. By Theorem 5, we can define an endomorphism of B(F0 (H ⊕ H )) on the generators c0 (h, g) by 1 α(c0 (h, g)) = √ (c0 (h, h) + ic0 (−g, g)), 2 1 α(c0∗ (h, g)) = √ (c0∗ (h, h) − ic0∗ (−g, g)). 2 A straightforward computation verifies that the operators α(c0 (h, g)) satisfy the 0-commutation relations of Eq. 2.1. Hence, α extends to a ∗-homomorphism. It can also easily be checked that α has an inverse of the form 1 α −1 (c0 (h, g)) = √ (c0 (h + g, 0) + ic0 (0, h − g)), 2 1 α −1 (c0∗ (h, g)) = √ (c0∗ (h + g, 0) − ic0∗ (0, h − g)), 2 which also extends to a ∗-homomorphism. Hence, α is a ∗-automorphism. Finally, one can calculate that α(Z(h)) = B(h). Whence, α maps H0 onto Chol , and so maps W ∗ (H0 (HC )) = 0 (H ⊕ H ) onto C (HC ). It should be noted that Proposition 4 generalizes automatically to q = 0; however, we are only concerned with the q = 0 case for Chol . Corollary 1. The map α from 0 (H ⊕ H ) to C (HC ) extends to an isometric isomorphism Lp (0 , τ0 ) → Lp (C , τ0 ) for 0 < p ≤ ∞. Proof. Since α is a ∗-automorphism of the full von Neumann algebra B(F0 ), by Wigner’s theorem it is induced (through conjugation) by either a unitary or an anti-unitary on F0 . Suppose it is a unitary, U , so that α(A) = U ∗ AU for each A ∈ B(F0 ). Recall that τ0 is known to restrict to a tracial state on both W ∗ (H0 ) and C . Hence, for A ∈ H0 and p > 0, α(A)p = τ0 (|α(A)|p ) = τ0 (α(|A|p )) = τ0 (U ∗ |A|p U ) = τ0 (|A|p ) = Ap . p

p

It follows that α extends to an isometric isomorphism from Lp (H0 , τ0 ) onto Lp (Chol , τ0 ). The anti-unitary case is similar. Hence, the algebraic map which sends Z(h) to B(h) preserves all Lp topology (even for p < 1), and so the analyses of the spaces H0 and Chol are very much the same. In the commutative context, one of the most powerful tools in this area is the SegalBargmann transform S , which is a unitary isomorphism S : L2 (Rd , γ ) → HL2 (Cd , γ ). Here, γ denotes the measure whose density with respect to Lebesgue measure is (a constant multiple of) the Gaussian exp(−|z|2 ), rather than exp(−|x|2 /2) as in γ . The Hermite polynomials Hn11 (x1 ) · · · Hn1d (xd ), appropriately normalized, form an orthonormal basis of L2 (Rd , γ ), and the action of S on this basis is simple: S : Hn11 (x1 ) · · · Hn1d (xd ) → z1n1 · · · zdnd . So S maps the Hermite polynomials to the holomorphic monomials.

(2.5)

Hypercontractivity in Non-Commutative Holomorphic Spaces

623

(A note on normalization. Instead of changing the measure γ → γ√ , we could redefine S : L2 (Rd , γ ) → HL2 (Cd , γ ) by setting S f (z) = S f (z/ 2). This map is, of course, a unitary isomorphism. It is this point of view that we take while generalizing √ the Segal-Bargmann transform. After all, we have already built the factor 1/ 2 into the variable Z(h).) In [B2], a free Segal-Bargmann transform is introduced, which is a unitary isomorphism L2 (0 , τ0 ) → L2 (Chol , τ0 ). We will modify this transform and extend it to all q ∈ [−1, 1], and further show that besides generalizing the classical transform S it respects the action of the number operator. First, we will need to understand the embedding of Hq (HC ) in Fq (H ⊕H ) (it is injected via the map A → A, which is one-to-one on all of q (H ⊕H ) by Proposition 3). Consider the diagonal mapping δ : HC → HC ⊕HC defined δ(h) = 2−1/2 (h, ih). Since δ is isometric, it extends to an isometric embedding δq : Fq (H ) → Fq (H ⊕ H ) (that is, δq (h1 ⊗ h2 ⊗ · · · ) = δ(h1 ) ⊗ δ(h2 ) ⊗ · · · ). Proposition 5. The map A → A injecting Hq (HC ) → Fq (H ⊕ H ) extends to a unitary isomorphism L2 (Hq (HC ), τq ) → δq Fq (H ). If {ej } are orthonormal vectors in H , then Zq (e1 )n1 · · · Zq (ek )nk = δq (e1⊗n1 ⊗ · · · ⊗ ek⊗nk ).

(2.6)

Proof. Let φ = h1 ⊗ · · · ⊗ hn ∈ F(H ), and consider Zq (h)δ(φ). We may compute Xq (h, 0)δ(φ) = (cq (h, 0) + cq∗ (h, 0))δ(φ) = (h, 0) ⊗ δ(φ) +

n

q j −1 2−1/2 (hj , ihj ), (h, 0)

j =1

H ⊕H

δ(φˆ j )

1 j −1 = (h, 0) ⊗ δ(φ) + √ q (hj , h)H δ(φˆ j ), 2 j =1 n

where φˆ j = h1 ⊗ · · · ⊗ hj −1 ⊗ hj +1 ⊗ · · · ⊗ hn . A similar calculation shows that 1 j −1 q (ihj , h)H δ(φˆ j ), Xq (0, h)δ(φ) = (0, h) ⊗ δ(φ) + √ 2 j =1 n

and so in the sum Xq (h, 0) + iXq (0, h) the cq∗ terms cancel. (Note, we have assumed as is standard that the complexified inner product (h, g) is linear in h and conjugate-linear in g.) Thus, we have Zq (h)δ(φ) = 2−1/2 (h, ih) ⊗ δ(φ) = δ(h ⊗ φ). Equation 2.6 now follows by induction, and the theorem follows since such vectors are dense in δq Fq (H ). We may now define the q-Segal-Bargmann transform as follows. Propositions 3 and 5 give (up to the map δq ) unitary equivalences between the Fock space Fq (H ) and both L2 (q (H ), τq ) and L2 (Hq (HC ), τq ). The q-Segal-Bargmann transform Sq is the composition of these unitary isomorphisms. That is, Sq is the unitary map which makes the following diagram commute: Fq (H ) O

δq

A→A

L2 (q (H ), τq )

/ Fq (H ⊕ H ) O A→A

Sq

/ L2 (Hq (HC ), τq ).

624

T. Kemp

By Eqs. 2.3 and 2.6, the action of Sq can be expressed in terms of the q-Hermite polynomials. If {ej } are orthonormal vectors in H and j = j +1 for 1 ≤ ≤ k − 1, then q

q

Sq : Hn1 (Xq (ej1 )) · · · Hnk (Xq (ejk )) → Zq (ej1 )n1 · · · Zq (ejk )nk .

(2.7)

Comparing Eqs. 2.5 and 2.7, we see that Sq is a natural extension of the classical Segal-Bargmann transform. Since Hq (HC ) is contained in q (H ⊕ H ), it inherits the number operator Nq from it, induced by the inclusion of q (H ⊕ H ) into Fq (H ⊕ H ) via the map A → A. From Eq. 2.6, we see then that Zq (e1 )n1 · · · Zq (ek )nk is an eigenvector of Nq with eigenvalue n1 + · · · + nk . This precisely matches the conjugated action Sq Nq Sq∗ of the number operator Nq on q (H ), as can be seen from Proposition 3. Hence, we have Sq Nq = Nq Sq , just as in the commutative case. Finally, we define Lp (Hq (HC ), τq ) to be the completion of Hq (HC ) in the p L (q (H ⊕ H ), τq )-norm. For p ≥ 2 (the case of interest for our main theorem), it is equal to the intersection of L2 (Hq (HC ), τq ) with Lp (q (H ⊕H ), τq ). The class of Banach spaces Lp (Hq , τq ) is a non-commutative generalization of the spaces HLp (Cd , γ ) that occur in Janson’s Theorem 3. Since the algebra Hq is not a von Neumann algebra, this family is not known to be complex interpolation scale. For example, in the q = 1 case, the family is not complex interpolation scale when H is infinite-dimensional (this is almost proven in [JPR]). Hence, once we have proved Theorem 4, it is not an easy matter to generalize to the case p > 2, r = 2, 4, 6, . . . .

3. Mixed Spin and Strong Hypercontractivity We will consider the mixed-spin algebras C (I, σ ) introduced in [B1] which represent systems with some commutation and some anti-commutation relations. Such systems may be viewed as approximations to the q-commutation relations, in a manner which will be made precise in Sect. 4. We introduce a holomorphic subalgebra H(I, σ ), and give a combinatorial proof of a strong hypercontractivity theorem like Theorem 4 for it.

3.1. The mixed-spin algebra C (I, σ ). Let I be a finite totally ordered set (with cardinality denoted by |I |), and let σ be a function I × I → {−1, 1} which is symmetric, σ (i, j ) = σ (j, i), and constantly −1 on the diagonal, σ (i, i) = −1. Let C (I, σ ) denote the unital C-algebra with generators {xi ; i ∈ I } and relations xi xj − σ (i, j )xj xi = 2δij

for

i, j ∈ I.

(3.1)

(The requirement σ (i, i) = −1 forces xi2 = 1, and guarantees that C (I, σ ) is finitedimensional.) In the special case σ ≡ −1, this is precisely the complex Clifford algebra C|I | , hence our choice of notation. In the case σ (i, j ) = 1 for i = j (i.e. when different generators commute), the generators of C (I, σ ) may be modeled by |I | i.i.d. Bernoulli random variables, and so we reproduce the toy Fock space considered in [M]. In the general case, C (I, σ ) has, as a vector space, a basis consisting of all xA with A = (i1 , . . . , ik ) increasing multi-indices in I k , where xA = xi1 . . . xik , and x∅ denotes

Hypercontractivity in Non-Commutative Holomorphic Spaces

625

the identity 1 ∈ C (I, σ ). Thus, dim C (I, σ ) = 2|I | . Moreover, C (I, σ ) has a natural decomposition C (I, σ ) =

|I |

Cn (I, σ ),

n=0

where Cn = span{xA ; |A| = n} is the “n-particle space.” Of some importance to us will be the natural grading of the algebra, C (I, σ ) = C+ (I, σ ) ⊕ C− (I, σ ), where C+ = {Cn ; n is even}, and C− is the corresponding odd subspace. The reader may readily verify that this decomposition is a grading – i.e. Cα · Cβ ⊆ Cαβ , where α, β ∈ {+, −} and their product is to be interpreted in the obvious fashion. We equip C (I, σ ) with an involution ∗, which is defined to be the conjugatelinear extension of the map xA∗ = xA∗ , where (i1 , . . . , ik )∗ is the reversed multiindex (ik , . . . , i1 ). In particular, the generators xi = xi∗ are self-adjoint, and in general xA∗ = ±xA . We also define a tracial state τσ by τσ (xA ) = δA∅ ; that is, τσ (1) = 1 while τσ (xA ) = 0 for all other basis elements. It is easy to check that τσ (ab) = τσ (ba). This allows us to define an inner product on C (I, σ ) by (a, b)σ = τσ (b∗ a). The basis {xA } is orthonormal with respect to (·, ·)σ . Following the GNS construction, we see that the action of C (I, σ ) on the Hilbert space (C (I, σ ), (·, ·)σ ) by left-multiplication is continuous, and yields an injection of C (I, σ ) into the von Neumann algebra of bounded operators on the Hilbert space. In this way, C (I, σ ) gains a von Neumann algebra structure. We denote by Lp (C (I, σ ), τσ ) the non-commutative Lp space of this von Neumann algebra with its trace τσ . (So, in particular, L2 (C (I, σ ), τσ ) is naturally isomorphic to the Hilbert space (C (I, σ ), (·, ·)σ ).) The Lp (C (I, σ ), τσ )-norm is, in fact, just the (normalized) Schatten Lp -norm on the matrix algebra. This can be seen from the following proposition. Proposition 6. Let tr denote the normalized trace on the finite-dimensional algebra B(L2 (C (I, σ ), τσ )). Then for any x ∈ C (I, σ ), τσ (x) = tr(x). Proof. Using the orthonormal basis {xA } for L2 (C (I, σ ), τσ ), we compute that 0, if A = ∅ −|I | (xA xB , xB )σ = −|I | tr(xA ) = 2 , 2 (x , x ) = 1, if A = ∅ B B σ B B

where the sums are taken over all increasing multi-indices B. So tr(xA ) = δA∅ = τσ (xA ). Note that the trace τσ may be expressed in terms of the inner product as the pure state τσ (x) = (x1, 1)σ . This formula extends to all of B(L2 (C (I, σ ), τσ )), giving the pure state β → (β1, 1)σ . However, this state does not equal tr for all bounded operators β. We will see examples in Sect. 4 showing that it is not tracial in general.

626

T. Kemp

The algebra C (I, σ ) comes equipped with a number operator Nσ which has Cn (I, σ ) as an eigenspace with eigenvalue n. That is, Nσ xA = |A|xA . This is a generalization of the action of the operator N−1 on the Clifford algebra C|I | = −1 (R|I | ). Nσ is a positive semi-definite operator on L2 (C (I, σ ), τσ ), and so generates a contraction semigroup e−tNσ . It is to the study of this semigroup, restricted to a holomorphic subspace, that we devote the remainder of this section. 3.2. The mixed-spin holomorphic algebra H(I, σ ). Following our construction of Hq , we will begin by doubling the number of variables. We extend σ to the set I × {0, 1} by setting σ ((i, ζ ), (j, ζ )) = σ (i, j ), and then consider the algebra C (I ×{0, 1}, σ ). If we relabel x(i,0) → xi and x(i,1) → yi , then this is tantamount to constructing the unital C-algebra with relations  xi xj − σ (i, j )xj xi = 2δij  yi yj − σ (i, j )yj yi = 2δij for i, j ∈ I. (3.2) xi yj − σ (i, j )yj xi = 0  Note, C (I, σ ) is ∗-isomorphically embedded in C (I × {0, 1}, σ ) via the inclusion xi → x(i,0) . Hence, this relabeling should not be confusing. We define elements zj ∈ C (I × {0, 1}, σ ) by zj = 2−1/2 (xj + iyj ) = 2−1/2 (x(j,0) + ix(j,1) ),

for j ∈ I. (3.3) √ (To avoid confusion, we point out that in Eq. 3.3, i refers to −1 ∈ C.) The operator zj is an analog of the operators Zq (ej ) in Hq . The normalization is again chosen so that zj is a unit vector in L2 (C , τσ ). For the calculations in the foregoing, however, it will be convenient to have the variables normalized in L∞ (C , τσ ). Therefore, we also introduce zˆ j = 2−1/2 zj =

1 (xj + iyj ) 2

for

j ∈ I.

The reader may readily verify that |ˆzj |2 = zˆ j∗ zˆ j is a nonzero idempotent, and hence ˆzj ∞ = 1. Define the mixed spin holomorphic algebra H(I, σ ) as the C-algebra generated by {z1 , . . . , z|I | }. This is just the polynomial algebra in the variables z1 , . . . , z|I | — the adjoints are not included. Indeed, 2ˆzj∗ = xj −iyj , so xj = zˆ j + zˆ j∗ and yj = i(ˆzj∗ − zˆ j ). Thus, the ∗-algebra generated by z1 , . . . , z|I | is all of C (I × {0, 1}, σ ). Observe that 2zj2 = xj2 − yj2 + i(xj yj + yj xj ) = 0 since σ (j, j ) = −1. In general, we may compute that zi zj − σ (i, j )zj zi = 0,

(3.4)

and the same relations (of course) hold for the zˆ j . The operators zˆ j , zˆ j∗ also satisfy the joint relations zˆ i∗ zˆ j − σ (i, j )ˆzj zˆ i∗ = δij

for

i, j ∈ I.

(3.5)

Hypercontractivity in Non-Commutative Holomorphic Spaces

627

Eq. 3.5 looks much like the q-commutation relations of Eq. 2.1. It is, in fact, possible to think of zˆ j , zˆ j∗ as creation and annihilation operators. That is, there is a faithful representation of zˆ j , zˆ j∗ in B(L2 (C (I, σ ), τσ )), which sends zˆ j and zˆ j∗ to the creation and annihilation operators βj , βj∗ on L2 (C (I, σ ), τσ ) discussed in Sect. 4. (By our definition, the operators zˆ j , zˆ j∗ are a priori in the doubled space B(L2 (C (I × {0, 1}, σ ), τσ ).) This representation, the spin-chain representation, is discussed in [CL] in detail in the case σ ≡ −1, and is generalized in [B1]. The problem with this point of view is that the pure state β → (β1, 1)σ on B(L2 (C (I, σ ), τσ )) does not correspond to the trace τσ under the representation. So, we prefer not to think of zˆ j , zˆ j∗ as creation and annihilation operators. A simple calculation shows that if |A| = n then zA ∈ Cn (I × {0, 1}, σ ), and so Nσ zA = |A|zA . Thus, H(I, σ ) is a reducing subspace for the (self-adjoint) operator Nσ on L2 (C (I × {0, 1}, σ ), τσ ). Note also that the action of Nσ on zA mirrors that of Nσ on xA . In fact, this can be stated in terms of a σ -Segal-Bargmann transform: the map Sσ : xA → zA is a unitary isomorphism of L2 (C (I, σ ), τσ ) onto L2 (H(I, σ ), τσ ), and Sσ Nσ = Nσ Sσ . The main part of the proof of Theorem 4 is the following strong hypercontractivity result regarding the semigroup e−tNσ acting on H(I, σ ). Theorem 6. For p = 2 and r an even integer, e−tNσ ar ≤ ap for all a ∈ H(I, σ ) iff t ≥ tJ (p, r) =

1 r log . 2 p

We expect the theorem holds for 2 ≤ p ≤ r < ∞. (The case p < 2 may be somewhat different from the commutative case; in a communication from L. Gross, a calculation showed that in 1-dimension the least time to contraction seems to be larger than the Janson time for some p, r < 2.) If I = ∅, it is easy to see that the Janson time cannot be improved for any p, r > 0, again by calculation in the 1-dimensional case. Proof (of the ‘only if’ direction of Theorem 6). Let a() = 1 + zˆ ∈ H(I, σ ) where zˆ = zˆ j for some j ∈ I . Then |a()|2 = (1 + zˆ ∗ )(1 + zˆ ) = 1 + (ˆz + zˆ ∗ ) + 2 |ˆz|2 = 1 + x + 2 |ˆz|2 , where x = xj . Hence, |a()|2p = (1 + (x + |ˆz|2 ))p

p(p − 1) 2 (x + |ˆz|2 )2 + o( 2 ) = 1 + p(x + |ˆz|2 ) + 2 p(p − 1) 2 = 1 + (px) + 2 p|ˆz|2 + x + o( 2 ). 2

Now, |ˆz|2 = (1/2)(1 + ixy), where y = yj , and so τσ |ˆz|2 = 1/2. Also x 2 = 1, and τσ x = 0. Therefore, a()2p =

1/2p

τσ (|a()|2p )

1/2p 1 p(p − 1) 2 + o( 2 ) = 1+ p· + 2 2 p 2 = 1 + + o( 2 ). 4

628

T. Kemp

So a()p = 1 + (p/8) 2 + o( 2 ). Now, e−tNσ a() = 1 + e−t zˆ = a(e−t ). Thus, in order for e−tNσ a()r ≤ a()p , we must have 1 1 1 + e−2t r 2 + o( 2 ) ≤ 1 + p 2 + o( 2 ), 8 8 and so as → 0, it follows that e−2t ≤ p/r — or t ≥ tJ (p, r).

Hence, the necessity condition holds for all r ≥ p > 0. For the sufficiency, however, the tools available to us are extremely limited (due to the fact that H(I, σ ) is not a ∗-algebra). We are forced to give a combinatorial proof, which cannot reach beyond the cases when p = 2 and r is even. The remainder of the ‘if’ direction of Theorem 6 is the main subject of Sect. 3.3. 3.3. Strong hypercontractivity for H(I, σ ). We will prove Theorem 6 by induction on |I |. Note, in the case |I | = 0, the algebra H(I, σ ) is just C. Since the action of e−tNσ on C is trivial, and since all · p norms are equal to the complex modulus | · |, the sufficiency condition follows automatically in this case. Now, suppose the strong hypercontractivity result of Theorem 6 holds for the algebras H(I , σ ) with |I | ≤ d. Let I be a set of size d + 1, and σ a spin-assignment on I . Select any fixed element i ∈ I . Any element a ∈ H(I, σ ) can be uniquely decomposed as a = b + zˆ i c,

b, c ∈ H(I − {i}, σ |I −{i} ).

(3.6)

For convenience, throughout we will refer to I − {i} as J , and to zˆ i as zˆ , xi as x, and so forth. Since |J | = d, the inductive hypothesis is that H(J, σ |J ) satisfies the strong hypercontractivity estimate of Theorem 6. The quantity |ˆz|2 will often come up in calculations, and so we give it a name: ξ = |ˆz|2 = zˆ ∗ zˆ . We will also encounter zˆ zˆ ∗ , but by Eq. 3.5, zˆ zˆ ∗ = 1 − ξ . The following lemma records some of the important properties of the operators ξ , zˆ , and zˆ ∗ . All of the statements may be verified by trivial calculation. Lemma 1. The following properties hold for ξ , zˆ , and zˆ ∗ . 1. ξ p = ξ for p > 0. 2. ξ is independent of C (J × {0, 1}, σ |J ) — that is, for each u ∈ C (J × {0, 1}, σ |J ), ξ u = uξ and τσ (ξ u) = τσ (ξ )τσ (u) = 21 τσ (u). 3. Let u ∈ C (J × {0, 1}, σ |J ), let h ∈ {ˆz, zˆ ∗ , ξ, 1 − ξ }, and let p > 0. Then hup = 2−1/p up . 4. ξ zˆ = zˆ ∗ ξ = 0, zˆ ξ = zˆ , and ξ zˆ ∗ = zˆ ∗ . The commutativity in item 2 above follows in large part from the fact that ξ = 21 (1 + ixy) ∈ C+ (I × {0, 1}, σ ). The grading plays an important role in the combinatorics to follow. In fact, the grading of C ({i} × {0, 1}, σ |{i} ) induces a grading on the full algebra C (I × {0, 1}, σ ). We refer to this grading by C = C+i ⊕ C−i ,

i Cαi · Cβi ⊆ Cαβ .

So, for example, the element yj ξ (i = j ) is in C+i (I × {0, 1}, σ ), even though it is in C− (I × {0, 1}, σ ). Note that

C−i (I × {0, 1}, σ ) = zˆ u + zˆ ∗ v ; u, v ∈ C (J × {0, 1}, σ |J ) .

Hypercontractivity in Non-Commutative Holomorphic Spaces

629

For any such u, τσ (ˆzu) = (ˆz, u∗ )σ = 0, and similarly τσ (ˆz∗ u) = 0. It follows that τσ |C i = 0. Using the graded structure, this leads to the following important lemma, − which aids in the calculation of moments. Lemma 2. Let v 0 ∈ C+i (I × {0, 1}, σ ) and v 1 ∈ C−i (I × {0, 1}, σ ). Let η be the {0, 1}sequence of length n, and denote by |η| the sum η1 + · · · + ηn of its entries (i.e. the number of 1s). Then the element v η = v η1 · · · v ηn has τσ (v η ) = 0 if |η| is odd. Now, we proceed to expand the moments of |a|2 . Using the decomposition in 3.6, we have |a|2 = (b + zˆ c)∗ (b + zˆ c) = |b|2 + b∗ zˆ c + c∗ zˆ ∗ b + c∗ |ˆz|2 c. That is, |a|2 = (|b|2 + ξ |c|2 ) + (b∗ zˆ c + c∗ zˆ ∗ b) = v 0 + v 1 .

(3.7)

Eq. 3.7 decomposes |a|2 into its C+i and C−i parts, v 0 = |b|2 + ξ |c|2 and v 1 = b∗ zˆ c + c∗ zˆ ∗ b. It follows immediately that 1 a22 = τσ (v 0 ) = τσ (|b|2 + ξ |c|2 ) = b22 + c22 . 2

(3.8)

The factor of 1/2 (unusual in Pythagoras’ formula) is due to our choice to normalize zˆ in L∞ and not in L2 . More generally, for the nth moment of |a|2 , 2n 0 1 n a2n τσ (v η ), 2n = τσ (|a| ) = τσ [(v + v ) ] = η∈2n

where 2n denotes the set of all {0, 1}-sequences of length n. Using Lemma 2, we have a2n 2n =

τσ (v η ) =

n/2

τσ (v η ).

k=0 |η|=2k

|η| even η∈2n

Now, the term v η is a product of n terms, each of which is either |b|2 + ξ |c|2 or b∗ zˆ c + c∗ zˆ ∗ b. Define v 00 = |b|2 , v 10 = b∗ zˆ c,

v 01 = ξ |c|2 , v 11 = c∗ zˆ ∗ b.

Then we may write v η as vη =

ν∈2n

v ην =

v η1 ν1 · · · v ηn νn .

ν∈2n

It should be noted that many of the terms in this sum are in fact 0. For example, consider (v 10 )2 = b∗ zˆ cb∗ zˆ c. In general, for any u ∈ C (J × {0, 1}, σ |J ), there is a u˜ ∈ C (J × {0, 1}, σ |J ) such that zˆ u = uˆ ˜ z. Hence the term (v 10 )2 contains zˆ 2 = 0, and 10 so is 0. More generally, a term like v v 01 v 10 is also 0: the zˆ in v 10 can be commuted past all terms except ξ , at which point the product is either 0 or zˆ (by Lemma 1), so the term is 0. On the other hand, the term v 11 v 01 v 10 is nonzero, since (once commuting past the C (J × {0, 1}, σ |J )-terms) we have zˆ ∗ ξ zˆ = (ˆz∗ zˆ )2 = ξ = 0. Let η, ν ∈ 2n . Denote by 1 (η) ⊆ {1, . . . , n} the set of j such that ηj = 1. Then say that ν is η-alternating, ν ∈ A(η), if the subsequence {(νj ) ; j ∈11(η)} is alternating. For example, let η = (1, 1, 0, 1). Then the sequences (0, 1, 0, 0) and (0, 1, 1, 0) are both in A(η), while the sequence (0, 0, 0, 0) is not. Note that v 10 and v 11 are the terms

630

T. Kemp

containing zˆ and zˆ ∗ . Hence, the v ην with ν ∈ A(η) are precisely those terms in which zˆ and zˆ ∗ alternate when they occur. By the considerations in the preceding paragraph, these are the only nonzero terms in the expansion of v η . Thus, vη = v ην . ν∈A(η)

In any term in the above sum, let |η| = 2k and let |ν| = m. Since 11(η) is a set of 2k indices, and since ν ∈ A(η), νj = 0 for k of these indices j , and νj = 1 for the other k. Thus ν contains at least k 1s and at least k 0s, and so k ≤ m ≤ n − k. It follows that the full expansion for the nth moment is a2n 2n =

n/2 n−k

k=0 m=k |η|=2k

τσ (v ην ).

ν∈A(η)

|ν|=m

It will be useful to consider the cases k = 0 and m = 0 separately, and so we rewrite this moment as 00 n a2n 2n = τσ [(v ) ] +

n m=1 |ν|=m

τσ (v 0ν ) +

n/2 n−k

k=1 m=k |η|=2k

τσ (v ην ).

(3.9)

ν∈A(η)

|ν|=m

(Note, if η ≡ 0 then the condition ν ∈ A(η) is vacuously satisfied for all ν ∈ 2n .) Each of the v ην in Eq. 3.9 is a product of terms, each of which contains some elements of C (J × {0, 1}, σ |J ) and some factors of zˆ , zˆ ∗ , or ξ . (Observe the only term which has no factors from C ({i} × {0, 1}, σ |{i} ) is the first one (v 00 )n .) To estimate such terms, we introduce the following tool. Lemma 3. Let u1 , . . . , us ∈ C (J × {0, 1}, σ |J ). Let U be a product including all of the elements u1 , . . . , un together with some non-zero number of terms from {ˆz, zˆ ∗ , ξ }. Then 1 τσ (U ) ≤ u1 s · · · us s . (3.10) 2 Proof. First note that τσ (U ) is invariant under cyclic permutations of U . U may then be written in the form h1 U1 h2 U2 · · · h U , where each Uj is a product of some of the u1 , . . . , us , and each hj is a product of the terms zˆ ,ˆz∗ , and ξ . Let sj be the number of terms in Uj ; then s1 + · · · + s = s. So s1 /s + · · · + s /s = 1, and when we apply H¨older’s inequality, we find τσ (U ) ≤ h1 U1 s/s1 · · · h U s/s .

(3.11)

By Lemma 1 part 4, any product of terms in {ˆz, zˆ ∗ , ξ } is either zˆ , zˆ ∗ , ξ , 1 − ξ , or 0. Thus, using Lemma 1 part 3, we have hj Uj s/sj ≤ 2−sj /s Uj s/sj .

(3.12)

Now, since Uj is a product of sj terms, say uk1 , . . . , uksj , applying H¨older’s inequality again (using 1/sj + · · · + 1/sj = 1/(s/sj )) we have Uj s/sj ≤ uk1 s · · · uksj s . Combining this with Eqs. 3.11 and 3.12, we get τσ (U ) ≤ 2−s1 /s · · · 2−s /s u1 s · · · us s , and since s1 + · · · + s = s, this reduces to Eq. 3.10.

Hypercontractivity in Non-Commutative Holomorphic Spaces

631

We now apply Lemma 3 to estimate the three terms in Eq. 3.9. The first term is merely τσ (|b|2n ) = b2n 2n . In the first sum n

τσ (v 0ν ),

m=1 |ν|=m

the term v 0ν , with |ν| = m, is a product containing m factors of v 01 = ξ c∗ c and n − m factors of v 00 = b∗ b. So there are a total of 2n factors from the set {b, b∗ , c, c∗ } ⊂ C (J × {0, 1}, σ |J ). Since u2n = u∗ 2n for each u ∈ C (J × {0, 1}, σ |J ), Lemma 3 then implies that τσ (v 0ν ) ≤

1 (b2n )2(n−m) (c2n )2m . 2

Hence, n

n 1 τσ (v ) ≤ (b22n )n−m (c22n )m 2 m=1 |ν|=m m=1 |ν|=m n 1 n (b22n )n−m (c22n )m . = m 2 0ν

(3.13)

m=1

Now we consider the second sum n/2 n−k

k=1 m=k |η|=2k

τσ (v ην ).

ν∈A(η)

|ν|=m

In each term v ην , since |η| = 2k and ν ∈ A(η), we know that k of the terms are v 10 and k of the terms are v 11 . So k of the 1s in ν have been accounted for with the v 11 terms, and since |ν| = m precisely m − k terms must be v 01 . As the total number of terms must be n, this means the remaining v 00 terms are n − (2k + m − k) = n − m − k in number. So, there are – – – –

k factors of v 10 = b∗ zˆ c, so k factors each of b∗ and c, k factors of v 11 = c∗ zˆ b, so k factors each of b and c∗ , m − k factors of v 01 = ξ c∗ c, so m − k factors each of c and c∗ , and n − m − k factors of v 00 = b∗ b, so n − m − k factors each of b and b∗ .

In total, then, v ην contains 2k + 2(n − m − k) = 2(n − m) factors of b or b∗ , and 2k + 2(m − k) = 2m factors of c or c∗ . Applying Lemma 3 again, n/2 n−k

k=1 m=k |η|=2k

ν∈A(η)

|ν|=m

τσ (v ην ) ≤

n/2 n−k 1 (b2n )2(n−m) (c2n )2m . 2 k=1 m=k |η|=2k

ν∈A(η)

|ν|=m

We must now n count the number of pairs (η, ν) with |η| = 2k, ν ∈ A(η) and |ν| = m. There are 2k such η. We know that ν is alternating on 11(η), and so the corresponding subsequence must be either 0101 . . . 01 or 1010 . . . 10, giving two choices, and exhausting k of the m 1s in ν. Finally, since |11(η)| = 2k, there are n − 2k 0s in η, and ν is

632

T. Kemp

unconstrained there; hence, there are n−2k m−k choices. Whence, the number of pairs (η, ν) in the sum is n n − 2k 2 . 2k m−k This gives the estimate n/2 n−k

k=1 m=k |η|=2k

≤

n/2 n−k k=1 m=k

n 2k

τσ (v ην )

ν∈A(η)

|ν|=m

n − 2k (b22n )n−m (c22n )m m−k

(3.14)

for the final sum in Eq. 3.9. It will be convenient to reorder the terms in Eq. 3.14 so that m occurs first. Since the sum (for each k) ranges from m = k to m = n − k, the pairs (k, m) in the sum are those with 1 ≤ k ≤ n/2 and k ≤ m ≤ n − k. The second condition gives two inequalities: k ≤ m and k ≤ n − m. Note, if both of these are satisfied then, summing, 2k ≤ n – the first condition is automatically satisfied. The sum can therefore be rewritten as n m∧(n−m) n n − 2k (b22n )n−m (c22n )m . (3.15) 2k m−k m=1

k=1

So, combining Eqs. 3.9, 3.13, and 3.15, we have the estimate 2n a2n 2n ≤ b2n +

n

χm (b22n )n−m (c22n )m ,

(3.16)

m=1

where the coefficient χm is given by m∧(n−m) n n − 2k 1 n . + χm = 2k m−k 2 m k=1

The following proposition shows that χm is optimally bounded to yield the necessary strong hypercontractive estimate. We state it without proof; the reader may do the necessary calculations. Proposition 7. The coefficients χm satisfy n n m χm ≤ . 2 m This inequality is an equality in the case m = 1. Applying Proposition 7 to Eq. 3.16, we have n n n m 2n a2n ≤ b + (b22n )n−m (c22n )m . 2n 2n m 2 m=1

We now complete the proof of Theorem 6.

(3.17)

Hypercontractivity in Non-Commutative Holomorphic Spaces

633

Proof (of the ‘if’ direction of Theorem 6). For a = b + zˆ c, we have at = bt + e−t zˆ ct , where at = e−tNσ a and so forth. Using the estimate in Eq. 3.17, we have n n n m −2mt 2n 2n e (bt 22n )n−m (ct 22n )m . at 2n ≤ bt 2n + m 2 m=1

Now, suppose t ≥ tJ (2, 2n) = 21 log n. Then e−2mt ≤ n−m . Since b, c ∈ H(J, σ |J ) it follows from the inductive hypothesis that bt 2n ≤ b2 and ct 2n ≤ c2 . Thus, n n n m −m 2n 2n n (b22 )n−m (c22 )m at 2n ≤ b2 + m 2 m=1 n 1 = b22 + c22 , 2 and from Eq. 3.8, this equals a2n 2 . This proves the theorem.

4. Speicher’s Stochastic Interpolation In this final section, we consider creation and annihilation operators βj , βj∗ on L2 (C , τσ ) which bear the same relation to the generators xj in C as the creation and annihilation operators cq , cq∗ bear to the q-Gaussian variables Xq ∈ q . We use these operators, together with a non-commutative central limit theorem of Speicher, to approximate the Lp (Hq , τq )-norm by the norm on Lp (H, τσ ), and thus transfer Theorem 6 from the context of the mixed spin holomorphic algebras to the arena of the q-holomorphic algebras, proving Theorem 4. All of the techniques in this section are analogs of Biane’s ideas in [B1]. 4.1. Creation and Annihilation operators on L2 (C , τσ ). Define operators βj on L2 (C (I, σ ), τσ ) by xj xA , if j ∈ /A βj (xA ) = . 0, if j ∈ A One may readily verify that the adjoint of βj is given by 0, if j ∈ /A ∗ βj (xA ) = . xj xA , if j ∈ A In the case σ (i, j ) = 1 for i = j , these are the B´eb´e Fock operators on the toy Fock space of [M]. In general, βj and βj∗ mimic the creation and annihilation operators. It is easy to see from dimension considerations that the ∗-algebra they generate is all of B(L2 (C (I, σ ), τσ )). We also have βj + βj∗ = xj ,

(4.1)

as a left-multiplication operator on C (I, σ ). One can readily compute that these operators σ -commute – i.e. βi βj = σ (i, j )βj βj if i = j . They also satisfy the σ -relations βi∗ βj − σ (i, j )βj βi∗ = δij ,

634

T. Kemp

just like the operators zˆ j ∈ C (I × {0, 1}, σ ). In fact, the map zˆ j → βj induces a ∗isomorphism from C (I × {0, 1}, σ ) onto B(L2 (C (I, σ ))). (In the case σ ≡ −1, this reduces to the well known isomorphism from the complex Clifford algebra C2n onto the full matrix algebra M2n (C).) Beware, however: this isomorphism does not send τσ to the normalized trace tr on B(L2 (C (I, σ ))), as pointed out in Sect. 3.2. The operators βj , βj∗ demonstrate concretely that the pure state β → (β1, 1)σ , the extension of τσ to L2 (C (I, σ ), τσ ), is not tracial. Indeed, it is easy to calculate that (βj βj∗ 1, 1)σ = 0 while (βj∗ βj 1, 1)σ = 1. These are, however, the same covariance relations that the operators cq and cq∗ satisfy with respect to the pure state A → (A, )q on B(Fq ). It is additionally true that (βj 1, 1)σ = (βj∗ 1, 1)σ = 0, also in line with the operators cq and cq∗ . The following lemma shows that the state (·1, 1)σ factors over naturally ordered products of the operators βj and βj∗ . It is proved in [B1]. Lemma 4. For each j ∈ I , let αj be in the ∗-algebra generated by βj . Let j1 , . . . , js be s distinct elements in I . then (αj1 · · · αjs 1, 1)σ = (αj1 1, 1)σ · · · (αjs 1, 1)σ . 4.2. Speicher’s central limit theorem. Fix q ∈ [−1, 1]. We consider the family of random matrices Sq , consisting of all those infinite symmetric random matrices σ : N∗ × N∗ → {−1, 1} constantly −1 on the diagonal, for which {σ (i, j ) ; i < j } are i.i.d. with P(σ = 1) = (1 + q)/2. Note, then, P(σ = −1) = (1 − q)/2, and so E(σ (i, j )) =

1+q 1−q ·1+ · −1 = q. 2 2

This family of random matrices features prominently in the main theorem of [S], which we will use to prove Theorem 4. Let In denote the set {1, . . . , n}, and let σ ∈ Sq . For convenience, we denote the algebra C (In × {0, 1}, σ |In ) as C (n, σ ). The creation operators on C (n, σ ) are labeled by pairs (j, ζ ) where j ∈ In and ζ ∈ {0, 1}; to avoid confusion, we also index them as σ to keep track of the dependence on σ . Let d be a positive integer, and define new βj,ζ σ,n σ,n variables β1,ζ , . . . , βd,ζ , which act on C (nd, σ ), by σ,n βk+1,ζ

n(k+1) 1 σ =√ β ,ζ , n

0 ≤ k ≤ d − 1.

=nk+1

These operators are constructed to approximate the operators cq . The intuition is: due σ,n to the expectation of the matrix σ ∈ Sq , for large n the βj,ζ satisfy commutation relations close to the q-commutation relations of Eq. 2.1. Speicher’s central limit theorem (Theorem 2 in [S]) makes this statement precise, but requires that the matrix of spins for the different variables have independent (upper triangular) entries. In our case, since for each pair i, j the entries σ ((i, ζ ), (j, ζ )) are the same for all choices of ζ, ζ ∈ {0, 1}, the matrix is only block-independent (with blocks of size 2 × 2). Nevertheless, as with the classical central limit theorem, a straightforward modification of Speicher’s proof generalizes the theorem to this case. We thus have the following theorem.

Hypercontractivity in Non-Commutative Holomorphic Spaces

635

Theorem 7. Let e1 , . . . , ed be an orthonormal basis for Rd . Among the operators q q cq (ej , ek ) on Fq (Rd ⊕ Rd ), denote cq (ej , 0) as cj,0 , and denote cq (0, ej ) as cj,1 . Let Q be a polynomial in 4d non-commuting variables. For almost every σ ∈ Sq , σ,n σ,n σ,n ∗ σ,n ∗ , . . . , βd,1 , (β1,0 ) , . . . , (βd,1 ) )1, 1)σ lim (Q(β1,0

n→∞

= (Q(c1,0 , . . . , cd,1 , (c1,0 )∗ , . . . , (cd,1 )∗ ), )q . q

q

q

q

Proof. This follows from Speicher’s central limit theorem. The required covariance σ were verified above, and the factorization of naturallyconditions for the operators βj,ζ ordered products is the content of Lemma 4. An immediate corollary is that the moments of elements in Hq (Cd ) can be approximated σ,n σ,n ∗ by the corresponding elements in C (nd, σ ). To be precise: let xjσ,n = βj,0 + (βj,0 ) σ,n σ,n σ,n ∗ and let yj = βj,1 + (βj,1 ) . By Eq. 4.1, xjσ,n +1

n(j +1) 1 σ =√ x , n =nj +1

yjσ,n +1

n(j +1) 1 σ =√ y . n =nj +1

Let zjσ,n = 2−1/2 (xjσ,n + iyjσ,n ), which is in H(Ind , σ |Ind ). q

Proposition 8. Denote Zq (ej ) as Zj . Let r be an even integer, and let P be a polynomial in d non-commuting variables. For almost every σ ∈ Sq , q

q

lim P (z1σ,n , . . . , zdσ,n )Lr (H,τσ ) = P (Z1 , . . . , Zd )Lr (Hq ,τq ) .

n→∞

Proof. Let Q be the polynomial in 4d non-commuting variables defined by σ,n σ,n σ,n ∗ σ,n ∗ Q(β1,0 , . . . , βd,1 , (β1,0 ) , . . . , (βd,1 ) ) = P (z1σ,n , . . . , zdσ,n )∗ P (z1σ,n , . . . , zdσ,n ). σ,n σ,n Such a polynomial exists because the variable zjσ,n is a (linear) polynomial in βj,0 , βj,1 , and their adjoints. By definition, the same polynomial yields

Q(c1,0 , . . . , cd,1 , (c1,0 )∗ , . . . , (cd,1 )∗ ) = P (Z1 , . . . , Zd )∗ P (Z1 , . . . , Zd ). q

q

q

q

q

q

q

q

Applying Theorem 7 to the polynomial Qm , we have q

q

lim τσ |P (z1σ,n , . . . , zdσ,n )|2m = τq |P (Z1 , . . . , Zd )|2m

n→∞

a.s.[σ ],

where we have used the fact that (·1, 1)σ reduces to τσ when applied to elements of C (nd, σ ). We will also need to know that the semigroup e−tNσ approximates e−tNq . Proposition 9. Let r be an even integer, and let P be a polynomial in d non-commuting variables. For t > 0, and for almost every σ ∈ Sq , lim e−tNσ P (z1n,σ , . . . , zdn,σ )r = e−tNq P (Z1 , . . . , Zd )r . q

n→∞

q

636

T. Kemp q

q

q

q

Proof. We can expand P (Z1 , . . . , Zd ) as a linear combination of monomials Zi1 · · · Zi . Each such monomial is an eigenvector of e−tNq with eigenvalue e− t . So it is easy to see that there is a unique polynomial Pt such that Pt (Z1 , . . . , Zd ) = e−tNq P (Z1 , . . . , Zd ). q

q

q

q

· · · ziσ,n . Since ziσ,n is a linear combination of z1σ , . . . , Now, consider the polynomials ziσ,n 1 σ , this polynomial may be expanded as a linear combination of monomials zσ · · · zσ znd j1 j with 1 ≤ j1 , . . . , j ≤ nd. From Eq. 3.4, if any two indices are equal, then zjσ1 · · · zjσ = 0; otherwise, it is of degree . Hence e−tNσ (zjσ1 · · · zjσ ) = e− t (zjσ1 · · · zjσ ). It follows that e−tNσ (ziσ,n · · · ziσ,n ) = e− t (zjσ1 · · · zjσ ). Thus, we see that 1 Pt (z1σ,n , . . . , zdσ,n ) = e−tNσ P (z1σ,n , . . . , zdσ,n ). The theorem now follows by applying Proposition 8 to the polynomial Pt .

It should be noted that this elementary argument fails in the full algebra C (nd, σ ); for example, (x1σ )2 = 1 is of degree 0, while Xq (e1 )2 is of degree 2 if q > −1. The relevant statement is still true in that case, but a much more delicate argument (which can be found in [B1]) is necessary to prove it. We now conclude with the end of the proof of Theorem 4. Proof (of Theorem 4). First note that the sharpness of the Janson time tJ (p, r) for any p, r > 0 can be confirmed by an argument identical to the one in the proof of Theorem 6. For sufficiency, by standard arguments it is enough to prove the theorem for the finite dimensional Hilbert space H = Rd , and moreover it suffices to prove it for elements q q f ∈ L2 (Hq , τq ) that are polynomials f = P (Z1 , . . . , Zd ) of the generators. Let r be an even integer, and let t ≥ tJ (2, r). By Proposition 9, e−tNq f r = lim e−tNσ P (z1σ,n , . . . , zdσ,n )r n→∞

a.s.[σ ].

By Theorem 6 applied to the algebra H(Ind , σ |Ind ), e−tNσ P (z1σ,n , . . . , zdσ,n )r ≤ P (z1σ,n , . . . , zdσ,n )2 . Finally, applying Proposition 8, we have lim P (z1σ,n , . . . , zdσ,n )2 = f 2

n→∞

This completes the proof.

a.s.[σ ].

Acknowledgement. I would like to thank Len Gross for much insight, and for suggesting a thesis problem which led to this work. I would also like to thank Philippe Biane, Claus Koestler, and Roland Speicher for useful conversations.

Hypercontractivity in Non-Commutative Holomorphic Spaces

637

References [B1] [B2] [BSZ] [BSp] [BKS] [CL] [DN] [G1] [G2] [G3] [G4] [G5]

[Gli] [J] [JPR] [K] [L] [LM] [M] [N1] [N2] [PX] [R] [S] [Se1] [Se2] ´ [Sn] [V]

Biane, P.: Free hypercontractivity. Commun. Math. Phys. 184, 457–474 (1997) Biane, P.: Segal-Bargmann transform, functional calculus on matrix spaces and the theory of semi-circular and circular systems. J. Funct. Anal. 144, 232–286 (1997) Baez, J.C., Segal, I.E., Zhou, Z.: Introduction to algebraic and constructive quantum field theory. Princeton Series in Physics. Princeton, Princeton University Press, 1992 Bozejko, M., Speicher, R.: An example of a generalized Brownian motion. Commun. Math. Phys. 137, 519–531 (1991) Bozejko, M., K¨ummerer, B., Speicher, R.: q-Gaussian processes: non-commutative and classical aspects. Commun. Math. Phys. 185, 129–154 (1997) Carlen, E., Lieb, E.: Optimal hypercontractivity for Fermi fields and related non-commutative integration inequalities. Commun. Math. Phys. 155, 26–46 (1993) Dykema, K.; Nica, A.: On the Fock representation of the q-commutation relations. J. Reine Angew. Math. 440, 201–212 (1993) Gross, L.: Existence and uniqueness of physical ground states. J. Funct. Anal. 10, 52–109 (1972) Gross, L.: Logarithmic Sobolev inequalities. Amer. J. Math. 97, 1061–1083 (1975) Gross, L.: Hypercontractivity and logarithmic Sobolev inequalities for the Clifford-Dirichlet form. Duke Math. J. 42, 383–396 (1975) Gross, L.: Hypercontractivity over complex manifolds. Acta. Math. 182, 159–206 (1999) Gross, L.: Hypercontractivity, logarithmic Sobolev inequalities and applications: a survey of surveys. To appear in Diffusion, quantum theory, and radically elementary mathematics: a celebration of Edward Nelson’s contributions to science. Edited by William G. Faris. Princeton University Press, 2006 Glimm, J.: Boson fields with nonlinear self-interaction in two dimensions. Commun. Math. Phys. 8, 12–25 (1968) Janson, S.: On hypercontractivity for multipliers on orthogonal polynomials. Ark. Math. 21, 97–110 (1983) Janson, S., Peetre, J., Rochberg, R.: Hankel forms and the Fock space. Rev. Mat. Iberoamericana 3(1), 61–138 (1987) Krolak, I.: Contractivity properties of the Ornsetein-Uhlenbeck semigroup for general commutation realtions. To appear in Mathematische Zeitschrift Lindsay, J.M.: Gaussian hypercontractivity revisited. J. Funct. Anal. 92, 313–324 (1990) Lindsay, J.M., Meyer, P.A.: Fermionic hypercontractivity. In: Quantum probability VII, Singapore: World Scientific, 1992, pp. 211–220 Meyer, P.A.: Quantum Probability for Probabilists. Second edition, Lecture Notes in Mathematics, Vol. 1538, Berlin–Heidelberg–New York: Springer, 1995 Nelson, E.: A quartic interaction in two dimensions. In: Mathematical Theory of Elementary Particles, Cambridge, MA: M.I.T. Press, 1965, pp. 69–73 Nelson, E.: The free Markov field. J. Funct. Anal. 12, 211–227 (1973) Pisier, G., Xu, Q.: Non-commutative Lp -spaces. In: Handbook of the geometry of Banach spaces, Vol. 2, Amsterdam: North-Holland, 2003, pp. 1459–1517 Ricard, E.: Factoriality of q-Gaussian von Neumann algebras. Commun. Math. Phys. 257, 659– 665 (2005) Speicher, R.: A non-commutative central limit theorem. Math. Zeit. 209, 55–66 (1992) Segal, I.: Tensor algebras over Hilbert spaces, II. Ann. of Math. 63, 160–175 (1956) Segal, I.: Construction of non-linear local quantum processes, I. Ann. of Math. 92, 462–481 (1970) ´ Sniady, P.: Gaussian random matrix models for q-deformed Gaussian variables. Commun. Math. Phys. 216, 515–537 (2001) Voiculescu, D.V.: Symmetries of some reduced free product C ∗ algebras. In: Operator Algebras and their Connection with Topology and Ergodic Theory, Lecture Notes in Mathematics, Vol. 1132, Berlin-Heidelberg-New York: Springer, 1985, pp. 566–588

Communicated by Y. Kawahigashi

Commun. Math. Phys. 259, 639–677 (2005) Digital Object Identifier (DOI) 10.1007/s00220-005-1380-z

Communications in

Mathematical Physics

Smoothness of Invariant Manifolds for Nonautonomous Equations Luis Barreira, Claudia Valls Departamento de Matem´atica, Instituto Superior T´ecnico, 1049-001 Lisboa, Portugal. E-mail: [email protected]; [email protected] Received: 8 November 2004 / Accepted: 8 February 2005 Published online: 28 June 2005 – © Springer-Verlag 2005

Abstract: For semiflows generated by ordinary differential equations v = A(t)v admitting a nonuniform exponential dichotomy, we show that for any sufficiently small perturbation f there exist smooth stable and unstable manifolds for the perturbed equation v = A(t)v + f (t, v). As an application, we establish the existence of invariant manifolds for the nonuniformly hyperbolic trajectories of a semiflow. In particular, we obtain smooth invariant manifolds for a class of vector fields that need not be C 1+α for any α ∈ (0, 1). To the best of our knowledge no similar statement was obtained before in the nonuniformly hyperbolic setting. We emphasize that we do not need to assume the existence of an exponential dichotomy, but only the existence of a nonuniform exponential dichotomy, with sufficiently small nonuniformity when compared to the Lyapunov exponents of the original linear equation. Furthermore, for example in the case of stable manifolds, we only need to assume that there exist negative Lyapunov exponents, while we also allow zero exponents. Our proof of the smoothness of the invariant manifolds is based on the construction of an invariant family of cones.

1. Introduction 1.1. Exponential dichotomies and invariant manifolds. We want to establish the existence of smooth stable and unstable invariant manifolds for ordinary differential equations v = A(t)v + f (t, v), assuming the existence of a nonuniform exponential dichotomy for the linear equation v = A(t)v.

(1)

Supported by the Center for Mathematical Analysis, Geometry, and Dynamical Systems, and through Funda¸ca˜ o para a Ciˆencia e a Tecnologia by Program POCTI/FEDER, Program POSI, and the grant SFRH/BPD/14404/2003.

640

L. Barreira, C. Valls

In a certain sense, this is the weakest possible setting in which one can construct the above invariant manifolds (full details are given below). We also want to consider semiflows instead of only flows. The classical notion of exponential dichotomy, that we call here uniform exponential dichotomy, demands considerably from the dynamics and it is of interest to look for more general types of hyperbolic behavior. These generalizations can be much more typical than the notion of uniform exponential dichotomy. This is precisely what happens with the notion of nonuniform exponential dichotomy: essentially any linear equation as in (1) admits such a dichotomy (full details are given below). We refer to [4] for a related detailed discussion (see also the text after Theorem 1 below and Sect. 2.2). On the other hand, we emphasize that there exist large classes of linear differential equations possessing uniform exponential dichotomies. Furthermore, the corresponding theory and its applications are widely developed. We refer to the books [5–7, 9, 15] for details and further references related to uniform exponential dichotomies. In order to formulate our stable manifold theorem, we first briefly describe the setup. Consider a C 1 function t → A(t) such that A(t) is a n × n real matrix for each t ≥ 0. We assume that all solutions of (1) are global in the future, i.e., are defined for every t ≥ 0. Let T (t, s) be the evolution operator associated with Eq. (1). This is the operator satisfying T (t, s)v(s) = v(t) for every solution v(t) of (1) and every t ≥ s. We assume that with respect to some invariant decomposition Rn = E × F (which does not depend on time) the evolution operator T (t, s) can be written in the form T (t, s) = (U (t, s), V (t, s)). We say that Eq. (1) admits a nonuniform exponential dichotomy if there exist constants a≤a<0≤b≤b

and θ, K > 0

(2)

such that for every t ≥ s ≥ 0, U (t, s) ≤ Kea(t−s)+θs ,

U (t, s)−1 ≤ Ke−a(t−s)+θt ,

V (t, s) ≤ Keb(t−s)+θs ,

V (t, s)−1 ≤ Ke−b(t−s)+θt .

(3)

The first four constants in (2) play the role of Lyapunov exponents, while θ measures the nonuniformity of the dichotomy. The assumption a < 0 means that there is at least one negative Lyapunov exponent. We could also consider an analogous version with a ≤ a ≤ 0 < b ≤ b, that would correspond to the existence of at least one positive Lyapunov exponent. We now consider the perturbed equation v = A(t)v + f (t, v),

(4)

where f (t, v) is a C 1 function defined for t ≥ 0 and v ∈ Rn , such that f (t, 0) = 0 for every t ≥ 0 (and thus the origin is a solution of (4)). We can now formulate our stable manifold theorem. We will use the notation R+ for (0, +∞). Theorem 1. Assume that Eq. (1) admits a nonuniform exponential dichotomy, and that there exist c > 0 and q > 1 such that f (t, u) − f (t, v) ≤ cu − v(uq + vq ) for every t ≥ 0 and u, v ∈ Rn . If the conditions ¯ (2 − q)θ } and a + θ < b q a¯ + 4θ < min{a¯ − b, hold, then there is a

C1

manifold V ⊂

R × Rn

with the following properties:

(5)

Smoothness of Invariant Manifolds

641

1. R+ × {0} ⊂ V and T(t,0) V = R × E for every t > 0; 2. V is forward invariant under the semiflow τ on R+ × Rn generated by the system t = 1,

v = A(t)v + f (t, v);

(6)

3. there exists D > 0 such that for every (s, u), (s, v) ∈ V and τ ≥ 0, we have τ (s, u) − τ (s, v) ≤ Deaτ +θs u − v. See Fig. 1 for an illustration. We can also formulate an analogous statement concerning the existence of unstable manifolds, essentially by reversing time in the above setting (see Sect. 6). Theorem 1 is an immediate consequence of Theorems 2 and 3 below. We refer to Sect. 4 for a detailed formulation of the theorem (and in particular for information on the size of the manifold V). We note that the second inequality in (5) is always satisfied when θ is sufficiently small. The existence of a Lipschitz manifold with the properties in Theorem 1 (with the exception of the identity T(t,0) V = R × E for every t > 0) was established in [4] under somewhat weaker assumptions: namely, with q > 1 replaced by q > 0, with the first inequality in (5) replaced by q a¯ + (q + z)θ < 0, and without requiring the C 1 regularity of the perturbation f (see Sect. 3). The proof in the present paper of the regularity of the Lipschitz manifold V is of very different nature from the existence arguments in [4]. Also taking into account the length of the proofs we decided to separate the two. We refer the reader to [3, 4] for more details on the notion of nonuniform exponential dichotomy. In particular, we showed in [3] that any equation v = A(t)v with A(t) as above, at least with one negative Lyapunov exponent, admits a nonuniform exponential dichotomy. We also showed in [3] that the smallness of the nonuniformity is a rather common phenomenon from the point of view of ergodic theory: almost all linear variational equations obtained from a measure-preserving flow on a smooth Riemannian manifold admit a nonuniform exponential dichotomy with arbitrarily small nonuniformity.

1.2. Relation with the nonuniform hyperbolicity theory. We now discuss the relation and novelty of our work with respect to the theory of nonuniformly hyperbolic dynamics (we refer to [1] and to the supplement in [10] for detailed expositions of parts of the theory and to the survey [2] for a detailed description of its contemporary status). We note that our definition of nonuniform exponential dichotomy is inspired not only in the notion of uniform exponential dichotomy but also in the notion of nonuniformly hyperbolic trajectory. Smooth invariant manifolds were first obtained for nonuniformly hyperbolic trajectories by Pesin in [12] (see [1, 2] for details). Nonuniform contractions. The simpler case when the linear variational equation in (1) generates a nonuniform contraction, or equivalently when all the Lyapunov exponents of (1) are negative, is considered in [1]: it is shown that the zero solution of the perturbed equation in (4) is asymptotically (exponentially) stable provided that the perturbation f and the nonuniformity θ of the nonuniform contraction (see (3)) are sufficiently small. In [4] we generalized the approach in [1] to the more general situation when not all the Lyapunov exponents are negative. In particular, we established the existence of stable Lipschitz manifolds, corresponding to the space of vectors with negative Lyapunov exponents. This means that the zero solution of the nonlinear equation in (4) is asymptotically (exponentially) stable along these invariant sets. Our work requires a substantial

642

L. Barreira, C. Valls

elaboration of the approach in [1]. In particular, it was necessary to consider two fixedpoint problems—one to obtain an a priori estimate for the speed of decay of the stable component of solutions along a given graph, and the other to obtain the graph which is the stable manifold—while in the case of contractions we only need the simpler a priori estimate, since in this case the stable manifold is the whole space. Nonuniformly hyperbolic trajectories. Consider a C 1+α flow on a compact manifold without fixed points. It is well known in the nonuniform hyperbolicity theory that any such flow generates a nonautonomous linear differential equation—the linear variational equation—admitting a nonuniform exponential dichotomy along any of its nonuniformly hyperbolic trajectories, i.e., those with at least one negative Lyapunov exponent. Furthermore, there exists a stable manifold theorem (as well as an unstable manifold theorem), due to Pesin (we refer to [1] for full details). We emphasize that the methods used in this setting (as described for example in [1]) cannot be used in our work, at least without further changes. Namely: 1. We want to consider semiflows and not only flows. In particular, it is thus in general impossible to introduce the same adapted Lyapunov norms as in the case of flows. On the other hand, we still require some appropriate device that can play a similar role in the case of semiflows. This caused several difficulties in our approach (see also the related discussion in Sect. 1.3 and the last paragraphs of Sects. 4.1 and 5.6). 2. In the nonuniform hyperbolicity theory the C 1+α hypothesis plays a crucial role, in particular for the existence of invariant manifolds (see Sect. 4.2), not to mention for the absolute continuity and the study of the ergodic properties of the dynamics. On the other hand, as an application of Theorem 1, we are able to establish the existence of invariant manifolds for some C 1 flows (and semiflows) which need not be C 1+α for any α ∈ (0, 1) (see Sect. 4.2). Furthermore, our proof of the stable manifold theorem is new even for C 1+α flows. Infinite-dimensional setting. The first related results in Hilbert spaces were established by Ruelle in [14]. The case of transformations in Banach spaces under some compactness assumptions was considered by Ma˜ne´ in [11] (including the case of differentiable maps with compact derivative at each point). These results were extended in [16] to a class of transformations satisfying a certain asymptotic compactness. There are also results in the literature for partial differential equations and functional differential equations. However, to the best of our knowledge, all these results consider only the case of uniform exponential dichotomies. For details and references one can consult the books mentioned in Sect. 1.1 and also [8]. We are also interested in infinite-dimensional systems. In particular, we considered in [4] differential equations in Banach spaces and we established the existence of Lipschitz stable manifolds when Rn is replaced by a Banach space X, and A(t) is a bounded linear operator on X for each t ≥ 0. The present paper concerns primarily the regularity of the invariant manifolds. We could of course proceed in a similar manner to that in [4] to establish the regularity through a fixed-point problem, obtained from the original equation essentially by formally taking derivatives, although in a slightly more elaborated manner. However, this approach would require an amount of regularity for the vector field which is not optimal. One of our objectives is precisely to describe how an appropriate elaboration of the method of invariant families of cones can be used to show, now in the case of nonuniformly hyperbolic dynamics, that to establish the smoothness of the invariant manifolds

Smoothness of Invariant Manifolds

643

we do not require any additional regularity for the vector field. On the other hand, this approach causes some complications when we consider infinite-dimensional systems. These difficulties are carefully explained at the end of Sects. 3 and 5 when we can already refer to the appropriate places in the proofs. But the main difficulty is that we use in a decisive manner the compactness of the unit ball in Rn . 1.3. Method of proof. The proof of the smoothness of the invariant manifolds is based on the construction of an invariant family of cones along each orbit. These invariant families allow us to obtain an invariant distribution that is shown to coincide with the tangent space of the invariant manifold. This procedure also allows us to discuss the continuity of the distribution, and thus of the continuity of the tangent spaces, that corresponds to the smoothness of the invariant manifold. A feature of our approach is that we deal directly with the semiflows instead of first considering time-one maps as it is sometimes customary in the theory of hyperbolic dynamics (both uniform and nonuniform). A considerable difficulty in our work is that, since we are dealing in general with semiflows instead of flows, it is impossible to introduce the adapted Lyapunov norms which are standard in the case of flows. However, to study the regularity of the invariant manifolds in terms of an invariant family of cones we still must show the invariance of the whole family, which is somewhat delicate for small time, i.e., before sufficient time has passed so that the contraction given by a in (3) overcomes the nonuniformity (which depends on the initial time s). This prevents us from using time-one maps, at least without some appropriate preliminary preparation. We prefer to deal from the beginning with the original semiflow and the above difficulties are overcome by introducing what can still be seen as families of adapted Lyapunov norms, although with a new procedure developed here for the case of semiflows (see Sect. 5.2). The structure of the paper is the following. We describe our setup and the notion of nonuniform exponential dichotomy in Sect. 2. The existence results from [4] are briefly recalled in Sect. 3, thus making the paper self-contained. Our stable manifold theorem is formulated in Sect. 4 and proven in Sect. 5. An application of the stable manifold theorem to nonuniformly hyperbolic trajectories is also included in Sect. 4. The case of unstable manifolds is considered in Sect. 6, and is obtained as a consequence of the former results by reversing time. In Sect. 7 we apply our results to perturbations of linear equations with nonzero Lyapunov exponents (we note that any such equation admits nonuniform exponential dichotomies). 2. Preliminaries + 2.1. Setup. Let A : R+ 0 → Mn (R) be a continuous function, where R0 = [0, +∞) and Mn (R) is the set of n × n real matrices. Consider the initial value problem

v = A(t)v,

v(s) = vs ,

(7)

with s ≥ 0 and vs ∈ Rn . We assume that each solution of (7) is defined

for every t ≥ s.

(8)

We want to study nonlinear perturbations of Eq. (7). Namely, consider a continuous n n function f : R+ 0 × R → R such that f (t, 0) = 0

for every t ≥ 0.

(9)

644

L. Barreira, C. Valls

We assume that there exist c > 0 and q > 0 such that f (t, u) − f (t, v) ≤ cu − v(uq + vq ) for every t ≥ 0 and u, v ∈

Rn .

(10)

Consider the initial value problem

v = A(t)v + f (t, v),

v(s) = vs ,

(11)

with s ≥ 0 and vs ∈ note that v(t) ≡ 0 is a solution of (11). We assume that there is a decomposition Rn = E ×F (independent of t), with respect to which A(t) has the block form B(t) 0 A(t) = . (12) 0 C(t) Rn . We

The blocks B(t) and C(t) will correspond respectively to the stable and center–unstable components of A(t) (see Sect. 2.2). Due to the block form in (12), the unique solution of (7) can be written in the form v(t) = (U (t, s)ξ, V (t, s)η)

for t ≥ s,

(13)

with vs = (ξ, η) ∈ E × F , where U (t, s) and V (t, s) are the evolution operators associated respectively with the blocks B(t) and C(t). We also write f = (g, h) with values in E × F . Clearly, g(t, 0) = 0 and h(t, 0) = 0

for every t ≥ 0,

(14)

and g(t, u) − g(t, v) ≤ cu − v(uq + vq ), h(t, u) − h(t, v) ≤ cu − v(uq + vq )

(15)

for every t ≥ 0 and u, v ∈ Rn . We note that since all norms in R2 are equivalent, assuming that q > 1, the q-norm (uq + vq )1/q is equivalent to the 1-norm u + v. In this case, one can thus replace the factor uq +vq by (u+v)q in each inequality in (15), up to a multiplicative constant. We now write v = (x, y) ∈ E × F . Due to the block form in (12), given s ≥ 0 and vs = (ξ, η) ∈ E × F the problem (11) is equivalent to the system x = B(t)x + g(t, x, y),

y = C(t)y + h(t, x, y),

with (x(s), y(s)) = (ξ, η). We denote by (x(t), y(t)) = (x(t, s, ξ, η), y(t, s, ξ, η)) the unique solution of this problem or, equivalently, of the problem ρ x(ρ) = U (ρ, s)ξ + U (ρ, r)g(r, x(r), y(r)) dr, s ρ y(ρ) = V (ρ, s)η + V (ρ, r)h(r, x(r), y(r)) dr

(16)

(17)

s

for ρ ≥ s. For each τ ≥ 0, we write τ (s, ξ, η) = (s + τ, x(s + τ, s, ξ, η), y(s + τ, s, ξ, η)). This is the semiflow generated by the equation in (11).

(18)

Smoothness of Invariant Manifolds

645

2.2. Nonuniform exponential dichotomies. We now present our hyperbolicity assumptions, namely the existence of a nonuniform exponential dichotomy. This is the weakest hypothesis under which we are able to establish the existence of a stable manifold. We emphasize that this assumption is much weaker than the existence of an exponential dichotomy. We write the unique solution of the initial value problem in (7) in the form v(t) = T (t, s)v(s), where T (t, s) is the associated evolution operator. Consider constants a≤a<0≤b≤b

and a, b ≥ 0.

(19)

Following [4], we say that the linear equation v = A(t)v admits a nonuniform exponential dichotomy if there is a continuous function P : R+ 0 → Mn (R) such that P (t) is a projection for each t ≥ 0 with P (t)T (t, s) = T (t, s)P (s) for every t ≥ s ≥ 0, and there exist constants a, a, a, b, b, b as in (19) and D1 , D2 ≥ 1 such that for every t ≥ s ≥ 0, T (t, s)P (s) ≤ D1 ea(t−s)+as ,

T (t, s)−1 P (t) ≤ D1 e−a(t−s)+at ,

T (t, s)Q(s) ≤ D2 eb(t−s)+bs ,

T (t, s)−1 Q(t) ≤ D2 e−b(t−s)+bt ,

where Q(t) = Id −P (t) is the complementary projection for each t ≥ 0. We refer to [4] for a detailed discussion of this notion. When A(t) has the block form in (12), we can rephrase the notion of nonuniform exponential dichotomy in the following (equivalent) manner: the evolution operators U (t, s) and V (t, s) define a nonuniform exponential dichotomy if there exist constants as in (19) and D1 , D2 ≥ 1 such that for every t ≥ s ≥ 0, U (t, s) ≤ D1 ea(t−s)+as ,

U (t, s)−1 ≤ D1 e−a(t−s)+at ,

V (t, s) ≤ D2 eb(t−s)+bs ,

V (t, s)−1 ≤ D2 e−b(t−s)+bt .

(20)

We note that in comparison with (3) we have introduced more constants in the definition, in order to understand better the effect of each of them in our results. This will also allow us to formulate in a sharper manner certain assumptions that involve the constants, such as the conditions in (5). We showed in [3] that for any equation v = A(t)v, having A(t) the block form in (12) for every t ≥ 0, if there is at least one negative Lyapunov exponent, then the evolution operators U (t, s) and V (t, s) define a nonuniform exponential dichotomy. 3. Existence of Lipschitz Stable Manifolds The existence of a Lipschitz stable manifold W for the equation in (11) was established in [4]. Our main aim in this paper is to show that W is a smooth manifold. In this section we give a brief self-contained presentation of the necessary material from [4]. We emphasize that there is no need to consult that paper in order to understand either the statements or the proofs in the present paper. We make the following assumptions: A1. the function A : R+ 0 → Mn (R) is continuous and satisfies (8) and (12) for every t ≥ 0;

646

L. Barreira, C. Valls

n n A2. the function f : R+ 0 × R → R is continuous and satisfies (9) and (10) for some c > 0 and q > 0.

Furthermore, we assume that the equation v = A(t)v admits a nonuniform exponential dichotomy. The set W is obtained as the graph of a Lipschitz function. We first describe the class of Lipschitz functions that will be considered. We denote by R(δ) ⊂ E the open ball of radius δ > 0 centered at zero. Fix now δ > 0 and κ > 0, and let β = a(1 + 1/q) + b/q,

(21)

with a and b as in (19). The number β specifies the size of the neighborhood R(δe−βs ) in which we take the initial condition at time s. We consider the set Zβ = {(s, ξ ) : s ≥ 0 and ξ ∈ R(δe−βs )} ⊂ R+ 0 × E, and we denote by Xβ the space of continuous functions ϕ : Zβ → F such that for each s ≥ 0, ϕ(s, 0) = 0 and ϕ(s, x) − ϕ(s, y) ≤ κx − y

for every x, y ∈ R(δe−βs ).

(22)

Given a function ϕ ∈ Xβ we consider the graph of ϕ, n W = {(s, ξ, ϕ(s, ξ )) : (s, ξ ) ∈ Zβ } ⊂ R+ 0 ×R .

(23)

We refer to the set W as a Lipschitz manifold. We note that W contains the line R+ 0 × {0} and the Lipschitz graph Ws = {(s, ξ, ϕ(s, ξ )) : ξ ∈ R(δe−βs )} for each fixed s ≥ 0. See Fig. 1.

Ws+τ Ws

s+τ W

p

s E

(s, ξ, ϕ(s, ξ ))

F Fig. 1. A local stable manifold W of the origin. In order that W is invariant under the semiflow τ we must have p = τ (s, ξ, ϕ(s, ξ ))

Smoothness of Invariant Manifolds

647

We also consider the constant ω = β + a = a(2 + 1/q) + b/q,

(24)

and the corresponding sets Zω and Xω , defined as before simply replacing the constant β by ω everywhere. We showed in [4] that for each sufficiently small δ > 0 there exists a function ϕ ∈ Xβ , with β as in (21), such that for each initial condition (s, ξ ) ∈ Zω ⊂ Zβ the corresponding solution of (11) with vs = (ξ, ϕ(s, ξ )) is entirely contained in W (see Theorem 2 below). In other words, for this particular ϕ the set W is forward invariant under the semiflow τ in (18). We consider the conditions qa + a + (a + b)/q < 0

and

a + b < b.

(25)

Note that both inequalities in (25) are automatically satisfied when a and b are sufficiently small (in view of (19)). We now present our result from [4] on the existence of stable manifolds. Theorem 2 ([4]). Assume that A1 and A2 hold. If the equation v = A(t)v admits a nonuniform exponential dichotomy and the conditions in (25) hold, then there exist δ > 0 and a unique function ϕ ∈ Xβ such that the set W in (23) is forward invariant under the semiflow τ , i.e., if (s, ξ ) ∈ Zω then τ (s, ξ, ϕ(s, ξ )) ∈ W for every τ ≥ 0. Furthermore: 1. for every (s, ξ ) ∈ Zω we have +∞ V (τ, s)−1 h(τ −s (s, ξ, ϕ(s, ξ ))) dτ ; ϕ(s, ξ ) = −

(26)

s

2. there exists D > 0 such that for every s ≥ 0, ξ , ξ ∈ R(δe−ωs ), and τ ≥ 0 we have τ (s, ξ, ϕ(s, ξ )) − τ (s, ξ , ϕ(s, ξ )) ≤ Deaτ +as ξ − ξ .

(27)

We call W a local stable manifold or simply a stable manifold of the origin. In particular, setting ξ = 0 in (27) we see that any solution of the initial value problem in (11) starting in W, i.e., with v(s) = (ξ, ϕ(s, ξ )) for some ξ ∈ R(δe−ωs ), approaches the zero solution with exponential speed a (which in particular is independent of ξ ). It also follows from Theorem 2 that if ξ ∈ R(δe−ωs ) then y(ρ, t (s, ξ, ϕ(s, ξ ))) = ϕ(ρ, x(ρ, t (s, ξ, ϕ(s, ξ ))))

(28)

for every ρ ≥ s + t and t ≥ 0. The fact that the initial condition ξ must be taken in a neighborhood of exponentially decreasing size R(δe−ωs ), with respect to the initial time s, is a manifestation of the extra exponential terms involving a and b in the norm bounds in (20) for the operators U (t, s) and V (t, s). We observe that only the first and the last inequalities in (20) in the definition of nonuniform exponential dichotomy are used in the proof of Theorem 2 in [4]. In the proof of the regularity of the stable manifold (see Theorem 3 below) we will need, in addition, the third inequality in (20). We also considered in [4] the case of differential equations in infinite-dimensional spaces. In particular, the statement in Theorem 2 is valid when Rn is replaced by a

648

L. Barreira, C. Valls

Banach space X, and A(t) is a bounded linear operator on X for each t ≥ 0. On the other hand, the arguments used in the present paper to establish the regularity of the Lipschitz manifold in Theorem 2 cannot be applied in the infinite-dimensional setting, at least without further changes (namely, the arguments in Sect. 5.4 below cannot be used; see the proof of Lemma 8). As such we have chosen in the present section to write the whole exposition for Rn . Nevertheless, our approach still offers in particular the possibility of immediate application to the context of perturbations of linear differential equations with nonzero Lyapunov exponents in finite-dimensional spaces (see Sect. 7). On the other hand, in the infinite-dimensional setting, these equations—which are one of our main motivations—still lack today a proper development of the classical Lyapunov–Perron theory which would be crucial for corresponding applications. 4. Smoothness of the Stable Manifolds 4.1. Main result. We now study the regularity of the Lipschitz manifolds in Theorem 2. The setup will be the same as that in Sects. 2 and 3, although we will require slightly more restrictive assumptions on the constants in (19). The assumptions of the type considered here in fact occur even in the case of uniform exponential dichotomies when one studies the regularity of stable and unstable invariant manifolds. Set ϑ = max{a, b}. We consider the conditions ¯ (2 − q)ϑ} qa + 4ϑ < min{a¯ − b,

and

a + ϑ < b,

(29)

which clearly imply the conditions in (25). The second inequality in (29) is slightly stronger than the second inequality in (25), although it can be considered essentially the same. On the other hand, the first inequality in (29) is of different type. It does imply the first inequality in (25) but it also requires a certain “spectral gap”. We note that the second inequality in (29) is always satisfied when ϑ is sufficiently small. For technical reasons we need to slightly reduce the size of the neighborhood R(δe−ωs ), with ω as in (24). Namely, we fix the new exponent ω + 3ϑ/q ≥ ω and for each > 0, we consider the subset V ⊂ W given by V = {(s, ξ, ϕ(s, ξ )) : (s, ξ ) ∈ Zω+3ϑ/q with s > } ⊂ R+ × Rn .

(30)

We also replace Conditions A1 and A2 in Sect. 3 by the new conditions: 1 B1. the function A : R+ 0 → Mn (R) is of class C and satisfies (8) and (12) for every t ≥ 0; n n 1 B2. the function f : R+ 0 × R → R is of class C and satisfies (9) and (10) for some c > 0 and q > 1.

The following is our main result. It establishes the C 1 regularity of the Lipschitz manifold V. Theorem 3. Assume that B1 and B2 hold. If the equation v = A(t)v admits a nonuniform exponential dichotomy and the conditions in (29) hold, then for each > 0 there exists δ > 0 such that the set V in (30) is a smooth manifold of class C 1 containing the line (, +∞) × {0} and satisfying T(s,0) V = R × E for every s > . The proof of Theorem 3 is given in Sect. 5. In fact, we will obtain Theorem 3 from a corresponding version under less restrictive assumptions (see Theorem 5 in Sect. 5.6).

Smoothness of Invariant Manifolds

649

We note that in general we are not able to take = 0 in (30). The explanation is given at the end of Sect. 5.6, when we can already refer to the appropriate places in the proof. On the other hand, if the functions A(t) and f (t, x) are defined for every t > −ε, for some fixed ε > 0, then we can replace in (30), as well as in the statement of Theorem 3, by any number in (−ε, 0). We observe that in view of B2, in Theorem 3 we are assuming that q > 1 and not only q > 0, as in Theorem 2. The hypothesis B2 is somewhat related to the essential and in a certain sense optimal C 1+α hypothesis in the theory of nonuniformly hyperbolic dynamics (see [1] for details). Rigorous statements are formulated in the following section.

4.2. Stable manifolds of nonuniformly hyperbolic trajectories. We now explain how Theorem 3 can be used to show the existence of smooth stable manifolds for solutions of a given differential equation (possibly nonautonomous) that exhibit nonuniformly hyperbolic behavior. n n Consider a continuous function F : R+ 0 × R → R and the equation v = F (t, v).

(31)

Let now v0 (t) be a solution of (31). We say that v0 (t) is nonuniformly hyperbolic if the matrix function A(t) =

∂F (t, v0 (t)) ∂v

admits a nonuniform exponential dichotomy. For simplicity we continue to assume that A(t) has the block form in (12) with respect to the invariant decomposition Rn = E ×F . We are interested in constructing stable manifolds for nonuniformly hyperbolic solutions of (31). We use the same notation as in Sect. 4.1, and we set γ = ω + 3ϑ/q = a(2 + 1/q) + b/q + 3 max{a, b}/q, with a and b as in (20). Theorem 4. Assume that F is of class C 1 and let v0 (t) be a nonuniformly hyperbolic solution of (31) such that: 1. the function t → A(t) is of class C 1 ; 2. there exist c > 0 and q > 1 such that for every t ≥ 0 and y ∈ Rn , ∂F ≤ Cyq . (t, y + v (t)) − A(t) 0 ∂v If the conditions (29) hold, then for each > 0 there exist δ > 0 and a unique function ϕ ∈ Xγ such that the set V = {(s, ξ, ϕ(s, ξ )) + (0, v0 (s)) : (s, ξ ) ∈ Zγ with s > } is a C 1 manifold with the following properties: 1. (s, v0 (s)) ∈ V and T(s,v0 (s)) V = R × E for every s > ;

650

L. Barreira, C. Valls

2. V is forward invariant under solutions of (31), i.e., if s > and (s, vs ) ∈ V, then (t, v(t)) ∈ V for every t ≥ s, where v(t) = v(t, vs ) is the unique solution of (31) for t ≥ s with v(s) = vs ; 3. there exists D > 0 such that for every s > , (s, vs ), (s, v s ) ∈ V, and t ≥ s we have v(t, vs ) − v(t, v s ) ≤ Dea(t−s)+as vs − v s . Proof. The proof follows closely arguments in [4]. We shall reduce the study of Eq. (31) to that of (11). For this we consider the change of variables (t, y) = (t, v − v0 (t)). Letting y(t) = v(t) − v0 (t), where v(t) is a solution of (31), we obtain y (t) = F (t, v(t)) − F (t, v0 (t)) = F (t, y(t) + v0 (t)) − F (t, v0 (t)) = A(t)y(t) + G(t, y(t)), where G(t, y) = F (t, y + v0 (t)) − F (t, v0 (t)) − A(t)y.

(32)

By hypothesis A(t) satisfies Assumption B1 in Sect. 4.1. Furthermore, it follows from (32) that G is of class C 1 and clearly G(t, 0) = 0 for every t ≥ 0. It remains to establish property (10). For this we note that ∂Gi · y − z, G(t, y) − G(t, z) ≤ sup (t, y + r(z − y)) ∂y r∈[0,1],i=1,... ,n where G = (G1 , . . . , Gn ). Since ∂G ∂F (t, y) = (t, y + v0 (t)) − A(t) ∂y ∂v for every t ≥ 0 and y ∈ Rn , we obtain G(t, y) − G(t, z) ≤ c sup y + r(z − y)q y − z r∈[0,1]

≤ c max{yq , zq }y − z ≤ c(yq + zq )y − z. Thus, the function G satisfies Assumption B2 in Sect. 4.1. We can now apply Theorems 2 and 3 to obtain the desired statement. For example, if F is of class C 2 and there exist c > 0 and q > 1 such that 2 ∂ F ≤ cyq (t, y + v (t)) 0 ∂v 2 for every t ≥ 0 and y ∈ Rn , then the hypotheses in Theorem 4 hold. We call the set V in Theorem 4 a local stable manifold or simply a stable manifold of the solution v0 (t) (of Eq. (31)). In [13] Pugh gave an explicit example of a C 1 diffeomorphism which is not C 1+α for any α ∈ (0, 1), for which the statement in the stable manifold theorem fails. Of course that this not mean that all C 1 diffeomorphisms and flows with nonuniformly hyperbolic trajectories which lack higher regularity have no stable or unstable invariant manifolds. We note that Theorem 4 provides in particular examples of C 1 vector fields which need not be C 1+α for any α ∈ (0, 1) but for which each nonuniformly hyperbolic trajectory possesses a stable manifold. To the best of our

Smoothness of Invariant Manifolds

651

knowledge no similar statement was obtained before in the nonuniformly hyperbolic setting. Explicit examples can be constructed in the following manner. We assume that F (t, 0) = 0 for every t ≥ 0, and we consider the constant solution v0 = 0. We want to exhibit a C 1 function F such that: older continuous; 1. H := ∂F ∂v is continuous but not H¨ 2. the function t → H (t, 0) is of class C 1 ; 3. H (t, y) − H (t, 0) ≤ cyq for every t ≥ 0 and y ∈ Rn , and some constants c > 0 and q > 1. For this we consider a continuous function ρ : R+ → [0, 1] and a sequence pn ∈ R+ decreasing to zero such that: 1. ρ is of class C 1 outside the points pn ; 2. ρ is H¨older continuous with H¨older exponent at most pn in some open neighborhood of pn for each n ∈ N. We now define a function H by H (t, y) = H (t, 0) + f (y)ρ(y) for each y = 0, where t → H (t, 0) = A(t) and f : Rn → Rn are any C 1 functions such that v = A(t)v has only global solutions and f (y) ≤ cyq for every y ∈ Rn . One can easily verify that y → H (t, y) is H¨older continuous with exponent at most pn outside the ball of radius pn centered at the origin. Thus, H is not H¨older continuous, although it is continuous at (t, 0) and thus continuous. Integrating H , while imposing the condition F (t, 0) = 0 for every t ≥ 0, we find a function F as desired: namely, F satisfies the hypotheses of Theorem 4, but since H is not H¨older continuous, F is not of class C 1+α for any α. 5. Proof of Theorem 3 We establish in this section the C 1 regularity of the Lipschitz manifold V in (30). In view of clarity we will separate the proof into several steps. We consider the conditions qa + qa + b + 2 max{a, b} < 0, b + (q − 1)a + a < 0,

a + b < b,

b + qa + b < 0.

(33) (34)

One can easily verify that the conditions in (29) imply those in (33)–(34). We will establish Theorem 3 under these slightly weaker assumptions. Note that with the exception of the second condition in (33) the other three are automatically satisfied when q is sufficiently large (also by (29)). Furthermore, instead of the exponent ω + 3ϑ/q in (30) we consider the new exponent α = ω + a/q + b/q + max{a, b}/q = a(2 + 2/q) + 2b/q + max{a, b}/q.

(35)

Note that α ≤ ω + 3ϑ/q and thus in general α is slightly better than ω + 3ϑ/q, in the sense that it may correspond to a slightly larger neighborhood of initial conditions. From now on we consider the set V ⊂ W defined as in (30) but with Zω+3ϑ/q replaced by Zα . Without loss of generality we will always consider the norm (v, w) = v + w for (v, w) ∈ E × F .

652

L. Barreira, C. Valls

5.1. Auxiliary estimates. Here we establish several estimates that are needed in the proof of Theorem 3. We will always assume that B1 and B2 hold. In particular, f is now of class C 1 and the conditions (9)–(10) (or equivalently (14)–(15)) hold for some c > 0 and q > 1. We will also assume that the conditions (33)–(34) hold. These are standing assumptions that will be used throughout Sects. 5.1–5.6. We first give a bound for the derivatives of the perturbation. Lemma 1. We have

∂g ∂g ∂h ∂h q max ∂x , ∂y , ∂x , ∂y ≤ 2c(x, y) .

Proof. We first consider the derivative ∂g/∂x. Since g is differentiable, it follows from (15) that for every v ∈ E, ∂g v = lim g(t, x + vh, y) − g(t, x, y) ∂x h→0 |h| x + vh − x((x + vh, y)q + (x, y)q ) ≤ c lim ≤ 2c(x, y)q v. h→0 |h| Therefore, ∂g/∂x ≤ 2c(x, y)q . Proceeding in a similar manner with the derivatives ∂g/∂y, ∂h/∂x, and ∂h/∂y we obtain the desired statement. We now obtain norm bounds for the derivatives of the solution with respect to the initial conditions (note that by Hypotheses B1 and B2, the solution of (17) is indeed of class C 1 in the initial conditions). Fix s ≥ 0 and t ≥ 0. For each ρ ≥ s + t, we set pρ = (ρ, t (s, ξ, ϕ(s, ξ ))), and

∂x ∂y Sρ = |pρ + |pρ , ∂ξ ∂ξ

qρ = (ρ, x(pρ ), y(pρ )),

(36)

∂x ∂y Tρ = |pρ + |pρ , ∂η ∂η

where x and y are the functions in (16). Since τ is a semiflow, we have the identities x(pρ ) = x(ρ, s, ξ, ϕ(s, ξ )) and y(pρ ) = y(ρ, s, ξ, ϕ(s, ξ )).

(37)

The following lemma gives several exponential bounds, in particular for Sρ and Tρ , which are essential in the proof of Theorem 3. As described above, for technical reasons we need to slightly reduce the size of the neighborhood R(δe−ωs ). Namely, we consider the new neighborhood R(δe−αs ) ⊂ R(δe−ωs ) with α as in (35). In view of the first inequality in (33), we have qa + a < 0

and qa + b < 0.

(38)

Let also d = 2a + 2b + aq + max{a, b}, and θ = 2c(1 + κ)q D q δ q

and θ = 2c(1 + κ)q D q δ q (D1 + D2 ).

(39)

Smoothness of Invariant Manifolds

653

Lemma 2. Given δ > 0 sufficiently small, for each (s, ξ ) ∈ Zα , t ≥ 0, and ρ ≥ s + t we have ∂g ∂x ∂g ∂y qa(ρ−s)−ds |q + Sρ , (40) | | | ∂x ρ ∂ξ pρ ∂y qρ ∂ξ pρ ≤ θ e ∂g ∂x ∂g ∂y qa(ρ−s)−ds |q + Tρ , (41) | | | ∂x ρ ∂η pρ ∂y qρ ∂η pρ ≤ θ e with identical inequalities with g replaced by h, and Sρ ≤ D1 e(b+θ)(ρ−s−t)+a(s+t) ,

Tρ ≤ D2 e(b+θ)(ρ−s−t)+b(s+t) .

(42)

Proof. Set x(r) = x(r, s, ξ, η) and y(r) = y(r, s, ξ, η) (see (16)). Taking derivatives with respect to ξ and η in (17), given ρ ≥ s + t we obtain ρ ∂x ∂g ∂x ∂g ∂y |p = U (ρ, s + t) + U (ρ, r) |p + |p dr, ∂ξ ρ ∂x ∂ξ r ∂y ∂ξ r s+t ρ ∂y ∂h ∂x ∂h ∂y |pρ = V (ρ, r) |pr + |pr dr, ∂ξ ∂x ∂ξ ∂y ∂ξ s+t (43) ρ ∂g ∂x ∂g ∂y ∂x U (ρ, r) |p = |p + |p dr, ∂η ρ ∂x ∂η r ∂y ∂η r s+t ρ ∂y ∂h ∂x ∂h ∂y |pρ = V (ρ, s + t) + V (ρ, r) |pr + |pr dr, ∂η ∂x ∂η ∂y ∂η s+t with the partial derivatives of g and h computed at qr (see (36)). Recall that y(pr ) = ϕ(r, x(pr )) for every r ≥ s + t (see (28)), and thus (x, y)(pr ) ≤ (1 + κ)x(pr ). By Lemma 1 we conclude that ∂g ∂x ∂g ∂y q q |q | | | + ∂x r ∂ξ pr ∂y qr ∂ξ pr ≤ 2c(1 + κ) x(pr ) Sr . Since ξ ∈ R(δe−αs ) ⊂ R(δe−ωs ), it follows from (37) and Theorem 2 (making ξ = 0 in (27)) that x(pr )q = x(r, s, ξ, ϕ(s, ξ ))q ≤ D q δ q eqa(r−s)−ds . Thus,

∂g ∂x ∂g ∂y qa(r−s)−ds |q Sr , ∂x r ∂ξ |pr + ∂y |qr ∂ξ |pr ≤ θ e

which is the inequality in (40). We can obtain (41) in a similar manner. Furthermore, we have identical inequalities to those in (40) and (41) with g replaced by h. If follows from the first identity in (43), (20), and (40) that ρ ∂x |p ≤ D1 ea(ρ−s−t)+a(s+t) + θ D1 ea(ρ−r)+ar eqa(r−s)−ds Sr dr. ∂ξ ρ s+t In a similar manner, using the second identity in (43) and again (40), now with g replaced by h, we obtain ρ ∂y |p ≤ θ D2 eb(ρ−r)+br eqa(r−s)−ds Sr dr. ∂ξ ρ s+t

654

L. Barreira, C. Valls

Therefore, Sρ ≤ D1 e +θ

a(ρ−s−t)+a(s+t)

+θ

ρ

D1 ea(ρ−r)+ar eqa(r−s)−ds Sr dr

s+t

ρ

D2 eb(ρ−r)+br eqa(r−s)−ds Sr dr.

s+t

Using (38) and the fact that a < b (see (19)), we conclude that Sρ ≤ D1 ea(ρ−s−t)+a(s+t) ρ +θ D1 ea(ρ−r) Sr dr + θ s+t

≤ D1 e

ρ

D2 eb(ρ−r) Sr dr

s+t

b(ρ−s−t)+a(s+t)

+ θe

b(ρ−s−t)

ρ

e−b(r−s−t) Sr dr.

(44)

s+t

We now write Sr = e−b(r−s−t) Sr for each r ≥ s + t. It follows from (44) that ρ Sr dr Sρ ≤ D1 ea(s+t) + θ s+t

for every ρ ≥ s + t. Therefore, using Gronwall’s lemma we conclude that Sρ ≤ D1 ea(s+t) eθ(ρ−s−t) for every ρ ≥ s + t. This establishes the third inequality in the lemma. In a similar manner, using the third and fourth identities in (43), together with (20) and (41) (also with g replaced by h), we obtain ρ b(ρ−s−t)+b(s+t) +θ D1 ea(ρ−r) ear eqa(r−s)−ds Tr dr T ρ ≤ D2 e s+t ρ +θ D2 eb(ρ−τ )+br eqa(r−s)−ds Tr dr. s+t

Using (38) and the fact that a < b, this yields T ρ ≤ D2 e

b(ρ−s−t)+b(s+t)

+ θe

b(ρ−s−t)

ρ

e−b(r−s−t) Tr dr.

s+t

Writing Tr = e−b(r−s−t) Tr for each r ≥ s + t, we conclude that ρ Tr dr Tρ ≤ D2 eb(s+t) + θ s+t

for every ρ ≥ s + t. Thus, it follows from Gronwall’s lemma that Tρ ≤ D2 eb(s+t) eθ(ρ−s−t) for every ρ ≥ s + t. This establishes the last inequality in (42).

Smoothness of Invariant Manifolds

655

We note that by the dependence of θ and θ on δ (see (39)), these two constants can be made arbitrarily small by making δ sufficiently small. In particular, the exponent b + θ in (42) can be made arbitrarily close to b. The following statement considers a function F which occurs in the construction of the invariant families of cones in Sect. 5.3 below. The value F (x) is essentially the size of the cone at time x. Lemma 3. Consider the function F : [0, +∞) → R defined by F (x) =

γ ea1 x + ν(1 − ea2 x ) , 1 − ν(1 − ea2 x )

where a1 , a2 < 0, a1 ≥ a2 , and γ , ν ∈ (0, 1). If ν<

a1 γ , a2 (1 + γ )

(45)

then F (x) < γ for every x > 0. Proof. Assume first that a1 = a2 = a. Then F (x) =

aeax (γ − ν(1 + γ )) , (1 − ν(1 − eax ))2

and it follows from (45) that F (x) < 0. Therefore, F (x) < F (0) = γ for every x > 0. Assume now that a1 > a2 . In this case, F (x) =

γ a1 ea1 x − νγ a1 ea1 x − νa2 ea2 x − νγ a2 e(a1 +a2 )x + νγ a1 e(a1 +a2 )x . (1 − ν(1 − ea2 x ))2

Since a1 > a2 and a2 < 0 we have −νa2 ea2 x < −νa2 ea1 x and thus, F (x) ≤

ea1 x (γ a1 (1 − ν) − νa2 + νγ (a1 − a2 )ea2 x ) . (1 − ν(1 − ea2 x ))2

Furthermore, again since a1 > a2 and x ≥ 0, we have νγ (a1 − a2 )ea2 x ≤ νγ (a1 − a2 ) and hence, F (x) ≤

ea1 x (γ a1 (1 − ν) − νa2 + νγ (a1 − a2 )) ea1 x (γ a1 − νa2 (1 + γ )) ≤ . a x 2 (1 − ν(1 − e 2 )) (1 − ν(1 − ea2 x ))2

It follows from (45) that F (x) < 0. Therefore, F (x) < F (0) = γ for every x > 0. This completes the proof of the lemma.

656

L. Barreira, C. Valls

5.2. Adapted norms. Due to the nonuniformity of the norm bounds for the operators U (t, s) and V (t, s) in the notion of nonuniform exponential dichotomy (see (20)), we introduce a new family of “adapted” norms. Namely, we fix > 0 and s ≥ , and given r ≥ s and (v, w) ∈ E × F we define the new norms +∞ vr = U (σ, r)vea (σ −r) dσ, r r wr = V (r, σ )−1 we−b (σ −r) dσ, (46) s−

where a = −a − ς > 0,

b = b + ς > 0

(47)

for some ς > 0 such that ς ≤a

and ς = b − b − b.

(48)

We also set (v, w)r = vr + wr for each (v, w) ∈ E × F . Our choice of norms is certainly not unique; in particular one can easily change them so that (v, w)r is obtained from an inner product for which E and F are orthogonal. Nevertheless, the resulting stable distribution, which is an essential element in our proof of the C 1 regularity of the manifold V, is independent of the choice of norms. We now consider the relation between the norms · and · . Lemma 4. For every s ≥ , r ≥ s, and (v, w) ∈ E × F we have D1 ar e v, ς

(49)

D2 br ς(r+−s) e (e − 1)w, ς

(50)

C1 e−ar v ≤ vr ≤

C2 e−br w ≤ wr ≤ where C1 =

1 − ea−a−ς−a D1 (a − a + ς + a)

and C2 =

e(b−b+ς+b) − 1 D2 (b − b + ς + b)

.

Proof. By the definition of the norm vr in (46)–(47) and (20), we have +∞ U (σ, r)vea (σ −r) dσ vr = r +∞ D1 ea(σ −r)+ar ve(−a−ς)(σ −r) dσ ≤ r +∞ D1 ar e−ς(σ −r) dσ = e v. = D1 ear v ς r

(51)

Smoothness of Invariant Manifolds

657

For the other inequality in (49) we write vr ≥

r+1 r

≥

r+1

U (σ, r)−1 −1 vea (σ −r) dσ

r r+1

r

=

U (σ, r)vea (σ −r) dσ ≥

D1−1 ea(σ −r)−aσ ve(−a−ς)(σ −r) dσ

D1−1 ve−(a−a−ς)r

r+1

e(a−a−ς−a)σ dσ = C1 e−ar v.

r

In a similar manner, using the definition of the norm wr in (46)–(47) and (20), we have r wr = V (r, σ )−1 we−b (σ −r) dσ s− r

≤

D2 e−b(r−σ )+br we(−b−ς)(σ −r) dσ

s−

r

= D2 ebr w

e−ς(σ −r) dσ ≤

s−

D2 br e w(eς(r+−s) − 1). ς

The remaining inequality follows from r wr ≥ V (r, σ )−1 we−b (σ −r) dσ s− r

≥

r−

= =

D2−1 e−b(r−σ )−bσ we(−b−ς)(σ −r) dσ

D2−1 we(b−b+ς)r

w D2 (b − b + ς + b)

r

e−(b−b+ς+b)σ dσ

r−

e−br (e(b−b+ς+b) − 1) = C2 e−br w.

This completes the proof of the lemma.

Note that in view of (48) the constant C2 is well-defined. The above lemma shows that the norms · and · are equivalent (for each fixed > 0, ς > 0, and s ≥ ), although their ratio may deteriorate with exponential speed along each orbit (see (49) and (50)). Nevertheless, this deterioration, essentially given by the exponents a and b, i.e., by the nonuniformity in the exponential dichotomy, is small when compared to the values of the Lyapunov exponents (in view of (33)–(34)).

5.3. Existence of an invariant family of cones. The next step in our proof of Theorem 3 is to establish the existence of an invariant family of cones along each orbit of the semiflow τ . This is our main element towards the construction of an invariant distribution that later will be shown to coincide with the tangent bundle of V. This procedure will also allow us to discuss the continuity of the distribution, and thus of the tangent spaces, in terms of the cones.

658

L. Barreira, C. Valls {s + τ } × E × F

s+τ

s E

Cs (τ )

F Fig. 2. The cone Cs (τ ) at the point τ (s, ξ, η)

Fix γ > 0. For each s ≥ 0 and τ ≥ 0, we consider the cone Cs (τ ) = {(v, w) ∈ E × F : ws+τ < γ vs+τ } ∪ {(0, 0)}. We emphasize that Cs (τ ) is defined in terms of the new norms · given by (46) and not in terms of the original norm in E × F . Given (s, ξ, η) ∈ [0, +∞) × E × F , we think of the cone Cs (τ ) = Cs,ξ,η (τ ) sitting at the point (s, ξ, η) as a subset of the plane {s + τ } × E × F or, alternatively, of the tangent space Tτ (s,ξ,η) ({s + τ } × E × F ). This is a cone around the space E at time s + τ . See Fig. 2. Note that by (34) and (39), we can choose δ > 0 sufficiently small so that c1 = b + θ + (q − 1)a + a < 0

and c2 = b + θ + qa + b < 0.

(52)

The following lemma shows that the above family of cones is indeed invariant under the differential of τ . More precisely, we consider the partial derivatives of the second and third components of r (see (18)) with respect to (ξ, η) at the point (s, ξ, η) ∈ R+ × E × F , and we denote it by ∂(s,ξ,η) τ . The number γ > 0 will remain (arbitrarily) fixed from now on. Lemma 5. Given δ > 0 sufficiently small, for each (s, ξ ) ∈ Zα with s ≥ , and τ > t ≥ 0 we have (∂τ (s,ξ,ϕ(s,ξ )) t−τ )Cs (τ ) ⊂ Cs (t).

(53)

Proof. Given (vs+τ , ws+τ ) ∈ Cs (s + τ ) and r ∈ [s, s + τ ], we define the vector (vr , wr ) = (∂τ (s,ξ,ϕ(s,ξ )) r−(s+τ ) )(vs+τ , ws+τ ) ∈ Tr (s,ξ,ϕ(s,ξ )) ({s + r} × E × F ).

(54)

Smoothness of Invariant Manifolds

659

Ws+τ

Cs (τ )

Ws+t

qs+τ

(vs+t , ws+t )

qs+t

E

(vs+τ , ws+τ )

Cs (t) F Fig. 3. Preimages of vectors inside the cones along a given orbit at the times s + t and s + τ

See Fig. 3 for an illustration. Let now τ > t ≥ 0. By (54), we have (vr , wr ) = (∂t (s,ξ,ϕ(s,ξ )) r−(s+t) )(vs+t , ws+t ) for each r ∈ [s + t, s + τ ]. In a somewhat more explicit form, we can write ∂x ∂x vr vs+t ∂ξ ∂η = ∂y ∂y , wr ws+t ∂ξ ∂η

with the partial derivatives of x and y in (16) computed at pr (see (36)). In particular, vr =

∂x ∂x |pr vs+t + |p ws+t , ∂ξ ∂η r

wr =

∂y ∂y |pr vs+t + |p ws+t . ∂ξ ∂η r

We introduce the notation G(r) =

∂g ∂g wr , vr + ∂y ∂x

H (r) =

∂h ∂h vr + wr , ∂x ∂y

with the partial derivatives of g and h computed at qr (see (36)). Then ∂g ∂y ∂g ∂x ∂g ∂y ∂g ∂x G(r) = |p + |p vs+t + |p + |p ws+t , ∂x ∂ξ r ∂y ∂ξ r ∂x ∂η r ∂y ∂η r and in view of (40) and (41) in Lemma 2, we have ∂g ∂x ∂g ∂x ∂g ∂y ∂g ∂y G(r) ≤ |p + |p + |p + |p ∂x ∂ξ r ∂y ∂ξ r ∂x ∂η r ∂y ∂η r ×(vs+t + ws+t ) ≤ θ eqa(r−s)−ds (Sr + Tr )(vs+t , ws+t ) ≤ θ e(b+θ+qa)(r−s−t) eqat m(s) max{eat , ebt }(vs+t , ws+t ),

(55)

660

L. Barreira, C. Valls

where

m(s) = e−aqs−max{a,b}s max e−(b+2a)s , e−(a+2b)s . Note that eas+bs+max{a,b}s m(s) ≤ 1.

(56)

By the first inequality in (33) we obtain G(r) ≤ θe(b+θ+qa)(r−s−t) e−(a+b)t−max{a,b}t m(s)(vs+t , ws+t ).

(57)

We have a similar inequality with G(r) replaced by H (r), namely H (r) ≤ θ e(b+θ+qa)(r−s−t) e−(a+b)t−max{a,b}t m(s)(vs+t , ws+t ).

(58)

It follows from (43) and (55) with ρ = s + τ that vs+τ = U (s + τ, s + t)vs+t +

s+τ

U (s + τ, r)G(r) dr,

s+t

ws+t = V (s + τ, s + t)−1 ws+τ −

s+τ

V (r, s + t)−1 H (r) dr.

s+t

Therefore, by (20) and since a + a = −ς (see (47)), we have

+∞

U (σ, s + τ )vs+τ ea (σ −s−τ ) dσ s+τ +∞ s+τ a (σ −s−τ ) U (σ, s + t)vs+t + e = U (σ, r)G(r) dr dσ

vs+τ s+τ =

s+τ +∞

s+t

≤

U (σ, s + t)vs+t ea (σ −s−τ ) dσ s+t +∞ s+τ +e−a (s+τ ) U (σ, r) · G(r) dr ea σ dσ s+τ

s+t

≤ e−a (τ −t) vs+t s+t s+τ −a (s+τ ) G(r)e−ar+ar +e D1 s+t

= e−a (τ −t) vs+t s+t +

D1 a(s+τ ) e ς

+∞

e

(a +a)σ

dσ

s+τ s+τ s+t

G(r)e−ar+ar dr.

dr

Smoothness of Invariant Manifolds

661

Set now C3 = max{C1−1 , C2−1 }. By (57) and Lemma 4 (see (49)) we obtain

θ D1 vs+τ s+τ ≤ e(a+ς)(τ −t) vs+t s+t + C3 emax{a,b}s (vs+t , ws+t )s+t ς s+τ ×e−(a+b)t m(s) ea(s+τ −r) e(b+θ+qa)(r−s−t) ear dr s+t

θ D1 vs+t s+t + C3 emax{a,b}s (vs+t , ws+t )s+t ς s+τ ×eas e−bt m(s) ea(s+τ −r) e(b+θ+qa+a)(r−s−t) dr

≤e

(a+ς)(τ −t)

s+t

θ D1 C3 emax{a,b}s (vs+t , ws+t )s+t ≤ e(a+ς)(τ −t) vs+t s+t + ς s+τ ×eas e−bt m(s)ea(τ −t) e[b+θ+(q−1)a+a](r−s−t) dr. s+t

Using the constant c1 < 0 in (52), we conclude from (56) that

vs+τ s+τ ≤ e(a+ς)(τ −t) vs+t s+t θ D1 + C3 e−bt (1 − ec1 (τ −t) )ea(τ −t) (vs+t , ws+t )s+t . ς|c1 |

(59)

In a similar manner, using (20) we obtain

ws+t s+t

s+t

=

V (s + t, σ )−1 ws+t e−b (σ −s−t) dσ

s− s+t

s+τ −b (σ −s−t) −1 V (s + τ, σ )−1 ws+τ − e V (r, σ ) H (r) dr dσ s− s+t s+τ −b (τ −t) ≤e V (s + τ, σ )−1 ws+τ e−b (σ −s−τ ) dσ

=

+e

b (s+t)

s− s+t

s−

≤

s+τ

V (r, σ )

−1

· H (r) dr e−b σ dσ

s+t

−b (τ −t)

ws+τ s+τ e s+τ b (s+t) +e D2 H (r)e−br+br s+t

s+t

e s−

≤ e−b (τ −t) ws+τ s+τ + eb (s+t) e−ςs eς

D2 ς

(−b +b)σ

s+τ

s+t

dσ

dr

H (r)e−br+br dr,

662

L. Barreira, C. Valls

where in the last inequality we have used the identity b = b + ς (see (47)). It follows from (58) and Lemma 4 that ws+t s+t ≤ e−(b+ς)(τ −t) ws+τ s+τ θ D2 C3 emax{a,b}s (vs+t , ws+t )s+t +e(b+ς)(s+t) e−ςs eς ς s+τ −(a+b)t ×e m(s) e−br+br e(b+θ+qa)(r−s−t) dr s+t

ws+τ s+τ θ D2 C3 (vs+t , ws+t )s+t +e(b+ς)(s+t) e−ςs eς ς s+τ −b(s+t) bs −at max{a,b}s ×e e e e m(s) e(b+θ+qa+b)(r−s−t) dr

≤e

−(b+ς)(τ −t)

s+t

θD2 C3 (vs+t , ws+t )s+t ≤ e−(b+ς)(τ −t) ws+τ s+τ + e(ς−a)t eς ς s+τ bs+max{a,b}s ×e m(s) e(b+θ+qa+b)(r−s−t) dr. s+t

Using the constant c2 < 0 in (52), we conclude from (56) that ws+t s+t ≤ e−(b+ς)(τ −t) ws+τ s+τ θ D2 + C3 eς e(ς−a)t (1 − ec2 (τ −t) )(vs+t , ws+t )s+t . ς|c2 |

(60)

Since (vs+τ , ws+τ ) ∈ Cs (τ ) we have ws+τ s+τ ≤ γ vs+τ s+τ . It follows from (59) and (60) that ws+t s+t ≤ e−(b+ς)(τ −t) γ e(a+ς)(τ −t) vs+t s+t

θ D1 C3 e−bt (1 − ec1 (τ −t) )ea(τ −t) (vs+t , ws+t )s+t + ς|c1 | θ D2 + C3 eς e(ς−a)t (1 − ec2 (τ −t) )(vs+t , ws+t )s+t ς|c2 | ≤ e(a−b)(τ −t) γ vs+t s+t θ D1 C3 (1 − ec1 (τ −t) )(vs+t , ws+t )s+t +γ ς|c1 | θ D2 + C3 eς e(ς−a)t (1 − ec2 (τ −t) )(vs+t , ws+t )s+t . ς|c2 | Set now ν = (γ + 1)C3

θ ς

D1 D2 ς . + e |c1 | |c2 |

(61)

In view of (47)–(48) we have ς − a ≤ 0. Thus, setting c = min{c1 , c2 } < 0 we obtain ws+t s+t ≤ e(a−b)(τ −t) γ vs+t s+t + ν(1 − ec(τ −t) )(vs+t s+t + ws+t s+t ).

Smoothness of Invariant Manifolds

663

Therefore, [1 − ν(1 − ec(τ −t) )] · ws+t s+t ≤ [e(a−b)(τ −t) γ + ν(1 − ec(τ −t) )] · vs+t s+t . We now consider two cases: 1. if a − b ≤ c, then ws+t s+t ≤ F (τ − t)vs+t s+t , where F (τ − t) =

γ ec(τ −t) + ν(1 − ec(τ −t) ) ; 1 − ν(1 − ec(τ −t) )

2. if a − b > c, then ws+t s+t ≤ F (τ − t)vs+t s+t , where F (τ − t) =

γ e(a−b)(τ −t) + ν(1 − ec(τ −t) ) . 1 − ν(1 − ec(τ −t) )

When δ is sufficiently small (and thus when θ is also sufficiently small, in view of (39)), it follows from (61) that ν can be made arbitrarily small. By Lemma 3 we conclude that F (τ − t) < γ for every τ > t. This completes the proof of the lemma. 5.4. Construction and continuity of the stable spaces. We continue to fix γ > 0. Given (s, ξ ) ∈ Zα , we set (∂τ (s,ξ,ϕ(s,ξ )) −τ )Cs (τ ). (62) E(s, ξ, ϕ(s, ξ )) = τ ≥0

It is shown below (see Lemma 8) that E(s, ξ, ϕ(s, ξ )) is a vector space (which is independent of γ ), with the same dimension as E in the decomposition E × F . We will call it the stable space at the point (s, ξ, ϕ(s, ξ )). We first establish some auxiliary results concerning the speed at which the norms of vectors inside and outside the cones vary along a given orbit. We start with vectors inside the cones. Lemma 6. Given δ > 0 sufficiently small, for each (s, ξ ) ∈ Zα with s ≥ , τ ≥ 0, and (v, w) ∈ Cs (τ ) we have (∂τ (s,ξ,ϕ(s,ξ )) −τ )(v, w)s ≥

1 e−(a+ς)τ (v, w)s+τ . 2(1 + γ )

(63)

Proof. We use the same notation as in the proof of Lemma 5. Let τ > t = 0 and set (vs+τ , ws+τ ) = (v, w). We consider the vector (vs , ws ) given by (54) with r = s. By Lemma 5 (see (53)), we have (vs , ws ) ∈ Cs (0), and thus ws s ≤ γ vs s . It follows from (59) that θ D1 C3 (1 − ec1 τ )(1 + γ )eaτ vs s vs+τ s+τ ≤ e(a+ς)τ vs s + ς|c1 | θ D1 ≤ e(a+ς)τ 1 + C3 (1 + γ ) vs s . ς|c1 |

664

L. Barreira, C. Valls

In view of (39), for each δ sufficiently small we have θ D1 C3 (1 + γ ) ≤ 1, ς|c1 |

(64)

and thus vs s ≥

1 −(a+ς)τ vs+τ s+τ . e 2

(65)

Since (v, w) ∈ Cs (τ ), we have (vs+τ , ws+τ )s+τ ≤ (1 + γ )vs+τ s+τ . It follows from (65) that (vs , ws )s ≥ vs s ≥ ≥

1 −(a+ς)τ vs+τ s+τ e 2

1 e−(a+ς)τ (vs+τ , ws+τ )s+τ . 2(1 + γ )

This completes the proof of the lemma.

We now establish an analogous result to that in Lemma 6 for vectors outside the cones. Lemma 7. Given δ > 0 sufficiently small, for each (s, ξ ) ∈ Zα with s ≥ , τ ≥ 0, and z ∈ F we have (∂(s,ξ,ϕ(s,ξ )) τ )(0, z)s+τ ≥

1 (b+ς)τ (0, z)s . e 2

(66)

Proof. Let (z1 , z2 ) = (∂(s,ξ,ϕ(s,ξ )) τ )(0, z) =

∂x ∂ξ ∂y ∂ξ

∂x ∂η ∂y ∂η

0 , z

with the partial derivatives computed at the point ps+τ (see (36) and (37)). Since (z1 , z2 )s+τ ≥ z2 s+τ , it is sufficient to find a lower bound for z2 s+τ . We use the notation F (r) =

∂h ∂y ∂h ∂x |q |p z + |q |p z, ∂x r ∂η r ∂y r ∂η r

with pr and qr as in (36). It follows from (43) with t = 0 and ρ = s + τ that s+τ ∂h ∂x ∂h ∂y V (s + τ, s)z = z2 − |qr |pr z + |qr |pr z dr. V (r, s + τ )−1 ∂x ∂η ∂y ∂η s Using (60) with t = 0 and since c2 < 0, we obtain zs ≤ e−(b+ς)τ z2 s+τ +

θ D2 C3 eς zs . ς|c2 |

In view of (39), for each δ sufficiently small we have θ D2 C3 eς ≤ 1/2, ς|c2 |

(67)

Smoothness of Invariant Manifolds

665

and thus zs ≤ 2e−(b+ς)τ z2 s+τ . Therefore, since zs = (0, z)s , we conclude that (z1 , z2 )s+τ ≥ z2 s+τ ≥ This completes the proof of the lemma.

1 (b+ς)τ (0, z)s . e 2

We recall that −(a + ς) > 0 and b + ς > 0 (see (47)). Thus, the inequalities (63) and (66) say respectively that vectors inside the cones expand as time goes to the past, and that vectors in the F direction (which are outside the cones) expand as time goes to the future. With the help of Lemmas 6 and 7 we can now establish that the set E(s, ξ, ϕ(s, ξ )) defined by (62) is a vector space varying continuously with the pair (s, ξ ). Lemma 8. Given δ > 0 sufficiently small, the following properties hold: 1. for every (s, ξ ) ∈ Zα with s ≥ , the set E(s, ξ, ϕ(s, ξ )) is a subspace independent of γ with dim E(s, ξ, ϕ(s, ξ )) = dim E; 2. the map Zα ∩ ([, +∞) × E) (s, ξ ) → E(s, ξ, ϕ(s, ξ )) is continuous. Proof. Set D(τ ) = (∂τ (s,ξ,ϕ(s,ξ )) −τ )Cs (τ ). It follows readily from Lemma 5, applying ∂t (s,ξ,ϕ(s,ξ )) −t to both sides of (53), that for every τ > t ≥ 0, D(τ ) \ {(0, 0)} ⊂ int D(t) ⊂ int D(0) = Cs (0) \ {(0, 0)}.

(68)

Therefore, (D(τ ))τ ≥0 is a strictly decreasing family of closed sets inside the cone Cs (0), and thus, E(s, ξ, ϕ(s, ξ )) is a nonempty closed subset of Cs (0). Furthermore, for each k ∈ N the set D(k) contains a subspace Ek of dimension dim E. Therefore, by the compactness of the unit ball in Rn , there exists a subspace E ⊂ E(s, ξ, ϕ(s, ξ )) of dimension dim E (consider the subspaces Ek and an orthonormal basis for each of them; the compactness allows us to find a subsequence for which each of the components of the orthonormal basis converges). Given v ∈ E(s, ξ, ϕ(s, ξ )), we write v = v1 + v2 with v1 ∈ E and v2 ∈ F . We note that E is inside the cone Cs (0) while F is outside this cone, and thus, we can always write v in this form. Since v, v1 ∈ E(s, ξ, ϕ(s, ξ )) it follows from the definition of E(s, ξ, ϕ(s, ξ )) in (62) that (∂(s,ξ,ϕ(s,ξ )) τ )v, (∂(s,ξ,ϕ(s,ξ )) τ )v1 ∈ Cs (τ ). By Lemmas 6 and 7 we obtain v2 s ≤ 2e(−b−ς)τ (∂(s,ξ,ϕ(s,ξ )) τ )v2 s+τ = 2e(−b−ς)τ (∂(s,ξ,ϕ(s,ξ )) τ )(v − v1 )s+τ ≤ 4(1 + γ )e(a−b)τ (vs + v1 s ). By (19), letting τ →+∞ we obtain v2 =0 and thus v∈E . Therefore, E =E(s, ξ, ϕ(s, ξ )) and E(s, ξ, ϕ(s, ξ )) is a subspace of dimension dim E. Furthermore, if we replace γ by γ = γ in (62), proceeding as before we obtain again a subspace Eγ = E (γ ) of dimension dim E, which is contained in E(s, ξ, ϕ(s, ξ )) if γ < γ and which contains

666

L. Barreira, C. Valls

E(s, ξ, ϕ(s, ξ )) if γ > γ . But since Eγ and E(s, ξ, ϕ(s, ξ )) have the same dimension, we must have Eγ = E(s, ξ, ϕ(s, ξ )). Therefore, E(s, ξ, ϕ(s, ξ )) is independent of γ . It remains to establish the continuity in the last property. Recall that v ∈ E(s, ξ, ϕ(s, ξ )) if and only if (see (62)) (∂(s,ξ,ϕ(s,ξ )) τ )v ∈ Cs (τ ) for every τ ≥ 0. Consider a sequence (sk , ξk )k ∈ Zα ∩ ([, +∞) × E) converging to a point (s, ξ ) in the same set as k → +∞, and a sequence vk ∈ E(sk , ξk , ϕ(sk , ξk )) such that vk = 1 for each k ∈ N. Then (∂(sk ,ξk ,ϕ(sk ,ξk )) τ )vk ∈ Csk (τ ) for every k ∈ N and τ ≥ 0

(69)

(we stress that the cone in (69) is computed with respect to the norms ·sk +τ in (46)). We first assume that (vk )k converges and let v ∈ E × F be the limit of the sequence. It follows from the C 1 regularity of τ (which is an immediate consequence of the C 1 regularity in t and v of the vector field of the equation in (6)) and the Lipschitz property of ϕ in (22) that (s, ξ ) → ∂(s,ξ,ϕ(s,ξ )) τ is continuous, and hence, (∂(sk ,ξk ,ϕ(sk ,ξk )) τ )vk → (∂(s,ξ,ϕ(s,ξ )) τ )v

as k → +∞

for every τ ≥ 0.

Furthermore, since the norms in (46) are independent of ξ and vary continuously with s, we conclude from (69) that (∂(s,ξ,ϕ(s,ξ )) τ )v ∈ Cs (τ ) for every τ ≥ 0. Therefore, v ∈ E(s, ξ, ϕ(s, ξ )) (see (62)). When (vk )k does not converge, let (mk )k be some subsequence for which (vmk )k converges, say to a vector v ∈ E × F (recall that vk = 1 for each k, and thus there are always sublimits). Proceeding in a similar manner, it follows from (69) that (∂(s,ξ,ϕ(s,ξ )) τ )v ∈ Cs (τ ) for every τ ≥ 0. Therefore, v ∈ E(s, ξ, ϕ(s, ξ )), i.e., any sublimit of the sequence (vk )k is in E(s, ξ, ϕ(s, ξ )). It follows from the property dim E(sk , ξk , ϕ(sk , ξk )) = dim E(s, ξ, ϕ(s, ξ )) = dim E

for each k ∈ N,

that any sublimit of a sequence of orthonormal bases (with respect to the original norm ·) of the vector spaces E(sk , ξk , ϕ(sk , ξk )) (obtained by considering any subsequence (mk )k such that every component of the orthonormal bases converges) is also an orthonormal basis of E(s, ξ, ϕ(s, ξ )). Therefore, E(sk , ξk , ϕ(sk , ξk )) → E(s, ξ, ϕ(s, ξ )) and the map ξ → E(s, ξ, ϕ(s, ξ )) is continuous.

as k → ∞,

It follows readily from (62) and the inclusions in (68) that for every increasing sequence τk → +∞ as k → +∞ we have E(s, ξ, ϕ(s, ξ )) = (∂τk (s,ξ,ϕ(s,ξ )) −τk )Cs (τk ). k∈N

Smoothness of Invariant Manifolds

667

5.5. Behavior of the tangent sets. We now introduce sets that at each point of V contain all possible tangential behavior (with respect to V). Given s ≥ 0 and ξ , ξ ∈ R(δe−αs ) with ξ = ξ , we set ξ,ξ ϕ =

(ξ, ϕ(s, ξ )) − (ξ , ϕ(s, ξ )) (ξ, ϕ(s, ξ )) − (ξ , ϕ(s, ξ ))

,

and t(s,ξ ) ϕ = {v ∈ E × F : ξ,ξm ϕ → v for some sequence ξm → ξ }. We define the tangent set of the graph of ϕ at (s, ξ, ϕ(s, ξ )) (when restricted to {s} × E × F ) by V (s, ξ, ϕ(s, ξ )) = {λv : v ∈ t(s,ξ ) ϕ and λ ∈ R}. One can easily verify that the function ϕ is differentiable at (s, ξ ) when restricted to {s} × E × F if and only if V (s, ξ, ϕ(s, ξ )) is a subspace of dimension dim E. This is precisely the basis of our approach to establish the smoothness of V. In order to effect this approach we first establish a relation between the tangent sets and the invariant family of cones constructed in the former section. For each r ≥ s, we write x(r) = x(r, s, ξ, ϕ(s, ξ )) and x(r) = x(r, s, ξ , ϕ(s, ξ )).

(70)

ζ = 2c(1 + κ)q+1 D q δ q D1 .

(71)

We also set

We start with some auxiliary results. Lemma 9. Given δ ∈ (0, 1) sufficiently small, for each s ≥ 0, ξ , ξ ∈ R(δe−αs ), and τ > t ≥ 0 we have x(s + τ ) − x(s + τ ) ≤ D1 e(a+ζ )(τ −t)+a(s+t) x(s + t) − x(s + t).

(72)

Proof. For each r ∈ [s + t, s + τ ], it follows from (22) that (x(r), ϕ(r, x(r))) = (x(r), ϕ(r, x(r)) − ϕ(r, 0)) ≤ (1 + κ)x(r), (x(r), ϕ(r, x(r))) = (x(r), ϕ(r, x(r)) − ϕ(r, 0)) ≤ (1 + κ)x(r), and (x(r), ϕ(r, x(r))) − (x(r), ϕ(r, x(r))) ≤ (1 + κ)x(r) − x(r). By (15), we obtain g(r, x(r), ϕ(r, x(r))) − g(r, x(r), ϕ(r, x(r))) ≤ c(1 + κ)q+1 x(r) − x(r)(x(r)q + x(r)q ).

(73)

Using Theorem 2 (see (27)), this yields g(r, x(r), ϕ(r, x(r))) − g(r, x(r), ϕ(r, x(r))) ≤ 2c(1 + κ)q+1 x(r) − x(r)D q eqa(r−s)+aqs (ξ q + ξ q ) ≤ ηeqa(r−s)−(2a+2b+aq+max{a,b})s x(r) − x(r),

(74)

668

L. Barreira, C. Valls

where η = 2c(1 + κ)q+1 D q δ q < 2c(1 + κ)q+1 D q ,

(75)

since δ < 1. Note that the last constant in (75) is independent of δ. Therefore, setting ρ(r) = x(r) − x(r) and using (20), it follows from the identities

s+τ

x(s + τ ) = U (s + τ, s + t)x(s + t) + x(s + τ ) = U (s + τ, s + t)x(s + t) +

s+t s+τ

U (s + τ, r)g(r, x(r), ϕ(r, x(r))) dr, U (s + τ, r)g(r, x(r), ϕ(r, x(r))) dr,

s+t

that ρ(s + τ ) ≤ U (s + τ, s + t)ρ(s + t) s+τ + U (s + τ, r)ηeqa(r−s)−(2a+2b+aq+max{a,b})s ρ(r) dr s+t

≤ D1 ea(τ −t)+a(s+t) ρ(s + t) + D1 ηe−(a+2b+aq+max{a,b})s s+τ × ea(τ −t) eT1 (r−s) e−a(r−t−s) ρ(r) dr,

(76)

s+t

with T1 = qa + a. By (33), we have T1 < 0. Setting (σ ) = e−a(σ −t−s) ρ(σ ), it follows from (76) that for every τ > t, (s + τ ) ≤ D1 e

a(s+t)

ρ(s + t) + ζ

s+τ

(r) dr. s+t

Using Gronwall’s lemma for the function τ → (s + τ ), we obtain ρ(s + τ ) ≤ D1 ea(s+t)+ζ (τ −t) ρ(s + t)ea(τ −t) for every τ > t, which is the same as (72). This completes the proof.

Note now that by (19) and (33), we have −b + a + qa + b < 0

and

a − b + b < 0,

and hence, in view of (71), we can choose δ > 0 sufficiently small so that T2 = −b + a + ζ + qa + b < 0

and

a + ζ − b + b < 0.

(77)

We will use the notations χ (r) = x(r) − x(r)

and

ρ(r) = ϕ(r, x(r)) − ϕ(r, x(r)),

with x(r) and x(r) as in (70). The following is another auxiliary result.

(78)

Smoothness of Invariant Manifolds

669

Lemma 10. Given δ ∈ (0, 1) sufficiently small, for each s ≥ 0, ξ , ξ ∈ R(δe−αs ), and τ > t ≥ 0 we have ρ(s + t) ≤ D2 e−(b−b)(τ −t) eb(s+t) ρ(s + τ ) D1 D2 η (qa+a+b)t−(a+b+aq+max{a,b})s + χ (s + t). e |T2 | Proof. It follows from (17) and (28) that ϕ(s + t, x(s + t)) = V (s + τ, s + t)−1 ϕ(s + τ, x(s + τ )) s+τ V (r, s + t)−1 h(r, x(r), ϕ(r, x(r))) dr, −

(79)

s+t

ϕ(s + t, x(s + t)) = V (s + τ, s + t)−1 ϕ(s + τ, x(s + τ )) s+τ V (r, s + t)−1 h(r, x(r), ϕ(r, x(r))) dr. −

(80)

s+t

Proceeding in a similar manner to that in (73) and (74), with g replaced by h, we obtain h(r, x(r), ϕ(r, x(r))) − h(r, x(r), ϕ(r, x(r))) ≤ c(1 + κ)q+1 x(r) − x(r)(x(r)q + x(r)q ) ≤ ηeqa(r−s)−(2a+2b+aq+max{a,b})s χ (r), with η as in (75). It follows from Lemma 9, setting s + τ = r in (72), that for every r ≥ s + t we have h(r, x(r), ϕ(r, x(r))) − h(r, x(r), ϕ(r, x(r))) ≤ D1 ηeqa(r−s)−(2a+2b+aq+max{a,b})s e(a+ζ )(r−s−t)+a(s+t) χ (s + t). Subtracting (79) and (80), and using (20) we obtain ρ(s + t) ≤ V (s + τ, s + t)−1 ρ(s + τ ) + D1 ηeat χ (s + t) s+τ V (r, s + t)−1 eqa(r−s)−(a+2b+aq+max{a,b})s e(a+ζ )(r−t−s) dr × s+t −b(τ −t)+b(s+τ )

≤ D2 e

ρ(s + τ ) + D1 D2 ηe(qa+a+b)t s+τ −(a+b+aq+max{a,b})s χ (s + t) eT2 (r−s−t) dr ×e s+t −b(τ −t)+b(s+τ )

≤ D2 e ρ(s + τ ) D1 D2 η (qa+a+b)t−(a+b+aq+max{a,b})s e + χ (s + t), |T2 | with T2 as in (77). This completes the proof of the lemma.

We can now establish a relation between the tangent sets and the invariant family of cones along each orbit. Lemma 11. Given δ ∈ (0, 1) sufficiently small, for each (s, ξ ) ∈ Zα with s ≥ , and t ≥ 0 we have V (t (s, ξ, ϕ(s, ξ ))) ⊂ Cs (t).

670

L. Barreira, C. Valls

Proof. We proceed by contradiction. Namely, assume that for a fixed γ > 0 there exists t ≥ s such that V (t (s, ξ, ϕ(s, ξ )))\Cs (t) = ∅.

(81)

Then, there exists ξ ∈ R(δe−αs ) arbitrarily close to ξ for which ϕ(s + t, x(s + t)) − ϕ(s + t, x(s + t))s+t > γ x(s + t) − x(s + t)s+t , (82) where x and x are the functions in (70). Using the same notation as in (78), it follows from (82) and Lemma 4 that χ (s + t) < γ −1 C1 ea(s+t) ϕ(s + t, x(s + t)) − ϕ(s + t, x(s + t))s+t D2 C1 (a+b)(s+t)+ς(+t) ≤ ρ(s + t). e γς By Lemma 10, we obtain D22 C1 −(b−b)(τ −t) (a+2b)(s+t)+ς(+t) e ρ(s + τ ) e γς D1 D22 C1 η (qa+a+b)t−(a+b+aq+max{a,b})s (a+b)(s+t) + e χ (s + t) e γ ς|T2 | D 2 C1 ≤ 2 e−(b−b)(τ −t) e(a+2b)(s+t)+ς(+t) ρ(s + τ ) γς D1 D22 C1 η (qa+2a+2b)t −aqs−max{a,b}s + e χ (s + t). (83) e γ ς|T2 |

χ (s + t) ≤

By (33), we have qa + 2a + 2b < 0. In view of (75), we can choose δ > 0 sufficiently small so that D1 D22 C1 η (qa+2a+2b)t −aqs−max{a,b}s D1 D22 C1 η 1 e ≤ e ≤ . γ ς |T2 | γ ς |T2 | 2 Hence, it follows from (83) that χ (s + t) ≤ 2

D22 C1 −(b−b)(τ −t) (a+2b)(s+t)+ς(+t) e e ρ(s + τ ). γς

By (72) in Lemma 9, we conclude that χ (s + τ ) ≤

2D1 D22 C1 (a+ζ −b+b)(τ −t) (2a+2b)(s+t)+ς(+t) e e ρ(s + τ ). γς

Recall that s and t are fixed (see (81)). Therefore, by (77) (see also (78)), there exists τ > t such that 1 x(s + τ ) − x(s + τ ) < ϕ(s + τ, x(s + τ )) − ϕ(s + τ, x(s + τ )). κ But this contradicts the fact that the points (x(s + τ ), ϕ(s + τ, x(s + τ ))) and (x(s + τ ), ϕ(s + τ, x(s + τ ))) belong to the stable manifold, since ϕ possesses the Lipschitz property in (22). This completes the proof of the lemma.

Smoothness of Invariant Manifolds

671

5.6. Proof of Theorem 3. We have now all the tools that are needed to prove that the Lipschitz manifold W or more precisely that its subset V ⊂ W (see (30)) is in fact a smooth manifold of class C 1 . The following is a slightly more general version of Theorem 3. Theorem 5. Assume that B1 and B2 hold. If the equation v = A(t)v admits a nonuniform exponential dichotomy and the conditions in (33)–(34) hold, then for each > 0 there exists δ > 0 such that the set V = {(s, ξ, ϕ(s, ξ )) : (s, ξ ) ∈ Zα with s > },

(84)

with α as in (35), is a smooth manifold of class C 1 containing the line (, +∞) × {0} and satisfying T(s,0) V = R×E for every s > . In addition, we have V (s, ξ, ϕ(s, ξ )) = E(s, ξ, ϕ(s, ξ )) for each (s, ξ ) ∈ Zα with s > . Proof. We note that ξ,ξm ϕ → v as m → ∞ (with ξm → ξ as m → ∞) if and only if for every τ ≥ 0, lim

m→∞

(∂(s,ξ,ϕ(s,ξ )) τ )v τ (s, ξm , ϕ(s, ξm )) − τ (s, ξ, ϕ(s, ξ )) = . (∂(s,ξ,ϕ(s,ξ )) τ )v τ (s, ξm , ϕ(s, ξm )) − τ (s, ξ, ϕ(s, ξ ))

This implies that (∂(s,ξ,ϕ(s,ξ )) τ )V (s, ξ, ϕ(s, ξ )) = V (τ (s, ξ, ϕ(s, ξ ))).

(85)

Let now (s, ξ ) ∈ Zα with s > . By Lemma 11, we have V (τ (s, ξ, ϕ(s, ξ ))) ⊂ Cs (τ ) for every τ ≥ 0. Therefore, in view of (85), V (s, ξ, ϕ(s, ξ )) ⊂ (∂τ (s,ξ,ϕ(s,ξ )) −τ )Cs (τ ) for every τ ≥ 0, and hence, by (62), V (s, ξ, ϕ(s, ξ )) ⊂ E(s, ξ, ϕ(s, ξ )). On the other hand, for each v ∈ E \ {0} there exists a sequence tm → 0 such that ξ,ξ +tm v ϕ converges as m → +∞ (due to the compactness of the unit ball in Rn ). This implies that the first dim E components of V (s, ξ, ϕ(s, ξ )) project onto E. On the other hand, by Lemma 8, the space E(s, ξ, ϕ(s, ξ )) has dimension dim E and hence V (s, ξ, ϕ(s, ξ )) = E(s, ξ, ϕ(s, ξ )).

(86)

In particular, V (s, ξ, ϕ(s, ξ )) is a subspace of dimension dim E. Therefore (see the discussion in the beginning of Sect. 5.5), the function ϕ is differentiable at each point (s, ξ, ϕ(s, ξ )) when restricted to {s}×E ×F . Furthermore, it follows from the continuity of the map (s, ξ ) → E(s, ξ, ϕ(s, ξ )) given by Lemma 8 and the identity (86) that ϕ is of class C 1 on each plane {s} × E × F (since the tangent set varies continuously). This shows that the set V ∩ ({s} × Rn ) is a C 1 manifold for each s > , of dimension dim E. We now consider some ε = ε(s) > 0 such that s − ε > , and we define a map Fs : (−ε, ε) × R(δe−αs ) → R+ × Rn by Fs (t, ξ ) = t (s, ξ, ϕ(s, ξ )).

672

L. Barreira, C. Valls

We showed above that ξ → ϕ(s, ξ ) is of class C 1 (for each fixed s). Furthermore, it follows from B1 and B2 that the map (t, s, ξ, η) → t (s, ξ, η) is of class C 1 . Therefore, Fs is also of class C 1 (for each fixed s). In addition, one can verify that the map Fs is injective: if Fs (t, ξ ) = Fs (t , ξ ) then the first component of Fs gives s + t = s + t and hence t = t ; therefore, t (s, ξ, ϕ(s, ξ )) = t (s, ξ , ϕ(s, ξ )) and applying −t to both sides of the identity yields ξ = ξ . This shows that Fs is a parametrization of class C 1 on the rectangle (−ε, ε) × R(δe−αs ) of an open subset of V containing the graph V ∩ ({s} × Rn ). Since this procedure can be effected for every s, we conclude that V is a smooth manifold of class C 1 of dimension dim E + 1. For the remaining properties, note that by Theorem 2 (see (26)) we have +∞ ϕ(s, ξ ) = − V (τ, s)−1 h(τ −s (s, ξ, ϕ(s, ξ ))) dτ. s

Taking derivatives with respect to ξ , we obtain +∞ ∂ϕ ∂x ∂h ∂y −1 ∂h V (τ, s) (s, 0) = − (τ, 0) + (τ, 0) dτ, ∂ξ ∂x ∂ξ ∂y ∂ξ s with

∂x ∂ξ

and

∂y ∂ξ

computed at (τ, s, 0) ∈ R+ × R+ × Rn . By Lemma 1 we have ∂h ∂h (τ, 0) = (τ, 0) = 0, ∂x ∂y

and hence,

∂ϕ ∂ξ (s, 0)

= 0. This implies that

(T(s,0) V ) ∩ ({s} × Rn ) = {s} × E

for each s > .

(87)

Furthermore, since ϕ(s, 0) = 0 for every s > , we have (, +∞) × {0} ⊂ V and thus, R × {0} ⊂ T(s,0) V for every s > . Together with the identities in (87) and the fact that dim V = dim E + 1, we conclude that T(s,0) V = R × E. This completes the proof of the theorem. It follows immediately from (85) and (86) in the proof of the theorem that given (s, ξ ) ∈ Zα with s > we have (∂(s,ξ,ϕ(s,ξ )) τ )E(s, ξ, ϕ(s, ξ )) = E(τ (s, ξ, ϕ(s, ξ )))

(88)

for every τ ≥ 0. However, a priori (without considering the tangent sets), the identity in (88) must be considered nontrivial, due to the fact that the cones which are used to define the space E(τ (s, ξ, ϕ(s, ξ ))) in (88) are obtained from the norms · in (46) with s replaced by s + τ . In particular, it follows easily from the definition of the stable spaces in (62) that (∂(s,ξ,ϕ(s,ξ )) τ )E(s, ξ, ϕ(s, ξ )) ⊂ E(τ (s, ξ, ϕ(s, ξ ))) and a priori this inclusion could be proper. We note that when we consider cones instead of tangent spaces, the corresponding inclusion is indeed proper for the cones at the points (s, ξ, ϕ(s, ξ )) and τ (s, ξ, ϕ(s, ξ )), i.e., (∂(s,ξ,ϕ(s,ξ )) τ )Cs (0) ⊂ Cs+τ (τ ) and this inclusion is proper (since ws+τ < ws whenever w = 0).

Smoothness of Invariant Manifolds

673

We now explain the reason why Theorem 5 (and also Theorem 3) require the extra parameter which is absent in Theorem 2 concerning the existence of a Lipschitz manifold. This has to do with the expressions in (61), (64), and (67) which involve the product θC3 = θ max{C1−1 , C2−1 }, with θ as in (39), and with C1 and C2 = C2 () as in (51). Indeed, it follows from (51) that as → 0 the constant C2 approaches infinity, and thus θ and consequently δ = δ() must approach zero. On the other hand, we can fix an arbitrarily small positive , and choose a corresponding δ in Theorems 3 and 5. 6. Unstable Manifolds We now briefly consider the case of unstable manifolds. The theory is analogous to that for stable manifolds, and the proofs can be readily obtained by reversing time in the former notions and arguments. As such, we formulate the corresponding results without proof. − Consider a C 1 function A : R− 0 → Mn (R), with R0 = (−∞, 0]. As in the case of positive time, we assume that there exists a decomposition Rn = E × F (independent of t), with respect to which A(t) has the block form in (12) for every t ≤ 0. We also assume that the solution of the initial value problem in (7) with s ≤ 0 and vs = (ξ, η) ∈ E × F is global in the past. Thus, it can be written in the form v(t) = (U (t, s)ξ, V (t, s)η)

for t ≤ s,

where U (t, s) and V (t, s) are the evolution operators associated respectively with the blocks B(t) and C(t) (see (12)). In an analogous manner to that for positive time, we say that the equation v = A(t)v admits a nonuniform exponential dichotomy or that the evolution operators U (t, s) and V (t, s) define a nonuniform exponential dichotomy if there exist constants b ≤ b < 0 ≤ a ≤ a, a, b ≥ 0, and D1 , D2 ≥ 1 such that for every t ≤ s ≤ 0, U (t, s) ≤ D1 ea|t−s|+a|s| ,

U (t, s)−1 ≤ D1 e−a|t−s|+a|t| ,

V (t, s) ≤ D2 eb|t−s|+b|s| ,

V (t, s)−1 ≤ D2 e−b|t−s|+b|t| .

n n We also consider a C 1 function f : R− 0 × R → R such that f (t, 0) = 0 for every t ≤ 0, and we assume that there exist c > 0 and q > 1 such that (10) holds for every t ≤ 0 and u, v ∈ Rn . We continue to write f = (g, h) with values in E ×F . We consider the semiflow τ (now with τ ≤ 0) generated by Eq. (11) or equivalently by the system in (17) for ρ ≤ s. Again we look for unstable manifolds as graphs of Lipschitz functions. We first define the new constants

β = b(1 + 1/q) + a/q

and α = 2β + ϑ/q,

where ϑ = max{a, b}. Given δ > 0 and κ > 0, we consider a space Xuβ of continuous functions obtained as in Sect. 3 by replacing positive time by negative time: namely, consider the set

Zβu = (s, ξ ) : s ≤ 0 and ξ ∈ R(δe−β|s| ) ⊂ R− (89) 0 × F, and let Xuβ be the space of continuous functions ψ : Zβu → E such that for each s ≤ 0, ψ(s, 0) = 0 and ψ(s, x) − ψ(s, y) ≤ κx − y

for every x, y ∈ R(δe−β|s| ).

We also consider the set Zαu obtained as in (89) with β replaced by α.

674

L. Barreira, C. Valls

We now formulate our result on the existence of smooth unstable manifolds. All the assumptions formulated in this section are standing assumptions. Theorem 6. If the equation v = A(t)v admits a nonuniform exponential dichotomy, and the conditions qb + 4ϑ < min{b¯ − a, ¯ (2 − q)ϑ} and b + ϑ < a

(90)

hold, then for each < 0 there exist δ > 0 and a unique function ψ ∈ Xuβ such that the set Vu = {(s, ψ(s, ξ ), ξ ) : (s, ξ ) ∈ Zβu and s < } ⊂ (−∞, ) × Rn has the following properties: 1. Vu is a smooth manifold of class C 1 containing the line (−∞, )×{0} and satisfying T(s,0) Vu = R × F for every s < ; 2. Vu is invariant under the semiflow τ , i.e., if (s, ξ ) ∈ Zαu then τ (s, ψ(s, ξ ), ξ ) ∈ Vu for every τ ≤ 0; 3. for every (s, ξ ) ∈ Zαu we have s ψ(s, ξ ) = U (s, τ )g(τ −s (s, ψ(s, ξ ), ξ )) dτ ; −∞

4. there exists D > 0 such that for every s ≤ 0, ξ , ξ ∈ R(δe−α|s| ), and τ ≤ 0 we have τ (s, ψ(s, ξ ), ξ ) − τ (s, ψ(s, ξ ), ξ ) ≤ Deb|τ |+b|s| ξ − ξ . We call Vu a local unstable manifold or simply an unstable manifold of the origin. The existence of a Lipschitz unstable manifold was discussed in [4]. In a similar manner to that in the beginning of Sect. 5 we can replace the conditions in (90) by the slightly less restrictive assumptions qb + a + qb + 2 max{a, b} < 0, a + (q − 1)b + b < 0,

b + a < b,

a + qb + a < 0.

We can also establish analogous results to those in Sect. 4.2, concerning the existence of unstable manifolds for nonuniformly hyperbolic trajectories corresponding to positive Lyapunov exponents. This is a simple application of Theorem 6 together with similar arguments to those in Sect. 4.2. 7. Differential Equations with Nonzero Lyapunov Exponents We now explain how the results in the former sections can be applied to nonautonomous linear differential equations with nonzero Lyapunov exponents. Without loss of generality we only consider negative Lyapunov exponents since the case of positive Lyapunov exponents is entirely analogous. We assume that the functions A : R+ 0 → Mn (R) and n → Rn are of class C 1 and satisfy the conditions B1 and B2 in Sect. 4. f : R+ × R 0

Smoothness of Invariant Manifolds

675

7.1. Lyapunov exponents and Lyapunov coefficient. We define the Lyapunov exponent χ : Rn → R ∪ {−∞} for the equation in (7) by χ (v0 ) = lim sup t→+∞

1 logv(t), t

where v(t) is the solution of (7) with s = 0. It follows from the abstract theory of Lyapunov exponents (see [1] for a detailed description) that the function χ takes at most r ≤ n distinct values on Rn \ {0}, say −∞ ≤ χ1 < · · · < χk < 0 ≤ χk+1 < · · · < χr ,

(91)

for some r ≤ n and 0 ≤ k ≤ n. Note that k ≥ 1 if and only if there is at least one negative Lyapunov exponent. We assume from now on that there is at least one negative Lyapunov exponent. We emphasize that χk+1 may be zero. Moreover, for each i = 1, . . . , r the set Ei = {v0 ∈ Rn : χ (v0 ) ≤ χi }

(92)

is a linear space. We always assume that there exists a subspace F ⊂ Rn such that E = Ek (see (91)–(92)) together with F give a decomposition Rn = E × F , with respect to which A(t) has the block form in (12). We note that (for any norm in Rn ) we have χ (v0 ) ≤ χk < 0

for every v0 ∈ E,

and χ (v0 ) ≥ χk+1 ≥ 0 for every v0 ∈ Rn \ E (and thus for every v0 ∈ F \ {0}). We now introduce the classical notion of regularity coefficient. For this we need to consider the initial value problem w = −A(t)∗ w,

w(0) = w0 ,

(93)

with w0 ∈ Rn , where A(t)∗ denotes the transpose of A(t). We also consider the associated Lyapunov exponent χ : Rn → R ∪ {−∞} defined by χ (w0 ) = lim sup t→+∞

1 logw(t), t

where w(t) is the solution of (93). We define the regularity coefficient of χ and χ by (wi ) : 1 ≤ i ≤ n}, γ (χ , χ ) = min max{χ (vi ) + χ where the minimum is taken over all bases v1 , . . . , vn and w1 , . . . , wn of Rn such that vi , wj = δij for each i and j (here δij is the Kronecker symbol). We also consider the Lyapunov exponents associated with the blocks B(t) and C(t) in (12), i.e., with the pair x = B(t)x and x = −B(t)∗ x, as well as with the pair y = C(t)y and y = −C(t)∗ y. The corresponding regularity coefficients are respectively |E) γU = γ (χ |E, χ

and

γV = γ (χ |F, χ |F ).

Our study is based on the following statement, which shows that the evolution operators U (t, s) and V (t, s) in (13) always define a nonuniform exponential dichotomy.

676

L. Barreira, C. Valls

Proposition 7 ([3]). Assume that the matrix A(t) has the block form in (12) for every t ≥ 0, and that the equation v = A(t)v has at least one negative Lyapunov exponent. Then for each ε > 0 the evolution operators U (t, s) and V (t, s) define a nonuniform exponential dichotomy with a = χ1 + ε, b = χk+1 + ε,

a = χk + ε, b = χr + ε,

a = γU + 2ε, b = γV + 2ε.

(94) (95)

7.2. Construction of stable manifolds. We now present our results on the existence of smooth stable manifolds. We continue to assume that the matrix function A and the perturbation f satisfy the conditions B1 and B2 in Sect. 4. We also consider the conditions qχk + 4ϑ < χk − χr

and χk + ϑ < χk+1 ,

(96)

where ϑ = max{γU , γV }. In view of (96) we can choose ε > 0 such that qa + 4 max{a, b} − a + b < 0

and a − b + max{a, b} < 0,

(97)

where the constants in (97) take the values given by (94)–(95). We also consider the constants β and α given respectively by (21) and (35) again with the values of a and b in (94)–(95). The following is an immediate consequence of Theorems 2 and 5. Theorem 8. Assume that B1 and B2 hold. If the conditions in (96) hold, then for each ε > 0 satisfying (97) and > 0, there exist δ > 0 and a unique function ϕ ∈ Xβ such that the set V in (84) satisfies the following properties: 1. V is a smooth manifold of class C 1 containing the line (, +∞) × {0} and satisfying T(s,0) V = R × E for every s > ; 2. V is invariant under the semiflow τ , i.e., if (s, ξ ) ∈ Zα then τ (s, ξ, ψ(s, ξ )) ∈ V for every τ ≥ 0; 3. the identity (26) holds for every (s, ξ ) ∈ Zα ; 4. there exists D > 0 such that for every s ≥ 0, ξ , ξ ∈ R(δe−αs ), and τ ≥ 0 we have the inequality in (27). In a similar manner to that in Sect. 4.2 we can apply Theorem 8 (and the corresponding version for the case of unstable manifolds) to obtain smooth stable and unstable manifolds of nonuniformly hyperbolic trajectories corresponding respectively to the negative and to the positive Lyapunov exponents for the linear variational equation. References 1. Barreira, L., Pesin, Ya.: Lyapunov Exponents and Smooth Ergodic Theory. University Lecture Series 23. Providence, RI: Am. Math. Soc., 2002 2. Barreira, L., Pesin,Ya.: Smooth ergodic theory and nonuniformly hyperbolic dynamics. In: Handbook of Dynamical Systems 1B, B. Hasselblatt, A. Katok (eds.), Elsevier, to appear 3. Barreira, L., Valls, C.: Nonuniform exponential dichotomies and Lyapunov regularity. Preprint 4. Barreira, L., Valls, C.: Stable manifolds for nonautonomous equations without exponential dichotomy. J. Differ. Eqs. To appear 5. Chicone, C., Latushkin, Yu.: Evolution Semigroups in Dynamical Systems and Differential Equations. Mathematical Surveys and Monographs 70, Providence, RI: Am. Math. Soc., 1999

Smoothness of Invariant Manifolds

677

6. Coppel, W.: Dichotomies in Stability. Theory. Lect. Notes in Math. 629, Berlin-Heidelberg-New York: Springer, 1978 7. Hale, J.: Asymptotic Behavior of Dissipative Systems. Mathematical Surveys and Monographs 25, Providence, RI: Am. Math. Soc., 1988 8. Hale, J., Lunel, S.: Introduction to Functional Differential Equations. Applied Mathematical Sciences 99, Berlin-Heidelberg-New York: Springer, 1993 9. Henry, D.: Geometric Theory of Semilinear Parabolic Equations. Lect. Notes in Math. 840, BerlinHeidelberg-New York: Springer, 1981 10. Katok, A., Hasselblatt, B.: Introduction to the Modern Theory of Dynamical Systems, with a supplement byA. Katok and L. Mendoza. Encyclopedia of Mathematics and itsApplications 54, Cambridge: Cambridge University Press, 1995 11. Ma˜ne´ , R.: Lyapunov exponents and stable manifolds for compact transformations. In: Geometric Dynamics (Rio de Janeiro, 1981), ed. J. Palis, Lect. Notes in Math. 1007, Berlin-HeidelbergNew-York: Springer, 1983, pp. 522–577 12. Pesin,Ya.: Families of invariant manifolds corresponding to nonzero characteristic exponents. Math. USSR-Izv. 10, 1261–1305 (1976) ´ 13. Pugh, C.: The C 1+α hypothesis in Pesin theory. Inst. Hautes Etudes Sci. Publ. Math. 59, 143–161 (1984) 14. Ruelle, D.: Characteristic exponents and invariant manifolds in Hilbert space. Ann. Math. (2) 115, 243–290 (1982) 15. Sell, G., You, Y.: Dynamics of Evolutionary Equations. Applied Mathematical Sciences 143, BerlinHeidelberg-New York: Springer, 2002 16. Thieullen, P.: Fibr´es dynamiques asymptotiquement compacts. Exposants de Lyapunov. Entropie. Dimension. Ann. Inst. H. Poincar´e. Anal. Non Lin´eaire 4, 49–97 (1987) Communicated by J.L. Lebowitz

Commun. Math. Phys. 259, 679–709 (2005) Digital Object Identifier (DOI) 10.1007/s00220-005-1331-8

Communications in

Mathematical Physics

Invariant Classification of Orthogonally Separable Hamiltonian Systems in Euclidean Space Joshua T. Horwood1 , Raymond G. McLenaghan2 , Roman G. Smirnov3 1

Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, United Kingdom CB3 0WA, UK. E-mail: [email protected] 2 Department of Applied Mathematics, University of Waterloo, Waterloo, Ontario, Canada N2L 3G1. E-mail: [email protected] 3 Department of Mathematics and Statistics, Dalhousie University, Halifax, Nova Scotia, Canada B3H 3J5. E-mail: [email protected] Received: 15 November 2004 / Accepted: 21 January 2005 Published online: 8 July 2005 – © Springer-Verlag 2005

Abstract: The problem of the invariant classification of the orthogonal coordinate webs defined in Euclidean space is solved within the framework of Felix Klein’s Erlangen Program. The results are applied to the problem of integrability of the Calogero-Moser model. Electronic Supplementary Material: Supplementary material is available in the online version of this article at http://dx.doi.org/10.1007/s00220-005-1331-8. 1. Introduction In his famous Erlangen Program [11], Felix Klein introduced a unified point of view according to which many different branches of geometry could be integrated into a single system. As is well known, this standpoint stipulates that the main goal of any branch of geometry can be formulated as follows: “Given a manifold and a group of transformations of the manifold, to study the manifold configurations with respect to those features that are not altered by the transformations of the group.” ([13], p 67) The term “manifold of n dimensions” in this setting describes a set of n variables that independently take on the real values from −∞ to ∞ ([12], p 116). Motivated by this idea, one can assert that Euclidean geometry of E3 (Euclidean space) can be completely characterized by the invariants of the Euclidean group of transformations. As is well-known, this Lie group of (orientation-preserving) isometries, denoted here by I (E3 ), is a semi-direct product of the corresponding groups of rotations and translations. An important aspect of Euclidean geometry is the theory of orthogonal coordinate webs that originated in works of a number of eminent mathematicians of the past including St¨ackel [27], Bˆocher [4], Darboux [5] and Eisenhart [8] within the framework of the theory of separation of variables. Its modern developments can be found in the review by

680

J.T. Horwood, R.G. McLenaghan, R.G. Smirnov

Benenti [2] and the relevant references therein. In particular, it has been shown that there exist exactly eleven orthogonal coordinate webs which afford separation of variables for the Schr¨odinger and Hamilton-Jacobi equations defined in E3 . These coordinate webs are confocal quadrics determined by the Killing tensors of valence two having orthogonally integrable (normal) eigenvectors and distinct eigenvalues. Eisenhart’s results in E3 were extended by Olevsky [21] to three-dimensional spaces of non-zero constant curvature, while Kalnins, Miller and others generalized them to spaces of higher dimensions (see Kalnins [10] and the references therein). This work is a natural continuation of the project initiated in [15] (see also [18] and [26]) where isometry group invariants and covariants of valence-two Killing tensors are derived and used to classify orthogonal coordinate webs of the Euclidean and Minkowski planes. Accordingly, we approach the problem of classification of the eleven orthogonal webs in E3 from the viewpoint of the invariant theory of the isometry group I (E3 ). Recall that the standard approach to the study of Killing tensors defined in pseudo-Riemannian manifolds of constant curvature rests on the fact that they can be expressed in this case as sums of symmetrized tensor products of Killing vectors. In contrast to the conventional view, we consider the Killing tensors of valence two defined in E3 to be algebraic objects or elements of the corresponding vector space K2 (E3 ) and define the action of I (E3 ) in this vector space to derive I (E3 )-invariants of the valence-two Killing tensors. In line with the postulates of the Erlangen Program, we completely solve the problem of classification of the eleven orthogonal webs in E3 by employing the I (E3 )-invariants of the vector space K2 (E3 ) and its subspaces. Our solution is based on the result of Theorem 5.1 which describes the space of all isometry group invariants of the vector space of Killing tensors of valence two defined in E3 combined with a careful study of the corresponding vector space of Killing vectors (the Lie algebra of the isometry group of E3 ). It must be emphasized that the problem of the invariant classification of the orthogonal coordinate webs in E3 is significantly more complicated than the corresponding problems in two-dimensional pseudo-Riemannian spaces of constant curvature. Apart from the obvious difficulties in dealing with a vector space of a much higher dimension, one has to solve the problem of the normality of eigenvectors of valence-two Killing tensors. More specifically, the eleven orthogonal coordinate webs in E3 are generated by Killing tensors of valence two with normal eigenvectors. On the other hand, unlike the situation in two-dimensional spaces, in E3 not every Killing tensor of valence two with distinct eigenvalues has normal eigenvectors. Moreover, the normality condition is equivalent to a system of non-linear partial differential equations (PDEs), which makes it nearly impossible to verify directly. The problem of finding necessary and sufficient intrinsic conditions for the eigenvectors of a tensor field of valence two with pointwise distinct eigenvalues to be orthogonally integrable has a long history. It can be traced back to Schouten [24], where such conditions depending on the eigenvectors were derived. Tonolo [30] subsequently determined a set of eigenvector-independent necessary and sufficient conditions for the Ricci tensor defined in a three-dimensional space to have normal eigenvectors. This criterion was shown by Schouten to be applicable to arbitrary (Ricci or not) valence-two tensor fields defined in an arbitrary pseudo-Riemannian manifold. Later, Nijenhuis [20] derived an equivalent formulation of the criterion introduced originally by Tonolo in terms of the components of the Nijenhuis tensor of the tensor field in question. In view of their respective contributions, we refer to these remarkable formulae throughout this paper as the Tonolo-Schouten-Nijenhuis (TSN) conditions and

Invariant Classification of Orthogonally Separable Hamiltonian Systems in Euclidean Space

681

employ them to verify the normality of the eigenvectors of valence-two Killing tensors defined in E3 . In addition, we determine in each case the coordinate transformation from the given Cartesian coordinates to the corresponding coordinate system determined by the orthogonal web. As an illustration of the power of the new theory, we use it to obtain a concise solution to the problem of integrability of the Calogero-Moser Hamiltonian system defined in E3 via orthogonal separation of variables in the associated Hamilton-Jacobi equation. The paper is organized as follows. In Sect. 2, we give an overview of the invariant theory for vector spaces of Killing tensors defined on pseudo-Riemannian manifolds of constant curvature. This theory is then specialized in Sect. 3 to vector spaces of valencetwo Killing tensors in Euclidean space. In Sect. 4, we discuss Hamilton-Jacobi theory in the context of separation of variables and use it to derive canonical forms for the orthogonal coordinate webs of E3 . The fundamental invariants are derived in Sect. 5 and are used in Sect. 6 to classify the coordinate webs. Methods for transforming a given Killing tensor to canonical form are treated in Sect. 7. In Sect. 8, we summarize the steps in our algorithm and apply it in Sect. 9 to determine separable coordinates for the Calogero-Moser system in E3 . Finally, we draw conclusions in Sect. 10 and indicate future research directions. The reader will no doubt realize that our classification of the coordinate webs and our algorithm for determining separable coordinates for natural Hamiltonians in E3 is highly computational. Nevertheless, all computations are purely algebraic in nature, and thus are straightforward to implement in a computer algebra system. To complement the paper, we have written a Maple package, called the KillingTensor package, which performs all the steps in our algorithm. The package is available with the electronic version of the paper. 2. Invariant Theory of Killing Tensors In the past decade, the classical invariant theory of homogeneous polynomials has become an active field of research once again (see Olver [22] and the references therein). The theory emerged in the nineteenth century as the intrinsic study of vector spaces of homogeneous polynomials under the action of the general linear group. Two of the authors (RGM, RGS) and Dennis The have incorporated the basic ideas of classical invariant theory into the study of Killing tensors defined in pseudo-Riemannian spaces of constant curvature under the action of the isometry group [14–18]. This synergy of the two theories grew out of the observation that Killing tensors of the same valence defined in a space of constant curvature constitute a vector space or, more precisely, a representation space of the isometry group of the underlying space. Putting this observation in proper perspective allows one to extend the basic ideas of classical invariant theory to the study of Killing tensors. Indeed, let (M, g) be an n-dimensional pseudo-Riemannian manifold of constant curvature with metric tensor g. Definition 2.1. A Killing tensor K of valence p defined in (M, g) is a symmetric (p, 0) tensor satisfying the Killing tensor equation [K, g] = 0,

(2.1)

where [, ] denotes the Schouten bracket [25]. When p = 1, K is said to be a Killing vector (infinitesimal isometry) and Eq. (2.1) reads LK g = 0, where L denotes the Lie derivative operator.

(2.2)

682

J.T. Horwood, R.G. McLenaghan, R.G. Smirnov

The Schouten bracket is a real bilinear operator, which property together with (2.1) implies that the set Kp (M) of all Killing tensors of valence p defined in (M, g) is in fact a vector space. Its dimension d is determined by the Delong-Takeuchi-Thompson (DTT) formula [7, 28, 29] 1 n+p n+p−1 d = dim Kp (M) = , p 1. (2.3) n p+1 p Therefore the general element of Kp (M) is represented by d arbitrary parameters , a d , with respect to an appropriate basis. Alternatively, this fact can be verified by solving the corresponding Killing tensor equation (2.1) with respect to a fixed system of coordinates, in which case the parameters a 1 , . . . , a d appear as constants of integration in the general form of elements of Kp (M). Each element h of the isometry group I (M) induces, by the push forward map, a non-singular linear transformation ρ(h) of Kp (M). By Theorem 3.5 of McLenaghan et al [19], the map a1, . . .

ρ : I (M) → GL(Kp (M))

(2.4)

defines a representation of I (M). Indeed, ρ is a group isomorphism. Once the form of the general element K of Kp (M) is available with respect to some convenient system of coordinates on M, the explicit form of the transformation ρ(h)K (written more succinctly as h · K) may be written explicitly in terms of the parameters a 1 , . . . , a d . We shall be particularly concerned with the smooth real-valued functions on Kp (M) that are invariant under the group I (M). The precise definition of such I (M)-invariant functions of Kp (M) is as follows. Definition 2.2. Let (M, g) be a pseudo-Riemannian manifold of constant curvature. Let p 1 be fixed. A smooth function F : Kp (M) → R is said to be an I (M)-invariant of Kp (M) iff it satisfies the condition F (h · K) = F (K)

(2.5)

for K ∈ Kp (M) and for all h ∈ I (M). The main problem of any invariant theory is to describe the whole space of invariants of a vector space under the action of a group. To achieve this one has to determine a set of fundamental invariants with the property that any other invariant is an analytic function of the fundamental invariants (see [22] for more details). The fundamental theorem of invariants of a regular Lie group action [22] determines the number of fundamental invariants needed to define the whole of the space of I (M)-invariants. Theorem 2.1. Let G be a Lie group acting regularly on an n-dimensional manifold M with s-dimensional orbits. Then, in a neighbourhood N of each point p ∈ M, there exist n − s functionally independent G-invariants 1 , . . . , n−s . Any other G-invariant I defined near p can be locally uniquely expressed as an analytic function of the fundamental invariants through I = F (1 , . . . , n−s ). In order to determine the form of the invariants of Kp (M), we use the fact that the invariance of a function under an entire Lie group is equivalent to the invariance of the function under the infinitesimal transformations of the group given by the corresponding Lie algebra. The precise result is given in the following proposition [22].

Invariant Classification of Orthogonally Separable Hamiltonian Systems in Euclidean Space

683

Proposition 2.1. Let G be a connected Lie group of transformations acting regularly on a manifold M. A smooth real-valued function F : M → R is a G-invariant iff v(F ) = 0

(2.6)

for all p ∈ M and for every infinitesimal generator v of G. In our application, G is the representation ρ(I (M)) defined by (2.4) and the condition (2.6) is equivalent to U i (F ) = 0,

i = 1, . . . , r,

(2.7)

where the U i are vector fields which form a basis of the Lie algebra of the representation and r = dim I (M) = 21 n(n + 1). By Theorem 3.5 of [19], this Lie algebra is isomorphic to the Lie algebra of I (M). Such a basis may be computed directly as the basis of the tangent space to ρ(I (M)) at the identity if an explicit form of the representation is available. According to Theorem 2.1 of the present paper, the general solution of the system of first-order PDEs (2.7) is an analytic function F of a set of fundamental I (M)-invariants. The number of fundamental invariants is d − s, where d is given by (2.3) and s is the dimension of the orbits of ρ(I (M)) acting regularly in the space Kp (M). To determine s and the subspaces of Kp (M) where the isometry group I (M) acts with orbits of the same dimension, one can use the result of the following proposition [22]. Proposition 2.2. Let a Lie group G act on M and let p ∈ M. The vector space S|p = span{U i |p | U i ∈ g} spanned by all vector fields determined by the infinitesimal generators at p coincides with the tangent space to the orbit Op of G that passes through p, i.e. S|p = Tp (Op ). In particular, the dimension of Op equals the dimension of S|p . We are now prepared to apply the theory presented thus far to the vector space K2 (E3 ). 3. Invariant Theory of Killing Tensors of Valence two in Euclidean Space We now specialize the general theory of the previous section to the vector space K2 (E3 ) of valence-two Killing tensors in Euclidean space E3 . Recall the following well-known result in [23] from invariant theory. Theorem 3.1. The orbits of a compact linear group acting in a real vector space are separated by the fundamental (polynomial) invariants. We first note that in our case the group is non-compact and so in order to distinguish between the orbits of I (E3 ) acting in the vector space K2 (E3 ) we need to employ a more elaborate analysis than a mere computation of a set of fundamental invariants. It is well-known that in E3 , as in all manifolds of constant curvature, any Killing tensor is expressible as a sum of symmetrized products of Killing vectors. The six Killing vectors in E3 may be written in Cartesian coordinates x i viz Xi =

∂ , ∂x i

R i = k j i x j Xk ,

(3.1)

684

J.T. Horwood, R.G. McLenaghan, R.G. Smirnov

for i = 1, 2, 3, where ij k is the Levi-Civita permutation tensor1 . We also note the commutation relations [Xi , Xj ] = 0,

[Xi , R j ] = k ij X k ,

[R i , R j ] = k ij R k .

(3.2)

Thus the general Killing tensor in K2 (E3 ) may be expressed as K = Aij X i X j + 2B ij Xi R j + C ij R i R j ,

(3.3)

where the coefficients Aij , B ij and C ij are constant and satisfy the symmetry properties Aij = A(ij ) ,

C ij = C (ij ) .

(3.4)

It follows from (3.1) and (3.3) that the components of the general Killing tensor in K2 (E3 ) with respect to the natural basis are given by K ij = Aij + 2 (i k B j )k x + i mk j n C k x m x n .

(3.5)

For future reference, we give explicitly the six independent components of K ij . Noting the symmetries (3.4), it proves convenient to set (following [3])  a1 α3 α2 Aij = α3 a2 α1  , α 2 α1 a 3 

 b11 b12 b13 = b21 b22 b23  , b31 b32 b33 

B ij

 c1 γ 3 γ 2 =  γ3 c 2 γ1  γ2 γ1 c 3 

C ij

(3.6)

and x i = (x, y, z). From (3.5) we obtain K 11 K 22 K 33 K 23 K 31 K 12

= a1 − 2b12 z + 2b13 y + c2 z2 + c3 y 2 − 2γ1 yz, = a2 − 2b23 x + 2b21 z + c3 x 2 + c1 z2 − 2γ2 zx, = a3 − 2b31 y + 2b32 x + c1 y 2 + c2 x 2 − 2γ3 xy, = α1 + b31 z − b21 y + (b22 − b33 )x + (γ3 z + γ2 y − γ1 x)x − c1 yz, = α2 + b12 x − b32 z + (b33 − b11 )y + (γ1 x + γ3 z − γ2 y)y − c2 zx, = 7α3 + b23 y − b13 x + (b11 − b22 )z + (γ2 y + γ1 x − γ3 z)z − c3 xy. (3.7)

According to the DTT formula (2.3), the dimension of K2 (E3 ) is twenty which appears to disagree with (3.6) which lists twenty-one parameters. To reconcile this, we observe from (3.7) that only the differences of the diagonal coefficients b11 , b22 and b33 are involved. Defining β1 = b22 − b33 ,

β2 = b33 − b11 ,

β3 = b11 − b22 ,

(3.8)

yields the constraint β1 + β2 + β3 = 0, thereby showing that there are twenty independent parameters. In many of the computations which follow, it turns out to be more convenient to use the three bii parameters instead of two of the three βi . With this in 1 We are using the summation convention throughout and lowering and raising indices with the Euclidean metric gij = diag(1, 1, 1) and its inverse g ij .

Invariant Classification of Orthogonally Separable Hamiltonian Systems in Euclidean Space

685

mind, we (commit an abuse of notation and) let K2 (E3 ) be the space spanned by the twenty-one parameters a1 , a2 , a3 , α1 , α2 , α3 , b11 , b22 , b33 , b23 , b31 , b12 , b32 , b13 , b21 , c1 , c2 , c3 , γ1 , γ2 , γ3 . (3.9) We shall also refer to (3.9) as the Killing tensor parameters. We now consider the transformation rules for the Killing tensor parameters. The transformation from one set of Cartesian coordinates x i to another set x˜ i is given by x i = λj i x˜ j + δ i ,

(3.10)

where λj i ∈ SO(3) and δ i ∈ R3 . It is straightforward to show that the Killing vectors (3.1) transform according to ˜ j, X i = λj i X

˜ j + µj i X ˜ j, R i = λj i R

(3.11)

where µj i = k i λj k δ .

(3.12)

The Killing vector transformation rules (3.11) in conjunction with (3.3) lead to A˜ ij = Ak λi k λj + 2B k λ(i k µj ) + C k µi k µj , B˜ ij = B k λi k λj + C k λj µi k , C˜ ij = C k λi k λj .

(3.13)

These equations give the explicit form of the representation of I (E3 ) on K2 (E3 ) with respect to a Cartesian coordinate system on E3 . Equipped with these transformation rules, we can now derive the infinitesimal generators of I (E3 ) in the representation defined by (3.13). Let U m , m = 1, 2, 3, denote the generators associated to the Killing vectors Xm . Noting that such Killing vectors generate translations about the x m -axis, we set λj i = δj i in (3.13) and differentiate the resulting equations with respect to δ m to obtain ∂ B˜ ij ∂ C˜ ij ∂ A˜ ij (i j )k i jk = 2 mk B , = mk C , = 0. ∂δ m i ∂δ m i ∂δ m i δ =0

δ =0

δ =0

The corresponding differential operators are therefore U i = 2 (j i B k)

∂ ∂ + j i C k j k , j k ∂A ∂B

(3.14)

for i = 1, 2, 3, where the range of summation over the derivative operators is understood to be over only those parameters listed in (3.9). Next, let V m , m = 1, 2, 3, denote the generators associated to the Killing vectors R m , the generators of rotations about the x m -axis. For an infinitesimal rotation about the x 3 -axis by an angle θ 3 , the rotation λj i ∈ SO(3) is given by     cos θ 3 − sin θ 3 0 0 −1 0 i dλ j λj i =  sin θ 3 cos θ 3 0 ⇒ = 1 0 0 = 3j i . 3 dθ 0 0 0 ij 0 0 1 ij θ 3 =0

686

J.T. Horwood, R.G. McLenaghan, R.G. Smirnov

More generally, for an infinitesimal rotation about the x m -axis, dλj i dλi j j = ⇔ = i j m. mi dθ m m dθ m θ m =0 θ =0

It thus follows from (3.13) that ∂ ∂ + ( j i B k + k i B j ) j k j k ∂A ∂B ∂ k j + i C ) j k , ∂C

V i = ( j i Ak + k i Aj ) +( j i C k

(3.15)

for i = 1, 2, 3. As required by the general theory, the generators (3.14) and (3.15) satisfy the same commutation relations as the Killing vectors (3.1), namely [U i , U j ] = 0,

[U i , V j ] = k ij U k ,

[V i , V j ] = k ij V k .

For computational purposes, we shall require explicit expressions for the generators. It follows from (3.14) and (3.15) that Ui =

21 j =1

∂ Gi , ∂a j j

Vi =

21 j =1

Gi+3 j

∂ , ∂aj

(3.16)

for i = 1, 2, 3, where a j , j = 1, . . . , 21, are the twenty-one Killing tensor parameters ordered by (3.9) and 

0  2b13  −2b12 Gi j =   0  −2α2 2α3

−2b23 0 2b21 2α1 0 −2α3

2b32 b22 − b33 b12 −b13 0 −γ1 γ1 −2b31 −b21 b33 − b11 b23 γ2 0 −γ2 0 b31 −b32 b11 − b22 −γ3 γ3 0 −2α1 a3 − a2 −α3 α2 0 b23 + b32 −b23 − b32 2α2 α3 a1 − a3 −α1 −b31 − b13 0 b31 + b13 0 −α2 α1 a2 − a1 b12 + b21 −b12 − b21 0

−c3 γ3 0 c2 0 −γ2 0 −c1 γ1 −γ3 c3 0 γ2 0 −c2 0 −γ1 c1 b33 − b22 −b21 b13 b33 − b22 −b12 b31 b21 b11 − b33 −b32 b12 b11 − b33 −b23 b32 b22 − b11 −b31 b23 b22 − b11 −b13  0 0 0 0 0 0 0 0 0 0 0 0   0 0 0 0 0 0  . 0 2γ1 −2γ1 c3 − c2 −γ3 γ2    −2γ2 0 2γ2 γ3 c1 − c3 −γ1 2γ3 −2γ3 0 −γ2 γ1 c2 − c1 ij Finally, we observe that the coefficient matrix Gi j has rank six almost everywhere, and so, in view of Theorem 2.1, we expect fifteen fundamental I (E3 )-invariants. The computation and presentation of these invariants are treated in Sect. 5.

Invariant Classification of Orthogonally Separable Hamiltonian Systems in Euclidean Space

687

4. Hamilton-Jacobi Theory and Orthogonal Coordinate Webs Consider a Hamiltonian system defined on (M, g) by a natural Hamiltonian function of the form H = 21 g ij (x)pi pj + V (x),

i, j = 1, . . . n, (4.1) n ∂ ∂ with respect to the canonical Poisson bi-vector P = i=1 ∂x i ∧ ∂pi given in terms of the position-momenta coordinates (x, p) = (x i , pi ), i = 1, . . . , n on the cotangent bundle T ∗ (M). As is well-known, in many cases the Hamiltonian system defined by (4.1) can be integrated by quadratures by finding a complete integral W of the corresponding Hamilton-Jacobi equation which is a first-order PDE given by ∂W ∂W ∂W 1 ij . (4.2) g (x) i j + V (x) = E, pj = 2 ∂x ∂x ∂x i The geometrical meaning of Eq. (4.2) and its complete integral W is well-understood (see, for example, [1]). Thus, if the function F = 21 g ij W,i W,j + V − E = 0 is regular on the cotangent bundle T ∗ (M), then Eq. (4.2) defines a hypersurface in T ∗ (M). Furthermore, W is a complete integral of (4.2) iff the Lagrangian submanifold S ⊂ T ∗ (M) determined by the equations pi = W,i lies on the hypersurface defined by the HamiltonJacobi equation (4.2). Solving (4.2) is normally based on finding a canonical transformation to separable coordinates: (x, p) → (u, v) with respect to which the equation can be solved under the additive separation ansatz W (u; c) = ni=1 Wi (ui ; c) and the nondegeneracy condition det(∂ 2 W/∂ui ∂cj )n×n = 0, where c = (c1 , . . . , cn ) is a constant vector. Orthogonal separation of variables occurs in the case when the transformations to separable coordinates are point-transformations and the metric tensor g is diagonal with respect to the coordinates of separation (u, v). A useful criterion for orthogonal separability is given by Benenti [2]. Theorem 4.1. The Hamiltonian system defined by (4.1) is orthogonally separable if and only if there exists a valence-two Killing tensor K with (i) pointwise simple and real eigenvalues, (ii) orthogonally integrable (normal) eigenvectors and (iii) such that d(K dV ) = 0.

(4.3)

A Killing tensor satisfying conditions (i) and (ii) of Theorem 4.1 is called a characteristic Killing tensor (CKT). Let us elaborate on conditions (i) and (ii) in Theorem 4.1. On two-dimensional Riemannian manifolds of constant curvature these conditions are trivial, since every eigenvector ξ of K is normal and K has repeated eigenvalues iff it is a multiple of the metric g. In three-dimensions, E3 in particular, the situation is far more complicated. Computing eigenvectors of a symmetric 3 × 3 tensor is tedious and becomes virtually intractable if one considers Killing tensors with arbitrary parameters. Instead, we employ the Tonolo-Schouten-Nijenhuis (TSN) conditions, as introduced in Sect. 1, which are both necessary and sufficient for a given symmetric (Killing) tensor field to have integrable eigenvectors. These conditions read N [j k gi] = 0,

(4.4a)

= 0,

(4.4b)

N

N

[j k Ki]

m

[j k Ki]m K

= 0,

(4.4c)

688

J.T. Horwood, R.G. McLenaghan, R.G. Smirnov

where N i j k are the components of the Nijenhuis tensor of K ij given by N i j k = K i K [j,k] + K [j K i k], .

(4.5)

We remark that the TSN conditions (4.4a)–(4.4c) yield 10 quadratic, 35 cubic and 84 quartic equations, respectively, in the Killing tensor parameters. It is thus straightforward to verify, using (4.4), if a given Killing tensor satisfies condition (ii) of Theorem 4.1. However, we have been unable to solve the conditions (4.4) directly to obtain the most general Killing tensor admitting orthogonally integrable eigenvectors2 . Condition (i), namely the distinct eigenvalues condition, like condition (ii), can also be verified directly for a given Killing tensor: we simply compute the discriminant of the characteristic polynomial of K ij and verify that it does not vanish identically. Although a general solution of the TSN conditions (4.4) appears intractable, we may instead employ Eisenhart’s method [8] to derive all Killing tensors with normal eigenvectors up to equivalence. In particular, each representative Killing tensor characterizes separability of the Hamilton-Jacobi equation (4.2) in one of the eleven (orthogonally) separable coordinate systems in E3 . The Eisenhart method can be described as follows. Consider the Euclidean metric ds 2 = g11 (du1 )2 + g22 (du2 )2 + g33 (du3 )2 , with respect to separable coordinates ui . The method yields three canonical Killing tensors given by Kij = diag(λ1 g11 , λ2 g22 , λ3 g33 ), where the λi satisfy the linear system of PDEs ∂λi ∂ = (λi − λj ) j ln gii , (no sum) (4.6) j ∂u ∂u for i, j = 1, 2, 3 (see [8], Eqs. (1.8)). Trivially, any multiple of the metric g satisfies (4.6). It is straightforward to solve (4.6) for each of the eleven separable coordinate systems in E3 to obtain the two additional (non-trivial) canonical Killing tensors which we shall label K 1 and K 2 . We now summarize the results of this calculation. For each of the eleven separable coordinate systems, we give the corresponding coordinate transformation, ranges of the separable coordinates, the metric and the components of the canonical Killing tensors K 1 and K 2 .  x = x, y = y, z = z     −∞ < x, y, z < ∞  Cartesian: ds 2 = dx 2 + dy 2 + dz2 (4.7) (x, y, z)   K ij = diag(0, 1, 0)    1ij K2 = diag(0, 0, 1)

Circular cylindrical: (r, θ, z)

 x = r cos θ, y = r sin θ, z = z    r 0, 0 θ < 2π, −∞ < z < ∞   ds 2 = dr 2 + r 2 dθ 2 + dz2 ij   K1 = diag(0, r 4 , 0)    ij K2 = diag(0, 0, 1)

(4.8)

2 Steve Czapor (private communication) has simplified the situation considerably. Using Gr¨ obner basis theory, he has shown that (4.4a) and (4.4b) imply (4.4c), for any Killing tensor K ∈ K2 (E3 ).

Invariant Classification of Orthogonally Separable Hamiltonian Systems in Euclidean Space

689

 x = 21 (µ2 − ν 2 ), y = µν, z = z      µ 0, −∞ < ν < ∞, −∞ < z < ∞ ds 2 = (µ2 + ν 2 )(dµ2 + dν 2 ) + dz2  ij  K = diag(ν 2 g11 , −µ2 g22 , 0)    1ij K2 = diag(0, 0, 1)

(4.9)

Parabolic cylindrical: (µ, ν, z)

Elliptic-hyperbolic: (η, ψ, z)

Spherical: (r, θ, φ)

 x = a cosh η cos ψ, y = a sinh η sin ψ, z = z      η 0, 0 ψ < 2π, −∞ < z < ∞, a > 0 ds 2 = a 2 (cosh2 η − cos2 ψ)(dη2 + dψ 2 ) + dz2 (4.10) ij   K1 = diag(a 2 cos2 ψ g11 , a 2 cosh2 η g22 , 0)    ij K2 = diag(0, 0, 1)

 x = r sin θ cos φ, y = r sin θ sin φ, z = r cos θ      r 0, 0 θ < π, 0 φ < 2π ds 2 = dr 2 + r 2 dθ 2 + r 2 sin2 θ dφ 2 ij    K1 = diag(0, r 4 , r 4 sin2 θ)   ij K2 = diag(0, 0, r 4 sin4 θ)

(4.11)

 x = a sinh η sin θ cos ψ, y = a sinh η sin θ sin ψ, z = a cosh η cos θ     Prolate  η 0, 0 θ < π, 0 ψ < 2π, a > 0 ds 2 = a 2 (sinh2 η + sin2 θ )(dη2 + dθ 2 ) + a 2 sinh2 η sin2 θ dψ 2 spheroidal: ij   (η, θ, ψ)  K1 = diag − a 2 sin2 θ g11 , a 2 sinh2 η g22 , a 2 (sinh2 η − sin2 θ)g33   ij K2 = diag(0, 0, a 2 sinh2 η sin2 θ g33 ) (4.12)  x=a cosh η sin θ cos ψ, y = a cosh η sin θ sin ψ, z = a sinh η cos θ     Oblate  η 0, 0 θ < π, 0 ψ < 2π, a > 0 ds 2 = a 2 (cosh2 η − sin2 θ )(dη2 + dθ 2 ) + a 2 cosh2 η sin2 θ dψ 2 spheroidal: 2 2 ij  2 2 2 2 2 (η, θ, ψ)    K1ij = diag a sin θ g11 , a cosh η g22 , a (cosh η + sin θ)g33  K2 = diag(0, 0, a 2 cosh2 η sin2 θ g33 ) (4.13)

Parabolic: (µ, ν, ψ)

 x = µν cos ψ, y = µν sin ψ, z = 21 (µ2 − ν 2 )      µ 0, ν 0, 0 ψ < 2π ds 2 = (µ2 + ν 2 )(dµ2 + dν 2 ) + µ2 ν 2 dψ 2  ij  K = diag − ν 2 g11 , µ2 g22 , (µ2 − ν 2 )g33    1ij K2 = diag(0, 0, µ2 ν 2 g33 )

(4.14)

  rθ λ 2 2 r 2 (θ 2 − b2 )(b2 − λ2 ) 2 r 2 (c2 − θ 2 )(c2 − λ2 )  2  = ,y = ,z = x  2 2 2  bc b (c − b ) b2 (c2 − b2 )    r 0, b2 < θ 2 < c2 , 0 < λ2 < b2 ,   Conical: r 2 (θ 2 − λ2 ) r 2 (θ 2 − λ2 ) (4.15) 2 (r, θ, λ)  ds 2 = dr 2 + 2 dθ dλ2 +  2 − λ2 )(c2 − λ2 ) 2 )(c2 − θ 2 )  (θ − b (b   ij   K = diag(0, r 2 λ2 g22 , r 2 θ 2 g33 )    1ij K2 = diag(0, r 2 g22 , r 2 g33 )

690

J.T. Horwood, R.G. McLenaghan, R.G. Smirnov

Paraboloidal: (µ, ν, λ)

Ellipsoidal: (η, θ, λ)

 4(µ − b)(b − ν)(b − λ) 2 4(µ − c)(c − ν)(λ − c)   ,y = , x2 =   b−c b−c    z=µ+ν+λ−b−c     0 < ν η > b > θ > c > λ (η − θ )(η − λ) (θ − η)(θ − λ) ds 2 = dη2 + dθ 2   4(a − η)(b − η)(c − η) 4(a − θ )(b − θ )(c − θ)    (λ − η)(λ − θ )   + dλ2   4(a − λ)(b − λ)(c − λ)    ij  K1 = diag − (θ + λ)g11 , −(λ + η)g22 , −(η + θ)g33    ij K2 = diag(θ λg11 , ληg22 , ηθg33 ) (4.17)

As we are dealing with Hamiltonian systems defined in terms of Cartesian coordinates, the next step is to transform the components of each of the canonical Killing ij ij tensors K1 and K2 to Cartesian coordinates. This again is a routine calculation using the transformations from Cartesian to separable coordinates listed in (4.7)–(4.17) and the appropriate tensor transformation law. For each of the eleven separable cases, we take ij ij a linear combination of K1 , K2 and the metric g ij . Using (3.7), we then identify the constants in the linear combination and any essential parameters3 with the Killing tensor parameters (3.9) appearing in (3.7). We note that if the separable case under consideration has n essential parameters, then one can generally choose n+3 of the Killing tensor parameters in the identification. However, in the paraboloidal and ellipsoidal cases, it is convenient to choose more than n + 3 parameters in the identification so that the components of the resulting Killing tensor are polynomials in the parameters and the Cartesian coordinates. Consequently, this leads to algebraic constraints in the Killing tensor parameters. These constraints not only ensure that the resulting Killing tensor has normal eigenvectors, but also guarantees that one can always (uniquely) recover the original constants in the linear combination and all essential parameters from the identified Killing tensor parameters. Each of the CKTs constructed in this manner uniquely defines one of the eleven possible orthogonal coordinate webs in E3 . We now present the results of this procedure. For each of the eleven separable coordinate systems, we give the components of the corresponding CKT with respect to Cartesian coordinates and any restrictions on the Killing tensor parameters. 3

ij

ij

These refer to any parameters appearing in the canonical Killing tensors K1 and K2 .

Invariant Classification of Orthogonally Separable Hamiltonian Systems in Euclidean Space

1. Cartesian web

 a1 0 0 =  0 a2 0  . 0 0 a3

691



K ij

(4.18)

2. Circular cylindrical web  a1 + c3 y 2 −c3 xy 0 =  −c3 xy a1 + c3 x 2 0  . 0 0 a3 

K ij

(4.19)

3. Parabolic cylindrical web  a1 b23 y 0 = b23 y a1 − 2b23 x 0  . 0 0 a3 

K ij

4. Elliptic-hyperbolic web   a1 + c3 y 2 −c3 xy 0 K ij =  −c3 xy a2 + c3 x 2 0  , 0 0 a3

(4.20)

a1 − a2 > 0. c3

(4.21)

5. Spherical web  −c3 xy −c2 xz a1 + c 2 z 2 + c 3 y 2 . = −c3 xy a1 + c3 x 2 + c2 z 2 −c2 yz 2 2 −c2 xz −c2 yz a1 + c 2 x + c2 y 

K ij

6. Prolate spheroidal web   −c3 xy −c2 xz a1 + c2 z2 + c3 y 2 , K ij =  −c3 xy a1 + c3 x 2 + c2 z 2 −c2 yz 2 2 −c2 xz −c2 yz a3 + c2 x + c2 y

(4.22)

a3 − a1 >0. c2 (4.23)

7. Oblate spheroidal web   −c3 xy −c2 xz a1 + c2 z2 + c3 y 2 , K ij =  −c3 xy a1 + c3 x 2 + c2 z 2 −c2 yz 2 2 −c2 xz −c2 yz a3 + c2 x + c2 y

a3 − a1 <0. c2 (4.24)

8. Parabolic web  a1 − 2b12 z + c3 y 2 −c3 xy b12 x   = a1 − 2b12 z + c3 x 2 b12 y  . −c3 xy b12 y a1 b12 x 

K ij

(4.25)

692

J.T. Horwood, R.G. McLenaghan, R.G. Smirnov

9. Conical web K ij

 −c3 xy −c2 zx a1 + c 2 z 2 + c 3 y 2 . = −c3 xy a1 + c3 x 2 + c1 z 2 −c1 yz 2 2 −c2 zx −c1 yz a1 + c1 y + c2 x 

(4.26)

10. Paraboloidal web  K ij

 a1 − 2b12 z + c3 y 2 −c3 xy b12 x   = a2 + 2b21 z + c3 x 2 −b21 y  , −c3 xy −b21 y a3 b12 x

(4.27)

b12 [b12 b21 + c3 (a2 − a3 )] + b21 [b12 b21 + c3 (a1 − a3 )] = 0. 11. Ellipsoidal web   −c3 xy −c2 zx a1 + c2 z2 + c3 y 2 , K ij =  −c3 xy a 2 + c 3 x 2 + c1 z 2 −c1 yz −c2 zx −c1 yz a3 + c1 y 2 + c2 x 2

(4.28)

(a1 − a2 )c1 c2 + (a2 − a3 )c2 c3 + (a3 − a1 )c3 c1 = 0. We remark that the eleven Killing tensors (4.18)–(4.28) represent all possible Killing tensors with normal eigenvectors, up to equivalence. We have essentially computed the general solution of the TSN conditions (4.4) using Eisenhart’s method. As we shall see in Sect. 6, the fundamental invariants of K2 (E3 ) fail to discriminate amongst the eleven coordinate webs. In anticipation, we make the following key observation. The eleven orthogonal coordinate webs can be divided into three groups according to Table 4.1. More precisely, we say that a CKT K ∈ K2 (E3 ) is translational (rotational) if it admits a translational (rotational) Killing vector4 V ∈ K1 (E3 ) satisfying LV K = 0. This definition still lacks complete precision for we have not defined translational and rotational Killing vectors. Certainly, one can give a definition of such Killing vectors in terms of integral curves. But ideally we would like to give a definition in terms of algebraic invariants. We will revisit this problem in Sect. 6.1. For now, it suffices to note that the canonical translational CKTs (4.18)–(4.21) admit the Killing vector V = X 3 , while the canonical rotational CKTs (4.22)–(4.25) admit the Killing vector V = R 3 . We now pose the following problem: construct a subspace of K2 (E3 ) consisting of translational Killing tensors, say with V = X3 . To proceed we take the general Killing tensor from (3.3) and impose the condition LX3 K = 0. This gives a linear system of equations in the Killing tensor parameters which can be readily solved to yield   a1 + 2b13 y + c3 y 2 α3 − b13 x + b23 y − c3 xy α2 − β1 y K ij = α3 − b13 x + b23 y − c3 xy a2 − 2b23 x + c3 x 2 α1 + β1 x  . (4.29) α2 − β1 y α 1 + β1 x a3 We see that the four canonical translational CKTs are all special cases of (4.29). However, it follows that the Killing tensor (4.29) does not generally have normal eigenvectors. Consequently, we cannot take our subspace to be those Killing tensors of the form (4.29). Nevertheless, using the TSN conditions (4.4), it can be shown that α1 = α2 = β1 = 0 4 The circular cylindrical tensor (4.19) also admits a rotational Killing vector and can therefore be considered as both translational and rotational.

Invariant Classification of Orthogonally Separable Hamiltonian Systems in Euclidean Space

693

Table 4.1. The orthogonal coordinate webs in Euclidean space Translational webs Cartesian circular cylindrical parabolic cylindrical elliptic-hyperbolic

Rotational webs spherical prolate spheroidal oblate spheroidal parabolic

Asymmetric webs conical paraboloidal ellipsoidal

is a sufficient condition for (4.29) to have normal eigenvectors. This is not a necessary condition, for the constant Killing tensor K ij = Aij is of the form (4.29) and has normal eigenvectors. But, it is immediate from the transformation rules (3.13) that a CKT is Cartesian if and only if it is a constant Killing tensor. Moreover, it can be shown from (4.4) that if a Killing tensor of the form (4.29) has normal eigenvectors and is not Cartesian, then α1 = α2 = β1 = 0. Therefore, any translational Killing tensor with orthogonally integrable eigenvectors which is not Cartesian has the form  a1 + 2b13 y + c3 y 2 α3 − b13 x + b23 y − c3 xy 0 = α3 − b13 x + b23 y − c3 xy a2 − 2b23 x + c3 x 2 0 . 0 0 a3 

ij KT

(4.30)

We define the subspace KT2 (E3 ) of K2 (E3 ) to be the set of all Killing tensors of the form (4.30). We shall refer to this subspace as the space of translational Killing tensors, bearing in mind that it does not include all of the Cartesian CKTs. This does not pose any obstacle whatsoever in our goal of constructing a classification scheme, as we pointed out that Cartesian CKTs are trivially the constant Killing tensors. Finally, we remark that the space KT2 (E3 ) enjoys two nice features. Firstly, the form of the general translational Killing tensor (4.30) is invariant under the (orientation-preserving) isometry group I (E2 ), and secondly, the upper 2 × 2 block of (4.30) is the general Killing tensor on the Euclidean plane K2 (E2 ) (see, for example, [15]). Consequently, we can take advantage of known results in the literature for classifying the translational webs. We can perform a similar analysis for the rotational webs. It follows that any Killing tensor with normal eigenvectors admitting a Killing vector V = R 3 has the form  −c3 xy b12 x − c2 xz a1 − 2b12 z + c2 z2 + c3 y 2 = −c3 xy a1 − 2b12 z + c3 x 2 + c2 z2 b12 y − c2 yz  . b12 x − c2 xz b12 y − c2 yz a3 + c 2 x 2 + c 2 y 2 (4.31) 

ij

KR

2 (E3 ) of K2 (E3 ) to be the set of all Killing tensors of the We define the subspace KR form (4.31) and shall refer to this subspace as the space of rotational Killing tensors. We remark that the form of the general rotational Killing tensor (4.31) is also invariant under the isometry group I (R) (i.e. the group of translations about the z-axis) and that all canonical rotational CKTs (4.22)–(4.25) are special cases of (4.31). In conclusion, we have defined the following four vector spaces: K1 (E3 ), KT2 (E3 ), 2 KR (E3 ) and K2 (E3 ). For each of these four spaces, we need to compute the fundamental invariants under the action of the appropriate isometry group and classify the corresponding canonical forms. This is the topic of the next two sections.

694

J.T. Horwood, R.G. McLenaghan, R.G. Smirnov

5. Fundamental Invariants of Killing Tensors in Euclidean Space In this section we derive the fundamental invariants in each of the four vector spaces 2 (E3 ) and K2 (E3 ), under the action of their corresponding isometry K1 (E3 ), KT2 (E3 ), KR group. 5.1. The space of Killing vectors. The most general Killing vector V ∈ K1 (E3 ) may be expressed as V = Ai X i + C i R i ,

(5.1)

where Xi and R i , i = 1, 2, 3, are the Killing vectors defined in (3.1) and the coefficients Ai and C i are constant. For sake of convenience, we set Ai = (a1 , a2 , a3 ),

C i = (c1 , c2 , c3 ),

so that the space of Killing vector parameters is spanned by the six parameters (5.2)

a1 , a2 , a3 , c1 , c2 , c3 . For future reference, we note that (5.1) can be written in the form V = (a1 − c2 z + c3 y)X 1 + (a2 − c3 x + c1 z)X 2 + (a3 − c1 y + c2 x)X 3 .

(5.3)

As in Sect. 3, we can derive the transformation rules for the Killing vector parameters under the action of I (E3 ). It follows that A˜ i = Aj λi j + C j µi j , C˜ i = C j λi j , (5.4) which lead to the following infinitesimal generators of I (E3 ) in the representation defined by (5.4): ∂ ∂ ∂ U i = j ik C k j , V i = j ki Ak j + j ki C k j , (5.5) ∂A ∂A ∂C for i = 1, 2, 3. Using the formalism of Sect. 2, it follows from (5.5) that K1 (E3 ) admits two fundamental I (E3 )-invariants, namely 1 = C i Ci ,

2 = Ai Ci .

In the next section, we will use these invariants to classify the elements of define translational and rotational Killing vectors in terms of 1 and 2 .

(5.6) K1 (E3 )

and

5.2. The space of translational Killing tensors. In Sect. 4 we defined the space of translational Killing tensors KT2 (E3 ) to the set of Killing tensors of the form (4.30). We pointed out that the upper 2 × 2 block of (4.30) is the general Killing tensor in the space K2 (E2 ). The fundamental I (E2 )-invariants of this vector space are known (see, for example, [15]), and hence are also I (E2 )-invariants of KT2 (E3 ). It follows that KT2 (E3 ) admits two fundamental I (E2 )-invariants given by5 1 = c3 ,

2 = [b13 2 − b23 2 + c3 (a2 − a1 )]2 + 4(b13 b23 − α3 c3 )2

(5.7)

(see Proposition 4.1 in [15]). 5

The paper [15] only treats the space of “non-trivial” Killing tensors in K2 (E2 ). The invariants (5.7) form a complete set of fundamental I (E2 )-invariants for this vector subspace. It can be shown that KT2 (E3 ) admits two additional fundamental I (E2 )-invariants; we do not present them here as they play no role in classifying the elements of KT2 (E3 ).

Invariant Classification of Orthogonally Separable Hamiltonian Systems in Euclidean Space

695

5.3. The space of rotational Killing tensors. The set of all Killing tensors of the form 2 (E3 ). This subspace of K2 (E3 ) (4.31) defines the space of rotational Killing tensors KR is mapped to itself under the action of the isometry group I (R), the group of translations about the z-axis. Trivially, the Lie algebra i(R) is generated by the Killing vector field X3 and hence the corresponding infinitesimal generator in the parameter space is the 2 (E3 ) which reads generator U 3 in (3.16) restricted to KR U 3 = −2b12

∂ ∂ − c2 . ∂a1 ∂b12

(5.8)

Solving the PDE U 3 (F ) = 0 by the method of characteristics, we obtain the four 2 (E3 ), namely fundamental I (R)-invariants of KR 1 = c2 ,

2 = b12 2 + c2 (a3 − a1 ),

3 = a3 ,

4 = c3 .

(5.9)

5.4. The space of Killing tensors. We now briefly describe how to derive a complete set of invariants for the full vector space K2 (E3 ) of valence-two Killing tensors in Euclidean space. As mentioned at the end of Sect. 3, K2 (E3 ) admits fifteen fundamental I (E3 )-invariants which can be computed by solving the system of linear PDEs U i (F ) = 0,

V i (F ) = 0,

i = 1, 2, 3,

(5.10)

where the generators U i and V i are given by (3.16) and F is an analytic function in the Killing tensor parameters. Computationally, arriving at the general solution of (5.10) is non-trivial. In particular, we have found that the method of characteristics becomes intractable when applied to (5.10). However, we have successfully computed all fifteen invariants using the method of undetermined coefficients [6]. The simplest implementation of this method is to build monomial trial functions in the Killing tensor parameters up to a fixed degree, take a linear combination of these monomials and substitute the combination into (5.10) leading to a large (sparse) system of linear equations in the undetermined coefficients. This approach has two obvious disadvantages. Firstly, such an ansatz does not take advantage of the apparent structure and symmetry of (5.10). Consequently, as one must construct all monomials up to and including degree five to recover all fifteen fundamental invariants, approximately 50 000 undetermined coefficients are involved; the corresponding linear system requires almost ninety hours of CPU time to solve on a modest Sun workstation! Secondly, by virtue of the ansatz, the computed invariants are in expanded form and occupy fifteen pages of output. A more effective ansatz involves constructing scalar trial functions which are “tensorial”6 in the Aij , B ij and C ij . For example, trial functions which are cubic in the C ij include C i i Cj j Ck k ,

C ij Cij Ck k ,

C ij Cj k Cki ,

ikm j n C ij C k C mn .

6 As the Aij and B ij do not transform as tensors (see (3.13)), there is no reason why this ansatz should work.

696

J.T. Horwood, R.G. McLenaghan, R.G. Smirnov

Implementing the method of undetermined coefficients with this ansatz yields the following fifteen fundamental I (E3 )-invariants of K2 (E3 ): 1 = Bi i , 2 = Ci i , 3 = B ij Cij , 4 = C ij Cij , 5 = B ij Bj i + Aij Cij , 6 = B ij Cj k Cki ,

7 = C ij Cj k Cki ,

8 = C ij [Bj k (Bik + 2Bki ) + Aj k Cki ],

9 = ikm j n B ij B k B mn − 2(Bi [i Bj j ] + Aij Cij )Bk k + 6B ij Aj k Cki , 10 = B ij (Bi k Ckj − 2Bj k Cki ) − (B ij Bij + Aij Cij )Ck k + Ai i Cj [j Ck k] , 11 = im j kp B ij B k C mn Cn p + B ij [Bij C k Ck − Cj k (Ck Bi + 4C[k B]i )] +Aij Cij Ck [k C ] , 12 = Ai i [(Cj j Ck k + 3C j k Cj k )C − 4C j k Ck Cj ] − 6Aij Cij C k Ck +6B ij {Bij C k Ck − Cj k [(Bik − 2Bki )C + 4Ck Bi ]} +12im j kp B ij B k C mn Cn p , 13 = Aij (Bij Ck [k C ] + Bj k Ck Ci − 2Ci(j Bk) k C ) +Ai i C j k (Bj k C − Bk Cj ) − B ij [Bij B k Ck + 2Cj k Bki B +Bj k Bik C − (Bj k Bi + Bi k Bj )Ck ], 14 = 4Ai [i Aj j ] Ck [k C ] + 8Aij (Aj k Ck[i C] + Ak k C[j C]i ) + Aij Cij (Ak Ck +4Bk [k B ] ) + 4C ij Bj k Ak Bi + 16Aij Cj k B[k B]i , 15 = Aij Cij [(Ck k C − 3C k Ck )Cm m + 2C k C m Cmk ] −6Aij Cj k Cki C [ Cm m] − 12C ij Bj k (Ck Bi[ Cm] m + 2Bk Ci[ Cm] m ). (5.11) We have hence proven the following theorem. Theorem 5.1. Consider the vector space K2 (E3 ). Any algebraic I (E3 )-invariant I defined over K2 (E3 ) in terms of the Killing tensor parameters (3.9) where the isometry group I (E3 ) acts freely and regularly with six-dimensional orbits can be locally uniquely expressed as an analytic function I = F (1 , . . . , 15 ), where the fundamental I (E3 )invariants i , i = 1, . . . , 15, are given by (5.11). We close this section by addressing a minor computational issue. It is clear from (3.7) that the Killing tensor parameters b11 , b22 and b33 are not uniquely determined for a given Killing tensor K ∈ K2 (E3 ). Thus, how does one evaluate the invariants (5.11) on a given Killing tensor? Indeed, we require an invariant method for solving Eqs. (3.8) for the bii in terms of the known βi . This problem is easily rectified upon observing that Eqs. (3.8) in conjunction with the condition 1 = 0 ⇔ b11 + b22 + b33 = 0 yields a unique solution for the bii given by b11 = 13 (β3 − β2 ),

b22 = 13 (β1 − β3 ),

b33 = 13 (β2 − β1 ).

(5.12)

In what follows, we shall extract the parameters b11 , b22 and b33 from a given K ∈ K2 (E3 ) using (5.12). 6. Invariant Classification of Orthogonal Coordinate Webs in Euclidean Space In order to build a classification scheme for the orthogonal coordinate webs in Euclidean space based on the set of Killing tensors in K2 (E3 ) with normal eigenvectors and distinct

Invariant Classification of Orthogonally Separable Hamiltonian Systems in Euclidean Space

697

eigenvalues, one must first know how to classify elements in the vector spaces K1 (E3 ), 2 (E3 ). The classification of elements in these spaces are treated in SubKT2 (E3 ) and KR sects. 6.1, 6.2 and 6.3, respectively. We then use these results to classify the orthogonal coordinate webs of K2 (E3 ) in Subsect. 6.4. 6.1. Classification of Killing vectors. We shall classify the elements of K1 (E3 ) according to whether the fundamental invariants (5.6) are zero or non-zero. There are three cases to consider each of which gives rise to a canonical Killing vector. The classification scheme is summarized in Table 6.1: 1. 1 = 0, 2 = 0. In this case, the Killing vector V is of the form V = Ai X i ,

(6.1)

where it is assumed that the Ai are not all zero so that V is non-trivial. It follows that we can use the isometry group I (E3 ) to transform (6.1) to ˜ 3, V˜ = a˜ 3 X

(6.2)

for some a˜ 3 = 0. Indeed, the transformation rules (5.4) reduce to C˜ i = 0 and A˜ i = Aj λi j . Without loss of generality, we can set the components of the translation δ i = 0. The components of the rotation λj i can be computed by setting λ3 j = (Ak Ak )−1/2 Aj and then obtaining λ1 j and λ2 j by extending the vector λ3 j to a proper orthonormal basis in E3 (using the Gram-Schmidt algorithm or QR decomposition, for example). We observe that (6.2) defines a translation, hence we say that a Killing vector V ∈ K1 (E3 ) is translational iff 1 = 2 = 0. 2. 1 = 0, 2 = 0. Using the isometry group I (E3 ), we claim that any such Killing vector V can be transformed to ˜ 3, V˜ = c˜3 R

(6.3)

for some c˜3 = 0. Indeed, we can first make a translation x i = xˆ i + δ i so that Aˆ i = 0 in the new coordinates xˆ i . It follows from the transformation rules (5.4) and (3.12) that the δ i must satisfy the system of linear equations i j k C j δ k = Ai , which has a solution iff 2 = 0. ˜ i. 1. In the new coordinates xˆ i , it follows that V˜ = C i R 2. By a similar argument to that used in the previous case, we can find a rotation λj i such that the coordinate transformation xˆ i = λj i x˜ j puts V˜ into the form (6.3). We observe that (6.3) defines a rotation, hence we say that a Killing vector V ∈ K1 (E3 ) is rotational iff 1 = 0 and 2 = 0. Table 6.1. Invariant classification of Killing vectors in Euclidean space Classification translational rotational helicoidal

Invariants 1 = 0, 1 = 0, 1 = 0,

2 = 0 2 = 0 2 = 0

698

J.T. Horwood, R.G. McLenaghan, R.G. Smirnov

3. 1 = 0, 2 = 0. By similar arguments to those used in the previous two cases, it can be shown in this case that there exists a coordinate transformation (3.10) such that ˜ 3 + c˜3 R ˜ 3, V˜ = a˜ 3 X

(6.4)

for some non-zero a˜ 3 and c˜3 . The integral curves of such a Killing vector field are helices and so we say that a Killing vector V ∈ K1 (E3 ) is helicoidal iff both 1 = 0 and 2 = 0. We remark that we can further refine this classification into left and right-handed helicoidal Killing vectors according to the sign of 2 . 6.2. The space of translational Killing tensors. A classification scheme for the vector space of translational Killing tensors KT2 (E3 ) based on the fundamental I (E2 )-invariants (5.7) is provided in [15]. For completeness, we summarize this scheme in Table 6.2.

6.3. The space of rotational Killing tensors. Evaluating the I (R)-invariants (5.9) on each of the rotational CKTs (4.22)–(4.25) produces a classification scheme for the vec2 (E3 ). It turns out that we only need to use the invariants and in (5.9) tor space KR 1 2 to obtain a classification. As we pointed out in Sect. 4, we can also include the circular cylindrical web in this classification. Our classification scheme is detailed in Table 6.3.

6.4. The space of Killing tensors. The motivation for constructing invariant classification schemes in the vector spaces treated in the previous three subsections is due to the fact that we have been unable to obtain a scheme based solely on the fifteen fundamental I (E3 )-invariants of the full space K2 (E3 ) presented in (5.11). This is primarily because these invariants fail to discriminate amongst some of the canonical CKTs (4.18)–(4.28). To begin, let K ∈ K2 (E3 ) be the given CKT for which we wish to classify. As we showed in Sect. 4, if K is constant, then it necessarily characterizes a Cartesian web. Table 6.2. Invariant classification of translational Killing tensors in Euclidean space Orthogonal coordinate web Cartesian circular cylindrical parabolic cylindrical elliptic-hyperbolic

Invariants 1 = 0, 1 = 0, 1 = 0, 1 = 0,

2 2 2 2

=0 =0

= 0

= 0

Table 6.3. Invariant classification of rotational Killing tensors in Euclidean space Orthogonal coordinate web circular cylindrical spherical prolate spheroidal oblate spheroidal parabolic

Invariants 1 = 0, 1 = 0, 1 = 0, 1 = 0, 1 = 0,

2 2 2 2 2

=0 =0 >0 <0

= 0

Invariant Classification of Orthogonally Separable Hamiltonian Systems in Euclidean Space

699

Let us therefore assume for the remainder of this section that K is not constant. The classification of K involves two main steps: 1. Determine whether K characterizes a translational, rotational or asymmetric web (according to the type of Killing vector it admits). 2. Use the classification schemes in Tables 6.2 and 6.3 if K is translational or rotational, or, the classification scheme outlined in Table 6.4 if K characterizes an asymmetric web (i.e. K admits no Killing vector). To proceed with the first step, we let V be the general Killing vector from (5.3) and impose the condition LV K = 0.

(6.5)

Equation (6.5) results in a linear system of equations in the six Killing vector parameters (5.2) which can be readily solved. It follows that the general solution of (6.5) can be decomposed as V = 1 V 1 + · · · n V n , for some n 6, where i , i = 1, . . . , n, are arbitrary non-zero constants and {V 1 , . . . , V n } is a linearly independent set of Killing vectors. If n = 0, we conclude that K does not admit a Killing vector, and hence characterizes an asymmetric web. Otherwise, using Table 6.1, we classify each of the V i according to whether they are translational, rotational, or helicoidal. Therefore, if one of the V i is translational, then K characterizes a translational web, otherwise K characterizes a rotational web7 . We have now shown how to determine if K characterizes a translational, rotational or asymmetric web. To proceed, suppose that K is a translational (rotational) Killing tensor and let V be its corresponding translational (rotational) Killing vector. From the results of Subsect. 6.2 (6.3), we can use the isometry group I (E3 ) to bring V to the ˜ 3 (c˜3 R ˜ 3 ). Applying the corresponding coordinate transformation to canonical form a˜ 3 X 2 (E3 )). Finally, we can classify the Killing tensor K places it in the subspace KT2 (E3 ) (KR the transformed K using Table 6.2 (6.3). Suppose now that K characterizes an asymmetric web. Using the fundamental I (E3 )-invariants (5.11) of K2 (E3 ), we shall derive a scheme for classifying K. To begin, we evaluate the invariants 2 , 4 and 7 (see (5.11)) on the three asymmetric CKTs (4.26)–(4.28). It follows that for the conical and ellipsoidal tensors, (2 , 4 , 7 ) = (c1 + c2 + c3 , c1 2 + c2 2 + c3 2 , c1 3 + c2 3 + c3 3 ), while for the paraboloidal tensor, (2 , 4 , 7 ) = (c3 , c3 2 , c3 3 ). This motivates defining two auxiliary invariants 1 = 2 2 − 4 ,

2 = 2 3 − 7 ,

(6.6)

noting that 1 = 2 = 0 on the paraboloidal tensor (4.27). We claim that the vanishing of 1 and 2 is also a sufficient condition for K to characterize a paraboloidal web. Indeed, it follows that 1 = 2 = 0 in one or more of the following three cases c1 = c2 = 0,

c2 = c3 = 0,

c3 = c1 = 0.

7 It is impossible for all of the V to be helicoidal. Although the circular cylindrical web is the only i coordinate web which admits a helicoidal Killing vector, it also admits both translational and rotational Killing vectors. Clearly, if one of the V i is helicoidal, we can conclude immediately that K characterizes a circular cylindrical web.

700

J.T. Horwood, R.G. McLenaghan, R.G. Smirnov

The conical case (4.26) and the ellipsoidal case (4.28) cannot have c1 = c2 = 0, since this condition reduces them to the spherical and elliptic-hyperbolic tensors, respectively. Moreover, by a rotation, the two other cases are also impossible for conical and ellipsoidal tensors. Therefore, we conclude that K characterizes a paraboloidal web if and only if 1 = 2 = 0. Suppose now that K does not characterize a paraboloidal web for the remainder of this section. It is convenient to define 3 = 34 − 2 2 ,

(6.7)

noting that 3 = (c1 − c2 )2 + (c2 − c3 )2 + (c3 − c1 )2 on both the conical and ellipsoidal CKTs (4.26) and (4.28). Indeed, if 3 = 0, the web cannot possibly be a conical tensor since c1 = c2 = c3 reduces it to a special case of the spherical tensor. Therefore, if 3 = 0, then K necessarily characterizes an ellipsoidal web. To distinguish between a conical and ellipsoidal tensor in the case when 3 = 0, we define three additional auxiliary invariants given by 4 = 2 5 − 38 − 210 , 5 = 2 10 + 4 5 − 11 , 6 = 2 [22 (102 5 + 248 − 310 ) − 7211 + 12 ] −484 8 − 205 7 + 1615 .

(6.8)

It follows that the invariants (6.8) all evaluate to zero on the conical tensor (4.26). We claim that 4 = 5 = 6 = 0 (in conjunction with 3 = 0) is also a sufficient condition for a conical tensor. Indeed, arguing by contradiction, it follows that 4 = (a1 + a2 − 2a3 )c1 c2 + (a2 + a3 − 2a1 )c2 c3 + (a3 + a1 − 2a2 )c3 c1 , 5 = (c1 c2 + c2 c3 + c3 c1 )[(a1 + a2 − 2a3 )c3 + (a2 + a3 − 2a1 )c1 +(a3 + a1 − 2a2 )c2 ], 6 = 12c1 c2 c3 [(2a1 − a2 − a3 )c1 + (2a2 − a3 − a1 )c2 + (2a3 − a1 − a2 )c3 ], on the ellipsoidal tensor (4.28) and that 4 = 5 = 6 = 0 in one or more of the following five cases8 : c1 = c2 = 0,

c2 = c3 = 0,

c3 = c1 = 0,

c1 = c2 = c3 ,

a1 = a2 = a3 .

By previous arguments, the first three cases are impossible if the tensor is ellipsoidal. The fourth case can also be eliminated since 3 = 0. Finally, the fifth case is impossible since it reduces to a conical tensor. Therefore, we conclude that if 3 = 0, then the tensor is conical if and only if 4 = 5 = 6 = 0. This completes the derivation of the classification scheme for the set of asymmetric CKTs in Euclidean space. These results are summarized in Table 6.4. 8

This calculation is facilitated by use of the Maple Gr¨obner basis package.

Invariant Classification of Orthogonally Separable Hamiltonian Systems in Euclidean Space

701

Table 6.4. Invariant classification of asymmetric Killing tensors in Euclidean space Orthogonal coordinate web paraboloidal ellipsoidal conical

Invariants (1 , 2 ) = (0, 0) (1 , 2 ) = (0, 0), (1 , 2 ) = (0, 0), (1 , 2 ) = (0, 0),

3 = 0 3 = 0, 3 = 0,

or (4 , 5 , 6 ) = (0, 0, 0) (4 , 5 , 6 ) = (0, 0, 0)

7. Transformations to Canonical Form Once a CKT in Euclidean space has been classified using the scheme detailed in the previous section, the isometry group I (E3 ) can be used to transform the Killing tensor into its corresponding canonical form. This step leads directly to the transformation to separable coordinates. In this section we provide methods for determining the transformations to canonical form. As we shall see, the majority of the calculations amount to elementary linear algebra. The procedure can be summarized as follows. Suppose that K ij are the components of a CKT with respect to Cartesian coordinates x i . Under the action of I (E3 ), the transformation from the original set of Cartesian coordinates x i to another set x˜ i is given by (3.10), i.e. x i = λj i x˜ j + δ i .

(7.1)

∈ R3 Thus, we need to determine the rotation λj ∈ SO(3) and translation which brings K ij to its appropriate canonical form K˜ ij given by one of the eleven cases (4.18)–(4.28) (in the coordinates x˜ i ). Moreover, any essential parameters appearing in the tensor also need to be determined (e.g. the parameter a appearing in the elliptichyperbolic tensor listed in (4.10)). Once λj i , δ i and all essential parameters are known, the transformation from Cartesian coordinates x i to separable coordinates ui is i

x i = λj i T j (uk ) + δ i ,

δi

(7.2)

where x i = T i (uj ) is the standard coordinate transformation associated with the separable coordinates tabulated in (4.7)–(4.17). To carry out this procedure, we use the transformation rules relating the parameter matrices of K ij to those of its canonical form K˜ ij (see (3.6)). In matrix form, these transformation rules read ˜ = λt Aλ + 2 S(λt Bµ) + µt Cµ, A B˜ = λt Bλ + µt Cλ,

(7.3a) (7.3b)

C˜ = λt Cλ,

(7.3c)

where x = λx˜ + δ, S denotes the symmetric part and    1 1 1   λ1 λ2 λ3 x˜ x x = y  = x i , x˜ = y˜  = x˜ i , λ = λ1 2 λ2 2 λ3 2  = λj i , z˜ i z i λ1 3 λ2 3 λ3 3 ij  1   2 3 δ λ1 δ − λ1 3 δ 2 λ2 2 δ 3 − λ2 3 δ 2 λ3 2 δ 3 − λ3 3 δ 2 δ = δ 2  = δ i , µ = λ1 3 δ 1 − λ1 1 δ 3 λ2 3 δ 1 − λ2 1 δ 3 λ3 3 δ 1 − λ3 3 δ 1  = µj i . δ3 i λ1 1 δ 2 − λ1 2 δ 1 λ2 1 δ 2 − λ2 2 δ 1 λ3 1 δ 2 − λ3 2 δ 1 ij

702

J.T. Horwood, R.G. McLenaghan, R.G. Smirnov

The identities     2 2 −δ 1 δ 2 −δ 3 δ 1 0 δ 3 −δ 2 (δ ) + (δ 3 )2 µλt = −δ 3 0 δ 1  , µµt =  −δ 1 δ 2 (δ 3 )2 + (δ 1 )2 −δ 2 δ 3  , 2 3 1 1 2 3 1 δ −δ 0 −δ δ −δ δ (δ )2 + (δ 2 )2 (7.4) shall prove useful and, in addition, the inverse of (7.3) which reads ˜ t + 2 S(λBµ ˜ t, ˜ t ) + µCµ A = λAλ t t ˜ , ˜ + µCλ B = λBλ t ˜ . C = λCλ

(7.5a) (7.5b) (7.5c)

Our procedure for determining the transformation to canonical form for the cases of translational and rotational CKTs is provided in Subsects. 7.1 and 7.2, respectively. The three asymmetric CKTs are each treated separately in Subsects. 7.3–7.5. 7.1. The translational Killing tensors. Let us consider first the Cartesian CKT. Since ˜ = λt Aλ, it is necessarily a constant tensor, the transformation rules (7.3) reduce to A ˜ = diag(a˜ 1 , a˜ 2 , a˜ 3 ), on account of (4.18). Trivially, the a˜ i are the eigenvalues where A of A, the columns of λ are the (normalized) eigenvectors of A and the translation δ is arbitrary. Suppose now that the CKT is circular cylindrical, parabolic cylindrical or elliptichyperbolic. We may assume without loss of generality that the tensor has the form (4.30), since it must necessarily be of this form in order to carry out the classification scheme in Sect. 6. Consequently, the rotation and translation are of the form    1 cos φ − sin φ 0 δ λ =  sin φ cos φ 0 , δ = δ 2  , 0 0 1 0 and thus the problem reduces to finding φ, δ 1 and δ 2 . The derivation of these parameters for each of the three translational CKTs under consideration is provided in [15] (see Table 1, p. 1432). We now restate these results in our notation. 1. Circular cylindrical case: The rotation angle φ is arbitrary and δ1 =

b23 , c3

δ2 = −

b13 . c3

2. Parabolic cylindrical case: If b23 = 0, then tan φ = − and φ =

π 2

b13 , b23

for b23 = 0 (unique mod π ). The components of the translation are

δ1 =

b23 (a2 − a1 ) + 2α3 b13 , 2(b13 2 + b23 2 )

δ2 =

b13 (a2 − a1 ) − 2α3 b23 . 2(b13 2 + b23 2 )

Invariant Classification of Orthogonally Separable Hamiltonian Systems in Euclidean Space

703

3. Elliptic-hyperbolic case: Let σ1 = b13 2 − b23 2 + c3 (a2 − a1 ),

σ2 = α3 c3 − b13 b23 ,

= σ1 2 + 4σ2 2

(and note that is one of the fundamental invariants (5.7)). Then,  if σ2 = 0 and σ1 < 0,   0, if σ2 = 0 and σ1 > 0, tan φ = ∞, √   σ1 + if σ2 = 0, 2σ2 , (φ unique mod π), and δ 1 and δ 2 are the same as in the circular cylindrical case. Moreover, the essential parameter a satisfies √ a˜ 1 − a˜ 2 a2 = = 2. c˜3 c3 7.2. The rotational Killing tensors. By the same reasoning used in the previous subsection, we may assume without loss of generality that the rotational Killing tensor has the form (4.31). As the isometry group for this subspace of rotational webs is the group of translations about the z-axis, we set λj i = δj i and δ 1 = δ 2 = 0 in the transformation rules (7.3). The determination of δ 3 and hence the transformation to canonical form thus becomes a trivial calculation. It follows that b12 δ3 = c2 for the spherical, prolate spheroidal and oblate spheroidal cases and a1 − a3 δ3 = 2b12 for the parabolic case. Finally, the essential parameter a appearing in the transformation from Cartesian to prolate (oblate) spheroidal coordinates satisfies a˜ 3 − a˜ 1 2 = ± 2, c˜2 1 where 1 and 2 are the fundamental I (R)-invariants (5.9) and the positive (negative) signs correspond to the prolate (oblate) spheroidal tensor. a2 = ±

7.3. The conical case. The parameter matrices associated with the canonical form of the conical Killing tensor specialize to ˜ = a˜ 1 1, A

B˜ = 0,

C˜ = diag(c˜1 , c˜2 , c˜3 )

(see (4.26)). From (7.3c), C˜ = λt Cλ, hence, as in the Cartesian case, the c˜i are the eigenvalues of C and the columns of λ are the (normalized) eigenvectors of C. It follows that the essential parameters b and c satisfy b2 c˜2 − c˜1 = , 2 c c˜3 − c˜1 thus, in order to satisfy the condition b2 < c2 , we can order the eigenvalues such that c˜1 < c˜2 < c˜3 . Finally, substituting (7.3c) into (7.5b) leads to B = µλt C, which can easily be solved for δ noting the identity (7.4).

704

J.T. Horwood, R.G. McLenaghan, R.G. Smirnov

7.4. The paraboloidal case. The parameter matrices associated with the canonical form of the paraboloidal Killing tensor specialize to   0 b˜12 0 ˜ = diag(a˜ 1 , a˜ 2 , a˜ 3 ), B˜ = b˜21 0 0 , C˜ = diag(0, 0, c˜3 ), A (7.6) 0 0 0 subject to the constraint b˜12 [b˜12 b˜21 + c˜3 (a˜ 2 − a˜ 3 )] + b˜21 [b˜12 b˜21 + c˜3 (a˜ 1 − a˜ 3 )] = 0 (see (4.27)). It follows that the essential constants b and c satisfy  a˜ 3 a˜ 1 c = a˜ 2 − , c − b = a˜ 2 − , if c˜3 = 0,   2b˜12 2b˜12   ˜   b = a˜ 1 −a˜ 2 , c − b = b12 , if c˜3 = 0, b˜21 = 0, 2c˜3 ˜  c=     b =

2b12 a˜ 1 −a˜ 2 , 2b˜21 a˜ 1 −a˜ 3 , 2b˜12

c−b = c−b =

b˜21 2c˜3 , b˜12 +b˜21 2c˜3 ,

if c˜3 = 0, b˜12 = 0,

(7.7)

(7.8)

if c˜3 = 0, b˜12 = 0, b˜21 = 0,

together with the condition b > c. From (7.3c), C˜ = λt Cλ, hence it follows from (7.6) that C necessarily has a zero eigenvalue of multiplicity two and one other eigenvalue c˜3 . We now consider the two cases c˜3 = 0 and c˜3 = 0 separately. If c˜3 = 0, then it follows from (7.7) that b˜21 = −b˜12 . Moreover, (7.3c) is trivi2 ally satisfied and (7.3b) reduces to B˜ = λt Bλ. This implies that B˜ = λt B 2 λ, where 2 B˜ = diag(−b˜12 2 , −b˜12 2 , 0). Therefore, the negative eigenvalue of B 2 determines b˜12 ; we can take its sign to be positive without loss of generality. The normalized eigenvectors of B 2 determine λ up to a rotation in the eigenspace associated with the negative eigenvalue which fixes λ up to a parameter ψ. Finally, it follows that (7.5a) reduces to ˜ t . This equation can be solved for the a˜ i , δ i , and ψ which, in A = 2 S(Bλµt ) + λAλ general, yields multiple solutions; a particular solution satisfying the condition b > c (see (7.8)) can be selected. Finally, if c˜3 = 0, then the eigenproblem (7.3c) uniquely determines c˜3 and λ up to a parameter ψ. Equations (7.5a) and (7.5b) can then be solved for b˜12 , b˜21 , a˜ i , δ i , and ψ, in conjunction with the condition b > c. 7.5. The ellipsoidal case. The parameter matrices associated with the canonical form of the ellipsoidal Killing tensor specialize to ˜ = diag(a˜ 1 , a˜ 2 , a˜ 3 ), A

B˜ = 0,

C˜ = diag(c˜1 , c˜2 , c˜3 ),

(7.9)

subject to the constraint (a˜ 1 − a˜ 2 )c˜1 c˜2 + (a˜ 2 − a˜ 3 )c˜2 c˜3 + (a˜ 3 − a˜ 1 )c˜3 c˜1 = 0 (see (4.28)). It follows that the essential constants a, b and c satisfy  a˜ 1 −a˜ 2 a˜ 3 −a˜ 1   a − b = c˜3 , c − a = c˜2 , if c˜2 = 0, c˜3 = 0, a˜ 1 −a˜ 2 a˜ 1 −a˜ 2 a − b = c˜3 , c − b = c˜1 , if c˜2 = 0, c˜3 = 0,   a˜ 1 a˜ 1 c − a = a˜ 3c− c − b = a˜ 3c− if c˜2 = 0, c˜3 = 0, ˜2 , ˜1 ,

(7.10)

(7.11)

Invariant Classification of Orthogonally Separable Hamiltonian Systems in Euclidean Space

705

together with the condition a > b > c. As in the conical case, (7.3c) implies that the c˜i are the eigenvalues of C and the columns of λ are the (normalized) eigenvectors of C. There are two cases to consider: (1) the c˜i are all distinct and (2) the c˜i are all equal9 . If the c˜i are all distinct, then the matrix λ is uniquely determined (up to the ordering ˜ and of the columns) and δ can be computed as in the conical case. Upon obtaining A ˜ C, the condition a > b > c should be verified and, if necessary, the eigenvalues c˜i may need to be reordered accordingly. If all of the c˜i are equal, then (7.3c) is trivially satisfied. Equation (7.5b) reduces to B = c˜1 µλt which can be solved for δ using the identity (7.4). Finally, (7.5a) simplifies ˜ t . This eigenproblem can be solved to obtain λ. to A − c˜1 µµt = λAλ 8. Main Algorithm The above-presented considerations lead to a systematic and computationally efficient method of determining separable coordinates for the natural Hamiltonian (4.1). We emphasize that our algorithm is purely algebraic, and hence is well suited for implementation in a symbolic computer algebra system. Indeed, as we mentioned in Sect. 1, the algorithm has been fully implemented into Maple through the KillingTensor package. We now summarize the three main steps of the algorithm. (1) Impose the compatibility condition. Using the given potential V in terms of Cartesian coordinates x i and a generic Killing tensor K of the form (3.7), impose the compatibility condition (4.3) to obtain the equivalent conditions on the Killing tensor parameters (3.9). Computationally, this step amounts to solving a system of linear equations in the parameters (3.9). (2) Extract the orthogonal coordinate webs. Decompose the general solution obtained in step (1) into the form K = 0 g + 1 K 1 + · · · + n K n ,

(8.1)

where i , i = 1, . . . , n are arbitrary constants, g is the metric tensor and {K 1 , . . . , K n } is a linearly independent set of Killing tensors, noting that n 19 since dim K2 (E3 ) = 20. By Theorem 4.1, each K i must necessarily have normal eigenvectors and distinct eigenvalues if it is to characterize separation in one of the eleven separable coordinate systems. The former can be verified using the TSN conditions (4.4) while the latter can be verified efficiently by computing the discriminant of the characteristic polynomial of K i and checking that it does not vanish identically. Finally, relabel the K i so that K 1 , . . . , K m , m n, are CKTs. (3) Classify each Killing tensor and transform to canonical form. For each K i , i = 1, . . . , m, in step (2), classify K i using the scheme in Sect. 6. Finally, using the techniques described in Section 7, determine the transformation (7.1) which brings K i to its appropriate canonical form. The transformation to separable coordinates can be carried out using Eq. (7.2). Remark. Because of the non-linearity of the integrable eigenvector and distinct eigenvalue conditions as well as the non-linearity of the fundamental invariants, certain linear combinations of the K i , i = 1, . . . , n in step (2) may produce Killing tensors which characterize separability in coordinate systems not characterized by the K i , i = 1, . . . , m. In 9 The case of only two equal c˜ is impossible, for such a Killing tensor would characterize either an i elliptic-hyperbolic, prolate spheroidal or oblate spheroidal web.

706

J.T. Horwood, R.G. McLenaghan, R.G. Smirnov

fact, it may be possible to construct such a linear combination where the individual Killing tensors of the combination fail to have normal eigenvectors or distinct eigenvalues. This will be illustrated by the example in the next section. 9. Application: The Calogero-Moser System We apply the algorithm of Sect. 8 to the three-body inverse square Calogero-Moser system with equal masses. It is defined by the natural Hamiltonian (4.1) with potential V =

1 1 1 + + . (x − y)2 (y − z)2 (z − x)2

(9.1)

Solving the compatibility condition (4.3) with the potential (9.1) yields K = a1 g + α1 K 1 + b32 K 2 + c3 K 3 + γ3 K 4 , where

(9.2)



   011 −2yz (x + y − z)z (z + x − y)y ij ij −2zx (z + y − x)x  , K1 = 1 0 1 , K4 =  (x + y − z)z 110 (z + x − y)y (z + y − x)x −2xy     2 2 −zx y + z −xy 2y + 2z −x − y −z − x ij ij K2 =  −x − y 2z + 2x −y − z  , K3 =  −xy z2 + x 2 −yz  .(9.3) −z − x −y − z 2x + 2y −zx −yz x 2 + y 2

Using (4.4), we find that K has normal eigenvectors for all a1 , α3 , b32 , c3 and γ3 . However, it follows that only K 2 and K 4 have distinct eigenvalues. It is also useful to note that K in (9.2) admits a Killing vector V = (y − z)X 1 + (z − x)X 2 + (x − y)X 3 .

(9.4)

Using the Killing vector classification scheme in Sect. (6.1), it follows from Table (6.1) that V is rotational and the transformation √   2 √0 √2 1 x i = λj i x˜ j + δ i , λj i = √ −1 √3 √2 , δ i = 0, (9.5) 6 −1 − 3 2 ij brings (9.4) to the canonical form (6.3). Let us now apply step (3) of the algorithm in Sect. 8 to each of the CKTs in (9.3). ij Applying the transformation (9.5) to K2 yields   2˜z 0 −x˜ √ ij K˜ 2 = 3  0 2˜z −y˜  . −x˜ −y˜ 0 ˜ 2 ∈ K2 (E3 ) with respect to the transformed Cartesian coordinates It follows that K R i x˜ , and thus, by Table 6.3, we see that K 2 characterizes a parabolic web. Finally, we ij observe from (4.25) that K˜ 2 is already in canonical form. Therefore, it follows from

Invariant Classification of Orthogonally Separable Hamiltonian Systems in Euclidean Space

707

(9.5) together with (4.14) that the transformation to separable parabolic coordinates (µ, ν, ψ) is given by x= y= z=

√ 1 √2 µν cos ψ + √ (µ2 − ν 2 ), 3 2 3 − √1 µν cos ψ + √1 µν sin ψ + 6 2 − √1 µν cos ψ − √1 µν sin ψ + 6 2

1 √ (µ2 2 3 1 √ (µ2 2 3

− ν 2 ), − ν 2 ).

Proceeding similarly for K 4 , it follows that it enjoys the form   2 z˜ x˜ 2y˜ − z˜ 2 −2x˜ y˜ ij , K˜ 4 =  −2x˜ y˜ 2x˜ 2 − z˜ 2 y˜ z˜ 2 2 z˜ x˜ y˜ z˜ −x˜ − y˜ ij under the transformation (9.5). We conclude from Table 6.3 that K˜ 4 characterizes a spherical web and is in canonical form upon comparison with (4.22). Thus, the transformation to separable spherical coordinates is given by (9.5) together with (4.11). As mentioned in the remark at the end of Sect. 8, we can take various linear combinations of K 1 , . . . , K 4 in an attempt to find additional separable coordinate systems. For example, let K 5,6 = K 3 ± K 1 . It follows that K 5,6 both have distinct eigenvalues (even though K 1 does not). Applying the transformation (9.5) to K 5,6 yields   −x˜ y˜ −˜zx˜ ∓1 + y˜ 2 + z˜ 2 ij . K˜ 5,6 =  −x˜ y˜ ∓1 + z˜ 2 + x˜ 2 −y˜ z˜ 2 2 −˜zx˜ −x˜ y˜ ±2 + x˜ + y˜

From Table 6.3 we conclude that K 5 and K 6 characterize the prolate spheroidal and ij oblate spheroidal webs, respectively. Moreover, K˜ 5,6 is in canonical form (compare with (4.24) and (4.25)) and the essential parameter appearing in the √ transformation from Cartesian to prolate (or oblate) spheroidal coordinates is a = 3 for both cases (see Sect. 7.2). Finally, it follows that K 7 = K 1 + K 3 + K 4 has distinct eigenvalues and admits a translational Killing vector V = X1 + X2 + X3 .

(9.6)

It follows that the transformation (9.5) brings (9.6) to the canonical form (6.2). Applyij ing this transformation to K7 yields   −1 + 3y˜ 2 −3x˜ y˜ 0 ij K˜ 7 =  −3x˜ y˜ −1 + 3x˜ 2 0 , 0 0 2 which characterizes the circular cylindrical web. To summarize, we have shown using the algorithm of Sect. 8 that the CalogeroMoser system with potential (9.1) separates in five orthogonally separable coordinate systems, namely, circular cylindrical, spherical, prolate spheroidal, oblate spheroidal and parabolic. This result is consistent with that found in [3] and [9]. We conclude by making two remarks. Firstly, our analysis is exhaustive in the sense that we have found all possible orthogonally separable coordinate systems for which the

708

J.T. Horwood, R.G. McLenaghan, R.G. Smirnov

Calogero-Moser system separates, since it follows that the general Killing tensor (9.2) admits the rotational Killing vector (9.4) and hence can only characterize a rotational web. Secondly, the same conclusions from this section hold for a weighted CalogeroMoser system with unequal masses. More precisely, the Hamiltonian system (4.1) with potential V =

g2 g3 g1 + + 2 2 (m1 x − m2 y) (m2 y − m3 z) (m3 z − m1 x)2

separates in the five aforementioned coordinate systems, where gi and mi are constants and mi > 0. Moreover, in all five cases, the transformation to separable coordinates is given by   0 m 2 m3 M m1 MN −1 x i = λj i T j (uk ) + δ i , λj i = −m2 m3 2 MN m2 N m3 m1 M  , δ i = 0, −m3 m2 2 MN −m3 N m1 m2 M ij where M = (m1 2 m2 2 + m2 2 m3 2 + m3 2 m1 2 )−1/2 ,

N = (m2 2 + m3 2 )−1/2 .

The authors believe that this result is new. 10. Conclusions In this paper we solve a non-trivial problem of the geometry of orthogonal coordinate webs, namely the classification of the eleven orthogonal coordinate webs in E3 in terms of the invariants of the corresponding vector spaces of Killing two-tensors and vectors. Notably, the original solution presented here fits well the approach to geometry of Felix Klein presented in his Erlangen Program. Moreover, the results are successfully applied to the integrability problem of the Hamiltonian systems defined in E3 . From this viewpoint, the well-known Calogero-Moser super-separable Hamiltonian system has been integrated within the framework of the (orthogonal) Hamilton-Jacobi theory of separation of variables. The other three-dimensional pseudo-Riemannian flat space that is amendable to the methods developed in this work is Minkowski space E2,1 . The work in this direction is underway. Acknowledgement. The authors wish to thank the Department of Mathematics, University of Turin for hospitality during which part of this paper was written. They wish to express their appreciation for helpful discussions with Sergio Benenti, Claudia Chanu, Robin Deeley, Lorenzo Fatibene, Giovanni Rastelli and Dennis The. The research was supported in part by Natural Sciences and Engineering Research Council of Canada (NSERC) Discovery Grants (RGM, RGS), an NSERC Postgraduate Scholarship (JTH) and by a Senior Visiting Professorship of the Gruppo Nazionale di Fisica Matematica dell’Italia (RGM).

References 1. Benenti, S.: Hamiltonian Optics and Generating Families. Napoli Series on Physics and Astrophysics. Naples: Bibliopolis, 2003 2. Benenti, S.: Separability in Riemannian manifolds. Philos. Trans. R. Soc. Lond. Ser. A Math. Phys. Eng. Sci. (2004) (in press) 3. Benenti, S., Chanu, C., Rastelli, G.: The super-separability of the three-body inverse-square Calogero system. J. Math. Phys. 41, 4654–4678 (2000)

Invariant Classification of Orthogonally Separable Hamiltonian Systems in Euclidean Space

709

¨ 4. Bˆocher, M.: Uber die Reihenentwickelungen der Potentialtheorie (mit einem Vorwort von Felix Klein). Leipzig, 1894 5. Darboux, G.: Le¸cons sur les Systems Orthogonaux et les Coordin´ees Curvilignes. Paris: GauthierVillars, 1910 6. Deeley, R.J., Horwood, J.T., McLenaghan, R.G., Smirnov, R.G.: Theory of algebraic invariants of vector spaces of Killing tensors: methods for computing the fundamental invariants. In: Nikitin, A.G., Boyko, V.M., Popovych, R.O., Yehorchenko, I.A. (eds.) Proceedings of Institute of Mathematics of NAS of Ukraine, Vol 50, Proceedings of Fifth International Conference “Symmetry in Nonlinear Mathematical Physics”, Kiev 2003, Kiev: Institute of Mathematics, 2004, pp. 1079–1086 7. Delong, R.P., Jr.: Killing tensors and the Hamilton-Jacobi equation. University of Minnesota, 1982 (PhD thesis) 8. Eisenhart, L.P.: Separable systems of St¨ackel. Ann. Math. 35, 284–305 (1934) 9. Chanu, C.: Separation of variables and Killing tensors in the Euclidean three-space. University of Turin, 2001 (PhD thesis) 10. Kalnins, E.G.: Separation of Variables for Riemannian Spaces of Constant Curvature. New York: Longman Scientific & Technical, 1986 11. Klein, F.: Vergleichende Betrachtungen u¨ ber neuere gemetrische Forschungen. Erlangen: A. Duchert, 1872 ¨ 12. Klein, F.: Uber die sogenannte Nicht-Euklidische Geometrie (Zweiter Aufsatz). Math. Ann. 6, 112– 145 (1873) 13. Klein, F.: Vergleichende Betrachtungen u¨ ber neuere gemetrische Forschungen. Math. Ann. 43, 63– 100 (1893) (revised version of [11]) 14. McLenaghan, R.G., Smirnov, R.G., The, D.: The 1881 problem of Morera revisited. In: Kowalski, O., Krupka, D., Slov´ak, J. (eds.) Proceedings, The 8th Conference on Differential Geometry and its Applications, Opava 2001. Math. Publ. 3, 333–341 (2001) 15. McLenaghan, R.G., Smirnov, R.G., The, D.: Group invariant classification of separable Hamiltonian systems in the Euclidean plane and the O(4)-symmetric Yang-Mills theories of Yatsun. J. Math. Phys. 43, 1422–1440 (2002) 16. McLenaghan, R.G., Smirnov, R.G., The, D.: Group invariants of Killing tensors in the Minkowski plane. In: Abenda, S., Gaeta, G., Walcher, S. (eds.) Proceedings, Symmetry and Perturbation Theory, Cala Gonone 2002. Singapore: World Scientific, 2003, pp. 153–162 17. McLenaghan, R.G., Smirnov, R.G., The, D.: An invariant classification of orthogonal coordinate webs. In: Proceedings, Recent Advances in Lorentzian and Riemannian Geometries, Baltimore 2003. Contemp. Math. 337, 109–120 (2003) 18. McLenaghan, R.G., Smirnov, R.G., The, D.: An extension of the classical theory of invariants to pseudo-Riemannian geometry and Hamiltonian mechanics. J. Math. Phys. 45, 1079–1120 (2004) 19. McLenaghan, R.G., Milson, R., Smirnov, R.G.: Killing tensors as irreducible representations of the general linear group. C. R. Acad. Sc. Paris 339, 621–624 (2004) 20. Nijenhuis, A.: Xn−1 -forming sets of eigenvectors. Neder. Akad. Wetensch. Proc. 51A, 200-212 (1951) 21. Olevsky, M.N.: Separation of variables for the equation u + λu = 0 in spaces of constant curvature in two- and three-dimensions. Math. Sbornik 27, 379–427 (1950) 22. Olver, P.J.: Classical Theory of Invariants (Student Texts 44). Cambridge: Cambridge University Press, 1999 23. Onishchik, A.L., Vinberg, E.B.: Lie Groups and Lie Algebras. New York: Springer-Verlag, 1990 24. Schouten, J.A.: Ricci-Calculus. Second edition. Berlin: Springer-Verlag, 1954 ¨ 25. Schouten, J.A.: Uber Differentalkomitanten zweier kontravarianter Gr¨ossen. Proc. Kon. Ned. Akad. Amsterdam 43, 449–452 (1940) 26. Smirnov, R.G., Yue, J.: Covariants, joint invariants and the problem of equivalence in the invariant theory of Killing tensors defined in pseudo-Riemannian spaces of constant curvature. J. Math. Phys. 45, 4141–4163 (2004) ¨ 27. St¨ackel, P.: Uber die Integration der Hamilton-Jacobi’schen Differentialgleichung mittels Separation der Variablen. Habilitationsschrift: Halle a.S. 26 S. 8o , 1891 28. Takeuchi, M.: Killing tensor fields on spaces of constant curvature. Tsukuba J. Math. 7, 233–255 (1983) 29. Thompson, G.: Killing tensors in spaces of constant curvature. J. Math. Phys. 27, 2693–2699 (1986) 30. Tonolo, A.: Sulle variet`a Riemanniane normali a tre dimensioni. Pont. Acad. Sci. Acta 13, 29–53 (1949) Communicated by G.W. Gibbons

Commun. Math. Phys. 259, 711–728 (2005) Digital Object Identifier (DOI) 10.1007/s00220-005-1381-y

Communications in

Mathematical Physics

Several Complex Variables and the Distribution of Resonances in Potential Scattering T. Christiansen Department of Mathematics, University of Missouri, Columbia, MO 65211, USA. E-mail: [email protected] Received: 22 November 2004 / Accepted: 27 January 2005 Published online: 28 June 2005 – © Springer-Verlag 2005

Abstract: We study resonances associated to Schr¨odinger operators with compactly supported potentials on Rd , d ≥ 3, odd. We consider potentials depending holomorphically on a parameter z ∈ Cm . For certain such families, for all z except those in a pluripolar set, the associated resonance–counting function has order of growth d. 1. Introduction In this paper we study the growth of the resonance–counting function for potential d scattering in odd dimension d ≥ 3. Let V ∈ L∞ comp (R ; C) and let NV (r) be the resonance–counting function for the Schr¨odinger operator + V . The purpose of this paper is to show that log NV (r) =d log r r→∞

lim sup

(1)

for many potentials V . By [22], this is the maximum value this limit can obtain. Previously, the only potentials known to satisfy (1) in dimension at least three were a class of radial potentials [21]. For a certain class of compactly supported potentials W (z) depending holomorphically on a parameter z, we show that (1) holds for V = W (z), for all z except those in a pluripolar set. In a probabilistic sense this greatly expands the number of potentials which are known to have resonance–counting function with maximal order of growth. We use this to show that potentials with this property are dense in the d ∞ d L∞ norm in L∞ comp (R ). We remark that there are complex-valued V ∈ Lcomp (R ; C) such that the limit in (1) is 0 [1]. d 2 −1 is defined for all but For odd d and V ∈ L∞ comp (R ; C), RV (λ) = ( + V − λ ) ∞ d finitely many λ with Im λ > 0. If χ ∈ Cc (R ), χ ≡ 1 on the support of V , then χ RV χ has a meromorphic continuation to C. The poles of this continuation are resonances, or

Partially supported by NSF grant DMS 0088922.

712

T. Christiansen

scattering poles. They do not depend on the choice of χ if χ ≡ 1 on the support of V . They are, in many ways, analogous to eigenvalues and correspond to decaying states. For an introduction to resonances and for a survey of some results on their distribution, see [18, 23, 25, 26]. Let RV be the set of poles of RV (λ), repeated with multiplicity. Let NV (r) = #{zj ∈ RV : |zj | < r}. Then, if d = 1, NV (r) 2 = diam(supp(V )) r→∞ r π lim

[2, 16, 20]. This is true for complex-valued potentials as well as for real-valued ones. Much less is known about the higher-dimensional case, and there is evidence that the question of distribution of resonances is more subtle. Zworski [22] showed that for d ≥ 3, odd, NV (r) ≤ C(r d + 1) and this order of growth is achieved by a class of radial potentials [21]. On the other hand, the best known lower bound to hold for a general class of potentials is, for non-trivial V ∈ Cc∞ (Rd ; R), lim sup r→∞

NV (r) >0 r

[15]. It is important that these are real-valued potentials, as this does not hold for all smooth complex-valued potentials. In [1], there is an example of a family of complexvalued potentials for which NV (r) ≡ 0 for all r. In fact, the example works in even dimensions as well (with some caveats for d = 2). The potentials can be chosen to be smooth. In this paper we show that there are many potentials with resonance–counting function with the maximum order of growth. This theorem can be viewed as providing a kind of probabilistic lower bound on the resonance counting function, as it gives no information for any given potential but says that “most” potentials in certain families have resonance counting function with maximal growth rate. The proof uses some results from several complex variables. Theorem 1.1. Let d ≥ 3 be odd and let ⊂ Cm be an open, connected set. Let j d V (z, x) = j0=1 fj (z)Vj (x) with fj holomorphic on and Vj ∈ L∞ comp (R ; C). Suppose lim sup r→∞

log NV (z0 ) (r) =d log r

for some z0 ∈ . Then log NV (z) (r) =d log r r→∞

lim sup

for z ∈ \ E, where E is a pluripolar set.

Resonances in Potential Scattering

713

We recall the definition of a pluripolar set in Sect. 2 and refer the reader to [6, 7, 9] for further details. We remark that pluripolar sets are quite small– in particular, they have Lebesgue measure zero, and there are further restrictions on them. By [21, Theorem 2] the condition on V (z0 ) is satisfied if V (z0 ) is the radial potential W (|x|), with W ∈ C 2 ([0, a]), W (a) = 0. One may also use Theorem 1.2 to generate such potentials. As an application of Theorem 1.1 and some further study of holomorphic functions whose zeros correspond to resonances, we obtain the following theorem. d Theorem 1.2. Suppose d is odd and V ∈ L∞ comp (R ; R) is bounded below by the characteristic function of a ball. Then

log NzV (r) =d log r r→∞

lim sup

for z ∈ C \ E, where E ⊂ C is a pluripolar set. We remark that if E ⊂ C is a pluripolar set, then E R has Lebesgue measure 0 in R [13, Sect. 3.2]. Earlier results for potentials of fixed sign are found in [8] and [17]. These papers studied the purely imaginary scattering poles associated to potentials d V ∈ L∞ comp (R ; R), where V or −V is bounded below by a positive multiple of the characteristic function of a ball. They showed that for such potentials, #{λj ∈ RV : λj ∈ iR, |λj | ≤ r} ≥ cV r d−1 for some constant cV > 0. A corollary of Theorem 1.1, Theorem 1.2, and the properties of pluripolar sets is Corollary 1.3. For d ≥ 3, odd, the set log NV (r) d = d (R ; R) : lim sup V ∈ L∞ comp log r r→∞ d ∞ is dense in L∞ comp (R ; R) under the L norm. The set log NV (r) d V ∈ L∞ (R ; C) : lim sup = d comp log r r→∞ d ∞ is dense in the set L∞ comp (R ; C) under the L norm. Moreover, the same results are ∞ ∞ ∞ true if we replace Lcomp by Cc and the L norm by the C ∞ topology. ∞ We remark that to prove the results for L∞ comp potentials in the L topology, one could use [21, Theorem 2] instead of Theorem 1.2. In the next section of this paper, we recall some definitions and facts from one and several complex variables. In addition, we prove an extension of [9, Corollary 1.42], a result about order of growth for functions of several complex variables. This result, combined with some facts about the determinant of the scattering matrix which are established in Sect. 3, enables us to prove our main theorem, Theorem 4.3, in Sect. 4. This result is somewhat stronger than Theorem 1.1. Section 5 is devoted to the proofs of Theorem 1.2 and Corollary 1.3. Throughout this paper, C, CV , C , and Cχ denote constants whose value may change from line to line. The dimension d is odd throughout.

714

T. Christiansen

2. Some Complex Analysis In this section we recall some definitions and results from complex analysis, and prove an extension of a result in several complex variables that we shall need. Let a1 , a2 , a3 .... be a sequence of non-zero complex numbers with |am | → ∞. The convergence exponent of this sequence is the greatest lower bound of the set ∞ 1 λ: converges . |am |λ m=1

If n(r) = #{aj : |aj | < r}, then lim sup r→∞

log n(r) , log r

which may be called the order of n(r), is the same as the convergence exponent for the sequence {aj }∞ j =1 . We shall abuse notation slightly and call this the convergence exponent for the set {al }, where we order {al } so that |a1 | ≤ |a2 | ≤ |a3 | ≤ ... to form the sequence. We now recall the definition and some facts about plurisubharmonic functions. For further details, see, for example, [6, 9]. Let ⊂ Cm be a domain; that is, an open, connected set. A function ϕ(z) which takes its values in [−∞, ∞) is plurisubharmonic in if • ϕ(z) is upper semi-continuous and ϕ ≡ −∞. • For every z ∈ and every r such that {z + uw : |u| ≤ r, u ∈ C} ⊂ , 2π −1 ϕ(z) ≤ (2π) ϕ(z + reiθ w)dθ. 0

We shall write ϕ ∈ PSH() if ϕ is plurisubharmonic on . Being plurisubharmonic is a local property. Let ⊂ Cm be a domain. If ϕ is upper semi-continuous on , ϕ ≡ −∞, and for every z ∈ there is a ρ(z) such that 2π −1 ϕ(z) ≤ (2π) ϕ(z + weiθ )dθ 0

for all w ∈ Cm , |w| < ρ(z), then we say that ϕ is locally plurisubharmonic on . But if ϕ is locally plurisubharmonic on , it is plurisubharmonic on (e.g. [9, Prop. I.19]). A set E ⊂ Cm is pluripolar if for each a ∈ E there is a neighborhood V of a and ϕ ∈ PSH(V ) such that E ∩ V ⊂ {z ∈ V : ϕ(z) = −∞}. This is equivalent to the definition given in [9] via the Josefson Theorem [6, Theorem 4.7.4]. For a function ϕ which is plurisubharmonic in θ1 < arg u < θ2 , we define the order ρ of ϕ in θ1 < arg u < θ2 as ρ = lim sup

r→∞

log supθ1 <arg u<θ2 ,|u|=r |ϕ(u)| log r

.

An important example of a plurisubharmonic function is log |f |, where f is holomorphic. Thus we shall make the following (standard) definition of order for a holomorphic function. For f holomorphic for θ1 < arg u < θ2 , the order ρ of f in θ1 < arg u < θ2 is ρ = lim sup

r→∞

log supθ1 <arg u<θ2 ,|u|=r log |f (u)| log r

.

Resonances in Potential Scattering

715

Since the two notions of order are so closely related, we use the same name and notation for each. We shall be concerned with functions that satisfy the following set of assumptions. Assumption (A0). For some open ⊂ Cm and some > 0, f (z, λ) is holomorphic on × {λ ∈ C : Im λ > −}. Moreover, there are constants Cf and α such that log |f (z, λ)| ≤ Cf (1 + |λ|α ) for λ ∈ R.

(2)

With the next two lemmas, we construct a plurisubharmonic function on × C whose order is related to the order of the function f in a half-plane. Some related results and techniques appear in Theorem I.28 and its proof in [9]. Lemma 2.1. Assume f satisfies assumption (A0). For some β > α, β ≥ 1, let M(z, r) = max( max log |f (z, λ)|, r β ). |λ|≤r Im λ≥0

Then there is an r0 ∈ R such that M(z, r) is a plurisubharmonic function of (z, u) ∈ × {u ∈ C : |u| > r0 }, where |u| = r. Proof. Note that since f is holomorphic, we actually have that M is continuous. Clearly, M ≡ −∞. Now we shall show that M is locally plurisubharmonic when r is sufficiently large. Since f is holomorphic, the maximum value of |f | on a compact set K is obtained on the boundary of K. The key idea here is that we require r to be so large that Cf (1 + r α ) < r β , where Cf , α are as in (2). For such values of r, M(z, r) is not the value of log |f (z, ±r)|. Choose r sufficiently large as above. Suppose M(z, r) = log |f (z, λ0 )|, with |λ0 | = r and Im λ0 > 0. Then to see that M is locally plurisubharmonic at (z, u) with |u| = r, consider w ∈ Cm+1 with w ≤ Im λ0 and write w = (w , wm+1 ). Note that (w , λ0 wm+1 /u) has the same norm as w and |λ0 + eiθ λ0 wm+1 /u| = |u + eiθ wm+1 |. Then M(z, |u|) = M(z, |λ0 |) 2π λ0 −1 log |f (z + eiθ w , λ0 + eiθ wm+1 )|dθ ≤ (2π) u 0 2π λ0 M(z + eiθ w , |λ0 + eiθ wm+1 |)dθ ≤ (2π)−1 u 0 2π M(z + eiθ w , |u + eiθ wm+1 |)dθ. = (2π)−1 0

Suppose, on the other hand, that M(z, r) = r β . Let w = (w , wm+1 ) ∈ Cm+1 . Then, if |u| = r, M(z, |u|) = M(z, r) = r β 2π ≤ (2π)−1 |u + wm+1 eiθ |β dθ ≤ (2π)−1

0 2π 0

M(z + w eiθ , |u + wm+1 eiθ |)dθ.

716

T. Christiansen

Thus M is locally plurisubharmonic, and thus subharmonic, in × {u ∈ C : |u| > r0 }, for some r0 .

Next, we modify M somewhat to obtain a function plurisubharmonic on × C. Lemma 2.2. Let , f , M, and r0 be as in Lemma 2.1. For (z, u) ∈ × C, set M(z, r0 + 1) if |u| < r0 + 1 M1 (z, u) = M(z, |u|) if |u| ≥ r0 + 1. Then M1 ∈ PSH( × C). Proof. We again use the fact that being plurisubharmonic is a local property. Clearly, if z0 ∈ , |u0 | = r0 + 1, then M1 is plurisubharmonic in a neighborhood of (z0 , u0 ). If |u0 | = r0 + 1, then, since M(z0 , •) is increasing and plurisubharmonic, for z0 ∈ , 2π M1 (z0 + w eiθ , u0 + wm+1 eiθ )dθ M1 (z0 , u0 ) = M(z0 , |u0 |) ≤ (2π)−1 0

for all w = (w , wm+1 ) ∈ Cm+1 , w sufficiently small.

With this preparation, we may now prove the following extension of [9, Cor. 1.42], which we shall apply in Sect. 4 to prove our main theorem. Proposition 2.3. Let ⊂ Cm be an open, connected set and let f satisfy assumptions (A0). Let ρ(z) be the order of gz (λ) = f (z, λ) in 0 < arg λ < π . If ρ(z) ≤ ρ0 for all z ∈ , ρ(z0 ) = ρ0 for some z0 ∈ , and ρ0 > max(α, 1), then ρ(z) = ρ0 for all z ∈ \ E, where E ⊂ is a pluripolar set. Proof. Choose a β ∈ R such that max(α, 1) < β < ρ0 , and let M1 (z, u) be as defined in Lemma 2.2. Note that the order of u → M1 (z, u) is max(ρ(z), β). Let be open, connected, and bounded, with ⊂ . Then by [9, Prop. 1.40], there is a sequence { q } of negative plurisubharmonic functions on such that −(ρ(z))−1 = lim sup q (z). q→∞

In addition, lim sup q→∞

and

1

q (z) + ρ0

lim sup q→∞

q (z0 ) +

1 ρ0

≤ 0,

= 0.

Thus, by [9, Prop. 1.39], ρ(z) = ρ0 for z ∈ \ E , for some pluripolar set E ⊂ . We can cover with having the properties as above. The set E is the union of the corresponding sets E , and is thus pluripolar.

We note that this proposition could also be proved by adapting arguments of [5].

Resonances in Potential Scattering

717

3. The Scattering Matrix and its Determinant In this section, we collect some facts about the scattering matrix and its determinant. d For V ∈ L∞ comp (R ; C), let SV (λ) be the associated scattering matrix and let sV (λ) = det SV (λ). It is meromorphic in the upper half-plane, with at most a finite number of poles there. This is a useful function in the study of resonances because for odd d its zeros in the upper half-plane coincide, with at most a finite number of exceptions, with poles of the resolvent in the lower half-plane, and the multiplicities agree (see [24, (3.7)] or [4]). That is, for all but finitely many λ0 , if Im λ0 > 0 is a zero of order m0 of sV (λ), then −λ0 is a pole of order m0 of χ RV (λ)χ . d Recall that for V ∈ L∞ comp (R ; C) the scattering matrix associated to +V is given by t SV (λ) = I + cd λd−2 πλ (V − V RV (λ)V )π−λ

(3)

where πλ is given by (πλ f )(ω) =

e−iλx·ω f (x)dx.

Here cd is independent of λ. d Lemma 3.1. Let V ∈ L∞ comp (R ; C). For λ ∈ R, there is a CV so that

d

≤ CV |λ|d−2

log s (λ) V

dλ

whenever |λ| is sufficiently large. Proof. We use d d log sV (λ) = tr (SV (λ))−1 SV (λ) . dλ dλ If V is real-valued, then for λ ∈ R, SV (λ) is unitary. Otherwise, we will use (3) to bound

(SV (λ))−1 = SV (−λ)

when λ ∈ R. By (3), t

SV (λ) − I = cd |λ|d−2 πλ χ (V − V RV (λ)V )χ π−λ

,

where χ ∈ Cc∞ (Rd ) is one on the support of V . Using [19, Cor. 3.7], t

L2 (Sd−1 )→L2 (Rd ) ≤ Cχ |λ|−(d−1)/2 .

πλ χ L2 (Rd )→L2 (Sd−1 ) ≤ Cχ |λ|−(d−1)/2 , χ π−λ

For λ ∈ R and |λ| is sufficiently large (depending on V ∞ and supp V ) V − V RV (λ)V ≤ CV . Thus we have, for such λ,

SV (λ) − I ≤ CV |λ|−1 .

718

T. Christiansen

Next, we bound

d SV (λ) = d I + cd λd−2 πλ χ (V − V RV (λ)V )χ )π t . −λ dλ dλ 1 1 We bound this just as in [3, Lemma 3.3], using the fact that

AB 1 ≤ A 2 B 2 . When |λ| is sufficiently large,

V − V RV (λ)V ≤ CV ,

d V RV (λ)V ≤ CV . dλ

t can be estimated using their explicit Moreover, the Hilbert-Schmidt norms of πλ χ , χ π−λ Schwartz kernels to see that t

πλ χ 2 ≤ Cχ , χ π−λ

2 ≤ Cχ .

The Hilbert-Schmidt norms of the derivatives of these operators are also bounded above by constants, so that d SV (λ) ≤ CV (1 + |λ|d−2 ). (4) dλ 1 This finishes the proof.

We shall consider holomorphic families of potentials that satisfy the following conditions. These potentials form a somewhat more general class than those of Theorem 1.1. Assumption (A1). Let ⊂ Cm be an open set, and let d V = V (z, x) ∈ H(z ; L∞ comp (Rx ; C)).

That is, V is holomorphic in the z variables and takes its values in compactly supported potentials. Moreover, we require that there be a fixed compact set K ⊂ Rd such that supp(V (z, •)) ⊂ K for all z ∈ . Proposition 3.2. Let ⊂ Cm be open and suppose that V (z, x) satisfies assumptions (A1). Let K ⊂ be a compact set. Then sV (z) (λ) has the following properties: a: There is a constant CK,0 ≥ 0 such that sV (z) (λ) is holomorphic on ×{λ : Im λ > CK,0 } for any open ⊂ K. b: The constant CK,0 can be chosen so that for z0 ∈ K, if λ0 is a zero of sV (z0 ) (λ) with Im λ0 > CK,0 , then −λ0 is a pole of RV (z0 ) (λ), and the multiplicities coincide. c: For Im λ > CK,0 , z ∈ K, there is a constant C (depending on V and K) so that d

|sV (z) (λ)| ≤ CeC|λ| . d: If z ∈ K, Im λ = C1 > CK,0 , then there is a constant C ( depending on V , K, and C1 ) so that d−2

|sV (z) (λ)| ≤ CeC|λ|

.

Resonances in Potential Scattering

719

Proof. When z ∈ K, V (z, •) L∞ is bounded, so that there is a CK,0 such that I + V (z)R0 (λ) is invertible when Im λ > CK,0 , z ∈ K. Thus RV (z) (λ) = R0 (λ)(I + V R0 (λ))−1 is holomorphic in × {λ : Im λ > CK,0 }, and, using the explicit expression for SV , so are SV (z) (λ) and sV (z) (λ). Using the relation (SV (λ))−1 = SV (−λ), we see that zeros of SV (λ) with Im λ > CK,0 correspond to poles of SV (λ) with Im λ < −CK,0 . If SV (z) (λ) is holomorphic in × {λ : Im λ > CK,0 }, then for z0 ∈ , the zeros of sV (z0 ) (λ) with Im λ > CK,0 correspond, with multiplicity, to the poles of RV (z0 ) (−λ) (e.g. [4, 24]). Property (c) follows as in [24] or [3], using the fact that supp V (z, •) and V (z, •) ∞ are bounded when z ∈ K. To prove the final property, we use the fact that | det(I + A)| ≤ e A 1 . Again as in [3] and Lemma 3.1, we have that t

SV (λ) − I 1 ≤ C|λ|d−2 πλ χ 2 V − V RV (λ)V

χ π−λ

2 ,

where χ ∈ Cc∞ (Rn ) is one on the support of V . Using the explicit kernels of πλ we obtain

πλ χ 2 ≤ eC(| Im λ|+1) . t . We obtain a similar estimate for χ π−λ 2

4. Proof of Theorem 1.1 For integers p ≥ 1, let G(u; p) be the canonical factor 2 /2+...+up /p

G(u; p) = (1 − u)eu+u d

.

∈ Z, d ≥ 2. Suppose the convergence exponent of than d and that λj ∈ R for all j . Then there is an

the sequence Lemma 4.1. Let {λj } is strictly less > 0 and a constant C such that for λ ∈ R,

 

λ

∞ ∞

d 

≤ C (1 + |λ|d − ).  log G(−t/λ ; d − 1)−log G(t/λ ; d − 1) dt j j

dt

0

j =1

j =1

Proof. Let n(r) = #{λj : |λj | < r}. Since the convergence exponent for the sequence is less than d , there is an > 0 and a constant C so that n(r) ≤ C (1 + r d − ). For the real part of the integral, we use   λ ∞ ∞ d  log Re G(−t/λj ; d − 1) − log G(t/λj ; d − 1) dt dt 0 j =1 j =1

∞

∞

= log

G(−λ/λj ; d − 1) − log

G(λ/λj ; d − 1)

. (5)

j =1

j =1

720

T. Christiansen

Then applying standard estimates for canonical products (e.g. [10, Theorem I.6]), we have

∞ ∞

log

≤ C (1 + |λ|d − ) |G(−λ/λ ; d − 1)| − log |G(λ/λ ; d − 1)| j j

j =1

j =1 for some > 0. For the imaginary part of the integral, we use | arg(1 − λ/λj ) − arg(1)| ≤ π, where the argument is chosen to be a continuous function of λ. We also use

d −1

1 p

≤ C|u|d −1 if |u| ≥ 1

Im u

p=1 p

and

| arg G(u; d − 1) − arg(1)| ≤ C|u|d if |u| ≤ 1. Then, using arguments similar to [10, Lemma 1.3 and Theorem 1.4],

  



λ ∞ ∞

d

Im 

  log G(−t/λ ; d − 1) − log G(t/λ ; d − 1) dt j j

0 dt

j =1 j =1

∞ ∞

=

arg G(−t/λj ; d − 1) − arg G(t/λj ; d − 1)

j =1

j =1 2λ ∞ t 1−d dn(t) + |λ|d t −d dn(t) ≤ Cn(2λ) + |λ|d −1 ≤ C (1 + |λ|

d −

1

2λ

).

d Lemma 4.2. Let d ≥ 3 be odd, and V ∈ L∞ comp (R ; C). Then sV (λ) is of order d in the half-plane {λ ∈ C : Im λ > 2 V ∞ + 1} if and only if RV has convergence exponent d.

Proof. We remark that since NV (r) ≤ C(r d + 1), the convergence exponent of RV is at most d. We shall actually prove the contrapositive of this lemma. Let g(λ) be holomorphic in a neighborhood of the closed upper half-plane, and let ng (r) be the number of zeros of g in the upper half-plane with norm less than r, counted with multiplicity. By using intermediate steps from the proof of [3, Lemma 3.2], r t r π ng (t) 1 1 g (s) dt = dsdt + t −1 log |g(reiθ )|dθ. t 2π g(s) 2π 0 0 −t 0 We apply this to sV (λ) (multiplied by a suitable polynomial if it has poles in the upper half-plane). Using Lemma 3.1 as well, we see that if sV has order strictly less than d in this region, RV has convergence exponent strictly less than d.

Resonances in Potential Scattering

721

Now suppose that RV has convergence exponent ρ strictly less than d. We may write [24] sV (λ) = αeig(λ)

P (−λ) , P (λ)

where α is a constant,

P (λ) =

G(λ/λj ; d − 1)

λj ∈RV ,λj =0

and g(λ) is a polynomial of order at most d. The canonical product P (λ) is of order max(ρ, d − 1). By the minimum modulus theorem, then, P (−λ)/P (λ) is of order max(ρ, d − 1) in the upper half plane in question. From Lemma 4.1, we know that

λ 0

d (log P (−t) − log P (t))dt

≤ C (1 + |λ|d− ) dt

for some > 0. Thus, using Lemma 3.1, we see that g must have order less than d. Therefore, sV (λ) has order strictly less than d in {λ ∈ C : Im λ ≥ 2 V ∞ + 1}.

Our main theorem allows a more general family of potentials than Theorem 1.1. Theorem 4.3. Let d ≥ 3 be odd and let ⊂ Cm be open and connected. Suppose V (z, x) satisfies Assumptions (A1), and for some z0 ∈ , RV (z0 ) has convergence exponent d. Then RV (z) has convergence exponent d for all z ∈ \ E, where E ⊂ is a pluripolar set. Proof. By Lemma 4.2, the order of sV (z0 ) (λ) in the upper half-plane is d. Given an open, connected ⊂ such that ⊂ is bounded, by Proposition 3.2 we may apply Proposition 2.3 to sV (z) (λ + iC + i). From Proposition 2.3 we see that in the upper half-plane sV (z) (λ + iC + i) has order d for z ∈ \ E , for some pluripolar set E . Again by Lemma 4.2, this means that the convergence exponent of RV (z) is d for z ∈ \ E . As in the proof of Proposition 2.3, we can cover by such , and the set E is the union of the corresponding sets E .

5. A Class of Potentials with Fixed Sign d For V ∈ L∞ comp (R ; C) let

BV (λ) =

V R0 (λ)|V |1/2 . |V |1/2

(6)

Then the poles of RV (λ) are the zeros of I + BV (λ). In this section we use this fact and a study of related holomorphic functions to prove Theorem 1.2. Throughout this section we assume that d is odd.

722

T. Christiansen

5.1. Lower bounds on a determinant. In this subsection we obtain lower bounds on det(I + BV2m (λ)) when m > d/4 and V can be bounded below by the characteristic function of the ball. In the next subsection we will use this lower bound and some results of several complex variables to prove Theorem 1.2. d Lemma 5.1. Let V ∈ L∞ comp (R ; R) satisfy V ≥ χB(a,x0 ) , where χB(a,x0 ) is the characteristic function of the ball of radius a > 0 centered at x0 . Let V0 = χB(a,0) . Then, for s ∈ R+ , m > d/4, m ∈ Z,

det(I + (BV (−is))2m ) ≥ det(I + (BV0 (−is))2m ). Before proving the lemma, we remark that the sign in front of (BV (−is))2m may appear puzzling, as the zeros of det(I + (BV (λ))2m ) do not, in general, include the poles of RV , while those of det(I − (BV (λ))2m ) do (compare Lemma 5.4). The sign is positive so that we may work with the determinant of the identity plus a positive operator. In the proof of Proposition 5.6 we introduce a complex parameter, and this allows us to reconcile the apparent differences. Proof. If A is a trace class operator, det(I + A) =

(I + µj (A)),

(7)

j

where µj are the eigenvalues of A repeated according to their multiplicity, and |µ1 (A)| ≥ |µ2 (A)| ≥ .... We note that for s ∈ R+ , BV2 (−is) is a positive, self-adjoint operator, so that all of its eigenvalues are non-negative. Let V1 = χB(a,x0 ) . Then, using the max-min principle, µj (BV2 (−is)) ≥ µj (BV21 (−is)) ≥ 0 and thus µj (BV2m (−is)) ≥ µj (BV2m (−is)) ≥ 0. 1 are the same as those of BV2m , using (7) finishes the proof Since the eigenvalues of BV2m 0 1 of the lemma.

Next we will describe the resolvent R0 (λ) in a way which will be useful for our purposes. Let σ12 < σ22 ≤ ... be the eigenvalues of the Laplacian on the sphere Sd−1 , repeated according to their multiplicity, and let {φj } be a corresponding set of orthonormal eigenfunctions. We use the notation of [11] for the Bessel functions Jν , the (1) modified Bessel functions Iν and Kν , and the Hankel function Hν . Let λ ∈ C and d−1 d be polar coordinates on R . Then (r, y) ∈ R+ × S 2i (R0 (λ)f )(r, y) π ∞ =

r

k=1 σ 2 =k(k+d−2) 0 j

+

∞

k=1 σ 2 =k(k+d−2) r

Hνk (λr)Jνk (λr ) φj (y)φ j (y )f (r , y )(r )d−1 dσy dr (rr )(d−2)/2 (1)

Sd−1 ∞

Hνk (λr )Jνk (λr) φj (y)φ j (y )f (r , y )(r )d−1 dσy dr . (rr )(d−2)/2 Sd−1 (1)

j

(8)

Resonances in Potential Scattering

723

Here νk = k +

d − 1. 2

To obtain a lower bound on the eigenvalues of BV20 (−is) we shall need some lower bounds on Bessel functions. Lemma 5.2. Let s, M ∈ R+ and ν − 1/2 ∈ N. Then there is a constant c > 0 such that ecν |Jν (−iνs)| ≥ c √ , ν cν e |Hν(1) (−iνs)| ≥ c √ ν when 3 < s < M and ν is sufficiently large. Proof. We use, from [12, 9.6.3 and 9.6.30] |Jν (−is)| = |Iν (s)|

(9)

and, from [12, 9.1.39, 9.6.4, and 9.6.31] Hν(1) (−is) =

−2i −3νπi/2 Kν (s) − 2e−νπi/2 Iν (s). e π

(10)

For 3 ≤ z ≤ M < ∞ there are constants c, C > 0 such that eνξ Iν (νz) ≥ c √ , ν |Kν (νz)| ≤ Ce−νξ when ν is sufficiently large and ξ = (1 + z2 )1/2 + ln this and (9), for some c > 0,

z 1+(1+z2 )1/2

[11, 10.7.16]. Applying

ecν |Jν (−iνs)| ≥ c √ ν when 3 ≤ s ≤ M and ν is sufficiently large. Similarly, using (10), the upper bound on Kν (νz) and the lower bound on Iν (νz), we obtain the second part of the lemma.

The following lemma shows that the holomorphic function det(I + (BV0 (−is))2m ) introduced in Lemma 5.1 has order at least d. d Lemma 5.3. For a > 0, let V0 = χB(a,0) ∈ L∞ comp (R ). Then, for s ∈ R+ , m > d/4, m ∈ Z, there is a constant c > 0 such that

det(I + (BV0 (−is))2m ) ≥ cecs when s is sufficiently large.

d

724

T. Christiansen

Proof. We first obtain a lower bound on some of the eigenvalues of BV20 (−is) = 2 1/2 (−is), (V0 R0 (−is)|V0 |1/2 )2 . Since V0 is radial, we can write BV20 (−is) = k Bk,V 0 d−1 where Bk,V0 (−is) acts on the eigenspace of the Laplacian on S with eigenvalue k(k+d−2) and multiplicity m(k) ≥ ck d−2 , for some c > 0. We will bound Bk,V0 (−is)

2 (−is). from below, giving us a lower bound on m(k) of the eigenvalues of Bk,V 0 Using (8),

Bk,V0 (−isr) ≥ χ[0,a/2] r −(n−2)/2 Jνk (−isr) L2 (Rd ) χ[a/2,a] r −(n−2)/2Hν(1) (−isr) L2 (Rd ) , k where χ[α,β] is the characteristic function of the interval [α, β]. By Lemma 5.2, for sa sa M ∈ R+ and 2M < νk < 12 there is some c > 0 such that a/2 ecνk cr dr (11)

χ[0,a/2] r −(n−2)/2 Jνk (−isr) 2L2 (Rd ) ≥ νk a/4 c ≥ ecνk . νk Here and throughout c is a positive constant whose value may change from line to line. Using Lemma 5.2 in a similar way, for M ∈ R+ ,

χ[a/2,a] r −(n−2)/2 Hν(1) (−isr) 2L2 (Rd ) ≥ c k for some c > 0 when

sa M

< νk <

sa 6 .

ecνk νk

(12)

Thus, with α = d/2 − 1,

Bk,V0 (−is) ≥

c eck k+α

sa sa d−2 eigenvalues of B 2 when M < k + α < 12 . Thus m(k) ≥ ck k,V0 are at least as large sa sa −2 ck as c(k + α) e when M < k + α < 12 . Then, taking M and s sufficiently large,

det(I + (BV0 (−is))

2m

)≥

sa sa
≥ exp

eck 1+c (k + α)2m

ck d−2

(ck d−2 − ck d−2 ln(k + α) + ck d−1 )

sa sa
≥ c exp(cs d ).

5.2. Proof of Theorem 1.2 and Corollary 1.3. In this subsection we use the results of Sect. 5.1, Theorem 1.1, and some results from [9] to prove Theorem 1.2. We also prove Corollary 1.3. If m > d/2 is an integer, det(I − (−1)m BVm (λ)) is a holomorphic function of λ. Moreover, its zeros include the poles of RV (λ). In fact, we can say more, as the next lemma shows (compare [22, Prop. 1]). d Lemma 5.4. Let m > d/2 be an integer and let ω = e2πi/m . Let V ∈ L∞ comp (R ; C). m m m Then the zeros of det(I − (−1) BV (λ)) correspond, with multiplicity, to ∪k=1 Rωk V .

Resonances in Potential Scattering

725

Proof. We note that I − (−1)m BVm (λ) =

m

(I + ωk BV (λ)) =

k=1

m

(I + Bωk V (λ)).

k=1

The lemma follows from using the fact that the zeros of I + BV correspond, with multiplicity, to RV .

We shall need some knowledge of det(I − (−1)m BVm (λ)) in the upper half plane. This is analogous to [2, Lemma 3.2]. d Lemma 5.5. Let V ∈ L∞ comp (R ; C), m > d/2 be an integer and

hV (λ) = det(I − (−1)m BVm (λ)). Then for 0 < θ < π , > 0, there is a C (depending on V , θ , and ) such that for r ∈ R+ , |hV (reiθ ) − 1| ≤ Cr −1 . Proof. We use the fact that | det(I + A) − 1| ≤ e A 1 [14, Lemma XIII.17.4]. For χ ∈ Cc∞ (Rd ),

χ R0 (reiθ )χ H s (Rd )→H s+2 (Rd ) ≤ C and

χ R0 (reiθ )χ H s (Rd )→H s (Rd ) ≤

C . r2

Here C denotes a positive constant whose value changes from line to line and may depend on parameters other than r. Therefore, for any > 0 and p > d/2,

χ R0 (reiθ )χ p ≤

C r 2−d/p−

and

BVm (reiθ ) 1 ≤

C . r 1−

d Proposition 5.6. Let V ∈ L∞ comp (R ; R) be bounded below by the characteristic funcπi

tion of a ball. Let m > d/4 be an integer and let ω = e m . Then lim sup r→∞

log( 2m j =1 Nωj zV (r)) log r

for all z ∈ C \ E, where E is a pluripolar set.

=d

726

T. Christiansen

2m (λ)). This is a holomorphic function of (λ, z) ∈ Proof. Consider the function det(I −BzV C2 . Moreover, as in [22, Prop. 3], it is (for fixed z) of order at most d in λ. On the other hand, if z2m = −1, by Lemma 5.3 it is of order at least d in λ. Thus, applying [9, 2m (λ)) is of order d Props. 1.39 and 1.40] as in the proof of Proposition 2.3, det(I − BzV for z ∈ C \ E for a pluripolar set E. Now fix z ∈ C \ E. Suppose

2m

lim sup

log(

j =1 Nωj zV (r))

log r

r→∞

= d < d.

Then we may write

2m (λ)) = αz eigz (λ) det(I − BzV

G(λ/λj ; d − 1)

(13)

λj ∈∪k Rωk zV , λj =0

where αz is a constant, gz is a polynomial of order at most d, and G(ζ ; d − 1) is the canonical factor of order d − 1. There are at most finitely many elements of ∪k Rωk zV in the upper half plane. Thus standard estimates on canonical products and the minimum modulus theorem show that for Im λ sufficiently large and 0 < θ1 < arg λ < θ2 < π , the canonical product in (13) must satisfy, for every > 0 and some C ,

1 −C |λ|d +

d +

e ≤

G(λ/λj ; d − 1) ≤ C eC |λ| .

C

λj ∈∪k Rωk zV

λ =0

j

2m (λ)) Thus, using Lemma 5.5, gz (λ) must be of order strictly less than d, and so det(I −BzV is of order strictly less than d in λ, a contradiction.

Now we can give the proof of Theorem 1.2. Proof of Theorem 1.2. Fix an integer m with m > d/4. By Proposition 5.6 and using the notation of that proposition, 2m

lim sup

log(

r→∞

j =1 Nωj zV (r))

log r

=d

for z ∈ C \ E , for some pluripolar set E . If z1 ∈ C \ E , then, there is some j1 such that lim sup r→∞

log Nωj1 z1 V (r) log r

= d.

Thus by Lemma 4.2 the potential V1 (z, x) = zV (x) satisfies the assumptions of Theorem 1.1 with z0 = ωj1 z1 . Applying Theorem 1.1 finishes the proof.

We may now give the proof of Corollary 1.3.

Resonances in Potential Scattering

727

d ∞ d Proof of Corollary 1.3. Let V0 ∈ L∞ comp (R ; C) and let > 0. Let V1 ∈ Cc (R ; R) be bounded below by the characteristic function of a ball. Then by Theorem 1.2, RzV1 has convergence exponent d for all z ∈ C \ E1 for some pluripolar set E1 . The set E1 ⊂ C R2 not only has Lebesgue measure zero, but its restriction to R is of Lebesgue measure zero in R (e.g. [13, Sect. 3.2]). Thus we may choose z1 ∈ R \ (E1 R ) so that z1 V1 L∞ < /2 and Rz1 V1 has convergence exponent d. Now consider V (z) = zz1 V1 + (1 − z)V0 . Then V (z, x) is in the framework of Theorem 1.1, V (1) = z1 V1 , and V (0) = V0 . Using Theorem 1.1, RV (z) has convergence exponent d for all z ∈ C \ E, where E is a pluripolar set. Thus we may find a z2 ∈ R \ (E R ) with |z2 | < (2( V0 L∞ + 1))−1 . The potential V2 (x) = V (z2 , x) thus has RV2 with convergence exponent d and V0 − V2 L∞ < . We remark that if V0 is real-valued, then so is V2 . With fairly straightforward modifications, the same proof gives the result for smooth potentials in the C ∞ topology.

Acknowledgement. We are pleased to thank D. Drasin, D. Edidin, C. Kiselman, and I. Verbitsky for helpful discussions.

References 1. Christiansen, T.: Schr¨odinger operators with complex-valued potentials and no resonances. http://arxiv.org/list/math-ph/0408052, 2004 2. Froese, R.: Asymptotic distribution of resonances in one dimension. J. Differ. Eqs. 137(2), 251–272 (1997) 3. Froese, R.: Upper bounds for the resonance counting function of Schr¨odinger operators in odd dimensions. Canad. J. Math. 50(3), 538–546 (1998) 4. Guillop´e, L., Zworski, M.: Scattering asymptotics for Riemann surfaces. Ann. Math. 145, 597–660 (1997) 5. Kiselman, C.: The use of conjugate convex functions in complex analysis. In: Complex analysis (Warsaw, 1979), Banach Center Publ. 11, Warsaw: PWN, 1983, pp. 131–142 6. Klimek, M.: Pluripotential theory. Oxford: Clarendon Press, 1991 7. Labutin, D.: Pluripolarity of sets with small Hausdorff measure. Manuscripta Math. 102(2), 163–167 (2000) 8. Lax, P.D., Phillips, R.S.: Decaying modes for the wave equation in the exterior of an obstacle. Commun. Pure Applied Math., 22, 737–787 (1969) 9. Lelong, P., Gruman, L.: Entire functions of several complex variables. Berlin: Springer Verlag, 1986 10. Levin, B.Ja.: Distribution of zeros of entire functions. Providence, RI: American Mathematical Society, 1964, viii+493 pp 11. Olver, F.W.J.: Asymptotics and special functions. New York: Academic Press, 1974 12. Olver, F.W.J.: Bessel functions of integer order. In: Handbook of mathematical functions with formulas, graphs, and mathematical tables, M. Abromowitz, I. Stegun (eds.). National Bureau of Standards Applied Math Series 55, 1968 13. Ransford, T.: Potential theory in the complex plane. Cambridge: Cambridge University Press, 1995 14. Reed, M., Simon, B.: Methods of modern mathematical physics IV: Analysis of operators. Boston: Academic Press, 1978 15. S´a Barreto, A.: Remarks on the distribution of resonances in odd dimensional Euclidean scattering. Asymptot. Anal. 27(2), 161–170 (2001) 16. Simon, B.: Resonances in one dimension and Fredholm determinants. J. Funct. Anal. 178(2), 396– 420 (2000) 17. Vasy, A.: Scattering poles for negative potentials. Commun. PDE 21(1&2), 185–194 (1997) 18. Vodev, G.: Resonances in the Euclidean scattering. Cubo Matem´atica Educacional 3(1), 317–360 (2001) 19. Yafaev, D.: Scattering theory: some old and new problems. Lecture Notes in Mathematics 1735. Berlin: Springer-Verlag, 2000, xvi+169 pp 20. Zworski, M.: Distribution of poles for scattering on the real line. J. Funct. Anal. 73(2), 277–296 (1987)

728

T. Christiansen

21. Zworski, M.: Sharp polynomial bounds on the number of scattering poles of radial potentials. J. Funct. Anal. 82, 370–403 (1989) 22. Zworski, M.: Sharp polynomial bounds on the number of scattering poles. Duke Math. J. 59(2), 311–323 (1989) 23. Zworski, M.: Counting scattering poles. In: Spectral and scattering theory (Sanda, 1992), Lecture Notes in Pure and Appl. Math. 161, New York: Dekker, 1994, pp. 301–331 ´ 24. Zworski, M.: Poisson formulae for resonances. In: S´eminaire sur les Equations aux D´eriv´ees Parti´ elles, 1996-1997, Exp. No. XIII, Palaiseau: Ecole Polytech., 1997, 14pp 25. Zworski, M.: Resonances in physics and geometry. Notices Am. Math. Soc. 46(3), 319–328 (1999) 26. Zworski, M.: Quantum resonances and partial differential equations. In: Proceedings of the International Congress of Mathematicians, Vol. III (Beijing, 2002), Beijing: Higher Ed. Press, 2002, pp. 243–252 Communicated by B. Simon

Commun. Math. Phys. 259, 729–759 (2005) Digital Object Identifier (DOI) 10.1007/s00220-005-1383-9

Communications in

Mathematical Physics

The Dirac Operator on SUq (2) Ludwik D¸abrowski1 , Giovanni Landi2 , Andrzej Sitarz3, , Walter van Suijlekom1 , Joseph C. V´arilly4, 1 2

Scuola Internazionale Superiore di Studi Avanzati, Via Beirut 2-4, 34014 Trieste, Italy Dipartimento di Matematica e Informatica, Universit`a di Trieste, Via Valerio 12/b, 34127 Trieste, and INFN, Sezione di Napoli, Napoli, Italy 3 Institute of Physics, Jagiellonian University, Reymonta 4, 30-059 Krak´ ow, Poland 4 Departamento de Matem´atica, Universidad de Costa Rica, 2060 San Jos´e, Costa Rica Received: 11 December 2004 / Accepted: 21 January 2005 Published online: 21 June 2005 – © Springer-Verlag 2005

Abstract: We construct a 3+ -summable spectral triple (A(SUq (2)), H, D) over the quantum group SUq (2) which is equivariant with respect to a left and a right action of Uq (su(2)). The geometry is isospectral to the classical case since the spectrum of the operator D is the same as that of the usual Dirac operator on the 3-dimensional round sphere. The presence of an equivariant real structure J demands a modification in the axiomatic framework of spectral geometry, whereby the commutant and first-order properties need be satisfied only modulo infinitesimals of arbitrary high order. 1. Introduction In this paper, we show how to successfully construct a (noncommutative) 3-dimensional spectral geometry on the manifold of the quantum group SUq (2). This is done by building a 3+ -summable spectral triple (A(SUq (2)), H, D) which is equivariant with respect to a left and a right action of Uq (su(2)). The geometry is isospectral to the classical case in the sense that the spectrum of the operator D is the same as that of the usual Dirac operator on the 3-sphere S3 SU (2), with the “round” metric. The possibility of such an isospectral deformation was suggested in [10] where the operator D was named the “true Dirac” operator. Subsequent investigations [13] seemed to rule out this deformation because some of the commutators [D, x], with x ∈ A(SUq (2)), failed to extend to bounded operators, a property which is essential to the definition of a spectral triple [7]. These difficulties are overcome here by constructing on a Hilbert space of spinors H a spin representation of the algebra A(SUq (2)) which differs slightly from the one used in [13]. Our spin representation is determined by requiring that it be equivariant with respect to a left and a right action of Uq (su(2)), a condition which is not present in Partially supported by Polish State Committee for Scientific Research (KBN) under grant 2 P03B 022 25. Regular Associate of the Abdus Salam ICTP, Trieste.

730

L. D¸abrowski, G. Landi, A. Sitarz, W. van Suijlekom, J. C. V´arilly

the previous approach. The role of Hopf-algebraic equivariance in producing interesting spectral triples has already met with some success [5, 12]; for a programmatic viewpoint, see [30]. Our construction of an isospectral noncommutative geometry on the manifold of SUq (2), which deforms the usual geometry on the 3-dimensional sphere, belongs to an interesting terrain where noncommutative geometry meets the underlying “spaces” of quantum groups. Recent examples [11, 12, 26, 29] are concerned with the “two-dimensional” spheres of Podle´s [27] and more general flag manifolds [22]. The left-equivariant spectral triple on SUq (2) constructed in [5] and fully analyzed in [9] is not isospectral and does not have a good limit at the classical value of the deformation parameter. After a brief review in Sect. 2 of SUq (2) and its symmetries, mainly to fix notation, we construct its left regular representation in Sect. 3 via equivariance, and transfer that construction to spinors in Sect. 4. On the Hilbert space of spinors, we consider in Sect. 5 a class of equivariant “Dirac” operators D. For such an operator D having a classical spectrum, that is, with eigenvalues depending linearly on “total angular momentum”, we prove boundedness of the commutators [D, x], for all x ∈ A(SUq (2)). In fact, this equivariant Dirac operator is essentially determined by a modified first-order condition, as is shown later on. Since the spectrum is classical, the deformation –from SU (2) to SUq (2)– is isospectral, and in particular the metric dimension of the spectral geometry is 3. The new feature of the spin geometry of SUq (2) is the nature of the real structure J , whose existence is addressed in Sect. 6. An equivariant J is constructed by suitably lifting to the Hilbert space of spinors H the antiunitary Tomita conjugation operator for the left regular representation of A(SUq (2)). However, this J is not the Tomita operator for the spin representation; for if it were, the spectral triple would inherit equivariance under the co-opposite symmetry algebra U1/q (su(2)), forcing it to be trivial. Therefore, the equivariant J we shall use does not intertwine the spin representation of A(SUq (2)) with its commutant, and it is not possible to satisfy all the desirable properties of a real spectral triple as set forth in [8, 15]. This rupture was already observed in [11]; just as in that paper, we must also weaken the first-order requirement on D. In Sect. 7, we rescue the formalism by showing that the commutant and first-order properties nevertheless do hold, up to infinitesimals of arbitrary high order. For that, we identify an ideal of trace-class operators containing all commutation defects; these defects vanish in the classical case. An appropriately modified first-order condition is given, which distinguishes Dirac operators with classical spectra. A discussion of the Connes–Moscovici local index formula for the spectral geometry presented in this paper is currently under investigation and will be soon reported elsewhere. 2. Algebraic Preliminaries Definition 2.1. Let q be a real number with 0 < q < 1, and let A = A(SUq (2)) be the ∗-algebra generated by a and b, subject to the following commutation rules: ba = qab,

b∗ a = qab∗ ,

a ∗ a + q 2 b∗ b = 1,

bb∗ = b∗ b,

aa ∗ + bb∗ = 1.

(2.1)

As a consequence, a ∗ b = qba ∗ and a ∗ b∗ = qb∗ a ∗ . This becomes a Hopf ∗-algebra under the coproduct

The Dirac Operator on SUq (2)

731

a := a ⊗ a − q b ⊗ b∗ , b := b ⊗ a ∗ + a ⊗ b, counit ε(a) = 1, ε(b) = 0, and antipode Sa = a ∗ , Sb = −qb, Sb∗ = −q −1 b∗ , Sa ∗ = a. Remark 2.2. Here we follow Majid’s “lexicographic convention” [23, 24] (where, with c = −qb∗ , d = a ∗ , a factor of q is needed to restore alphabetical order). Another much-used convention is related to ours by a ↔ a ∗ , b ↔ −b; see, for instance, [5, 9]. Definition 2.3. The Hopf ∗-algebra U = Uq (su(2)) is generated as an algebra by elements e, f, k, with k invertible, satisfying the relations ek = qke,

k 2 − k −2 = (q − q −1 )(f e − ef ),

kf = qf k,

(2.2)

and its coproduct is given by k = k ⊗ k,

e = e ⊗ k + k −1 ⊗ e,

f = f ⊗ k + k −1 ⊗ f.

Its counit , antipode S, and star structure ∗ are given respectively by (k) = 1, Sk = k −1 , k ∗ = k, (f ) = 0, Sf = −qf, f ∗ = e, (e) = 0, Se = −q −1 e, e∗ = f. There is an automorphism ϑ of Uq (su(2)) defined on the algebra generators by ϑ(k) := k −1 ,

ϑ(f ) := −e,

ϑ(e) := −f.

(2.3)

Remark 2.4. We recall that there is another convention for the generators of Uq (su(2)) in widespread use: see [19], for instance. The handy compendium [21] gives both versions, denoting by U˘ q (su(2)) the version which we adopt here. However, the parameter q of this paper corresponds to q −1 in [21], or alternatively, we keep the same q but exchange e and f of that book; the equivalence of these procedures is immediate from the above formulas (2.2). The older literature uses the convention which we follow here, with generators usually written as K = k, X+ = f , X − = e. We employ the so-called “q-integers”, defined for each n ∈ Z as [n] = [n]q :=

q n − q −n q − q −1

provided

q = 1.

(2.4)

Definition 2.5. There is a bilinear pairing between U and A, defined on generators by 1

k, a = q 2 ,

1

k, a ∗ = q − 2 ,

e, −qb∗ = f, b = 1,

with all other couples of generators pairing to 0. It satisfies (Sh)∗ , x = h, x ∗ ,

for all h ∈ U, x ∈ A.

(2.5)

We regard U as a subspace of the linear dual of A via this pairing. There are canonical left and right U-module algebra structures on A [32] such that g, h x := gh, x,

g, x h := hg, x,

for all g, h ∈ U, x ∈ A.

732

L. D¸abrowski, G. Landi, A. Sitarz, W. van Suijlekom, J. C. V´arilly

They are given by h x := (id ⊗h) x and x h := (h ⊗ id) x, or equivalently by h x := x(1) h, x(2) ,

x h := h, x(1) x(2) ,

(2.6)

using the Sweedler notation x =: x(1) ⊗ x(2) with implicit summation. The right and left actions of U on A are mutually commuting: (h a) g = (a(1) h, a(2) ) g = g, a(1) a(2) h, a(3) = h (g, a(1) a(2) ) = h (a g), and it follows from (2.5) that the star structure is compatible with both actions: h x ∗ = ((Sh)∗ x)∗ ,

x ∗ h = (x (Sh)∗ )∗ ,

for all

h ∈ U, x ∈ A.

On the generators, the left action is given explicitly by 1

1

1

1

k a = q 2 a, k a ∗ = q − 2 a ∗ , k b = q − 2 b, k b∗ = q 2 b∗ , f a = 0, f a ∗ = −qb∗ , f b = a, f b∗ = 0, ∗ e a = b, e a = 0, e b = 0, e b∗ = −q −1 a ∗ ,

(2.7)

and the right action is likewise given by 1

1

1

1

a k = q 2 a, a ∗ k = q − 2 a ∗ , b k = q 2 b, b∗ k = q − 2 b∗ , a f = −qb∗ , a ∗ f = 0, b f = a ∗ , b∗ f = 0, ∗ a e = 0, a e = b, b e = 0, b∗ e = −q −1 a.

(2.8)

We remark in passing that since A is also a Hopf algebra, the left and right actions are linked through the antipodes: S(Sh x) = Sx h. Indeed, it is immediate from (2.6) and the duality relation Sh, y = h, Sy that S(Sh x) = S(x(1) ) Sh, x(2) = S(x(1) ) h, S(x(2) ) = (Sx)(2) h, (Sx)(1) = Sx h. As noted in [14], for instance, the invertible antipode of U serves to transform the right action into a second left action of U on A, commuting with the first. Here we also use the automorphism ϑ of (2.3), and define h · x := x S −1 (ϑ(h)). Indeed, it is immediate that g · (h · x) = (x S −1 (ϑh)) S −1 (ϑg) = x (S −1 (ϑh)S −1 (ϑg)) = x (S −1 (ϑ(gh)) = gh · x, i.e., it is a left action. We tabulate this action directly from (2.8): 1

1

1

1

k · a = q 2 a, k · a ∗ = q − 2 a ∗ , k · b = q 2 b, k · b∗ = q − 2 b∗ , f · a = 0, f · a ∗ = qb, f · b = 0, f · b∗ = −a, ∗ ∗ −1 ∗ e · a = −b , e · a = 0, e · b = q a , e · b∗ = 0.

(2.9)

The Dirac Operator on SUq (2)

733

In the “classical” case q = 1, we use the well-known identifications SU (2) ≈ S3 ≈ Spin(4)/ Spin(3) = (SU (2) × SU (2))/SU (2); on quotienting out the diagonal SU (2) subgroup of Spin(4), we realize SU (2) as the base space of the principal spin bundle Spin(4) → S3 , with projection map (g, h) → gh−1 . The action of Spin(4) on SU (2) is given by (g, h) · x := gxh−1 , and the stabilizer of 1 is the diagonal SU (2) subgroup. We may choose to regard this as a pair of commuting actions of SU (2) on the base space SU (2), apart from the nuance of switching one of them from a right to a left action via the group inversion map. The foregoing pair of actions of Uq (su(2)) on A(SUq (2)) extends this scheme to the case q = 1. We recall [21] that A has a vector-space basis consisting of matrix elements of its l : 2l ∈ N, m, n = −l, . . . , l − 1, l }, where irreducible corepresentations, { tmn 1

0 = 1, t00

t 12

1 2,2

l = The coproduct has the matricial form tmn

j l trs tmn

=

1

= a,

t 12

1 2 ,− 2

= b.

l l k tmk ⊗tkn , while the product is given by

j +l

Cq

k=|j −l|

j l k j l k Cq tk , r m r +m s n s + n r+m,s+n

(2.10)

where the Cq (−) factors are q-Clebsch–Gordan coefficients [3, 20]. The Haar state on the C ∗ -completion C(SUq (2)), which we shall denote by ψ, is l ) := 0 if l > 0. (The Haar faithful, and it is determined by setting ψ(1) := 1 and ψ(tmn state is usually denoted by h, but here we use h for a generic element of U instead.) Let Hψ = L2 (SUq (2), ψ) be the Hilbert space of its GNS representation; then the GNS map η : C(SUq (2)) → Hψ is injective and satisfies l l ∗ l )2 = ψ((tmn ) tmn ) = η(tmn

q −2m , [2l + 1]

(2.11)

l ) are mutually orthogonal. From the formula and the vectors η(tmn

Cq

q −m l l 0 = (−1)l+m , 1 −m m 0 [2l + 1] 2

we see that the involution in C(SUq (2)) is given by l ∗ l (tmn ) = (−1)2l+m+n q n−m t−m,−n . 1

In particular, t 2 1

− 2 , 21

1

= −qb∗ and t 2 1

− 2 ,− 21

(2.12)

= a ∗ , as expected.

An orthonormal basis of Hψ is obtained by normalizing the matrix elements, using (2.11): 1

l ). |lmn := q m [2l + 1] 2 η(tmn

(2.13)

734

L. D¸abrowski, G. Landi, A. Sitarz, W. van Suijlekom, J. C. V´arilly

3. Equivariant Representation of A(SUq (2)) Let U be a Hopf algebra and let A be a left U-module algebra. A representation of A on a vector space V is called U-equivariant if there is also an algebra representation of U on V , satisfying the following compatibility relation: h(xξ ) = (h(1) x)(h(2) ξ ),

h ∈ U, x ∈ A, ξ ∈ V ,

where denotes the Hopf action of U on A. If A is instead a right U-module algebra, the appropriate compatibility relation is x(hξ ) = h(1) ((x h(2) )ξ ). Also, if A is an U-bimodule algebra (carrying commuting left and right Hopf actions of U), one can demand both of these conditions simultaneously for the pair of representations of A and U on the same vector space V . In the present case, it turns out to be simpler to consider equivariance under two commuting left Hopf actions, as exemplified in the previous section. We shall first work out in detail a construction of the regular representation of the Hopf algebra A(SUq (2)), showing how it is determined by its equivariance properties. We begin with the known representation theory [21] of Uq (su(2)). The irreducible finite dimensional representations σl of Uq (su(2)) are labelled by nonnegative half-integers l = 0, 21 , 1, 23 , 2, . . . , and they are given by σl (k) |lm = q m |lm, σl (f ) |lm = [l − m][l + m + 1] |l, m + 1, σl (e) |lm = [l − m + 1][l + m] |l, m − 1,

(3.1)

where the vectors |lm, for m = −l, −l + 1, . . . , l − 1, l, form a basis for the irreducible U-module Vl , and the brackets denote q-integers as in (2.4). Moreover, σl is a ∗-representation of Uq (su(2)), with respect to the hermitian scalar product on Vl for which the vectors |lm are orthonormal. Remark 3.1. The irreducible representations (3.1) coincide with those of U˘ q (su(2)) in [21], after exchange of e and f (see Remark 2.4). Further results on the representation theory of Uq (su(2)) are taken from [21, Chap. 3] without comment; in particular we use the q-Clebsch–Gordan coefficients found therein for the decomposition of tensor product representations. An alternative source for these coefficients is [3], although their 1 q 2 is our q. Definition 3.2. Let λ and ρ be mutually commuting representations of the Hopf algebra U on a vector space V . A representation π of the ∗-algebra A on V is (λ, ρ)-equivariant if the following compatibility relations hold: λ(h) π(x)ξ = π(h(1) · x) λ(h(2) )ξ, ρ(h) π(x)ξ = π(h(1) x) ρ(h(2) )ξ,

(3.2)

for all h ∈ U, x ∈ A and ξ ∈ V . We shall now exhibit an equivariant representation of A(SUq (2)) on the preHilbert space which is the (algebraic) direct sum V :=

∞ 2l=0

Vl ⊗ V l .

The Dirac Operator on SUq (2)

735

The two Uq (su(2)) symmetries λ and ρ will act on the first and the second leg of the tensor product respectively; both actions will be via the irreps (3.1). In other words, λ(h) = σl (h) ⊗ id,

ρ(h) = id ⊗σl (h)

on Vl ⊗ Vl .

We abbreviate |lmn := |lm ⊗ |ln, for m, n = −l, . . . , l − 1, l; these form an orthonormal basis for Vl ⊗ Vl , for each fixed l. (As we shall see, this is consistent with our labelling (2.13) of the orthonormal basis of Hψ in the previous section.) Also, we adopt a shorthand notation: l ± := l ± 21 ,

m± := m ± 21 ,

n± := n ± 21 .

Proposition 3.3. A (λ, ρ)-equivariant ∗-representation π of A(SUq (2)) on the Hilbert space V of (3.3) must have the following form: − + + + − + + π(a) |lmn = A+ lmn |l m n + Almn |l m n ,

+ − π(b) |lmn = Blmn |l + m+ n− + Blmn |l − m+ n− , + |l + m− n− + A − |l − m− n− , π(a ∗ ) |lmn = A ∗

π(b ) |lmn =

(3.3)

lmn lmn + |l + m− n+ + B − |l − m− n+ , B lmn lmn

± where the constants A± lmn and Blmn are, up to phase factors depending only on l, given by

A+ lmn

=

A− lmn = + = Blmn − = Blmn

[l + m + 1][l + n + 1] q [2l + 1][2l + 2] 1 2 (2l+m+n+1)/2 [l − m][l − n] q , [2l][2l + 1] 1 2 (m+n−1)/2 [l + m + 1][l − n + 1] q , [2l + 1][2l + 2] 1 2 (m+n−1)/2 [l − m][l + n] −q , [2l][2l + 1]

1

(−2l+m+n−1)/2

2

,

(3.4)

and the other coefficients are complex conjugates of these, namely, ± = (A∓± − − ) , A lmn l m n

± = (B ∓± − + ) . B lmn l m n

(3.5)

Proof. First of all, notice that hermiticity of π entails the relations (3.5). We now use the covariance properties (3.2). When h = k, they simplify to λ(k) π(x) ξ = π(k · x) λ(k) ξ,

ρ(k) π(x) ξ = π(k x) ρ(k) ξ.

(3.6)

Thus, for instance, when x = a we find the relations 1 1 λ(k) π(a) |lmn = π(q 2 a) q m |lmn = q m+ 2 π(a) |lmn, 1 1 ρ(k) π(a) |lmn = π(q 2 a) q n |lmn = q n+ 2 π(a) |lmn, 1

where we have invoked k · a = k a = q 2 a. We conclude that π(a) |lmn must lie in the closed span of the basis vectors |l m+ n+ . A similar argument with x = b in (3.6) shows 1 1 that π(b) increments n and decrements m by 21 , since k · b = q 2 b while k b = q − 2 b.

736

L. D¸abrowski, G. Landi, A. Sitarz, W. van Suijlekom, J. C. V´arilly

The analogous behaviour for x = a ∗ and x = b∗ follows in the same way from (2.7) and (2.9). Thus, π(a)|lmn is a (possibly infinite) sum π(a) |lmn =

l

Cl lmn |l m+ n+ ,

(3.7)

where the sum runs over nonnegative half-integers l = 0, 21 , 1, 23 , . . . . Next, we call on (3.2) with h = f , x = a, to get 1

λ(f ) π(a)ξ = π(f · a) λ(k)ξ + π(k −1 · a) λ(f )ξ = q − 2 π(a) λ(f )ξ, on account of (2.7). Consequently, λ(f )r π(a) = q −r/2 π(a) λ(f )r for r = 1, 2, 3, . . . . On applying λ(f )r to both sides of (3.7), we obtain on the left-hand side a multiple of π(a) |l, m + r, n, which vanishes for m + r > l; and on the right-hand side we get 1 + + l Cl lmn Dl mr |l , m + r, n , where Dl mr = 0 as long as m + r + 2 ≤ l . We 1 conclude that Cl lmn = 0 for l > l + 2 , by linear independence of these summands. To get a lower bound on the range l in (3.7), we consider the of the index ∗ − − analogous expansion π(a ) |lmn = l Cl lmn |l m n . Now λ(e)r π(a ∗ ) |lmn = q r/2 π(a ∗ ) λ(e)r |lmn ∝ π(a ∗ ) |l, m − r, n vanishes for m − r < −l; while λ(e)r |l m− n− = Fl mr |l , m− − r, n− with Fl mr = 0 for m − r − 21 ≥ −l . Again l lmn = 0 for l > l + 1 . However, since π is a ∗-representation, the we conclude that C 2 matrix element l m n | π(a) | lmn is the complex conjugate of lmn | π(a ∗ ) | l m n , 1 which vanishes for l > l + 2 , so that the indices in (3.7) satisfy l − 21 ≤ l ≤ l + 21 . Clearly, l = l is ruled out because l − m and l − m ± 21 must both be integers. Therefore, π(a) and also π(a ∗ ) have the structure indicated in (3.3). A parallel argument shows the corresponding result for π(b) and π(b∗ ). The coefficients which appear in (3.4) may be determined by further application of the equivariance relations. Since f a = 0 and e b = 0, then by applying ρ(f ) and ρ(e) to the first two relations of (3.3), we obtain the following recursion relations for ± the coefficients A± lmn , Blmn : −2 + 2 A+ Alm,n+1 [l + n + 1] 2 , lmn [l + n + 2] = q 1

1

1

−2 − 2 Alm,n+1 [l − n] 2 , A− lmn [l − n − 1] = q 1

1

1

+ + [l − n + 2] 2 = q 2 Blm,n−1 [l − n + 1] 2 , Blmn 1

1

1

− − [l + n − 1] 2 = q 2 Blm,n−1 [l + n] 2 . Blmn 1

1

1

Then, applying λ(f ) to the same pair of equations, we further find that −2 + 2 A+ Al,m+1,n [l + m + 1] 2 , lmn [l + m + 2] = q 1

1

1

−2 − 2 Al,m+1,n [l − m] 2 , A− lmn [l − m − 1] = q 1

1

1

+ + [l + m + 2] 2 = q − 2 Bl,m+1,n [l + m + 1] 2 , Blmn 1

− [l − m − 1] = q Blmn 1 2

1

− 21

1

− Bl,m+1,n [l − m] . 1 2

(3.8a)

The Dirac Operator on SUq (2)

737

These recursions are explicitly solved by (m+n)/2 A+ [l + m + 1] 2 [l + n + 1] 2 al+ , lmn = q 1

1

(m+n)/2 A− [l − m] 2 [l − n] 2 al− , lmn = q 1

1

+ Blmn = q (m+n)/2 [l + m + 1] 2 [l − n + 1] 2 bl+ , 1

− Blmn

=q

(m+n)/2

1 2

1

[l − m] [l + n]

(3.8b)

bl− ,

1 2

where al± , bl± depend only on l. Once more, we apply the equivariance relations (3.2); this time, we use 1

ρ(e)π(a) = π(e a)ρ(k) + π(k −1 a)ρ(e) = π(b)ρ(k) + q − 2 π(a)ρ(e). (3.9) Applied to |lmn, it yields an equation between linear combinations of |l + m+ n− and |l − m+ n− ; equating coefficients, we find bl+ = q l al+ ,

bl− = −q −l−1 al− .

Furthermore, applying also to |lmn the relation λ(e)π(b) = π(e · b)λ(k) + π(k −1 · b)λ(e) 1

= q −1 π(a ∗ )λ(k) + q − 2 π(b)λ(e),

(3.10)

we get, after a little simplification and use of (3.5), (a − 1 ) = q 2l+ 2 al+ . 3

l+ 2

It remains only to determine the parameters al+ . We turn to the algebra commutation relation ba = qab and compare coefficients in the expansion of π(b)π(a) |lmn = q π(a)π(b) |lmn. Those of |l + 1, m + 1, n and |l − 1, m + 1, n already coincide; but from the |l, m + 1, n terms, we get the identity q[2l + 2] |al+ |2 = [2l] |a + 1 |2 . l− 2

This can be solved immediately, to give al+ =

Cζl q −l 1

1

[2l + 1] 2 [2l + 2] 2

,

where C is a positive constant, and ζl is a phase factor which can be absorbed in the basis vectors |lmn; hereinafter we take ζl = 1 (we comment on that choice at the end of the section). Finally, from the relation a ∗ a + q 2 b∗ b = 1 we obtain 1 = 000 | π(a ∗ a + q 2 b∗ b) | 000 = |a0+ |2 + q 2 |b0+ |2 = (1 + q 2 )C 2 /[2] = q C 2 ,

738

L. D¸abrowski, G. Landi, A. Sitarz, W. van Suijlekom, J. C. V´arilly 1

and thus C = q − 2 . We therefore find that 1

al+ =

1

q −l− 2 1

q l+ 2

, 1

al− =

, 1

bl− = −

[2l + 1] 2 [2l + 2] 2

1

1

[2l] 2 [2l + 1] 2

1

bl+ =

,

1

q− 2 1

[2l + 1] 2 [2l + 2] 2

q− 2 1

1

[2l] 2 [2l + 1] 2

,

and substitution in (3.8b) yields the coefficients (3.4).

It is easy to check that the formulas (3.3) give precisely the left regular representation πψ of A(SUq (2)). Indeed, that representation was implicitly given already by the product rule (2.10). From [3, (3.53)] we obtain 1 2 1 2

Cq

l l+ m m+

1

1

= q − 2 (l−m)

[l + m + 1] 2 1

[2l + 1] 2

,

1 1 [l − m + 1] 2 l l+ 2 (l+m) = q , 1 − 21 m m− [2l + 1] 2

1 1 2 1 2 l l− (l+m+1) [l − m] 2 =q , Cq 1 1 + [2l + 1] 2 2 mm 1 1 2 l l− − 21 (l−m+1) [l + m] Cq 21 = −q . 1 − 2 m m− [2l + 1] 2

Cq

By setting j = r = s =

1 2

(3.11)

1 2

in (2.10), we find

1 1 2 l l± 2 l l± l l± Cq 1 η(tm Cq 1 πψ (a)η(tmn ) = + n+ ). + + ± 2 mm 2 nn

Taking the normalization (2.13) into account, this becomes 1 l l+ l l+ + + + 2 Cq 1 Cq πψ (a)|lmn = q + |l m n 1 m m+ [2l + 2] 2 2 nn 1 1 1 2 l l− l l− − 21 [2l + 1] − + + 2 2 Cq 1 +q + Cq 1 n n+ |l m n 1 [2l] 2 2 mm 2 1

− 21

[2l + 1] 2

1

2 1 2

1

1

= q 2 (−2l+m+n−1)

1

[l + m + 1] 2 [l + n + 1] 2 1 2

[2l + 1] [2l + 2] 1

+q

1 2 (2l+m+n+1)

= π(a)|lmn.

|l + m+ n+

1

[l − m] 2 [l − n] 2 1

1 2

1

[2l] 2 [2l + 1] 2

|l − m+ n+

A similar calculation, using (3.11) again, shows that π(b) = πψ (b). Since a and b generate A as a ∗-algebra, we conclude that π = πψ . (It should be noted that πψ has

The Dirac Operator on SUq (2)

739

already been exhibited in [5] in the same way, albeit with a different convention for the algebra generators.) The identification (2.13) embeds the prehilbert space V densely in the Hilbert space Hψ , and the representation πψ extends to the GNS representation of C(SUq (2)) on Hψ , as described by the Peter-Weyl theorem [21, 32]. In like manner, all other representations of A exhibited in this paper extend to C ∗ -algebra representations of C(SUq (2)) on the appropriate Hilbert spaces. The only lack of uniqueness in the proof of Proposition 3.3 involved the choice of the phase factors ζl ; if Z is the linear operator on V which multiplies vectors in Vl ⊗Vl by ζl , then Z commutes with each λ(h) and ρ(g), and extends to a unitary operator on Hψ . In other words, any (λ, ρ)-equivariant representation π extends to Hψ and is unitarily equivalent to the left regular representation. The (standard) choice ζl = 1 ensures that ± all coefficients A± lmn and Blmn are real: it is indeed an extension of the Conden-Shortley phase convention [4]. 4. The Spin Representation The left regular representation π of A, constructed in the previous section, can be amplified to π = π ⊗id on V ⊗ C2 . In the commutative case when q = 1, this yields the spinor representation of SU (2), because the spinor bundle is parallelizable: S SU (2) × C2 , although one needs to specify the trivialization. The representation theory of U (and the corepresentation theory of A) follows the same pattern; only the Clebsch–Gordan coefficients need to be modified [20] when q = 1. To fix notations, we take W := V ⊗ C2 = V ⊗ V 1 , 2

and its Clebsch–Gordan decomposition is the (algebraic) direct sum W =

∞

Vl ⊗ V l ⊗ V 1 V 1 ⊕ 2

2l=0

2

∞ 2j =1

(Vj + 1 ⊗ Vj ) ⊕ (Vj − 1 ⊗ Vj ). 2

2

(4.1)

We rename the finite-dimensional spaces on the right-hand side as ↑

W = W0 ⊕

↑

↓

Wj ⊕ Wj ,

(4.2)

2j ≥1 ↑

↓

where Wj Vj + 1 ⊗ Vj and Wj Vj − 1 ⊗ Vj , so that 2

2

↑

dim Wj = (2j + 1)(2j + 2), ↓

dim Wj = 2j (2j + 1),

for j = 0, 21 , 1, 23 , . . . ,

for j = 21 , 1, 23 , . . . .

(4.3)

Definition 4.1. We amplify the representation ρ of U on V to ρ = ρ ⊗id on W = V ⊗C2 . However, we replace λ on V by its tensor product with σ 1 on C2 : 2

λ (h) := (λ ⊗ σ 1 )(h) = λ(h(1) ) ⊗ σ 1 (h(2) ). 2

2

740

L. D¸abrowski, G. Landi, A. Sitarz, W. van Suijlekom, J. C. V´arilly

It is straightforward to check that the representations λ and ρ on W commute, and that the representation π of A on W is (λ , ρ )-equivariant: λ (h) π (x)ψ = π (h(1) · x) λ (h(2) )ψ,

(4.4)

ρ (h) π (x)ψ = π (h(1) x) ρ (h(2) )ψ, for all h ∈ U, x ∈ A and ψ ∈ W .

To determine an explicit basis for W which is well-adapted to (λ , ρ )-equivariance, consider the following vectors in V ⊗ C2 : clm |lmn ⊗ | 21 , − 21 + slm |l, m − 1, n ⊗ | 21 , + 21 , −slm |lmn ⊗ | 21 , − 21 + clm |l, m − 1, n ⊗ | 21 , + 21 , where 1

clm := q −(l+m)/2

[l − m + 1] 2 1

[2l + 1] 2

1

,

slm := q (l−m+1)/2

[l + m] 2

1

[2l + 1] 2

are the q-Clebsch–Gordan coefficients corresponding to the above decomposition (4.1), 2 + s 2 = 1. These are eigenvectors for λ (C ), where C := qk 2 +q −1 k −2 + satisfying clm q q lm −1 2 (q − q ) ef is the Casimir element of U, with respective eigenvalues q 2l+2 + q −2l−2 and q 2l + q −2l . Thus, to get a good basis, one should offset the index l by ± 21 (as is also suggested by the decomposition (4.2) of W ). For j = l + 21 , µ = m − 21 , with µ = −j, . . . , j and n = −j − , . . . , j − , let |j µn↓ := Cj µ |j − µ+ n ⊗ | 21 , − 21 + Sj µ |j − µ− n ⊗ | 21 , + 21 ;

(4.5a)

and for j = l − 21 , µ = m − 21 , with µ = −j, . . . , j and n = −j + , . . . , j + , let |j µn↑ := −Sj +1,µ |j + µ+ n ⊗ | 21 , − 21 + Cj +1,µ |j + µ− n ⊗ | 21 , + 21 , (4.5b) where the coefficients are now 1

Cj µ := q −(j +µ)/2

[j − µ] 2 1

1

,

[2j ] 2

Sj µ := q (j −µ)/2

[j + µ] 2 1

.

(4.5c)

[2j ] 2

Notice that there are no ↓ vectors for j = 0. It is now straightforward, though tedious, to ↓ ↑ verify that these vectors are orthonormal bases for the respective subspaces Wj and Wj . The Hilbert space of spinors is H := Hψ ⊗ C2 , which is just the completion of the algebraic direct sum (4.2). We may decompose it as H = H↑ ⊕ H↓ , where H↑ and H↓ ↑ ↓ are the respective completions of 2j ≥0 Wj and 2j ≥1 Wj . Lemma 4.2. The basis vectors |j µn↑ and |j µn↓ are joint eigenvectors for λ (k) and ρ (k), and e, f are represented on them as ladder operators: λ (k)|j µn↑ = q µ |j µn↑,

ρ (k)|j µn↑ = q n |j µn↑,

λ (k)|j µn↓ = q µ |j µn↓,

ρ (k)|j µn↓ = q n |j µn↓.

(4.6a)

The Dirac Operator on SUq (2)

741

Moreover, 1

1

1

1

1

1

1 2

1 2

λ (f )|j µn↑ = [j − µ] 2 [j + µ + 1] 2 |j, µ + 1, n↑, λ (e)|j µn↑ = [j + µ] 2 [j − µ + 1] 2 |j, µ − 1, n↑, λ (f )|j µn↓ = [j − µ] 2 [j + µ + 1] 2 |j, µ + 1, n↓,

(4.6b)

λ (e)|j µn↓ = [j + µ] [j − µ + 1] |j, µ − 1, n↓, and 1

1

1

1

1

1

1 2

1 2

ρ (f )|j µn↑ = [j − n + 21 ] 2 [j + n + 23 ] 2 |j µ, n + 1, ↑, ρ (e)|j µn↑ = [j + n + 21 ] 2 [j − n + 23 ] 2 |j µ, n − 1, ↑, ρ (f )|j µn↓ = [j − n − 21 ] 2 [j + n + 21 ] 2 |j µ, n + 1, ↓,

(4.6c)

ρ (e)|j µn↓ = [j + n − 21 ] [j − n + 21 ] |j µ, n − 1, ↓. The representation π can now be computed in the new spinor basis by conjugating the form of π ⊗ id found in Proposition 3.3 by the basis transformation (4.5). However, it is more instructive to derive these formulas from the property of (λ , ρ )-equivariance. First, we introduce a handy notation. Definition 4.3. For j = 0, 21 , 1, 23 , . . . , with µ = −j, . . . , j and n = −j − 21 , . . . , j + 21 , we juxtapose the pair of spinors

|j µn↑ |j µn := , |j µn↓ with the convention that the lower component is zero when n = ±(j + 21 ) or j = 0. Furthermore, a matrix with scalar entries, A↑↑ A↑↓ A= , A↓↑ A↓↓ is understood to act on |j µn by the rule: A|j µn↑ = A↑↑ |j µn↑ + A↓↑ |j µn↓, A|j µn↓ = A↓↓ |j µn↓ + A↑↓ |j µn↑.

(4.7)

Proposition 4.4. The representation π := π ⊗ id of A is given by π (a) |j µn = αj+µn |j + µ+ n+ + αj−µn |j − µ+ n+ , π (b) |j µn = βj+µn |j + µ+ n− + βj−µn |j − µ+ n− , π (a ∗ ) |j µn = α˜ j+µn |j + µ− n− + α˜ j−µn |j − µ− n− , π (b∗ ) |j µn = β˜j+µn |j + µ− n+ + β˜j−µn |j − µ− n+ ,

(4.8)

742

L. D¸abrowski, G. Landi, A. Sitarz, W. van Suijlekom, J. C. V´arilly

where αj±µn and βj±µn are, up to phase factors depending only on j , the following triangular 2 × 2 matrices:   3 1/2 −j − 21 [j +n+ 2 ] 0 q 1 1  [2j +2]  αj+µn = q (µ+n− 2 )/2 [j + µ + 1] 2  , 1 1/2  1 [j −n+ 1 ]1/2 [j +n+ ] −j 2 2 2 q [2j +1] [2j +2] q [2j +1] 

αj−µn

j +1 1 1 q = q (µ+n− 2 )/2 [j − µ] 2 

[j −n+ 21 ]1/2 [2j +1]

0

 1 1  βj+µn = q (µ+n− 2 )/2 [j + µ + 1] 2 

1

[j +n+ 21 ]1/2 [2j ] [2j +1]

j + 21

[j −n− 21 ]1/2 [2j ]

q

[j −n+ 23 ]1/2 [2j +2]

1

−2 1 −q (µ+n− 21 )/2 2 =q [j − µ] 

[j +n+ 21 ]1/2 [2j +1]

0

  ,  0

[j +n+ 21 ]1/2 −q −j −1 [2j +1] [2j +2]



βj−µn

−q 2

q

− 21

[j −n+ 21 ]1/2 [2j +1]

 ,

(4.9)



−q j −

[j −n+ 21 ]1/2 [2j ] [2j +1] 

[j +n− 21 ]1/2 [2j ]

,

and the remaining matrices are the hermitian conjugates α˜ j±µn = (αj∓± µ− n− )† ,

β˜j±µn = (βj∓± µ− n+ )† .

Proof. The proof of Proposition 3.3 applies with minor changes. From the analogues of 1 1 (3.6) and the relations λ (f )π (a) = q − 2 π (a) λ (f ) and λ (e)π (a ∗ ) = q 2 π (a ∗ ) λ (e), applied to the spinors |j µn, together with the formulas (4.6a) and (4.6b), we determine that π (a) has the indicated form, where the αj±µn are 2 × 2 matrices. The other cases of (4.8) are handled similarly. To compute these matrices, we again use the commutation relations of λ (f ) with π (a) and π (b) to establish recurrence relations, analogous to (3.8a), which yield αj+µn = q (µ+n− 2 )/2 [j + µ + 1] 2 A+ j n,

αj−µn = q (µ+n− 2 )/2 [j − µ] 2 A− j n,

βj+µn = q (µ+n− 2 )/2 [j + µ + 1] 2 Bj+n ,

βj−µn = q (µ+n− 2 )/2 [j − µ] 2 Bj−n .

1 1

1 1

1

1

1

1

± The new matrices A± j n , Bj n may be further refined by using commutation relations 1

involving ρ (f ) and ρ (e). For instance, ρ (f )π (a) = q − 2 π (a) ρ (f ) entails

1 1 0 [j − n + 21 ] 2 [j + n + 25 ] 2 A+ 1 1 jn 0 [j − n − 21 ] 2 [j + n + 23 ] 2

1 1 [j − n + 21 ] 2 [j + n + 23 ] 2 0 + . = Aj,n+1 1 1 0 [j − n − 21 ] 2 [j + n + 21 ] 2 This yields four recurrence relations for the entries of A+ j n , one of which has only the trivial solution; we conclude that

1 [j + n + 23 ] 2 aj+↑↑ 0 + , Aj n = 1 1 [j − n + 21 ] 2 aj+↓↑ [j + n + 21 ] 2 aj+↓↓

The Dirac Operator on SUq (2)

743

where the aj+ are scalars depending only on j . In a similar fashion, we arrive at

1 1 [j − n + 21 ] 2 aj−↑↑ [j + n + 21 ] 2 aj−↑↓ − Aj n = , 1 0 [j − n − 21 ] 2 aj−↓↓

Bj+n

[j − n + 23 ] 2 bj+↑↑

=

1

0

,

[j + n + 21 ] 2 bj+↓↑ [j − n + 21 ] 2 bj+↓↓ 1

Bj−n =

1

[j + n + 21 ] 2 bj−↑↑ [j − n + 21 ] 2 bj−↑↓ 1

1

.

[j + n − 21 ] 2 bj−↓↓ 1

0

The analogue of (3.9) leads quickly to the relations bj+↑↑ = q j + 2 aj+↑↑ ,

bj+↓↑ = −q −j − 2 aj+↓↑ ,

1

3

bj−↑↑ = −q −j − 2 aj−↑↑ , 3

bj−↑↓ = q j − 2 aj−↑↓ , 1

bj+↓↓ = q j − 2 aj+↓↓ , 1

bj−↓↓ = −q −j − 2 aj−↓↓ . 1

(4.10)

Next, from the analogue of (3.10) we get (a −

j + 21 ,↑↑

) = q 2j +2 aj+↑↑ ,

(a −

j + 21 ,↑↓

) = −aj+↓↑ ,

(a −

j + 21 ,↓↓

) = q 2j +1 aj+↓↓ .

The aj+ parameters may be determined from π (b)π (a) |j µn = q π (a)π (b) |j µn. The coefficients of |j ± 1, µ + 1, n yield only the relation [2j + 1] a +

a+ j + 21 ,↓↓ j ↓↑

= [2j + 3] a +

a+ . j + 21 ,↓↑ j ↑↑

(4.11)

From the |j, µ + 1, n terms, we obtain + − − + + − 2 Bj−+ n+ A+ j n + Bj − n+ Aj n = q (Aj + n− Bj n + Aj − n− Bj n ). 1

Comparison of the diagonal entries on both sides gives two more relations: [2j + 1] |aj+↓↑ |2 = q 2j +1 [2j + 1] |a + 1 |2 − q[2j + 3] |aj+↑↑ |2 , j − 2 ,↑↑ + + 2 2 2j [2j + 1] |a 1 | = q q[2j + 1] |aj ↓↓ | − [2j − 1] |a + 1 |2 . j − 2 ,↓↑

j − 2 ,↓↓

Finally, the expectation of π (a ∗ a + q 2 b∗ b) = 1 in the vector states for |j µn↑ and |j µn↓ leads to the relations q 2j [2j + 1]2 |a +

j − 21 ,↑↑

q 2j [2j + 1]2 |aj+↓↓ |2 = 1.

|2 = 1,

Thus all coefficients are now determined, up to a few j -dependent phases: 1

aj+↑↑ = ζj

q −j − 2 , [2j + 2]

1

aj+↓↑ = ηj

q2 , [2j + 1] [2j + 2]

aj+↓↓ = ξj

q −j , [2j + 1]

(4.12)

with |ζj | = |ηj | = |ξj | = 1. The relation (4.11) also implies ζj + 1 ηj = ηj + 1 ξj . 2 2 As before, we may reset these phases to 1 by redefining |j µn↑ and |j µn↓, without breaking the (λ , ρ )-equivariance. Substituting (4.12) back in previous formulas then gives (4.9).

744

L. D¸abrowski, G. Landi, A. Sitarz, W. van Suijlekom, J. C. V´arilly

As already mentioned, formulas (4.9) for the matrices αj±µn and βj±µn could have been obtained also from a direct but tedious computation using Eqs. (4.5) and their inverses. Remark 4.5. Were we to consider a representation of A that need not be (λ , ρ )-equivariant, we could as well have defined our spinor space, like in [13], as C2 ⊗ V , instead of V ⊗ C2 . The Clebsch–Gordan decomposition of C2 ⊗ V would be that of Eq. (4.1), but the q-Clebsch–Gordan coefficients appearing in (4.5a) and (4.5b) would be different due to the rule for exchanging the first two columns in q-Clebsch–Gordan coefficients [21]: j l m l j m Cq = Cq , r s t −s −r −t which results in a substitution of q by q −1 in (4.5c). However, this is not the correct lifting of the (λ, ρ)-equivariant representation π of A to a (λ , ρ )-equivariant representation of A on spinor space. We already noted that π as defined by π ⊗ id on V ⊗ C2 is (λ , ρ )-equivariant, directly from (λ, ρ)-equivariance of π. One checks, simply by working out both sides of Eq. (4.4), that the noncocommutativity of Uq (su(2)) spoils (λ , ρ )-equivariance of the representation π := id ⊗ π of A on the tensor product C2 ⊗ V , where we now define ρ := id ⊗ρ, and λ (h) := (σ 1 ⊗ λ)(h) = σ 1 (h(1) ) ⊗ λ(h(2) ). 2

2

5. The Equivariant Dirac Operator Recall the central Casimir element Cq = qk 2 + q −1 k −2 + (q − q −1 )2 ef ∈ U. The symmetric operators λ (Cq ) and ρ (Cq ) on H, initially defined with dense domain W , ↑ ↓ extend to selfadjoint operators on H. The finite-dimensional subspaces Wj and Wj are their joint eigenspaces: λ (Cq )|j µn↑ = (q 2j +1 + q −2j −1 ) |j µn↑, ρ (Cq )|j µn↑ = (q 2j +2 + q −2j −2 ) ×|j µn↑, λ (Cq )|j µn↓ = (q 2j +1 + q −2j −1 ) |j µn↓, ρ (Cq )|j µn↓ = (q 2j + q −2j )|j µn↓, directly from (4.6). Let D be a selfadjoint operator on H which commutes strongly with λ (Cq ) and ↑ ↓ ρ (Cq ); then the finite-dimensional subspaces Wj and Wj reduce D. We look for the general form of such a selfadjoint operator D which is moreover (λ , ρ )-invariant in the sense that it commutes with λ (h) and ρ (h), for each h ∈ Uq (su(2)). ↑

↓

Lemma 5.1. The subspaces Wj and Wj are eigenspaces for D. ↑

↓

Proof. We may restrict to either the subspace Wj or Wj . Since λ (k) and ρ (k) are required to commute with D and moreover have distinct eigenvalues on these subspaces, it follows that D has a diagonal matrix with respect to the basis |j µn↑, respectively ↑ |j µn↓. If we provisionally write D|j µn↑ = dj µn |j µn↑, then the vanishing of ↑

↑

1

1

[D, λ (f )] |j µn↑ = (dj,µ+1,n − dj µn ) [j − µ] 2 [j + µ + 1] 2 |j, µ + 1, n↑,

The Dirac Operator on SUq (2)

745 ↑

for µ = −j, . . . , j − 1, shows that dj µn is independent of µ; and [D, ρ (f )] = 0 ↑

↓

likewise shows that dj µn does not depend on n. The same goes for dj µn , too. Thus we may write ↑

D|j µn↑ = dj |j µn↑, ↑

↓

D|j µn↓ = dj |j µn↓,

(5.1)

↓

where dj and dj are real eigenvalues of D. The respective multiplicities are (2j + 1)(2j + 2) and 2j (2j + 1), in view of (4.3). One of the conditions for the triple (A, H, D) to be a spectral triple, is boundedness of the commutators [D, π (x)] for x ∈ A. This naturally imposes certain restrictions on ↑ ↓ the eigenvalues dj , dj of the operator D. For convenience, we recall the representation π of a in the basis |j µn, written explicitly on |j µn↑ and |j µn↓ as in (4.7): π (a)|j µn↑ = αj±µn↑↑ |j ± µ+ n+ ↑ + αj+µn↓↑ |j + µ+ n+ ↓, ±

π (a)|j µn↓ =

±

αj±µn↓↓ |j ± µ+ n+ ↓ + αj−µn↑↓ |j − µ+ n+ ↑.

Then, a straightforward computation shows that ↑ ↑ αj±µn↑↑ (dj ± − dj )|j ± µ+ n+ ↑ [D, π (a)] |j µn↑ = ±

↓

↑

↑

↓

+αj+µn↓↑ (dj + − dj )|j + µ+ n+ ↓, ↓ ↓ D, π (a) |j µn↓ = αj±µn↓↓ (dj ± − dj )|j ± µ+ n+ ↓ ±

+αj−µn↑↓ (dj − − dj )|j − µ+ n+ ↑.

(5.2)

Recall that the standard Dirac operator D / on the sphere S3 , with the round metric, has 3 1 3 eigenvalues (2j + 2 ) for j = 0, 2 , 1, 2 , with respective multiplicities (2j + 1)(2j + 2); and −(2j + 21 ) for j = 21 , 1, 23 , with respective multiplicities 2j (2j + 1): see [1, 18], for instance. Notice that its spectrum is symmetric about 0. In [2] a “q-Dirac” operator D was proposed, which in our notation corresponds to ↓ ↑ ↑ taking dj = 2[2j + 1]/(q + q −1 ) and dj = −dj ; these are q-analogues of the classical eigenvalues of D / − 21 . For this particular choice of eigenvalues, it follows directly from the explicit form (4.9) of the matrices αj±µn that then the right-hand sides of (5.2) diverge, and therefore [D, π (a)] is unbounded. This was already noted in [10] and it was suggested that one should instead consider an operator D whose spectrum matches that of the classical Dirac operator. In fact, Proposition 7.3 below shows that this is essentially the only possibility for a Dirac operator satisfying a (modified) first-order condition. Let us then consider any operator D given by (5.1) –that is, a bi-equivariant one– with eigenvalues of the following form: ↑

↑

↑

d j = c1 j + c 2 ,

↓

↓

↓

dj = c1 j + c2 ,

(5.3)

746

L. D¸abrowski, G. Landi, A. Sitarz, W. van Suijlekom, J. C. V´arilly ↑

↑

↓

↓

where c1 , c2 , c1 , c2 are independent of j . For brevity, we shall say that the eigenvalues are “linear in j ”. On the right-hand side of (5.2), the “diagonal” coefficients simplify to ↑

↑

↑

↓

αj±µn↑↑ (dj ± − dj ) = 21 αj±µn↑↑ c1 ,

↓

↓

αj±µn↓↓ (dj ± − dj ) = 21 αj±µn↓↓ c1 ,

(5.4)

which can be uniformly bounded with respect to j –see expressions (4.9). For the offdiagonal terms, involving αj+µn↓↑ and αj−µn↑↓ , the differences between the “up” and “down” eigenvalues are linear in j . Since 0 < q < 1, it is clear that [N ] ∼ (q −1 )N−1 3 for large N , and thus αj+µn↓↑ ∼ q 3j +n+ 2 ≤ q 2j +1 for large j . Similar easy estimates yield αj+µn↓↑ = O(q 2j +1 ), αj−µn↑↓ = O(q 2j ),

βj+µn↓↑ = O(q 2j + 2 ), 1

βj−µn↑↓ = O(q 2j + 2 ), 1

(5.5)

as j → ∞.

We therefore arrive at ↓

↑

|αj+µn↓↑ (dj + − dj − 1)| ≤ Cj q 2j ,

↑

↓

|αj−µn↑↓ (dj − − dj − 1)| ≤ C j q 2j , (5.6)

for some C > 0, C > 0, independent of j ; and similar estimates hold for the off-diagonal coefficients of π (b). ↑

↓

Proposition 5.2. Let D be any selfadjoint operator with eigenspaces Wj and Wj , and ↑

↓

eigenvalues (5.1). If the eigenvalues dj and dj are linear in j as in (5.3), then [D, π (x)] is a bounded operator for all x ∈ A. Proof. Since a and b generate A as a ∗-algebra, it is enough to consider the cases x = a and x = b. For x = a and any ξ ∈ H, the relations (5.2) and (5.4), together with the Schwarz inequality, give the estimate [D, π (a)] ξ 2 ≤

1 4

↑

↓

max{(c1 )2 , (c1 )2 } π (a)ξ 2 + ξ 2 η2 ,

where η is a vector whose components are estimated by (5.6), which establishes finiteness of η since 0 < q < 1. Therefore, [D, π (a)] is norm bounded. In the same way, we find that [D, π (b)] is bounded. Now, if D is a selfadjoint operator as in Proposition 5.2, and if the eigenvalues of D satisfy (5.3) and, moreover, ↓

↑

c1 = −c1 ,

↓

↑

↑

c2 = −c2 + c1 ,

(5.7)

then the spectrum of D coincides with that of the classical Dirac operator D / on the round sphere S3 , up to rescaling and addition of a constant. Thus, we can regard our spectral triple as an isospectral deformation of (C ∞ (S3 ), H, D / ), and in particular, its spectral dimension is 3. We summarize our conclusions in the following theorem. Theorem 5.3. The triple (A(SUq (2)), H, D), where the eigenvalues of D satisfy (5.3) and (5.7), is a 3+ -summable spectral triple.

The Dirac Operator on SUq (2)

747

At this point, it is appropriate to comment on the relation of our construction with that of [13]. There, a spinor representation is constructed by tensoring the left regular representation of A(SUq (2)) by C2 on the left. This spinor space is then decomposed into two subspaces, similar to our “up” and “down” subspaces, on which D acts diagonally with eigenvalues linear in the total spin number j . The corresponding decomposition of the representation π of A(SUq (2)) on spinor space is obtained by using the appropriate Clebsch–Gordan coefficients. However, contrary to what we have established above, in [13] it is found that a certain commutator [D, π (x)] is an unbounded operator. In particular, the off-diagonal terms in the representation of [13] do not have the compact nature we encountered in (5.5). They can be bounded from below by a positive constant, which leads, when multiplied by a term linear in j , to an unbounded operator. The origin of this notable contrast is the following. Since in [13] no condition of Uq (su(2))-equivariance is imposed a priori on the representation of A(SUq (2)), the spinor space W could be identified either with V ⊗ C2 or C2 ⊗ V , according to convenience. However, as we noted in Remark 4.5, the choice of C2 ⊗ V is not allowed by the condition of (λ , ρ )-equivariance, because Uq (su(2)) is not cocommutative. Indeed, repeating the construction of a spinor representation and Dirac operator on the spinor space C2 ⊗ V instead of V ⊗ C2 –hence ignoring equivariance– results eventually in unbounded commutators. 6. The Real Structure The next issue we address is the real structure J on the spectral triple (A(SUq (2)), H, D). We shall see that by requiring equivariance of J it is not possible to satisfy all the usual properties of a real spectral triple like in [8] or [15]. Among other things, these conditions entail for J that it intertwine a left action and a commuting right action of the algebra on the Hilbert space, which then gets a bimodule structure (the commutant property); and that the bounded commutators [D, a], for any element a in the algebra, commute with the opposite action by any b in the algebra (the first order condition on D). However, we shall be able to satisfy these two conditions only up to certain compact operators. 6.1. The Tomita operator of the regular representation. On the GNS representation space Hψ , there is a natural involution Tψ : η(x) → η(x ∗ ), with domain η(C(SUq (2))), which may be regarded as an unbounded (antilinear) operator on Hψ . The Tomita–Takesaki theory [31] shows that this operator is closable (we denote its closure also by Tψ ) and 1/2 that the polar decomposition Tψ =: Jψ ψ defines both the positive “modular operator” ψ and the antiunitary “modular conjugation” Jψ . It has already been noted by Chakraborty and Pal [6] that this Jψ has a simple expression in terms of the matrix elements of our chosen orthonormal basis for Hψ . Indeed, it follows immediately from (2.12) and (2.13) that Tψ |lmn = (−1)2l+m+n q m+n |l, −m, −n. One checks, using (3.3), that Tψ π(a) |000 = π(a ∗ ) |000,

Tψ π(b) |000 = π(b∗ ) |000.

Since π is the GNS representation for the state ψ, this is enough to conclude that Tψ η(x) = η(x ∗ )

for all

x ∈ A.

(6.1)

748

L. D¸abrowski, G. Landi, A. Sitarz, W. van Suijlekom, J. C. V´arilly

The adjoint antilinear operator, satisfying η | Tψ∗ | ξ = ξ | Tψ | η, is given by Tψ∗ |lmn = (−1)2l+m+n q −m−n |l, −m, −n, and since ψ = Tψ∗ Tψ , we see that every |lmn lies in Dom ψ with ψ |lmn = q 2m+2n |lmn. Consequently, Jψ |lmn = (−1)2l+m+n |l, −m, −n.

(6.2)

It is clear that Jψ2 = 1 on Hψ . Definition 6.1. Let π ◦ (x) := Jψ π(x ∗ ) Jψ−1 , so that π ◦ is a ∗-antirepresentation of A on Hψ . Equivalently, π ◦ is a ∗-representation of the opposite algebra A(SU1/q (2)). By Tomita’s theorem [31], π and π ◦ are commuting representations. As an example, we compute ◦

π (a) |lmn = (−1)2l+m+n Jψ π(a ∗ ) |l, −m, −n + − = (−1)2l+m+n Jψ A |l + , −m+ , −n+ + A = =

l,−m,−n l,−m,−n |l + − + + + − + + Al,−m,−n |l m n + Al,−m,−n |l m n A− |l + m+ n+ + A+ |l − m+ n+ , l + ,−m+ ,−n+ l − ,−m+ ,−n+

−

, −m+ , −n+

where, explicitly,

A+ l − ,−m+ ,−n+

[l + m + 1][l + n + 1] [2l + 1][2l + 2] 1 [l − m][l − n] 2 = q −(2l+m+n+1)/2 . [2l][2l + 1]

A− = q (2l−m−n+1)/2 l + ,−m+ ,−n+

1 2

,

A glance back at (3.4) shows that these coefficients are identical with those of π(a) |lmn, after substituting q → q −1 . A similar phenomenon occurs with the coefficients of π ◦ (b). We find, indeed, that ◦− − + + + + + π ◦ (a) |lmn = A◦+ lmn |l m n + Almn |l m n , ◦+ + + − ◦− − + − π ◦ (b) |lmn = Blmn |l m n + Blmn |l m n ,

where ± −1 A◦± lmn (q) = Almn (q ),

◦± ± Blmn (q) = q −1 Blmn (q −1 ).

We can now verify directly that the representations π to appeal to the theorem of Tomita. For instance,

and π ◦

(6.3)

commute, without need

A+ − A + A◦+ l + 1, m + 1, n + 1 | [π(a), π ◦ (a)] | lmn = A◦+ l + m+ n+ lmn l + m+ n+ lmn 1 [l + m + 1][l + m + 2][l + n + 1][l + n + 2] 2 =Q , [2l + 1][2l + 2]2 [2l + 3] where 1

Q = q 2 (2l

+ −m+ −n+ +1)

1

1

q 2 (−2l+m+n−1) − q 2 (−2l

+ +m+ +n+ −1)

1

q 2 (2l−m−n+1) = 0.

Likewise, l − 1, m + 1, n + 1 | [π(a), π ◦ (a)] | lmn = 0, and one checks that the matrix element l, m + 1, n + 1 | [π(a), π ◦ (a)] | lmn vanishes, too. The (λ, ρ)-equivariance of π is reflected in an analogous equivariance condition for π ◦ . We now identify this condition explicitly.

The Dirac Operator on SUq (2)

749

Lemma 6.2. The symmetry of the antirepresentation π ◦ of A on Hψ is given by the equivariance conditions: λ(h) π ◦ (x)ξ = π ◦ (h˜ (2) · x) λ(h(1) )ξ, ρ(h) π ◦ (x)ξ = π ◦ (h˜ (2) x) ρ(h(1) )ξ,

(6.4)

for all h ∈ U, x ∈ A and ξ ∈ V , and h → h˜ is the automorphism of U determined on generators by k˜ := k, f˜ := q −1 f , and e˜ := qe. Proof. We work only on the dense subspace V . From (3.1) and (6.2), we get at once Jψ λ(k)∗ Jψ−1 = λ(k −1 ),

Jψ λ(f )∗ Jψ−1 = −λ(f ),

Jψ λ(e)∗ Jψ−1 = −λ(e), (6.5)

and identical relations with ρ instead of λ. Write α for the antiautomorphism of U determined by α(k) := k −1 , α(f ) := −f , and α(e) := −e; so that Jψ λ(h)∗ Jψ−1 = λ(α(h)) for h ∈ U, and similarly with ρ instead of λ. Next, the first relation of (3.2) is equivalent to π(x) λ(Sh) = λ(Sh(1) ) π(h(2) · x).

(6.6)

Indeed, the left-hand side can be expanded as π(x) λ(ε(h(1) ) Sh(2) ) = λ(Sh(1) h(2) ) π(x) λ(Sh(3) ) = λ(Sh(1) ) π(h(2) · x) λ(h(3) ) λ(Sh(4) ) on applying (3.2); and the rightmost expression equals the right-hand side of (6.6). Taking hermitian adjoints and conjugating by Jψ , we get λ(α(Sh)) π ◦ (x) = π ◦ (h(2) · x) λ(α(Sh(1) )). It remains only to note that Sα = αS is an automorphism of U, whose inverse is the map h → h˜ above; and to repeat the argument with ρ instead of λ, changing only the left action of U in concordance with (3.2). An independent check of (6.4) is afforded by the following argument. We may ask which antirepresentations π ◦ of Hψ satisfy these equivariance conditions. It suffices to run the proof of Proposition 3.3, mutatis mutandis, to determine the possible form of such a π ◦ on the basis vectors |lmn. For instance, (3.9) is replaced by ρ(e)π ◦ (a) = π ◦ (e˜ a)ρ(k −1 ) + π ◦ (k˜ a)ρ(e) = q π ◦ (b)ρ(k −1 ) + q 2 π ◦ (a)ρ(e). 1

One finds that all formulas in that proof are reproduced, except for changes in the powers of q that appear; and, apart from the aforementioned phase ambiguities, one recovers precisely the form of π ◦ given by (6.3). Before proceeding, we indicate also the symmetry of the Tomita operator Tψ , analogous to (6.5) above. Combining (6.1) with (3.2), and recalling that η(x) = π(x) |000, we find that for generators h of U, Tψ λ(h)π(x) |000 = π(x ∗ ϑ(h)∗ ) |000. On the other hand, λ(ϑ −1 S(ϑ(h∗ )))Tψ π(x) |000 = π(x ∗ ϑ(h)∗ ) |000.

750

L. D¸abrowski, G. Landi, A. Sitarz, W. van Suijlekom, J. C. V´arilly

One checks easily on generators that ϑ −1 S(ϑ(h)∗ ) = S(h)∗ . Since the vector |000 is separating for the GNS representation, we conclude that Tψ λ(h) Tψ−1 = λ(Sh)∗ . Similarly, we find that Tψ ρ(h) Tψ−1 = ρ(Sh)∗ . In other words, the antilinear involutory automorphism h → (Sh)∗ of the Hopf ∗-algebra U is implemented by the Tomita operator for the Haar state of the dual Hopf ∗-algebra A. This is a known feature of quantum-group duality in the C ∗ -algebra framework; for this and several other implementations by spatial operators, see [25]. 6.2. The real structure on spinors. We are now ready to come back to spinors. Notice that Jψ does not appear explicitly in the equivariance conditions (6.4) for the right regular representation π ◦ of A on Hψ . Thus, we are now able to construct the “right multiplication” representation of A on spinors from its symmetry alone, and to deduce the conjugation operator J on spinors after the fact. Proposition 6.3. Let π ◦ be an antirepresentation of A on H = Hψ ⊕ Hψ satisfying the following equivariance conditions: λ (h) π ◦ (x)ξ = π ◦ (h˜ (2) · x) λ (h(1) )ξ, ρ (h) π ◦ (x)ξ = π ◦ (h˜ (2) x) ρ (h(1) )ξ.

(6.7)

Then, up to some phase factors depending only on the index j in the decomposition (4.2), π ◦ is given on the spinor basis by ◦− + + + − + + π ◦ (a) |j µn = αj◦+ µn |j µ n + αj µn |j µ n , ◦− + + − − + − π ◦ (b) |j µn = βj◦+ µn |j µ n + βj µn |j µ n , + − − − − − π ◦ (a ∗ ) |j µn = α˜ j◦+ ˜ j◦− µn |j µ n + α µn |j µ n ,

(6.8)

+ − + ˜ ◦− − − + π ◦ (b∗ ) |j µn = β˜j◦+ µn |j µ n + βj µn |j µ n , ◦± ◦± ± −1 where αj◦± µn and βj µn are the triangular 2 × 2 matrices, given by αj µn (q) = αj µn (q ) ± ± ± ◦± −1 −1 and βj µn (q) = q βj µn (q ), with αj µn and βj µn given by (4.9).

Proof. We retrace the steps of the proof of Proposition 4.4, mutatis mutandis. Since 1 k˜ · a = k · a = q 2 a, the relations involving λ (k) and ρ (k) are unchanged. We quickly conclude that π ◦ must have the form (6.8), and it remains to determine the coefficient matrices. The commutation relations of λ (f ) with π ◦ (a) and π ◦ (b) give: − 2 (µ+n− 2 ) [j + µ + 1] 2 A◦+ αj◦+ µn = q jn ,

− 2 (µ+n− 2 ) αj◦− [j − µ] 2 A◦− µn = q jn ,

− 2 (µ+n− 2 ) [j + µ + 1] 2 Bj◦+ βj◦+ µn = q n,

− 2 (µ+n− 2 ) βj◦− [j − µ] 2 Bj◦− µn = q n.

1 1

1 1

1 1

1

1

1

1

1

1

The Dirac Operator on SUq (2)

751

◦± The matrices A◦± j n , Bj n may be determined, as before, by the commutation relations 1

involving ρ (f ) and ρ (e). One finds that the n-dependent factors such as [j + n + 23 ] 2 ± ◦+ and so on, are the same as the respective entries of A± j n , Bj n ; let aj ↑↑ , etc., be the remaining factors which depend on j only. Then (4.10) is replaced by −j − 2 ◦+ bj◦+ aj ↑↑ , ↑↑ = q

j + 2 ◦+ bj◦+ aj ↓↑ , ↓↑ = −q

−j − 2 ◦+ bj◦+ aj ↓↓ , ↓↓ = q

j + 2 ◦− bj◦− aj ↑↑ , ↑↑ = −q

−j − 2 ◦− bj◦− aj ↑↓ , ↑↓ = q

j − 2 ◦− bj◦− aj ↓↓ . ↓↓ = −q

3

1

1

1

1

1

Next, we find (a ◦− 1

j + 2 ,↑↑

) = q −2j −2 aj◦+ ↑↑ ,

(a ◦− 1

j + 2 ,↑↓

) = −aj◦+ ↓↑ ,

(a ◦− 1

j + 2 ,↓↓

) = q −2j −1 aj◦+ ↓↓ .

Since π ◦ is an antirepresentation, ab = q −1 ba implies π ◦ (b)π ◦ (a) = q −1 π ◦ (a) The matrix elements of both sides lead to three relations:

π ◦ (b).

[2j + 1] a ◦+ 1

a ◦+ j + 2 ,↓↓ j ↓↑

= [2j + 3] a ◦+ 1

a ◦+ , j + 2 ,↓↑ j ↑↑

(6.9)

which is formally identical to (4.11), and 2 −2j −1 [2j + 1] |a ◦+ 1 |2 − q −1 [2j + 3] |aj◦+ |2 , [2j + 1] |aj◦+ ↓↑ | = q ↑↑ j − 2 ,↑↑ ◦+ ◦+ 2 −2j −1 2 |2 . q [2j + 1] |aj◦+ [2j + 1] |a 1 | = q ↓↓ | − [2j − 1] |a 1 j − 2 ,↓↑

j − 2 ,↓↓

Finally, the relation aa ∗ + bb∗ = 1 yields π ◦ (a ∗ )π ◦ (a) + π ◦ (b∗ )π ◦ (b) = 1; its diagonal matrix elements gives the last two relations: q −2j [2j + 1]2 |a ◦+ 1

j − 2 ,↑↑

2 q −2j [2j + 1]2 |aj◦+ ↓↓ | = 1.

|2 = 1,

All coefficients are now determined except for their phases: 1

aj◦+ ↑↑

=

ζj◦

qj+ 2 , [2j + 2]

1

aj◦+ ↓↑

=

ηj◦

q− 2 , [2j + 1] [2j + 2]

and (6.9) also entails the phase relations ζj◦ η◦

j + 21

◦ aj◦+ ↓↓ = ξj

= ηj◦ ξ ◦

j + 21

qj , [2j + 1]

(6.10)

. Once more, we choose all

phases to be +1 by convention. Substituting (6.10) back in previous formulas, we find ± −1 αj◦± µn (q) = αj µn (q ),

in perfect analogy with (6.3).

−1 ± −1 βj◦± µn (q) = q βj µn (q )

(6.11)

Definition 6.4. The conjugation operator J is the antilinear operator on H which is defined explicitly on the orthonormal spinor basis by J |j µn↑ := i 2(2j +µ+n) |j, −µ, −n, ↑, J |j µn↓ := i 2(2j −µ−n) |j, −µ, −n, ↓.

(6.12)

It is immediate from this presentation that J is antiunitary and that J 2 = −1, since each 4j ± 2(µ + n) is an odd integer.

752

L. D¸abrowski, G. Landi, A. Sitarz, W. van Suijlekom, J. C. V´arilly

Proposition 6.5. The invariant operator D of Sect. 5 commutes with the conjugation operator J : J DJ −1 = D.

(6.13)

Proof. This is clear from the diagonal form of both D and J on their common eigen↑ ↓ spaces Wj and Wj , given by the respective Eqs. (5.1) and (6.12). Remark 6.6. Proposition 6.5 is a minimal requirement for (A(SUq (2)), H, D, J ) to constitute a real spectral triple. However, here is where we part company with the axiom scheme for real spectral triples proposed in [8]. Indeed, the conjugation operator J that we have defined by (6.12) is not the modular conjugation for the spinor representation of A. That modular operator is Jψ ⊕Jψ , which does not have a diagonal form in our chosen spinor basis (unless q = 1). It is clear that conjugation of π (A(SUq (2)) by the modular operator would yield a representation of the opposite algebra A(SU1/q (2)), and the commutation relation analogous to (6.13) would then force D to be equivariant under the corresponding symmetry of U1/q (su(2)), denoted by (λ , ρ ) in our earlier Remark 4.5. It is not hard to check that this extra equivariance condition would force D to be merely a scalar operator, thereby negating the possibility of an equivariant 3+ -summable real spectral triple based on A(SUq (2)) with the modular conjugation operator. This result is consonant with the “no-go theorem” of Schm¨udgen [28] for nontrivial commutator representations of Woronowicz differential calculi on SUq (2). The remedy that we propose here is to modify J , in keeping with the symmetry of the spinor representation, to a non-Tomita conjugation operator. We shall see, however, that the expected properties of real spectral triples do hold “up to compact perturbations”. It should be noted that J satisfies the analogue of (6.5) for the representations λ and ρ : J λ (k)J −1 = λ (k −1 ),

J λ (e)J −1 = −λ (f ),

Jρ (k)J −1 = ρ (k −1 ),

Jρ (e)J −1 = −ρ (f ),

(6.14)

which follows directly from the definition (6.12) and the relations (4.6). Proposition 6.7. The antiunitary operator J intertwines the left and right spinor representations: J π (x ∗ ) J −1 = π ◦ (x),

for all x ∈ A.

(6.15)

Proof. It follows directly from the proof of Lemma 6.2, using the relations (6.14) instead of (6.5), that the antirepresentation x → J π (x ∗ ) J −1 complies with the equivariance conditions (6.7). By Proposition 6.3, it coincides with π ◦ up to an equivalence obtained by resetting the phase factors in (6.10). It remains only to check that ζj◦ = ηj◦ = ξj◦ = 1 for the aforementioned antirepresentation. This check is easily effected by calculating J π (a ∗ ) J −1 directly on the basis vectors |j µn↑. We compute

The Dirac Operator on SUq (2)

753

J π (a ∗ )J −1 |j µn↑ = i −2(2j −µ−n) J π (a ∗ ) |j, −µ, −n, ↑ + + = i −2(2j −µ−n) J α˜ j,−µ,−n,↑↑ |j + , −µ+ , −n+ ↑ + α˜ j,−µ,−n,↓↑ |j + , −µ+ , −n+ ↓ − + α˜ j,−µ,−n,↑↑ |j − , −µ+ , −n+ ↑ + + − = α˜ j,−µ,−n,↑↑ |j + µ+ n+ ↑ − α˜ j,−µ,−n,↓↑ |j + µ+ n+ ↓ + α˜ j,−µ,−n,↑↑ |j − µ+ n+ ↑

= αj−+ ,−µ+ ,−n+ ,↑↑ |j + µ+ n+ ↑ − αj−+ ,−µ+ ,−n+ ,↓↑ |j + µ+ n+ ↓ + αj+− ,−µ+ ,−n+ ,↑↑ |j − µ+ n+ ↑ 1 3 1 1 1 1 [j + µ + 1] 2 [j + n + ] 2 2 |j + µ+ n+ ↑ = q − 2 (µ+n− 2 ) q j + 2 [2j + 2] 1

1

[j + µ + 1] 2 [j − n + 21 ] 2 + + + |j µ n ↓ [2j + 1][2j + 2] 1 1 [j − µ] 2 [j − n + 21 ] 2 − + + + q −j −1 |j µ n ↑ [2j + 1] 1

+ q− 2

◦+ ◦− + + + + + + − + + = αj◦+ µn↑↑ |j µ n ↑ + αj µn↓↑ |j µ n ↓ + αj µn↑↑ |j µ n ↑

= π ◦ (a) |j µn↑, where the αj◦± µn coefficients are taken according to (6.11). In the same way, one finds that J π (b∗ )J −1 |j µn↑ = π ◦ (b) |j µn↑, again using (6.11) for βj◦± µn ; and similar calculations show that both sides of (6.15) coincide on the basis vector |j µn↓. (These four calculations, taken together, afford a direct proof of (6.15) without need to consider the symmetries of J .) 7. Algebraic Properties of the Spectral Triple In this section, we discuss the properties of the real spectral triple (A(SUq (2)), H, D, J ), in particular its commutant property and its first-order condition. We will see that these are only satisfied up to certain compact operators, quite similarly to [11]. We can simplify our discussion somewhat by replacing the spinor representation π of A = A(SUq (2)) of Proposition 4.4 by a so-called approximate representation π : A → B(H), such that π (x) − π (x) is a compact operator for each x ∈ A. In other words, although π need not preserve the algebra relations of A, the mappings π and π have the same image in the Calkin algebra B(H)/K(H), that is, they define the same ∗-homomorphism of A into the Calkin algebra. We denote by Lq the positive trace-class operator given by Lq |j µn := q j |j µn for

j ∈ 21 N,

and let Kq be the two-sided ideal of B(H) generated by Lq ; it consists of trace-class operators. The ideal Kq is indeed contained in the ideal of infinitesimals of order α, that is, compact operators whose nth singular value µn satisfies µn = O(n−α ), for all α > 0. Thus the following analysis holds modulo infinitesimals of arbitrary high order.

754

L. D¸abrowski, G. Landi, A. Sitarz, W. van Suijlekom, J. C. V´arilly

Proposition 7.1. The following equations define a mapping π : A → B(H) on generators, which is a ∗-representation modulo Kq , and is approximate to the spin representation π of Proposition 4.4 in the sense that π (x) − π (x) ∈ Kq for each x ∈ A: − + + + − + + π (a) |j µn = α + j µn |j µ n + α j µn |j µ n ,

π (b) |j µn = β + |j + µ+ n− + β − |j − µ+ n− , j µn j µn + − − − − − π (a ∗ ) |j µn = α˜ + ˜− j µn |j µ n + α j µn |j µ n ,

(7.1)

+ − π (b∗ ) |j µn = β˜ j µn |j + µ− n+ + β˜ j µn |j − µ− n+ ,

where

α− j µn β+ j µn β− j µn

1 − q 2j +2n+3 0 , 0 1 − q 2j +2n+1 q 1 − q 2j −2n+1 0 2j +µ+n+ 21 2j −2µ , := q 1−q 0 1 − q 2j −2n−1 q 1 − q 2j −2n+3 0 j +n− 21 2j +2µ+2 , (7.2) := q 1−q 0 1 − q 2j −2n+1 1 − q 2j +2n+1 0 , := −q j +µ 1 − q 2j −2µ 0 1 − q 2j +2n−1

α+ j µn :=

1 − q 2j +2µ+2

and ∓ α˜ ± j µn = α j ± µ− n− ,

± β˜ j µn = α ∓ . j ± µ− n+

(7.3)

Proof. First of all, we claim that the defining relations (2.1) are preserved by π modulo the ideal Kq of B(H), that is, π (b)π (a) − q π (a)π (b) ∈ Kq , and so on. Indeed, it can be verified by a direct but tedious check on the spinor basis that π (b)π (a) − q π (a)π (b) = L4q A, where A is a bounded operator; the same is true for each of the other relations listed in (2.1). It is well known, and easily checked from (2.1), that A is generated as a vector space by the products a k bl b∗m and bl b∗m a ∗n , for k, l, m, n ∈ N. We may thus define π (x) for any x ∈ A by extending (7.1) multiplicatively on such products, and then extending further by linearity. With this convention, we conclude that π (xy) − π (x)π (y) ∈ Kq

for all

x, y ∈ A.

(7.4)

The defining formulas also entail that π (x)∗ = π (x ∗ ) for each x ∈ A. If π (x) − π (x) ∈ Kq and π (y) − π (y) ∈ Kq , then π (xy) − π (x)π (y) = π (x) π (y) − π (y) + π (x) − π (x) π (y) ∈ Kq , and therefore π (xy) − π (xy) lies in Kq also; thus, it suffices to verify this property in the cases x = a, b.

The Dirac Operator on SUq (2)

755

On comparing the coefficients (7.2) with the corresponding ones of π (a) and π (b) from (4.9), we get, for instance, q 4j +4 1 − q 2j +2µ+2 1 − q 2j +2n+3 + + = q 4j +4 αj+µn↑↑ , αj µn↑↑ − α j µn↑↑ = 1 − q 4j +4 q 4j +2 1 − q 2j +2µ+2 1 − q 2j +2n+1 + + αj µn↓↓ − α j µn↓↓ = = q 4j +2 αj+µn↓↓ , 1 − q 4j +2 (7.5a) and similarly, 4j +2 − αj−µn↑↑ − α − αj µn↑↑ , j µn↑↑ = q

4j − αj−µn↓↓ − α − j µn↓↓ = q αj µn↓↓ .

(7.5b) 1

We estimate the off-diagonal terms, using the inequalities q ±µ ≤ q −j , q ±n ≤ q −j − 2 and [N ]−1 < q N−1 : 1

|αj+µn↓↑ |

=q

(µ+n+ 21 )/2

1

|αj−µn↑↓ | = q (µ+n+ 2 )/2 1

1

[j + µ + 1] 2 [j − n + 21 ] 2 q −2j −2 ≤ < q 2j −1 , [2j + 1] [2j + 2] [2j + 1] [2j + 2] 1

[j − µ] 2 [j + n + 21 ] 2 q −2j −1 ≤ < q 2j −2 . [2j ] [2j + 1] [2j ] [2j + 1]

On account of (7.5) and analogous relations for the coefficients of π (b), we find that π (a) − π (a) ≡ T π (a)T π (b) − π (b) ≡ T π (b)T

mod Kq , mod Kq ,

where T is the operator defined by

3 3 q 2j + 2 0 q2 0 T |j µn := |j µn = L2q |j µn. 1 1 0 q 2j + 2 0 q2

(7.6)

Clearly, T ∈ Kq , so that by boundedness of π (x) it follows that π (x) − π (x) ∈ Kq for x = a, b. Using the conjugation operator J , we can also define an approximate antirepresentation of A by π ◦ (x) := J π (x)J −1 . It is immediate that π ◦ (x) − π ◦ (x) ∈ Kq , with π ◦ as defined in Proposition 6.3. Explicitly, we can write ◦− + + + − + + π ◦ (a) |j µn = α ◦+ j µn |j µ n + α j µn |j µ n ,

π ◦ (b) |j µn = β ◦+ |j + µ+ n− + β ◦− |j − µ+ n− , j µn j µn + − − − − − π ◦ (a ∗ ) |j µn = α˜ ◦+ ˜ ◦− j µn |j µ n + α j µn |j µ n , ◦+ ◦− π ◦ (b∗ ) |j µn = β˜ j µn |j + µ− n+ + β˜ j µn |j − µ− n+ ,

where α ◦± ˜± j µn = α j,−µ,−n ,

± α˜ ◦± j µn = α j,−µ,−n ,

±

β ◦± = −β˜ j,−µ,−n , j µn

◦± β˜ j µn = −β ± . j,−µ,−n

It turns out that the approximate representations π and π ◦ almost commute, in the following sense.

756

L. D¸abrowski, G. Landi, A. Sitarz, W. van Suijlekom, J. C. V´arilly

Proposition 7.2. For each x, y ∈ A, the commutant [π ◦ (x), π (y)] lies in Kq . Proof. In view of our earlier remarks on the almost-multiplicativity of π , and thus also of π ◦ , it is enough to check this for the cases x, y = a, a ∗ , b, b∗ . We omit the detailed calculation, which we have performed with a symbolic computer program. In each case, the commutator [π ◦ (x), π (y)] decomposes as a direct sum of operators in the subspac↑ ↓ es Wj and Wj separately, in view of (7.2) and (6.12), and the explicit calculation shows that for each pair of generators x, y, we obtain [π ◦ (x), π (y)] = L2q A, where A is a bounded operator. If we further impose the first-order condition up to compact operators in the ideal Kq , it turns out that this (almost) determines the Dirac operator. Proposition 7.3. Up to rescaling, adding constants, and adding elements of Kq , there is only one operator D of the form (5.1) which satisfies the first order condition modulo Kq , that is, each [D, π (y)] is bounded, and [π ◦ (x), [D, π (y)]] ∈ Kq for all x, y ∈ A.

(7.7)

This operator D has eigenvalues that are linear in j . Proof. Suppose first that D is an equivariant selfadjoint operator of the type considered in Sect. 5, with eigenvalues linear in j ; that is, D is determined by (5.1) and (5.3). Since each operator appearing in (7.7) decomposes into a pair of operators on the “up” and “down” spinor subspaces, it is clear that the nested commutators are independent of the ↑ ↓ ↑ ↓ parameters c2 and c2 ; and that c1 and c1 are merely scale factors on both subspaces. Again we take x and y to be generators: explicit calculations show that in each case, [π ◦ (x), [D, π (y)]] = L2q B with B a bounded operator. To prove the converse, assume only that D satisfies the equivariance condition (5.1), and that [D, π (a)] and [D, π (b)] are bounded. We may decompose π (a) = π (a)+ + π (a)− according to whether the index j in (7.1) is raised or lowered; and similarly for π (b), π ◦ (a), and π ◦ (b). Proposition 7.2 shows that, modulo Kq : π (a)+ π ◦ (a)+ ≡ π ◦ (a)+ π (a)+ , π (a)− π ◦ (a)− ≡ π ◦ (a)− π (a)− , + ◦ − π (a) π (a) + π (a)− π ◦ (a)+ ≡ π ◦ (a)+ π (a)− + π ◦ (a)− π (a)+ . By (7.2), the operators π (a) and π (b), as well as D, are diagonal for the decomposition H = H↑ ⊕ H↓ . On the subspace H↑ , we obtain [[D, π (a)], π ◦ (a)] |j µn↑ = Dπ (a)π ◦ (a) + π ◦ (a)π (a)D − π (a)Dπ ◦ (a) − π ◦ (a)Dπ (a) |j µn↑ ↑ ↑ ↑ ↑ ↑ ↑ = (dj +1 + dj − 2dj + ) π (a)+ π ◦ (a)+ + (dj −1 + dj − 2dj − ) π (a)− π ◦ (a)− ↑

↑

+ 2dj (π (a)+ π ◦ (a)− + π (a)− π ◦ (a)+ ) − dj + (π (a)− π ◦ (a)+

↑ + π ◦ (a)− π (a)+ ) − dj − (π (a)+ π ◦ (a)− + π ◦ (a)+ π (a)− ) + R |j µn↑, (7.8)

The Dirac Operator on SUq (2)

757

where R ∈ Kq . On the subspace H↓ , we get the precisely analogous expression with the arrows reversed. In order that the expression on the right-hand side of (7.8) come from an element of Kq applied to |j µn↑, and likewise for |j µn↓, it is necessary and sufficient that the scalars ↑

↑

↑

↑

wj := dj +1 + dj − 2dj + , ↑

↓

↓

↓

↓

wj := dj +1 + dj − 2dj +

(7.9)

↓

satisfy wj = O(q j ) and wj = O(q j ) as j → ∞. ↑

↓

In the particular case where wj = 0 and wj = 0 for all j , (7.9) gives elementary ↑

↓

recurrence relations for dj and dj , whose solutions are precisely the expressions (5.3) that are linear in j , namely, ↑

↑

↑

dj = c1 j + c2 ,

↓

↓

↓

dj = c1 j + c2 .

The general case gives a pair of perturbed recurrence relations, that may be treated by generating-function methods [16]; their solutions differ from the linear case by terms that are O(q j ) as j → ∞. Thus, the corresponding operator D differs from one whose eigenvalues are linear in j by an element of Kq . We finish by summarizing the implications of the above Propositions 7.1, 7.2 and 7.3 for the spectral triple (A(SUq (2)), H, D, J ), where A(SUq (2)) acts on H via the spinor representation π . The representations π and π ◦ do not commute, since the conjugation operator J differs from the Tomita conjugation for π . However, we do obtain commutation “up to infinitesimals”; since [π ◦ (x), π (y)] ≡ [π ◦ (x), π (y)] mod Kq , Proposition 7.2 entails the analogous result for the exact representations: [π ◦ (x), π (y)] ∈ Kq

for all

x, y ∈ A.

To examine the first-order property, we note first if x, y ∈ A and [D, π (y) − π (y)] lies in Kq , then [π ◦ (x), [D, π (y)]] = π ◦ (x) + (π ◦ (x) − π ◦ (x)), [D, π (y) + (π (y) − π (y))] ≡ [π ◦ (x), [D, π (y)]] ≡ 0 mod Kq .

(7.10)

Since D commutes with the positive operator T defined in (7.6), we find in the case of a generator y = a, a ∗ , b or b∗ , that [D, π (y) − π (y)] = [D, T π (y)T ] = T [D, π (y)]T , which lies in Kq since [D, π (y)] is bounded, by Proposition 5.2. Thus, [D, π (y)] is bounded, too –as required by Proposition 7.3. The general case of [D, π (y) − π (y)] ∈ Kq then follows from (7.4). Thus (7.10) holds for general x, y ∈ A. Combining that with Proposition 7.3, we arrive at the following characterization of our spectral triple over A(SUq (2)).

758

L. D¸abrowski, G. Landi, A. Sitarz, W. van Suijlekom, J. C. V´arilly

Theorem 7.4. The real spectral triple (A(SUq (2)), H, D, J ) defined here, with A(SUq (2)) acting on H via the spinor representation π , satisfies both the commutant property and the first order condition up to infinitesimals: [π ◦ (x), π (y)] ∈ Kq , ◦ π (x), [D, π (y)] ∈ Kq ,

for all

x, y ∈ A(SUq (2)).

In [17] it was argued that there are obstructions to the construction of “deformed spectral triples” satisfying a type of first order condition for the Dirac operator. Theorem 7.4 above shows a way to overcome these obstructions. Acknowledgements. We thank Alain Connes and Mariusz Wodzicki for helpful discussions. AS and JCV ´ thank Alain Connes for the opportunity to visit the Institut des Hautes Etudes Scientifiques. AS also thanks the Institut Henri Poincar´e and SISSA for hospitality. LD acknowledges partial support by the EU Project INTAS 00-257. Support from the Vicerrector´ıa de Investigaci´on of the Universidad de Costa Rica is also acknowledged. JCV is grateful to the DAAD and to Florian Scheck for a warm welcome to the Johannes-Gutenberg Universit¨at, Mainz, during the course of this work.

References 1. B¨ar, C.: The Dirac operator on homogeneous spaces and its spectrum on 3-dimensional lens spaces. Arch. Math. 59, 65–79 (1992) 2. Bibikov, P.N., Kulish, P.P.: Dirac operators on the quantum group SUq (2) and the quantum sphere. J. Math. Sci. (N.Y.) 100, 2039–2050 (2000) 3. Biedenharn, L.C., Lohe, M.A.: Quantum group symmetry and q-tensor algebras. Singapore: World Scientific, 1995 4. Biedenharn, L.C., Louck, J.D.: Angular momentum in quantum physics: theory and applications. Reading, MA: Addison-Wesley, 1981 5. Chakraborty, P.S., Pal, A.: Equivariant spectral triples on the quantum SU (2) group. K-Theory 28, 107–126 (2003) 6. Chakraborty, P. S., Pal, A.: Remark on Poincar´e duality for SUq (2). http://arxiv.org/list/ math.OA/0211367, 2002 7. Connes, A.: Noncommutative geometry. London, San Diego: Academic Press, 1994 8. Connes, A.: Noncommutative geometry and reality. J. Math. Phys. 36, 6194–6231 (1995) 9. Connes, A.: Cyclic cohomology, quantum group symmetries and the local index formula for SUq (2). J. Inst. Math. Jussieu 3, 17–68 (2004) 10. Connes, A., Landi, G.: Noncommutative manifolds, the instanton algebra and isospectral deformations. Commun. Math. Phys. 221, 141–159 (2001) 11. D¸abrowski, L., Landi, G., Paschke, M., Sitarz, A.: The spectral geometry of the equatorial Podle´s sphere. C. R. Acad. Sci. Paris, Ser. I 340, 819–822 (2005). 12. D¸abrowski, L., Sitarz, A.: Dirac operator on the standard Podle´s quantum sphere. In: Noncommutative Geometry and Quantum Groups, Banach Centre Publications 61, Hajac, P.M., Pusz, W. (eds.), Warszawa: IMPAN, 2003, pp. 49–58 13. Goswami, D.: Some noncommutative geometric aspects of SUq (2). http://arxiv.org/list/ math-ph/0108003, 2001 14. Gover, A.R., Zhang, R.B.: Geometry of quantum homogeneous vector bundles and representation theory of quantum groups I. Rev. Math. Phys. 11, 533–552 (1999) 15. Gracia-Bond´ıa, J.M., V´arilly, J.C., Figueroa, H.: Elements of noncommutative geometry. Boston: Birkh¨auser, 2001 16. Graham, R.L., Knuth, D.E., Patashnik, O.: Concrete mathematics. Reading, MA: Addison-Wesley, 1989 17. Hawkins, E.: Noncommutative rigidity. Commun. Math. Phys. 246, 211–235 (2004) 18. Homma, Y.: A representation of Spin(4) on the eigenspinors of the Dirac operator on S3 . Tokyo J. Math. 23, 453–472 (2000) 19. Kassel, C.: Quantum Groups. Berlin: Springer, 1995

The Dirac Operator on SUq (2)

759

20. Kirillov, A.N., Reshetikhin, N.Yu.: Representations of the algebra Uq (sl(2)), q-orthogonal polynomials and invariants of links. In: Infinite dimensional lie algebras and groups, Kac, V.G. (ed.), Singapore: World Scientific, 1989, pp. 285–339 21. Klimyk, A.U., Schm¨udgen, K.: Quantum Groups and their Representations. New York: Springer, 1998 22. Kr¨ahmer, U.: Dirac operators on quantum flag manifolds. Lett. Math. Phys. 67, 49–59 (2004) 23. Majid, S.: Foundations of quantum group theory. Cambridge: Cambridge Univ. Press, 1995 24. Majid, S.: Noncommutative Riemannian and spin geometry of the standard q-sphere. Commun. Math. Phys. 256, 255–285 (2005) 25. Masuda, T., Nakagami, Y., Woronowicz, S.L.: A C ∗ -algebraic framework for quantum groups. Int. J. Math. 14, 903–1001 (2003) 26. Neshveyev, S., Tuset, L.: A local index formula for the quantum sphere. Commun. Math. Phys. 254, 323–341 (2005) 27. Podle´s, P.: Quantum spheres. Lett. Math. Phys. 14, 521–531 (1987) 28. Schm¨udgen, K.: Commutator representations of differential calculi on the quantum group SUq (2). J. Geom. Phys. 31, 241–264 (1999) 29. Schm¨udgen, K., Wagner, E.: Dirac operator and a twisted cyclic cocycle on the standard Podle´s quantum sphere. J. reine angew. Math. 574, 219–235 (2004) 30. Sitarz, A.: Equivariant spectral triples. In: Noncommutative geometry and quantum groups, banach centre publications 61, Hajac, P.M. Pusz, W. (eds.), Warszawa: IMPAN, 2003, pp. 231–263 31. Takesaki, M.: Tomita’s theory of modular Hilbert algebras. Lecture Notes in Mathematics 128, Berlin: Springer, 1970 32. Woronowicz, S.L.: Compact quantum groups. In: Quantum symmetries, Connes, A., Gaw¸edski, K., Zinn-Justin, J. (eds.), Amsterdam: Elsevier Science, 1998, pp. 845–884 Communicated by A. Connes